Query gi|254780976|ref|YP_003065389.1| Holliday junction resolvase YqgF [Candidatus Liberibacter asiaticus str. psy62]
Match_columns 160
No_of_seqs 112 out of 1903
Neff 6.3
Searched_HMMs 39220
Date Mon May 30 02:26:42 2011
Command /home/congqian_1/programs/hhpred/hhsearch -i 254780976.hhm -d /home/congqian_1/database/cdd/Cdd.hhm
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK00109 Holliday junction res 100.0 1.5E-44 0 290.1 15.9 140 14-154 1-140 (141)
2 pfam03652 UPF0081 Uncharacteri 100.0 3.6E-43 0 281.9 15.6 134 17-151 1-134 (134)
3 COG0816 Predicted endonuclease 100.0 5.9E-41 1.4E-45 268.5 15.9 138 16-154 1-139 (141)
4 TIGR00250 TIGR00250 conserved 100.0 1.1E-38 2.9E-43 254.7 10.2 130 20-150 1-133 (133)
5 smart00732 YqgFc Likely ribonu 99.9 1.5E-27 3.7E-32 187.4 10.6 99 17-118 1-99 (99)
6 COG2433 Uncharacterized conser 97.7 0.0021 5.3E-08 41.3 12.7 119 17-156 244-362 (652)
7 PRK00039 ruvC Holliday junctio 97.5 0.00031 7.9E-09 46.3 6.5 103 16-123 1-113 (169)
8 cd00529 RuvC_resolvase Hollida 97.5 0.0017 4.4E-08 41.7 10.1 104 18-126 1-114 (154)
9 COG2183 Tex Transcriptional ac 97.4 0.0035 8.8E-08 39.9 10.6 104 5-116 317-427 (780)
10 pfam04312 DUF460 Protein of un 97.3 0.0042 1.1E-07 39.4 10.2 109 15-142 30-138 (138)
11 pfam02075 RuvC Crossover junct 97.2 0.0047 1.2E-07 39.1 9.0 104 19-127 1-113 (148)
12 COG1548 Predicted transcriptio 95.8 0.035 9E-07 33.8 6.3 105 16-122 2-109 (330)
13 COG0817 RuvC Holliday junction 95.2 0.084 2.1E-06 31.6 6.4 103 20-127 1-113 (160)
14 COG1940 NagC Transcriptional r 95.1 0.24 6E-06 28.8 8.6 108 14-122 3-126 (314)
15 PRK09698 D-allose kinase; Prov 94.1 0.53 1.4E-05 26.7 8.3 95 16-111 3-113 (302)
16 TIGR00228 ruvC crossover junct 90.7 0.051 1.3E-06 32.9 -0.4 109 19-131 1-118 (158)
17 PRK13310 N-acetyl-D-glucosamin 89.5 1.6 4E-05 23.9 6.3 91 19-110 2-104 (302)
18 PRK05082 N-acetylmannosamine k 89.3 1.9 4.8E-05 23.4 6.7 97 17-114 1-108 (291)
19 PRK00290 dnaK molecular chaper 88.5 0.55 1.4E-05 26.6 3.5 24 16-39 1-24 (631)
20 PRK09557 fructokinase; Reviewe 88.0 2.3 5.9E-05 22.8 6.7 100 19-119 2-113 (301)
21 CHL00094 dnaK heat shock prote 87.7 0.67 1.7E-05 26.1 3.5 23 16-38 1-23 (622)
22 pfam00480 ROK ROK family. 86.3 2.9 7.4E-05 22.2 6.9 100 21-121 1-112 (181)
23 PRK13410 molecular chaperone D 85.8 0.99 2.5E-05 25.0 3.5 25 16-40 1-25 (719)
24 PRK13411 molecular chaperone D 85.2 1 2.6E-05 25.0 3.3 23 16-38 1-23 (655)
25 TIGR01821 5aminolev_synth 5-am 84.6 2.2 5.7E-05 22.9 4.9 67 41-114 152-218 (427)
26 COG1214 Inactive homolog of me 84.5 1.4 3.6E-05 24.1 3.8 95 17-120 1-102 (220)
27 PRK13311 N-acetyl-D-glucosamin 83.4 3.9 0.0001 21.4 7.1 59 19-78 2-67 (256)
28 COG0418 PyrC Dihydroorotase [N 81.0 1.4 3.7E-05 24.1 2.7 79 55-149 116-202 (344)
29 COG0443 DnaK Molecular chapero 80.6 0.59 1.5E-05 26.4 0.7 23 16-38 4-26 (579)
30 KOG0237 consensus 79.6 5.4 0.00014 20.6 6.4 73 53-126 54-143 (788)
31 pfam07318 DUF1464 Protein of u 79.5 2.3 6E-05 22.8 3.4 96 21-120 1-116 (336)
32 pfam04848 Pox_A22 Poxvirus A22 78.0 6 0.00015 20.3 6.3 85 18-110 2-88 (143)
33 TIGR02089 TTC tartrate dehydro 77.2 2.1 5.3E-05 23.1 2.6 54 87-146 167-233 (355)
34 TIGR01865 cas_Csn1 CRISPR-asso 76.6 5.2 0.00013 20.7 4.5 34 17-50 1-34 (1140)
35 PTZ00186 heat shock 70 kDa pre 75.5 4.2 0.00011 21.3 3.8 25 14-38 24-48 (657)
36 PRK00292 glk glucokinase; Prov 73.1 8.1 0.00021 19.5 5.0 61 16-76 1-63 (315)
37 cd01987 USP_OKCHK USP domain i 72.0 8.6 0.00022 19.4 5.2 51 54-109 72-123 (124)
38 TIGR02026 BchE magnesium-proto 69.6 9.8 0.00025 19.0 5.9 52 53-109 229-280 (506)
39 PRK06395 phosphoribosylamine-- 68.9 10 0.00026 19.0 6.3 72 54-127 52-142 (435)
40 TIGR02858 spore_III_AA stage I 66.5 11 0.00029 18.7 6.2 83 25-113 71-168 (282)
41 PRK11678 putative chaperone; P 66.3 5.6 0.00014 20.5 2.7 25 19-43 2-26 (450)
42 pfam03932 CutC CutC family. Co 65.9 12 0.00029 18.6 8.6 83 40-129 49-143 (202)
43 PRK13529 malate dehydrogenase; 62.7 13 0.00034 18.2 4.3 124 14-154 153-288 (563)
44 cd00378 SHMT Serine-glycine hy 62.6 13 0.00034 18.2 5.7 65 8-76 97-170 (402)
45 cd01844 SGNH_hydrolase_like_6 61.9 14 0.00035 18.1 6.3 54 57-110 47-100 (177)
46 pfam00012 HSP70 Hsp70 protein. 61.9 7.2 0.00018 19.8 2.6 20 19-38 1-20 (598)
47 TIGR00856 pyrC_dimer dihydroor 61.9 14 0.00035 18.1 5.1 93 55-155 116-222 (364)
48 TIGR01766 tspaseT_teng_C trans 58.0 16 0.00041 17.7 5.0 62 54-115 11-86 (86)
49 PRK13928 rod shape-determining 57.8 16 0.00041 17.7 9.2 101 16-117 2-110 (325)
50 PRK13930 rod shape-determining 56.9 17 0.00043 17.6 10.3 103 14-117 6-116 (336)
51 PRK10490 sensor protein KdpD; 55.9 17 0.00045 17.5 4.3 52 53-110 322-373 (895)
52 cd03301 ABC_MalK_N The N-termi 55.2 7.8 0.0002 19.6 1.9 88 22-119 67-164 (213)
53 PRK05451 dihydroorotase; Provi 54.5 18 0.00047 17.4 4.8 52 73-126 130-189 (345)
54 PRK13034 serine hydroxymethylt 53.9 19 0.00048 17.3 4.5 68 8-76 109-180 (422)
55 COG3513 Predicted CRISPR-assoc 53.8 13 0.00034 18.2 2.9 24 15-38 2-25 (1088)
56 PTZ00317 malic enzyme; Provisi 53.5 17 0.00043 17.6 3.4 124 14-154 152-287 (570)
57 COG1693 Repressor of nif and g 53.1 12 0.0003 18.6 2.5 19 16-34 271-289 (325)
58 PTZ00294 glycerol kinase; Prov 52.4 17 0.00044 17.5 3.3 25 16-40 1-25 (510)
59 pfam00370 FGGY_N FGGY family o 51.1 13 0.00034 18.2 2.5 22 19-40 2-23 (245)
60 COG3839 MalK ABC-type sugar tr 50.8 21 0.00054 17.0 3.8 90 24-121 72-169 (338)
61 COG1831 Predicted metal-depend 50.5 21 0.00054 17.0 5.8 73 55-128 109-186 (285)
62 pfam07282 Transposase_35 Putat 49.7 16 0.0004 17.8 2.7 47 100-146 13-68 (69)
63 TIGR01264 tyr_amTase_E tyrosin 47.8 24 0.0006 16.7 5.1 74 34-114 144-220 (415)
64 pfam07355 GRDB Glycine/sarcosi 47.6 24 0.0006 16.7 4.9 54 49-107 62-115 (349)
65 PRK00976 hypothetical protein; 47.3 24 0.00061 16.7 4.1 55 19-76 3-59 (324)
66 PRK10939 autoinducer-2 (AI-2) 47.2 16 0.00041 17.7 2.4 25 15-39 1-25 (521)
67 pfam06723 MreB_Mbl MreB/Mbl pr 47.1 24 0.00062 16.7 8.3 100 17-118 1-109 (327)
68 cd00458 SugarP_isomerase Sugar 47.0 24 0.00062 16.7 4.2 53 55-114 8-61 (169)
69 PRK00047 glpK glycerol kinase; 46.6 25 0.00063 16.6 5.4 28 13-40 1-28 (498)
70 TIGR03275 methan_mark_8 putati 46.5 25 0.00063 16.6 3.9 55 20-93 157-211 (259)
71 PRK13789 phosphoribosylamine-- 46.3 25 0.00063 16.6 5.4 103 54-157 55-184 (426)
72 pfam00464 SHMT Serine hydroxym 46.2 25 0.00064 16.6 4.2 68 8-76 98-169 (380)
73 PRK12767 carbamoyl phosphate s 45.6 26 0.00065 16.5 6.2 56 13-75 22-77 (325)
74 PRK11572 copper homeostasis pr 45.4 26 0.00066 16.5 8.4 84 40-130 50-145 (248)
75 TIGR02166 dmsA_ynfE anaerobic 44.5 20 0.0005 17.2 2.5 84 69-155 221-308 (849)
76 PRK13929 rod-share determining 44.2 27 0.00068 16.4 7.5 100 16-117 3-113 (335)
77 PRK13790 phosphoribosylamine-- 43.6 27 0.0007 16.3 4.6 74 53-127 49-139 (415)
78 cd01988 Na_H_Antiporter_C The 42.0 29 0.00074 16.2 4.4 51 54-109 81-131 (132)
79 pfam03709 OKR_DC_1_N Orn/Lys/A 41.8 29 0.00074 16.2 6.1 59 44-117 18-77 (111)
80 TIGR01975 isoAsp_dipep beta-as 41.8 29 0.00075 16.2 4.5 64 27-109 151-221 (391)
81 pfam01385 Transposase_2 Probab 41.4 30 0.00075 16.1 3.6 23 15-38 121-143 (226)
82 PRK03317 histidinol-phosphate 40.6 31 0.00078 16.0 5.1 62 49-116 142-203 (369)
83 pfam00390 malic Malic enzyme, 40.3 31 0.00079 16.0 4.8 98 13-124 72-180 (182)
84 pfam08968 DUF1885 Domain of un 39.8 20 0.00051 17.2 2.0 76 27-112 32-121 (128)
85 PRK00885 phosphoribosylamine-- 38.7 33 0.00083 15.9 6.0 101 54-155 49-176 (424)
86 TIGR01187 potA polyamine ABC t 37.9 34 0.00086 15.8 4.9 72 38-117 61-132 (331)
87 TIGR00063 folE GTP cyclohydrol 37.3 22 0.00055 17.0 1.8 39 65-121 87-126 (183)
88 COG2205 KdpD Osmosensitive K+ 36.6 35 0.0009 15.7 6.2 58 53-115 320-378 (890)
89 PTZ00094 serine hydroxymethylt 36.6 35 0.0009 15.7 4.9 65 7-76 112-188 (450)
90 TIGR01769 GGGP geranylgeranylg 36.4 35 0.0009 15.7 5.9 96 50-153 9-111 (212)
91 COG4020 Uncharacterized protei 36.4 36 0.00091 15.6 4.3 59 18-76 4-66 (332)
92 PRK13330 consensus 36.0 36 0.00092 15.6 7.1 85 19-113 2-94 (257)
93 COG0112 GlyA Glycine/serine hy 35.7 36 0.00093 15.6 4.0 68 8-76 104-175 (413)
94 pfam09298 DUF1969 Domain of un 35.7 17 0.00044 17.5 1.1 15 24-38 13-27 (101)
95 TIGR01579 MiaB-like-C MiaB-lik 34.9 38 0.00096 15.5 3.7 78 54-141 352-433 (492)
96 PRK10331 L-fuculokinase; Provi 34.1 39 0.00099 15.4 2.9 24 17-40 2-25 (470)
97 PRK00011 glyA serine hydroxyme 33.8 39 0.001 15.4 4.7 67 8-76 103-174 (415)
98 cd02064 Flavokinase_C Riboflav 33.3 40 0.001 15.3 4.4 64 59-127 90-159 (179)
99 PRK13317 pantothenate kinase; 32.8 41 0.001 15.3 8.1 87 16-122 1-87 (273)
100 KOG3289 consensus 32.1 42 0.0011 15.2 5.1 101 55-156 59-171 (199)
101 TIGR02350 prok_dnaK chaperone 31.7 42 0.0011 15.2 3.1 53 18-78 1-53 (598)
102 PRK04870 histidinol-phosphate 31.4 43 0.0011 15.2 5.7 60 50-116 136-195 (356)
103 PRK05183 hscA chaperone protei 31.2 30 0.00077 16.1 1.8 23 16-38 18-40 (621)
104 cd02016 TPP_E1_OGDC_like Thiam 31.1 38 0.00096 15.5 2.2 50 24-90 10-59 (265)
105 cd01017 AdcA Metal binding pro 30.6 44 0.0011 15.1 7.3 49 79-129 198-246 (282)
106 PRK09267 flavodoxin FldA; Vali 30.0 45 0.0012 15.0 3.4 13 69-81 48-60 (169)
107 PTZ00062 glutaredoxin; Provisi 29.1 47 0.0012 14.9 2.9 62 53-123 114-175 (219)
108 TIGR01510 coaD_prev_kdtB pante 28.8 48 0.0012 14.9 4.2 54 68-121 26-83 (163)
109 cd01019 ZnuA Zinc binding prot 27.9 49 0.0013 14.8 7.3 49 79-129 206-254 (286)
110 PRK04152 consensus 27.6 50 0.0013 14.8 5.1 57 53-114 138-194 (364)
111 COG0151 PurD Phosphoribosylami 27.4 50 0.0013 14.7 7.5 73 54-128 50-140 (428)
112 PRK13334 consensus 27.1 51 0.0013 14.7 9.0 89 15-113 2-95 (262)
113 cd01487 E1_ThiF_like E1_ThiF_l 26.9 52 0.0013 14.7 4.4 58 89-150 53-111 (174)
114 cd01025 TOPRIM_recR TOPRIM_rec 26.8 52 0.0013 14.7 5.6 44 38-83 29-72 (112)
115 cd01829 SGNH_hydrolase_peri2 S 26.6 52 0.0013 14.6 6.1 47 54-100 95-141 (200)
116 pfam05913 DUF871 Bacterial pro 26.6 52 0.0013 14.6 7.5 93 24-124 167-261 (357)
117 COG3703 ChaC Uncharacterized p 26.6 52 0.0013 14.6 2.8 23 14-36 55-77 (190)
118 TIGR01756 LDH_protist lactate 26.6 50 0.0013 14.7 2.2 29 53-81 86-117 (314)
119 PRK13927 rod shape-determining 26.3 53 0.0013 14.6 9.7 102 14-117 5-118 (338)
120 TIGR03281 methan_mark_12 putat 25.8 43 0.0011 15.1 1.8 55 19-76 1-59 (326)
121 pfam08426 ICE2 ICE2. ICE2 is a 25.7 12 0.0003 18.5 -1.1 28 20-47 19-46 (409)
122 cd01018 ZntC Metal binding pro 25.5 55 0.0014 14.5 6.6 74 58-142 179-252 (266)
123 TIGR00502 nagB glucosamine-6-p 25.4 54 0.0014 14.6 2.2 88 55-147 16-123 (260)
124 TIGR01098 3A0109s03R phosphona 25.4 55 0.0014 14.5 4.5 50 64-116 46-96 (299)
125 TIGR01759 MalateDH-SF1 malate 24.9 36 0.00093 15.6 1.3 39 58-102 113-152 (329)
126 PRK10824 hypothetical protein; 24.8 56 0.0014 14.4 3.6 61 54-123 3-63 (115)
127 pfam00940 RNA_pol DNA-dependen 24.6 57 0.0014 14.4 2.6 33 56-88 67-99 (391)
128 PTZ00009 heat shock 70 kDa pro 24.4 57 0.0015 14.4 3.2 22 17-38 4-25 (657)
129 pfam07066 Phage_Lacto_M3 Lacto 24.3 58 0.0015 14.4 4.6 54 16-70 1-67 (162)
130 pfam03343 SART-1 SART-1 family 24.2 58 0.0015 14.4 4.5 30 99-128 526-558 (577)
131 cd06295 PBP1_CelR Ligand bindi 24.1 58 0.0015 14.4 9.7 85 55-152 52-151 (275)
132 PRK08153 histidinol-phosphate 24.1 58 0.0015 14.4 5.6 57 54-115 143-199 (370)
133 TIGR00591 phr2 deoxyribodipyri 23.8 59 0.0015 14.3 2.9 41 84-124 297-345 (471)
134 PRK10812 putative metallodepen 23.8 59 0.0015 14.3 7.1 73 53-128 74-148 (265)
135 PRK01433 hscA chaperone protei 23.3 60 0.0015 14.3 3.0 22 17-38 19-40 (595)
136 PRK09404 kgd alpha-ketoglutara 23.1 61 0.0015 14.2 2.5 62 89-152 146-216 (931)
137 TIGR01212 TIGR01212 radical SA 22.8 61 0.0016 14.2 3.7 62 17-78 109-192 (307)
138 COG0567 SucA 2-oxoglutarate de 22.7 61 0.0016 14.2 2.1 65 86-151 126-196 (906)
139 cd05695 S1_Rrp5_repeat_hs3 S1_ 22.6 52 0.0013 14.6 1.7 16 16-31 50-65 (66)
140 PRK05166 histidinol-phosphate 22.2 63 0.0016 14.1 5.0 58 51-114 146-203 (374)
141 COG0675 Transposase and inacti 22.1 64 0.0016 14.1 5.5 132 13-152 171-350 (364)
142 TIGR01266 fum_ac_acetase fumar 22.0 41 0.001 15.3 1.1 64 24-99 27-93 (436)
143 TIGR02852 spore_dpaB dipicolin 22.0 64 0.0016 14.1 5.0 54 27-92 1-55 (188)
144 KOG4013 consensus 21.5 65 0.0017 14.0 6.2 63 60-129 87-153 (255)
145 KOG1856 consensus 21.2 66 0.0017 14.0 7.0 93 18-114 604-712 (1299)
146 pfam00899 ThiF ThiF family. Th 21.2 66 0.0017 14.0 3.2 57 89-149 56-112 (134)
147 TIGR03031 cas_csx12 CRISPR-ass 21.2 40 0.001 15.3 0.9 16 20-35 2-17 (802)
148 PRK13321 pantothenate kinase; 21.2 67 0.0017 14.0 7.9 84 19-112 2-91 (256)
149 COG2441 Predicted butyrate kin 20.9 63 0.0016 14.1 1.8 59 20-79 1-69 (374)
150 TIGR03123 one_C_unchar_1 proba 20.4 69 0.0018 13.9 8.4 92 20-117 1-102 (318)
151 cd07025 Peptidase_S66 LD-Carbo 20.3 69 0.0018 13.9 4.0 42 66-109 222-263 (282)
No 1
>PRK00109 Holliday junction resolvase-like protein; Reviewed
Probab=100.00 E-value=1.5e-44 Score=290.13 Aligned_cols=140 Identities=37% Similarity=0.508 Sum_probs=133.3
Q ss_pred CCCCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHH
Q ss_conf 89983999986887689996307651101210110477415689899998651677589972135788888447999999
Q gi|254780976|r 14 KPNQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAF 93 (160)
Q Consensus 14 ~~~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f 93 (160)
+.++|+||||||+||||+|+||+.+.+|+|++|+.+++....+.+|.+++++|+|+.||||+|++++|+++++++.++.|
T Consensus 1 ~~~griLgiD~G~kriGlAisD~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iViGlP~~~~g~~~~~~~~v~~F 80 (141)
T PRK00109 1 MMSGRILGFDVGTKRIGVAIGDPLGGTARPLETIKAQNGEPDWDALEKLLKEWQPDLLVVGLPLNMDGTEGPMTERARKF 80 (141)
T ss_pred CCCCCEEEEEECCCEEEEEEECCCCCEEECCEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHH
T ss_conf 99888899981899799999469998474414377277068999999999983999899944799998844889999999
Q ss_pred HHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 9999840489977998367167999999998698644546437899899999999968645
Q gi|254780976|r 94 VHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQEVLDRISF 154 (160)
Q Consensus 94 ~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq~~Ld~~~~ 154 (160)
++.|. +.+++||++|||||||++|++.|++.|++++++|+.+|++||++|||+|||+.+.
T Consensus 81 ~~~L~-~~~~l~v~~~DEr~TS~~A~~~l~~~~~~~~~~k~~iD~~AA~iILq~~Ld~~~~ 140 (141)
T PRK00109 81 ANRLE-GRFGLPVELVDERLTTVEAERMLFEAGVSRAKRKGVVDSLAAVIILQSYLDSRPK 140 (141)
T ss_pred HHHHH-HHHCCCEEEEECCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC
T ss_conf 99999-9729987986243279999999997365332125226799999999999972767
No 2
>pfam03652 UPF0081 Uncharacterized protein family (UPF0081).
Probab=100.00 E-value=3.6e-43 Score=281.88 Aligned_cols=134 Identities=44% Similarity=0.616 Sum_probs=129.0
Q ss_pred CCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHH
Q ss_conf 83999986887689996307651101210110477415689899998651677589972135788888447999999999
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHN 96 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~ 96 (160)
+|+||||||+||||+|+||+.+.+|+|++|+.+++..+.+.+|.+++.+|+|+.||||+|++|||+++++++.++.|++.
T Consensus 1 ~riLgiD~G~kriGvAisd~~~~~a~Pl~~i~~~~~~~~~~~i~~ii~e~~i~~iVvGlP~~~dG~~~~~~~~v~~f~~~ 80 (134)
T pfam03652 1 GRILGLDVGTKRIGVAISDPLGILASPLETIERKNGKPDLEELAELIKEWQPDGIVVGLPLNMDGSEGEQTKRVRKFARR 80 (134)
T ss_pred CCEEEEECCCCEEEEEEECCCCCEEECEEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHH
T ss_conf 97999980899799999359998250248898588558999999999983999899955899998808899999999999
Q ss_pred HHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 9840489977998367167999999998698644546437899899999999968
Q gi|254780976|r 97 MIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQEVLDR 151 (160)
Q Consensus 97 L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq~~Ld~ 151 (160)
|. ..+++||++|||||||++|++.|++.|++++++++.+|++||++|||+|||.
T Consensus 81 L~-~~~~i~v~~~DEr~TS~~A~~~l~~~~~~~~k~k~~iD~~AA~iILq~~L~~ 134 (134)
T pfam03652 81 LK-KRFGLPVELVDERLTTVEAERILREAGLSRKKRKEVVDSIAAVIILQSYLDS 134 (134)
T ss_pred HH-HHHCCCEEEEECCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCC
T ss_conf 99-8619986886343279999999997374421025216799999999998668
No 3
>COG0816 Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) [DNA replication, recombination, and repair]
Probab=100.00 E-value=5.9e-41 Score=268.46 Aligned_cols=138 Identities=43% Similarity=0.685 Sum_probs=131.3
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCC-HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHH
Q ss_conf 98399998688768999630765110121011047741-56898999986516775899721357888884479999999
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVT-QTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFV 94 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~-~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~ 94 (160)
++++||+|||+||||+|+||+...+|+|++++.+.+.. ..+..|.+++++|+++.+|||+|+||+|+++++++.++.|+
T Consensus 1 ~~~ilalD~G~KrIGvA~sd~~~~~A~pl~~i~~~~~~~~~~~~l~~li~~~~~~~vVVGlP~~m~g~~~~~~~~~~~f~ 80 (141)
T COG0816 1 GMRILALDVGTKRIGVAVSDILGSLASPLETIKRKNGKPQDFNALLKLVKEYQVDTVVVGLPLNMDGTEGPRAELARKFA 80 (141)
T ss_pred CCEEEEEECCCCEEEEEEECCCCCCCCCHHHEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHH
T ss_conf 96499996278748899764777521241430203353756999999999838887999667688887134699999999
Q ss_pred HHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 999840489977998367167999999998698644546437899899999999968645
Q gi|254780976|r 95 HNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQEVLDRISF 154 (160)
Q Consensus 95 ~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq~~Ld~~~~ 154 (160)
+.|. +.+++||++||||+||++|++.|.+.|++++++|+.+|++||++|||+|||+...
T Consensus 81 ~~L~-~r~~lpv~l~DERltTv~A~~~L~~~~~~~~~rk~~iD~~AA~~ILq~~ld~~~~ 139 (141)
T COG0816 81 ERLK-KRFNLPVVLWDERLSTVEAERMLIEAGVSRKKRKGVIDSLAAVLILESYLDRNHA 139 (141)
T ss_pred HHHH-HHCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHHHHHHHH
T ss_conf 9998-7649878998186579999999997587344434143899999999999975200
No 4
>TIGR00250 TIGR00250 conserved hypothetical protein TIGR00250; InterPro: IPR005227 Holliday junction resolvases (HJRs) are key enzymes of DNA recombination. The principal HJRs are now known or confidently predicted for all bacteria and archaea whose genomes have been completely sequenced, with many species encoding multiple potential HJRs. Structural and evolutionary relationships of HJRs and related nucleases suggests that the HJR function has evolved independently from at least four distinct structural folds, namely RNase H, endonuclease, endonuclease VIIcolicin E and RusA (IPR008822 from INTERPRO): The endonuclease fold, whose structural prototypes are the phage exonuclease, the very short patch repair nuclease (Vsr) and type II restriction enzymes, is shown to encompass by far a greater diversity of nucleases than previously suspected. This fold unifies archaeal HJRs (IPR002732 from INTERPRO), repair nucleases such as RecB (IPR004586 from INTERPRO) and Vsr (IPR004603 from INTERPRO), restriction enzymes and a variety of predicted nucleases whose specific activities remain to be determined. The RNase H fold characterises the RuvC family (IPR002176 from INTERPRO), which is nearly ubiquitous in bacteria, and in addition the YqgF family (IPR005227 from INTERPRO). The proteins of this family, typified by Escherichia coli YqgF, are likely to function as an alternative to RuvC in most bacteria, but could be the principal HJRs in low-GC Gram-positive bacteria and Aquifex. Endonuclease VII of phage T4 (IPR004211 from INTERPRO) is shown to serve as a structural template for many nucleases, including McrA and other type II restriction enzymes. Together with colicin E7, endonuclease VII defines a distinct metal-dependent nuclease fold. Horizontal gene transfer, lineage-specific gene loss and gene family expansion, and non-orthologous gene displacement seem to have been major forces in the evolution of HJRs and related nucleases. A remarkable case of displacement is seen in the Lyme disease spirochete Borrelia burgdorferi, which does not possess any of the typical HJRs, but instead encodes, in its chromosome and each of the linear plasmids, members of the exonuclease family predicted to function as HJRs. The diversity of HJRs and related nucleases in bacteria and archaea contrasts with their near absence in eukaryotes. The few detected eukaryotic representatives of the endonuclease fold and the RNase H fold have probably been acquired from bacteria via horizontal gene transfer. The identity of the principal HJR(s) involved in recombination in eukaryotes remains uncertain; this function could be performed by topoisomerase IB or by a novel, so far undetected, class of enzymes. Likely HJRs and related nucleases were identified in the genomes of numerous bacterial and eukaryotic DNA viruses. Gene flow between viral and cellular genomes has probably played a major role in the evolution of this class of enzymes. This family represents the YqgF family of putative Holliday junction resolvases. With the exception of the spirochetes, the YqgF family is represented in all bacterial lineages, including the mycoplasmas with their highly degenerate genomes. The RuvC resolvases are conspicuously absent in the low-GC Gram-positive bacterial lineage, with the exception of Ureaplasma urealyticum (Q9PQY7 from SWISSPROT, ). Furthermore, loss of function ruvC mutants of E. coli show a residual HJR activity that cannot be ascribed to the prophage-encoded RusA resolvase . This suggests that the YqgF family proteins could be alternative HJRs whose function partially overlaps with that of RuvC . ; GO: 0000150 recombinase activity, 0003677 DNA binding, 0004518 nuclease activity, 0006281 DNA repair, 0006310 DNA recombination, 0006974 response to DNA damage stimulus.
Probab=100.00 E-value=1.1e-38 Score=254.65 Aligned_cols=130 Identities=32% Similarity=0.452 Sum_probs=123.9
Q ss_pred EEEECCCCEEEEE-EEECCCCEECCCEEEECCCCCHHH--HHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHH
Q ss_conf 9998688768999-630765110121011047741568--9899998651677589972135788888447999999999
Q gi|254780976|r 20 ASIDLGTKRIGLA-ISDPGRRFAHPRPFLVRKKVTQTA--LELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHN 96 (160)
Q Consensus 20 LgiD~G~kriGiA-isd~~~~~a~Pl~~i~~~~~~~~~--~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~ 96 (160)
||+|+|+|+|||| -||..+.+|.|+++|.+....++| ..|.++++||+|+.||||+|+||+|++++++..++.||+.
T Consensus 1 LglD~GtK~iGvA~g~~~tG~tA~gi~tik~~~~~~d~Gl~~i~~l~ke~~~d~~vvGlP~nM~Gt~g~~~~~~~kFA~r 80 (133)
T TIGR00250 1 LGLDLGTKSIGVAVGQDITGLTAQGIPTIKAQDGEPDWGLSRIEELLKEWKVDKIVVGLPLNMDGTVGPLTKRAQKFAKR 80 (133)
T ss_pred CCCCCCCCEEEEEECCCCCCCEECCCHHHHHCCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHHHHHHH
T ss_conf 96324641354676366755210440311210488862179999987315889788617878788725026888999988
Q ss_pred HHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 984048997799836716799999999869864454643789989999999996
Q gi|254780976|r 97 MIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQEVLD 150 (160)
Q Consensus 97 L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq~~Ld 150 (160)
+. ..+++||.+|||||||.+|++.|.+.+..+..+|+.||+.||++|||+|||
T Consensus 81 ~~-~~~~v~v~l~DERL~t~~A~~~L~~~~~~~~~~Kg~iD~~aA~~IL~~yld 133 (133)
T TIGR00250 81 LE-GRFGVPVELMDERLSTVEAESQLFARGGFRALRKGKIDKVAAVIILQSYLD 133 (133)
T ss_pred HH-HHHCCCEEEECCCCCHHHHHHHHHHHCCHHHHHCCCHHHHHHHHHHHHHCC
T ss_conf 65-540664378617613789999888630011221574247899999876329
No 5
>smart00732 YqgFc Likely ribonuclease with RNase H fold. YqgF proteins are likely to function as an alternative to RuvC in most bacteria, and could be the principal holliday junction resolvases in low-GC Gram-positive bacteria. In Spt6p orthologues, the catalytic residues are substituted indicating that they lack enzymatic functions.
Probab=99.95 E-value=1.5e-27 Score=187.44 Aligned_cols=99 Identities=33% Similarity=0.525 Sum_probs=92.7
Q ss_pred CCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHH
Q ss_conf 83999986887689996307651101210110477415689899998651677589972135788888447999999999
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHN 96 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~ 96 (160)
+|+||||||+||||+|+||+.+.+|+|+.++.+++..+.+..|.+++++|+|+.||||+|++|||+++++++ +.|++.
T Consensus 1 krvLgiD~G~kriG~Ai~D~~~~~a~pl~~i~~~~~~~~~~~l~~~i~~~~~~~iviG~P~~~~g~~~~~~~--~~f~~~ 78 (99)
T smart00732 1 KRVLGLDPGRKGIGVAVVDETGKLADPLEVIPRTNKEADAARLKKLIKKYQPDLIVIGLPLNMNGTASRETE--EAFAEL 78 (99)
T ss_pred CCEEEEEECCCCEEEEEEECCCCEECCCEEEECCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCHHHH--HHHHHH
T ss_conf 979999217896999999899973213189985982699999999999849988997475248998199999--999999
Q ss_pred HHHHCCCCCEEEECCCCCHHHH
Q ss_conf 9840489977998367167999
Q gi|254780976|r 97 MIDRKVYVPFVFWDERLTTVSA 118 (160)
Q Consensus 97 L~~~~~~l~v~~~DEr~TS~~A 118 (160)
| ++.+++||++|||||||++|
T Consensus 79 l-~~~~~i~v~~~DEr~tS~~A 99 (99)
T smart00732 79 L-KERFNLPVVLVDERLATVYA 99 (99)
T ss_pred H-HHCCCCCEEEEECCCCCCCC
T ss_conf 8-51789988999589724249
No 6
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=97.70 E-value=0.0021 Score=41.29 Aligned_cols=119 Identities=20% Similarity=0.231 Sum_probs=82.7
Q ss_pred CCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHH
Q ss_conf 83999986887689996307651101210110477415689899998651677589972135788888447999999999
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHN 96 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~ 96 (160)
.-|+|||+|+ .+|+|+-|-.+.. +.+..+++ -...++..+|.+|+--.+|-- |= ++.-+.|+ +
T Consensus 244 ~lIVGIDPGi-ttgiAvldldGev---l~~~S~r~--~~~~eVve~I~~lG~PvvVAt-----DV--tp~P~~V~----K 306 (652)
T COG2433 244 SLIVGIDPGI-TTGIAVLDLDGEV---LDLESRRG--IDRSEVVEFISELGKPVVVAT-----DV--TPAPETVK----K 306 (652)
T ss_pred CEEEEECCCC-EEEEEEEECCCCE---EEEECCCC--CCHHHHHHHHHHCCCCEEEEE-----CC--CCCCHHHH----H
T ss_conf 4489737985-3357897148818---87532167--888999999998398369980-----68--89807899----9
Q ss_pred HHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 984048997799836716799999999869864454643789989999999996864566
Q gi|254780976|r 97 MIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQEVLDRISFLE 156 (160)
Q Consensus 97 L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq~~Ld~~~~~~ 156 (160)
+. ..|+-+.+-=|+++|+.+-...++..+.+.....+ -|++||++ ..|+.+.+.+.
T Consensus 307 iA-asf~A~ly~P~~dLsveEK~~~~r~~~~~~~ddH~-RDALAAA~--kAY~~yk~kl~ 362 (652)
T COG2433 307 IA-ASFNAVLYTPDRDLSVEEKQEALRTLKISVSDDHE-RDALAAAY--KAYLAYKPKLE 362 (652)
T ss_pred HH-HHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCHH-HHHHHHHH--HHHHHHHHHHH
T ss_conf 99-97398423785547878889988516888899447-78999999--99999899999
No 7
>PRK00039 ruvC Holliday junction resolvase; Reviewed
Probab=97.53 E-value=0.00031 Score=46.27 Aligned_cols=103 Identities=14% Similarity=0.116 Sum_probs=61.5
Q ss_pred CCCEEEEECCCCEEEEEEEECCCC--EECCCEEEECCCCC-------HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHH
Q ss_conf 983999986887689996307651--10121011047741-------568989999865167758997213578888844
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRR--FAHPRPFLVRKKVT-------QTALELLSFITTENIAAFIIGLPLNMNGSEGPR 86 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~--~a~Pl~~i~~~~~~-------~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~ 86 (160)
+.||||||+|..++|.|+-+.... ...-.++|...... .....+.+++++|+|+.++|=-+.--.+..+..
T Consensus 1 mMrILGIDPGl~~tG~gvie~~~~~~~~i~~G~I~t~~~~~~~~RL~~I~~~l~~li~~~~P~~~AiE~~F~~kN~~tai 80 (169)
T PRK00039 1 MMRILGIDPGLRRTGYGVIEVEGRKLSYVASGVIRTTSDDDLPERLKQIFDGLSEVIDQYQPDEAAIEQVFVNKNPQSTL 80 (169)
T ss_pred CCEEEEECCCCCCEEEEEEEEECCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHCCCHHHHH
T ss_conf 95899988366860899999769968999988995799999899999999999999996599656453788733688999
Q ss_pred -HHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH
Q ss_conf -7999999999984048997799836716799999999
Q gi|254780976|r 87 -VHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILI 123 (160)
Q Consensus 87 -~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~ 123 (160)
...++--+- +.-...++||+.+ |..+.+..+-
T Consensus 81 ~lgqARGvil-laa~~~~i~v~ey----sP~~VK~ait 113 (169)
T PRK00039 81 KLGQARGVAI-LAAARRGLPVAEY----TPLQVKKAVV 113 (169)
T ss_pred HHHHHHHHHH-HHHHHHCCCEEEE----CCCHHHHHHH
T ss_conf 9999999999-9999838975888----7074455331
No 8
>cd00529 RuvC_resolvase Holliday junction resolvases (HJRs) are endonucleases that specifically resolve Holliday junction DNA intermediates during homologous recombination. HJR's occur in archaea, bacteria, and in the mitochondria of certain fungi, however this CD includes only the bacterial and mitochondrial HJR's. These are referred to as the RuvC family of Holliday junction resolvases, RuvC being the E.coli HJR. RuvC and its orthologs are homodimers and are structurely similar to RNase H and Hsp70.
Probab=97.51 E-value=0.0017 Score=41.75 Aligned_cols=104 Identities=12% Similarity=0.109 Sum_probs=64.5
Q ss_pred CEEEEECCCCEEEEEEEECCCCEEC--CCEEEECCCCC-------HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCH-HH
Q ss_conf 3999986887689996307651101--21011047741-------56898999986516775899721357888884-47
Q gi|254780976|r 18 PIASIDLGTKRIGLAISDPGRRFAH--PRPFLVRKKVT-------QTALELLSFITTENIAAFIIGLPLNMNGSEGP-RV 87 (160)
Q Consensus 18 riLgiD~G~kriGiAisd~~~~~a~--Pl~~i~~~~~~-------~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~-~~ 87 (160)
||||||+|..++|.|+-+....-.. -.+++...... .....+.+++++|+|+.++|=-+.--.+..+. ..
T Consensus 1 RILGIDPGl~~~G~aiie~~~~~~~~l~~g~I~t~~~~~~~~RL~~I~~~l~~li~~~~P~~vaiE~~f~~~n~~tal~l 80 (154)
T cd00529 1 RILGIDPGSRNTGYGVIEQEGRKLIYLASGVIRTSSDAPLPSRLKTIYDGLNEVIDQFQPDVVAIERVFFAKNPDSALKL 80 (154)
T ss_pred CEEEECCCCCCCEEEEEEEECCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHHHCCHHHHHHH
T ss_conf 96998756687159999957997899998889669999989999999999999998349978986546560188889999
Q ss_pred HHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCC
Q ss_conf 999999999984048997799836716799999999869
Q gi|254780976|r 88 HSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMN 126 (160)
Q Consensus 88 ~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g 126 (160)
.+++--+-... ...++||..+ |..+.++..-..|
T Consensus 81 g~arGvi~~~~-~~~~i~v~ey----~P~~VK~~vtG~G 114 (154)
T cd00529 81 GQARGALILAL-ANRNLPVFEY----TPNQVKKAVTGYG 114 (154)
T ss_pred HHHHHHHHHHH-HHHCCCCEEE----CHHHHHHHHHCCC
T ss_conf 99999999999-9817986168----8899889986797
No 9
>COG2183 Tex Transcriptional accessory protein [Transcription]
Probab=97.41 E-value=0.0035 Score=39.92 Aligned_cols=104 Identities=18% Similarity=0.255 Sum_probs=69.8
Q ss_pred CHHHHHHHCCCCC-CEEEEECCCCE-EEEEEEECCCCEECCCEEEECC---CCCHHHHHHHHHHHHHCCCEEEEEECCCC
Q ss_conf 8899998358998-39999868876-8999630765110121011047---74156898999986516775899721357
Q gi|254780976|r 5 LIEDLVKSLKPNQ-PIASIDLGTKR-IGLAISDPGRRFAHPRPFLVRK---KVTQTALELLSFITTENIAAFIIGLPLNM 79 (160)
Q Consensus 5 ~~~~~~~~~~~~~-riLgiD~G~kr-iGiAisd~~~~~a~Pl~~i~~~---~~~~~~~~l~~ii~e~~i~~iVvGlP~~~ 79 (160)
++++++-..|.++ .+||+|+|.+. +=+|+.|..+.+.---.+++.. ........|..++..|+|..+.||
T Consensus 317 nL~~lLl~aP~~~~~~lglDPg~rtG~k~Avvd~tGk~l~~~~Iyp~~p~~~~~~~~~~l~~l~~~~~Ve~iaIG----- 391 (780)
T COG2183 317 NLKDLLLAAPAKPKATLGLDPGFRTGCKVAVVDDTGKLLDTATIYPHPPVNQSDKAEATLKDLIRKYKVELIAIG----- 391 (780)
T ss_pred HHHHHHHCCCCCCCCEEECCCCCCCCCEEEEECCCCCEECEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEE-----
T ss_conf 889987248888860120487630130799983788555316997279754157899999999998371079873-----
Q ss_pred CCCCCHHHHHHHHHHHHHHHHC--CCCCEEEECCCCCHH
Q ss_conf 8888844799999999998404--899779983671679
Q gi|254780976|r 80 NGSEGPRVHSTRAFVHNMIDRK--VYVPFVFWDERLTTV 116 (160)
Q Consensus 80 ~g~~~~~~~~v~~f~~~L~~~~--~~l~v~~~DEr~TS~ 116 (160)
+|+.+. .+..|+..++... .++....+.|..-|.
T Consensus 392 ngTaSr---ete~fv~~vl~~~~~~~~~~viVsEagAsv 427 (780)
T COG2183 392 NGTASR---ETEKFVADVLKELPKEKVLKVIVSEAGASV 427 (780)
T ss_pred CCCCCH---HHHHHHHHHHHHCCCCCCCEEEECCCCCCH
T ss_conf 687655---479999999984667898679972653102
No 10
>pfam04312 DUF460 Protein of unknown function (DUF460). Archaeal protein of unknown function.
Probab=97.33 E-value=0.0042 Score=39.40 Aligned_cols=109 Identities=16% Similarity=0.175 Sum_probs=73.2
Q ss_pred CCCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHH
Q ss_conf 99839999868876899963076511012101104774156898999986516775899721357888884479999999
Q gi|254780976|r 15 PNQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFV 94 (160)
Q Consensus 15 ~~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~ 94 (160)
...-|+|||.|+ .+|+|+-|-.+.. +.+...++ -...++.+.|.+++-- +||.- | .++.-..|+.++
T Consensus 30 r~~lIVGIDPG~-ttgiAildLdG~~---l~~~S~R~--~~~~evi~~I~~~G~P-viVAt----D--V~p~P~~V~Kia 96 (138)
T pfam04312 30 RRYLIVGIDPGI-TTGIAILDLDGEV---LDLYSSRN--MDRGEVIELIYELGKP-VIVAT----D--VSPPPETVKKIA 96 (138)
T ss_pred CCCEEEEECCCC-EEEEEEEECCCCE---EEEEECCC--CCHHHHHHHHHHCCCE-EEEEE----C--CCCCCHHHHHHH
T ss_conf 676799978991-4378898258849---98770367--9989999999974976-99982----6--989828999999
Q ss_pred HHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHH
Q ss_conf 999840489977998367167999999998698644546437899899
Q gi|254780976|r 95 HNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAA 142 (160)
Q Consensus 95 ~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~ 142 (160)
.. ++-..+.=++.++..|=.+..++.+.+.+. ...-|++||+
T Consensus 97 ~~-----f~A~ly~P~~dl~veEK~~l~~~~~~~~~n-~HeRDALaAA 138 (138)
T pfam04312 97 RS-----FGAVLYTPERDLSVEEKRELARKYSWKYDD-DHERDALAAA 138 (138)
T ss_pred HH-----HCCEEECCCCCCCHHHHHHHHHHHCCCCCC-CHHHHHHHCC
T ss_conf 97-----198115788767689999999871776799-4366564239
No 11
>pfam02075 RuvC Crossover junction endodeoxyribonuclease RuvC.
Probab=97.16 E-value=0.0047 Score=39.11 Aligned_cols=104 Identities=11% Similarity=0.094 Sum_probs=64.4
Q ss_pred EEEEECCCCEEEEEEEECCCCE--ECCCEEEECCC------CCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCH-HHHH
Q ss_conf 9999868876899963076511--01210110477------4156898999986516775899721357888884-4799
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRF--AHPRPFLVRKK------VTQTALELLSFITTENIAAFIIGLPLNMNGSEGP-RVHS 89 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~--a~Pl~~i~~~~------~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~-~~~~ 89 (160)
|||||+|..++|.|+-+....- ..-.+++...+ .......|.+++++|+|+.++|=-|.-..+..+. ...+
T Consensus 1 ILGIDPGl~~tG~avi~~~~~~~~~v~~G~I~t~~~~~~~RL~~I~~~l~~ii~~~~P~~~aiE~~F~~~N~~ta~~lgq 80 (148)
T pfam02075 1 ILGIDPGSRVTGYGVIEHRGRQLSYLTSGVIRTQSDPLPERLLAIFDAVEEVLDTHQPNAVAIEQVFFAKNVDSALKLAQ 80 (148)
T ss_pred CEEECCCCCCEEEEEEEEECCEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHCCHHHHHHHHH
T ss_conf 98878886860799999759847999951597899887999999999999999964998674758777128699999999
Q ss_pred HHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCC
Q ss_conf 99999999840489977998367167999999998698
Q gi|254780976|r 90 TRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNV 127 (160)
Q Consensus 90 v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~ 127 (160)
++--+-. .-...++||+.+ |..+.++.+-..|.
T Consensus 81 aRGvill-~~~~~~i~v~ey----~P~~VKk~itG~G~ 113 (148)
T pfam02075 81 ARGVAIL-AAAERGISVFEY----TPSQVKKAVTGNGS 113 (148)
T ss_pred HHHHHHH-HHHHHCCCEEEE----CHHHHHHHHHCCCC
T ss_conf 9999999-999807980578----88998888737985
No 12
>COG1548 Predicted transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
Probab=95.78 E-value=0.035 Score=33.83 Aligned_cols=105 Identities=14% Similarity=0.159 Sum_probs=63.9
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCEECCCEEEEC-CCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHH
Q ss_conf 9839999868876899963076511012101104-774156898999986516775899721357888884479999999
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVR-KKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFV 94 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~-~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~ 94 (160)
+.++||||.|-.++-+|.+|....- .-+-.++. ++.+.+-..|.+++..++++.+-|=+--.+..-.....+-|.+.+
T Consensus 2 ~~kilGiDIGGAntk~a~~DG~~~~-~d~~YlPMWk~k~rL~~~Lkei~~k~~~~~vgvvMTaELaD~f~tk~eGVe~Ii 80 (330)
T COG1548 2 KMKILGIDIGGANTKIASSDGDNYK-IDHIYLPMWKKKDRLEETLKEIVHKDNVDYVGVVMTAELADAFKTKAEGVEDII 80 (330)
T ss_pred CCEEEEEECCCCCCHHHHCCCCEEE-EEEEEECCCCCHHHHHHHHHHHHCCCCCCEEEEEEEHHHHHHHHHHHHHHHHHH
T ss_conf 7337775216754301320587014-667872000115578999999850378550688863878877551776799999
Q ss_pred HHHHHHCCCCCEEEECCCCC--HHHHHHHH
Q ss_conf 99984048997799836716--79999999
Q gi|254780976|r 95 HNMIDRKVYVPFVFWDERLT--TVSAQQIL 122 (160)
Q Consensus 95 ~~L~~~~~~l~v~~~DEr~T--S~~A~~~l 122 (160)
+.. ++.++.||+++|=... |.+|.+..
T Consensus 81 ~~v-~~Af~~pv~~v~~~G~~~ssEa~~~~ 109 (330)
T COG1548 81 DTV-EKAFNCPVYVVDVNGNFLSSEALKNP 109 (330)
T ss_pred HHH-HHHCCCCEEEEECCCCCCCHHHHCCH
T ss_conf 999-87428956999156867676675388
No 13
>COG0817 RuvC Holliday junction resolvasome, endonuclease subunit [DNA replication, recombination, and repair]
Probab=95.16 E-value=0.084 Score=31.55 Aligned_cols=103 Identities=13% Similarity=0.105 Sum_probs=62.9
Q ss_pred EEEECCCCEEEEEEEECCCCEECCC--EEEECC-C------CCHHHHHHHHHHHHHCCCEEEEEECCCC-CCCCCHHHHH
Q ss_conf 9998688768999630765110121--011047-7------4156898999986516775899721357-8888844799
Q gi|254780976|r 20 ASIDLGTKRIGLAISDPGRRFAHPR--PFLVRK-K------VTQTALELLSFITTENIAAFIIGLPLNM-NGSEGPRVHS 89 (160)
Q Consensus 20 LgiD~G~kriGiAisd~~~~~a~Pl--~~i~~~-~------~~~~~~~l~~ii~e~~i~~iVvGlP~~~-~g~~~~~~~~ 89 (160)
||||+|.+++|.++=+....-..++ ++|... + .......+.+++.+|.|+.+.|--+.-- |-+..-....
T Consensus 1 lGIDPGl~~~G~gvI~~~~~~l~~v~~G~I~t~~~~~l~~RL~~l~~~l~~vl~~~~P~~~AIE~~F~~kN~~s~lklgQ 80 (160)
T COG0817 1 LGIDPGLRRTGYGVIEVEGRQLSYLASGVIRTSSDAPLAERLKQLYDGLSEVLDEYQPDEVAIEQVFVNKNADSALKLGQ 80 (160)
T ss_pred CCCCCCCCCCCEEEEECCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEHHHHHHHCCHHHHHHHHH
T ss_conf 99587754101589982698489986438965887627999999999999999974988465417777128089999999
Q ss_pred HHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCC
Q ss_conf 99999999840489977998367167999999998698
Q gi|254780976|r 90 TRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNV 127 (160)
Q Consensus 90 v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~ 127 (160)
.+-.+- |.-..-++||+. ||-.++++.....|.
T Consensus 81 ARGv~~-la~~~~~l~v~e----Y~p~~VKkavvG~G~ 113 (160)
T COG0817 81 ARGVAL-LAAARRGLPVFE----YTPNQVKKAVVGNGK 113 (160)
T ss_pred HHHHHH-HHHHHCCCCHHH----CCHHHHHHHHHCCCC
T ss_conf 988999-999984997623----268887888510772
No 14
>COG1940 NagC Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
Probab=95.10 E-value=0.24 Score=28.83 Aligned_cols=108 Identities=13% Similarity=0.180 Sum_probs=62.5
Q ss_pred CCCCCEEEEECCCCEEEEEEEECCCCE-ECCCEEEECCCCC-HHHHHHHH----HHHHH----CCCEEEEEECCCCCCCC
Q ss_conf 899839999868876899963076511-0121011047741-56898999----98651----67758997213578888
Q gi|254780976|r 14 KPNQPIASIDLGTKRIGLAISDPGRRF-AHPRPFLVRKKVT-QTALELLS----FITTE----NIAAFIIGLPLNMNGSE 83 (160)
Q Consensus 14 ~~~~riLgiD~G~kriGiAisd~~~~~-a~Pl~~i~~~~~~-~~~~~l~~----ii~e~----~i~~iVvGlP~~~~g~~ 83 (160)
+....++|||.|..++=+|+.|..+.+ .......+..... .....+.+ .+..+ .+.+|.++.|...+...
T Consensus 3 ~~~~~~lgidIggt~i~~~l~d~~g~~l~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~iGIgi~~pg~~~~~~ 82 (314)
T COG1940 3 PEAMTVLGIDIGGTKIKVALVDLDGEILLRERIPTPTPDPEEAILEAILALVAELLKQAQGRVAIIGIGIPGPGDVDNGT 82 (314)
T ss_pred CCCEEEEEEEECCCEEEEEEEECCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEECCCCC
T ss_conf 87629999998897899999937997898899877888845679999999999998752767756999997646126873
Q ss_pred ----CHHHHHH--HHHHHHHHHHCCCCCEEEECCCCCHHHHHHHH
Q ss_conf ----8447999--99999998404899779983671679999999
Q gi|254780976|r 84 ----GPRVHST--RAFVHNMIDRKVYVPFVFWDERLTTVSAQQIL 122 (160)
Q Consensus 84 ----~~~~~~v--~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l 122 (160)
.+..... .+|.+.| ++.+++||...+.--...-|+..+
T Consensus 83 ~~~~~~~~~~~~~~~l~~~L-~~~~~~Pv~veNDan~aAlaE~~~ 126 (314)
T COG1940 83 VIVPAPNLGWWNGVDLAEEL-EARLGLPVFVENDANAAALAEAWF 126 (314)
T ss_pred EEECCCCCCCCCCCCHHHHH-HHHHCCCEEEECCHHHHHHHHHHH
T ss_conf 67425766544467799999-998899879925199999999872
No 15
>PRK09698 D-allose kinase; Provisional
Probab=94.05 E-value=0.53 Score=26.70 Aligned_cols=95 Identities=14% Similarity=0.291 Sum_probs=55.7
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCEECCCEEE-ECCCCCHHHHHHHHHHHH------HCCCEEEEEECCCCCCCCC----
Q ss_conf 98399998688768999630765110121011-047741568989999865------1677589972135788888----
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRFAHPRPFL-VRKKVTQTALELLSFITT------ENIAAFIIGLPLNMNGSEG---- 84 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i-~~~~~~~~~~~l~~ii~e------~~i~~iVvGlP~~~~g~~~---- 84 (160)
+.=|+|||+|-.+|=.|+.|..+.+-.-...- ......+.+..+..++.+ ..+.++-||.|=..|-..+
T Consensus 3 ~~~viGIDiGGT~i~~~l~d~~g~i~~~~~~~t~~~~~~~~~~~i~~~i~e~~~~~~~~i~gIgig~pG~vd~~~g~v~~ 82 (302)
T PRK09698 3 HNVVAGIDMGATHIRFCLVDAEGEILHCEKRRTAEVIAPDLVSGISEMIDEQLRRFNARCHGLVMGFPALVSKDKRTIIS 82 (302)
T ss_pred CCEEEEEEECCCEEEEEEECCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCEEEE
T ss_conf 76899999873249999996999799999966987676799999999999997651885246999958960599998983
Q ss_pred -HH---HHH-HHHHHHHHHHHCCCCCEEEECC
Q ss_conf -44---799-9999999984048997799836
Q gi|254780976|r 85 -PR---VHS-TRAFVHNMIDRKVYVPFVFWDE 111 (160)
Q Consensus 85 -~~---~~~-v~~f~~~L~~~~~~l~v~~~DE 111 (160)
+. ... -..+.+.| ++.+++||++-+.
T Consensus 83 ~~nl~~~~~~~~~l~~~l-~~~~~~pv~veND 113 (302)
T PRK09698 83 TPNLPLTAADLYDLADKL-ENTLNCPVFFSRD 113 (302)
T ss_pred CCCCCCCCCCCCCHHHHH-HHHHCCCEEEECC
T ss_conf 387666667677469999-9987999899653
No 16
>TIGR00228 ruvC crossover junction endodeoxyribonuclease RuvC; InterPro: IPR002176 The Escherichia coli ruvC gene is involved in DNA repair and in the late step of RecE and RecF pathway recombination . RuvC protein (3.1.22.4 from EC) cleaves cruciform junctions, which are formed by the extrusion of inverted repeat sequences from a super-coiled plasmid and which are structurally analogous to Holliday junctions, by introducing nicks into strands with the same polarity. The nicks leave a 5'terminal phosphate and a 3'terminal hydroxyl group which are ligated by E. coli or Bacteriophage T4 DNA ligases. Analysis of the cleavage sites suggests that DNA topology rather than a particular sequence determines the cleavage site. RuvC protein also cleaves Holliday junctions that are formed between gapped circular and linear duplex DNA by the function of RecA protein. The active form of RuvC protein is a dimer. This is mechanistically suited for an endonuclease involved in swapping DNA strands at the crossover junctions. It is inferred that RuvC protein is an endonuclease that resolves Holliday structures in vivo . RucC is a small protein of about 20 kD. It requires and binds a magnesium ion. The structure of E. coli ruvC is a 3-layer alpha-beta sandwich containing a 5-stranded beta-sheet sandwiched between 5 alpha-helices .; GO: 0004520 endodeoxyribonuclease activity, 0006281 DNA repair, 0006310 DNA recombination.
Probab=90.74 E-value=0.051 Score=32.86 Aligned_cols=109 Identities=11% Similarity=0.129 Sum_probs=56.1
Q ss_pred EEEEECCCCEEEEEE----EECCCCEECCCEEEECCC----CCHHHHHHHHHHHHHCCCEEEEEECCCCCC-CCCHHHHH
Q ss_conf 999986887689996----307651101210110477----415689899998651677589972135788-88844799
Q gi|254780976|r 19 IASIDLGTKRIGLAI----SDPGRRFAHPRPFLVRKK----VTQTALELLSFITTENIAAFIIGLPLNMNG-SEGPRVHS 89 (160)
Q Consensus 19 iLgiD~G~kriGiAi----sd~~~~~a~Pl~~i~~~~----~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g-~~~~~~~~ 89 (160)
|||||+|++.+|-|| |..+.-+....-.+.-+. .......+.+++..|+|+.+-|--=.--.+ ..--....
T Consensus 1 ILGIDPGSR~~GY~vI~~~~~~L~~l~~G~I~~~~~~L~~~l~~~~~~~~~~~~~f~~n~~AIE~vF~a~N~~S~lKL~Q 80 (158)
T TIGR00228 1 ILGIDPGSRVTGYGVIRQAGKKLSYLGSGFIRIKVTDLPERLKDIYAGVTEIITQFQPNEFAIEQVFMAKNADSALKLGQ 80 (158)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHHHHHHHH
T ss_conf 97768987511258988753443334245003303468899999999999998430666001212432027257888764
Q ss_pred HHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHH
Q ss_conf 999999998404899779983671679999999986986445
Q gi|254780976|r 90 TRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKK 131 (160)
Q Consensus 90 v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~ 131 (160)
.|-.+-.+ .-.-.+|++ --||....++.+...|.-.|.
T Consensus 81 ~RG~A~~~-~v~~~l~~F---sEY~~~~VK~~~~G~G~A~K~ 118 (158)
T TIGR00228 81 ARGVAIVA-AVNERLPVF---SEYAARQVKKTVVGIGSAAKE 118 (158)
T ss_pred HHHHHHHH-HHHHHCCCC---CHHHHHHHHHHHCCCCCCCHH
T ss_conf 44699999-998626873---105577665653057730038
No 17
>PRK13310 N-acetyl-D-glucosamine kinase; Provisional
Probab=89.45 E-value=1.6 Score=23.88 Aligned_cols=91 Identities=12% Similarity=0.155 Sum_probs=50.7
Q ss_pred EEEEECCCCEEEEEEEECCCCEECCCEEE-ECCCCCHHHHHHHHHHHHHC-----CCEEEEEECCCCCCCCCH-----HH
Q ss_conf 99998688768999630765110121011-04774156898999986516-----775899721357888884-----47
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFAHPRPFL-VRKKVTQTALELLSFITTEN-----IAAFIIGLPLNMNGSEGP-----RV 87 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a~Pl~~i-~~~~~~~~~~~l~~ii~e~~-----i~~iVvGlP~~~~g~~~~-----~~ 87 (160)
++|||.|-.+|=+|+.|....+-....+- +..+....+..+.++++++. +..+-||.|=..|-..+- ..
T Consensus 2 ~lGiDIGGT~i~~~l~d~~g~i~~~~~~~Tp~~~~~~~~~~i~~~i~~~~~~~~~~~~iGig~pG~vd~~~g~v~~~n~~ 81 (302)
T PRK13310 2 YYGFDIGGTKIELGVFDEKLELQWEKRVPTPRDSYDAFLDAVCELVAEADQRFGCKGSVGIGIPGMPETEDGTLYAANVP 81 (302)
T ss_pred EEEEEECCHHEEEEEEECCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCEEEECCCC
T ss_conf 99999170049999997899799999987999999999999999999987655887869999857652999889936986
Q ss_pred H-HHHHHHHHHHHHCCCCCEEEEC
Q ss_conf 9-9999999998404899779983
Q gi|254780976|r 88 H-STRAFVHNMIDRKVYVPFVFWD 110 (160)
Q Consensus 88 ~-~v~~f~~~L~~~~~~l~v~~~D 110 (160)
. .-..+.+.| ++.+++||++-+
T Consensus 82 ~~~~~~l~~~L-~~~~g~pV~ieN 104 (302)
T PRK13310 82 AASGKPLRADL-SARLGRDVRLDN 104 (302)
T ss_pred CCCCCCHHHHH-HHHHCCCEEEEC
T ss_conf 41489759999-999799889951
No 18
>PRK05082 N-acetylmannosamine kinase; Provisional
Probab=89.34 E-value=1.9 Score=23.37 Aligned_cols=97 Identities=10% Similarity=0.141 Sum_probs=54.2
Q ss_pred CCEEEEECCCCEEEEEEEECCCCEECCCEEEE-C-CCCCHHHHHHHHHHHHH--CCCEEEEEECCCCCCCCC-----HHH
Q ss_conf 83999986887689996307651101210110-4-77415689899998651--677589972135788888-----447
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGRRFAHPRPFLV-R-KKVTQTALELLSFITTE--NIAAFIIGLPLNMNGSEG-----PRV 87 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~-~-~~~~~~~~~l~~ii~e~--~i~~iVvGlP~~~~g~~~-----~~~ 87 (160)
+.+||||+|-.+|=+|+.|..+.+..-...-. . ...+.....+.++++++ +++.+-|+.|=-.|.... +..
T Consensus 1 M~~lgiDiGGT~ik~~lvd~~G~i~~~~~~~t~~~~~~~~~~~~i~~~~~~~~~~~~~igi~~pG~v~~~~~~~~~~~nl 80 (291)
T PRK05082 1 MTTLAIDIGGTKIAAALVGEDGQISQRRQIPTPASQTPEALRDALEALVSPLQAQADRVAVASTGIINDGVLTALNPHNL 80 (291)
T ss_pred CCEEEEEECCCEEEEEEECCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCEEEECCEEEECCCEEEEECCCCC
T ss_conf 96999995734099999959995999999979998899999999999999988556968874222054986699668777
Q ss_pred HH--HHHHHHHHHHHCCCCCEEEECCCCC
Q ss_conf 99--9999999984048997799836716
Q gi|254780976|r 88 HS--TRAFVHNMIDRKVYVPFVFWDERLT 114 (160)
Q Consensus 88 ~~--v~~f~~~L~~~~~~l~v~~~DEr~T 114 (160)
.. -..+.+.| ++.+++||.+-+.--.
T Consensus 81 ~~~~~~pl~~~l-~~~~~~pv~v~NDan~ 108 (291)
T PRK05082 81 GGLLHFPLVQTL-EQITDLPTIALNDAQA 108 (291)
T ss_pred CCCCCCCHHHHH-HHHHCCCEEEECCHHH
T ss_conf 875697569999-9977989899750678
No 19
>PRK00290 dnaK molecular chaperone DnaK; Provisional
Probab=88.53 E-value=0.55 Score=26.59 Aligned_cols=24 Identities=33% Similarity=0.436 Sum_probs=20.5
Q ss_pred CCCEEEEECCCCEEEEEEEECCCC
Q ss_conf 983999986887689996307651
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRR 39 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~ 39 (160)
|++|+|||+|+.+.=+|+.+....
T Consensus 1 M~~VIGIDLGTTnS~VAv~~~g~~ 24 (631)
T PRK00290 1 MGKIIGIDLGTTNSCVAVMEGGEP 24 (631)
T ss_pred CCCEEEEECCHHCEEEEEEECCEE
T ss_conf 998899985802089999989988
No 20
>PRK09557 fructokinase; Reviewed
Probab=88.01 E-value=2.3 Score=22.82 Aligned_cols=100 Identities=12% Similarity=0.154 Sum_probs=54.5
Q ss_pred EEEEECCCCEEEEEEEECCCCEECCCEEE-ECCCCCHHHHHHHHHHHHH-----CCCEEEEEECCCCCCCCC----HHHH
Q ss_conf 99998688768999630765110121011-0477415689899998651-----677589972135788888----4479
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFAHPRPFL-VRKKVTQTALELLSFITTE-----NIAAFIIGLPLNMNGSEG----PRVH 88 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a~Pl~~i-~~~~~~~~~~~l~~ii~e~-----~i~~iVvGlP~~~~g~~~----~~~~ 88 (160)
.+|||.|-.+|=+|+.|..+.+-.-...- ++......+..+..++++. .+..+-||.|=-.|...+ ....
T Consensus 2 ~iGiDIGGT~i~~~lvd~~G~il~~~~~~t~~~~~~~~~~~i~~li~~~~~~~~~~~~IGIg~PG~vd~~~G~v~~~~~~ 81 (301)
T PRK09557 2 RIGIDLGGTKIEVIALDDAGEELFRKRLPTPRDDYQQTIEAIATLVDMAEQATGQRGTVGVGIPGVISPYTGLVKNANST 81 (301)
T ss_pred EEEEEECCCEEEEEEEECCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEECCCCC
T ss_conf 89999873209999997999799999987999887999999999999988763887769999207771999989668864
Q ss_pred H--HHHHHHHHHHHCCCCCEEEECCCCCHHHHH
Q ss_conf 9--999999998404899779983671679999
Q gi|254780976|r 89 S--TRAFVHNMIDRKVYVPFVFWDERLTTVSAQ 119 (160)
Q Consensus 89 ~--v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~ 119 (160)
. -..+.+.| ++.+++||...+.----.-|+
T Consensus 82 ~~~~~~l~~~l-~~~~~~pV~v~NDanaaalgE 113 (301)
T PRK09557 82 WLNGQPLDKDL-SARLNREVRLANDANCLAVSE 113 (301)
T ss_pred CCCCCCHHHHH-HHHHCCCEEEECCHHHHHHHH
T ss_conf 21685479999-998799889946198999999
No 21
>CHL00094 dnaK heat shock protein 70
Probab=87.70 E-value=0.67 Score=26.09 Aligned_cols=23 Identities=30% Similarity=0.482 Sum_probs=20.1
Q ss_pred CCCEEEEECCCCEEEEEEEECCC
Q ss_conf 98399998688768999630765
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~ 38 (160)
|++++|||+|+.+.=+|+.+...
T Consensus 1 M~~VIGIDlGTTnS~VAv~~~g~ 23 (622)
T CHL00094 1 MGKVVGIDLGTTNSVVAVMEGGK 23 (622)
T ss_pred CCCEEEEECCHHCEEEEEEECCE
T ss_conf 99889997580208999998998
No 22
>pfam00480 ROK ROK family.
Probab=86.32 E-value=2.9 Score=22.24 Aligned_cols=100 Identities=18% Similarity=0.282 Sum_probs=56.7
Q ss_pred EEECCCCEEEEEEEECCCCEECCCEE-EECCCCCHHHHHHHHHHHHH-----CCCEEEEEECCCCCCCC-----CHHHHH
Q ss_conf 99868876899963076511012101-10477415689899998651-----67758997213578888-----844799
Q gi|254780976|r 21 SIDLGTKRIGLAISDPGRRFAHPRPF-LVRKKVTQTALELLSFITTE-----NIAAFIIGLPLNMNGSE-----GPRVHS 89 (160)
Q Consensus 21 giD~G~kriGiAisd~~~~~a~Pl~~-i~~~~~~~~~~~l~~ii~e~-----~i~~iVvGlP~~~~g~~-----~~~~~~ 89 (160)
|||.|-.+|=+|+.|..+.+-.-... .+.......+..+.+.+++. .+.++-||.|=..|... ++....
T Consensus 1 GiDiGgt~i~~~l~d~~G~i~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~~~giGi~~pG~vd~~~g~v~~~~~~~w 80 (181)
T pfam00480 1 GIDLGGTKIELALFDEDGEILDRERIPTPETTPEETLDALVDFVEEAITQYGASLGIGIGSPGIIDDDQGTITLTPNIGW 80 (181)
T ss_pred CEEECCCEEEEEEECCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEECCCCCEEEECCCCCC
T ss_conf 99978858999999499979999999899999999999999999998875198326999985348189487961488884
Q ss_pred -HHHHHHHHHHHCCCCCEEEECCCCCHHHHHHH
Q ss_conf -99999999840489977998367167999999
Q gi|254780976|r 90 -TRAFVHNMIDRKVYVPFVFWDERLTTVSAQQI 121 (160)
Q Consensus 90 -v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~ 121 (160)
-..+.+.| ++.+++||.+.+.--.-.-|+..
T Consensus 81 ~~~~l~~~l-~~~~~~Pv~v~NDa~aaalaE~~ 112 (181)
T pfam00480 81 ADFDLVKKL-ESRFNVPVYIENDANAAALGEKV 112 (181)
T ss_pred CCCCHHHHH-HHHHCCCEEECCHHHHHHHHHHH
T ss_conf 478789999-99869980432226499999998
No 23
>PRK13410 molecular chaperone DnaK; Provisional
Probab=85.85 E-value=0.99 Score=25.05 Aligned_cols=25 Identities=28% Similarity=0.433 Sum_probs=21.0
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCE
Q ss_conf 9839999868876899963076511
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRF 40 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~ 40 (160)
|++|+|||+|+.+.=||+.+.....
T Consensus 1 Mg~VIGIDLGTTNS~VAv~~~G~~~ 25 (719)
T PRK13410 1 MGKVVGIDLGTTNSVVAVMEGGKPV 25 (719)
T ss_pred CCCEEEEECCCCCEEEEEEECCEEE
T ss_conf 9986999858102899999899889
No 24
>PRK13411 molecular chaperone DnaK; Provisional
Probab=85.23 E-value=1 Score=25.01 Aligned_cols=23 Identities=30% Similarity=0.478 Sum_probs=20.0
Q ss_pred CCCEEEEECCCCEEEEEEEECCC
Q ss_conf 98399998688768999630765
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~ 38 (160)
|++|+|||+|+.+.=+|+-....
T Consensus 1 M~~VIGIDLGTTnS~VAv~~~G~ 23 (655)
T PRK13411 1 MGKVIGIDLGTTNSCVAVLEGGK 23 (655)
T ss_pred CCCEEEEECCHHHEEEEEEECCE
T ss_conf 99869998680338999998998
No 25
>TIGR01821 5aminolev_synth 5-aminolevulinic acid synthase; InterPro: IPR010961 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents 5-aminoaevulinic acid (ALA) synthase (2.3.1.37 from EC), which catalyses the first stage of tetrapyrrole biosynthesis by the C4 pathway, namely the condensation of succinyl CoA and glycine. ALA synthase is a pyridoxal-phosphate-dependent enzyme. During catalysis, glycine initially binds to the enzyme cofactor, and after condensation with succinyl CoA, CoA, carbon dioxide and 5-aminolevulinic acid are produced .; GO: 0003870 5-aminolevulinate synthase activity, 0030170 pyridoxal phosphate binding, 0033014 tetrapyrrole biosynthetic process.
Probab=84.62 E-value=2.2 Score=22.92 Aligned_cols=67 Identities=15% Similarity=0.206 Sum_probs=51.2
Q ss_pred ECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCC
Q ss_conf 01210110477415689899998651677589972135788888447999999999984048997799836716
Q gi|254780976|r 41 AHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLT 114 (160)
Q Consensus 41 a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~T 114 (160)
+..-+.|.+.|..+++++|++-..-..|..|++===++|||...|..+.|+ |.++ ++ -.+|+||=.+
T Consensus 152 S~~~k~iFrHND~~hLe~LL~~~d~~~PKiIAFESvYSMDGd~aPi~eiCD-----lA~k-YG-ALTy~DEVHA 218 (427)
T TIGR01821 152 SGAEKFIFRHNDVAHLEKLLQSVDPNAPKIIAFESVYSMDGDIAPIKEICD-----LADK-YG-ALTYLDEVHA 218 (427)
T ss_pred CCCCEEEEECCCHHHHHHHHHHCCCCCCCEEEEEEHCCCCCCCCCHHHHHH-----HHHH-HC-CCCEECCEEE
T ss_conf 789505630599789999986477998956886100016787266788966-----8887-18-8630011465
No 26
>COG1214 Inactive homolog of metal-dependent proteases, putative molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=84.45 E-value=1.4 Score=24.13 Aligned_cols=95 Identities=18% Similarity=0.217 Sum_probs=62.4
Q ss_pred CCEEEEECCCCEEEEEEEEC--CCCEECCCEEEECCCCCHHHHHHHHHHHHH-----CCCEEEEEECCCCCCCCCHHHHH
Q ss_conf 83999986887689996307--651101210110477415689899998651-----67758997213578888844799
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDP--GRRFAHPRPFLVRKKVTQTALELLSFITTE-----NIAAFIIGLPLNMNGSEGPRVHS 89 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~--~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~-----~i~~iVvGlP~~~~g~~~~~~~~ 89 (160)
+++|+||--++.+++|+-+. ...++.-.....+++.+.....+.++..+- +++.|+||.= =|+.+- .+.
T Consensus 1 m~iLaiDTs~~~~s~ai~~~~~~~vl~~~~~~~~r~hse~l~~~i~~ll~~~~~~~~dld~Iav~~G---PGSFTG-lRI 76 (220)
T COG1214 1 MKILAIDTSTSALSVALYLADDGKVLAEHTEKLKRNHAERLMPMIDELLKEAGLSLQDLDAIAVAKG---PGSFTG-LRI 76 (220)
T ss_pred CCEEEEECCCHHHHHHHEECCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCHHHCCEEEEECC---CCCCHH-HHH
T ss_conf 9189998673453166142478958899997158328899999999999875999778488999308---974223-889
Q ss_pred HHHHHHHHHHHCCCCCEEEECCCCCHHHHHH
Q ss_conf 9999999984048997799836716799999
Q gi|254780976|r 90 TRAFVHNMIDRKVYVPFVFWDERLTTVSAQQ 120 (160)
Q Consensus 90 v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~ 120 (160)
-..+++-|. ...++|++=+ ||-++..
T Consensus 77 G~~~AkgLA-~~l~iplvgv----ssL~~~A 102 (220)
T COG1214 77 GVAFAKGLA-LALNIPLVGV----SSLEALA 102 (220)
T ss_pred HHHHHHHHH-HHCCCCEEEE----CHHHHHH
T ss_conf 999999998-7529868995----7699998
No 27
>PRK13311 N-acetyl-D-glucosamine kinase; Provisional
Probab=83.41 E-value=3.9 Score=21.43 Aligned_cols=59 Identities=19% Similarity=0.185 Sum_probs=36.1
Q ss_pred EEEEECCCCEEEEEEEECCCCEECCCEEE--ECCCCCHHHHHHHHHHHHHC-----CCEEEEEECCC
Q ss_conf 99998688768999630765110121011--04774156898999986516-----77589972135
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFAHPRPFL--VRKKVTQTALELLSFITTEN-----IAAFIIGLPLN 78 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a~Pl~~i--~~~~~~~~~~~l~~ii~e~~-----i~~iVvGlP~~ 78 (160)
++|||.|-.+|=+++.|....+..- ..+ ++.+..+.+..+.+++++.. ...+-||.|=.
T Consensus 2 ylGiDIGGTki~~~l~d~~g~~~~~-~~~~T~~~~~~~~~~~i~~l~~~~~~~~~~~~~IGig~pG~ 67 (256)
T PRK13311 2 YYGFDMGGTKIELGVFDENLQRIWH-KRVPTPREDYPQLLQILRDLTEEADTYCGVQGSVGIGIPGL 67 (256)
T ss_pred EEEEECCCHHEEEEEEECCCCEEEE-EEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCC
T ss_conf 9999934101102899699989999-99749985799999999999999887469887599980777
No 28
>COG0418 PyrC Dihydroorotase [Nucleotide transport and metabolism]
Probab=80.97 E-value=1.4 Score=24.08 Aligned_cols=79 Identities=23% Similarity=0.325 Sum_probs=48.3
Q ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHH----HHHHH----HHHHHHCCCCCEEEECCCCCHHHHHHHHHHCC
Q ss_conf 68989999865167758997213578888844799----99999----99984048997799836716799999999869
Q gi|254780976|r 55 TALELLSFITTENIAAFIIGLPLNMNGSEGPRVHS----TRAFV----HNMIDRKVYVPFVFWDERLTTVSAQQILIDMN 126 (160)
Q Consensus 55 ~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~----v~~f~----~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g 126 (160)
.+..+..+.+..+ =+|+|+...|..+...-. -..|+ +.|.++++.+.|.+ |+.||+.|-+..++.+
T Consensus 116 ~~~~~~pvle~Mq----~~gmpLlvHGEvt~~~vDifdrE~~Fi~~vl~pl~~~fP~LKIV~--EHiTT~dav~~v~~~~ 189 (344)
T COG0418 116 DIEKIYPVLEAMQ----KIGMPLLVHGEVTDAEVDIFDREAAFIESVLEPLRQRFPKLKIVL--EHITTKDAVEYVKDAN 189 (344)
T ss_pred CHHHHHHHHHHHH----HCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEE--EEECCHHHHHHHHHCC
T ss_conf 3788789999999----719868970465775456113578789999999986587636998--8715088999999648
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 86445464378998999999999
Q gi|254780976|r 127 VSRKKRIQKVDSIAAALILQEVL 149 (160)
Q Consensus 127 ~~~k~~k~~iD~~AA~iILq~~L 149 (160)
.. +||+|-.+.-+
T Consensus 190 ~n----------laATIT~hHL~ 202 (344)
T COG0418 190 NN----------LAATITPHHLL 202 (344)
T ss_pred CC----------EEEEEEHHHEE
T ss_conf 65----------04673145602
No 29
>COG0443 DnaK Molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=80.65 E-value=0.59 Score=26.42 Aligned_cols=23 Identities=30% Similarity=0.380 Sum_probs=20.1
Q ss_pred CCCEEEEECCCCEEEEEEEECCC
Q ss_conf 98399998688768999630765
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~ 38 (160)
.+.++|||+|+.+.=+|+.++..
T Consensus 4 ~~~~iGIDlGTTNS~vA~~~~~~ 26 (579)
T COG0443 4 AKKAIGIDLGTTNSVVAVMRGGG 26 (579)
T ss_pred CCEEEEEECCCCCEEEEEECCCC
T ss_conf 77178996177628999980798
No 30
>KOG0237 consensus
Probab=79.56 E-value=5.4 Score=20.60 Aligned_cols=73 Identities=15% Similarity=0.286 Sum_probs=51.5
Q ss_pred CHHHHHHHHHHHHHCCCEEEEEECCCCC-----------------CCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCH
Q ss_conf 1568989999865167758997213578-----------------88884479999999999840489977998367167
Q gi|254780976|r 53 TQTALELLSFITTENIAAFIIGLPLNMN-----------------GSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTT 115 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~iVvGlP~~~~-----------------g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS 115 (160)
..++..+.+..++|+|..+|+|=-..+- .....|.+..+.|++.+..+ .++|-.-|+----.
T Consensus 54 ~~d~~ala~f~~e~~I~lVvvGPE~PL~~Gl~~~l~~~gi~~FGPs~~aAqlE~sK~fsK~fm~r-~~IPTA~y~~ft~~ 132 (788)
T KOG0237 54 VADFEALASFCKEHNINLVVVGPELPLVAGLADVLRSAGIPCFGPSKQAAQLEASKNFSKDFMHR-HNIPTAKYKTFTDP 132 (788)
T ss_pred HHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEECCHHHHHHHHHHHHHHHHHHHH-CCCCCCEEEEECCH
T ss_conf 65599999999874621899787326542356666405851017437777766417889988986-39983202640787
Q ss_pred HHHHHHHHHCC
Q ss_conf 99999999869
Q gi|254780976|r 116 VSAQQILIDMN 126 (160)
Q Consensus 116 ~~A~~~l~~~g 126 (160)
.+|+..+....
T Consensus 133 e~a~sfi~~~~ 143 (788)
T KOG0237 133 EEAKSFIQSAT 143 (788)
T ss_pred HHHHHHHHHCC
T ss_conf 99899998489
No 31
>pfam07318 DUF1464 Protein of unknown function (DUF1464). This family consists of several hypothetical archaeal proteins of around 350 residues in length. The function of this family is unknown.
Probab=79.45 E-value=2.3 Score=22.80 Aligned_cols=96 Identities=25% Similarity=0.255 Sum_probs=57.7
Q ss_pred EEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEE----EECCCCCCCCCHHH---------
Q ss_conf 99868876899963076511012101104774156898999986516775899----72135788888447---------
Q gi|254780976|r 21 SIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFII----GLPLNMNGSEGPRV--------- 87 (160)
Q Consensus 21 giD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVv----GlP~~~~g~~~~~~--------- 87 (160)
|||.|++-..++.-|....+ . -..++.....+.-..+.+++++++|+.++. |+|+...-..++..
T Consensus 1 GIDPGT~S~dv~g~~dG~v~-~-~~sipt~~v~~~p~~iv~~i~e~~~d~IagPSGYGlPlk~~~eltd~ei~L~Tl~~~ 78 (336)
T pfam07318 1 GIDPGTGSYDVAGVDDGNVI-L-DKSIPTDEVTKDPRSLVRLIEEARPDAIAGPSGYGLPLKRARELTEREIFLATLADP 78 (336)
T ss_pred CCCCCCCCEEEEEEECCEEE-E-EEECCHHHHHHCHHHHHHHHHHCCCCEEECCCCCCCCCCCHHHCCHHHHHHEEECCC
T ss_conf 97989861679997288599-9-875278987558699999999649988964776786302456578675756241454
Q ss_pred -----HHHHHHHHHHHHHCCCCCEEEEC--CCCCHHHHHH
Q ss_conf -----99999999998404899779983--6716799999
Q gi|254780976|r 88 -----HSTRAFVHNMIDRKVYVPFVFWD--ERLTTVSAQQ 120 (160)
Q Consensus 88 -----~~v~~f~~~L~~~~~~l~v~~~D--Er~TS~~A~~ 120 (160)
.-.++|...+.++ ++|+++.. =++.|+-+.+
T Consensus 79 ~~~g~~gL~~~~~~~~~~--~ln~~fiPgVIhLpTVP~~R 116 (336)
T pfam07318 79 SEEGLGGLRNLMRLMASK--DLNVFFIPGVIHLPTVPAWR 116 (336)
T ss_pred CCCCCHHHHHHHHHHHHC--CCCEEEECCEEECCCCCHHH
T ss_conf 443303499999999856--89867757557768880454
No 32
>pfam04848 Pox_A22 Poxvirus A22 protein.
Probab=77.96 E-value=6 Score=20.30 Aligned_cols=85 Identities=13% Similarity=0.112 Sum_probs=46.2
Q ss_pred CEEEEECCCCEEEEEEEECCCCEECCCEEE--ECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHH
Q ss_conf 399998688768999630765110121011--047741568989999865167758997213578888844799999999
Q gi|254780976|r 18 PIASIDLGTKRIGLAISDPGRRFAHPRPFL--VRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVH 95 (160)
Q Consensus 18 riLgiD~G~kriGiAisd~~~~~a~Pl~~i--~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~ 95 (160)
.|++||+|.|+-+.++-+.....-.-+.+- +-. .......+.. +-+++++.+++-- ++..++.++.+ .|..
T Consensus 2 iIlSiDiGiKNlA~c~~e~~~n~I~~~~v~~~d~~-~d~~~~v~~d-~~~~~~d~VLiEk----Qp~r~~~~k~i-~fi~ 74 (143)
T pfam04848 2 IICAIDIGTKNPARTIIEIKDNKIKLIDISKLDWS-SDWEYRVARD-LYNYNYDVVLLER----QGRRSPNIKFI-YFIK 74 (143)
T ss_pred EEEEEECCCCCCEEEEEECCCCEEEEEEEECCCCC-CCHHHHHHHH-HHCCCCCEEEEEE----CCCCCCCEEEH-HHHH
T ss_conf 79998427777015899737984999986048988-5279999999-7435899899960----79999847641-3356
Q ss_pred HHHHHCCCCCEEEEC
Q ss_conf 998404899779983
Q gi|254780976|r 96 NMIDRKVYVPFVFWD 110 (160)
Q Consensus 96 ~L~~~~~~l~v~~~D 110 (160)
.+. ...+..|.-.+
T Consensus 75 g~f-y~~~~kvi~~~ 88 (143)
T pfam04848 75 GFL-YNKQTKVICVN 88 (143)
T ss_pred HHH-CCCCCCEEEEC
T ss_conf 582-16777578708
No 33
>TIGR02089 TTC tartrate dehydrogenase; InterPro: IPR011829 Tartrate dehydrogenase catalyzes the oxidation of both meso- and (+)-tartrate as well as a D-malate . These enzymes are closely related to the 3-isopropylmalate and isohomocitrate dehydrogenases found in IPR004429 from INTERPRO and IPR011828 from INTERPRO, respectively.; GO: 0009027 tartrate dehydrogenase activity, 0051287 NAD binding.
Probab=77.17 E-value=2.1 Score=23.10 Aligned_cols=54 Identities=22% Similarity=0.404 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHHHHCCCC-------------CEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH
Q ss_conf 799999999998404899-------------779983671679999999986986445464378998999999
Q gi|254780976|r 87 VHSTRAFVHNMIDRKVYV-------------PFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQ 146 (160)
Q Consensus 87 ~~~v~~f~~~L~~~~~~l-------------~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq 146 (160)
..++.+|+=+|.++++.- ---||||.....+++- -..+..+..||.+||-+.|.
T Consensus 167 veRilrFAFeLA~~RPrk~lTs~TKSNgi~~~MpfWDEv~~~~A~~y------p~v~~d~~hiDalaa~fv~~ 233 (355)
T TIGR02089 167 VERILRFAFELAQKRPRKHLTSATKSNGIKHSMPFWDEVFEEVAAEY------PDVEWDSYHIDALAARFVLK 233 (355)
T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHC------CCCEEEHHHHHHHHHHHHCC
T ss_conf 46888999999975786441225622454316852678999997337------89524045786799886438
No 34
>TIGR01865 cas_Csn1 CRISPR-associated protein, Csn1 family; InterPro: IPR010145 CRISPR loci appear to be mobile elements with a wide host range. This entry represents proteins found only in CRISPR-containing species, near other CRISPR-associated proteins (cas). The species range so far for these proteins is pathogenic bacteria only..
Probab=76.58 E-value=5.2 Score=20.68 Aligned_cols=34 Identities=15% Similarity=0.338 Sum_probs=28.9
Q ss_pred CCEEEEECCCCEEEEEEEECCCCEECCCEEEECC
Q ss_conf 8399998688768999630765110121011047
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRK 50 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~ 50 (160)
.++||||.|+.=||.||.+.......|-..+...
T Consensus 1 ~y~lGLDiGi~SVGWA~~e~d~~~~~~~~~id~g 34 (1140)
T TIGR01865 1 EYILGLDIGIASVGWAVVEDDYKVEAKKRLIDGG 34 (1140)
T ss_pred CCEEEEEECCCCEEEEEEECCCCCCCCCCEEECC
T ss_conf 9348887020123405775464740124144048
No 35
>PTZ00186 heat shock 70 kDa precursor protein; Provisional
Probab=75.53 E-value=4.2 Score=21.28 Aligned_cols=25 Identities=28% Similarity=0.356 Sum_probs=21.4
Q ss_pred CCCCCEEEEECCCCEEEEEEEECCC
Q ss_conf 8998399998688768999630765
Q gi|254780976|r 14 KPNQPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 14 ~~~~riLgiD~G~kriGiAisd~~~ 38 (160)
+-+++|+|||+|+.+.=||+-+...
T Consensus 24 ~~~g~VIGIDLGTTnS~VAv~~~g~ 48 (657)
T PTZ00186 24 KVQGDVIGVDLGTTYSCVATMDGDK 48 (657)
T ss_pred CCCCCEEEEECCCCCEEEEEEECCE
T ss_conf 7888789996386447899998998
No 36
>PRK00292 glk glucokinase; Provisional
Probab=73.10 E-value=8.1 Score=19.53 Aligned_cols=61 Identities=8% Similarity=0.108 Sum_probs=34.2
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHC--CCEEEEEEC
Q ss_conf 9839999868876899963076511012101104774156898999986516--775899721
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTEN--IAAFIIGLP 76 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~--i~~iVvGlP 76 (160)
|+.+|.-|.|-.++=+|+.+....-.....++...+...+..-+..++.+.+ |...+++.+
T Consensus 1 M~~iLvaDIGGTn~R~al~~~~~~~~~~~~~~~~~~~~s~~~~l~~~l~~~~~~~~~a~iavA 63 (315)
T PRK00292 1 MKPALVGDIGGTNARFALCDDASGEILQAKVYSTADYPSLEEAVRVYLDEHGVQVGDACFAIA 63 (315)
T ss_pred CCCEEEEECCHHHHHHEEEECCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCCCCEEEEEEE
T ss_conf 985389981754444424536998620025895688789999999999857998354999997
No 37
>cd01987 USP_OKCHK USP domain is located between the N-terminal sensor domain and C-terminal catalytic domain of this Osmosensitive K+ channel histidine kinase family. The family of KdpD sensor kinase proteins regulates the kdpFABC operon responsible for potassium transport. The USP domain is homologous to the universal stress protein Usp Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity.
Probab=72.03 E-value=8.6 Score=19.37 Aligned_cols=51 Identities=12% Similarity=0.199 Sum_probs=35.3
Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHH-HHHHHHHHHHHHHCCCCCEEEE
Q ss_conf 5689899998651677589972135788888447-9999999999840489977998
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLPLNMNGSEGPRV-HSTRAFVHNMIDRKVYVPFVFW 109 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~-~~v~~f~~~L~~~~~~l~v~~~ 109 (160)
.....|.++.++++++.||+|-|- .+.+. -....+++.+.+..+++.|+.+
T Consensus 72 d~~~~I~~~A~~~~~t~IVlG~~~-----~~~~~~~~~gSv~~~vl~~a~~idv~iV 123 (124)
T cd01987 72 DVAEAIVEFAREHNVTQIVVGKSR-----RSRWRELFRGSLVDRLLRRAGNIDVHIV 123 (124)
T ss_pred CHHHHHHHHHHHCCCCEEEECCCC-----CCHHHHHHCCCHHHHHHHHCCCCCEEEE
T ss_conf 989999999998499899976898-----8546887238899999974999988985
No 38
>TIGR02026 BchE magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase; InterPro: IPR011772 This entry respresents the cobalamin-dependent oxidative cyclase responsible for forming the distinctive E-ring of the chlorin ring system under anaerobic conditions . This step is essential in the biosynthesis of both bacteriochlorophyll and chlorophyll under anaerobic conditions (a separate enzyme, AcsF, acts under aerobic conditions). This entry identifies two clades of sequences, one from photosynthetic, non-cyanobacterial bacteria and another including Synechocystis and several non-photosynthetic bacteria. The function of the Synechocystis gene is supported by gene clustering with other photosynthetic genes, so the purpose of the gene in the non-photosynthetic bacteria is uncertain. Note that homologues of this gene are not found in plants which rely solely on the aerobic cyclase.; GO: 0016709 oxidoreductase activity acting on paired donors with incorporation or reduction of molecular oxygen NADH or NADPH as one donor and incorporation of one atom of oxygen, 0050661 NADP binding, 0015995 chlorophyll biosynthetic process.
Probab=69.58 E-value=9.8 Score=19.05 Aligned_cols=52 Identities=12% Similarity=0.255 Sum_probs=45.2
Q ss_pred CHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE
Q ss_conf 156898999986516775899721357888884479999999999840489977998
Q gi|254780976|r 53 TQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFW 109 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~ 109 (160)
.+..++|..++++|+|..|++- |..++-.-+.-..|+++++.+.+.+.|...
T Consensus 229 kKfvdEI~~L~r~hgVgfF~LA-----DEePT~Nr~~f~efCEe~Iar~l~~~v~~g 280 (506)
T TIGR02026 229 KKFVDEIEKLVREHGVGFFILA-----DEEPTVNRKKFQEFCEELIARNLQIKVTWG 280 (506)
T ss_pred CHHHHHHHHHHHHCCCCEEEEC-----CCCCCCCHHHHHHHHHHHHHCCCCEEEEEE
T ss_conf 1389999999863185336632-----788730168999999999857895699996
No 39
>PRK06395 phosphoribosylamine--glycine ligase; Provisional
Probab=68.86 E-value=10 Score=18.95 Aligned_cols=72 Identities=14% Similarity=0.247 Sum_probs=52.6
Q ss_pred HHHHHHHHHHHHHCCCEEEEEE--CCCCCC----------------CCCHHHHHHHHHHHHHHHHCCCCC-EEEECCCCC
Q ss_conf 5689899998651677589972--135788----------------888447999999999984048997-799836716
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGL--PLNMNG----------------SEGPRVHSTRAFVHNMIDRKVYVP-FVFWDERLT 114 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGl--P~~~~g----------------~~~~~~~~v~~f~~~L~~~~~~l~-v~~~DEr~T 114 (160)
.+...|.+++++++|+.+|||= |+- +| ....+.+..+.|++++..+ .++| -.-+.+-++
T Consensus 52 ~d~~~i~~~a~~~~idLvvvGPE~pL~-~Gi~D~l~~~gi~vFGP~k~aA~LE~SK~FaK~fm~~-~~IP~ta~~~~~~~ 129 (435)
T PRK06395 52 KDYDLIEDFALKNNVDIVFVGPDPVLA-TPLVNNLLKRGIKVASPTMEAAMIETSKMFMRYLMER-HNIPGNINFNACFS 129 (435)
T ss_pred CCHHHHHHHHHHCCCCEEEECCCHHHH-CCHHHHHHHCCCCEECCHHHHHHHHCCHHHHHHHHHH-CCCCCCCCCCCCCC
T ss_conf 699999999998499999989767886-6145599768994668428873263279889999998-09985664532399
Q ss_pred HHHHHHHHHHCCC
Q ss_conf 7999999998698
Q gi|254780976|r 115 TVSAQQILIDMNV 127 (160)
Q Consensus 115 S~~A~~~l~~~g~ 127 (160)
..+|.+.+...+.
T Consensus 130 ~~ea~~~~~~~~~ 142 (435)
T PRK06395 130 EKDAARDYITSMK 142 (435)
T ss_pred HHHHHHHHHHCCC
T ss_conf 9999999997099
No 40
>TIGR02858 spore_III_AA stage III sporulation protein AA; InterPro: IPR014217 Proteins in this entry include the stage III sporulation protein AA that is encoded by one of several genes in the spoIIIA locus. This protein is only found in species that are capable of endospore formation..
Probab=66.46 E-value=11 Score=18.66 Aligned_cols=83 Identities=22% Similarity=0.263 Sum_probs=46.2
Q ss_pred CCCEEEEEE----EECCCCEECCCEEE---ECCCCCHHHHHHHHHHHHHC---CCEEEEEECCCCCCCCCHHHHHHHHHH
Q ss_conf 887689996----30765110121011---04774156898999986516---775899721357888884479999999
Q gi|254780976|r 25 GTKRIGLAI----SDPGRRFAHPRPFL---VRKKVTQTALELLSFITTEN---IAAFIIGLPLNMNGSEGPRVHSTRAFV 94 (160)
Q Consensus 25 G~kriGiAi----sd~~~~~a~Pl~~i---~~~~~~~~~~~l~~ii~e~~---i~~iVvGlP~~~~g~~~~~~~~v~~f~ 94 (160)
|-.|||+|= .|..-.+-.+..-+ ..+...-..++|...+-+++ .+.+|||=|- .| +|. ..|+.+
T Consensus 71 GGHRVGlaG~~V~E~~~vk~ik~v~S~NiRIaRE~~G~A~~~~~yL~d~~~~~~NTLiIsPPq--~G-KTT---lLRDla 144 (282)
T TIGR02858 71 GGHRVGLAGRCVLENGKVKTIKNVASLNIRIAREVLGAADKILPYLVDRNGRVLNTLIISPPQ--CG-KTT---LLRDLA 144 (282)
T ss_pred CCEEEEEECEEEEECCCEEEEECCCCCCEEEEEECCCCCHHHHHHHHCCCCCEEEEEEECCCC--CC-CCC---HHHHHH
T ss_conf 672675303799507846451054642133020005775666887730589446788886889--88-510---488898
Q ss_pred HHHHH-----HCCCCCEEEECCCC
Q ss_conf 99984-----04899779983671
Q gi|254780976|r 95 HNMID-----RKVYVPFVFWDERL 113 (160)
Q Consensus 95 ~~L~~-----~~~~l~v~~~DEr~ 113 (160)
+.+.. ...+++|-.+|||-
T Consensus 145 R~~StG~~~~~~~g~KVgivDERS 168 (282)
T TIGR02858 145 RILSTGISKLGLKGKKVGIVDERS 168 (282)
T ss_pred HHHHCCCCCCCCCCCCEEEEECCH
T ss_conf 886078542468997469984324
No 41
>PRK11678 putative chaperone; Provisional
Probab=66.35 E-value=5.6 Score=20.51 Aligned_cols=25 Identities=24% Similarity=0.399 Sum_probs=18.9
Q ss_pred EEEEECCCCEEEEEEEECCCCEECC
Q ss_conf 9999868876899963076511012
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFAHP 43 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a~P 43 (160)
.+|||+||.+.-+|+.+.......|
T Consensus 2 ~IGIDfGTTNS~VAv~~~g~~~vi~ 26 (450)
T PRK11678 2 FIGFDYGTANCSVAVMRDGKPQLLP 26 (450)
T ss_pred EEEEEECHHHEEEEEEECCEEEEEE
T ss_conf 6999847132332899899889998
No 42
>pfam03932 CutC CutC family. Copper transport in Escherichia coli is mediated by the products of at least six genes, cutA, cutB, cutC, cutD, cutE, and cutF. A mutation in one or more of these genes results in an increased copper sensitivity. Members of this family are between 200 and 300 amino acids in length are found in both eukaryotes and bacteria.
Probab=65.95 E-value=12 Score=18.60 Aligned_cols=83 Identities=16% Similarity=0.171 Sum_probs=53.7
Q ss_pred EECCCEEEECCCCC------HHHH---HHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE-
Q ss_conf 10121011047741------5689---8999986516775899721357888884479999999999840489977998-
Q gi|254780976|r 40 FAHPRPFLVRKKVT------QTAL---ELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFW- 109 (160)
Q Consensus 40 ~a~Pl~~i~~~~~~------~~~~---~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~- 109 (160)
...|+.+..|-... ..+. +=...+++.++++||+|. ++.||+... .+.++|.+...++|++|+
T Consensus 49 ~~ipv~vMIRPR~G~F~Ys~~E~~~M~~dI~~~~~~G~~GvV~G~-L~~d~~iD~------~~~~~li~~a~~l~~TFHR 121 (202)
T pfam03932 49 AKIPVYVMIRPRGGDFVYSDHELKAMLEDILKAVELGAPGVVLGA-LTADGEIDT------KRMEKLIEAAGGLGVTFHR 121 (202)
T ss_pred CCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEE-ECCCCCCCH------HHHHHHHHHHCCCCEEEEC
T ss_conf 599749998427998864989999999999999986989789988-889998299------9999999974688559862
Q ss_pred --CCCCCHHHHHHHHHHCCCCH
Q ss_conf --36716799999999869864
Q gi|254780976|r 110 --DERLTTVSAQQILIDMNVSR 129 (160)
Q Consensus 110 --DEr~TS~~A~~~l~~~g~~~ 129 (160)
|+--.-.+|-+.|.+.|..+
T Consensus 122 AfD~~~d~~~al~~L~~lG~~r 143 (202)
T pfam03932 122 AFDMCPDPEEALEQLIELGCER 143 (202)
T ss_pred HHHCCCCHHHHHHHHHHCCCCE
T ss_conf 0430599999999999759987
No 43
>PRK13529 malate dehydrogenase; Provisional
Probab=62.75 E-value=13 Score=18.23 Aligned_cols=124 Identities=20% Similarity=0.204 Sum_probs=71.4
Q ss_pred CCCCCEEEE-ECCCCEEEEEEEECCCCEE----CCCEEE-----ECCCCCHHHHHHHHHHHHHCCCEEEEEECCC-CCCC
Q ss_conf 899839999-8688768999630765110----121011-----0477415689899998651677589972135-7888
Q gi|254780976|r 14 KPNQPIASI-DLGTKRIGLAISDPGRRFA----HPRPFL-----VRKKVTQTALELLSFITTENIAAFIIGLPLN-MNGS 82 (160)
Q Consensus 14 ~~~~riLgi-D~G~kriGiAisd~~~~~a----~Pl~~i-----~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~-~~g~ 82 (160)
...+||||| |.|.-=+|++++--..-+| .|-.++ .-.|++. |+ + +-+.+|++-. ..|.
T Consensus 153 TDGerILGlGDlG~~GmgI~iGKl~lYta~aGi~P~~~LPV~LDvGTnNe~----LL---~----DPlYlG~r~~R~~g~ 221 (563)
T PRK13529 153 TDGERILGLGDQGIGGMGIPIGKLSLYTACGGISPARTLPVVLDVGTNNEQ----LL---N----DPLYLGWRHPRITGE 221 (563)
T ss_pred ECCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHCCCEEEECCCCCHH----HC---C----CCCCCCCCCCCCCHH
T ss_conf 467300245765678776514389887621588842205569953889677----63---5----963357678888768
Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHH
Q ss_conf 884479999999999840489977998367167999999998698644546437899-899999999968645
Q gi|254780976|r 83 EGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSI-AAALILQEVLDRISF 154 (160)
Q Consensus 83 ~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~-AA~iILq~~Ld~~~~ 154 (160)
+--..++.|.....++.++ +..+=|.|++..|.+.|..-.-+ ----+.|.. -|++.|-+.|+.+..
T Consensus 222 --~Y~~fidefv~av~~~fP~--~liqfEDF~~~nA~~lL~rYr~~--~~~FNDDiQGTaaV~LAgll~Alri 288 (563)
T PRK13529 222 --EYDEFVDEFVQAVKRRFPN--ALLQFEDFAQKNAMRLLERYRDE--ICTFNDDIQGTAAVTLAGLLAALKI 288 (563)
T ss_pred --HHHHHHHHHHHHHHHHCCC--CEEEHHCCCCHHHHHHHHHHHHC--CCEECCCCCHHHHHHHHHHHHHHHH
T ss_conf --8999999999999997898--27850107961389999998650--6422453103779999999999998
No 44
>cd00378 SHMT Serine-glycine hydroxymethyltransferase (SHMT). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). SHMT carries out interconversion of serine and glycine; it catalyzes the transfer of hydroxymethyl group of N5, N10-methylene tetrahydrofolate to glycine resulting in the formation of serine and tetrahydrofolate. Both eukaryotic and prokaryotic SHMT enzymes form tight obligate homodimers; the mammalian enzyme forms a homotetramer comprising four pyridoxal phosphate-bound active sites.
Probab=62.61 E-value=13 Score=18.22 Aligned_cols=65 Identities=15% Similarity=0.080 Sum_probs=41.3
Q ss_pred HHHHHCCCCCCEEEEEC--------CCC-EEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC
Q ss_conf 99983589983999986--------887-6899963076511012101104774156898999986516775899721
Q gi|254780976|r 8 DLVKSLKPNQPIASIDL--------GTK-RIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLP 76 (160)
Q Consensus 8 ~~~~~~~~~~riLgiD~--------G~k-riGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP 76 (160)
=|...++|..+|||+|. |.. +++ ++ ...-...|+.+ ..++..-+++++.++++++.|..||+|..
T Consensus 97 v~~all~pGD~il~l~l~~GGHltHg~~~~v~--~~-g~~~~~~~Y~v-d~~~~~IDyd~~~~~a~~~kPklIi~G~S 170 (402)
T cd00378 97 VYFALLEPGDTIMGLDLSHGGHLTHGSFTKVS--AS-GKLFESVPYGV-DPETGLIDYDALEKMALEFKPKLIVAGAS 170 (402)
T ss_pred HHHHHHCCCCEEEEECCCCCCCCCCCCCCCCE--EE-EEEEEEEECCC-CCCCCCCCHHHHHHHHHHCCCCEEEECCC
T ss_conf 99997279998998664347775667787720--21-35678896324-55658718999999998659969997771
No 45
>cd01844 SGNH_hydrolase_like_6 SGNH_hydrolase subfamily. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=61.94 E-value=14 Score=18.14 Aligned_cols=54 Identities=15% Similarity=0.204 Sum_probs=44.7
Q ss_pred HHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEC
Q ss_conf 989999865167758997213578888844799999999998404899779983
Q gi|254780976|r 57 LELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWD 110 (160)
Q Consensus 57 ~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~D 110 (160)
..+.+++.+..++.+|+-+=.|+.+......+....|.+.+.++.++.||.++.
T Consensus 47 ~~~a~~i~~~~a~~~vi~~g~N~~~~~~~~~~~~~~~V~~ir~~~p~tPIvlv~ 100 (177)
T cd01844 47 PEVAELLRDVPADLYIIDCGPNIVGAEAMVRERLGPLVKGLRETHPDTPILLVS 100 (177)
T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEE
T ss_conf 889999861899889996346788747899999999999999878899789983
No 46
>pfam00012 HSP70 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase domain and the carboxyl terminus is the substrate binding region.
Probab=61.88 E-value=7.2 Score=19.84 Aligned_cols=20 Identities=35% Similarity=0.562 Sum_probs=15.9
Q ss_pred EEEEECCCCEEEEEEEECCC
Q ss_conf 99998688768999630765
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~ 38 (160)
|+|||+|+.++=+|+.+...
T Consensus 1 ViGIDlGTtns~vAv~~~g~ 20 (598)
T pfam00012 1 VIGIDLGTTNSCVAVMEGGG 20 (598)
T ss_pred CEEEECCHHCEEEEEEECCE
T ss_conf 98998460148999998998
No 47
>TIGR00856 pyrC_dimer dihydroorotase, homodimeric type; InterPro: IPR004721 Dihydroorotase belongs to MEROPS peptidase family M38 (clan MJ), where it is classified as a non-peptidase homologue. DHOase catalyses the third step in the de novo biosynthesis of pyrimidine, the conversion of ureidosuccinic acid (N-carbamoyl-L-aspartate) into dihydroorotate. Dihydroorotase binds a zinc ion which is required for its catalytic activity . In bacteria, DHOase is a dimer of identical chains of about 400 amino-acid residues (gene pyrC). In the metazoa, DHOase is part of a large multi-functional protein known as 'rudimentary' in Drosophila melanogaster and CAD in mammals and which catalyzes the first three steps of pyrimidine biosynthesis . The DHOase domain is located in the central part of this polyprotein. In yeast, DHOase is encoded by a monofunctional protein (gene URA4). However, a defective DHOase domain is found in a multifunctional protein (gene URA2) that catalyzes the first two steps of pyrimidine biosynthesis. The comparison of DHOase sequences from various sources shows that there are two highly conserved regions. The first located in the N-terminal extremity contains two histidine residues suggested to be involved in binding the zinc ion. The second is found in the C-terminal part. Members of this family of proteins are predicted to adopt a TIM barrel fold . This family represents the homodimeric form of dihydroorotase 3.5.2.3 from EC. It is found in bacteria, plants and fungi; URA4 of yeast is a member of this group of sequences. ; GO: 0004151 dihydroorotase activity, 0019856 pyrimidine base biosynthetic process.
Probab=61.88 E-value=14 Score=18.14 Aligned_cols=93 Identities=22% Similarity=0.280 Sum_probs=57.7
Q ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCCH----------HHHHHHHH----HHHHHHHCCCCCEEEECCCCCHHHHHH
Q ss_conf 6898999986516775899721357888884----------47999999----999984048997799836716799999
Q gi|254780976|r 55 TALELLSFITTENIAAFIIGLPLNMNGSEGP----------RVHSTRAF----VHNMIDRKVYVPFVFWDERLTTVSAQQ 120 (160)
Q Consensus 55 ~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~----------~~~~v~~f----~~~L~~~~~~l~v~~~DEr~TS~~A~~ 120 (160)
++.++..+++.-+ =+|+|+..-|+.+. .-.+-..| -+.|..+++.|.|.+ |+.||+.|-+
T Consensus 116 ~~~~~~~vL~~m~----~~~l~L~~HgE~t~vhdgdkePI~~ld~E~~f~~~~L~~L~~~fP~Lkiil--EH~t~~~ai~ 189 (364)
T TIGR00856 116 LIEAILPVLEAMQ----KVGLPLLLHGEVTSVHDGDKEPIDILDREELFIEKVLEPLREKFPKLKIIL--EHITTEDAIK 189 (364)
T ss_pred HHHHHHHHHHHHH----HCCCEEEECCCCCCCCCCCCCCCCHHCHHHHHHHHHHHHHHHHCCCCEEEE--EECCCHHHHH
T ss_conf 4678999999887----629836631677765567887532103546768999899986277641899--8529889999
Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 99986986445464378998999999999686456
Q gi|254780976|r 121 ILIDMNVSRKKRIQKVDSIAAALILQEVLDRISFL 155 (160)
Q Consensus 121 ~l~~~g~~~k~~k~~iD~~AA~iILq~~Ld~~~~~ 155 (160)
...+.+.+.|+- .-=.+||+|-+|.-+=.++.+
T Consensus 190 ~i~~~n~~vkka--tdvk~aATlT~~HL~~~~d~l 222 (364)
T TIGR00856 190 LIEDINNRVKKA--TDVKLAATLTPHHLLFTRDDL 222 (364)
T ss_pred HHHHHHHCCCHH--HHHHHHHHHHHHHHHHHHHHH
T ss_conf 998640001012--101131031077887307888
No 48
>TIGR01766 tspaseT_teng_C transposase, IS605 OrfB family; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase ..
Probab=57.96 E-value=16 Score=17.72 Aligned_cols=62 Identities=6% Similarity=0.104 Sum_probs=35.9
Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCC-CCCC-CCH-HH----HHHHH-----HHHHHH--HHCCCCCEEEECCCCCH
Q ss_conf 5689899998651677589972135-7888-884-47----99999-----999998--40489977998367167
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLPLN-MNGS-EGP-RV----HSTRA-----FVHNMI--DRKVYVPFVFWDERLTT 115 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP~~-~~g~-~~~-~~----~~v~~-----f~~~L~--~~~~~l~v~~~DEr~TS 115 (160)
+....+.+...+.+...||+|---+ .... ... .. +.... +...+. ...++++|.++++.|||
T Consensus 11 k~~~~~~~~~~~~~~~~~v~e~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~G~~v~~vnp~yts 86 (86)
T TIGR01766 11 KLAKQIVELALELNNGTIVLEDLKGRIKESDVDKDKGKKLRRKLHQWSFRKLISKLKYKAEEYGIEVILVNPSYTS 86 (86)
T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCC
T ss_conf 9999999988860798688625544100000134320466766647689999999998888709624764787789
No 49
>PRK13928 rod shape-determining protein Mbl; Provisional
Probab=57.76 E-value=16 Score=17.70 Aligned_cols=101 Identities=14% Similarity=0.159 Sum_probs=51.3
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCH--HHHHHHHHHHHHCCCEEEEEECCCCCCCCC--HHH-HHH
Q ss_conf 983999986887689996307651101210110477415--689899998651677589972135788888--447-999
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQ--TALELLSFITTENIAAFIIGLPLNMNGSEG--PRV-HST 90 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~--~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~--~~~-~~v 90 (160)
+++.+|||+||.++-+++.+....+..|=-+-..++... .+.+-..-.-...|..+++=.|+. +|..+ +.+ ...
T Consensus 2 ~~~~iGIDLGTtn~~v~~~~~~iv~~ePSvVa~~~~~~~~l~vG~~Ak~~~~~~p~~~~~~~pl~-~g~i~d~e~~~~~L 80 (325)
T PRK13928 2 FGRDIGIDLGTANVLVYVKGKGIVLNEPSVVAIDRNTNKVLAVGEEARRMVGRTPGNIVAIRPLR-DGVIADYDVTEKML 80 (325)
T ss_pred CCCEEEEECCHHCEEEEECCCCEECCCCEEEEEECCCCEEEEEHHHHHHHCCCCCCCEEEEEECC-CCEECCHHHHHHHH
T ss_conf 87744899860258999879987987884999988999799865999985447998789999568-97663678889989
Q ss_pred HHHHHHHHHH-CCCC--CEEEECCCCCHHH
Q ss_conf 9999999840-4899--7799836716799
Q gi|254780976|r 91 RAFVHNMIDR-KVYV--PFVFWDERLTTVS 117 (160)
Q Consensus 91 ~~f~~~L~~~-~~~l--~v~~~DEr~TS~~ 117 (160)
+.|..+...+ .+.. -|.-+.-.+|..|
T Consensus 81 ~~l~~~~~~~~~~~~~~vVItVPa~f~~~q 110 (325)
T PRK13928 81 KYFINKACGKRFFSKPRIMICIPSGITSVE 110 (325)
T ss_pred HHHHHHHHCCCCCCCCCEEEEECCCCCHHH
T ss_conf 999999730235799819999797999999
No 50
>PRK13930 rod shape-determining protein MreB; Provisional
Probab=56.95 E-value=17 Score=17.62 Aligned_cols=103 Identities=18% Similarity=0.111 Sum_probs=48.0
Q ss_pred CCCCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHH--HHHHHHHHHHHCCCEEEEEECCCCCCCCCH---HHH
Q ss_conf 899839999868876899963076511012101104774156--898999986516775899721357888884---479
Q gi|254780976|r 14 KPNQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQT--ALELLSFITTENIAAFIIGLPLNMNGSEGP---RVH 88 (160)
Q Consensus 14 ~~~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~--~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~---~~~ 88 (160)
.-+++.+|||+|+.++-+|+.+....+-.|=-+-...+.... +..-..-.-..++..+.+=.|+. +|...+ .+.
T Consensus 6 ~~~~~~iGIDLGTtns~i~~~~~gii~~~PSvVa~~~~~~~i~~vG~~Ak~~~~~~p~~~~~~~pl~-~g~i~d~e~~~~ 84 (336)
T PRK13930 6 GFFSKDIGIDLGTANTLVYVKGKGIVLNEPSVVAVDKKTGKVLAVGEEAKAMLGRTPGNIEAIRPLK-DGVIADFEATEA 84 (336)
T ss_pred HHHCCEEEEECCHHCEEEEEECCCEECCCCEEEEEECCCCEEEEHHHHHHHHCCCCCCCEEEEEECC-CCEECCHHHHHH
T ss_conf 4545766999863538999918988898885999987999898438999985048977569999678-981548899999
Q ss_pred HHHHHHHHHHHH-CCCC--CEEEECCCCCHHH
Q ss_conf 999999999840-4899--7799836716799
Q gi|254780976|r 89 STRAFVHNMIDR-KVYV--PFVFWDERLTTVS 117 (160)
Q Consensus 89 ~v~~f~~~L~~~-~~~l--~v~~~DEr~TS~~ 117 (160)
..+.+..+...+ .... -|.-+.-.+|..+
T Consensus 85 ~l~~l~~~~~~~~~~~~~~~VItVPa~f~~~q 116 (336)
T PRK13930 85 MLRYFIKKARGRSSFGRPRIVICVPSGITEVE 116 (336)
T ss_pred HHHHHHHHHHHHCCCCCCEEEEEECCCCCHHH
T ss_conf 99999999986458899819999399999999
No 51
>PRK10490 sensor protein KdpD; Provisional
Probab=55.92 E-value=17 Score=17.52 Aligned_cols=52 Identities=13% Similarity=0.207 Sum_probs=39.0
Q ss_pred CHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEC
Q ss_conf 1568989999865167758997213578888844799999999998404899779983
Q gi|254780976|r 53 TQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWD 110 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~D 110 (160)
......|.++.++.+++.||+|-|.... + ..-..+..+|.....++.|+.+.
T Consensus 322 ~dva~~il~~Ar~~nvT~IVlGrs~~rr-----w-~~r~sl~drLi~~a~~IDVhiV~ 373 (895)
T PRK10490 322 PAEEKAVVRYAREHNLGKIIIGRPASRR-----W-WRRETFADRLARLAPDLDQVIVA 373 (895)
T ss_pred CCHHHHHHHHHHHCCCCEEEECCCCCCC-----C-CCCCCHHHHHHHHCCCCEEEEEC
T ss_conf 9889999999998399989988878875-----0-33788999998718997499953
No 52
>cd03301 ABC_MalK_N The N-terminal ATPase domain of the maltose transporter, MalK. ATP binding cassette (ABC) proteins function from bacteria to human, mediating the translocation of substances into and out of cells or organelles. ABC transporters contain two transmembrane-spanning domains (TMDs) or subunits and two nucleotide binding domains (NBDs) or subunits that couple transport to the hydrolysis of ATP. In the maltose transport system, the periplasmic maltose binding protein (MBP) stimulates the ATPase activity of the membrane-associated transporter, which consists of two transmembrane subunits, MalF and MalG, and two copies of the ATP binding subunit, MalK, and becomes tightly bound to the transporter in the catalytic transition state, ensuring that maltose is passed to the transporter as ATP is hydrolyzed.
Probab=55.24 E-value=7.8 Score=19.63 Aligned_cols=88 Identities=11% Similarity=0.166 Sum_probs=54.7
Q ss_pred EECCCCEEEEEEEECCCCEECCCEEEE----------CCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHH
Q ss_conf 986887689996307651101210110----------4774156898999986516775899721357888884479999
Q gi|254780976|r 22 IDLGTKRIGLAISDPGRRFAHPRPFLV----------RKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTR 91 (160)
Q Consensus 22 iD~G~kriGiAisd~~~~~a~Pl~~i~----------~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~ 91 (160)
++...++||+.+-|.. -+|+-++. .........++.++++.-++..+.==+|..++|.+- |+|
T Consensus 67 ~~~~~R~ig~VfQ~~~---LfP~ltV~eNI~~~l~~~~~~~~e~~~~v~~~l~~~gl~~~~~~~P~~LSGGqk---QRV- 139 (213)
T cd03301 67 LPPKDRDIAMVFQNYA---LYPHMTVYDNIAFGLKLRKVPKDEIDERVREVAELLQIEHLLDRKPKQLSGGQR---QRV- 139 (213)
T ss_pred CCHHHCCEEEEECCCC---CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHH---HHH-
T ss_conf 9976788789945876---465470999999899985999899999999999875992465099556999999---999-
Q ss_pred HHHHHHHHHCCCCCEEEECCCCCHHHHH
Q ss_conf 9999998404899779983671679999
Q gi|254780976|r 92 AFVHNMIDRKVYVPFVFWDERLTTVSAQ 119 (160)
Q Consensus 92 ~f~~~L~~~~~~l~v~~~DEr~TS~~A~ 119 (160)
.+++.|. .+-++.+.||-+|+..+.
T Consensus 140 aiARAl~---~~P~lLLlDEP~saLD~~ 164 (213)
T cd03301 140 ALGRAIV---REPKVFLMDEPLSNLDAK 164 (213)
T ss_pred HHHHHHH---CCCCEEEECCCCCCCCHH
T ss_conf 9999987---599989983887642989
No 53
>PRK05451 dihydroorotase; Provisional
Probab=54.53 E-value=18 Score=17.38 Aligned_cols=52 Identities=27% Similarity=0.394 Sum_probs=32.4
Q ss_pred EEECCCCCCCCCHH----HHHHHHHH----HHHHHHCCCCCEEEECCCCCHHHHHHHHHHCC
Q ss_conf 97213578888844----79999999----99984048997799836716799999999869
Q gi|254780976|r 73 IGLPLNMNGSEGPR----VHSTRAFV----HNMIDRKVYVPFVFWDERLTTVSAQQILIDMN 126 (160)
Q Consensus 73 vGlP~~~~g~~~~~----~~~v~~f~----~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g 126 (160)
+|+|+...|..+.. ..+-..|. ..|..+++++.|.+ |+-||++|-+..++.+
T Consensus 130 ~g~pLliHGEv~~~~idifdrE~~Fi~~~l~~l~~~fP~LkIvl--EHISTk~aV~~V~~~~ 189 (345)
T PRK05451 130 IGMPLLVHGEVTDFDIDIFDREAVFIDRVLEPLRRRFPKLKIVF--EHITTKDAVDYVREAN 189 (345)
T ss_pred CCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEE--EECCHHHHHHHHHHCC
T ss_conf 59904521454677667110587766999999998687876898--7435799999999579
No 54
>PRK13034 serine hydroxymethyltransferase; Reviewed
Probab=53.91 E-value=19 Score=17.32 Aligned_cols=68 Identities=18% Similarity=0.187 Sum_probs=41.5
Q ss_pred HHHHHCCCCCCEEEEECCC---CEEEEEEEECCCCE-ECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC
Q ss_conf 9998358998399998688---76899963076511-012101104774156898999986516775899721
Q gi|254780976|r 8 DLVKSLKPNQPIASIDLGT---KRIGLAISDPGRRF-AHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLP 76 (160)
Q Consensus 8 ~~~~~~~~~~riLgiD~G~---kriGiAisd~~~~~-a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP 76 (160)
=|...+.|..+|||+|.-. -.-|-.++-....+ ..++++ ..++..-+++++.++++++.|..||+|..
T Consensus 109 v~~all~PGD~im~l~l~~GGHltHG~~~~~sg~~~~~~~Y~v-d~~~~~IDyd~l~~~a~~~kPklIi~G~S 180 (422)
T PRK13034 109 VYLALLKPGDTILGMSLDAGGHLTHGAKVALSGKWFNAVQYGV-DRDTGLIDYDQVERLAVEHKPKLIIAGFS 180 (422)
T ss_pred HHHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCEEEEEEEEE-CCCCCCCCHHHHHHHHHHCCCCEEEECCC
T ss_conf 9999648998688436777762557887553456047898217-53238227999999987319986996661
No 55
>COG3513 Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]
Probab=53.81 E-value=13 Score=18.24 Aligned_cols=24 Identities=21% Similarity=0.459 Sum_probs=20.0
Q ss_pred CCCCEEEEECCCCEEEEEEEECCC
Q ss_conf 998399998688768999630765
Q gi|254780976|r 15 PNQPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 15 ~~~riLgiD~G~kriGiAisd~~~ 38 (160)
++..+||+|.|+.-||-|+....-
T Consensus 2 ~~~yilglDIGi~SVGWAvve~de 25 (1088)
T COG3513 2 KKAYILGLDIGINSVGWAVVEDDE 25 (1088)
T ss_pred CCCEEEEEECCCCCEEEEEEECCC
T ss_conf 864378751034520258862233
No 56
>PTZ00317 malic enzyme; Provisional
Probab=53.51 E-value=17 Score=17.61 Aligned_cols=124 Identities=18% Similarity=0.173 Sum_probs=70.6
Q ss_pred CCCCCEEEE-ECCCCEEEEEEEECCCCE---------ECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCC-CCCC
Q ss_conf 899839999-868876899963076511---------01210110477415689899998651677589972135-7888
Q gi|254780976|r 14 KPNQPIASI-DLGTKRIGLAISDPGRRF---------AHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLN-MNGS 82 (160)
Q Consensus 14 ~~~~riLgi-D~G~kriGiAisd~~~~~---------a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~-~~g~ 82 (160)
....||||| |.|.-=+|++++--..-+ ..|+..=.-.|++ +|+ + +-+.+|++-. ..|.
T Consensus 152 TDGerILGlGDlG~~GmgI~iGKl~lYta~~Gi~P~~~LPV~LDvGTnN~----~LL---~----Dp~YlG~r~~R~~g~ 220 (570)
T PTZ00317 152 TDGSRILGLGDLGANGIGISIGKCSLYVAGAGIKPSRVLPVVLDVGTNNE----KYL---N----DPLYLGLREKRLGDE 220 (570)
T ss_pred ECCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCCHHCCCCEEEECCCCHH----HHC---C----CCCCCCCCCCCCCHH
T ss_conf 45721120266667854400888999987158883005876875688838----873---6----974467678888767
Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHH
Q ss_conf 884479999999999840489977998367167999999998698644546437899-899999999968645
Q gi|254780976|r 83 EGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSI-AAALILQEVLDRISF 154 (160)
Q Consensus 83 ~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~-AA~iILq~~Ld~~~~ 154 (160)
+--..++.|.....++.++ +..+=|.|++.-|.+.|..-.- +-.--+-|.. -|++.|-..|+-+.-
T Consensus 221 --~Y~~fvdefv~av~~~~P~--~liqfEDF~~~nA~~lL~~Yr~--~~~~FNDDiQGTaaV~LAgll~Alri 287 (570)
T PTZ00317 221 --EFYALLDEFMEAASATWPS--AVVQFEDFSNNHCFDMLERYQK--KYRCFNDDIQGTGAVIAAGFLNAVKL 287 (570)
T ss_pred --HHHHHHHHHHHHHHHHCCC--EEEEECCCCCHHHHHHHHHHHH--CCCEECCCCHHHHHHHHHHHHHHHHH
T ss_conf --8999999999999987898--2897244895448999999985--37642144213889999999999998
No 57
>COG1693 Repressor of nif and glnA expression [Transcription]
Probab=53.11 E-value=12 Score=18.57 Aligned_cols=19 Identities=5% Similarity=0.469 Sum_probs=7.6
Q ss_pred CCCEEEEECCCCEEEEEEE
Q ss_conf 9839999868876899963
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAIS 34 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAis 34 (160)
..++||++++....|+|+.
T Consensus 271 ~e~vLGlPv~~~~~Gvvvi 289 (325)
T COG1693 271 SERVLGLPVADRKVGVVVI 289 (325)
T ss_pred CCCEECCCCCCCCEEEEEE
T ss_conf 7625375458882579997
No 58
>PTZ00294 glycerol kinase; Provisional
Probab=52.35 E-value=17 Score=17.52 Aligned_cols=25 Identities=36% Similarity=0.403 Sum_probs=20.3
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCE
Q ss_conf 9839999868876899963076511
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRF 40 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~ 40 (160)
|+-+||||+|+.-+=.++-|..+.+
T Consensus 1 MkYvlgID~GTts~Ka~l~d~~G~~ 25 (510)
T PTZ00294 1 MKYIGSIDQGTTSTRFIIFDEKQRI 25 (510)
T ss_pred CCEEEEEECCCCCEEEEEECCCCCE
T ss_conf 9699999716521477699599999
No 59
>pfam00370 FGGY_N FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain.
Probab=51.10 E-value=13 Score=18.23 Aligned_cols=22 Identities=18% Similarity=0.267 Sum_probs=19.2
Q ss_pred EEEEECCCCEEEEEEEECCCCE
Q ss_conf 9999868876899963076511
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRF 40 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~ 40 (160)
+||||+|+..+=+++-|..+.+
T Consensus 2 ~lgIDiGTt~iKa~l~d~~g~~ 23 (245)
T pfam00370 2 VLGIDCGTTSTKAIVFNKQGKV 23 (245)
T ss_pred EEEEEHHHCCEEEEEECCCCCE
T ss_conf 9999803002100078599999
No 60
>COG3839 MalK ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]
Probab=50.78 E-value=21 Score=17.01 Aligned_cols=90 Identities=14% Similarity=0.195 Sum_probs=54.6
Q ss_pred CCCCEEEEEEEEC--------CCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHH
Q ss_conf 6887689996307--------65110121011047741568989999865167758997213578888844799999999
Q gi|254780976|r 24 LGTKRIGLAISDP--------GRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVH 95 (160)
Q Consensus 24 ~G~kriGiAisd~--------~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~ 95 (160)
...+.||.-+-+. -..+|||++.-...+ ...-.++.+..+--+++.+.==+|..++|. |.|+| ..++
T Consensus 72 P~~R~iamVFQ~yALyPhMtV~~Niaf~Lk~~~~~k-~ei~~rV~eva~~L~l~~lL~r~P~~LSGG---QrQRV-AlaR 146 (338)
T COG3839 72 PEKRGIAMVFQNYALYPHMTVYENIAFGLKLRGVPK-AEIDKRVKEVAKLLGLEHLLNRKPLQLSGG---QRQRV-ALAR 146 (338)
T ss_pred HHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHCCCCCCCHH---HHHHH-HHHH
T ss_conf 557888999378301578769997341664479956-888999999998739866773590117725---67899-9987
Q ss_pred HHHHHCCCCCEEEECCCCCHHHHHHH
Q ss_conf 99840489977998367167999999
Q gi|254780976|r 96 NMIDRKVYVPFVFWDERLTTVSAQQI 121 (160)
Q Consensus 96 ~L~~~~~~l~v~~~DEr~TS~~A~~~ 121 (160)
.|. +.+ .+++.||=+|...|+-.
T Consensus 147 AlV-r~P--~v~L~DEPlSnLDaklR 169 (338)
T COG3839 147 ALV-RKP--KVFLLDEPLSNLDAKLR 169 (338)
T ss_pred HHH-CCC--CEEEECCCHHHHHHHHH
T ss_conf 775-478--87884476467659999
No 61
>COG1831 Predicted metal-dependent hydrolase (urease superfamily) [General function prediction only]
Probab=50.50 E-value=21 Score=16.99 Aligned_cols=73 Identities=8% Similarity=0.129 Sum_probs=50.6
Q ss_pred HHHHHHHHHHHHCCCEE-EEEEC-CCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHH---HHHHHCCCC
Q ss_conf 68989999865167758-99721-3578888844799999999998404899779983671679999---999986986
Q gi|254780976|r 55 TALELLSFITTENIAAF-IIGLP-LNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQ---QILIDMNVS 128 (160)
Q Consensus 55 ~~~~l~~ii~e~~i~~i-VvGlP-~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~---~~l~~~g~~ 128 (160)
-++-..+++.|..+.+| =+|.| +.-+...=+.+..+..++-++ .+..++||.+.-|+++.+.-+ ++.++.|++
T Consensus 109 ~lelA~k~v~eg~avaiGEvGrPHypVs~~v~~~~n~vl~~a~el-A~dvdc~vqLHtes~~~~~~~~i~~~ak~~G~~ 186 (285)
T COG1831 109 ALELAAKLVEEGKAVAIGEVGRPHYPVSEEVWEASNEVLEYAMEL-AKDVDCAVQLHTESLDEETYEEIAEMAKEAGIK 186 (285)
T ss_pred HHHHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH-HHCCCCCEEEECCCCCHHHHHHHHHHHHHHCCC
T ss_conf 999999998503323310258888989999999999999999998-534798379753778867899999999981899
No 62
>pfam07282 Transposase_35 Putative transposase DNA-binding domain. This putative domain is found at the C-terminus of a large number of transposase proteins. This domain contains four conserved cysteines suggestive of a zinc binding domain. Given the need for transposases to bind DNA as well as the large number of DNA-binding zinc fingers we hypothesize this domain is DNA-binding.
Probab=49.73 E-value=16 Score=17.78 Aligned_cols=47 Identities=19% Similarity=0.203 Sum_probs=26.3
Q ss_pred HCCCCCEEEECCCCCHHHHHHH--HHHCCCCHHH-------HHHHHHHHHHHHHHH
Q ss_conf 0489977998367167999999--9986986445-------464378998999999
Q gi|254780976|r 100 RKVYVPFVFWDERLTTVSAQQI--LIDMNVSRKK-------RIQKVDSIAAALILQ 146 (160)
Q Consensus 100 ~~~~l~v~~~DEr~TS~~A~~~--l~~~g~~~k~-------~k~~iD~~AA~iILq 146 (160)
...+++|..+||++||+.=-.= ..+...+.+. ....-|-.||.-||+
T Consensus 13 ~~~G~~v~~v~~~~TS~~C~~Cg~~~~~~~~~r~~~C~~Cg~~~~rD~nAA~NI~~ 68 (69)
T pfam07282 13 KEYGIPVVYVNPAYTSKTCSVCGHKNKESLSGRTFKCPNCGFVADRDVNAAINILK 68 (69)
T ss_pred HHHCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCEECCHHHHHHHHHC
T ss_conf 98399899989998864279998998674888789899899689959999999963
No 63
>TIGR01264 tyr_amTase_E tyrosine aminotransferase; InterPro: IPR005957 This entry describes tyrosine aminotransferase (2.6.1.5 from EC) as found in animals and Trypanosoma cruzi (Euglenozoa). It is the first enzyme of a pathway of tyrosine degradation via homogentisate. L-tyrosine + 2-oxoglutarate = 4-hydroxyphenylpyruvate + L-glutamate.; GO: 0004838 tyrosine transaminase activity, 0030170 pyridoxal phosphate binding, 0009072 aromatic amino acid family metabolic process.
Probab=47.76 E-value=24 Score=16.73 Aligned_cols=74 Identities=20% Similarity=0.127 Sum_probs=49.1
Q ss_pred EECCCCEECCCEE-EECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHH--HHHHHHHHHHHHHCCCCCEEEEC
Q ss_conf 3076511012101-10477415689899998651677589972135788888447--99999999998404899779983
Q gi|254780976|r 34 SDPGRRFAHPRPF-LVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRV--HSTRAFVHNMIDRKVYVPFVFWD 110 (160)
Q Consensus 34 sd~~~~~a~Pl~~-i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~--~~v~~f~~~L~~~~~~l~v~~~D 110 (160)
|.+..+-..++.. ++-++.+.++.+|.+++.| +=..|||-+|-|.-|+--... +++..|+++ ..+| .+-|
T Consensus 144 c~~~~i~~~~Y~~d~~d~~w~~DL~~l~~LiDd-~T~~l~vnNPSNPcGSvF~~~Hl~~l~a~A~~-----~~Lp-i~AD 216 (415)
T TIGR01264 144 CKALGIEVKLYNLDLPDEDWEIDLKQLESLIDD-KTKALIVNNPSNPCGSVFRREHLEELLALAER-----LKLP-IIAD 216 (415)
T ss_pred HHHCCCCCCEECCCCCCCHHHHHHHHHHHHCCC-CCCEEEEECCCCCCCCCCCHHHHHHHHHHHHH-----CCCC-EEEE
T ss_conf 401476200002579873356648988631256-74078883757788988787889999999875-----3872-3664
Q ss_pred CCCC
Q ss_conf 6716
Q gi|254780976|r 111 ERLT 114 (160)
Q Consensus 111 Er~T 114 (160)
|=|-
T Consensus 217 EIY~ 220 (415)
T TIGR01264 217 EIYG 220 (415)
T ss_pred EECC
T ss_conf 2025
No 64
>pfam07355 GRDB Glycine/sarcosine/betaine reductase selenoprotein B (GRDB). This family represents a conserved region approximately 350 residues long within the selenoprotein B component of the bacterial glycine, sarcosine and betaine reductase complexes.
Probab=47.63 E-value=24 Score=16.71 Aligned_cols=54 Identities=20% Similarity=0.365 Sum_probs=40.0
Q ss_pred CCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEE
Q ss_conf 47741568989999865167758997213578888844799999999998404899779
Q gi|254780976|r 49 RKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFV 107 (160)
Q Consensus 49 ~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~ 107 (160)
..|.+....++.+.+++++++.||-|=-.| .|..+-.|..+-. ... ..+++|+.
T Consensus 62 ~en~dea~~~il~mv~~~~pDlfiAGPAFn-AGRYGvACG~i~k---aV~-e~l~IP~v 115 (349)
T pfam07355 62 NENIEEAVAEILEMLKEEKPDLFIAGPAFN-AGRYGVACGTIAK---AVK-EELGIPAV 115 (349)
T ss_pred HCCHHHHHHHHHHHHHHCCCCEEEECCCCC-CCCHHHHHHHHHH---HHH-HHHCCCEE
T ss_conf 409999999999999842999898766325-6425888999999---999-98699638
No 65
>PRK00976 hypothetical protein; Provisional
Probab=47.34 E-value=24 Score=16.69 Aligned_cols=55 Identities=22% Similarity=0.250 Sum_probs=35.6
Q ss_pred EEEEECCCCEEEEEEEECCCCEECCCEEEECCCC--CHHHHHHHHHHHHHCCCEEEEEEC
Q ss_conf 9999868876899963076511012101104774--156898999986516775899721
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKV--TQTALELLSFITTENIAAFIIGLP 76 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~--~~~~~~l~~ii~e~~i~~iVvGlP 76 (160)
++|+|.|+.-|-+|+-+.....-|- +.|+.. ....+.|...+.-++++++-+-|.
T Consensus 3 fvGiDHGTtgirfa~~~~~~~~~fk---i~R~e~~~~s~~~~l~~~v~~~eIdliavtYs 59 (324)
T PRK00976 3 FVGIDHGTTGIRFAIIEGEVKSTFK---IPREEAKEMSFLDELEKHVPLEEIDLIAVTYS 59 (324)
T ss_pred EEEECCCCCEEEEEEECCCCEEEEE---ECHHHHHHCCHHHHHHHCCCHHHCEEEEEEEC
T ss_conf 8864478751799972697048998---62898752208888875088414138999620
No 66
>PRK10939 autoinducer-2 (AI-2) kinase; Provisional
Probab=47.24 E-value=16 Score=17.75 Aligned_cols=25 Identities=24% Similarity=0.239 Sum_probs=21.2
Q ss_pred CCCCEEEEECCCCEEEEEEEECCCC
Q ss_conf 9983999986887689996307651
Q gi|254780976|r 15 PNQPIASIDLGTKRIGLAISDPGRR 39 (160)
Q Consensus 15 ~~~riLgiD~G~kriGiAisd~~~~ 39 (160)
.|+-+||||+|+.-+=.++-|..+.
T Consensus 1 ~M~Y~lgiD~GTss~Ka~l~d~~G~ 25 (521)
T PRK10939 1 SMSYLMALDAGTGSIRAVIFDLNGN 25 (521)
T ss_pred CCCEEEEEECCCCCHHHEEEECCCC
T ss_conf 9878999986102100206989999
No 67
>pfam06723 MreB_Mbl MreB/Mbl protein. This family consists of bacterial MreB and Mbl proteins as well as two related archaeal sequences. MreB is known to be a rod shape-determining protein in bacteria and goes to make up the bacterial cytoskeleton. Genes coding for MreB/Mbl are only found in elongated bacteria, not in coccoid forms. It has been speculated that constituents of the eukaryotic cytoskeleton (tubulin, actin) may have evolved from prokaryotic precursor proteins closely related to today's bacterial proteins FtsZ and MreB/Mbl.
Probab=47.12 E-value=24 Score=16.67 Aligned_cols=100 Identities=14% Similarity=0.167 Sum_probs=46.8
Q ss_pred CCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHH--HHHHHHHHHHHCCCEEEEEECCCCCCCCC--HHHHHH-H
Q ss_conf 839999868876899963076511012101104774156--89899998651677589972135788888--447999-9
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQT--ALELLSFITTENIAAFIIGLPLNMNGSEG--PRVHST-R 91 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~--~~~l~~ii~e~~i~~iVvGlP~~~~g~~~--~~~~~v-~ 91 (160)
++.+|||+|+.+.-+++.+....+-.|=-+-..++.... +..-.+-.-...+..+..=.|+. +|... +.++.+ +
T Consensus 1 ~~~iGIDLGTtnS~i~~~~~g~v~~~PSvVa~~~~~~~~i~vG~~Ak~~~~~~p~~~~~~~~~~-~g~~~~~e~~~~il~ 79 (327)
T pfam06723 1 SKDIGIDLGTANTLVYVKGKGIVLNEPSVVAINTKTKKVLAVGNEAKKMLGRTPGNIVAVRPLK-DGVIADFEVTEAMLK 79 (327)
T ss_pred CCEEEEECCHHHEEEEECCCCEECCCCCEEEEECCCCEEEECCHHHHHHCCCCCCCEEEEEECC-CCEECCHHHHHHHHH
T ss_conf 9768998755408999879987988885999987998899723999973547999769999778-986757888799999
Q ss_pred HHHHHHHHHCCCC----CEEEECCCCCHHHH
Q ss_conf 9999998404899----77998367167999
Q gi|254780976|r 92 AFVHNMIDRKVYV----PFVFWDERLTTVSA 118 (160)
Q Consensus 92 ~f~~~L~~~~~~l----~v~~~DEr~TS~~A 118 (160)
.+.+... ..... -|.-+.-.||..+=
T Consensus 80 ~l~~~~~-~~~~~~~~~aVItVPa~f~~~qR 109 (327)
T pfam06723 80 YFIKKVH-GRRSLSKPRVVICVPSGITEVER 109 (327)
T ss_pred HHHHHHH-HHCCCCCCCEEEEECCCCCHHHH
T ss_conf 9999997-40278999789997999999999
No 68
>cd00458 SugarP_isomerase SugarP_isomerase: Sugar Phosphate Isomerase family; includes type A ribose 5-phosphate isomerase (RPI_A), glucosamine-6-phosphate (GlcN6P) deaminase, and 6-phosphogluconolactonase (6PGL). RPI catalyzes the reversible conversion of ribose-5-phosphate to ribulose 5-phosphate, the first step of the non-oxidative branch of the pentose phosphate pathway. GlcN6P deaminase catalyzes the reversible conversion of GlcN6P to D-fructose-6-phosphate (Fru6P) and ammonium, the last step of the metabolic pathway of N-acetyl-D-glucosamine-6-phosphate. 6PGL converts 6-phosphoglucono-1,5-lactone to 6-phosphogluconate, the second step of the oxidative phase of the pentose phosphate pathway.
Probab=47.04 E-value=24 Score=16.66 Aligned_cols=53 Identities=15% Similarity=0.143 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCEEEECCCCC
Q ss_conf 689899998651677589972135788888447999999999984-048997799836716
Q gi|254780976|r 55 TALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMID-RKVYVPFVFWDERLT 114 (160)
Q Consensus 55 ~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~-~~~~l~v~~~DEr~T 114 (160)
..+.|.+++++. ..+|+|+| -| .+|.. .-+.+++.... ..-++-++..|||+-
T Consensus 8 ia~~i~k~i~~K--~~~vLGLa---tG-STP~~-~Y~~Li~~~~~~sf~nV~tFnlDEY~~ 61 (169)
T cd00458 8 IEDKXEKLLEEK--DDMVIGLG---TG-STPAY-FYKLLGEKLKRGEISDIVGFPTDERYV 61 (169)
T ss_pred HHHHHHHHHHCC--CCEEEEEC---CC-CCHHH-HHHHHHHHHHCCCCCEEEEEEEHCCCC
T ss_conf 999999999607--78499957---99-88799-999999997447801389984000047
No 69
>PRK00047 glpK glycerol kinase; Provisional
Probab=46.56 E-value=25 Score=16.61 Aligned_cols=28 Identities=21% Similarity=0.195 Sum_probs=22.5
Q ss_pred CCCCCCEEEEECCCCEEEEEEEECCCCE
Q ss_conf 5899839999868876899963076511
Q gi|254780976|r 13 LKPNQPIASIDLGTKRIGLAISDPGRRF 40 (160)
Q Consensus 13 ~~~~~riLgiD~G~kriGiAisd~~~~~ 40 (160)
|..+.-+||||+|+.-+=.++-|..+.+
T Consensus 1 ~Mm~~yvlgiD~GTts~Ka~l~d~~G~~ 28 (498)
T PRK00047 1 MMMKKYILALDQGTTSSRAIIFDHKGNI 28 (498)
T ss_pred CCCCCEEEEEECCCCEEEEEEECCCCCE
T ss_conf 9876879999746330356699699999
No 70
>TIGR03275 methan_mark_8 putative methanogenesis marker protein 8. Members of this protein family, to date, are found in a completed prokaryotic genome if and only if the species is one of the archaeal methanogens. The exact function is unknown, but likely is linked to methanogenesis or a process closely connected to it.
Probab=46.53 E-value=25 Score=16.61 Aligned_cols=55 Identities=18% Similarity=0.388 Sum_probs=37.5
Q ss_pred EEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHH
Q ss_conf 99986887689996307651101210110477415689899998651677589972135788888447999999
Q gi|254780976|r 20 ASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAF 93 (160)
Q Consensus 20 LgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f 93 (160)
.+++.|-|||+|-++|+ .+.+++.++-.+++++.+++|- |..|-..+-++.+-.+
T Consensus 157 ~Aie~Gyk~IaVTV~~~-----------------~~a~~iRele~~~~~~~~if~V--HtTGis~eea~~~~~~ 211 (259)
T TIGR03275 157 KAIELGYKKIAVTVADA-----------------EDAKAIRELESESGIDIIIFAV--HTTGIDREDAEEVVQY 211 (259)
T ss_pred HHHHCCCCEEEEEECCH-----------------HHHHHHHHHCCCCCCCEEEEEE--ECCCCCHHHHHHHHHH
T ss_conf 99972875589994588-----------------8899999752246984899999--6689987899999886
No 71
>PRK13789 phosphoribosylamine--glycine ligase; Provisional
Probab=46.28 E-value=25 Score=16.59 Aligned_cols=103 Identities=16% Similarity=0.186 Sum_probs=62.5
Q ss_pred HHHHHHHHHHHHHCCCEEEEEEC--CC--------CCC-------CCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHH
Q ss_conf 56898999986516775899721--35--------788-------88844799999999998404899779983671679
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLP--LN--------MNG-------SEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTV 116 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP--~~--------~~g-------~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~ 116 (160)
.+...+.+++++++|+..|||=- +. ..| ....+.+..+.|++++..+ .++|---+-.--+..
T Consensus 55 ~d~~~i~~fa~~~~idLvvvGPE~PL~~Gi~D~l~~~gi~vfGP~k~aA~LE~SK~faK~fm~~-~~IPTa~~~~f~~~~ 133 (426)
T PRK13789 55 LDKSSVQSFLKSNPFDLIVVGPEDPLVAGFADWAAELGIPCFGPDSYCAQVEGSKHFAKSLMKE-AKIPTASYKTFTEYS 133 (426)
T ss_pred CCHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCEEECCCHHHHHHHCCHHHHHHHHHH-CCCCCCEEEEECCHH
T ss_conf 6999999999984999999896688863179998416991689598995051269899999997-299976015736899
Q ss_pred HHHHHHHHCCCCHHHH-------HHH---HHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 9999999869864454-------643---7899899999999968645662
Q gi|254780976|r 117 SAQQILIDMNVSRKKR-------IQK---VDSIAAALILQEVLDRISFLES 157 (160)
Q Consensus 117 ~A~~~l~~~g~~~k~~-------k~~---iD~~AA~iILq~~Ld~~~~~~~ 157 (160)
+|.+.+.+.+...--+ |+. -|...|.-.|+..+....+-+.
T Consensus 134 ~A~~~l~~~~~P~VIKaDGLAaGKGV~V~~~~~eA~~al~~i~~~~~fg~a 184 (426)
T PRK13789 134 SSLSYLESEMLPIVIKADGLAAGKGVTVATEKKMAKRALKEIFKDKKFGQS 184 (426)
T ss_pred HHHHHHHHCCCCEEEECCCCCCCCCEEEECCHHHHHHHHHHHHHHHHHHCC
T ss_conf 999999857998899678768999669539999999999997544233035
No 72
>pfam00464 SHMT Serine hydroxymethyltransferase.
Probab=46.21 E-value=25 Score=16.58 Aligned_cols=68 Identities=15% Similarity=0.050 Sum_probs=40.9
Q ss_pred HHHHHCCCCCCEEEEECCC--C-EEEEEEEECC-CCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC
Q ss_conf 9998358998399998688--7-6899963076-511012101104774156898999986516775899721
Q gi|254780976|r 8 DLVKSLKPNQPIASIDLGT--K-RIGLAISDPG-RRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLP 76 (160)
Q Consensus 8 ~~~~~~~~~~riLgiD~G~--k-riGiAisd~~-~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP 76 (160)
=|...++|..+|||+|... - .-|-.++-.. .--+.|+++ ..++..-+++++.++++++.|..||+|..
T Consensus 98 v~~all~PGD~il~l~l~~GGHltHg~~~~~~g~~~~~~~Y~~-d~~~~~ID~d~l~~~a~~~kPklIi~G~S 169 (380)
T pfam00464 98 VYTALLNPGDRIMGLDLPHGGHLTHGYPVNFSGKFFESMPYGV-DPDTGLIDYDQLEKNAKLFKPKLIVAGTS 169 (380)
T ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCCCCEECCEEEEEECCC-CCCCCCCCHHHHHHHHHHCCCCEEEECCC
T ss_conf 9999837999899536777754567986562011467787434-64548668999999999709988998740
No 73
>PRK12767 carbamoyl phosphate synthase-like protein; Provisional
Probab=45.55 E-value=26 Score=16.52 Aligned_cols=56 Identities=20% Similarity=0.123 Sum_probs=43.0
Q ss_pred CCCCCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEE
Q ss_conf 589983999986887689996307651101210110477415689899998651677589972
Q gi|254780976|r 13 LKPNQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGL 75 (160)
Q Consensus 13 ~~~~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGl 75 (160)
.....+|+|.|.-..-.|..++|..-.+ |. ......+..|.+++++++|+.++-|.
T Consensus 22 ~~~~~~Vi~~D~~~~a~~~~~aD~~y~~--P~-----~~d~~y~~~ll~i~~~~~id~iiP~~ 77 (325)
T PRK12767 22 SLLGGKVIGADISPLAPALYFADKFYVV--PK-----VTDPNYIDALLDICKKENIDALIPLI 77 (325)
T ss_pred CCCCCEEEEECCCCCCCCHHHCCEEEEC--CC-----CCCHHHHHHHHHHHHHHCCCEEEECC
T ss_conf 6998599996899899534454889987--88-----89878999999999987999999778
No 74
>PRK11572 copper homeostasis protein CutC; Provisional
Probab=45.37 E-value=26 Score=16.50 Aligned_cols=84 Identities=13% Similarity=0.201 Sum_probs=53.7
Q ss_pred EECCCEEEECCCCC------HHHHH---HHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE-
Q ss_conf 10121011047741------56898---999986516775899721357888884479999999999840489977998-
Q gi|254780976|r 40 FAHPRPFLVRKKVT------QTALE---LLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFW- 109 (160)
Q Consensus 40 ~a~Pl~~i~~~~~~------~~~~~---l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~- 109 (160)
...|+-++.|-... ..+.. =...+++.++++||+|. ++.||+.... ..+ +|.+...++|++|+
T Consensus 50 ~~iPV~vMIRPR~GdF~Ys~~E~~~M~~dI~~~~~~Ga~GvV~G~-L~~dg~iD~~--~~~----~Li~~a~~l~vTFHR 122 (248)
T PRK11572 50 VTIPVHPIIRPRGGDFCYSDGEFAAMLEDIRTVRELGFPGLVTGV-LDVDGHVDMP--RME----KIMAAAGPLAVTFHR 122 (248)
T ss_pred CCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEE-ECCCCCCCHH--HHH----HHHHHHCCCEEEEEC
T ss_conf 699738999426998867989999999999999986999679966-8899984999--999----999974898079862
Q ss_pred --CCCCCHHHHHHHHHHCCCCHH
Q ss_conf --367167999999998698644
Q gi|254780976|r 110 --DERLTTVSAQQILIDMNVSRK 130 (160)
Q Consensus 110 --DEr~TS~~A~~~l~~~g~~~k 130 (160)
|.----.+|-+.|.+.|..+=
T Consensus 123 AfD~~~dp~~ale~Li~lG~~rI 145 (248)
T PRK11572 123 AFDMCANPLNALKNLAELGVARI 145 (248)
T ss_pred HHHCCCCHHHHHHHHHHCCCCEE
T ss_conf 02214999999999997599989
No 75
>TIGR02166 dmsA_ynfE anaerobic dimethyl sulfoxide reductase, A subunit, DmsA/YnfE family; InterPro: IPR011888 Many bacterial species are capable of anaerobic growth by using dimethylsulphoxide (DMSO) as the terminal electron acceptor, with DMSO reductase as the terminal elctron transfer enzyme. In Escherichia coli and many other Gram-negative bacteria DMSO reductase is a membrane-bound enzyme composed of three subunits; a catalytic molybdenum-containing subunit (DmsA), an electron transfer subunit containing a [4Fe-4S] cluster (DmsB), and a hydrophobic membrane-spanning anchor subunit which attaches the enzyme to the cytoplasmic membrane (DmsC) , . It is generally thought now that DmsAB faces the periplasm, contradicting previous results suggesting a cytoplasmic location. The N-terminal region of DmsA contains a "twin-arginine" signal sequence, suggesting export to the periplasm occurs via the TAT secretion pathway. This entry represents known and predicted bacterial DmsA polypeptides. Several species contain one or more paralogs of DmsA. In E. coli, the two paralogs of DmsA, YnfE and YnfF, are encoded within the ynfEFGHI operon . YnfE and YnfF cannot form a functional complex with DmsBC, but YnfFGH can restore growth on DMSO when DmABC is deleted. The function of YnfE is not known and it appears to prevent formation of the YnfFGH complex if present.; GO: 0009389 dimethyl sulfoxide reductase activity, 0030151 molybdenum ion binding, 0051539 4 iron 4 sulfur cluster binding.
Probab=44.47 E-value=20 Score=17.22 Aligned_cols=84 Identities=18% Similarity=0.155 Sum_probs=56.6
Q ss_pred CEEEEEECCCCCCCCCHHH--HHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH
Q ss_conf 7589972135788888447--99999999998404899779983671679999999986986445464378998999999
Q gi|254780976|r 69 AAFIIGLPLNMNGSEGPRV--HSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQ 146 (160)
Q Consensus 69 ~~iVvGlP~~~~g~~~~~~--~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq 146 (160)
..+||++=. |-.|+.++ .....|.+.+. ++-+..++.+|=|||=..|-+.=...-+.-.--...++++|=++|=|
T Consensus 221 skLvv~fG~--NP~ETRmSGGG~~y~~~~A~~-~ns~a~~I~iDPRYtDt~ag~~deWiPI~PGTDAALv~A~ayVlI~E 297 (849)
T TIGR02166 221 SKLVVMFGN--NPAETRMSGGGQTYYFLQALE-KNSNAKVIVIDPRYTDTVAGREDEWIPIRPGTDAALVAAIAYVLISE 297 (849)
T ss_pred CCEEEEECC--CCHHEEECCCCCHHHHHHHHH-HHCCCEEEEECCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCC
T ss_conf 876898157--810203227860456899997-52386189975888720323014786788772589999988875517
Q ss_pred HHHH--HHHHH
Q ss_conf 9996--86456
Q gi|254780976|r 147 EVLD--RISFL 155 (160)
Q Consensus 147 ~~Ld--~~~~~ 155 (160)
+|.| ..+||
T Consensus 298 Gwvnl~D~~FL 308 (849)
T TIGR02166 298 GWVNLHDKAFL 308 (849)
T ss_pred CCCCCCCHHHH
T ss_conf 73253267778
No 76
>PRK13929 rod-share determining protein MreBH; Provisional
Probab=44.23 E-value=27 Score=16.39 Aligned_cols=100 Identities=14% Similarity=0.131 Sum_probs=51.4
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCH--HHHHHHHHHHHHCCCEEEEEECCCCCCCCCH--HHH-HH
Q ss_conf 983999986887689996307651101210110477415--6898999986516775899721357888884--479-99
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQ--TALELLSFITTENIAAFIIGLPLNMNGSEGP--RVH-ST 90 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~--~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~--~~~-~v 90 (160)
+++.+|||+|+.++=++..+....+-.|--+-...+... .+..-..-.-...|..+++-.|+. +|..++ .+. ..
T Consensus 3 ~~~~iGIDLGTtns~v~~~~~gii~~ePSvva~d~~~~~~~avG~~Ak~~~~~~p~~i~~~~p~~-~g~i~d~e~s~~iL 81 (335)
T PRK13929 3 QSTEIGIDLGTANILVYSKNKGIILNEPSVVAVDTETKAVLAIGTEAKNMIGKTPGKIVAVRPMK-DGVIADYDMTTDLL 81 (335)
T ss_pred CCCEEEEECCHHHEEEEECCCCEECCCCCEEEEECCCCEEEEECHHHHHHHCCCCCCEEEEEECC-CCEECCHHHHHHHH
T ss_conf 88816898753528999878998884786899977997299964999997156977689998578-96345667679999
Q ss_pred HHHHHHHHHHCCCC------CEEEECCCCCHHH
Q ss_conf 99999998404899------7799836716799
Q gi|254780976|r 91 RAFVHNMIDRKVYV------PFVFWDERLTTVS 117 (160)
Q Consensus 91 ~~f~~~L~~~~~~l------~v~~~DEr~TS~~ 117 (160)
+.|.++. ...++. -|.-+.-.+|..|
T Consensus 82 ~~l~~~a-~~~~g~~~~~p~~VItVPa~ft~~q 113 (335)
T PRK13929 82 KQIMKKA-GKNIGMTFRKPTVVVCTPSGSTAVE 113 (335)
T ss_pred HHHHHHH-HHHCCCCCCCCCEEEEECCCCCHHH
T ss_conf 9999999-9851876688878999299999999
No 77
>PRK13790 phosphoribosylamine--glycine ligase; Provisional
Probab=43.55 E-value=27 Score=16.33 Aligned_cols=74 Identities=18% Similarity=0.379 Sum_probs=49.0
Q ss_pred CHHHHHHHHHHHHHCCCEEEEEE--CCC--------CCC-------CCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCH
Q ss_conf 15689899998651677589972--135--------788-------8884479999999999840489977998367167
Q gi|254780976|r 53 TQTALELLSFITTENIAAFIIGL--PLN--------MNG-------SEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTT 115 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~iVvGl--P~~--------~~g-------~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS 115 (160)
..+...+.+++++++|+..|||= |+. ..| ....+.+..+.|++++..+ .++|---+-.--+-
T Consensus 49 ~~d~~~i~~~a~~~~iDLvvvGPE~PL~~GivD~l~~~gi~vfGP~k~aA~LE~SK~faK~~m~~-~~IPTa~~~~f~~~ 127 (415)
T PRK13790 49 ESDHQAILDFAKQQNVDWVVIGPEQPLIDGLADILRANGFKVFGPNKQAAQIEGSKLFAKKIMEK-YNIPTADYKEVERK 127 (415)
T ss_pred CCCHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCCEEECCCHHHHHHHCCHHHHHHHHHH-CCCCCCCEEEECCH
T ss_conf 55899999999981999999896078663488886438975989497995151269999999998-29797772550879
Q ss_pred HHHHHHHHHCCC
Q ss_conf 999999998698
Q gi|254780976|r 116 VSAQQILIDMNV 127 (160)
Q Consensus 116 ~~A~~~l~~~g~ 127 (160)
.+|.+.+.+.+.
T Consensus 128 ~~A~~~l~~~~~ 139 (415)
T PRK13790 128 KDALTYIENCEL 139 (415)
T ss_pred HHHHHHHHHCCC
T ss_conf 999999861799
No 78
>cd01988 Na_H_Antiporter_C The C-terminal domain of a subfamily of Na+ /H+ antiporter existed in bacteria and archea . Na+/H+ exchange proteins eject protons from cells, effectively eliminating excess acid from actively metabolising cells. Na+ /H+ exchange activity is also crucial for the regulation of cell volume, and for the reabsorption of NaCl across renal, intestinal, and other epithelia. These antiports exchange Na+ for H+ in an electroneutral manner, and this activity is carried out by a family of Na+ /H+ exchangers, or NHEs, which are known to be present in both prokaryotic and eukaryotic cells. These exchangers are highly-regulated (glyco)phosphoproteins, which, based on their primary structure, appear to contain 10-12 membrane-spanning regions (M) at the N-terminus and a large cytoplasmic region at the C-terminus. The transmembrane regions M3-M12 share identity wit h other members of the family. The M6 and M7 regions are highly conserved. Thus, this is thought to be the regio
Probab=42.03 E-value=29 Score=16.19 Aligned_cols=51 Identities=8% Similarity=0.053 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE
Q ss_conf 56898999986516775899721357888884479999999999840489977998
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFW 109 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~ 109 (160)
.-...|.+.+++++++.||+|.+-. .|-..-. .-..++.+. +...+||..+
T Consensus 81 ~~~~~I~~~a~e~~~DlIVmG~~g~-~~~~~~~---lGSv~~~V~-~~a~cpVlvV 131 (132)
T cd01988 81 DIASGILRTAKERQADLIIMGWHGS-TSLRDRL---FGGVIDQVL-ESAPCDVAVV 131 (132)
T ss_pred CHHHHHHHHHHHHCCCEEEECCCCC-CCCCCCC---CCCHHHHHH-HCCCCCEEEE
T ss_conf 9799999999984999999837999-8754244---475799998-4599988996
No 79
>pfam03709 OKR_DC_1_N Orn/Lys/Arg decarboxylase, N-terminal domain. This domain has a flavodoxin-like fold, and is termed the "wing" domain because of its position in the overall 3D structure.
Probab=41.83 E-value=29 Score=16.17 Aligned_cols=59 Identities=22% Similarity=0.330 Sum_probs=38.9
Q ss_pred CEEEECCCCCHHHHHHHHHHH-HHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHH
Q ss_conf 101104774156898999986-51677589972135788888447999999999984048997799836716799
Q gi|254780976|r 44 RPFLVRKKVTQTALELLSFIT-TENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVS 117 (160)
Q Consensus 44 l~~i~~~~~~~~~~~l~~ii~-e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~ 117 (160)
..++...+ ...+..+++ .-.+.++|+-+ .+....+...+..+.+++||++.-|+.|+..
T Consensus 18 ~~vv~~~~----~~d~~~~~~~~~~i~avil~~-----------~~~~~~ll~~ir~~n~~lPvFl~~~~~~~~~ 77 (111)
T pfam03709 18 REVVDATS----TDDLLSLIETFTDIAAVVLSW-----------DDEARGLLDEIRRRNFDLPVFLLAETRTSED 77 (111)
T ss_pred CEEEEECC----HHHHHHHHHHCCCCCEEEEEE-----------CHHHHHHHHHHHHHCCCCCEEEEECHHHHHC
T ss_conf 88997487----899999998387876899984-----------6068999999997478999898844125750
No 80
>TIGR01975 isoAsp_dipep beta-aspartyl peptidase; InterPro: IPR010229 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of proteins include metallopeptidases belonging to the MEROPS peptidase family M38 (clan MJ, beta-aspartyl dipeptidase family). This entry includes the beta-aspartyl dipeptidase from Escherichia coli, (3.4.19.5 from EC, IadA), which degrades isoaspartyl dipeptides and may unblock degradation of proteins that cannot be repaired. This entry also describes closely related proteins from other species (e.g. Clostridium perfringens, Thermoanaerobacter tengcongensis) that may have an equivalent in function. This family shows homology to dihydroorotases. The L-isoaspartyl derivative of Asp arises non-enzymatically over time as a form of protein damage. In this isomerisation, the connectivity of the polypeptide changes to pass through the beta-carboxyl of the side chain. Much but not all of this damage can be repaired by protein-L-isoaspartate (D-aspartate) O-methyltransferase..
Probab=41.75 E-value=29 Score=16.16 Aligned_cols=64 Identities=28% Similarity=0.442 Sum_probs=43.6
Q ss_pred CEEEE---EEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC----CCCCCCCCHHHHHHHHHHHHHHH
Q ss_conf 76899---963076511012101104774156898999986516775899721----35788888447999999999984
Q gi|254780976|r 27 KRIGL---AISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLP----LNMNGSEGPRVHSTRAFVHNMID 99 (160)
Q Consensus 27 kriGi---Aisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP----~~~~g~~~~~~~~v~~f~~~L~~ 99 (160)
|=||+ |+|| ++.|. +...+|.+++-|=.|.++.=|.| +|+=.+ + +...+..+|.+
T Consensus 151 kvIGvGeiAiSD--HRSaq-----------Pt~~~L~~~aAeARVGGLLgGK~Giv~~H~Gds--~---~~L~~i~~~v~ 212 (391)
T TIGR01975 151 KVIGVGEIAISD--HRSAQ-----------PTVEELTNLAAEARVGGLLGGKPGIVNLHVGDS--K---RKLEPIEELVE 212 (391)
T ss_pred EEEEEEEEEECC--CCCCC-----------CCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCC--H---HHHHHHHHHHH
T ss_conf 355424572214--77887-----------448999999775112411168875689963698--6---77779999985
Q ss_pred HCCCCCEEEE
Q ss_conf 0489977998
Q gi|254780976|r 100 RKVYVPFVFW 109 (160)
Q Consensus 100 ~~~~l~v~~~ 109 (160)
+ .++||...
T Consensus 213 ~-~dvPi~q~ 221 (391)
T TIGR01975 213 E-TDVPITQF 221 (391)
T ss_pred H-CCCCCCCC
T ss_conf 0-58870025
No 81
>pfam01385 Transposase_2 Probable transposase. This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has now been merged into this family.
Probab=41.43 E-value=30 Score=16.13 Aligned_cols=23 Identities=26% Similarity=0.364 Sum_probs=14.0
Q ss_pred CCCCEEEEECCCCEEEEEEEECCC
Q ss_conf 998399998688768999630765
Q gi|254780976|r 15 PNQPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 15 ~~~riLgiD~G~kriGiAisd~~~ 38 (160)
.....+|||+|.+.. +|++|..+
T Consensus 121 ~~~~~~GIDlGi~~l-~t~~~~~~ 143 (226)
T pfam01385 121 NPNKAAGIDLGINNL-ATVSTEKG 143 (226)
T ss_pred CCCCEEEEECCCCEE-EEEECCCC
T ss_conf 877358995277657-99955963
No 82
>PRK03317 histidinol-phosphate aminotransferase; Provisional
Probab=40.59 E-value=31 Score=16.05 Aligned_cols=62 Identities=15% Similarity=0.098 Sum_probs=42.6
Q ss_pred CCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHH
Q ss_conf 47741568989999865167758997213578888844799999999998404899779983671679
Q gi|254780976|r 49 RKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTV 116 (160)
Q Consensus 49 ~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~ 116 (160)
..++..+++.+.+-+.++.+..+++-.|=|..|+.-+.. .+..+.+. ... .+..+||-|.-.
T Consensus 142 ~~df~~d~d~l~~ai~~~~~klv~l~nPNNPTG~~~~~~-~l~~i~~~----~~~-~~vvvDEAY~ef 203 (369)
T PRK03317 142 AADFGLDVDAAVAAIREHRPDVVFLTSPNNPTGTALPLD-DVEALLDA----APP-GIVVVDEAYAEF 203 (369)
T ss_pred CCCCCCCHHHHHHHHHCCCCCEEEECCCCCCCCEECCHH-HHHHHHHH----CCC-CEEEECCCHHHH
T ss_conf 878883999999986326985899758999877123789-99999985----887-779943877986
No 83
>pfam00390 malic Malic enzyme, N-terminal domain.
Probab=40.28 E-value=31 Score=16.02 Aligned_cols=98 Identities=17% Similarity=0.106 Sum_probs=58.0
Q ss_pred CCCCCCEEEE-ECCCCEEEEEEEECCCCE---------ECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCC-CC
Q ss_conf 5899839999-868876899963076511---------012101104774156898999986516775899721357-88
Q gi|254780976|r 13 LKPNQPIASI-DLGTKRIGLAISDPGRRF---------AHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNM-NG 81 (160)
Q Consensus 13 ~~~~~riLgi-D~G~kriGiAisd~~~~~---------a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~-~g 81 (160)
+....||||+ |.|.--+|++++--..-+ +.|+..=.-.|.+ +|+ + +-+.+|++-.. .|
T Consensus 72 VTDGerILGLGDlG~~Gm~I~iGKl~lYta~aGi~P~~~LPV~LDvGTnN~----~Ll---~----Dp~YlG~r~~R~~g 140 (182)
T pfam00390 72 VTDGERILGLGDLGVAGMPIMEGKLALYTAFAGIDPSRVLPIVLDVGTNNE----KLL---N----DPLYLGLRHKRVRG 140 (182)
T ss_pred EECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHCCCEEEECCCCHH----HHC---C----CCCCCCCCCCCCCH
T ss_conf 856730146676565765024459999988538874216877886687858----871---5----97556647788975
Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHH
Q ss_conf 8884479999999999840489977998367167999999998
Q gi|254780976|r 82 SEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILID 124 (160)
Q Consensus 82 ~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~ 124 (160)
. +--+.++.|...+.+. ++-....+=|.+++..|.+.|..
T Consensus 141 ~--eYd~f~deFv~av~~~-~~p~~liqfEDF~~~nA~~lLer 180 (182)
T pfam00390 141 E--EYDEFVDEFVEAVKAL-FPPFGGIQFEDFGAPNAFEILER 180 (182)
T ss_pred H--HHHHHHHHHHHHHHHH-CCCCCEEEECCCCCHHHHHHHHH
T ss_conf 8--9999999999999987-09984898642895428999985
No 84
>pfam08968 DUF1885 Domain of unknown function (DUF1885). This domain is found in a set of hypothetical proteins produced by bacteria of the Bacillus genus.
Probab=39.85 E-value=20 Score=17.18 Aligned_cols=76 Identities=18% Similarity=0.215 Sum_probs=37.6
Q ss_pred CEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEE-------------ECCC-CCCCCCHHHHHHHH
Q ss_conf 768999630765110121011047741568989999865167758997-------------2135-78888844799999
Q gi|254780976|r 27 KRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIG-------------LPLN-MNGSEGPRVHSTRA 92 (160)
Q Consensus 27 kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvG-------------lP~~-~~g~~~~~~~~v~~ 92 (160)
+++|.-++=.....|||.++.......- .|.. ....-..|+|| ||-+ .-|..++.-+.++.
T Consensus 32 ~kTG~Ql~w~Y~~aAFPY~i~~~~d~~l---~L~~--~~~rY~~I~vgVg~e~~~sfIqi~LP~~AT~GDKgKANEfckf 106 (128)
T pfam08968 32 SKTGDQIGWAYEQAAFPYEIRDTSQSTL---YLQS--DHDRYNGIYIGVGTEGEQTFIQISLPPTATFGDKGKANEFSKF 106 (128)
T ss_pred HHHHHHCCCHHHCCCCCEEEEECCCCEE---EECC--CCCCEEEEEEEECCCCCCEEEEEECCCCCCCCCCHHHHHHHHH
T ss_conf 7544020603340468668996378689---9804--7875505999635678715799988999877762227689999
Q ss_pred HHHHHHHHCCCCCEEEECCC
Q ss_conf 99999840489977998367
Q gi|254780976|r 93 FVHNMIDRKVYVPFVFWDER 112 (160)
Q Consensus 93 f~~~L~~~~~~l~v~~~DEr 112 (160)
+++.|. -+.++.+-|
T Consensus 107 LAK~l~-----geL~LFNGR 121 (128)
T pfam08968 107 LAKKLE-----GELQLFNGR 121 (128)
T ss_pred HHHHHH-----CEEEEECCE
T ss_conf 999741-----015775470
No 85
>PRK00885 phosphoribosylamine--glycine ligase; Provisional
Probab=38.66 E-value=33 Score=15.87 Aligned_cols=101 Identities=16% Similarity=0.221 Sum_probs=64.0
Q ss_pred HHHHHHHHHHHHHCCCEEEEEEC--CC--------CCC-------CCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHH
Q ss_conf 56898999986516775899721--35--------788-------88844799999999998404899779983671679
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLP--LN--------MNG-------SEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTV 116 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP--~~--------~~g-------~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~ 116 (160)
.+...+.+.+++++|+..|||=- +. ..| ....+.+..+.|++++..+ .++|--.+..--+..
T Consensus 49 ~d~~~l~~~a~~~~idlviiGPE~pL~~Gi~D~l~~~gi~vfGP~k~aA~lE~SK~faK~~m~~-~~IPTa~~~~f~~~~ 127 (424)
T PRK00885 49 TDIEALVAFAKEEGIDLTVVGPEAPLVAGIVDAFRAAGLKIFGPTKAAAQLEGSKAFAKDFMAR-YGIPTAAYEVFTDAE 127 (424)
T ss_pred CCHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCEECCCHHHHHHHHHHHHHHHHHHH-CCCCCCCEEEECCHH
T ss_conf 7999999999984999999896678873579999506994689497898876637999999998-298987548868999
Q ss_pred HHHHHHHHCCCCHHHH-------HHH---HHHHHHHHHHHHHHHHHHHH
Q ss_conf 9999999869864454-------643---78998999999999686456
Q gi|254780976|r 117 SAQQILIDMNVSRKKR-------IQK---VDSIAAALILQEVLDRISFL 155 (160)
Q Consensus 117 ~A~~~l~~~g~~~k~~-------k~~---iD~~AA~iILq~~Ld~~~~~ 155 (160)
+|.+.+.+.+...--+ |+. -|...|.-.|+..|++..+-
T Consensus 128 ~A~~~l~~~~~p~VIKaDGLAaGKGV~V~~~~~ea~~al~~i~~~~~fg 176 (424)
T PRK00885 128 EAKAYLDEQGAPIVVKADGLAAGKGVVVAMTLEEAEAAVDDMLAGNKFG 176 (424)
T ss_pred HHHHHHHHCCCCEEEECCCCCCCCCEEEECCHHHHHHHHHHHHCCCCCC
T ss_conf 9999998579988996176567775176378999999999985033234
No 86
>TIGR01187 potA polyamine ABC transporter, ATP-binding protein; InterPro: IPR005893 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the spermidine/putrescine ABC transporter, ATP binding subunit in bacteria and its equivalents in archaea. This transport system belongs to the larger ATP-Binding Cassette (ABC) transporter superfamily. Polyamines like spermidine and putrescine play a vital role in cell proliferation, differentiation, and ion homeostasis. The concentration of polyamines within the cell are regulated by biosynthesis, degradation and transport (uptake and efflux included).; GO: 0015417 polyamine-transporting ATPase activity, 0015846 polyamine transport, 0016020 membrane.
Probab=37.92 E-value=34 Score=15.79 Aligned_cols=72 Identities=11% Similarity=0.174 Sum_probs=47.0
Q ss_pred CCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHH
Q ss_conf 51101210110477415689899998651677589972135788888447999999999984048997799836716799
Q gi|254780976|r 38 RRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVS 117 (160)
Q Consensus 38 ~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~ 117 (160)
.++|+||+.=... ...--.+..+.+.-=+...+-==+|..++|. |-|+| ..|+.+..+ | .|.+.||.+|--.
T Consensus 61 ~NvAfgLk~~k~~-~~ei~~RV~e~L~~V~L~~~a~rkp~qLSGG---Q~QRv-AlARa~v~k-P--k~LLlDEpLsALD 132 (331)
T TIGR01187 61 ENVAFGLKMRKVP-RAEIKPRVKEALRLVQLEEFAKRKPHQLSGG---QQQRV-ALARALVFK-P--KILLLDEPLSALD 132 (331)
T ss_pred HHCCCCCCCCCCC-HHHHHHHHHHHHHHCCCCHHHCCCCCCCCCC---HHHHH-HHHHHHHCC-C--CEEEECCCHHHHH
T ss_conf 6454443517888-5668899999974213001104673104685---28999-999998608-9--5677117722643
No 87
>TIGR00063 folE GTP cyclohydrolase I; InterPro: IPR001474 GTP cyclohydrolase I (3.5.4.16 from EC) catalyzes the biosynthesis of formic acid and dihydroneopterin triphosphate from GTP. This reaction is the first step in the biosynthesis of tetrahydrofolate in prokaryotes, of tetrahydrobiopterin in vertebrates, and of pteridine-containing pigments in insects. The comparison of the sequence of the enzyme from bacterial and eukaryotic sources shows that the structure of this enzyme has been extremely well conserved throughout evolution .; GO: 0003934 GTP cyclohydrolase I activity, 0019438 aromatic compound biosynthetic process, 0005737 cytoplasm.
Probab=37.26 E-value=22 Score=16.96 Aligned_cols=39 Identities=23% Similarity=0.243 Sum_probs=28.3
Q ss_pred HHCCCE-EEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHH
Q ss_conf 516775-899721357888884479999999999840489977998367167999999
Q gi|254780976|r 65 TENIAA-FIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQI 121 (160)
Q Consensus 65 e~~i~~-iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~ 121 (160)
-|=|.. =|||+ |+.++.|..|+++|+-+ ||+|+.-|...
T Consensus 87 aYIPk~GkViGL--------SKlaR~ve~farRlQvQ----------ERLT~QIa~a~ 126 (183)
T TIGR00063 87 AYIPKDGKVIGL--------SKLARIVEMFARRLQVQ----------ERLTKQIAEAL 126 (183)
T ss_pred EEECCCCEEEEH--------HHHHHHHHHHHCCCCHH----------HHHHHHHHHHH
T ss_conf 988189648706--------68889999986265146----------88999999999
No 88
>COG2205 KdpD Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms]
Probab=36.59 E-value=35 Score=15.67 Aligned_cols=58 Identities=7% Similarity=0.044 Sum_probs=41.2
Q ss_pred CHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHH-HHHHHHHHHHHHCCCCCEEEECCCCCH
Q ss_conf 156898999986516775899721357888884479-999999999840489977998367167
Q gi|254780976|r 53 TQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVH-STRAFVHNMIDRKVYVPFVFWDERLTT 115 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~-~v~~f~~~L~~~~~~l~v~~~DEr~TS 115 (160)
......+.++.++++++.||+|-|.. +.+.. .-..|+..|.....++.|+.+......
T Consensus 320 ~dv~~~i~~ya~~~~~TkiViG~~~~-----~rw~~~~~~~l~~~L~~~~~~idv~ii~~~~~~ 378 (890)
T COG2205 320 GDVAKAIARYAREHNATKIVIGRSRR-----SRWRRLFKGSLADRLAREAPGIDVHIVALDAPP 378 (890)
T ss_pred CCHHHHHHHHHHHCCCEEEEECCCCC-----HHHHHHHCCCHHHHHHHCCCCCEEEEEECCCCC
T ss_conf 73899999999972981599568743-----178987505389999735899539994088875
No 89
>PTZ00094 serine hydroxymethyltransferase; Provisional
Probab=36.56 E-value=35 Score=15.66 Aligned_cols=65 Identities=17% Similarity=0.123 Sum_probs=40.4
Q ss_pred HHHHHHCCCCCCEEEEEC--CC----------CEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEE
Q ss_conf 999983589983999986--88----------768999630765110121011047741568989999865167758997
Q gi|254780976|r 7 EDLVKSLKPNQPIASIDL--GT----------KRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIG 74 (160)
Q Consensus 7 ~~~~~~~~~~~riLgiD~--G~----------kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvG 74 (160)
--|...++|..+|||+|. |- +++. ++- ..--..|+.+= .+..-+++++.++++++.|..||+|
T Consensus 112 av~~all~pGD~im~l~l~~GGHltHG~~~~~~~v~--~sg-~~~~~~~Y~vd--~~~~IDyd~~~~~a~~~kPklIi~G 186 (450)
T PTZ00094 112 AVYTALLQPHDRIMGLDLPSGGHLTHGFYTAKKKVS--ATS-IFFESLPYQVN--EEGLIDYDKLEKLAKAFRPKLIIAG 186 (450)
T ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCC--CCC-CEEEEEEECCC--CCCCCCHHHHHHHHHHHCCCEEEEC
T ss_conf 999997079997998435668865543346777754--013-45556773246--4897489999999986199879988
Q ss_pred EC
Q ss_conf 21
Q gi|254780976|r 75 LP 76 (160)
Q Consensus 75 lP 76 (160)
..
T Consensus 187 ~S 188 (450)
T PTZ00094 187 AS 188 (450)
T ss_pred CC
T ss_conf 82
No 90
>TIGR01769 GGGP geranylgeranylglyceryl phosphate synthase; InterPro: IPR010946 This entry represents geranylgeranylglyceryl phosphate synthase which catalyses the first committed step in the synthesis of ether-linked membrane lipids in archaea ..
Probab=36.44 E-value=35 Score=15.65 Aligned_cols=96 Identities=17% Similarity=0.201 Sum_probs=57.5
Q ss_pred CCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCH--HHHHHHHHHCCC
Q ss_conf 774156898999986516775899721357888884479999999999840489977998367167--999999998698
Q gi|254780976|r 50 KKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTT--VSAQQILIDMNV 127 (160)
Q Consensus 50 ~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS--~~A~~~l~~~g~ 127 (160)
+....+..+|.+-+++..-+.|.||=-+ +=.....+...+.+ ++..++||.+.-=..+- .-|...|--+=+
T Consensus 9 ~~~~~~~~~IAk~~~~aGtD~ilvGGs~------Gi~e~~ld~~v~~i-k~~~~~p~iLFPG~~~~Ls~~ADAvlFmSlL 81 (212)
T TIGR01769 9 KKSSDELKKIAKNAKEAGTDAILVGGSL------GISEDKLDQVVKKI-KEITNLPVILFPGSVNLLSRKADAVLFMSLL 81 (212)
T ss_pred CCCHHHHHHHHHHHHHCCCCEEEECCCC------CCCHHHHHHHHHHH-HHCCCCCEEECCCCHHHHHHHHHHHHHHHHH
T ss_conf 2584569999999985289889836633------73354788999986-4015786688278845788886598888865
Q ss_pred CHHHHHHHH---HHHHHHHHHHHH--HHHHH
Q ss_conf 644546437---899899999999--96864
Q gi|254780976|r 128 SRKKRIQKV---DSIAAALILQEV--LDRIS 153 (160)
Q Consensus 128 ~~k~~k~~i---D~~AA~iILq~~--Ld~~~ 153 (160)
+ +.....| --.||.+|..-+ ||..+
T Consensus 82 N-S~d~yfivGaqi~aA~~i~~~~PrlE~Ip 111 (212)
T TIGR01769 82 N-SADTYFIVGAQIKAAVLIKKLQPRLEVIP 111 (212)
T ss_pred C-CCCCCEEECCHHHHHHHHHHHCCCCCCCC
T ss_conf 3-69973674151578999987278856365
No 91
>COG4020 Uncharacterized protein conserved in archaea [Function unknown]
Probab=36.38 E-value=36 Score=15.65 Aligned_cols=59 Identities=25% Similarity=0.173 Sum_probs=38.9
Q ss_pred CEEEEECCCCEEEEEEEECCCCEECCCEEEECCCC--CHHHHHHHHH--HHHHCCCEEEEEEC
Q ss_conf 39999868876899963076511012101104774--1568989999--86516775899721
Q gi|254780976|r 18 PIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKV--TQTALELLSF--ITTENIAAFIIGLP 76 (160)
Q Consensus 18 riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~--~~~~~~l~~i--i~e~~i~~iVvGlP 76 (160)
-++|+|.|+.-|-+|+-|.....-|-+..-+.... ...+.+|..- +.-++++.+-+-|.
T Consensus 4 ~fVGiDHGTsgi~~ai~d~e~~~~Fklgrae~~~~~ek~~L~~l~de~~i~l~eidlialtYs 66 (332)
T COG4020 4 MFVGIDHGTSGIKFAIYDGEKDPEFKLGRAELRKVAEKSLLRELEDEARIALEEIDLIALTYS 66 (332)
T ss_pred EEEEECCCCCCEEEEEECCCCCCEEEECHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEC
T ss_conf 999554787616899975898814772505666565889999746863776342117988521
No 92
>PRK13330 consensus
Probab=36.03 E-value=36 Score=15.61 Aligned_cols=85 Identities=15% Similarity=0.237 Sum_probs=44.4
Q ss_pred EEEEECCCCEEEEEEEECCCCEE-CCCEEEECCCC--CHHHHHHHHHHHHHCCC-----EEEEEECCCCCCCCCHHHHHH
Q ss_conf 99998688768999630765110-12101104774--15689899998651677-----589972135788888447999
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFA-HPRPFLVRKKV--TQTALELLSFITTENIA-----AFIIGLPLNMNGSEGPRVHST 90 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a-~Pl~~i~~~~~--~~~~~~l~~ii~e~~i~-----~iVvGlP~~~~g~~~~~~~~v 90 (160)
+|.+|.|-.++=+++-+....+. +...+...++. ++....+..++.+++++ ..|+. +--|..
T Consensus 2 ~L~iDIGNT~i~~glf~~~~~i~~~~~~~~t~~~~t~de~~~~~~~~l~~~~~~~~~i~~~i~s-------SVVP~l--- 71 (257)
T PRK13330 2 LLVVDVGNTNTVFGIFENGKKIPLFHKRTVTRKDRTSDELGLFFRGFLREFKIENEAITGGIYS-------SVVPTL--- 71 (257)
T ss_pred EEEEEECCCCEEEEEEECCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHCCCEEEE-------CCCCCH---
T ss_conf 7999977880699999899799999986305842057899999999998839953220142772-------536655---
Q ss_pred HHHHHHHHHHCCCCCEEEECCCC
Q ss_conf 99999998404899779983671
Q gi|254780976|r 91 RAFVHNMIDRKVYVPFVFWDERL 113 (160)
Q Consensus 91 ~~f~~~L~~~~~~l~v~~~DEr~ 113 (160)
....+++.+..++++...++...
T Consensus 72 ~~~~~~~~~~~~~~~~~~v~~~~ 94 (257)
T PRK13330 72 NPILDRMFQDWFKIEAIRVHYQM 94 (257)
T ss_pred HHHHHHHHHHHCCCCEEEECCCC
T ss_conf 99999999997198779977888
No 93
>COG0112 GlyA Glycine/serine hydroxymethyltransferase [Amino acid transport and metabolism]
Probab=35.70 E-value=36 Score=15.58 Aligned_cols=68 Identities=15% Similarity=0.075 Sum_probs=41.1
Q ss_pred HHHHHCCCCCCEEEEECCC---CEEEEEEEECCC-CEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC
Q ss_conf 9998358998399998688---768999630765-11012101104774156898999986516775899721
Q gi|254780976|r 8 DLVKSLKPNQPIASIDLGT---KRIGLAISDPGR-RFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLP 76 (160)
Q Consensus 8 ~~~~~~~~~~riLgiD~G~---kriGiAisd~~~-~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP 76 (160)
=|...+++..+|||+|.-. -.=|-.++-+.. --+.|+++ ...+..-+.+++.+++.++.|..||.|..
T Consensus 104 v~~All~pGD~imgm~l~~GGHltHg~~v~~sG~~~~~v~Y~v-d~et~~IDyD~~~k~a~e~kPKlii~G~S 175 (413)
T COG0112 104 VYLALLQPGDTIMGLDLSHGGHLTHGSPVNFSGKLFNVVSYGV-DPETGLIDYDEVEKLAKEVKPKLIIAGGS 175 (413)
T ss_pred HHHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCEEEEEECCC-CCCCCCCCHHHHHHHHHHHCCCEEEECCC
T ss_conf 9999738988575056787876557998776640577676240-63347517999999999849978997760
No 94
>pfam09298 DUF1969 Domain of unknown function (DUF1969). The N-terminal domain of fumarylacetoacetate hydrolase is functionally uncharacterized, and adopts a structure consisting of an SH3-like barrel.
Probab=35.68 E-value=17 Score=17.53 Aligned_cols=15 Identities=40% Similarity=0.377 Sum_probs=11.3
Q ss_pred CCCCEEEEEEEECCC
Q ss_conf 688768999630765
Q gi|254780976|r 24 LGTKRIGLAISDPGR 38 (160)
Q Consensus 24 ~G~kriGiAisd~~~ 38 (160)
-+..|+||||+|-.-
T Consensus 13 ~~~~R~GvAIGd~Vl 27 (101)
T pfam09298 13 GPAPRIGVAIGDQVL 27 (101)
T ss_pred CCCCCCEEEECCEEE
T ss_conf 999721577675102
No 95
>TIGR01579 MiaB-like-C MiaB-like tRNA modifying enzyme; InterPro: IPR006467 This clade of sequences is closely related to MiaB, a modifier of isopentenylated adenosine-37 of certain eukaryotic and bacterial tRNAs (see IPR006463 from INTERPRO). Sequence alignments suggest that these sequences perform the same chemical transformation as MiaB, perhaps on a different (or differently modified) tRNA base substrate. This clade represents a subfamily that spans low GC Gram-positive bacteria, alpha and epsilon proteobacteria, Campylobacter, Porphyromonas, Aquifex, Thermotoga, Chlamydia, Treponema and Fusobacterium , , ..
Probab=34.89 E-value=38 Score=15.50 Aligned_cols=78 Identities=17% Similarity=0.206 Sum_probs=47.6
Q ss_pred HHHHHHHHHHHHHCC-CEEEEEECCCCCCCCCHHH-HHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHC--CCCH
Q ss_conf 568989999865167-7589972135788888447-99999999998404899779983671679999999986--9864
Q gi|254780976|r 54 QTALELLSFITTENI-AAFIIGLPLNMNGSEGPRV-HSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDM--NVSR 129 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i-~~iVvGlP~~~~g~~~~~~-~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~--g~~~ 129 (160)
+...+|...-.++.. +-+|||.| +|++.. +.+.++.+++ ....+=|+-+=-|=-|.+|.= +. .+..
T Consensus 352 ~~v~~lr~~~p~~~~gtD~IVGFP-----~E~eedFq~t~~~~~~~--~~~~~HiFpyS~R~~T~A~~m---~~rf~v~~ 421 (492)
T TIGR01579 352 KLVQKLRSVRPDYAFGTDVIVGFP-----GESEEDFQETLRIVKEI--EFSGLHIFPYSARPGTPASAM---DDRFKVPE 421 (492)
T ss_pred HHHHHHHHCCCCCCCCCCEEEECC-----CCCHHHHHHHHHHHHHC--CCCEEEEECCCCCCCCCCCCC---CCCCCCCC
T ss_conf 999999850776303760374088-----89889999999998526--602133542688432812047---87442761
Q ss_pred HHHHHHHHHHHH
Q ss_conf 454643789989
Q gi|254780976|r 130 KKRIQKVDSIAA 141 (160)
Q Consensus 130 k~~k~~iD~~AA 141 (160)
+..|++.+.++.
T Consensus 422 ~~~KeR~~~L~~ 433 (492)
T TIGR01579 422 TIKKERVKRLKE 433 (492)
T ss_pred HHHHHHHHHHHH
T ss_conf 158899999999
No 96
>PRK10331 L-fuculokinase; Provisional
Probab=34.13 E-value=39 Score=15.42 Aligned_cols=24 Identities=17% Similarity=0.256 Sum_probs=20.0
Q ss_pred CCEEEEECCCCEEEEEEEECCCCE
Q ss_conf 839999868876899963076511
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGRRF 40 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~~~ 40 (160)
.-|||||+|+.-+=.++-|..+.+
T Consensus 2 d~iLgID~GTts~Ka~l~d~~G~i 25 (470)
T PRK10331 2 DVILVLDCGATNVRAIAVDRQGKI 25 (470)
T ss_pred CEEEEEECCCCCEEEEEECCCCCE
T ss_conf 489999446321035698399999
No 97
>PRK00011 glyA serine hydroxymethyltransferase; Reviewed
Probab=33.81 E-value=39 Score=15.39 Aligned_cols=67 Identities=16% Similarity=0.082 Sum_probs=40.1
Q ss_pred HHHHHCCCCCCEEEEECCC--C-EEE--EEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC
Q ss_conf 9998358998399998688--7-689--9963076511012101104774156898999986516775899721
Q gi|254780976|r 8 DLVKSLKPNQPIASIDLGT--K-RIG--LAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLP 76 (160)
Q Consensus 8 ~~~~~~~~~~riLgiD~G~--k-riG--iAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP 76 (160)
=|...++|..+|||+|.-. - .-| ++++ ...--..++++ ..++..-+++++.++++++.|..||+|..
T Consensus 103 v~~all~PGD~im~l~l~~GGHltHg~~~~~~-~~~~~~~~y~~-d~~~~~IDyd~~~~~a~~~kPklIi~G~S 174 (415)
T PRK00011 103 VYFALLKPGDTILGMDLAHGGHLTHGSPVNFS-GKLYNVVSYGV-DEETGLIDYDEVEKLALEHKPKLIIAGAS 174 (415)
T ss_pred HHHHHCCCCCEEEECCCCCCCCCCCCCCCCCC-CCEEEEEEEEE-CCCCCCCCHHHHHHHHHHHCCCEEEECCC
T ss_conf 99996278985876686667876578675544-65247898678-64348167999999998609988996461
No 98
>cd02064 Flavokinase_C Riboflavin kinase (Flavokinase). This family represents the C-terminal region of the bifunctional riboflavin biosynthesis protein riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin. 5'-phospho transferase and ATP:FMN-adenylyltransferase activities . The C-terminal domain has FMN-adenylyltransferase activitie. They catalyse the 5'-phosphorylation of riboflavin to FMN and the adenylylation of FMN to FAD . A domain has been identified in the N-terminal region that is well conserved in all the bacterial FAD synthetases.This domain has remote similarity to nucleotidyl transferases and, hence, it may be involved in the adenylylation reaction of FAD synthetases .
Probab=33.25 E-value=40 Score=15.34 Aligned_cols=64 Identities=9% Similarity=0.180 Sum_probs=34.7
Q ss_pred HHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCC------EEEECCCCCHHHHHHHHHHCCC
Q ss_conf 99998651677589972135788888447999999999984048997------7998367167999999998698
Q gi|254780976|r 59 LLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVP------FVFWDERLTTVSAQQILIDMNV 127 (160)
Q Consensus 59 l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~------v~~~DEr~TS~~A~~~l~~~g~ 127 (160)
+..++...++..+|||.=...--...--....+.++++ .+.. +..-+++.||..-++.+.+..+
T Consensus 90 i~~~l~~l~~~~iVvG~Df~FG~~r~G~~~~L~~~~~~-----~~~~v~~i~~~~~~~~~ISSs~IR~~i~~G~i 159 (179)
T cd02064 90 VKDLLVKLNAKHVVVGFDFRFGKGRSGNAELLRELGEK-----YGFEVEIVPPVTIDGEKVSSTRIREALAEGDV 159 (179)
T ss_pred HHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHH-----HCCEEEEECCEEECCCEEEHHHHHHHHHCCCH
T ss_conf 99874147963999777302578988799999999987-----29649993769608916738999999985999
No 99
>PRK13317 pantothenate kinase; Provisional
Probab=32.81 E-value=41 Score=15.29 Aligned_cols=87 Identities=10% Similarity=0.114 Sum_probs=43.8
Q ss_pred CCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHH
Q ss_conf 98399998688768999630765110121011047741568989999865167758997213578888844799999999
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVH 95 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~ 95 (160)
|+-.+|||+|...+=+++.+....+-+ ..++.++.+..... +++..++..+.+ -|.. ...|.+
T Consensus 1 m~m~iGID~G~T~tKiv~~~~~~~l~f--~~~~s~~i~~~~~~---~~~~~~i~~i~v------TGgg------s~~l~~ 63 (273)
T PRK13317 1 MEMKIGIDAGGTLTKIVYLEEKNQRTF--KTVLSTDIKKVIDW---LKNQNDIEKICL------TGGK------AGQLAQ 63 (273)
T ss_pred CCCEEEEEECCCEEEEEEECCCCCEEE--EECCCCCHHHHHHH---HHHCCCCCEEEE------ECCC------HHHHHH
T ss_conf 985699998777078999889997988--73134778999999---762445228999------7771------787889
Q ss_pred HHHHHCCCCCEEEECCCCCHHHHHHHH
Q ss_conf 998404899779983671679999999
Q gi|254780976|r 96 NMIDRKVYVPFVFWDERLTTVSAQQIL 122 (160)
Q Consensus 96 ~L~~~~~~l~v~~~DEr~TS~~A~~~l 122 (160)
.+ . .++|+.++||-.....+-+.|
T Consensus 64 ~~-~--~~~~i~k~~E~~A~~~G~~~l 87 (273)
T PRK13317 64 LL-N--IGYKIVEFVEFEATLQGVRYL 87 (273)
T ss_pred HH-H--CCCCCCEEEHHHHHHHHHHHH
T ss_conf 97-2--589974654598999999998
No 100
>KOG3289 consensus
Probab=32.14 E-value=42 Score=15.23 Aligned_cols=101 Identities=11% Similarity=0.121 Sum_probs=61.9
Q ss_pred HHHHHHHHHHHHC--CCEEEEEECCCCCCCCCHH--HHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHH----HHHHCC
Q ss_conf 6898999986516--7758997213578888844--7999999999984048997799836716799999----999869
Q gi|254780976|r 55 TALELLSFITTEN--IAAFIIGLPLNMNGSEGPR--VHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQ----ILIDMN 126 (160)
Q Consensus 55 ~~~~l~~ii~e~~--i~~iVvGlP~~~~g~~~~~--~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~----~l~~~g 126 (160)
.++--..+|..|. -...|+|| ++.+...+.+ ..-+.+.+.++.+.+++-++..+|-..-|.+-++ .+.+.|
T Consensus 59 mlEvAl~lId~~~~~~GlviaGy-y~Ane~~~D~s~~~~A~kiadrIse~f~~A~ilv~dn~~l~~~~e~~~v~v~e~~g 137 (199)
T KOG3289 59 MLEVALNLIDVWGAQAGLVIAGY-YHANERVNDQSLNPVALKIADRISEFFPDAAILVLDNKKLVPQCERPPVIVLEDQG 137 (199)
T ss_pred HHHHHHHHHHHHHHHCCEEEEEE-EECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEECCC
T ss_conf 89999999988887568079998-63478744467337989998899863898729999632126666778778762467
Q ss_pred CCHHHHHH----HHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 86445464----3789989999999996864566
Q gi|254780976|r 127 VSRKKRIQ----KVDSIAAALILQEVLDRISFLE 156 (160)
Q Consensus 127 ~~~k~~k~----~iD~~AA~iILq~~Ld~~~~~~ 156 (160)
.+.+.++. ..|-..|..+|+.+|++.++.+
T Consensus 138 ~rW~~~d~~~~~~~d~~e~~~~ls~ll~~~~~r~ 171 (199)
T KOG3289 138 LRWRPKDKTLVQWSDWLEGRQMLSALLESRAYRD 171 (199)
T ss_pred CCEEECCCCHHHHHCCHHHHHHHHHHHHHHHHHH
T ss_conf 6313447716666415567789999986524544
No 101
>TIGR02350 prok_dnaK chaperone protein DnaK; InterPro: IPR012725 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes, the grpE protein is a co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold . DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. Members of this family are the chaperone DnaK, of the DnaK-DnaJ-GrpE chaperone system. All members of the seed alignment were taken from completely sequenced bacterial or archaeal genomes and (except for the Mycoplasma sequence) found clustered with other genes of this systems. This entry excludes DnaK homologues that are not DnaK itself, such as the heat shock cognate protein HscA (IPR010236 from INTERPRO). However, it is not designed to distinguish among DnaK paralogs in eukaryotes. Note that a number of DnaK genes have shadow ORFs in the same reverse (relative to dnaK) reading frame, a few of which have been assigned glutamate dehydrogenase activity. The significance of this observation is unclear; the lengths of such shadow ORFs are highly variable as if the presumptive protein product is not conserved.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding.
Probab=31.69 E-value=42 Score=15.18 Aligned_cols=53 Identities=23% Similarity=0.326 Sum_probs=30.6
Q ss_pred CEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCC
Q ss_conf 3999986887689996307651101210110477415689899998651677589972135
Q gi|254780976|r 18 PIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLN 78 (160)
Q Consensus 18 riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~ 78 (160)
+|+|||.||.+=-+||.+.... .+|.+.......- .+|-=.+-...+||-|=.
T Consensus 1 KiiGIDLGTTNS~vAV~eGg~p-----~Vi~n~EG~rTtP---S~VAF~~~Ge~lVG~~Ak 53 (598)
T TIGR02350 1 KIIGIDLGTTNSCVAVMEGGEP-----VVIPNAEGARTTP---SVVAFTKNGERLVGQPAK 53 (598)
T ss_pred CEEEEECCCCCEEEEEEECCCC-----EEECCCCCCCCCC---CEEEECCCCCEECCHHHH
T ss_conf 9677304741016877866866-----6850788795466---178984898777141541
No 102
>PRK04870 histidinol-phosphate aminotransferase; Provisional
Probab=31.43 E-value=43 Score=15.15 Aligned_cols=60 Identities=18% Similarity=0.150 Sum_probs=40.6
Q ss_pred CCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHH
Q ss_conf 7741568989999865167758997213578888844799999999998404899779983671679
Q gi|254780976|r 50 KKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTV 116 (160)
Q Consensus 50 ~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~ 116 (160)
.++..+...+.+-+.++++..+++..|=|.-|+.-+.. .+..|. +.... +..+||.|.-.
T Consensus 136 ~d~~~d~~~~~~~i~~~~~klv~i~nPNNPTG~~~~~~-~l~~l~----~~~~~--lvVvDEAY~eF 195 (356)
T PRK04870 136 ADFTLDLPAMLAAIAEHRPALVYLAYPNNPTGNLFDDA-DVEAII----AAAPG--LVVIDEAYQPF 195 (356)
T ss_pred CCCCCCHHHHHHHHCCCCCCEEEECCCCCCCCCCCCHH-HHHHHH----HCCCC--EEEEECCCHHH
T ss_conf 76562128899874146873899768999766336878-899998----50698--65522540755
No 103
>PRK05183 hscA chaperone protein HscA; Provisional
Probab=31.22 E-value=30 Score=16.07 Aligned_cols=23 Identities=30% Similarity=0.325 Sum_probs=18.1
Q ss_pred CCCEEEEECCCCEEEEEEEECCC
Q ss_conf 98399998688768999630765
Q gi|254780976|r 16 NQPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 16 ~~riLgiD~G~kriGiAisd~~~ 38 (160)
....+|||+|+.+.=||+.....
T Consensus 18 ~~~aIGIDLGTTnS~VA~~~~g~ 40 (621)
T PRK05183 18 RRLAVGIDLGTTNSLVAAVRSGV 40 (621)
T ss_pred CCCEEEEECCHHHEEEEEEECCE
T ss_conf 77089999782347999999998
No 104
>cd02016 TPP_E1_OGDC_like Thiamine pyrophosphate (TPP) family, E1 of OGDC-like subfamily, TPP-binding module; composed of proteins similar to the E1 component of the 2-oxoglutarate dehydrogenase multienzyme complex (OGDC). OGDC catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA and carbon dioxide, a key reaction of the tricarboxylic acid cycle.
Probab=31.07 E-value=38 Score=15.49 Aligned_cols=50 Identities=20% Similarity=0.242 Sum_probs=29.6
Q ss_pred CCCCEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHH
Q ss_conf 6887689996307651101210110477415689899998651677589972135788888447999
Q gi|254780976|r 24 LGTKRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHST 90 (160)
Q Consensus 24 ~G~kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v 90 (160)
.|.||-|+--+++. -..+.++.+...+.+++.+|+|+|= -|.-+-.+...
T Consensus 10 ~g~KRFslEG~Esl---------------Ip~l~~li~~a~~~gv~~iviGMaH--RGRLNvL~nvl 59 (265)
T cd02016 10 PGQKRFGLEGAESL---------------IPALDELIDRAAELGVEEVVIGMAH--RGRLNVLANVL 59 (265)
T ss_pred CCCCEECCCCHHHH---------------HHHHHHHHHHHHHCCCCEEEECCCC--CCHHHHHHHHH
T ss_conf 88745415668899---------------9999999999987698679982566--65188999986
No 105
>cd01017 AdcA Metal binding protein AcdA. These proteins have been shown to function in the ABC uptake of Zn2+ and Mn2+ and in competence for genetic transformation and adhesion. The AcdA proteins belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and they bind their ligand in the cleft between these domains. In addition, many of these proteins have a low complexity region containing metal binding histidine-rich motif (repetitive HDH sequence).
Probab=30.64 E-value=44 Score=15.07 Aligned_cols=49 Identities=14% Similarity=0.240 Sum_probs=35.4
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCH
Q ss_conf 788888447999999999984048997799836716799999999869864
Q gi|254780976|r 79 MNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSR 129 (160)
Q Consensus 79 ~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~ 129 (160)
.+....+..+.+..+.+.+. .-++++.|++..+++..|+.+-++.|.+.
T Consensus 198 ~~~~~eps~~~l~~l~~~ik--~~~v~~If~e~~~~~~~~~~ia~e~g~~v 246 (282)
T cd01017 198 VSPEVEPSPKQLAELVEFVK--KSDVKYIFFEENASSKIAETLAKETGAKL 246 (282)
T ss_pred CCCCCCCCHHHHHHHHHHHH--HCCCCEEEEECCCCCHHHHHHHHHHCCCE
T ss_conf 48888999999999999999--85998999818999099999999819967
No 106
>PRK09267 flavodoxin FldA; Validated
Probab=30.03 E-value=45 Score=15.01 Aligned_cols=13 Identities=31% Similarity=0.542 Sum_probs=6.5
Q ss_pred CEEEEEECCCCCC
Q ss_conf 7589972135788
Q gi|254780976|r 69 AAFIIGLPLNMNG 81 (160)
Q Consensus 69 ~~iVvGlP~~~~g 81 (160)
+.+|+|-|=--.|
T Consensus 48 d~lIlG~~Twg~G 60 (169)
T PRK09267 48 DLLILGIPTWGYG 60 (169)
T ss_pred CEEEEECCCCCCC
T ss_conf 8799975767888
No 107
>PTZ00062 glutaredoxin; Provisional
Probab=29.06 E-value=47 Score=14.91 Aligned_cols=62 Identities=18% Similarity=0.187 Sum_probs=41.3
Q ss_pred CHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH
Q ss_conf 15689899998651677589972135788888447999999999984048997799836716799999999
Q gi|254780976|r 53 TQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILI 123 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~ 123 (160)
...+++|..+|++..|..|.-|-|. .|+|......+.-|. ..+++...+| -+...+.++.++
T Consensus 114 ~~t~~RI~~LI~~n~VvLFMKGtp~------~PqCGFS~~vv~iL~--~~~v~f~~~n-VL~D~~vR~~LK 175 (219)
T PTZ00062 114 KQTVDKIERLLRENKIILFMKGDKT------FPQCKFSNAVVFMLN--SSKVKYETYN-ILDDPDLREELK 175 (219)
T ss_pred CCHHHHHHHHHHCCCEEEEECCCCC------CCCCCCHHHHHHHHH--HCCCCCCEEE-CCCCHHHHHHHH
T ss_conf 3089999999733968999678999------998834799999999--8199966053-579889997335
No 108
>TIGR01510 coaD_prev_kdtB pantetheine-phosphate adenylyltransferase; InterPro: IPR001980 Temperature-sensitive mutants of Escherichia coli, defective in the transfer of 3-deoxy-D-manno-octulosonic acid (KDO) from CMP-KDO to a tetraacyldisaccharide 1,4'-bisphosphate precursor of lipid A, have been used to map KDO transferase activity on the E. coli chromosome . The KDO transferase gene, designated kdtA, was shown to code for a 43kDa polypeptide. Overexpression of this single gene product greatly stimulates incorporation of two stereochemically distinct KDO residues during lipopolysaccharide biosynthesis in extracts of E. coli . From these experiments this protein was thought to play a role in lipopolysaccharide biosynthesis, however now it is annotated as phosphopantetheine adenylyltransferase (2.7.7.3 from EC), which catalyses the reversible transfer of an adenylyl group from ATP to 4'-phosphopantetheine to give dephospho-CoA (DPCOA) and pyrophosphate in the fourth (and penultimate) step of coenzyme A biosynthesis .; GO: 0004595 pantetheine-phosphate adenylyltransferase activity, 0015937 coenzyme A biosynthetic process.
Probab=28.80 E-value=48 Score=14.88 Aligned_cols=54 Identities=13% Similarity=0.167 Sum_probs=42.2
Q ss_pred CCEEEEEECCCCCCCCCHHHHHHHHHHHHHH-HHC---CCCCEEEECCCCCHHHHHHH
Q ss_conf 7758997213578888844799999999998-404---89977998367167999999
Q gi|254780976|r 68 IAAFIIGLPLNMNGSEGPRVHSTRAFVHNMI-DRK---VYVPFVFWDERLTTVSAQQI 121 (160)
Q Consensus 68 i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~-~~~---~~l~v~~~DEr~TS~~A~~~ 121 (160)
.+.+|||--.|.+.+.-...++-....++.. +.. +++.|..+|.||+---|++.
T Consensus 26 FDeV~Vav~~N~~K~~~f~~eeR~~~~~~~tk~~l~~~~NV~V~~f~~~L~Vd~Ak~~ 83 (163)
T TIGR01510 26 FDEVIVAVAINPSKKPLFSLEERVELIEDATKKHLKKIPNVEVDVFDGRLLVDYAKEL 83 (163)
T ss_pred CCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCEEEEEHHHC
T ss_conf 2177788887488985689899999999999997236981489840671212101366
No 109
>cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=27.90 E-value=49 Score=14.78 Aligned_cols=49 Identities=8% Similarity=0.078 Sum_probs=36.9
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCH
Q ss_conf 788888447999999999984048997799836716799999999869864
Q gi|254780976|r 79 MNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSR 129 (160)
Q Consensus 79 ~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~ 129 (160)
.+....+..+.+..+.+.+. .-++++.|++..++++.|+.+-++.|.+.
T Consensus 206 ~~~~~eps~~~l~~l~~~ik--~~~v~~If~e~~~~~k~a~~ia~e~g~~v 254 (286)
T cd01019 206 IDPEIDPGAKRLAKIRKEIK--EKGATCVFAEPQFHPKIAETLAEGTGAKV 254 (286)
T ss_pred ECCCCCCCHHHHHHHHHHHH--HCCCCEEEEECCCCHHHHHHHHHHCCCCE
T ss_conf 24677899999999999999--83998899828989399999999719938
No 110
>PRK04152 consensus
Probab=27.64 E-value=50 Score=14.76 Aligned_cols=57 Identities=18% Similarity=0.207 Sum_probs=38.0
Q ss_pred CHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCC
Q ss_conf 15689899998651677589972135788888447999999999984048997799836716
Q gi|254780976|r 53 TQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLT 114 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~T 114 (160)
..++..+.+.+...+.+.+++..|=|..|+.-... ....+.+.+ .+ + -+..+||.|-
T Consensus 138 ~~dl~~~~~~~~~~~~~lv~l~nPNNPTG~~~~~~-~l~~~~~~~-~~--~-~~vvvDEAY~ 194 (364)
T PRK04152 138 EHDLNEFLELYKSHKPKIIFLCLPNNPLGECLDAS-EVYKFIKGI-DE--D-TLVVIDAAYM 194 (364)
T ss_pred CCCHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCC-CHHHHHHHC-CC--C-CEEEEEHHHH
T ss_conf 33989999986377985899879999867246651-249999754-55--7-7687416778
No 111
>COG0151 PurD Phosphoribosylamine-glycine ligase [Nucleotide transport and metabolism]
Probab=27.39 E-value=50 Score=14.73 Aligned_cols=73 Identities=16% Similarity=0.345 Sum_probs=49.7
Q ss_pred HHHHHHHHHHHHHCCCEEEEEE--CCC--------CC-------CCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCC-H
Q ss_conf 5689899998651677589972--135--------78-------8888447999999999984048997799836716-7
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGL--PLN--------MN-------GSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLT-T 115 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGl--P~~--------~~-------g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~T-S 115 (160)
.+...|.++.++++++..|||= ||- .. .....+.+..+.|++.+..+ .++|--.. +.+| .
T Consensus 50 ~~~~~lv~fA~~~~idl~vVGPE~pL~~GvvD~l~~~Gi~vFGPsk~AA~lE~SK~faK~fm~k-~~IPta~y-~~f~~~ 127 (428)
T COG0151 50 TDHEALVAFAKEKNVDLVVVGPEAPLVAGVVDALRAAGIPVFGPTKAAAQLEGSKAFAKDFMKK-YGIPTAEY-EVFTDP 127 (428)
T ss_pred CCHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCEECCCHHHHHHHHHHHHHHHHHHH-CCCCCCCC-EECCCH
T ss_conf 5889999999973999899897178764468999877996238678898877539999999998-19996411-100898
Q ss_pred HHHHHHHHHCCCC
Q ss_conf 9999999986986
Q gi|254780976|r 116 VSAQQILIDMNVS 128 (160)
Q Consensus 116 ~~A~~~l~~~g~~ 128 (160)
.+|+..+.+.|..
T Consensus 128 e~a~ayi~~~g~p 140 (428)
T COG0151 128 EEAKAYIDEKGAP 140 (428)
T ss_pred HHHHHHHHHCCCC
T ss_conf 9999999972998
No 112
>PRK13334 consensus
Probab=27.14 E-value=51 Score=14.70 Aligned_cols=89 Identities=19% Similarity=0.305 Sum_probs=45.3
Q ss_pred CCCCEEEEECCCCEEEEEEEECCCCEECCCEEEECC--CCCHHHHHHHHHHHHHC---CCEEEEEECCCCCCCCCHHHHH
Q ss_conf 998399998688768999630765110121011047--74156898999986516---7758997213578888844799
Q gi|254780976|r 15 PNQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRK--KVTQTALELLSFITTEN---IAAFIIGLPLNMNGSEGPRVHS 89 (160)
Q Consensus 15 ~~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~--~~~~~~~~l~~ii~e~~---i~~iVvGlP~~~~g~~~~~~~~ 89 (160)
|.-.+|.+|.|-.+|=+++.|....+......-... ..+.....+..+...++ +..+++- +--| ..
T Consensus 2 ~~~~lL~IDIGNT~i~~glf~~~~~l~~~~r~~t~~~~~~d~~~~~~~~~~~~~~~~~~~~~iis-------SVVP--~~ 72 (262)
T PRK13334 2 PAFPLLAVDIGNTSTVLGLADGSELLTHTWRIRTNRMRLPDDLALQLHGLFTLAGAPPPRAAVLS-------SVAP--PV 72 (262)
T ss_pred CCCEEEEEEECCCCEEEEEEECCEEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEE-------ECCC--HH
T ss_conf 98708999977880599999789179999995167100777999999999976399868879999-------6371--57
Q ss_pred HHHHHHHHHHHCCCCCEEEECCCC
Q ss_conf 999999998404899779983671
Q gi|254780976|r 90 TRAFVHNMIDRKVYVPFVFWDERL 113 (160)
Q Consensus 90 v~~f~~~L~~~~~~l~v~~~DEr~ 113 (160)
...+...+ ++.++.++..++...
T Consensus 73 ~~~~~~~l-~~~~~~~~~~v~~~~ 95 (262)
T PRK13334 73 GRNYARAL-ERHFGVDAFVVEAEN 95 (262)
T ss_pred HHHHHHHH-HHHHCCCEEEECCCC
T ss_conf 99999999-998699869988987
No 113
>cd01487 E1_ThiF_like E1_ThiF_like. Member of superfamily of activating enzymes (E1) of the ubiquitin-like proteins. The common reaction mechanism catalyzed by E1-like enzymes begins with a nucleophilic attack of the C-terminal carboxylate of the ubiquitin-like substrate, on the alpha-phosphate of an ATP molecule bound at the active site of the activating enzymes, leading to the formation of a high-energy acyladenylate intermediate and subsequently to the formation of a thiocarboxylate at the C termini of the substrate. The exact function of this family is unknown.
Probab=26.87 E-value=52 Score=14.67 Aligned_cols=58 Identities=12% Similarity=0.226 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH-HHHHH
Q ss_conf 999999999840489977998367167999999998698644546437899899999-99996
Q gi|254780976|r 89 STRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALIL-QEVLD 150 (160)
Q Consensus 89 ~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iIL-q~~Ld 150 (160)
++...++.|.+.++.+.|.-++|++|...+.+.+.+..+-- .-.|..++..+| +.+.-
T Consensus 53 Ka~~a~~~l~~iNp~i~i~~~~~~l~~~n~~~l~~~~D~Vi----D~~Dn~~tr~~l~~~~~~ 111 (174)
T cd01487 53 KVEALKENLREINPFVKIEAINIKIDENNLEGLFGDCDIVV----EAFDNAETKAMLAESLLG 111 (174)
T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCEEE----ECCCCHHHHHHHHHHHHH
T ss_conf 99999999998797988999914449999999983799999----999998999999999999
No 114
>cd01025 TOPRIM_recR TOPRIM_recR: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in Escherichia coli RecR. RecR participates in the RecFOR pathway of homologous recombinational repair in prokaryotes. This pathway provides a single-stranded DNA molecule coated with RecA to allow invasion of a homologous molecule. The RecFOR system directs the loading of RecA onto gapped DNA coated with SSB protein. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). In RecR sequences this glutamate in the first turn of the TOPRIM domain is semiconserved, the DXD motif is not conserved.
Probab=26.77 E-value=52 Score=14.66 Aligned_cols=44 Identities=14% Similarity=0.138 Sum_probs=31.1
Q ss_pred CCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCC
Q ss_conf 5110121011047741568989999865167758997213578888
Q gi|254780976|r 38 RRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSE 83 (160)
Q Consensus 38 ~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~ 83 (160)
+..-.|+.-+... +-.+.+|.+.+++.++..+++.++-+++|.-
T Consensus 29 ~G~lspl~g~~p~--~l~i~~L~~ri~~~~i~EvIlA~~~t~EGe~ 72 (112)
T cd01025 29 GGLISPLDGIGPD--DLNIDKLLERIAKGQVKEVILATNPTVEGEA 72 (112)
T ss_pred CCCCCCCCCCCCC--CCCHHHHHHHHHCCCCEEEEEECCCCCCCHH
T ss_conf 2634755699833--0319999999835894189997489966089
No 115
>cd01829 SGNH_hydrolase_peri2 SGNH_peri2; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=26.64 E-value=52 Score=14.65 Aligned_cols=47 Identities=15% Similarity=0.222 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHH
Q ss_conf 56898999986516775899721357888884479999999999840
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDR 100 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~ 100 (160)
+-+.++..++.+.++..+.+|+|.--+...+..........++...+
T Consensus 95 ~rv~~~l~~~~~~g~~ViWvglP~~r~~~~~~~~~~~n~l~r~~~~~ 141 (200)
T cd01829 95 QRIDELLNVARAKGVPVIWVGLPAMRSPKLSADMVYLNSLYREEVAK 141 (200)
T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHH
T ss_conf 99999999997459829998389758600223699999999999997
No 116
>pfam05913 DUF871 Bacterial protein of unknown function (DUF871). This family consists of several conserved hypothetical proteins from bacteria and archaea. The function of this family is unknown.
Probab=26.61 E-value=52 Score=14.64 Aligned_cols=93 Identities=13% Similarity=0.135 Sum_probs=51.9
Q ss_pred CCCCEEEEEEEECC--CCEECCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHC
Q ss_conf 68876899963076--5110121011047741568989999865167758997213578888844799999999998404
Q gi|254780976|r 24 LGTKRIGLAISDPG--RRFAHPRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRK 101 (160)
Q Consensus 24 ~G~kriGiAisd~~--~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~ 101 (160)
||.+....--||.. +.+--.|+|+..............+.....++.++||-|. . .....+.+++-+....
T Consensus 167 ~gi~~~AFI~g~~~~rGPl~eGLPTLE~HR~~~~~~aa~~L~~~~~iD~V~IGD~~-----~--s~~el~~~~~~~~~~~ 239 (357)
T pfam05913 167 YGIKTAAFIPGDGGLRGPLYEGLPTLEKHRYLPPLAAAKHLLATGLIDDVLIGDAY-----A--SEEELKQLSEYFNEDV 239 (357)
T ss_pred CCCCEEEEEECCCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHCCCCCEEEECCCC-----C--CHHHHHHHHHHHHCCE
T ss_conf 79968999727986668836887741987699999999999972898979986999-----9--9999999999875686
Q ss_pred CCCCEEEECCCCCHHHHHHHHHH
Q ss_conf 89977998367167999999998
Q gi|254780976|r 102 VYVPFVFWDERLTTVSAQQILID 124 (160)
Q Consensus 102 ~~l~v~~~DEr~TS~~A~~~l~~ 124 (160)
+.++|.+. +.+|..+.+..+..
T Consensus 240 ~~L~v~~~-~~~~~~e~~~l~~~ 261 (357)
T pfam05913 240 ITLRVEPA-EDITEEEKKILFEE 261 (357)
T ss_pred EEEEEEEC-CCCCHHHHHHHHHC
T ss_conf 89998835-89888999998506
No 117
>COG3703 ChaC Uncharacterized protein involved in cation transport [Inorganic ion transport and metabolism]
Probab=26.59 E-value=52 Score=14.64 Aligned_cols=23 Identities=22% Similarity=0.274 Sum_probs=19.0
Q ss_pred CCCCCEEEEECCCCEEEEEEEEC
Q ss_conf 89983999986887689996307
Q gi|254780976|r 14 KPNQPIASIDLGTKRIGLAISDP 36 (160)
Q Consensus 14 ~~~~riLgiD~G~kriGiAisd~ 36 (160)
..-+.+||+|.|..-.|+|+-=|
T Consensus 55 ~~PGlvl~L~~GGsc~GvafRip 77 (190)
T COG3703 55 EQPGLVLGLDRGGSCEGVAYRIP 77 (190)
T ss_pred CCCCEEEEEECCCCEEEEEEECC
T ss_conf 79956988607993789999737
No 118
>TIGR01756 LDH_protist lactate dehydrogenase; InterPro: IPR011272 This entry represents a family of protist (Trichomonad) lactate dehydrogenases which have apparently evolved from a recent protist malate dehydrogenase ancestor . Lactate dehydrogenase converts the hydroxyl at C-2 of lactate to a carbonyl in the product, pyruvate. The preference of this enzyme for NAD or NADP has not been determined. A critical residue in malate dehydrogenase, arginine-91 (Trichomonas vaginalis numbering) has been mutated to a leucine, eliminating the positive charge which complemented the carboxylate in malate which is absent in lactate. Several other more subtle changes are proposed to make the active site smaller to accommodate the less bulky lactate molecule ..
Probab=26.59 E-value=50 Score=14.75 Aligned_cols=29 Identities=17% Similarity=0.221 Sum_probs=20.9
Q ss_pred CHHHHHHHHHHHHHC---CCEEEEEECCCCCC
Q ss_conf 156898999986516---77589972135788
Q gi|254780976|r 53 TQTALELLSFITTEN---IAAFIIGLPLNMNG 81 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~---i~~iVvGlP~~~~g 81 (160)
..-++.+-+.+.+|- |..+|||+|.|.|-
T Consensus 86 ~~IFKa~GE~Ls~yAK~TVk~LviGNP~N~Nc 117 (314)
T TIGR01756 86 ASIFKATGEALSEYAKPTVKVLVIGNPVNTNC 117 (314)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEEECCCCCHHH
T ss_conf 45666777787750244147888418985257
No 119
>PRK13927 rod shape-determining protein MreB; Provisional
Probab=26.31 E-value=53 Score=14.61 Aligned_cols=102 Identities=17% Similarity=0.204 Sum_probs=47.9
Q ss_pred CCCCCEEEEECCCCEEEEEEEECCCCEECCCEEEECCCCC-H---HHHHHHHHHHHHCCCEEEEEECCCCCCCCC--HHH
Q ss_conf 8998399998688768999630765110121011047741-5---689899998651677589972135788888--447
Q gi|254780976|r 14 KPNQPIASIDLGTKRIGLAISDPGRRFAHPRPFLVRKKVT-Q---TALELLSFITTENIAAFIIGLPLNMNGSEG--PRV 87 (160)
Q Consensus 14 ~~~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~~~-~---~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~--~~~ 87 (160)
.-+++.+|||+||.+.-+++.+....+--|=-+-..++.. . .-.+-.+.+ ...|..+.+=.|+. +|-.. +.+
T Consensus 5 ~~~~~~iGIDLGTtns~v~~~~~giv~~ePSvVa~~~~~~~~i~aVG~~Ak~~~-gr~p~~i~~irpl~-~Gvi~d~~~~ 82 (338)
T PRK13927 5 GLFSNDLGIDLGTANTLIYVKGKGIVLNEPSVVAIRQDTGGKVLAVGSEAKQML-GRTPGNIVAIRPMK-DGVIADFDVT 82 (338)
T ss_pred HHHCCEEEEECCHHCEEEEECCCCEECCCCCEEEEECCCCEEEEHHHHHHHHHC-CCCCCCEEEEEECC-CCCCCCHHHH
T ss_conf 443662289766340899988998785688089998389848761279999854-58967279998767-8810454778
Q ss_pred -HHHHHHHHHHHHHC-C--CCC--EEEECCCCCHHH
Q ss_conf -99999999998404-8--997--799836716799
Q gi|254780976|r 88 -HSTRAFVHNMIDRK-V--YVP--FVFWDERLTTVS 117 (160)
Q Consensus 88 -~~v~~f~~~L~~~~-~--~l~--v~~~DEr~TS~~ 117 (160)
+..+.|..+...+. + ..| |.-+.-.+|..|
T Consensus 83 ~~~l~~li~~~~~~~~~~~~~~~vVItVPa~ft~~q 118 (338)
T PRK13927 83 EKMLKYFIKKVHKRRGFFRPSPRVVVCVPSGSTQVE 118 (338)
T ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCHHH
T ss_conf 999999999997402434799969996199998899
No 120
>TIGR03281 methan_mark_12 putative methanogenesis marker protein 12. Members of this protein family, to date, are found in a completed prokaryotic genome if and only if the species is one of the archaeal methanogens. The exact function is unknown, but likely is linked to methanogenesis or a process closely connected to it.
Probab=25.83 E-value=43 Score=15.13 Aligned_cols=55 Identities=25% Similarity=0.240 Sum_probs=34.5
Q ss_pred EEEEECCCCEEEEEEEECCCCEECCCEEEECCC----CCHHHHHHHHHHHHHCCCEEEEEEC
Q ss_conf 999986887689996307651101210110477----4156898999986516775899721
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFAHPRPFLVRKK----VTQTALELLSFITTENIAAFIIGLP 76 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a~Pl~~i~~~~----~~~~~~~l~~ii~e~~i~~iVvGlP 76 (160)
++|+|.|+.-|-+|+-+.....-| -+.|.. ..-...++...+.-++++++-+-|.
T Consensus 1 fiGiDHGTtgirfa~~~~~~~~~F---kl~R~e~se~~~~~~~el~~~v~~~eIdliavtYS 59 (326)
T TIGR03281 1 FVGIDHGTTGIRFAIIDGEKEPVF---KLPRTELSEKEILESDELEDRVALEEIDLIALTYS 59 (326)
T ss_pred CCCCCCCCCCEEEEEECCCCEEEE---EECHHHHHHCCCEEHHHHHCCCCHHHCEEEEEEEC
T ss_conf 946246876079999769841788---70467640123110235443798414138999610
No 121
>pfam08426 ICE2 ICE2. ICE2 is a fungal ER protein which has been shown to play an important role in forming/maintaining the cortical ER. It has also bee identified as a protein which is necessary for nuclear inner membrane targeting.
Probab=25.69 E-value=12 Score=18.52 Aligned_cols=28 Identities=25% Similarity=0.180 Sum_probs=22.3
Q ss_pred EEEECCCCEEEEEEEECCCCEECCCEEE
Q ss_conf 9998688768999630765110121011
Q gi|254780976|r 20 ASIDLGTKRIGLAISDPGRRFAHPRPFL 47 (160)
Q Consensus 20 LgiD~G~kriGiAisd~~~~~a~Pl~~i 47 (160)
|++|+|-+.||+|.|=.+-.+-+-+.++
T Consensus 19 laFdVGG~~CGLafSltL~~~YF~~stl 46 (409)
T pfam08426 19 IAFDVGGRTCGLAFSLTLFVLYFVLTTL 46 (409)
T ss_pred CEEEECCCCHHHHHHHHHHHHHHHHHHH
T ss_conf 0466478112189999999999999999
No 122
>cd01018 ZntC Metal binding protein ZntC. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and bind their specific ligands in the cleft between these domains. In addition, many of these proteins possess a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=25.49 E-value=55 Score=14.52 Aligned_cols=74 Identities=19% Similarity=0.240 Sum_probs=48.3
Q ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHH
Q ss_conf 89999865167758997213578888844799999999998404899779983671679999999986986445464378
Q gi|254780976|r 58 ELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVD 137 (160)
Q Consensus 58 ~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD 137 (160)
.+.-+.++|+... ++. ...|. .+..+.+..+.+.+. .-++++.|..+.++++.|+.+-++.|++.- .+|
T Consensus 179 af~Yf~~~ygl~~--~~i--~~~~~-eps~~~l~~l~~~ik--~~~v~~If~E~~~~~k~~~~ia~etg~kv~----~ld 247 (266)
T cd01018 179 AWGYFARDYGLTQ--IPI--EEEGK-EPSPADLKRLIDLAK--EKGVRVVFVQPQFSTKSAEAIAREIGAKVV----TID 247 (266)
T ss_pred CHHHHHHHCCCEE--EEC--CCCCC-CCCHHHHHHHHHHHH--HCCCCEEEEECCCCCHHHHHHHHHHCCCEE----EEC
T ss_conf 4799998679879--860--58888-989999999999999--839989997089990999999997099379----967
Q ss_pred HHHHH
Q ss_conf 99899
Q gi|254780976|r 138 SIAAA 142 (160)
Q Consensus 138 ~~AA~ 142 (160)
-+++-
T Consensus 248 pL~~~ 252 (266)
T cd01018 248 PLAAD 252 (266)
T ss_pred CCCCC
T ss_conf 86402
No 123
>TIGR00502 nagB glucosamine-6-phosphate isomerase; InterPro: IPR004547 Glucosamine-6-phosphate isomerase (3.5.99.6 from EC) catalyses the conversion of D-glucosamine 6-phosphate and water to D-fructose 6-phosphate in the N-acetylglucosamine utilization pathway. The enzyme was formerly classified as EC 5.3.1.10. This family also includes a closely related pair of proteins from Bacillus subtilis, one of which is uncharacterised but included as a member of the orthologous set.; GO: 0004342 glucosamine-6-phosphate deaminase activity, 0006044 N-acetylglucosamine metabolic process.
Probab=25.42 E-value=54 Score=14.56 Aligned_cols=88 Identities=15% Similarity=0.155 Sum_probs=42.9
Q ss_pred HHHHHHHHHHHHCCC-E--EEEEECCCCCCCCCHHHHHHHHHHHHHHHH--CCCCCE-----------EEECCCCCHHHH
Q ss_conf 689899998651677-5--899721357888884479999999999840--489977-----------998367167999
Q gi|254780976|r 55 TALELLSFITTENIA-A--FIIGLPLNMNGSEGPRVHSTRAFVHNMIDR--KVYVPF-----------VFWDERLTTVSA 118 (160)
Q Consensus 55 ~~~~l~~ii~e~~i~-~--iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~--~~~l~v-----------~~~DEr~TS~~A 118 (160)
.+..+.+-|+++.|+ . ||+||| -|+ +|.. .-+.+++--... .|.--| .-..|+|-|..-
T Consensus 16 AA~~i~~~I~~Fkp~~~~PFVLGLp---TGg-TP~g-~Yk~LI~l~qa~~~sF~~v~TfNlDEY~GL~~~hP~SYh~FM~ 90 (260)
T TIGR00502 16 AARHIANRINEFKPTAARPFVLGLP---TGG-TPVG-TYKQLIRLYQAGKISFQNVVTFNLDEYVGLSEEHPESYHSFMH 90 (260)
T ss_pred HHHHHHHHHHCCCCCCCCCEEEECC---CCC-CHHH-HHHHHHHHHHCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHHH
T ss_conf 9999999864037762367461068---888-7578-9999999850487432410553141001587888852137866
Q ss_pred HHHHHHCCCCHHHH---HHH-HHHHHHHHHHHH
Q ss_conf 99999869864454---643-789989999999
Q gi|254780976|r 119 QQILIDMNVSRKKR---IQK-VDSIAAALILQE 147 (160)
Q Consensus 119 ~~~l~~~g~~~k~~---k~~-iD~~AA~iILq~ 147 (160)
.+.+..-..+...- .+. -|--|+|-=-|.
T Consensus 91 ~~fF~HId~~~~~i~IlnGna~dl~aeCr~YE~ 123 (260)
T TIGR00502 91 NNFFQHIDIKPENINILNGNADDLEAECRRYEE 123 (260)
T ss_pred HHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH
T ss_conf 512354477734455888686557889989999
No 124
>TIGR01098 3A0109s03R phosphonate ABC transporter, periplasmic phosphonate-binding protein; InterPro: IPR005770 Bacterial binding protein-dependent transport systems , are multicomponent systems typically composed of a periplasmic substrate-binding protein, one or two reciprocally homologous integral inner-membrane proteins and one or two peripheral membrane ATP-binding proteins that couple energy to the active transport system. The integral inner-membrane proteins translocate the substrate across the membrane. It has been shown , that most of these proteins contain a conserved region located about 80 to 100 residues from their C-terminal extremity. This region seems to be located in a cytoplasmic loop between two transmembrane domains. Apart from the conserved region, the sequence of these proteins is quite divergent, and they have a variable number of transmembrane helices. This is a family of periplasmic proteins which are part of the transport system for alkylphosphonate uptake. ; GO: 0015604 phosphonate transmembrane transporter activity, 0015716 phosphonate transport, 0030288 outer membrane-bounded periplasmic space.
Probab=25.38 E-value=55 Score=14.50 Aligned_cols=50 Identities=8% Similarity=0.149 Sum_probs=37.2
Q ss_pred HHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE-CCCCCHH
Q ss_conf 6516775899721357888884479999999999840489977998-3671679
Q gi|254780976|r 64 TTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFW-DERLTTV 116 (160)
Q Consensus 64 ~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~-DEr~TS~ 116 (160)
+.+.+..|++|+ -...+.+.+.....-|...| ++.+++||.++ =-.|+..
T Consensus 46 ~~~~p~~l~~g~--~P~e~~~~~~~~~~pL~~~L-ek~Lg~kV~~~~~~dY~a~ 96 (299)
T TIGR01098 46 KSETPKELNFGI--LPGENASNLARRWEPLKDYL-EKKLGIKVELFVATDYSAV 96 (299)
T ss_pred CCCCCCEEEEEE--ECCCCHHHHHHHHHHHHHHH-HHHCCCCEEEEEECCCCCE
T ss_conf 036777268998--44887557777668999988-8755886799861683103
No 125
>TIGR01759 MalateDH-SF1 malate dehydrogenase; InterPro: IPR010945 Malate dehydrogenases catalyse the interconversion of malate and oxaloacetate using dinucleotide cofactors . The enzymes in this entry are found in archaea, bacteria and eukaryotes and fall into two distinct groups. The first group are cytoplamsic, NAD-dependent enzymes which participate in the citric acid cycle (1.1.1.37 from EC). The second group are found in plant chloroplasts, use NADP as cofactor, and participate in the C4 cycle (1.1.1.82 from EC). Structural studies indicate that these enzymes are homodimers with very sinmilar overall topology, though the chloroplast enzymes also have N- and C-terminal extensions, and all contain the classical Rossman fold for NAD(P)H binding , , , . Substrate specificity is determined by a mobile loop at the active site which uses charge balancing to discriminate between the correct substrates (malate and oxaloacetate) and other potential oxo/hydroxyacid substrates the enzyme may encounter within the cell .; GO: 0016615 malate dehydrogenase activity, 0006108 malate metabolic process.
Probab=24.91 E-value=36 Score=15.59 Aligned_cols=39 Identities=10% Similarity=0.266 Sum_probs=22.1
Q ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHH-HHHHHHCC
Q ss_conf 8999986516775899721357888884479999999-99984048
Q gi|254780976|r 58 ELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFV-HNMIDRKV 102 (160)
Q Consensus 58 ~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~-~~L~~~~~ 102 (160)
+.++-.-..++.-+|||+|-|. -|--...++ ..|..+++
T Consensus 113 ~aLn~~Ak~~vKVLVVGNPaNT------NaLIa~~~AP~~ip~~Nf 152 (329)
T TIGR01759 113 KALNKVAKKDVKVLVVGNPANT------NALIASKNAPKKIPPKNF 152 (329)
T ss_pred HHHHHHCCCCCEEEEECCCCHH------HHHHHHHCCCCCCCCCCC
T ss_conf 9999861898569985298627------999999706865651111
No 126
>PRK10824 hypothetical protein; Provisional
Probab=24.84 E-value=56 Score=14.44 Aligned_cols=61 Identities=18% Similarity=0.163 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH
Q ss_conf 5689899998651677589972135788888447999999999984048997799836716799999999
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILI 123 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~ 123 (160)
...++|.++|+++.|..+.=|.|- .|+|......+.-|. ..+.++..+|= +...+.++.++
T Consensus 3 ~~~e~I~~~v~~~~VvlFMKGtp~------~P~CGFS~~~v~iL~--~~gv~f~~vnV-l~d~~ir~~lk 63 (115)
T PRK10824 3 TTIEKIQRQIAENPILLYMKGSPK------LPSCGFSAQAVQALS--ACGERFAYVDI-LQNPDIRAELP 63 (115)
T ss_pred HHHHHHHHHHHCCCEEEEECCCCC------CCCCCCHHHHHHHHH--HHCCCEEEEEC-CCCHHHHHHHH
T ss_conf 799999999843988999488999------997802899999999--92998068867-88988999989
No 127
>pfam00940 RNA_pol DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases.
Probab=24.57 E-value=57 Score=14.41 Aligned_cols=33 Identities=27% Similarity=0.407 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHH
Q ss_conf 898999986516775899721357888884479
Q gi|254780976|r 56 ALELLSFITTENIAAFIIGLPLNMNGSEGPRVH 88 (160)
Q Consensus 56 ~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~ 88 (160)
..++.+.+++-++..++-++|+++||+.+-.-.
T Consensus 67 c~e~~~~~~~~~~~~~~s~lPv~~DgTcsGlQH 99 (391)
T pfam00940 67 CLELRKALRSGDPEEYISHLPIHQDGSCNGLQH 99 (391)
T ss_pred HHHHHHHHHCCCCCCCEECCCCCCCCCCCHHHH
T ss_conf 999999997699223422575454576648999
No 128
>PTZ00009 heat shock 70 kDa protein; Provisional
Probab=24.42 E-value=57 Score=14.39 Aligned_cols=22 Identities=27% Similarity=0.323 Sum_probs=16.9
Q ss_pred CCEEEEECCCCEEEEEEEECCC
Q ss_conf 8399998688768999630765
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~ 38 (160)
.-.+|||+|+.+.=+|+.....
T Consensus 4 ~~aIGIDlGTtnS~vAv~~~g~ 25 (657)
T PTZ00009 4 EGAIGIDLGTTYSCVGVWQNER 25 (657)
T ss_pred CCEEEEECCHHCEEEEEEECCE
T ss_conf 8789998571138999996991
No 129
>pfam07066 Phage_Lacto_M3 Lactococcus phage M3 protein. This family consists of several Lactococcus phage middle-3 (M3) proteins of around 160 residues in length. The function of this family is unknown.
Probab=24.27 E-value=58 Score=14.38 Aligned_cols=54 Identities=15% Similarity=0.068 Sum_probs=35.0
Q ss_pred CCCEEEEECCCCE-----EEEEEEECCCCEECCCEEEECCCCC--------HHHHHHHHHHHHHCCCE
Q ss_conf 9839999868876-----8999630765110121011047741--------56898999986516775
Q gi|254780976|r 16 NQPIASIDLGTKR-----IGLAISDPGRRFAHPRPFLVRKKVT--------QTALELLSFITTENIAA 70 (160)
Q Consensus 16 ~~riLgiD~G~kr-----iGiAisd~~~~~a~Pl~~i~~~~~~--------~~~~~l~~ii~e~~i~~ 70 (160)
|+++|+|||.+.. +|-|+-.|+..-.. ++.+...+.. .....+.++|.+++.-.
T Consensus 1 mkk~laidfstankkgegtgyafr~pl~~e~~-vgsika~~~kk~~~era~~i~d~i~diid~fdl~~ 67 (162)
T pfam07066 1 MKKALAIDFSTANKKGEGTGYAFRKPLDGEVV-VGSIKAGGKKKDALERAFDIADAIKDIIDEFDLFD 67 (162)
T ss_pred CCCEEEEEEECCCCCCCCCCEEEECCCCCCEE-EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCHHH
T ss_conf 97138886312566787652376047888689-99874078566889999989999999987632233
No 130
>pfam03343 SART-1 SART-1 family. SART-1 is a protein involved in cell cycle arrest and pre-mRNA splicing. It has been shown to be a component of U4/U6 x U5 tri-snRNP complex in human, Schizosaccharomyces pombe and Saccharomyces cerevisiae. SART-1 is a known tumour antigen in a range of cancers recognized by T cells.
Probab=24.21 E-value=58 Score=14.37 Aligned_cols=30 Identities=27% Similarity=0.174 Sum_probs=23.4
Q ss_pred HHCCCCCEEEECC---CCCHHHHHHHHHHCCCC
Q ss_conf 4048997799836---71679999999986986
Q gi|254780976|r 99 DRKVYVPFVFWDE---RLTTVSAQQILIDMNVS 128 (160)
Q Consensus 99 ~~~~~l~v~~~DE---r~TS~~A~~~l~~~g~~ 128 (160)
...|++.+.|+|| .+|++||.+.|.+.-..
T Consensus 526 ~YkPdVkL~Y~DE~Gr~Lt~KEAFk~LSHkFHG 558 (577)
T pfam03343 526 NYKPDVKLKYVDEFGRELTPKEAFKYLSHKFHG 558 (577)
T ss_pred CCCCCCCEEEECCCCCCCCHHHHHHHHHHCCCC
T ss_conf 689987268878989878889999998741158
No 131
>cd06295 PBP1_CelR Ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. This group includes the ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. The binding of CelR to the celE promoter is inhibited specifically by cellobiose. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn chang
Probab=24.15 E-value=58 Score=14.36 Aligned_cols=85 Identities=14% Similarity=0.151 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCC--------------HHHHHH
Q ss_conf 689899998651677589972135788888447999999999984048997799836716--------------799999
Q gi|254780976|r 55 TALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLT--------------TVSAQQ 120 (160)
Q Consensus 55 ~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~T--------------S~~A~~ 120 (160)
.-.++..++..++++++|+ .|.+.+ . ...+.+. ..++||.++|.... ...|-+
T Consensus 52 ~e~~~~~~l~~~~vdGiIi-~~~~~~------~----~~~~~l~--~~~iPvV~~d~~~~~~~~~~V~~d~~~a~~~~~~ 118 (275)
T cd06295 52 DRDWLARYLASGRADGVIL-IGQHDQ------D----PLPERLA--ETGLPFVVWGRPLPGQPYCYVGSDNVGGGRLATE 118 (275)
T ss_pred HHHHHHHHHHHCCCCEEEE-ECCCCC------H----HHHHHHH--HCCCCEEEECCCCCCCCCCEEEECHHHHHHHHHH
T ss_conf 8999999998489988999-799899------7----9999999--5799999998626899997898287999999999
Q ss_pred HHHHCCCCHHH-HHHHHHHHHHHHHHHHHHHHH
Q ss_conf 99986986445-464378998999999999686
Q gi|254780976|r 121 ILIDMNVSRKK-RIQKVDSIAAALILQEVLDRI 152 (160)
Q Consensus 121 ~l~~~g~~~k~-~k~~iD~~AA~iILq~~Ld~~ 152 (160)
.|.+.|.++=- -.+..+..++.-=+++|.+.+
T Consensus 119 ~L~~~G~~~I~~i~~~~~~~~~~~R~~Gf~~a~ 151 (275)
T cd06295 119 HLLARGRRRIAFLGGPQDMPEGEERLEGYREAL 151 (275)
T ss_pred HHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHH
T ss_conf 999809987987058866726999999999999
No 132
>PRK08153 histidinol-phosphate aminotransferase; Provisional
Probab=24.08 E-value=58 Score=14.35 Aligned_cols=57 Identities=23% Similarity=0.230 Sum_probs=37.4
Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCH
Q ss_conf 56898999986516775899721357888884479999999999840489977998367167
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLTT 115 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS 115 (160)
.++..+..-+.+.+...+++..|=|..|+.-+.. .+..|++.+ ..--+..+||.|--
T Consensus 143 ~d~~~~l~~~~~~~~klv~l~nPNNPTG~~~~~~-~l~~l~~~~----~~~~lvVvDEAY~e 199 (370)
T PRK08153 143 EDLDALLDAARRENAPLVYLANPDNPMGSWHPAA-DIVAFIEAL----PETTLLVLDEAYCE 199 (370)
T ss_pred CCHHHHHHHCCCCCCCEEEECCCCCCCCCCCCHH-HHHHHHHHC----CCCCEEEEEHHHHH
T ss_conf 4869999860378973899768959857767868-899999847----77738962153564
No 133
>TIGR00591 phr2 deoxyribodipyrimidine photolyase; InterPro: IPR008148 Deoxyribodipyrimidine photolyase (DNA photolyase) is a DNA repair enzyme. It binds to UV-damaged DNA containing pyrimidine dimers and, upon absorbing a near-UV photon (300 to 500 nm), breaks the cyclobutane ring joining the two pyrimidines of the dimer. DNA photolyase is an enzyme that requires two choromophore-cofactors for its activity: a reduced FADH2 and either 5,10-methenyltetrahydrofolate (5,10-MTFH) or an oxidized 8-hydroxy-5- deazaflavin (8-HDF) derivative (F420). The folate or deazaflavin chromophore appears to function as an antenna, while the FADH2 chromophore is thought to be responsible for electron transfer. On the basis of sequence similarities DNA photolyases can be grouped into two classes. The second class contains enzymes from Myxococcus xanthus, methanogenic archaebacteria, insects, fish and marsupial mammals. It is not yet known what second cofactor is bound to class 2 enzymes. There are a number of conserved sequence regions in all known class 2 DNA photolyases, especially in the C-terminal part. ; GO: 0003904 deoxyribodipyrimidine photo-lyase activity, 0006281 DNA repair.
Probab=23.84 E-value=59 Score=14.33 Aligned_cols=41 Identities=29% Similarity=0.520 Sum_probs=25.7
Q ss_pred CHHHHHHHHHHHHHHH-HCCCCCEEEEC--CCCCHHH-----HHHHHHH
Q ss_conf 8447999999999984-04899779983--6716799-----9999998
Q gi|254780976|r 84 GPRVHSTRAFVHNMID-RKVYVPFVFWD--ERLTTVS-----AQQILID 124 (160)
Q Consensus 84 ~~~~~~v~~f~~~L~~-~~~~l~v~~~D--Er~TS~~-----A~~~l~~ 124 (160)
+...+.|..|.++|+. +.+.-...|+. +.|.|.. |++.|.+
T Consensus 297 ~~~~~sv~~FlEEL~VRREL~dNFCfY~PG~~Yds~~gl~~WA~~TL~~ 345 (471)
T TIGR00591 297 GNSSASVDAFLEELIVRRELADNFCFYNPGPSYDSLKGLYEWAQKTLMD 345 (471)
T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCHHHHHHHHH
T ss_conf 6783357765565665444205622407888744335662789999875
No 134
>PRK10812 putative metallodependent hydrolase; Provisional
Probab=23.84 E-value=59 Score=14.33 Aligned_cols=73 Identities=19% Similarity=0.312 Sum_probs=48.9
Q ss_pred CHHHHHHHHHHHHHCCCEE-EEEECCCCCCCCCHHHHHHHHHHHHHH-HHCCCCCEEEECCCCCHHHHHHHHHHCCCC
Q ss_conf 1568989999865167758-997213578888844799999999998-404899779983671679999999986986
Q gi|254780976|r 53 TQTALELLSFITTENIAAF-IIGLPLNMNGSEGPRVHSTRAFVHNMI-DRKVYVPFVFWDERLTTVSAQQILIDMNVS 128 (160)
Q Consensus 53 ~~~~~~l~~ii~e~~i~~i-VvGlP~~~~g~~~~~~~~v~~f~~~L~-~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~ 128 (160)
...+.++..++.+..+.+| =+||.+..+.+ ....+...|...|. ...+++||.++ .|-+..++.+.|++.+..
T Consensus 74 ~~~~~~l~~l~~~~~vvAIGEiGLDy~~~~~--~~~~Q~~~F~~ql~lA~~~~lPviiH-~R~A~~d~~~iLk~~~~~ 148 (265)
T PRK10812 74 PYDVEDLRRLAAEEGVVAMGETGLDYFYTPE--TKVRQQASFIHHIQIGRELNKPVIVH-TRDARADTLAILREEKVT 148 (265)
T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCCC--CHHHHHHHHHHHHHHHHHHCCCEEEE-ECCCHHHHHHHHHHHCCC
T ss_conf 3109999998518993898505799999998--79999999999999999747986998-554348999999982477
No 135
>PRK01433 hscA chaperone protein HscA; Provisional
Probab=23.29 E-value=60 Score=14.26 Aligned_cols=22 Identities=32% Similarity=0.379 Sum_probs=18.3
Q ss_pred CCEEEEECCCCEEEEEEEECCC
Q ss_conf 8399998688768999630765
Q gi|254780976|r 17 QPIASIDLGTKRIGLAISDPGR 38 (160)
Q Consensus 17 ~riLgiD~G~kriGiAisd~~~ 38 (160)
.-++|||+|+.+.=||+-....
T Consensus 19 ~~aIGIDLGTTnS~VAv~~~g~ 40 (595)
T PRK01433 19 QIAVGIDFGTTNSLIAIATNRK 40 (595)
T ss_pred EEEEEEECCHHHEEEEEEECCE
T ss_conf 4899995680101789998998
No 136
>PRK09404 kgd alpha-ketoglutarate decarboxylase; Reviewed
Probab=23.15 E-value=61 Score=14.24 Aligned_cols=62 Identities=11% Similarity=0.185 Sum_probs=35.9
Q ss_pred HHHHHHHHHHHHCC-CCCEEEECCCCCHHHHHHHHHH---C-----CCCHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 99999999984048-9977998367167999999998---6-----986445464378998999999999686
Q gi|254780976|r 89 STRAFVHNMIDRKV-YVPFVFWDERLTTVSAQQILID---M-----NVSRKKRIQKVDSIAAALILQEVLDRI 152 (160)
Q Consensus 89 ~v~~f~~~L~~~~~-~l~v~~~DEr~TS~~A~~~l~~---~-----g~~~k~~k~~iD~~AA~iILq~~Ld~~ 152 (160)
..+.....|.+..+ .+-++|. +.+..+-++.+++ . ..+...++...+.+..+-.+|.||...
T Consensus 146 tLreIi~~Lk~tYCgsIG~Eym--HI~d~eer~Wl~~riE~~~~~~~~s~eeKk~IL~~L~~Ae~FE~FL~~K 216 (931)
T PRK09404 146 TLREIIEALKKTYCGSIGVEFM--HISDPEERRWLQERIESPRGRPTFSAEEKKAILERLTAAEGFERFLHTK 216 (931)
T ss_pred CHHHHHHHHHHHHCCCHHHEEE--CCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf 2999999999984675864000--7899899999999973847789989999999999999999999999887
No 137
>TIGR01212 TIGR01212 radical SAM protein, TIGR01212 family; InterPro: IPR005911 This uncharacterised protein family shows significant similarity to IPR005910 from INTERPRO, a longer protein that is a histone acetyltransferase at its C-terminus and is a subunit of RNA polymerase II (in yeast). This family lacks the GNAT acetyltransferase domain..
Probab=22.84 E-value=61 Score=14.21 Aligned_cols=62 Identities=13% Similarity=0.146 Sum_probs=40.9
Q ss_pred CCEEEEECCCC----------------EEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHH---HHHC---CCEEEEE
Q ss_conf 83999986887----------------689996307651101210110477415689899998---6516---7758997
Q gi|254780976|r 17 QPIASIDLGTK----------------RIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFI---TTEN---IAAFIIG 74 (160)
Q Consensus 17 ~riLgiD~G~k----------------riGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii---~e~~---i~~iVvG 74 (160)
.-|+||++||+ .=|+-|.=.++.-+---.|+.+=|...++..+.+.+ +.++ ..=+|+|
T Consensus 109 ~~vVGlsvgTRPDC~P~~VLDlL~ey~~~GyevWvELGLQtah~~TL~~INRgHd~~~y~~a~~~~~krGikVC~H~I~G 188 (307)
T TIGR01212 109 DDVVGLSVGTRPDCVPDEVLDLLAEYKERGYEVWVELGLQTAHDRTLKKINRGHDFACYVDAVKRARKRGIKVCSHVILG 188 (307)
T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEEEC
T ss_conf 78057753688987747899999999549758999605356558999985143787899999999976598899998742
Q ss_pred ECCC
Q ss_conf 2135
Q gi|254780976|r 75 LPLN 78 (160)
Q Consensus 75 lP~~ 78 (160)
||..
T Consensus 189 LPgE 192 (307)
T TIGR01212 189 LPGE 192 (307)
T ss_pred CCCC
T ss_conf 8988
No 138
>COG0567 SucA 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes [Energy production and conversion]
Probab=22.67 E-value=61 Score=14.21 Aligned_cols=65 Identities=8% Similarity=0.069 Sum_probs=35.8
Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH---HC---CCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 47999999999984048997799836716799999999---86---98644546437899899999999968
Q gi|254780976|r 86 RVHSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILI---DM---NVSRKKRIQKVDSIAAALILQEVLDR 151 (160)
Q Consensus 86 ~~~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~---~~---g~~~k~~k~~iD~~AA~iILq~~Ld~ 151 (160)
.....+.+...|.+..+ -.|-+-=-+-+..+-++.+. +. ..+..+++.....+.|+-=+|.||..
T Consensus 126 ~~~~l~e~~~~l~~~Yc-gsig~E~~hi~~~~~~~Wl~~riE~~~~~~~~e~k~~~l~~L~~ae~fE~fl~~ 196 (906)
T COG0567 126 ETMTLRELIEILKKTYC-GSIGVEYMHISDPEEKRWLQERIESGKPTFTAEEKKAILKRLTAAEGFERFLHT 196 (906)
T ss_pred CCCCHHHHHHHHHHHHC-CCEEEEEECCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHC
T ss_conf 42319999999998724-650356633599899999999971578887889999999999999999998624
No 139
>cd05695 S1_Rrp5_repeat_hs3 S1_Rrp5_repeat_hs3: Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in Saccharomyces cerevisiae Rrp5 and 14 S1 repeats in Homo sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 3 (hs3). Rrp5 is found in eukaryotes but not in prokaryotes or archaea.
Probab=22.59 E-value=52 Score=14.64 Aligned_cols=16 Identities=38% Similarity=0.488 Sum_probs=12.9
Q ss_pred CCCEEEEECCCCEEEE
Q ss_conf 9839999868876899
Q gi|254780976|r 16 NQPIASIDLGTKRIGL 31 (160)
Q Consensus 16 ~~riLgiD~G~kriGi 31 (160)
+.|||.+|+-+|+||+
T Consensus 50 ~ArIL~Vdp~tK~V~L 65 (66)
T cd05695 50 RARILYVDPSTKVVGL 65 (66)
T ss_pred EEEEEEECCCCCEEEC
T ss_conf 9999999887478853
No 140
>PRK05166 histidinol-phosphate aminotransferase; Provisional
Probab=22.20 E-value=63 Score=14.13 Aligned_cols=58 Identities=16% Similarity=0.151 Sum_probs=37.3
Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCC
Q ss_conf 7415689899998651677589972135788888447999999999984048997799836716
Q gi|254780976|r 51 KVTQTALELLSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFWDERLT 114 (160)
Q Consensus 51 ~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~DEr~T 114 (160)
+..-++..+.+.+.+ +...+++..|=|..|..-+.. ..+.+++.+ .. + -+..+||-|-
T Consensus 146 ~~~~d~~~l~~ai~~-~tklv~l~nPnNPTG~~~~~~-~l~~l~~~~-~~--~-~~vvvDEaY~ 203 (374)
T PRK05166 146 DLTVDVDALLAAVAR-KPRMLMFSNPMNPVGSWLTPG-ALARLLDAV-PP--E-TLIVVDEAYA 203 (374)
T ss_pred CCCCCHHHHHHHHCC-CCCEEEEECCCCCCCCCCCHH-HHHHHHHHC-CC--C-EEEEECHHHH
T ss_conf 679799999986156-987899828989987658847-799999745-88--7-3999504789
No 141
>COG0675 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=22.12 E-value=64 Score=14.12 Aligned_cols=132 Identities=15% Similarity=0.130 Sum_probs=68.8
Q ss_pred CCCCCCEEEEECCCCEEEEEEEECCCCEECCCEEE--ECC---------------------------------CC----C
Q ss_conf 58998399998688768999630765110121011--047---------------------------------74----1
Q gi|254780976|r 13 LKPNQPIASIDLGTKRIGLAISDPGRRFAHPRPFL--VRK---------------------------------KV----T 53 (160)
Q Consensus 13 ~~~~~riLgiD~G~kriGiAisd~~~~~a~Pl~~i--~~~---------------------------------~~----~ 53 (160)
....++++|+|+|.+..- ++++. .....|.... ... +. .
T Consensus 171 ~~~~~~~~giD~g~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~k~ls~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 248 (364)
T COG0675 171 VPKPGKVVGIDLGLKNLA-TVSDN-GTVPIPRDGRKLLKRLKKAQKRLSRKKSRSKRRKKAKLKLARLRERIRNRRKDFH 248 (364)
T ss_pred CCCCCCEEEEECCCCEEE-EEECC-CCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf 367786789941775378-99647-6145124441467888999988766654566542225656667888988788899
Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHH-HHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCC-CHHH
Q ss_conf 5689899998651677589972135788888447-9999999999840489977998367167999999998698-6445
Q gi|254780976|r 54 QTALELLSFITTENIAAFIIGLPLNMNGSEGPRV-HSTRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNV-SRKK 131 (160)
Q Consensus 54 ~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~~~~~-~~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~-~~k~ 131 (160)
.....+.+......+..++++.... +-+.+.++ ...+.+.+...+. .+++|..++|.+||+.--. .|. +.+.
T Consensus 249 k~~~~~v~~~~~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~l~yka~~-~~~~v~~~~~~~tS~~C~~----cg~~~~r~ 322 (364)
T COG0675 249 KLAKKLVELADLVGVETLVVEDLVK-RRSISDWAFGELRRQLEYKAEW-GGIVVKVVPPYYTSKTCPC----CGHLSGRL 322 (364)
T ss_pred HHHHHHHHHHHHCCCCEEEECCHHH-CCHHHHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCC----CCCCCCCE
T ss_conf 9999999988860544589645142-0225674699999999999973-7925998089987777887----78747854
Q ss_pred -------HHHHHHHHHHHHHHHHHHHHH
Q ss_conf -------464378998999999999686
Q gi|254780976|r 132 -------RIQKVDSIAAALILQEVLDRI 152 (160)
Q Consensus 132 -------~k~~iD~~AA~iILq~~Ld~~ 152 (160)
....=|-.||.=|+..++...
T Consensus 323 ~~C~~cg~~~~rD~naa~Ni~~~~~~~~ 350 (364)
T COG0675 323 FKCPRCGFVHDRDVNAALNIARRALGLL 350 (364)
T ss_pred EECCCCCCEECCHHHHHHHHHHHHCCCC
T ss_conf 8889989740653889999998522567
No 142
>TIGR01266 fum_ac_acetase fumarylacetoacetase; InterPro: IPR005959 Fumarylacetoacetase (3.7.1.2 from EC; also known as fumarylacetoacetate hydrolase or FAH) catalyses the hydrolytic cleavage of a carbon-carbon bond in fumarylacetoacetate to yield fumarate and acetoacetate as the final step in phenylalanine and tyrosine degradation . This is an essential metabolic function in humans, the lack of FAH causing type I tyrosinaemia, which is associated with liver and kidney abnormalities and neurological disorders , . The enzyme mechanism involves a catalytic metal ion, a Glu/His catalytic dyad, and a charged oxyanion hole . FAH folds into two domains: an N-terminal domain SH3-like beta-barrel, and a C-terminal with an unusual fold consisting of three layers of beta-sheet structures .; GO: 0004334 fumarylacetoacetase activity, 0009072 aromatic amino acid family metabolic process.
Probab=22.04 E-value=41 Score=15.29 Aligned_cols=64 Identities=20% Similarity=0.336 Sum_probs=35.1
Q ss_pred CCCCEEEEEEEECCCCEECCCE--EEECCCCCHHHHHHHHHHHHHCCCEE-EEEECCCCCCCCCHHHHHHHHHHHHHHH
Q ss_conf 6887689996307651101210--11047741568989999865167758-9972135788888447999999999984
Q gi|254780976|r 24 LGTKRIGLAISDPGRRFAHPRP--FLVRKKVTQTALELLSFITTENIAAF-IIGLPLNMNGSEGPRVHSTRAFVHNMID 99 (160)
Q Consensus 24 ~G~kriGiAisd~~~~~a~Pl~--~i~~~~~~~~~~~l~~ii~e~~i~~i-VvGlP~~~~g~~~~~~~~v~~f~~~L~~ 99 (160)
--.+||||||+|...-++.-.. .+ .. ...+.+-.++.++--.+.| -+|-| .-..+|....+|..
T Consensus 27 ~~s~~~gvAIGD~iLdL~~~~~~GlF--~g-~p~~~~h~~vf~~~tLNaF~aLG~p---------a~~~~R~~lq~lls 93 (436)
T TIGR01266 27 DSSRRIGVAIGDQILDLSKIAAAGLF--EG-PPALKEHQEVFKQSTLNAFMALGRP---------AWKEARARLQELLS 93 (436)
T ss_pred CCCCCEEEEECCHHHHHHHHHHCCCC--CC-HHHHHHHHHHHCCCHHHHHHHCCHH---------HHHHHHHHHHHHHC
T ss_conf 78883047616224218999854686--66-3678999987402212565507868---------99999999999833
No 143
>TIGR02852 spore_dpaB dipicolinic acid synthetase, B subunit; InterPro: IPR014214 Members of this entry represent the B subunit of dipicolinic acid synthetase, an enzyme that synthesizes a small molecule that appears to confer heat stability to bacterial endospores such as those of Bacillus subtilis. The A and B subunits are together in what was originally designated the spoVF locus for stage V of endospore formation..
Probab=21.96 E-value=64 Score=14.10 Aligned_cols=54 Identities=13% Similarity=0.193 Sum_probs=33.0
Q ss_pred CEEEEEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHH-HCCCEEEEEECCCCCCCCCHHHHHHHH
Q ss_conf 768999630765110121011047741568989999865-167758997213578888844799999
Q gi|254780976|r 27 KRIGLAISDPGRRFAHPRPFLVRKKVTQTALELLSFITT-ENIAAFIIGLPLNMNGSEGPRVHSTRA 92 (160)
Q Consensus 27 kriGiAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e-~~i~~iVvGlP~~~~g~~~~~~~~v~~ 92 (160)
||||+|++-+=+ |+ +..+-+|++++.+ -+|.-|+=--=.+.+-..|+-.+++++
T Consensus 1 krIGfglTGSHC-------t~-----~~~~p~le~L~~~GAev~P~~s~~V~~T~TrFG~~~dwi~~ 55 (188)
T TIGR02852 1 KRIGFGLTGSHC-------TL-----EAVMPQLEKLVEEGAEVTPIVSETVQTTDTRFGDGADWIKK 55 (188)
T ss_pred CEEEEECCCCCC-------CH-----HHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCCCHHHHHHH
T ss_conf 934311362011-------66-----78989999999669848888630004356556861889999
No 144
>KOG4013 consensus
Probab=21.50 E-value=65 Score=14.04 Aligned_cols=63 Identities=19% Similarity=0.379 Sum_probs=40.1
Q ss_pred HHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE---CCCCCHHHHHH-HHHHCCCCH
Q ss_conf 99986516775899721357888884479999999999840489977998---36716799999-999869864
Q gi|254780976|r 60 LSFITTENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFW---DERLTTVSAQQ-ILIDMNVSR 129 (160)
Q Consensus 60 ~~ii~e~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~---DEr~TS~~A~~-~l~~~g~~~ 129 (160)
.++++++..++||+|- ++.||+...-. + ..++...-.+||+|+ |=-|--+.+.+ .|.+.|-++
T Consensus 87 v~llk~~GAdGfVFGa-Lt~dgsid~~~--C----~si~~~~rplPVTFHRAfD~~~D~k~~lE~~l~~lGF~r 153 (255)
T KOG4013 87 VELLKKAGADGFVFGA-LTSDGSIDRTS--C----QSIIETARPLPVTFHRAFDVAYDWKTCLEDALLDLGFKR 153 (255)
T ss_pred HHHHHHCCCCCEEEEE-CCCCCCCCHHH--H----HHHHHHCCCCCEEEEEEHHHHCCHHHHHHHHHHHHHHHH
T ss_conf 9988874997268740-27888758899--9----999986589862644302321488888899999853899
No 145
>KOG1856 consensus
Probab=21.23 E-value=66 Score=14.01 Aligned_cols=93 Identities=15% Similarity=0.177 Sum_probs=52.7
Q ss_pred CEEEEECCCCEEE----EEEEECCCCEECCCEEEE---CCC-------CCHHHHHHHHHHHHHCCCEEEEEECCCCCCCC
Q ss_conf 3999986887689----996307651101210110---477-------41568989999865167758997213578888
Q gi|254780976|r 18 PIASIDLGTKRIG----LAISDPGRRFAHPRPFLV---RKK-------VTQTALELLSFITTENIAAFIIGLPLNMNGSE 83 (160)
Q Consensus 18 riLgiD~G~kriG----iAisd~~~~~a~Pl~~i~---~~~-------~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~ 83 (160)
|+|++=+|+.+=- +-+. ..+-+.--+.... +++ ...+...+.++|+...|+.|.|+-| +-..
T Consensus 604 rvl~~~~~~~~~~a~f~v~vn-~~Gd~vD~lrl~~~~kr~~~~n~~~r~~k~~d~f~kFI~~~kP~vi~v~g~---~r~~ 679 (1299)
T KOG1856 604 RVLAVCGGTERSDAIFCVLVN-FEGDLVDYLRLVDITKRKTLVNDEERKKKFQDLFKKFIEKKKPHVIGVSGE---NRLK 679 (1299)
T ss_pred EEEEECCCCCCCCEEEEEEEC-CCCCEEEEEECCCHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECC---CCHH
T ss_conf 478741588887517999976-777624456515303332023203436668999999998439988996277---7126
Q ss_pred CHHHHHHHHHHHHHHH--HCCCCCEEEECCCCC
Q ss_conf 8447999999999984--048997799836716
Q gi|254780976|r 84 GPRVHSTRAFVHNMID--RKVYVPFVFWDERLT 114 (160)
Q Consensus 84 ~~~~~~v~~f~~~L~~--~~~~l~v~~~DEr~T 114 (160)
......++...+++.- ....+||+++|+.+-
T Consensus 680 q~~~~~I~~~v~el~~~~~~~~ipv~~vd~ela 712 (1299)
T KOG1856 680 QKIYEAIRQLVHELLISDQGHPIPVIYVDNELA 712 (1299)
T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCEEECCCHHH
T ss_conf 779999999998435312479865240361888
No 146
>pfam00899 ThiF ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family.
Probab=21.22 E-value=66 Score=14.01 Aligned_cols=57 Identities=18% Similarity=0.219 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH
Q ss_conf 9999999998404899779983671679999999986986445464378998999999999
Q gi|254780976|r 89 STRAFVHNMIDRKVYVPFVFWDERLTTVSAQQILIDMNVSRKKRIQKVDSIAAALILQEVL 149 (160)
Q Consensus 89 ~v~~f~~~L~~~~~~l~v~~~DEr~TS~~A~~~l~~~g~~~k~~k~~iD~~AA~iILq~~L 149 (160)
++...++++.+.++.+.|..+++++++....+.+.+..+=. .-.|..++...|..+-
T Consensus 56 Ka~~~~~~l~~~np~~~i~~~~~~i~~~~~~~~~~~~DvVi----~~~Dn~~~r~~ln~~c 112 (134)
T pfam00899 56 KAEVAKERLRAINPDVEVEAYPERLTPENLEELLKGADLVV----DALDNFAARYLLNDAC 112 (134)
T ss_pred EHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHCCCCCCEEE----ECCCCHHHHHHHHHHH
T ss_conf 07999999997789958999925679778764264488999----9989999999999999
No 147
>TIGR03031 cas_csx12 CRISPR-associated protein, Csx12 family. Members of this family of CRISPR-associated (cas) protein are found, so far, in CRISPR/cas loci in Wolinella succinogenes DSM 1740, Legionella pneumophila str. Paris, and Francisella tularensis, where the last probably is an example of a degenerate CRISPR locus, having neither repeats nor a functional Cas1. The characteristic repeat length is 37 base pairs and period is about 72. One region of this large protein shows sequence similarity to pfam01844, HNH endonuclease.
Probab=21.19 E-value=40 Score=15.34 Aligned_cols=16 Identities=44% Similarity=0.576 Sum_probs=14.4
Q ss_pred EEEECCCCEEEEEEEE
Q ss_conf 9998688768999630
Q gi|254780976|r 20 ASIDLGTKRIGLAISD 35 (160)
Q Consensus 20 LgiD~G~kriGiAisd 35 (160)
+|||.|.|.+||+.+.
T Consensus 2 I~IDLg~KnTGvfs~~ 17 (802)
T TIGR03031 2 IGIDLGGKNTGVFSSH 17 (802)
T ss_pred CEEECCCCCCEEEHHH
T ss_conf 4341377521212677
No 148
>PRK13321 pantothenate kinase; Reviewed
Probab=21.15 E-value=67 Score=14.00 Aligned_cols=84 Identities=18% Similarity=0.188 Sum_probs=41.9
Q ss_pred EEEEECCCCEEEEEEEECCCCEEC-CCEEEECCCCCHHHHHHHHHHHHH-----CCCEEEEEECCCCCCCCCHHHHHHHH
Q ss_conf 999986887689996307651101-210110477415689899998651-----67758997213578888844799999
Q gi|254780976|r 19 IASIDLGTKRIGLAISDPGRRFAH-PRPFLVRKKVTQTALELLSFITTE-----NIAAFIIGLPLNMNGSEGPRVHSTRA 92 (160)
Q Consensus 19 iLgiD~G~kriGiAisd~~~~~a~-Pl~~i~~~~~~~~~~~l~~ii~e~-----~i~~iVvGlP~~~~g~~~~~~~~v~~ 92 (160)
+|.||.|-.++=+|+.+....... -+.+-.....+.....+..+.+.. +++.+++.-= -+. ....
T Consensus 2 ~L~IDiGNT~ik~g~~~~~~~~~~~~~~t~~~~~~de~~~~l~~~~~~~~~~~~~i~~v~isSV-------vp~--~~~~ 72 (256)
T PRK13321 2 LLCIDVGNTNIVLGIFDGDQLVHNWRLPTDTTRTSDELGILLLSLFRHAGLDPEDIRAVVISSV-------VPP--LDYS 72 (256)
T ss_pred EEEEEECCHHEEEEEEECCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHCCEEEEEEC-------CCC--HHHH
T ss_conf 8999987315199999899999999965886558789999999999874999767867999724-------704--7999
Q ss_pred HHHHHHHHCCCCCEEEECCC
Q ss_conf 99999840489977998367
Q gi|254780976|r 93 FVHNMIDRKVYVPFVFWDER 112 (160)
Q Consensus 93 f~~~L~~~~~~l~v~~~DEr 112 (160)
+.+ ...+.++++.+++.+.
T Consensus 73 ~~~-~~~~~~~~~~~~v~~~ 91 (256)
T PRK13321 73 LES-ACKKYFGIKPLVVGPG 91 (256)
T ss_pred HHH-HHHHHHCCCCEECCCC
T ss_conf 999-9999839982653888
No 149
>COG2441 Predicted butyrate kinase [Energy production and conversion]
Probab=20.91 E-value=63 Score=14.13 Aligned_cols=59 Identities=25% Similarity=0.388 Sum_probs=31.3
Q ss_pred EEEECCCCEEE-EEEEECCCCEECCCEEEECCCCCHHHHHHHHHHHHHC-----CCEEE----EEECCCC
Q ss_conf 99986887689-9963076511012101104774156898999986516-----77589----9721357
Q gi|254780976|r 20 ASIDLGTKRIG-LAISDPGRRFAHPRPFLVRKKVTQTALELLSFITTEN-----IAAFI----IGLPLNM 79 (160)
Q Consensus 20 LgiD~G~kriG-iAisd~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~-----i~~iV----vGlP~~~ 79 (160)
+|||.|++-.- +++-|..+..-. ...++++...+.-.-+.++++|.+ ++.+| -|+|+..
T Consensus 1 ~GIDpGT~smdvfgfdDEsg~vi~-~~~I~rdeVtk~p~iiv~ii~e~~~e~g~~daivgpSGyGlPlk~ 69 (374)
T COG2441 1 IGIDPGTGSMDVFGFDDESGNVIV-DVAIPRDEVTKSPRIIVDIIEEVQAEVGGIDAIVGPSGYGLPLKR 69 (374)
T ss_pred CCCCCCCCCEEEEEEECCCCCEEE-EEECCHHHHCCCCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCC
T ss_conf 984899873568988537887899-974378883148427999999875653464401246667876542
No 150
>TIGR03123 one_C_unchar_1 probable H4MPT-linked C1 transfer pathway protein. This protein family was identified, by the method of partial phylogenetic profiling, as related to the use of tetrahydromethanopterin (H4MPT) as a C-1 carrier. Characteristic markers of the H4MPT-linked C1 transfer pathway include formylmethanofuran dehydrogenase subunits, methenyltetrahydromethanopterin cyclohydrolase, etc. Tetrahydromethanopterin, a tetrahydrofolate analog, occurs in methanogenic archaea, bacterial methanotrophs, planctomycetes, and a few other lineages.
Probab=20.38 E-value=69 Score=13.90 Aligned_cols=92 Identities=16% Similarity=0.171 Sum_probs=49.7
Q ss_pred EEEECCCCEEEEEEEECCCCEEC----CCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCC----CHHHHHHH
Q ss_conf 99986887689996307651101----21011047741568989999865167758997213578888----84479999
Q gi|254780976|r 20 ASIDLGTKRIGLAISDPGRRFAH----PRPFLVRKKVTQTALELLSFITTENIAAFIIGLPLNMNGSE----GPRVHSTR 91 (160)
Q Consensus 20 LgiD~G~kriGiAisd~~~~~a~----Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVvGlP~~~~g~~----~~~~~~v~ 91 (160)
||+|+|-.++=+|.-+..+.+.. |++.. ++.+.+-..|.++...+..... ++ +.|.|+- ..+.+=|+
T Consensus 1 iG~DIGGAnlK~a~~~~~g~i~~~~~~~~plW--k~~~~L~~~l~~i~~~~~~~~~-~a--vTMTgELaD~F~~R~~GV~ 75 (318)
T TIGR03123 1 LGIDIGGANTKAAELDEDGRIKEVHQLYCPLW--KGNDKLAETLKEISQDLSSADN-VA--VTMTGELADCFEDKAEGVE 75 (318)
T ss_pred CCEEECCCEEEEEEECCCCCEEEEEEEECCCC--CCCHHHHHHHHHHHHHCCCCCE-EE--EEEEHHHHHHHCCHHHHHH
T ss_conf 91552550147888538986889998517021--7836789999999985583131-89--9920657654316999999
Q ss_pred HHHHHHHHHCCCCCEEEE--CCCCCHHH
Q ss_conf 999999840489977998--36716799
Q gi|254780976|r 92 AFVHNMIDRKVYVPFVFW--DERLTTVS 117 (160)
Q Consensus 92 ~f~~~L~~~~~~l~v~~~--DEr~TS~~ 117 (160)
...+.+. ..+.-++.++ |..+-+.+
T Consensus 76 ~I~~~~~-~~~~~~~~~y~~~g~f~~~~ 102 (318)
T TIGR03123 76 FILAAVE-SAFGSPVSVFASDGGFVSAE 102 (318)
T ss_pred HHHHHHH-HHCCCCEEEEECCCCCCCHH
T ss_conf 9999999-87499669994688533388
No 151
>cd07025 Peptidase_S66 LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein. LD-carboxypeptidase (Muramoyltetrapeptide carboxypeptidase; EC 3.4.17.13; Merops family S66; initially described as Carboxypeptidase II) family also includes the microcin c7 self-immunity protein (MccF) as well as uncharacterized proteins including hypothetical proteins. LD-carboxypeptidase hydrolyzes the amide bond that links the dibasic amino acids to C-terminal D-amino acids. The physiological substrates of LD-carboxypeptidase are tetrapeptide fragments (such as UDP-MurNAc-tetrapeptides) that are produced when bacterial cell walls are degraded; they contain an L-configured residue (L-lysine or meso-diaminopimelic acid residue) as the penultimate residue and D-alanine as the ultimate residue. A possible role of LD-carboxypeptidase is in peptidoglycan recycling whereby the resulting tripeptide (precursor for murein synthesis) can be reconverted into peptidoglycan by attachment o
Probab=20.27 E-value=69 Score=13.88 Aligned_cols=42 Identities=10% Similarity=0.262 Sum_probs=25.0
Q ss_pred HCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE
Q ss_conf 16775899721357888884479999999999840489977998
Q gi|254780976|r 66 ENIAAFIIGLPLNMNGSEGPRVHSTRAFVHNMIDRKVYVPFVFW 109 (160)
Q Consensus 66 ~~i~~iVvGlP~~~~g~~~~~~~~v~~f~~~L~~~~~~l~v~~~ 109 (160)
.++.+||+|-+.+.+... +....+.++..+.... +++||.+-
T Consensus 222 ~~~~gii~G~f~~~~~~~-~~~~~~~~il~~~~~~-~~iPv~~~ 263 (282)
T cd07025 222 DKVAGIILGRFTDCEDND-DFGYTLEEVLKEVLGD-LGIPVLYG 263 (282)
T ss_pred CCCCEEEEECCCCCCCCC-CCCCCHHHHHHHHHHH-CCCCEEEC
T ss_conf 357089995687788877-6573399999998751-79818989
Done!