Query gi|254780758|ref|YP_003065171.1| hypothetical protein CLIBASIA_03225 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 216 No_of_seqs 176 out of 7460 Neff 8.5 Searched_HMMs 39220 Date Sun May 29 19:56:34 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780758.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR03413 GSH_gloB hydroxyacyl 100.0 0 0 334.5 16.0 182 8-215 1-219 (248) 2 PRK10241 hydroxyacylglutathion 100.0 0 0 327.5 16.8 183 7-215 2-220 (251) 3 KOG0813 consensus 100.0 1.4E-38 3.5E-43 244.0 13.0 175 15-214 12-232 (265) 4 KOG0814 consensus 100.0 8.5E-34 2.2E-38 216.1 10.1 177 15-213 20-202 (237) 5 smart00849 Lactamase_B Metallo 100.0 8.1E-30 2.1E-34 193.0 15.9 177 13-196 3-183 (183) 6 COG0491 GloB Zn-dependent hydr 100.0 3.8E-28 9.8E-33 183.3 15.7 197 3-200 9-214 (252) 7 PRK11921 metallo-beta-lactamas 99.8 5.7E-20 1.5E-24 135.8 13.1 168 13-200 28-227 (395) 8 COG0426 FpaA Uncharacterized f 99.8 1.1E-19 2.8E-24 134.1 12.6 170 13-201 33-227 (388) 9 pfam00753 Lactamase_B Metallo- 99.8 7.5E-20 1.9E-24 135.1 10.7 141 13-158 3-146 (148) 10 PRK05452 anaerobic nitric oxid 99.8 3.6E-19 9.1E-24 131.2 11.2 166 16-200 35-231 (479) 11 COG0595 mRNA degradation ribon 99.8 4.6E-18 1.2E-22 124.7 11.0 192 3-206 6-217 (555) 12 TIGR00649 MG423 conserved hypo 99.7 3.9E-17 1E-21 119.3 9.5 194 6-212 1-234 (593) 13 PRK11539 hypothetical protein; 99.7 2.9E-15 7.3E-20 108.5 14.6 167 3-210 498-680 (754) 14 TIGR03675 arCOG00543 arCOG0178 99.6 4.4E-16 1.1E-20 113.2 7.3 149 5-156 174-347 (630) 15 COG2333 ComEC Predicted hydrol 99.6 7.1E-14 1.8E-18 100.4 11.6 175 5-210 45-232 (293) 16 PRK00055 ribonuclease Z; Revie 99.5 9.4E-14 2.4E-18 99.7 8.6 145 4-158 2-165 (259) 17 COG1236 YSH1 Predicted exonucl 99.5 1.3E-13 3.3E-18 98.9 6.2 137 16-156 14-164 (427) 18 PRK02113 putative hydrolase; P 99.3 3.7E-11 9.4E-16 84.6 10.1 133 15-156 33-170 (252) 19 COG1237 Metal-dependent hydrol 99.2 1.4E-10 3.7E-15 81.2 11.4 66 17-84 23-93 (259) 20 COG1782 Predicted metal-depend 99.2 4.4E-11 1.1E-15 84.2 6.7 150 5-157 180-354 (637) 21 TIGR02651 RNase_Z ribonuclease 99.2 6E-11 1.5E-15 83.4 7.0 128 16-147 18-169 (327) 22 COG2015 Alkyl sulfatase and re 99.2 5.2E-10 1.3E-14 77.9 11.6 196 7-206 116-356 (655) 23 PRK11244 phnP carbon-phosphoru 99.1 1.1E-10 2.8E-15 81.9 6.6 124 17-157 37-164 (250) 24 PRK05184 pyrroloquinoline quin 99.1 5.6E-10 1.4E-14 77.7 8.3 76 18-93 40-122 (302) 25 TIGR03307 PhnP phosphonate met 99.1 4.4E-10 1.1E-14 78.4 6.5 122 18-156 38-163 (249) 26 PRK02126 ribonuclease Z; Provi 99.0 3.1E-09 7.9E-14 73.5 9.6 85 1-89 1-90 (334) 27 KOG1136 consensus 98.9 2.7E-09 6.8E-14 73.8 6.1 150 4-158 2-181 (501) 28 KOG4736 consensus 98.9 1.7E-09 4.4E-14 74.9 5.0 153 15-203 94-261 (302) 29 PRK00685 metal-dependent hydro 98.9 5.2E-09 1.3E-13 72.2 6.6 163 11-198 3-194 (228) 30 COG1234 ElaC Metal-dependent h 98.8 3.8E-08 9.6E-13 67.2 8.6 76 4-84 2-93 (292) 31 KOG1137 consensus 98.8 1.8E-09 4.5E-14 74.9 1.5 149 5-156 13-182 (668) 32 COG1235 PhnP Metal-dependent h 98.8 3.7E-07 9.3E-12 61.4 13.3 81 1-84 1-95 (269) 33 PRK04286 hypothetical protein; 98.7 5.1E-07 1.3E-11 60.6 13.2 147 4-156 1-186 (294) 34 KOG1135 consensus 98.6 7.7E-08 2E-12 65.3 6.3 138 15-155 14-171 (764) 35 pfam02112 PDEase_II cAMP phosp 98.6 3.3E-07 8.3E-12 61.7 8.6 151 6-156 1-204 (323) 36 COG2220 Predicted Zn-dependent 98.5 5.1E-07 1.3E-11 60.6 7.5 172 9-197 7-212 (258) 37 COG2248 Predicted hydrolase (m 98.2 2.2E-05 5.7E-10 51.0 10.9 145 7-156 2-185 (304) 38 KOG1361 consensus 97.5 7.7E-05 2E-09 47.9 3.4 110 29-156 91-205 (481) 39 PRK11709 putative L-ascorbate 97.4 0.00082 2.1E-08 41.9 7.4 134 51-198 109-284 (355) 40 KOG2121 consensus 97.2 8.2E-05 2.1E-09 47.8 0.1 56 18-74 463-524 (746) 41 COG5212 PDE1 Low-affinity cAMP 97.0 0.00076 2E-08 42.1 3.7 106 51-156 112-232 (356) 42 TIGR00361 ComEC_Rec2 DNA inter 96.9 0.0078 2E-07 36.3 8.0 172 5-205 482-686 (731) 43 TIGR02649 true_RNase_BN ribonu 96.6 0.0052 1.3E-07 37.3 5.5 108 29-145 33-149 (304) 44 TIGR02650 RNase_Z_T_toga ribon 96.3 0.0014 3.5E-08 40.6 1.1 55 28-85 19-82 (277) 45 TIGR02108 PQQ_syn_pqqB coenzym 94.3 0.19 4.9E-06 28.2 6.4 74 15-91 36-121 (314) 46 TIGR01393 lepA GTP-binding pro 93.5 0.3 7.6E-06 27.1 6.3 22 169-192 306-327 (598) 47 COG0462 PrsA Phosphoribosylpyr 84.7 1.7 4.3E-05 22.7 4.3 31 26-56 51-90 (314) 48 PTZ00145 phosphoribosylpyropho 83.7 3.2 8.2E-05 21.1 5.4 31 181-211 360-391 (443) 49 COG0481 LepA Membrane GTPase L 83.7 3.4 8.6E-05 20.9 7.0 52 137-190 256-328 (603) 50 TIGR02884 spore_pdaA delta-lac 82.8 2 5.1E-05 22.3 4.0 56 15-74 25-84 (225) 51 PRK04923 ribose-phosphate pyro 77.7 5.5 0.00014 19.7 4.9 32 181-212 238-270 (319) 52 PRK02458 ribose-phosphate pyro 77.4 4.4 0.00011 20.3 4.3 38 174-212 233-271 (323) 53 PRK02269 ribose-phosphate pyro 77.3 4 0.0001 20.5 4.1 30 181-210 238-268 (321) 54 PRK06827 phosphoribosylpyropho 77.3 5.4 0.00014 19.8 4.8 31 175-207 281-312 (381) 55 PRK07199 phosphoribosylpyropho 76.5 5.6 0.00014 19.7 4.7 37 174-211 226-263 (301) 56 PRK01999 consensus 76.2 4.7 0.00012 20.1 4.2 30 181-210 233-263 (311) 57 PRK04554 consensus 76.1 4 0.0001 20.5 3.8 35 175-210 231-266 (327) 58 PRK02270 consensus 76.0 3.9 0.0001 20.6 3.7 37 175-212 230-267 (327) 59 PRK01506 consensus 75.5 5.8 0.00015 19.6 4.5 29 182-210 239-268 (317) 60 PRK00553 ribose-phosphate pyro 75.2 5.5 0.00014 19.7 4.3 30 181-210 247-277 (340) 61 TIGR02168 SMC_prok_B chromosom 74.9 3.7 9.4E-05 20.7 3.4 22 128-149 1085-1107(1191) 62 PRK02039 consensus 74.7 4.5 0.00012 20.2 3.8 30 181-210 235-265 (316) 63 PRK05259 consensus 74.0 4.2 0.00011 20.4 3.5 34 175-209 225-259 (310) 64 PRK10966 exonuclease subunit S 73.9 3.9 9.9E-05 20.6 3.3 10 170-179 290-299 (402) 65 PRK03675 consensus 73.6 6.8 0.00017 19.2 4.5 36 174-210 216-252 (279) 66 PRK01259 ribose-phosphate pyro 73.6 5.9 0.00015 19.5 4.2 30 181-210 229-259 (309) 67 PRK02812 ribose-phosphate pyro 73.6 4.5 0.00011 20.3 3.5 27 182-208 252-279 (331) 68 PRK05038 consensus 72.7 5.3 0.00014 19.8 3.7 35 175-210 229-264 (315) 69 PRK03092 ribose-phosphate pyro 72.6 7.5 0.00019 19.0 4.6 29 182-210 223-252 (304) 70 TIGR01011 rpsB_bact ribosomal 71.1 7.6 0.00019 18.9 4.2 17 47-63 178-194 (227) 71 PRK01132 consensus 70.7 6 0.00015 19.5 3.6 33 174-207 218-251 (286) 72 TIGR01978 sufC FeS assembly AT 70.0 5.4 0.00014 19.8 3.3 14 55-68 79-92 (248) 73 PRK00934 ribose-phosphate pyro 68.0 9.4 0.00024 18.4 4.4 36 174-210 219-255 (286) 74 PRK04117 consensus 67.3 7.5 0.00019 18.9 3.5 36 174-210 226-262 (309) 75 KOG3592 consensus 67.0 4 0.0001 20.5 2.1 24 51-74 81-104 (934) 76 KOG3798 consensus 66.2 6.2 0.00016 19.4 2.9 93 51-158 132-241 (343) 77 TIGR01821 5aminolev_synth 5-am 64.0 4 0.0001 20.5 1.7 19 123-141 390-408 (427) 78 PRK09382 ispDF bifunctional 2- 62.7 7.5 0.00019 18.9 2.8 70 114-187 232-309 (382) 79 TIGR01501 MthylAspMutase methy 61.5 5.7 0.00014 19.6 2.0 45 150-207 85-130 (134) 80 PRK02277 orotate phosphoribosy 60.5 13 0.00033 17.5 5.1 14 145-158 139-152 (201) 81 COG4566 TtrR Response regulato 57.9 15 0.00037 17.3 4.3 44 28-72 50-96 (202) 82 PRK11624 cdsA CDP-diglyceride 57.8 5 0.00013 20.0 1.2 26 173-198 234-259 (285) 83 TIGR03348 VI_IcmF type VI secr 57.1 15 0.00038 17.2 4.1 44 16-63 153-213 (1169) 84 TIGR01344 malate_syn_A malate 55.9 6.1 0.00016 19.5 1.4 40 161-215 257-296 (522) 85 TIGR00475 selB selenocysteine- 55.4 7.6 0.00019 18.9 1.9 20 42-61 101-120 (627) 86 TIGR00619 sbcd nuclease SbcCD, 55.0 13 0.00033 17.6 2.9 43 42-84 76-120 (275) 87 cd04170 EF-G_bact Elongation f 54.0 14 0.00035 17.4 3.0 50 29-79 92-148 (268) 88 TIGR02427 protocat_pcaD 3-oxoa 53.5 11 0.00028 18.0 2.4 33 35-67 24-61 (256) 89 PRK01215 competence damage-ind 52.9 18 0.00045 16.8 5.8 82 1-84 1-97 (264) 90 pfam01148 CTP_transf_1 Cytidyl 52.4 7.6 0.00019 18.9 1.5 27 173-199 212-238 (259) 91 TIGR01090 apt adenine phosphor 51.7 19 0.00047 16.7 3.7 32 146-190 113-144 (175) 92 PRK10161 transcriptional regul 51.1 19 0.00048 16.6 5.1 57 144-200 166-225 (229) 93 COG0575 CdsA CDP-diglyceride s 50.6 9.5 0.00024 18.3 1.7 27 173-199 217-243 (265) 94 TIGR02362 dhaK1b probable dihy 49.7 13 0.00034 17.5 2.3 16 41-56 114-129 (328) 95 TIGR00972 3a0107s01c2 phosphat 49.3 19 0.00049 16.6 3.1 30 25-54 26-61 (248) 96 PRK09836 DNA-binding transcrip 47.7 21 0.00055 16.3 3.7 56 144-199 162-220 (226) 97 PRK05433 GTP-binding protein L 45.1 23 0.00059 16.1 3.0 55 137-193 254-329 (601) 98 KOG1448 consensus 43.3 25 0.00064 15.9 5.8 34 175-209 230-264 (316) 99 COG0104 PurA Adenylosuccinate 43.1 25 0.00065 15.9 3.8 50 3-57 49-100 (430) 100 TIGR01307 pgm_bpd_ind 2,3-bisp 42.7 9.1 0.00023 18.4 0.6 45 39-83 100-152 (529) 101 PRK01117 adenylosuccinate synt 41.6 22 0.00057 16.2 2.5 45 3-52 49-93 (431) 102 TIGR02515 IV_pilus_PilQ type I 39.2 29 0.00074 15.5 4.3 39 19-57 159-197 (464) 103 pfam00709 Adenylsucc_synt Aden 37.8 28 0.00073 15.6 2.5 45 3-52 47-91 (420) 104 pfam10070 DUF2309 Uncharacteri 37.1 26 0.00065 15.8 2.2 20 186-212 494-513 (783) 105 TIGR00151 ispF 2C-methyl-D-ery 36.9 32 0.00081 15.3 3.2 68 113-186 9-90 (159) 106 pfam08747 DUF1788 Domain of un 35.5 15 0.00038 17.2 0.8 58 155-214 62-121 (125) 107 TIGR02199 rfaE_dom_II rfaE bif 34.8 20 0.00052 16.4 1.4 30 42-74 3-34 (144) 108 pfam06415 iPGM_N BPG-independe 34.7 35 0.00088 15.1 3.8 40 41-80 19-63 (223) 109 PRK10816 DNA-binding transcrip 34.4 35 0.00089 15.1 6.7 57 143-199 160-219 (223) 110 TIGR00968 3a0106s01 sulfate AB 34.3 35 0.0009 15.0 3.5 28 173-200 167-197 (241) 111 PRK05434 phosphoglyceromutase; 33.3 37 0.00093 14.9 3.7 40 42-81 102-147 (511) 112 TIGR00644 recJ single-stranded 32.4 38 0.00096 14.9 4.2 31 28-60 116-147 (705) 113 PRK13784 adenylosuccinate synt 31.3 39 0.001 14.8 2.4 45 3-52 49-93 (428) 114 PRK13811 orotate phosphoribosy 30.0 42 0.0011 14.6 5.5 10 61-70 56-65 (170) 115 TIGR02998 RraA_entero regulato 29.3 25 0.00065 15.9 1.1 33 7-39 35-70 (161) 116 smart00788 Adenylsucc_synt Ade 28.7 44 0.0011 14.5 2.7 45 3-52 47-91 (421) 117 TIGR01369 CPSaseII_lrg carbamo 28.5 44 0.0011 14.5 6.4 48 105-158 761-809 (1089) 118 PRK11148 cyclic 3',5'-adenosin 28.5 44 0.0011 14.5 3.9 50 32-82 36-91 (275) 119 PRK13787 adenylosuccinate synt 28.3 43 0.0011 14.5 2.1 45 3-52 49-93 (423) 120 TIGR02439 catechol_proteo cate 27.9 45 0.0012 14.4 2.7 74 123-201 144-235 (288) 121 TIGR00505 ribA GTP cyclohydrol 27.9 45 0.0012 14.4 3.0 18 33-50 76-93 (227) 122 TIGR01738 bioH putative pimelo 27.6 46 0.0012 14.4 2.5 29 172-202 204-236 (248) 123 TIGR01394 TypA_BipA GTP-bindin 27.3 30 0.00078 15.4 1.2 36 29-65 98-142 (609) 124 PRK00143 trmU tRNA (5-methylam 27.2 47 0.0012 14.3 4.4 101 35-136 14-126 (355) 125 TIGR01935 NOT-MenG RraA family 27.2 24 0.00062 16.0 0.7 44 8-51 37-89 (155) 126 cd04168 TetM_like Tet(M)-like 26.6 48 0.0012 14.3 4.0 10 172-181 193-202 (237) 127 COG0075 Serine-pyruvate aminot 25.6 38 0.00098 14.8 1.5 34 28-62 105-142 (383) 128 TIGR01452 PGP_euk phosphoglyco 25.3 51 0.0013 14.1 3.5 19 38-58 23-41 (288) 129 COG0856 Orotate phosphoribosyl 25.0 51 0.0013 14.1 5.3 10 148-157 142-151 (203) 130 COG1107 Archaea-specific RecJ- 24.8 52 0.0013 14.1 2.8 10 55-64 447-456 (715) 131 PRK00090 bioD dithiobiotin syn 24.3 53 0.0013 14.0 4.2 22 39-60 18-41 (223) 132 pfam03054 tRNA_Me_trans tRNA m 23.2 56 0.0014 13.9 4.1 101 35-136 10-124 (354) 133 COG0311 PDX2 Predicted glutami 23.2 35 0.00089 15.1 0.9 20 31-50 42-61 (194) 134 TIGR02191 RNaseIII ribonucleas 22.8 34 0.00087 15.1 0.8 12 195-206 98-109 (228) 135 TIGR02855 spore_yabG sporulati 22.8 57 0.0014 13.8 2.2 27 173-199 148-174 (292) 136 PRK13783 adenylosuccinate synt 22.5 58 0.0015 13.8 2.7 44 3-51 48-91 (404) 137 cd01449 TST_Repeat_2 Thiosulfa 22.4 58 0.0015 13.8 3.7 55 133-201 38-92 (118) 138 COG4822 CbiK Cobalamin biosynt 22.1 59 0.0015 13.8 4.0 10 41-50 65-74 (265) 139 COG2241 CobL Precorrin-6B meth 21.9 59 0.0015 13.7 2.3 46 30-77 48-94 (210) 140 pfam08759 DUF1792 Domain of un 21.7 60 0.0015 13.7 3.8 53 151-203 128-182 (225) 141 pfam02542 YgbB YgbB family. Th 21.3 54 0.0014 13.9 1.6 31 115-145 12-43 (157) 142 TIGR02017 hutG_amidohyd N-form 20.5 63 0.0016 13.6 3.9 53 134-197 82-148 (269) 143 PRK12312 infB translation init 20.5 44 0.0011 14.5 1.0 29 25-55 162-190 (610) 144 TIGR03597 GTPase_YqeH ribosome 20.4 64 0.0016 13.5 3.7 32 41-74 116-147 (360) 145 cd06315 PBP1_ABC_sugar_binding 20.1 65 0.0016 13.5 5.6 58 25-82 29-87 (280) No 1 >TIGR03413 GSH_gloB hydroxyacylglutathione hydrolase. Members of this protein family are hydroxyacylglutathione hydrolase, a detoxification enzyme known as glyoxalase II. It follows lactoylglutathione lyase, or glyoxalase I, and acts to remove the toxic metabolite methylglyoxal and related compounds. This protein belongs to the broader metallo-beta-lactamase family (pfam00753). Probab=100.00 E-value=0 Score=334.46 Aligned_cols=182 Identities=35% Similarity=0.618 Sum_probs=163.6 Q ss_pred EEECCCCCCCE-EEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEECCHHH Q ss_conf 99708765716-99998899849998389999999999997499752999869885210110567886079554058438 Q gi|254780758|r 8 IVQVTPFKQNC-TFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDLSLTIIGPHKDD 86 (216) Q Consensus 8 ~i~v~~~~~N~-~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~~~~~ 86 (216) +++++.++.|. |+|.+++ ++|++|||| +.+.+++.+++.++++++|++||.|+||++|+..|++++++++|++..+. T Consensus 1 i~~ip~l~DNy~YLi~d~~-~~a~vIDP~-d~~~i~~~l~~~~l~l~~IL~TH~H~DHi~G~~~L~~~~~~~Vyg~~~~~ 78 (248) T TIGR03413 1 IIPIPALSDNYIWLLHDPD-GQAAVVDPG-EAEPVLDALEARGLTLTAILLTHHHHDHVGGVAELLEAFPAPVYGPAEER 78 (248) T ss_pred CEEECEECCEEEEEEEECC-CCEEEECCC-CCHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHCCCCCCCCHHHC T ss_conf 9533542565899999189-858998799-80999999998799678999489886001318999975146612011116 Q ss_pred HHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCC Q ss_conf 98840123444432002346666741896478399762012331012224721000001234321111000001356634 Q gi|254780758|r 87 AAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFENNFAHLGDTLFRSSIGRTD 166 (216) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~~~~lftGD~lf~~~~G~~d 166 (216) ....++.+.+||++.+|+.++++++|||||+||+||+++++++|||||+||.+||||. T Consensus 79 ---------------------~~~~d~~l~dgd~i~ig~~~~~vi~TPGHT~ghv~f~~~~~~~lFtGDTLF~gGcGR~- 136 (248) T TIGR03413 79 ---------------------IPGITHPVKDGDTVTLGGLEFEVLAVPGHTLGHIAYYLPDSPALFCGDTLFSAGCGRL- 136 (248) T ss_pred ---------------------CCCCEEEECCCCEEEECCEEEEEEECCCCCCCCEEEEECCCEEEEECCEEECCCCCCC- T ss_conf ---------------------8777078258988988998899996789973006899557407984463441650267- Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEECCCC------------------------------------CCCHHHHHHHCH Q ss_conf 67989999999999998519997899848879------------------------------------888899988281 Q gi|254780758|r 167 ILHGDYQQLINSINNKILPLGDEISFVCGHGP------------------------------------NSTIGRERRLNP 210 (216) Q Consensus 167 ~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~------------------------------------~tt~~~e~~~n~ 210 (216) ++|++++|++|| +||.+||++|+|||||.+ |||++.||+.|| T Consensus 137 -feG~~~~m~~SL-~kl~~Lpd~T~vypGHeYt~~Nl~Fa~~v~p~n~~l~~~~~~~~~~~~~~~~tvPstl~~E~~~NP 214 (248) T TIGR03413 137 -FEGTPEQMYDSL-QRLAALPDDTLVYCAHEYTLSNLRFALTVEPDNPALQERLKEVEALRAQGQPTLPSTLGLERATNP 214 (248) T ss_pred -CCCCHHHHHHHH-HHHHCCCCCEEEEECCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEHHHHHHHCC T ss_conf -898999999999-998648997289847550477899998748999999999999999987799816642999860494 Q ss_pred HHHCC Q ss_conf 45148 Q gi|254780758|r 211 FLKSN 215 (216) Q Consensus 211 ~~~~~ 215 (216) |||++ T Consensus 215 FlR~~ 219 (248) T TIGR03413 215 FLRAD 219 (248) T ss_pred CCCCC T ss_conf 40589 No 2 >PRK10241 hydroxyacylglutathione hydrolase; Provisional Probab=100.00 E-value=0 Score=327.49 Aligned_cols=183 Identities=30% Similarity=0.608 Sum_probs=158.1 Q ss_pred EEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEECCHHH Q ss_conf 99970876571699998899849998389999999999997499752999869885210110567886079554058438 Q gi|254780758|r 7 HIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDLSLTIIGPHKDD 86 (216) Q Consensus 7 ~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~~~~~ 86 (216) ++++++.++.|+.||+.+++++|++|||| +++.+++.+++.++++++|++||+|+||++|+..|+++++...++...+. T Consensus 2 ~i~~ip~~~DNy~yli~d~~~~a~vIDPg-d~~~i~~~l~~~~l~l~~IL~TH~H~DHi~G~~~L~~~~~~~~v~g~~~~ 80 (251) T PRK10241 2 NLNSIPAFDDNYIWVLNDEAGRCLIVDPG-EAEPVLNAIAANNWQPEAIFLTHHHHDHVGGVKELVEKFPQIVVYGPQET 80 (251) T ss_pred EEEEEEEECCEEEEEEECCCCCEEEECCC-CHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCEEECHHHC T ss_conf 67996125566899998599838999499-70999999997799178999458955766329999987689858670332 Q ss_pred HHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCC Q ss_conf 98840123444432002346666741896478399762012331012224721000001234321111000001356634 Q gi|254780758|r 87 AAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFENNFAHLGDTLFRSSIGRTD 166 (216) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~~~~lftGD~lf~~~~G~~d 166 (216) .....++.+++||++.+|+.+++|++|||||+||+||+. ++.|||||+||.+|||| T Consensus 81 --------------------~~~~~~~~v~dgd~i~~g~~~~~vi~TPGHT~Ghi~y~~--~~~lFtGDTLF~gGcGR-- 136 (251) T PRK10241 81 --------------------QDKGTTQVVKDGETAFVLGHEFSVFATPGHTLGHICYFS--KPYLFCGDTLFSGGCGR-- 136 (251) T ss_pred --------------------CCCCCCEEECCCCEEEECCEEEEEEECCCCCCCHHHHHC--CCCEEECCCCCCCCCCC-- T ss_conf --------------------676898784899999989989999977999830001113--77067515300466147-- Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEECCCC------------------------------------CCCHHHHHHHCH Q ss_conf 67989999999999998519997899848879------------------------------------888899988281 Q gi|254780758|r 167 ILHGDYQQLINSINNKILPLGDEISFVCGHGP------------------------------------NSTIGRERRLNP 210 (216) Q Consensus 167 ~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~------------------------------------~tt~~~e~~~n~ 210 (216) +++|+++||++|| +||.+||++|+|||||.+ |||++.||+.|| T Consensus 137 ~feG~~~~m~~SL-~kl~~LP~~T~vy~GHeYt~~Nl~Fa~~vep~N~~l~~~~~~~~~~r~~~~~t~Pstl~~E~~~Np 215 (251) T PRK10241 137 LFEGTPSQMYQSL-KKINALPDDTLICCAHEYTLSNMKFALSILPHDLSINDYYRKVKELRAKNQITLPVILKNERQINL 215 (251) T ss_pred CCCCCHHHHHHHH-HHHHCCCCCCEEEECCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCC T ss_conf 8998999999999-999678998789726332287899998858999999999999999997799958854899974494 Q ss_pred HHHCC Q ss_conf 45148 Q gi|254780758|r 211 FLKSN 215 (216) Q Consensus 211 ~~~~~ 215 (216) |||++ T Consensus 216 FlR~~ 220 (251) T PRK10241 216 FLRTE 220 (251) T ss_pred CCCCC T ss_conf 11589 No 3 >KOG0813 consensus Probab=100.00 E-value=1.4e-38 Score=243.95 Aligned_cols=175 Identities=32% Similarity=0.579 Sum_probs=147.3 Q ss_pred CCCEEEEEE-CCCCEEEEECCCCCHHHHHHHHHH---CCCCEEEEEECCCCCCCCCCCHHHHHHHC--CCEEECCHHHHH Q ss_conf 571699998-899849998389999999999997---49975299986988521011056788607--955405843898 Q gi|254780758|r 15 KQNCTFLFD-EESLEAVVVDPGGDLDIIKDVIQS---RNFHVKQIWITHGHVDHVGGAAQLKEDLS--LTIIGPHKDDAA 88 (216) Q Consensus 15 ~~N~~li~~-~~~~~~ilID~G~~~~~i~~~l~~---~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~--~~i~~~~~~~~~ 88 (216) ++|+||+.+ +.++.+.++||. .++.+...+++ .+.++++|+.||.|+||+||+..|.+.++ +.+|...+. T Consensus 12 ~Ny~YLl~~~~~~~~a~~vDP~-~pe~v~~~~~~~~~~~~~l~~Il~THhH~DHsGGn~~i~~~~~~~~~v~g~~~~--- 87 (265) T KOG0813 12 DNYMYLLGDGDKTIDADLVDPA-EPEYVIPSLKKLDDENRRLTAILTTHHHYDHSGGNEDIKREIPYDIKVIGGADD--- 87 (265) T ss_pred CCEEEEEECCCCEEEEEEECCC-CHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHCCCCCEEECCCHH--- T ss_conf 7569997536411365322676-318888888765530476468995222421467678887413578279668700--- Q ss_pred HHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCHHHHCCCCCC Q ss_conf 84012344443200234666674189647839976201233101222472100000123---432111100000135663 Q gi|254780758|r 89 MMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFE---NNFAHLGDTLFRSSIGRT 165 (216) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~---~~~lftGD~lf~~~~G~~ 165 (216) ......+.++++|++.+|+.++++++|||||.||+|||+.+ ++++||||+||..+||| T Consensus 88 ------------------r~~~i~~~~~~~e~~~~~g~~v~~l~TPgHT~~hi~~~~~~~~~e~~iFtGDtlf~~GcG~- 148 (265) T KOG0813 88 ------------------RIPGITRGLKDGETVTVGGLEVRCLHTPGHTAGHICYYVTESTGERAIFTGDTLFGAGCGR- 148 (265) T ss_pred ------------------CCCCCCCCCCCCCEEEECCEEEEEEECCCCCCCCEEEEEECCCCCCEEEECCCEEECCCCC- T ss_conf ------------------0755344678886799898889998389856880899951578997588578265358623- Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCC-------------------------------------CCCHHHHHHH Q ss_conf 467989999999999998519997899848879-------------------------------------8888999882 Q gi|254780758|r 166 DILHGDYQQLINSINNKILPLGDEISFVCGHGP-------------------------------------NSTIGRERRL 208 (216) Q Consensus 166 d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~-------------------------------------~tt~~~e~~~ 208 (216) +++|.+.+|..|+.+ +..|+++++|||||++ ++|+++||.. T Consensus 149 -~FEgt~~~M~~sl~~-l~~L~~~t~iypGHeYt~~n~kf~~~ve~~n~~~q~~l~~~~~~~~~~~~t~~p~tl~~e~~~ 226 (265) T KOG0813 149 -FFEGTAEQMDSSLNE-LIALPDDTRIYPGHEYTKSNLKFARYVEPRNEVEQEKLDWLVERRSKEKPTMVPSTLGEEKLY 226 (265) T ss_pred -HHCCCHHHHHHHHHH-HHCCCCCCEECCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHC T ss_conf -315778999985887-624798856846842330301045420555388999999999875326764670048899861 Q ss_pred CHHHHC Q ss_conf 814514 Q gi|254780758|r 209 NPFLKS 214 (216) Q Consensus 209 n~~~~~ 214 (216) |||+|- T Consensus 227 Npf~r~ 232 (265) T KOG0813 227 NPFMRL 232 (265) T ss_pred CCHHHC T ss_conf 941112 No 4 >KOG0814 consensus Probab=100.00 E-value=8.5e-34 Score=216.12 Aligned_cols=177 Identities=31% Similarity=0.412 Sum_probs=154.5 Q ss_pred CCCEEEEEECCCCEEEEECCCCC-HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHC-CCEEECCHHHHHHHCC Q ss_conf 57169999889984999838999-999999999749975299986988521011056788607-9554058438988401 Q gi|254780758|r 15 KQNCTFLFDEESLEAVVVDPGGD-LDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDLS-LTIIGPHKDDAAMMGK 92 (216) Q Consensus 15 ~~N~~li~~~~~~~~ilID~G~~-~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~-~~i~~~~~~~~~~~~~ 92 (216) .+-.|++.|..++++++|||-.+ .++=.+.+++.|+++.|-++||.|.||+-|...|+..+| ++-+.+ T Consensus 20 sTytYll~d~~~~~AviIDPV~et~~RD~qlikdLgl~LiYa~NTH~HADHiTGtg~Lkt~~pg~kSVis---------- 89 (237) T KOG0814 20 STYTYLLGDHKTGKAVIIDPVLETVSRDAQLIKDLGLDLIYALNTHVHADHITGTGLLKTLLPGCKSVIS---------- 89 (237) T ss_pred CEEEEEEEECCCCCEEEECCHHHCCCCHHHHHHHCCCEEEEEECCEEECCCCCCCCHHHHHCCCHHHHHH---------- T ss_conf 5178885507888558964056325305799986180014423040320330163337776421888865---------- Q ss_pred HHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCH Q ss_conf 23444432002346666741896478399762012331012224721000001234321111000001356634679899 Q gi|254780758|r 93 VDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFENNFAHLGDTLFRSSIGRTDILHGDY 172 (216) Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~~~~lftGD~lf~~~~G~~d~~~~d~ 172 (216) .......|..+++||.+++|+..+++..|||||+|++.|...+.+..||||+|+.++||||||.+|.+ T Consensus 90 ------------~~SGakAD~~l~~Gd~i~~G~~~le~ratPGHT~GC~TyV~~d~~~aFTGDalLIRgCGRTDFQqG~~ 157 (237) T KOG0814 90 ------------SASGAKADLHLEDGDIIEIGGLKLEVRATPGHTNGCVTYVEHDLRMAFTGDALLIRGCGRTDFQQGCP 157 (237) T ss_pred ------------HCCCCCCCCCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCEEEECCEEEEECCCCCCHHCCCH T ss_conf ------------40365312134889778774579998518998776379996575125624656785147630311685 Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCC----CCCHHHHHHHCHHHH Q ss_conf 99999999998519997899848879----888899988281451 Q gi|254780758|r 173 QQLINSINNKILPLGDEISFVCGHGP----NSTIGRERRLNPFLK 213 (216) Q Consensus 173 ~~~~~sl~~~l~~l~~~~~v~PgHG~----~tt~~~e~~~n~~~~ 213 (216) +++++|+.++|..||.+++|||+|.+ .+|+.+||+.||-|. T Consensus 158 ~~LyesVH~kIFTLP~d~~iYpaHdY~G~~~stV~EEk~~NPRLT 202 (237) T KOG0814 158 ASLYESVHSKIFTLPEDYLIYPAHDYKGFLVSTVWEEKNLNPRLT 202 (237) T ss_pred HHHHHHHHHHHEECCCCEEEEECCCCCCEEEEEHHHHHCCCCCCC T ss_conf 888998867752078751896535647558651344533396222 No 5 >smart00849 Lactamase_B Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor. Probab=99.97 E-value=8.1e-30 Score=193.00 Aligned_cols=177 Identities=28% Similarity=0.434 Sum_probs=132.6 Q ss_pred CCCCCEEEEEECCCCEEEEECCCCC-HHHHHHHHHHCCC-CEEEEEECCCCCCCCCCCHHHHHHHCCCEEECCHHHHHHH Q ss_conf 7657169999889984999838999-9999999997499-7529998698852101105678860795540584389884 Q gi|254780758|r 13 PFKQNCTFLFDEESLEAVVVDPGGD-LDIIKDVIQSRNF-HVKQIWITHGHVDHVGGAAQLKEDLSLTIIGPHKDDAAMM 90 (216) Q Consensus 13 ~~~~N~~li~~~~~~~~ilID~G~~-~~~i~~~l~~~~~-~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~~~~~~~~~ 90 (216) ...+|||+|.+++ +++|||||.. .+.+++.+++.+. +|++|++||.|+||+||+..|++++++++|++..+...+ T Consensus 3 ~~~~n~~li~~~~--~~vliD~G~~~~~~~~~~l~~~~~~~i~~vv~TH~H~DH~gg~~~~~~~~~~~i~~~~~~~~~~- 79 (183) T smart00849 3 GVGVNSYLVEGDG--GAILIDTGPGEAEDLLAELKKLGPKDIDAIILTHGHPDHIGGLPELLEAPGAPVYAPEGTAELL- 79 (183) T ss_pred CCCCEEEEEEECC--EEEEEECCCCHHHHHHHHHHHCCCCCCCEEEECCCCCCHHCHHHHHHHHCCCEEEECHHHHHHH- T ss_conf 8641899999899--8999929898099999999965998874999899994003309999873299899879999998- Q ss_pred CCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCC--C Q ss_conf 01234444320023466667418964783997620123310122247210000012343211110000013566346--7 Q gi|254780758|r 91 GKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFENNFAHLGDTLFRSSIGRTDI--L 168 (216) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~~~~lftGD~lf~~~~G~~d~--~ 168 (216) ...... ............+...+++++++.+++..++++++||||+|+++++.++.++||+||++|....+.... + T Consensus 80 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pGHt~g~~~~~~~~~~vl~~GD~~~~~~~~~~~~~~~ 157 (183) T smart00849 80 KDLLKL--GGALGAEAPPPPPDRTLKDGEELDLGGLELEVIHTPGHTPGSIVLYLPEGKILFTGDLLFSGGIGRTDDDGG 157 (183) T ss_pred HCCHHC--CCCCCCCCCCCCCCEECCCCCEEEECCEEEEEEECCCCCCCCEEEEECCCCEEEEEEEECCCCCCCCCCCCC T ss_conf 610010--333333467776616728999999899869986468878898999989999999988989798777478999 Q ss_pred CCCHHHHHHHHHHHHHCCCCCEEEEECC Q ss_conf 9899999999999985199978998488 Q gi|254780758|r 169 HGDYQQLINSINNKILPLGDEISFVCGH 196 (216) Q Consensus 169 ~~d~~~~~~sl~~~l~~l~~~~~v~PgH 196 (216) ..+..++++++ +++..+.+++ ++||| T Consensus 158 ~~~~~~~~~~l-~~~~~~~~~~-i~pgH 183 (183) T smart00849 158 DASASDSLESL-LKLLALDPEL-VVPGH 183 (183) T ss_pred CHHHHHHHHHH-HHHHCCCCCE-EECCC T ss_conf 56599999999-9987799999-97797 No 6 >COG0491 GloB Zn-dependent hydrolases, including glyoxylases [General function prediction only] Probab=99.96 E-value=3.8e-28 Score=183.28 Aligned_cols=197 Identities=30% Similarity=0.493 Sum_probs=134.1 Q ss_pred CCEEEEEECC--CCCCCEEEEEECCCCEEEEECCCCC---HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHC- Q ss_conf 7359999708--7657169999889984999838999---999999999749975299986988521011056788607- Q gi|254780758|r 3 KLSFHIVQVT--PFKQNCTFLFDEESLEAVVVDPGGD---LDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDLS- 76 (216) Q Consensus 3 ~~~~~~i~v~--~~~~N~~li~~~~~~~~ilID~G~~---~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~- 76 (216) .+.+..++.+ .+..|++++........+|||+|.. ...+.+.+...+.++++|++||.|.||+||+..+.+..+ T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~liDtG~~~~~~~~~~~~l~~~~~di~~vilTH~H~DH~gg~~~~~~~~~~ 88 (252) T COG0491 9 LDGITAFPIGVGPLSGNSVYLLVDGEGGAVLIDTGLGDADAEALLEALAALGLDVDAILLTHGHFDHIGGAAVLKEAFGA 88 (252) T ss_pred CCCEEEEECCCCCCCCCEEEEEECCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHHCCC T ss_conf 34027872167656773399998489848999499984057999999886699877899198807878279999874587 Q ss_pred CCEEECCHHHHHHHCCHHHHHHHH-HCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 955405843898840123444432-0023466667418964783997620123310122247210000012343211110 Q gi|254780758|r 77 LTIIGPHKDDAAMMGKVDEQARLL-SIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFENNFAHLGD 155 (216) Q Consensus 77 ~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~~~~lftGD 155 (216) +++ ..+.....+........... ..............+.+++.+.+++..++++++|||||||+++++++++++|+|| T Consensus 89 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~pGHT~g~~~~~~~~~~~l~~gD 167 (252) T COG0491 89 APV-IAPAEVPLLLREEILRKAGVTAEAYAAPGASPLRALEDGDELDLGGLELEVLHTPGHTPGHIVFLLEDGGVLFTGD 167 (252) T ss_pred CEE-EECCHHHHHHHCCCCCCCCCCHHHCCCCCCCCCCCCCCCCEEEECCCEEEEEECCCCCCCCEEEEECCCCEEEEEC T ss_conf 349-8040255443012111221001211477333443357898899789269999899888558899989998999932 Q ss_pred HHHHCC--CCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCC Q ss_conf 000013--566346798999999999999851999789984887988 Q gi|254780758|r 156 TLFRSS--IGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNS 200 (216) Q Consensus 156 ~lf~~~--~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~t 200 (216) ++|... +++...+..+..++++|+++......+...|+||||+.. T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~p~Hg~~~ 214 (252) T COG0491 168 TLFAGDTGVGRLDLPGGDAAQLLASLRRLLLLLLPDTLVLPGHGPDE 214 (252) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCHH T ss_conf 11678767775555777999999999999843589859996898544 No 7 >PRK11921 metallo-beta-lactamase/flavodoxin domain-containing protein; Provisional Probab=99.84 E-value=5.7e-20 Score=135.79 Aligned_cols=168 Identities=20% Similarity=0.261 Sum_probs=115.4 Q ss_pred CCCC--CEEEEEECCCCEEEEECCCCC--HHHHHHHHHHC-C-CCEEEEEECCCCCCCCCCCHHHHHHHC-CCEEECCHH Q ss_conf 7657--169999889984999838999--99999999974-9-975299986988521011056788607-955405843 Q gi|254780758|r 13 PFKQ--NCTFLFDEESLEAVVVDPGGD--LDIIKDVIQSR-N-FHVKQIWITHGHVDHVGGAAQLKEDLS-LTIIGPHKD 85 (216) Q Consensus 13 ~~~~--N~~li~~~~~~~~ilID~G~~--~~~i~~~l~~~-~-~~i~~Ii~TH~H~DH~gg~~~l~~~~~-~~i~~~~~~ 85 (216) +.++ |+|||.++ +.+|||+... .+..++.|++. + .+|+||++.|.-+||+|+++.|.+.+| ++++++... T Consensus 28 ~~GtTYNSYLI~~e---k~aLiDtv~~~f~~e~l~~l~~vi~~~~IdYiVvnH~EPDHsg~l~~ll~~~p~~~vv~s~~~ 104 (395) T PRK11921 28 HRGSSYNSYLIKDE---KTVLIDTVWAPFAKEFVENLKKEIDLDKIDYIVANHGEIDHSGALPELMKEIPDTPIYCTANG 104 (395) T ss_pred CCCEEEEEEEECCC---CEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHHCCCCEEEECHHH T ss_conf 99645675799689---789992898438999999999645965699999389997578999999998899999987899 Q ss_pred HHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCHHHHCCCC- Q ss_conf 8988401234444320023466667418964783997620123310122-24721000001234321111000001356- Q gi|254780758|r 86 DAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCP-GHSPGHVIYVNFENNFAHLGDTLFRSSIG- 163 (216) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tP-GHt~g~~~~~~~~~~~lftGD~lf~~~~G- 163 (216) ...+.... . .....+.+++||++.+|+.+++++.|| =|.|++++-|.+++++|||.|+ | ++.+ T Consensus 105 ~~~l~~~~-------~------~~~~~~~Vk~gd~L~LG~~tL~F~~tP~lHWPdtm~Ty~~edkiLFs~Da-F-G~h~a 169 (395) T PRK11921 105 AKSLKGHY-------H------QDWNFVVVKTGDRLEIGSKELIFIEAPMLHWPDSMFCYLTGDNILFSNDA-F-GQHYA 169 (395) T ss_pred HHHHHHHH-------C------CCCCEEEECCCCEEECCCCEEEEEECCCCCCCCCEEEEECCCCEEECCCC-C-CCCCC T ss_conf 99999872-------8------88773896789888428966999978988788843799757877860355-4-45433 Q ss_pred ---CCCCCC--C----CHHHH------------HHHHHHHHHCCC--CCEEEEECCCCCC Q ss_conf ---634679--8----99999------------999999985199--9789984887988 Q gi|254780758|r 164 ---RTDILH--G----DYQQL------------INSINNKILPLG--DEISFVCGHGPNS 200 (216) Q Consensus 164 ---~~d~~~--~----d~~~~------------~~sl~~~l~~l~--~~~~v~PgHG~~t 200 (216) +.|-.. . +...+ .+.| +++..|+ .+ .|+|+|||-- T Consensus 170 ~~~~fdd~~d~~~~~~ea~~YY~nI~~p~~~~V~~~l-~kl~~l~l~i~-~IaPsHGpi~ 227 (395) T PRK11921 170 SELMYNDLVDQGELYQEAIKYYANILTPFSPLVIKKI-EEILSLNLPVD-MICPSHGVIW 227 (395) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHCCCCCC-EEECCCCCEE T ss_conf 7100023354055699999999998663189999999-99984799976-9988985054 No 8 >COG0426 FpaA Uncharacterized flavoproteins [Energy production and conversion] Probab=99.83 E-value=1.1e-19 Score=134.14 Aligned_cols=170 Identities=22% Similarity=0.238 Sum_probs=116.1 Q ss_pred CCCCCEEEEEECCCCEEEEECCCCC--HHHHHHHHHHC-CC-CEEEEEECCCCCCCCCCCHHHHHHHC-CCEEECCHHHH Q ss_conf 7657169999889984999838999--99999999974-99-75299986988521011056788607-95540584389 Q gi|254780758|r 13 PFKQNCTFLFDEESLEAVVVDPGGD--LDIIKDVIQSR-NF-HVKQIWITHGHVDHVGGAAQLKEDLS-LTIIGPHKDDA 87 (216) Q Consensus 13 ~~~~N~~li~~~~~~~~ilID~G~~--~~~i~~~l~~~-~~-~i~~Ii~TH~H~DH~gg~~~l~~~~~-~~i~~~~~~~~ 87 (216) ....|+|||.+ ++.+||||+.. .+.+++.+++. +. +|+||+..|..+||+|+++.+.+.+| ++++++..... T Consensus 33 GttyNSYLI~~---~k~aLID~~~~~f~~~~l~~l~~~id~k~iDYIi~~H~ePDhsg~l~~ll~~~p~a~ii~s~~~~~ 109 (388) T COG0426 33 GTTYNSYLIVG---DKTALIDTVGEKFFDEYLENLSKYIDPKEIDYIIVNHTEPDHSGSLPELLELAPNAKIICSKLAAR 109 (388) T ss_pred CCEEEEEEEEC---CCEEEECCCCCCHHHHHHHHHHHHCCHHCCEEEEECCCCCCHHHHHHHHHHHCCCCEEEEEHHHHH T ss_conf 71445589967---967998899801699999999863480017199978889633546999998688978996389999 Q ss_pred HHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCHHHH-CCCCCC Q ss_conf 88401234444320023466667418964783997620123310122-2472100000123432111100000-135663 Q gi|254780758|r 88 AMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCP-GHSPGHVIYVNFENNFAHLGDTLFR-SSIGRT 165 (216) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tP-GHt~g~~~~~~~~~~~lftGD~lf~-~~~G~~ 165 (216) ++..-.. .+.. ...++.||++.+|+.+++++++| =|+||+++.|.+.+++|||+|+ |- .-|.. T Consensus 110 ~L~~~~~------------~~~~-~~ivk~Gd~ldlGg~tL~Fi~ap~LHWPd~m~TYd~~~kILFS~D~-fG~h~~~~- 174 (388) T COG0426 110 FLKGFYH------------DPEW-FKIVKTGDTLDLGGHTLKFIPAPFLHWPDTMFTYDPEDKILFSCDA-FGAHVCDD- 174 (388) T ss_pred HHHHHCC------------CCCC-EEECCCCCEECCCCCEEEEEECCCCCCCCCEEEEECCCCEEECCCC-CCCCCCCH- T ss_conf 9997448------------8541-3634788776448947999957877798733676247757771465-66424652- Q ss_pred CCCCCCHHHH------------------HHHHHHHHHCCCCCEEEEECCCCCCC Q ss_conf 4679899999------------------99999998519997899848879888 Q gi|254780758|r 166 DILHGDYQQL------------------INSINNKILPLGDEISFVCGHGPNST 201 (216) Q Consensus 166 d~~~~d~~~~------------------~~sl~~~l~~l~~~~~v~PgHG~~tt 201 (216) ..+..+...+ .....+++..++-+ .|.|+|||.-. T Consensus 175 ~~fded~~~~~~~~~~Y~~~lm~p~~~~v~~~l~~~~~l~i~-~IaP~HG~i~~ 227 (388) T COG0426 175 YRFDEDIEELLPDMRKYYANLMAPNARLVLWALKKIKLLKIE-MIAPSHGPIWR 227 (388) T ss_pred HCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCC-EEECCCCCEEE T ss_conf 112467999879899999996052179999998540546741-89759985255 No 9 >pfam00753 Lactamase_B Metallo-beta-lactamase superfamily. Probab=99.82 E-value=7.5e-20 Score=135.10 Aligned_cols=141 Identities=28% Similarity=0.369 Sum_probs=98.7 Q ss_pred CCCCCEEEEEECCCCEEEEECCCCCHHHHHHH---HHHCCCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEECCHHHHHH Q ss_conf 76571699998899849998389999999999---997499752999869885210110567886079554058438988 Q gi|254780758|r 13 PFKQNCTFLFDEESLEAVVVDPGGDLDIIKDV---IQSRNFHVKQIWITHGHVDHVGGAAQLKEDLSLTIIGPHKDDAAM 89 (216) Q Consensus 13 ~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~---l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~~~~~~~~ 89 (216) ....|||+|.++ ++++|||||......... .+....+|++|++||.|+||+||+..++++++...+......... T Consensus 3 ~~~~~~~li~~~--~~~vLiD~G~~~~~~~~~~~~~~~~~~~v~~vi~TH~H~DH~gg~~~~~~~~~~~~~~~~~~~~~~ 80 (148) T pfam00753 3 GVGSNSYLVEGD--GGAILIDTGLGADDALLLLAALGLDPKDIDAIILTHAHADHIGGLPELKEATPAPVVAAPEDAAAL 80 (148) T ss_pred CCEEEEEEEEEC--CEEEEEECCCCHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHCCHHHHHHHCCCCEEECCHHHHHH T ss_conf 855589999989--989999598991589999998379976864999799985565568999987599899826888765 Q ss_pred HCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 401234444320023466667418964783997620123310122247210000012343211110000 Q gi|254780758|r 90 MGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFENNFAHLGDTLF 158 (216) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~~~~lftGD~lf 158 (216) ........... ...............+....+...+.+...+||++++++++.+++++||+||++| T Consensus 81 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~GD~~~ 146 (148) T pfam00753 81 LRLGLDDAELR---KLVDVLPPDVDLEGGDGILGGGTLLFVTPHPGHGPGHVVVYLPGGKVLFTGDLLF 146 (148) T ss_pred HHHHHCCCCCC---CCCCCCCCEEEECCCEEEECCCEEEEEEECCCCCCCCEEEEECCCCEEEECCCCC T ss_conf 42100001002---4666666505405741992498899999789988985999999999999898267 No 10 >PRK05452 anaerobic nitric oxide reductase flavorubredoxin; Provisional Probab=99.81 E-value=3.6e-19 Score=131.18 Aligned_cols=166 Identities=18% Similarity=0.209 Sum_probs=112.3 Q ss_pred CCEEEEEECCCCEEEEECCCCC--HHHHHHHHHHC-CC-CEEEEEECCCCCCCCCCCHHHHHHHC-CCEEECCHHHHHHH Q ss_conf 7169999889984999838999--99999999974-99-75299986988521011056788607-95540584389884 Q gi|254780758|r 16 QNCTFLFDEESLEAVVVDPGGD--LDIIKDVIQSR-NF-HVKQIWITHGHVDHVGGAAQLKEDLS-LTIIGPHKDDAAMM 90 (216) Q Consensus 16 ~N~~li~~~~~~~~ilID~G~~--~~~i~~~l~~~-~~-~i~~Ii~TH~H~DH~gg~~~l~~~~~-~~i~~~~~~~~~~~ 90 (216) -|+|||.++ +.+|||+... .+.+++.|++. +. +|+||++-|.-+||+|+++.|.+.+| ++++++......+. T Consensus 35 YNSYLI~de---K~aLIDtv~~~f~~e~l~~L~~~id~~~IDYIIvnH~EpDHSGsL~~Lle~~P~~~Iv~s~~a~~~l~ 111 (479) T PRK05452 35 YNSYLIREE---KNVLIDTVDHKFSREFVQNLRMEIDLADIDYIVINHAEEDHAGALTELMAQIPDTPIYCTANAIDSIN 111 (479) T ss_pred EEEEEECCC---CEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 343698489---88999299762899999999974596679889968999754768999999889999998789999999 Q ss_pred CCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEE-CCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCHHHHC-CCC--CC Q ss_conf 0123444432002346666741896478399762-0123310122-24721000001234321111000001-356--63 Q gi|254780758|r 91 GKVDEQARLLSIRMNARNASSDRWLQDGDTLLLG-THIFKVFHCP-GHSPGHVIYVNFENNFAHLGDTLFRS-SIG--RT 165 (216) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg-~~~~~vi~tP-GHt~g~~~~~~~~~~~lftGD~lf~~-~~G--~~ 165 (216) .... .+--..+.+++||++++| |.+++++.|| =|.|++++-|++++++|||.|++ -. -+. +. T Consensus 112 ~~~~------------~~~~~~~vVk~Gd~LdLG~GktL~Fi~tPmlHWPDsM~TY~~edkILFS~DaF-G~H~~~~~~f 178 (479) T PRK05452 112 GHHH------------HPEWNFNVVKTGDTLDIGNGKQLIFVETPMLHWPDSMMTYLTGDAVLFSNDAF-GQHYCDEHLF 178 (479) T ss_pred HHHC------------CCCCCEEEECCCCEEECCCCCEEEEEECCCCCCCCCEEEEECCCCEEEECCCC-CCEECCCCCC T ss_conf 8527------------87667378468985726998468996189877887326886267568612555-3212275444 Q ss_pred CCCCCCH-------HHH------------HHHHHHHHHC--CCCCEEEEECCCCCC Q ss_conf 4679899-------999------------9999999851--999789984887988 Q gi|254780758|r 166 DILHGDY-------QQL------------INSINNKILP--LGDEISFVCGHGPNS 200 (216) Q Consensus 166 d~~~~d~-------~~~------------~~sl~~~l~~--l~~~~~v~PgHG~~t 200 (216) |- +-|. ..+ .+.| +++.. |+.+ .|.|+|||-- T Consensus 179 dd-evd~~~~~~ea~rYYanIl~Pf~~~V~~~l-~kl~~l~L~I~-~IaPsHGpI~ 231 (479) T PRK05452 179 ND-EVDQTELFEQCQRYYANILTPFSRLVTPKI-TEILGFNLPVD-MIATSHGVVW 231 (479) T ss_pred CC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHCCCCHH-EEECCCCCEE T ss_conf 33-245687899999999998764079999999-99973799842-4875788233 No 11 >COG0595 mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair] Probab=99.77 E-value=4.6e-18 Score=124.71 Aligned_cols=192 Identities=22% Similarity=0.264 Sum_probs=124.0 Q ss_pred CCEEEEEECCCCC---CCEEEEEECCCCEEEEECCCCC---H-----HHH---HHHHHHCCCCEEEEEECCCCCCCCCCC Q ss_conf 7359999708765---7169999889984999838999---9-----999---999997499752999869885210110 Q gi|254780758|r 3 KLSFHIVQVTPFK---QNCTFLFDEESLEAVVVDPGGD---L-----DII---KDVIQSRNFHVKQIWITHGHVDHVGGA 68 (216) Q Consensus 3 ~~~~~~i~v~~~~---~N~~li~~~~~~~~ilID~G~~---~-----~~i---~~~l~~~~~~i~~Ii~TH~H~DH~gg~ 68 (216) ..++.++++|.++ -|||++..++ +.+++|+|.- . +.+ ..++++...++++|++||+|.||+|++ T Consensus 6 ~~~i~i~~lGG~~EiGkN~~vve~~~--~i~i~D~G~~fp~~~~~gvDliIPd~~yl~~n~~kvkgI~lTHgHeDHIGai 83 (555) T COG0595 6 KAKIKIFALGGVGEIGKNMYVVEYGD--DIIILDAGLKFPEDDLLGVDLIIPDFSYLEENKDKVKGIFLTHGHEDHIGAL 83 (555) T ss_pred CCCEEEEEECCHHHHCCCEEEEEECC--CEEEEECCCCCCCCCCCCCCEEECCHHHHHHCCCCCEEEEECCCCHHHCCCH T ss_conf 78558998058102226238999789--6899977656686435555588258277431500213999468744432346 Q ss_pred HHHHHHHC-CCEEECCHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCC-CCCCCCC-- Q ss_conf 56788607-9554058438988401234444320023466667418964783997620123310122247-2100000-- Q gi|254780758|r 69 AQLKEDLS-LTIIGPHKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHS-PGHVIYV-- 144 (216) Q Consensus 69 ~~l~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt-~g~~~~~-- 144 (216) +++..+.+ +|+|++......+..+.... ...........+..++.+++++..++++++ -|| |+++++. T Consensus 84 p~ll~~~~~~piy~s~lt~~Li~~k~~~~-------~~~~~~~~~~ev~~~~~i~~~~~~v~f~~v-tHSIPds~g~~i~ 155 (555) T COG0595 84 PYLLKQVLFAPIYASPLTAALIKEKLKEH-------GLFKNENELHEVKPGSEIKFGSFEVEFFPV-THSIPDSLGIVIK 155 (555) T ss_pred HHHHHCCCCCCEECCHHHHHHHHHHHHHH-------CCCCCCCCEEEECCCCEEEECCEEEEEEEE-CCCCCCCEEEEEE T ss_conf 78985277684444776699999898873-------211356743994799857767679999960-3557451689998 Q ss_pred CCCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHH-HHHCCCCCEEEE-ECCCCCCCHHHHH Q ss_conf 1234321111000001356634679899999999999-985199978998-4887988889998 Q gi|254780758|r 145 NFENNFAHLGDTLFRSSIGRTDILHGDYQQLINSINN-KILPLGDEISFV-CGHGPNSTIGRER 206 (216) Q Consensus 145 ~~~~~~lftGD~lf~~~~G~~d~~~~d~~~~~~sl~~-~l~~l~~~~~v~-PgHG~~tt~~~e~ 206 (216) .+++.+++|||.-|-+... +=+..|...+.+--++ ++.-+.+.|.+- ||..++...-.|. T Consensus 156 Tp~G~Iv~TGDFk~d~~~~--~g~~~d~~r~~~~g~eGVl~LisdsTna~~pg~t~SE~~v~~~ 217 (555) T COG0595 156 TPEGNIVYTGDFKFDPTPV--DGEPTDLARLAEIGKEGVLALISDSTNAENPGFTPSESEVGEN 217 (555) T ss_pred CCCCCEEEECCEEECCCCC--CCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCCHHHHHHH T ss_conf 7984389968778558867--7776788999974667847999578656899989788999999 No 12 >TIGR00649 MG423 conserved hypothetical protein; InterPro: IPR004613 This is a family of conserved hypothetical proteins, the members of which contain an ATP-binding domain at the N-terminal end of the protein. It is possibly part of a superfamily of beta-lactmases.. Probab=99.72 E-value=3.9e-17 Score=119.33 Aligned_cols=194 Identities=22% Similarity=0.281 Sum_probs=128.2 Q ss_pred EEEEECCCCC---CCEEEEEECCCCEEEEECCCCCH-------H----HH-------------HHHHHHCCCCEEEEEEC Q ss_conf 9999708765---71699998899849998389999-------9----99-------------99999749975299986 Q gi|254780758|r 6 FHIVQVTPFK---QNCTFLFDEESLEAVVVDPGGDL-------D----II-------------KDVIQSRNFHVKQIWIT 58 (216) Q Consensus 6 ~~~i~v~~~~---~N~~li~~~~~~~~ilID~G~~~-------~----~i-------------~~~l~~~~~~i~~Ii~T 58 (216) ++++++|.+. -|||++.-+ +++++||.|... + .+ ..+|++..-+|++|++| T Consensus 1 ~~i~alGG~~E~GKNm~vve~~--~ev~i~D~G~~~PRVlIHed~di~~~~G~dL~~~~vIPD~~~l~~~~~kvkgI~~t 78 (593) T TIGR00649 1 IKIFALGGLEEIGKNMTVVEVD--DEVVIIDAGIRFPRVLIHEDTDIAEMLGVDLIEKGVIPDFTYLKENEDKVKGIVIT 78 (593) T ss_pred CEEECCCCCCCCCCCEEEEEEC--CEEEEEECCCCCCCEEEECCCCHHHHCCHHHHHCCCCCCCHHHHHHCCCEEEEEEC T ss_conf 9672358865417623899877--85999823223796464155203544041143203168961675212506788970 Q ss_pred CCCCCCCCCCHHHHHHHCCC-EEECCHHHHHHHCCHH-HHHHHHHCCCCCCCCCC-EEEECCCCEEEEE---CCCCCCCC Q ss_conf 98852101105678860795-5405843898840123-44443200234666674-1896478399762---01233101 Q gi|254780758|r 59 HGHVDHVGGAAQLKEDLSLT-IIGPHKDDAAMMGKVD-EQARLLSIRMNARNASS-DRWLQDGDTLLLG---THIFKVFH 132 (216) Q Consensus 59 H~H~DH~gg~~~l~~~~~~~-i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~l~~gd~i~lg---~~~~~vi~ 132 (216) |+|.||+||.++|..+++.+ +|++.-.-..+..+.. +... ..... -.++..++.++++ +..++++. T Consensus 79 HGH~DHIGav~yL~~~~~~~PiYGt~lt~~L~~~ki~~E~~l--------~~~~~yl~~~~~~~~~~~~~i~~~~~eFir 150 (593) T TIGR00649 79 HGHEDHIGAVPYLLHQYGFPPIYGTPLTIALIKSKIKAEHGL--------NVRTDYLQEINEGEPVQVGKIENFAIEFIR 150 (593) T ss_pred CCCCCCCCHHHHHHHHCCCCCEECCHHHHHHHHHHHHHHHCC--------CCCCCHHHEECCCCEEEEEECCCEEEEEEE T ss_conf 871000122888986568882404378999999988654200--------225750210158960888750643884577 Q ss_pred CCCCC-CCCCC--CCCCCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHH-HHHHCCCCCEEEE---ECCCCCCCHHHH Q ss_conf 22247-21000--00123432111100000135663467989999999999-9985199978998---488798888999 Q gi|254780758|r 133 CPGHS-PGHVI--YVNFENNFAHLGDTLFRSSIGRTDILHGDYQQLINSIN-NKILPLGDEISFV---CGHGPNSTIGRE 205 (216) Q Consensus 133 tPGHt-~g~~~--~~~~~~~~lftGD~lf~~~~G~~d~~~~d~~~~~~sl~-~~l~~l~~~~~v~---PgHG~~tt~~~e 205 (216) + -|| |+|++ +..+.+.++++||.-|-.. +.+-+..|..++.+-.+ .+|+-|.+-|.+= ||--||.....+ T Consensus 151 ~-tHSIPdS~~~a~hTp~G~IVy~~DFKfD~~--p~~~~~~Dl~~~~~~G~~GVL~LlsdsT~~~er~~g~TpSE~~~~~ 227 (593) T TIGR00649 151 V-THSIPDSVGLALHTPEGSIVYAGDFKFDNT--PVIGEPPDLNRIAELGKKGVLLLLSDSTNVEERNEGKTPSEAVVSE 227 (593) T ss_pred E-EECCCCCEEEEEECCCCEEEEECCEEEECC--CCCCCCCCHHHHHHHHCCCCEEEEECCEECCCCCCCCCCHHHHHHH T ss_conf 1-432412218889769834998325455377--7889860079999973288489995020226757886871889989 Q ss_pred HHHCHHH Q ss_conf 8828145 Q gi|254780758|r 206 RRLNPFL 212 (216) Q Consensus 206 ~~~n~~~ 212 (216) .-.+.|. T Consensus 228 ~l~~~~~ 234 (593) T TIGR00649 228 LLDDIFK 234 (593) T ss_pred HHHHHHH T ss_conf 9999862 No 13 >PRK11539 hypothetical protein; Provisional Probab=99.68 E-value=2.9e-15 Score=108.50 Aligned_cols=167 Identities=17% Similarity=0.195 Sum_probs=109.7 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCC-------HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHH Q ss_conf 73599997087657169999889984999838999-------99999999974997529998698852101105678860 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGD-------LDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDL 75 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~-------~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~ 75 (216) .+++.++.|| |-.+.+|.. +++++|+|+|.. ...+.-+++.+|++++++++||.|.||+||++.+.+++ T Consensus 498 ~~~v~~LDVG--QG~avlI~~--~~~~~L~DtG~~~~~~~~g~~vi~P~L~~~Gi~lD~lvlSH~D~DH~GGl~~ll~~~ 573 (754) T PRK11539 498 EWRVDMLDVG--QGLAMVIER--NGKAILYDTGNAWPEGDSAQQVIIPWLRWHNLTPEGVILSHEHLDHRGGLASLLHAW 573 (754) T ss_pred CCEEEEEECC--CCEEEEEEE--CCEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHC T ss_conf 6089999727--851899998--997899957998888872377889999967997208994588500126899999848 Q ss_pred CCCEEECCHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCC-----CCCCCCCCCC--CC Q ss_conf 7955405843898840123444432002346666741896478399762012331012224-----7210000012--34 Q gi|254780758|r 76 SLTIIGPHKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGH-----SPGHVIYVNF--EN 148 (216) Q Consensus 76 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGH-----t~g~~~~~~~--~~ 148 (216) +...+.+..... .......|+..+.++.++++++-|.. -..||++.+. +. T Consensus 574 p~~~i~~~~~~~-----------------------~~~~c~~G~~~~~~g~~~~vL~P~~~~~~~~N~~ScVl~i~~g~~ 630 (754) T PRK11539 574 PMAWIRSPLNWA-----------------------NHLPCVRGERWQWQGLTFSVHWPLQQSNDAGNNDSCVVKVDDGKH 630 (754) T ss_pred CCEEEECCCCCC-----------------------CCCCEECCCEEEECCEEEEEECCCCCCCCCCCCCCEEEEEEECCE T ss_conf 830998656656-----------------------765511687899999899997799867899998848999998996 Q ss_pred CCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCC--CHHHHHHHCH Q ss_conf 3211110000013566346798999999999999851999789984887988--8899988281 Q gi|254780758|r 149 NFAHLGDTLFRSSIGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNS--TIGRERRLNP 210 (216) Q Consensus 149 ~~lftGD~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~t--t~~~e~~~n~ 210 (216) ++|||||. + ..-..++++ +.-..+..|+.+.|-||..| |-.+-++.+| T Consensus 631 s~LltGDi------e-----~~~E~~Ll~---~~~~~L~~dVLkvpHHGSktSSs~~Fl~~v~P 680 (754) T PRK11539 631 SILLTGDI------E-----AQAEQKMLS---RYWQQLQATLLQVPHHGSNTSSSLPFIQRVNG 680 (754) T ss_pred EEEEECCC------C-----HHHHHHHHH---CCCCCCCCCEEECCCCCCCCCCHHHHHHHCCC T ss_conf 89996279------9-----899999986---37668888689889989784665999985499 No 14 >TIGR03675 arCOG00543 arCOG01782 universal archaeal KH-domain/beta-lactamase-domain protein. This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782. Probab=99.64 E-value=4.4e-16 Score=113.21 Aligned_cols=149 Identities=16% Similarity=0.165 Sum_probs=96.0 Q ss_pred EEEEEECCCCC---CCEEEEEECCCCEEEEECCCCCHH----HHHHHHH--HCCC-CEEEEEECCCCCCCCCCCHHHHH- Q ss_conf 59999708765---716999988998499983899999----9999999--7499-75299986988521011056788- Q gi|254780758|r 5 SFHIVQVTPFK---QNCTFLFDEESLEAVVVDPGGDLD----IIKDVIQ--SRNF-HVKQIWITHGHVDHVGGAAQLKE- 73 (216) Q Consensus 5 ~~~~i~v~~~~---~N~~li~~~~~~~~ilID~G~~~~----~i~~~l~--~~~~-~i~~Ii~TH~H~DH~gg~~~l~~- 73 (216) -+++.++|+++ --|.++...++ .+|+|||.++. ...-++. +.++ .|++|++||+|.||+|.++.|.+ T Consensus 174 ~vrit~LG~~~eVGRSc~l~~~~~~--~illDcG~~~~~~~~~~~P~~d~~e~~~~~IDAVvlTHAHLDH~G~LP~L~k~ 251 (630) T TIGR03675 174 WVRVTALGGFREVGRSALLLSTPES--RILLDCGVNVGANGDNAYPYLDVPEFQLDELDAVVITHAHLDHSGLVPLLFKY 251 (630) T ss_pred EEEEEECCCCCCCCCEEEEEECCCE--EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHHC T ss_conf 4999834676702645899963981--79998888877777545886567779801185899833532125632188764 Q ss_pred HHCCCEEECCHHHHHHHCCHHHHHHHHHCCCCCC---------CCCCEEEECCCCEEEEE-CCCCCCCCCCCCCCCCCCC Q ss_conf 6079554058438988401234444320023466---------66741896478399762-0123310122247210000 Q gi|254780758|r 74 DLSLTIIGPHKDDAAMMGKVDEQARLLSIRMNAR---------NASSDRWLQDGDTLLLG-THIFKVFHCPGHSPGHVIY 143 (216) Q Consensus 74 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~l~~gd~i~lg-~~~~~vi~tPGHt~g~~~~ 143 (216) -|..++|+.+++...+.-......+......... .......+..++.++++ +.+++++.+ ||-.||.++ T Consensus 252 Gf~GpIy~T~pT~dL~~llL~D~~ki~~~e~~~plYt~~dv~~~l~~~~~l~y~~~~~i~~~ik~tf~~A-GHILGSAmv 330 (630) T TIGR03675 252 GYDGPVYCTPPTRDLMTLLQLDYIDVAQREGKKPPYSSKDVREALKHTITLDYGEVTDIAPDIKLTFYNA-GHILGSAIA 330 (630) T ss_pred CCCCCEEECHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCEECCCCCEEECCCCEEEEEECC-CCCCEEEEE T ss_conf 8889888654599999999998999987058999989999999984456248998597179849999768-861044799 Q ss_pred CC--CCC--CCCCCCCH Q ss_conf 01--234--32111100 Q gi|254780758|r 144 VN--FEN--NFAHLGDT 156 (216) Q Consensus 144 ~~--~~~--~~lftGD~ 156 (216) ++ .++ +++||||. T Consensus 331 ~l~i~~g~~~ilfTGD~ 347 (630) T TIGR03675 331 HLHIGDGLYNIVYTGDF 347 (630) T ss_pred EEEECCCCEEEEEECCC T ss_conf 99976896799995998 No 15 >COG2333 ComEC Predicted hydrolase (metallo-beta-lactamase superfamily) [General function prediction only] Probab=99.56 E-value=7.1e-14 Score=100.40 Aligned_cols=175 Identities=20% Similarity=0.274 Sum_probs=110.9 Q ss_pred EEEEEECCCCCCCEEEEEECCCCEEEEECCCC--CHHHHHHHHHHCCCC-EEEEEECCCCCCCCCCCHHHHHHHCCCEEE Q ss_conf 59999708765716999988998499983899--999999999974997-529998698852101105678860795540 Q gi|254780758|r 5 SFHIVQVTPFKQNCTFLFDEESLEAVVVDPGG--DLDIIKDVIQSRNFH-VKQIWITHGHVDHVGGAAQLKEDLSLTIIG 81 (216) Q Consensus 5 ~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~--~~~~i~~~l~~~~~~-i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~ 81 (216) +++.+.+| +--+.++..++ + .+++|+|. .-..++.+|+++|++ |+.+++||.|.||+||+..+.+.+..+.++ T Consensus 45 ~~~~lDvG--qg~a~li~~~~-~-~~l~dtg~~~~~~~iip~Lk~~GV~~iD~lIlTH~d~DHiGg~~~vl~~~~v~~~~ 120 (293) T COG2333 45 KVHMLDVG--QGLATLIRSEG-K-TILYDTGNSMGQDVIIPYLKSLGVRKLDQLILTHPDADHIGGLDEVLKTIKVPELW 120 (293) T ss_pred EEEEEECC--CCEEEEEEECC-C-EEEEECCCCCCCEEEHHHHHHCCCCCCCEEEECCCCCCCCCCHHHHHHHCCCCCEE T ss_conf 59999768--97278985189-1-18862575447243635576749860137893168744447899998528877179 Q ss_pred C-CHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCC-----CCCCCCCC--CCCCCCC Q ss_conf 5-84389884012344443200234666674189647839976201233101222472-----10000012--3432111 Q gi|254780758|r 82 P-HKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSP-----GHVIYVNF--ENNFAHL 153 (216) Q Consensus 82 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~-----g~~~~~~~--~~~~lft 153 (216) . ....... .... + ....+.+...-|+.+++++..++++.-++.+. .|+++++. +..+||| T Consensus 121 i~~~~~~~~----~~~~-----~---~~~~~~~~~~~G~~~~~~~~~f~vl~P~~~~~~~~N~~S~Vl~v~~g~~s~LlT 188 (293) T COG2333 121 IYAGSDSTS----TFVL-----R---DAGIPVRSCKAGDSWQWGGVVFQVLSPVGGVSDDLNNDSCVLRVTFGGNSFLLT 188 (293) T ss_pred EECCCCCCC----HHHH-----H---HCCCCEECCCCCCEEEECCEEEEEECCCCCCCCCCCCCCEEEEEEECCEEEEEE T ss_conf 827877651----4556-----5---337741200127547888857999768866665556762599999589169996 Q ss_pred CCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCC--CHHHHHHHCH Q ss_conf 10000013566346798999999999999851999789984887988--8899988281 Q gi|254780758|r 154 GDTLFRSSIGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNS--TIGRERRLNP 210 (216) Q Consensus 154 GD~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~t--t~~~e~~~n~ 210 (216) ||. +......|. +.-..|..+++.+|.||..| +..+-.+.+| T Consensus 189 GD~-----------e~~~E~~l~----~~~~~l~~dVLkV~HHGS~tSss~~Fl~~v~P 232 (293) T COG2333 189 GDL-----------EEKGEKLLK----KYGPDLRADVLKVGHHGSKTSSSLAFLEAVKP 232 (293) T ss_pred CCC-----------CHHHHHHHH----HHCCCCCCEEEEECCCCCCCCCCHHHHHHCCC T ss_conf 578-----------705689998----43787442089816677665672889974398 No 16 >PRK00055 ribonuclease Z; Reviewed Probab=99.50 E-value=9.4e-14 Score=99.68 Aligned_cols=145 Identities=21% Similarity=0.183 Sum_probs=85.9 Q ss_pred CEEEEEECC-CC-----CCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCC---CEEEEEECCCCCCCCCCCHHHHHH Q ss_conf 359999708-76-----571699998899849998389999999999997499---752999869885210110567886 Q gi|254780758|r 4 LSFHIVQVT-PF-----KQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNF---HVKQIWITHGHVDHVGGAAQLKED 74 (216) Q Consensus 4 ~~~~~i~v~-~~-----~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~---~i~~Ii~TH~H~DH~gg~~~l~~~ 74 (216) |++.++=.| ++ .+.||+|..++ ..+|||||.+. +..+.+.++ ++++|++||.|+||+.|++.|... T Consensus 2 mkit~LGtg~~~p~~~R~~s~~li~~~~--~~iliD~G~G~---~~~l~~~~~~~~~l~~I~iTH~H~DH~~Gl~~ll~~ 76 (259) T PRK00055 2 MELTFLGTGSGVPTPTRNVSSILLRLGG--ELWLFDCGEGT---QRQMLRAGIGPRKIDRIFITHLHGDHIFGLPGLLST 76 (259) T ss_pred CEEEEEECCCCCCCCCCCCCEEEEEECC--EEEEEECCCHH---HHHHHHCCCCHHHCCEEEEECCCHHHHCHHHHHHHH T ss_conf 4899990689988999964889999999--08999588159---999998199955765899964335766528999999 Q ss_pred H-------CCCEEECCHHHHHHHCCHHHHHH-HHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0-------79554058438988401234444-320023466667418964783997620123310122247210000012 Q gi|254780758|r 75 L-------SLTIIGPHKDDAAMMGKVDEQAR-LLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNF 146 (216) Q Consensus 75 ~-------~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~ 146 (216) . +..+|++......+ ........ ....... ........++..++.++.+++.+.++ |+..++.|.+. T Consensus 77 ~~~~~~~~~l~i~gp~~~~~~l-~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~i~~~~~~-H~~~s~Gyri~ 151 (259) T PRK00055 77 RSLSGRTEPLTIYGPKGIKEFV-ETLLRASGSHLPYRIA---IHEIDEKTEGVVFDDDGFTVTAFPLD-HSIPCLGYRFE 151 (259) T ss_pred HHHCCCCCCEEEEECHHHHHHH-HHHHHHHCCCCCCCEE---EEECCCCCCCEEEECCCEEEEEEECC-CCCCEEEEEEE T ss_conf 8745999716999177899999-9998850668897256---76325798732896499899999867-99862799999 Q ss_pred C--CCCCCCCCHHH Q ss_conf 3--43211110000 Q gi|254780758|r 147 E--NNFAHLGDTLF 158 (216) Q Consensus 147 ~--~~~lftGD~lf 158 (216) . .+++|+||+-+ T Consensus 152 ~~g~~i~ysgDt~~ 165 (259) T PRK00055 152 EKGRKIAYVGDTRP 165 (259) T ss_pred ECCCEEEEECCCCC T ss_conf 68816999527886 No 17 >COG1236 YSH1 Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis] Probab=99.45 E-value=1.3e-13 Score=98.87 Aligned_cols=137 Identities=18% Similarity=0.207 Sum_probs=89.0 Q ss_pred CCEEEEEECCCCEEEEECCCCCHHHHHHH--HH-HCCCCEEEEEECCCCCCCCCCCHHHHHH-HCCCEEECCHHHHHHHC Q ss_conf 71699998899849998389999999999--99-7499752999869885210110567886-07955405843898840 Q gi|254780758|r 16 QNCTFLFDEESLEAVVVDPGGDLDIIKDV--IQ-SRNFHVKQIWITHGHVDHVGGAAQLKED-LSLTIIGPHKDDAAMMG 91 (216) Q Consensus 16 ~N~~li~~~~~~~~ilID~G~~~~~i~~~--l~-~~~~~i~~Ii~TH~H~DH~gg~~~l~~~-~~~~i~~~~~~~~~~~~ 91 (216) .-|.++.... ..+++|||.....-... +. ... +++++++||+|.||+|+++.+... +..++|+..++...... T Consensus 14 ~s~~~l~~~~--~~il~D~G~~~~~~~~~~p~~~~~~-~vDavllTHaHlDH~g~lp~l~~~~~~~~v~aT~~T~~l~~~ 90 (427) T COG1236 14 RSCVLLETGG--TRILLDCGLFPGDPSPERPLLPPFP-KVDAVLLTHAHLDHIGALPYLVRNGFEGPVYATPPTAALLKV 90 (427) T ss_pred CEEEEEEECC--CEEEECCCCCCCCCCCCCCCCCCCC-CCCEEEEECCCHHHHCCCHHHHHHCCCCCEEECHHHHHHHHH T ss_conf 5468987479--7699658988576765576788888-767799600652432346788763127865753518999998 Q ss_pred CHHHHHHHHHC--------CCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCH Q ss_conf 12344443200--------23466667418964783997620123310122247210000012--3432111100 Q gi|254780758|r 92 KVDEQARLLSI--------RMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNF--ENNFAHLGDT 156 (216) Q Consensus 92 ~~~~~~~~~~~--------~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~--~~~~lftGD~ 156 (216) ...+....... ..-.......+.+..++.+++++.+++++.+ ||.+|+.++++. .++++||||. T Consensus 91 ~l~d~~~~~~~~~~~~~~~~d~~~~~~~~~~~~yg~~~~v~~~~v~~~~A-GHilGsa~~~le~~~~~ilytGD~ 164 (427) T COG1236 91 LLGDSLKLAEGPDKPPYSEEDVERVPDLIRPLPYGEPVEVGGVKVTFYNA-GHILGSAAILLEVDGGRILYTGDV 164 (427) T ss_pred HHHCCCCCCCCCCCCCCCHHHHHHHHHCEEEECCCCEEEECCEEEEEECC-CCCCCEEEEEEEECCCEEEEEECC T ss_conf 73111010357777765434566555222770589817865289999717-974102799999689479998236 No 18 >PRK02113 putative hydrolase; Provisional Probab=99.28 E-value=3.7e-11 Score=84.63 Aligned_cols=133 Identities=17% Similarity=0.229 Sum_probs=79.2 Q ss_pred CCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCC-CEEEEEECCCCCCCCCCCHHHHHH---HCCCEEECCHHHHHHH Q ss_conf 571699998899849998389999999999997499-752999869885210110567886---0795540584389884 Q gi|254780758|r 15 KQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNF-HVKQIWITHGHVDHVGGAAQLKED---LSLTIIGPHKDDAAMM 90 (216) Q Consensus 15 ~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~-~i~~Ii~TH~H~DH~gg~~~l~~~---~~~~i~~~~~~~~~~~ 90 (216) +++|.+++..++ +.+|||+|++. ..++-+.++ ++++|++||.|.||++|+..|+.- -+.++|+.......+. T Consensus 33 R~r~s~lv~~~~-~~iLID~gpD~---r~Q~lr~~i~~iDavllTH~H~DHi~GlddLR~f~~~~~ipvy~~~~t~~~l~ 108 (252) T PRK02113 33 RLRTSALVETEG-ARILIDCGPDF---REQMLRLPFGKIDAVLITHEHYDHVGGLDDLRPFCRFGEVPIYAEQYVAERLR 108 (252) T ss_pred EEEEEEEEEECC-CEEEEECCCCH---HHHHHHCCCCCCCEEEEECCCHHHHCCHHHHHHHHCCCCCCEEECHHHHHHHH T ss_conf 246489999899-27999689138---99998779876148999745704426989999885458744665789999998 Q ss_pred CCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCH Q ss_conf 012344443200234666674189647839976201233101222472-100000123432111100 Q gi|254780758|r 91 GKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSP-GHVIYVNFENNFAHLGDT 156 (216) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~-g~~~~~~~~~~~lftGD~ 156 (216) ........ .......+....+.++.+..+.+++.+++.+.++ |.. ..+.|.+ +++.+..|+ T Consensus 109 ~~~~y~f~--~~~~~~~~~l~~~~i~~~~~f~i~~~~v~p~~v~-Hg~~~~~G~ri--g~~aY~tD~ 170 (252) T PRK02113 109 SRMPYCFV--EHSYPGVPNIPLREIEPDRPFLVNHTEVTPLRVM-HGKLPILGYRI--GKMAYITDM 170 (252) T ss_pred HHCHHHHC--CCCCCCCCCCEEEEECCCCCEEECEEEEEEEEEC-CCCCEECCEEC--CCEEEECCC T ss_conf 63311214--6888886563157715999467420799878506-88821644750--446993787 No 19 >COG1237 Metal-dependent hydrolases of the beta-lactamase superfamily II [General function prediction only] Probab=99.24 E-value=1.4e-10 Score=81.19 Aligned_cols=66 Identities=23% Similarity=0.402 Sum_probs=51.1 Q ss_pred CEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCC---EEEEEECCCCCCCCCCCHHHHHHH--CCCEEECCH Q ss_conf 16999988998499983899999999999974997---529998698852101105678860--795540584 Q gi|254780758|r 17 NCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFH---VKQIWITHGHVDHVGGAAQLKEDL--SLTIIGPHK 84 (216) Q Consensus 17 N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~---i~~Ii~TH~H~DH~gg~~~l~~~~--~~~i~~~~~ 84 (216) -+++|.++ +..+|+|+|.++..++..++..|+. ++++++||.|+||+||+.++.+.- +.++|+... T Consensus 23 fS~LVE~~--~~riLFDtG~~~~~ll~Na~~lgvd~~did~vvlSHgH~DH~GGL~~~~~~~~~~i~v~ahp~ 93 (259) T COG1237 23 FSALVEDE--GTRILFDTGTDSDVLLHNARLLGVDLRDIDAVVLSHGHYDHTGGLPYLLEENNPGIPVYAHPD 93 (259) T ss_pred EEEEEECC--CEEEEEECCCCCHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCHHHHHHCCCCCCEEEECHH T ss_conf 49999758--717999579985788877998199813474799857873134756768752677850786858 No 20 >COG1782 Predicted metal-dependent RNase, consists of a metallo-beta-lactamase domain and an RNA-binding KH domain [General function prediction only] Probab=99.19 E-value=4.4e-11 Score=84.16 Aligned_cols=150 Identities=17% Similarity=0.142 Sum_probs=88.7 Q ss_pred EEEEEECCCCC---CCEEEEEECCCCEEEEECCCCCHHH-------HHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHH- Q ss_conf 59999708765---7169999889984999838999999-------999999749975299986988521011056788- Q gi|254780758|r 5 SFHIVQVTPFK---QNCTFLFDEESLEAVVVDPGGDLDI-------IKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKE- 73 (216) Q Consensus 5 ~~~~i~v~~~~---~N~~li~~~~~~~~ilID~G~~~~~-------i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~- 73 (216) -+++..+|.|+ --|+++-.++++ +|+|||.+... .++.-+-.-..+++|++||+|.||+|-++.|-+ T Consensus 180 wvRvt~LGg~~EVGRSa~lv~T~eSr--VLlDcG~n~a~~~~~~~Pyl~vpE~~~~~lDAViiTHAHLDH~G~lP~LfkY 257 (637) T COG1782 180 WVRVTALGGFREVGRSALLVSTPESR--VLLDCGVNVAGNGEDAFPYLDVPEFQPDELDAVIITHAHLDHCGFLPLLFKY 257 (637) T ss_pred EEEEEEECCCHHCCCEEEEEECCCCE--EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHC T ss_conf 59999612411206256997368833--8996365677886435744556102512053699850234434521456651 Q ss_pred HHCCCEEECCHHHHHHHCCHHHHHHHHHCCCCCCC---------CCCEEEECCCCEEEE-ECCCCCCCCCCCCCCCCCCC Q ss_conf 60795540584389884012344443200234666---------674189647839976-20123310122247210000 Q gi|254780758|r 74 DLSLTIIGPHKDDAAMMGKVDEQARLLSIRMNARN---------ASSDRWLQDGDTLLL-GTHIFKVFHCPGHSPGHVIY 143 (216) Q Consensus 74 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~l~~gd~i~l-g~~~~~vi~tPGHt~g~~~~ 143 (216) .|..|+|+..+....+.--...............+ ..-...+..|+.-++ .+.+++++-+ ||--||-+. T Consensus 258 gy~GPVY~T~PTRDlm~LLq~Dyi~va~keg~~ppY~~k~v~~~lkhtItldYgevTDIaPDirLTf~NA-GHILGSA~~ 336 (637) T COG1782 258 GYDGPVYCTPPTRDLMVLLQLDYIEVAEKEGGEPPYESKDVRKVLKHTITLDYGEVTDIAPDIRLTFYNA-GHILGSAMA 336 (637) T ss_pred CCCCCEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHEEEEECCCCCCCCCCCCEEEEECC-CCHHCCEEE T ss_conf 7888743179718899999887999999649999988789998875454102576244687617898416-410131015 Q ss_pred CC--CCC--CCCCCCCHH Q ss_conf 01--234--321111000 Q gi|254780758|r 144 VN--FEN--NFAHLGDTL 157 (216) Q Consensus 144 ~~--~~~--~~lftGD~l 157 (216) .+ .++ +++||||.= T Consensus 337 HlHIGdGlyNi~yTGDfk 354 (637) T COG1782 337 HLHIGDGLYNIVYTGDFK 354 (637) T ss_pred EEEECCCCEEEEEECCCC T ss_conf 888648713579943444 No 21 >TIGR02651 RNase_Z ribonuclease Z; InterPro: IPR013471 All tRNA molecules are synthesised as long precursors with extra sequence at both the 5' and 3' ends, which undergo several processing steps to form the mature tRNA molecule. In many bacteria the tRNA precursors do not contain the CCA sequence which forms the 3' end of the mature tRNA and is essential for aminoacylation. In these organisms ribonucleotide Z cleaves the precursor to form a 3' end to which CCA is added. Species where all tRNA sequences already have the CCA tail, such as E. coli, do not generally posses this form of ribonuclease Z. ; GO: 0016891 endoribonuclease activity producing 5'-phosphomonoesters, 0042779 removal of tRNA 3'-trailer sequence. Probab=99.18 E-value=6e-11 Score=83.39 Aligned_cols=128 Identities=23% Similarity=0.210 Sum_probs=78.7 Q ss_pred CCEEEEEEC-CCCEEEEECCCCCHHHHHHHHHHCCC---CEEEEEECCCCCCCCCCCHHHHHHH---------CCCEEEC Q ss_conf 716999988-99849998389999999999997499---7529998698852101105678860---------7955405 Q gi|254780758|r 16 QNCTFLFDE-ESLEAVVVDPGGDLDIIKDVIQSRNF---HVKQIWITHGHVDHVGGAAQLKEDL---------SLTIIGP 82 (216) Q Consensus 16 ~N~~li~~~-~~~~~ilID~G~~~~~i~~~l~~~~~---~i~~Ii~TH~H~DH~gg~~~l~~~~---------~~~i~~~ 82 (216) ..+.+|... +.++.+|+||| |....++.+.++ +|+.|++||.|.||+.|++=|.+.. +..||++ T Consensus 18 ~~si~l~~~~e~g~~~LFDCG---EGTQ~Q~~~~~~~~~~i~~IFITH~HGDHi~GLPGLl~t~sf~G~~r~~pL~IyGP 94 (327) T TIGR02651 18 VSSIALKLEDERGELWLFDCG---EGTQRQMLRSGISPMKIDRIFITHLHGDHILGLPGLLSTMSFQGGGRKEPLTIYGP 94 (327) T ss_pred CCEEEEECHHHCCCEEEEECC---CHHHHHHHHCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEECC T ss_conf 758999602242874255477---18999999716886311136870663145542368888310378886445288687 Q ss_pred CHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEE-----------ECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 84389884012344443200234666674189-----------647839976201233101222472100000123 Q gi|254780758|r 83 HKDDAAMMGKVDEQARLLSIRMNARNASSDRW-----------LQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFE 147 (216) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~ 147 (216) .....+.............+.........+.. +.++..++-++.+|+.+.+ -|+-.|+.|.+.+ T Consensus 95 ~G~~e~~~~~l~~~~~~~~y~i~~~E~~~~~~~P~~~~~k~k~l~~~~~~~~~~f~v~~~~l-~H~~~~~GY~~~E 169 (327) T TIGR02651 95 PGIKEFIETSLRVSYTYLNYPIKIHEIEEKGVLPEFVVYKEKLLEDGLVFEDDGFKVEAFPL-DHSIPSLGYRFEE 169 (327) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCHHHHHCCCEEECCEEEECCCEEEEEEEC-CCCCEEEEEEEEE T ss_conf 44799999999862237884159997268997731231025612023267658448999950-5784146789986 No 22 >COG2015 Alkyl sulfatase and related hydrolases [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.18 E-value=5.2e-10 Score=77.95 Aligned_cols=196 Identities=19% Similarity=0.273 Sum_probs=112.0 Q ss_pred EEEECCCCC-CCEEEEEECCCCEEEEECCCCCHHH---HHHHHHH-CC-CCEEEEEECCCCCCCCCCCHHHHHHH----- Q ss_conf 999708765-7169999889984999838999999---9999997-49-97529998698852101105678860----- Q gi|254780758|r 7 HIVQVTPFK-QNCTFLFDEESLEAVVVDPGGDLDI---IKDVIQS-RN-FHVKQIWITHGHVDHVGGAAQLKEDL----- 75 (216) Q Consensus 7 ~~i~v~~~~-~N~~li~~~~~~~~ilID~G~~~~~---i~~~l~~-~~-~~i~~Ii~TH~H~DH~gg~~~l~~~~----- 75 (216) .+.+|-.|+ .|..++.++.+. ++|||--..+. .++...+ ++ .+|++|+.||+|.||.||..-+..+- T Consensus 116 ~iYQVRG~DisNITfveGdtg~--IViDpL~t~~tA~aAldl~~~~~g~rPV~aVIYtHsH~DHfGGVkGiv~eadV~sG 193 (655) T COG2015 116 GIYQVRGFDISNITFVEGDTGW--IVIDPLVTPETAKAALDLYNQHRGQRPVVAVIYTHSHSDHFGGVKGIVSEADVKSG 193 (655) T ss_pred CEEEEECCCCCCEEEECCCCCE--EEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCEEECCCHHHCCCC T ss_conf 6357602013322798078526--99815678288999999999765889748999612542013771001578884667 Q ss_pred CCCEEECCHH------HHHHHCCHHH--HHHHHHC-------------------CCCCCCCCCEEEE-CCCCEEEEECCC Q ss_conf 7955405843------8988401234--4443200-------------------2346666741896-478399762012 Q gi|254780758|r 76 SLTIIGPHKD------DAAMMGKVDE--QARLLSI-------------------RMNARNASSDRWL-QDGDTLLLGTHI 127 (216) Q Consensus 76 ~~~i~~~~~~------~~~~~~~~~~--~~~~~~~-------------------~~~~~~~~~~~~l-~~gd~i~lg~~~ 127 (216) .++++++..- +..+..+... ....++. ..+.....|.+.+ +.|+++.+.|.+ T Consensus 194 kV~iiAP~GFme~avaENvlAGnaM~RRa~YqyG~~Lp~g~~G~V~~giGk~la~G~vsLiaPT~~I~~~gE~~~iDGV~ 273 (655) T COG2015 194 KVQIIAPAGFMEEAVAENVLAGNAMSRRAQYQYGTLLPPGAQGQVGCGIGKTLATGEVSLIAPTKIIEETGETLTIDGVE 273 (655) T ss_pred CEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCEEEEECEE T ss_conf 41376460578999987544105676666643123358886675040103322468525645567862367258870507 Q ss_pred CCCCCCCC-CCCCCCCCCCCCCCCCCCCCHHHHCCCCC-CCCC---CCCHHHHHHHHHHHHHCCCCCE-EEEECCCCCCC Q ss_conf 33101222-47210000012343211110000013566-3467---9899999999999985199978-99848879888 Q gi|254780758|r 128 FKVFHCPG-HSPGHVIYVNFENNFAHLGDTLFRSSIGR-TDIL---HGDYQQLINSINNKILPLGDEI-SFVCGHGPNST 201 (216) Q Consensus 128 ~~vi~tPG-Ht~g~~~~~~~~~~~lftGD~lf~~~~G~-~d~~---~~d~~~~~~sl~~~l~~l~~~~-~v~PgHG~~tt 201 (216) +++--||| .+|-..-+|+|..++|......- ++.-. .-+. --|..+|-+-|.+-+..+.+++ .++-.|++|+ T Consensus 274 ~~Fq~tPgtEaPAEM~~y~P~~kaL~mAEnat-~~lHNlytlRGa~vRD~~~Ws~ylneal~~fg~~adVmfa~H~WP~- 351 (655) T COG2015 274 FEFQMTPGTEAPAEMHFYFPRLKALCMAENAT-HTLHNLYTLRGAEVRDAKAWSKYLNEALDMFGDDADVMFASHTWPR- 351 (655) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHHCC-CCCEEEEECCCCEECCHHHHHHHHHHHHHHHCCCCCEEEEECCCCC- T ss_conf 89962889888377751226788888775221-2100001015510035677888899999984455537986068874- Q ss_pred HHHHH Q ss_conf 89998 Q gi|254780758|r 202 IGRER 206 (216) Q Consensus 202 ~~~e~ 206 (216) +|.++ T Consensus 352 wG~~~ 356 (655) T COG2015 352 WGNAH 356 (655) T ss_pred CCHHH T ss_conf 22378 No 23 >PRK11244 phnP carbon-phosphorus lyase complex accessory protein; Provisional Probab=99.14 E-value=1.1e-10 Score=81.92 Aligned_cols=124 Identities=20% Similarity=0.219 Sum_probs=72.5 Q ss_pred CEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHH--HHCCCEEECCHHHHHHHCCHH Q ss_conf 169999889984999838999999999999749975299986988521011056788--607955405843898840123 Q gi|254780758|r 17 NCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKE--DLSLTIIGPHKDDAAMMGKVD 94 (216) Q Consensus 17 N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~--~~~~~i~~~~~~~~~~~~~~~ 94 (216) .|.+++..+ +..+|||+|. ..+.+.+.. -+|++|++||.|.||++|+..|+. ..+.++|+..... .+.. . T Consensus 37 r~s~lv~~~-~~~iLID~g~--~dl~~~~~~--~~iDaVllTH~H~DHi~GlddlR~~~~~~ipvy~~~~~~-~~~~-l- 108 (250) T PRK11244 37 PCSALLEFN-GARTLIDAGL--HDLAERFPP--GSLQAFLLTHYHMDHVQGLFPLRWGVGDVIPVYGPPDPE-GCDD-L- 108 (250) T ss_pred EEEEEEEEC-CEEEEEECCC--HHHHHHCCC--CCCCEEEEECCCHHHHCCHHHHHHCCCCCEEEEECCCHH-HHHH-H- T ss_conf 348999989-9279998980--899975555--687789982575334337688871379835677377577-8999-8- Q ss_pred HHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCHH Q ss_conf 44443200234666674189647839976201233101222472100000123--4321111000 Q gi|254780758|r 95 EQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFE--NNFAHLGDTL 157 (216) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~--~~~lftGD~l 157 (216) .+..+ .......+..++.+.+++.+++.++.. |..-++.|.+.. .++++++|+- T Consensus 109 --~~~~~------~~~~~~~~~~~~~~~i~g~~itp~~~~-Hg~~~~Gy~i~~~g~~~aY~tD~~ 164 (250) T PRK11244 109 --FKHPG------ILDFSHPLEAFEPFDLGGLQVTPLPLN-HSKLTFGYLLETAHSRFAYLTDTV 164 (250) T ss_pred --HHCCC------CCCCCCCCCCCCCEEECCEEEEEEEEC-CCCCCEEEEEECCCCEEEEECCCC T ss_conf --50788------434236658888668688389999977-889736999964893299966778 No 24 >PRK05184 pyrroloquinoline quinone biosynthesis protein PqqB; Provisional Probab=99.09 E-value=5.6e-10 Score=77.75 Aligned_cols=76 Identities=16% Similarity=0.251 Sum_probs=55.4 Q ss_pred EEEEEECCCCEEEEECCCCCHHHHHHHHHH-------CCCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEECCHHHHHHH Q ss_conf 699998899849998389999999999997-------4997529998698852101105678860795540584389884 Q gi|254780758|r 18 CTFLFDEESLEAVVVDPGGDLDIIKDVIQS-------RNFHVKQIWITHGHVDHVGGAAQLKEDLSLTIIGPHKDDAAMM 90 (216) Q Consensus 18 ~~li~~~~~~~~ilID~G~~~~~i~~~l~~-------~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~~~~~~~~~ 90 (216) |.+.+..+++.-+|||+.+|...-+....+ ++.+|++|++||+|.||+.|+..|++..+.++|+.......+. T Consensus 40 sSi~v~~dg~~~~LiDasPDlr~QL~~~~~l~p~~~~r~s~i~~V~lTHaH~dH~~GL~~LRe~~~i~vya~~~~~~~L~ 119 (302) T PRK05184 40 SSIAVSADGEDWVLVNASPDIRQQIRATPALQPARGLRDTPIAAVVLTDGQIDHVTGLLSLREGQPFPLYATPMVLEDLS 119 (302) T ss_pred EEEEEECCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCHHCCHHHHHCCCCCCEEECHHHHHHHH T ss_conf 48999828986999989938999986514417543445787348998657500331788871569720674689999998 Q ss_pred CCH Q ss_conf 012 Q gi|254780758|r 91 GKV 93 (216) Q Consensus 91 ~~~ 93 (216) .+. T Consensus 120 ~~~ 122 (302) T PRK05184 120 TGF 122 (302) T ss_pred HCC T ss_conf 639 No 25 >TIGR03307 PhnP phosphonate metabolism protein PhnP. This family of proteins found in operons encoding phosphonate C-P lyase systems as is observed in E. coli and is a member of the metallo-beta-lactamase superfamily (pfam00753). As defined by this model, all instances of this protein are associated with the C-P lyase, but not all genomes containing the C-P lyase system contain phnP. Probab=99.05 E-value=4.4e-10 Score=78.40 Aligned_cols=122 Identities=20% Similarity=0.244 Sum_probs=70.1 Q ss_pred EEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHH--HHCCCEEECCHHHHHHHCCHHH Q ss_conf 69999889984999838999999999999749975299986988521011056788--6079554058438988401234 Q gi|254780758|r 18 CTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKE--DLSLTIIGPHKDDAAMMGKVDE 95 (216) Q Consensus 18 ~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~--~~~~~i~~~~~~~~~~~~~~~~ 95 (216) |.+++..++ +-+|||+|. ..+.+.+... +|++|++||.|.||+.|+..|+- ..+.++|+...... +. . T Consensus 38 ~s~li~~~~-~~iLID~g~--pdlr~ql~~~--~IdaVllTH~H~DHi~GlddlR~~~~~~ipvy~~~~~~~-~~----~ 107 (249) T TIGR03307 38 CSAVIEVNG-VRTLIDAGL--TDLAERFPPG--SLQAILLTHYHMDHVQGLFPLRWGMGEPIPVYGPPDPEG-CD----D 107 (249) T ss_pred EEEEEEECC-CEEEEECCC--HHHHHHCCCC--CCCEEEEECCCHHHHCCCHHHHHCCCCCEEEEECCCHHH-HH----H T ss_conf 189999899-289998997--7999860516--878899965334544662445521688657886887889-99----9 Q ss_pred HHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCH Q ss_conf 444320023466667418964783997620123310122247210000012--3432111100 Q gi|254780758|r 96 QARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNF--ENNFAHLGDT 156 (216) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~--~~~~lftGD~ 156 (216) ..+..+. ......+...+.+.+++.++..+++. |..-.+.|.+. ..+++++.|+ T Consensus 108 ~~~~~~~------~~~~~~~~~~~~~~i~g~~i~p~~~~-H~~~~~Gy~i~~~g~~~aY~tD~ 163 (249) T TIGR03307 108 LFKHPGI------LDFSKPLLAFEPFHLGGLRVTPVPLV-HSKLTFGYLLETDGQRLAYLTDT 163 (249) T ss_pred HHCCCCC------CCCCCCCCCCCCEEECCEEEEEEEEE-CCCCCEEEEEECCCCEEEEECCC T ss_conf 8626984------24346547989677398189999996-07864699995189559996788 No 26 >PRK02126 ribonuclease Z; Provisional Probab=99.01 E-value=3.1e-09 Score=73.46 Aligned_cols=85 Identities=20% Similarity=0.249 Sum_probs=59.2 Q ss_pred CCCCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHH----- Q ss_conf 977359999708765716999988998499983899999999999974997529998698852101105678860----- Q gi|254780758|r 1 MSKLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDL----- 75 (216) Q Consensus 1 m~~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~----- 75 (216) |.+|---.+..++|+.++.+|.-...++++|+|||-..+.....+ .+|++|++||.|.||+.|++-|.+.+ T Consensus 1 ~~~~~~~~~~n~~~~Dp~l~v~~~~~~~~lLfDcGe~~~l~~~~l----~~Id~IFITHlHgDH~~GL~gLLr~~~gr~~ 76 (334) T PRK02126 1 MFRLLEPRLVNGPFDDPGLYVDFLFERRALLFDLGDLHHLPPREL----LRISHIFVSHTHMDHFIGFDRLLRLCLGRPR 76 (334) T ss_pred CCCCCCCEEECCCCCCCEEEEEECCCCCEEEEECCCCCCCCHHHH----CCCCEEEEECCCCCHHCCCHHHHHHHCCCCC T ss_conf 965454167568899987999716798189998998430064564----6547699826762212357999986279998 Q ss_pred CCCEEECCHHHHHH Q ss_conf 79554058438988 Q gi|254780758|r 76 SLTIIGPHKDDAAM 89 (216) Q Consensus 76 ~~~i~~~~~~~~~~ 89 (216) +..+|++......+ T Consensus 77 ~L~IyGP~Gl~~~v 90 (334) T PRK02126 77 RLRLFGPPGFADQV 90 (334) T ss_pred CEEEECCCCHHHHH T ss_conf 62999982489999 No 27 >KOG1136 consensus Probab=98.91 E-value=2.7e-09 Score=73.83 Aligned_cols=150 Identities=13% Similarity=0.154 Sum_probs=91.2 Q ss_pred CEEEEEECCCCC---CCEEEEEECCCCEEEEECCCCCH-----HHH--HHHHHHCC---CCEEEEEECCCCCCCCCCCHH Q ss_conf 359999708765---71699998899849998389999-----999--99999749---975299986988521011056 Q gi|254780758|r 4 LSFHIVQVTPFK---QNCTFLFDEESLEAVVVDPGGDL-----DII--KDVIQSRN---FHVKQIWITHGHVDHVGGAAQ 70 (216) Q Consensus 4 ~~~~~i~v~~~~---~N~~li~~~~~~~~ilID~G~~~-----~~i--~~~l~~~~---~~i~~Ii~TH~H~DH~gg~~~ 70 (216) +.++++|+|+=| ..|.++...+ +-+++|||... .+. ..++...| .-++-|++||+|.||+|++++ T Consensus 2 p~i~v~pLGAGQdvGrSCilvsi~G--k~iM~DCGMHMG~nD~rRfPdFSyI~~~g~~~~~idCvIIsHFHlDHcGaLPy 79 (501) T KOG1136 2 PEIKVTPLGAGQDVGRSCILVSIGG--KNIMFDCGMHMGFNDDRRFPDFSYISKSGRFTDAIDCVIISHFHLDHCGALPY 79 (501) T ss_pred CCCEEEECCCCCCCCCEEEEEEECC--CEEEEECCCCCCCCCCCCCCCCEEECCCCCCCCCEEEEEEEEECCCCCCCCCC T ss_conf 8514775467762574279999779--07899545455557656688852366888745404589985321100466400 Q ss_pred HHHH--HCCCEEECCHHHHH---HHCCHHHHHHHHHCCCCCCCC---------CCEEEECCCCEEEEE-CCCCCCCCCCC Q ss_conf 7886--07955405843898---840123444432002346666---------741896478399762-01233101222 Q gi|254780758|r 71 LKED--LSLTIIGPHKDDAA---MMGKVDEQARLLSIRMNARNA---------SSDRWLQDGDTLLLG-THIFKVFHCPG 135 (216) Q Consensus 71 l~~~--~~~~i~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~---------~~~~~l~~gd~i~lg-~~~~~vi~tPG 135 (216) +.+. |+.|+|+..+..+. +++....... ....+...+ .....+.-.+++++. ++.++.+.+ | T Consensus 80 fsEv~GY~GPIYMt~PTkaicPvlLeDyRkv~v--d~kGe~n~FT~q~I~nCMKKVv~i~l~qt~~vD~dl~IrayYA-G 156 (501) T KOG1136 80 FSEVVGYDGPIYMTYPTKAICPVLLEDYRKVAV--DRKGESNFFTTQDIKNCMKKVVAIDLHQTIQVDEDLQIRAYYA-G 156 (501) T ss_pred HHHHHCCCCCEEEECCHHHHCHHHHHHHHHHHC--CCCCCCCCEEHHHHHHHHHHEEEEEEHHEEEECCCCEEEEEEC-C T ss_conf 576617788647855635305388988898761--6567533200898999876506733022077255413643312-3 Q ss_pred CCCCCCCCCCC--CCCCCCCCCHHH Q ss_conf 47210000012--343211110000 Q gi|254780758|r 136 HSPGHVIYVNF--ENNFAHLGDTLF 158 (216) Q Consensus 136 Ht~g~~~~~~~--~~~~lftGD~lf 158 (216) |.-|..+||.. ++.+++|||.=+ T Consensus 157 HVLGAaMf~ikvGd~svvYTGDYnm 181 (501) T KOG1136 157 HVLGAAMFYIKVGDQSVVYTGDYNM 181 (501) T ss_pred CCCCEEEEEEEECCEEEEEECCCCC T ss_conf 2000057999845535898467667 No 28 >KOG4736 consensus Probab=98.91 E-value=1.7e-09 Score=74.91 Aligned_cols=153 Identities=26% Similarity=0.369 Sum_probs=86.8 Q ss_pred CCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCC---CEEEEEECCCCCCCCCCCHHHHHHHCCCEEECCHHHHHHHC Q ss_conf 571699998899849998389999999999997499---75299986988521011056788607955405843898840 Q gi|254780758|r 15 KQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNF---HVKQIWITHGHVDHVGGAAQLKEDLSLTIIGPHKDDAAMMG 91 (216) Q Consensus 15 ~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~---~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~~~~~~~~~~ 91 (216) +.+..++.| +...+++|+|.. .+.+.++ +++.+++||.+++|+|++..+... +++ .+. ..+ T Consensus 94 ~~~~tl~~d--~~~v~v~~~gls------~lak~~vt~d~i~~vv~t~~~~~hlgn~~~f~~s---p~l-~~s--~e~-- 157 (302) T KOG4736 94 QGQITLVVD--GGDVVVVDTGLS------VLAKEGVTLDQIDSVVITHKSPGHLGNNNLFPQS---PIL-YHS--MEY-- 157 (302) T ss_pred HCCCCEEEC--CCCEEEEECCCC------HHHHCCCCHHHCCEEEEECCCCCCCCCCCCCCCC---HHH-HHH--HHH-- T ss_conf 204540312--883499955872------2421476753455168732475542653346677---777-664--560-- Q ss_pred CHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCHHHHCCCCCCCC Q ss_conf 1234444320023466667418964783997620123310122247210000012----343211110000013566346 Q gi|254780758|r 92 KVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNF----ENNFAHLGDTLFRSSIGRTDI 167 (216) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~----~~~~lftGD~lf~~~~G~~d~ 167 (216) ....+.+ ..|+++..+.++ -.+++..|||||+.+++..+. -+.+.++|| ||++. +-.|- T Consensus 158 -------------~gr~~~p-t~l~e~~~~~l~-~~~~V~~TpGht~~~isvlv~n~~~~GTv~itGD-Lf~~~-~dlde 220 (302) T KOG4736 158 -------------IGRHVTP-TELDERPYLKLS-PNVEVWKTPGHTQHDISVLVHNVDLYGTVAITGD-LFPRE-EDLDE 220 (302) T ss_pred -------------CCCCCCH-HHHCCCCCCCCC-CCEEEEECCCCCCCCEEEEEEEECCCCEEEEEEE-CCCCC-CCCCC T ss_conf -------------3785674-240337740017-7546750799987635799985045524899853-24577-65443 Q ss_pred C------CC--CHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHH Q ss_conf 7------98--999999999999851999789984887988889 Q gi|254780758|r 168 L------HG--DYQQLINSINNKILPLGDEISFVCGHGPNSTIG 203 (216) Q Consensus 168 ~------~~--d~~~~~~sl~~~l~~l~~~~~v~PgHG~~tt~~ 203 (216) + ++ |...-+++ +++..-|++ ++.||||++..+. T Consensus 221 ~d~i~~~e~s~d~~~kr~~-r~~~v~l~D--~ivpgHg~~f~v~ 261 (302) T KOG4736 221 KDDIMSQEGSEDNAAKRQS-RNRYVCLAD--WIVPGHGPPFRVL 261 (302) T ss_pred HHHHHHHCCCCCHHHHHHH-HHCEEEEEE--EEECCCCCCEEEC T ss_conf 4556541368744454653-411798743--4533888722012 No 29 >PRK00685 metal-dependent hydrolase; Provisional Probab=98.88 E-value=5.2e-09 Score=72.16 Aligned_cols=163 Identities=21% Similarity=0.218 Sum_probs=90.5 Q ss_pred CCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEECCHHHHHHH Q ss_conf 08765716999988998499983899999999999974997529998698852101105678860795540584389884 Q gi|254780758|r 11 VTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDLSLTIIGPHKDDAAMM 90 (216) Q Consensus 11 v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~~~~~~~~~ 90 (216) +..++-.|++|... +..+||||=..... ...++....+++.|++||.|.||..-...+.++.+++++++......+ T Consensus 3 i~wlGha~~li~~~--g~~iLiDP~~s~~~-~~~~~~~~~~~D~VliSH~H~DHl~d~~~~~~~~~~~~i~p~~~~~~~- 78 (228) T PRK00685 3 ITWLGHSAFRIETG--GKKILIDPFLTGNP-KADLKPEDVKVDHILLTHGHGDHVGDTVEIAKRTGAVVVANFELATYL- 78 (228) T ss_pred EEEECCCEEEEEEC--CEEEEECCCCCCCC-CCCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHCCCEEEECHHHHHHH- T ss_conf 99940569999989--99999888668998-778885446799999668883435658888751599899636899999- Q ss_pred CCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCC------------C---CCCCCCCCCCCCCCCC Q ss_conf 012344443200234666674189647839976201233101222472------------1---0000012343211110 Q gi|254780758|r 91 GKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSP------------G---HVIYVNFENNFAHLGD 155 (216) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~------------g---~~~~~~~~~~~lftGD 155 (216) ... + .....+..++++++++.+++.+++ =|.+ | +.++-.++.++.|+|| T Consensus 79 ~~~-------g--------~~~~~~~~~~~~~~~gi~i~~vpa-~H~~~~~~~~~~~~~~g~~~g~vi~~~g~~iy~~GD 142 (228) T PRK00685 79 GWQ-------G--------VKTHPMNIGGTVEFDGGKVKFTQA-LHSSSYITEDGVSIYLGNPAGIVFTFEGKTIYHAGD 142 (228) T ss_pred HHC-------C--------CCEEEECCCCEEEECCEEEEEEEC-CCCCCCCCCCCCCEECCCCEEEEEEECCEEEEEECC T ss_conf 865-------9--------818993278749978889999966-546665577886268778779999979938998337 Q ss_pred HHHHCC---CC---CCCC---CCC-----CHHHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf 000013---56---6346---798-----9999999999998519997899848879 Q gi|254780758|r 156 TLFRSS---IG---RTDI---LHG-----DYQQLINSINNKILPLGDEISFVCGHGP 198 (216) Q Consensus 156 ~lf~~~---~G---~~d~---~~~-----d~~~~~~sl~~~l~~l~~~~~v~PgHG~ 198 (216) +-+... ++ ++|+ |-| |+.+. + +....+.+.+ ++|-|=. T Consensus 143 T~~~~~~~~i~~~~~pDlallpiGg~~~m~p~eA---~-~~~~~l~~k~-~IP~H~g 194 (228) T PRK00685 143 TDLFSDMKLIGELHQPDIALLPIGDRFTMGPADA---A-LAAEWLKPKT-VVPMHYN 194 (228) T ss_pred CCCCHHHHHHHHHCCCCEEEEECCCCCCCCHHHH---H-HHHHHHCCCE-EEECCCC T ss_conf 6665579997765599699991899610599999---9-9999739999-9981888 No 30 >COG1234 ElaC Metal-dependent hydrolases of the beta-lactamase superfamily III [General function prediction only] Probab=98.79 E-value=3.8e-08 Score=67.15 Aligned_cols=76 Identities=22% Similarity=0.315 Sum_probs=46.1 Q ss_pred CEEEEEECCCC-----C-CCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCC---CEEEEEECCCCCCCCCCCHHHHHH Q ss_conf 35999970876-----5-71699998899849998389999999999997499---752999869885210110567886 Q gi|254780758|r 4 LSFHIVQVTPF-----K-QNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNF---HVKQIWITHGHVDHVGGAAQLKED 74 (216) Q Consensus 4 ~~~~~i~v~~~-----~-~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~---~i~~Ii~TH~H~DH~gg~~~l~~~ 74 (216) |++.++=.+.- + ...+++.. .++..|+|||.. ....+.+.++ ++++|++||.|.||+.|++.|... T Consensus 2 m~i~fLGtg~~~Pt~~r~~~s~ll~~--~~~~~L~DcGeG---t~~~l~~~~~~~~~i~~IfITH~H~DHi~gL~~ll~~ 76 (292) T COG1234 2 MEITFLGTGGAVPTKDRNVSSILLRL--EGEKFLFDCGEG---TQHQLLRAGLPPRKIDAIFITHLHGDHIAGLPGLLVS 76 (292) T ss_pred CEEEEEECCCCCCCCCCCCCEEEEEE--CCEEEEEECCHH---HHHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHH T ss_conf 78999966998774767766799981--992799977867---9999998038820268899827863423777899996 Q ss_pred H-------CCCEEECCH Q ss_conf 0-------795540584 Q gi|254780758|r 75 L-------SLTIIGPHK 84 (216) Q Consensus 75 ~-------~~~i~~~~~ 84 (216) . +..+|++.. T Consensus 77 ~~~~~~~~~l~iygP~g 93 (292) T COG1234 77 RSFRGRREPLKIYGPPG 93 (292) T ss_pred HHCCCCCCCCEEECCCC T ss_conf 14018999824988853 No 31 >KOG1137 consensus Probab=98.78 E-value=1.8e-09 Score=74.85 Aligned_cols=149 Identities=15% Similarity=0.176 Sum_probs=93.4 Q ss_pred EEEEEECCC---CCCCEEEEEECCCCEEEEECCCCCHH----HHHHHHHHCCC-CEEEEEECCCCCCCCCCCHHHHHHHC Q ss_conf 599997087---65716999988998499983899999----99999997499-75299986988521011056788607 Q gi|254780758|r 5 SFHIVQVTP---FKQNCTFLFDEESLEAVVVDPGGDLD----IIKDVIQSRNF-HVKQIWITHGHVDHVGGAAQLKEDLS 76 (216) Q Consensus 5 ~~~~i~v~~---~~~N~~li~~~~~~~~ilID~G~~~~----~i~~~l~~~~~-~i~~Ii~TH~H~DH~gg~~~l~~~~~ 76 (216) .+.+.|+|+ .+.-|-++...+ +.++.|||-.+. ..+-+....+. .++..++||+|.||++.++++.+++. T Consensus 13 ~l~~~pLGag~EVGRSC~ile~kG--k~iMld~gvhpaysg~aslpf~d~vd~s~id~llIthFhldh~aslp~~~qkTs 90 (668) T KOG1137 13 QLKFTPLGAGNEVGRSCHILEYKG--KTIMLDCGVHPAYSGMASLPFYDEVDLSAIDPLLITHFHLDHAASLPFTLQKTS 90 (668) T ss_pred CEEEEECCCCCCCCCEEEEEEECC--EEEEECCCCCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHCCCCCCEEEECC T ss_conf 179987788762573379999668--379853666745344444642100460105478876653532266621000002 Q ss_pred --CCEEECCHHHHHHHCCHHHHH---------HHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf --955405843898840123444---------432002346666741896478399762012331012224721000001 Q gi|254780758|r 77 --LTIIGPHKDDAAMMGKVDEQA---------RLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVN 145 (216) Q Consensus 77 --~~i~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~ 145 (216) ..+++.|.+.+.+........ ..+...........-..+...++.++.+.++..+++ ||--|.++|.+ T Consensus 91 f~grvfmth~TkAi~kwllsdyvrvs~~s~~~~Ly~e~dl~~s~dKie~idfhe~~ev~gIkf~p~~a-GhVlgacMf~v 169 (668) T KOG1137 91 FIGRVFMTHPTKAIYKWLLSDYVRVSNRSGDDRLYTEGDLMESMDKIETIDFHETVEVNGIKFWPYHA-GHVLGACMFMV 169 (668) T ss_pred CCCEEEEECCHHHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHEEEEECCCCCCCCEEEEEECC-CHHHHHEEEEE T ss_conf 25616882441887876642336765135753343510477765532222200233348748996045-41233213666 Q ss_pred CC--CCCCCCCCH Q ss_conf 23--432111100 Q gi|254780758|r 146 FE--NNFAHLGDT 156 (216) Q Consensus 146 ~~--~~~lftGD~ 156 (216) +- -++|||||. T Consensus 170 eiagv~lLyTGd~ 182 (668) T KOG1137 170 EIAGVRLLYTGDY 182 (668) T ss_pred EECEEEEEECCCC T ss_conf 5240799962566 No 32 >COG1235 PhnP Metal-dependent hydrolases of the beta-lactamase superfamily I [General function prediction only] Probab=98.78 E-value=3.7e-07 Score=61.41 Aligned_cols=81 Identities=19% Similarity=0.309 Sum_probs=49.7 Q ss_pred CCCCEEEEEECCCCCCCEEEEEECC--------------CCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCC Q ss_conf 9773599997087657169999889--------------98499983899999999999974997529998698852101 Q gi|254780758|r 1 MSKLSFHIVQVTPFKQNCTFLFDEE--------------SLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVG 66 (216) Q Consensus 1 m~~~~~~~i~v~~~~~N~~li~~~~--------------~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~g 66 (216) |..|++.++-.|. +.|+.++.++. +.+.++||+|.+. ..+.++..-..+++|++||.|+||+. T Consensus 1 ~~~~~f~~lgsG~-~gg~p~~~~~~~~c~~~~~~v~~~~~~~~~lid~g~~~--~~~~~~~~~~~idai~~TH~H~DHi~ 77 (269) T COG1235 1 TGGMRFTVLGSGS-SGGVPVIGCDCRACGGNRLRVDCGVGVKTLLIDAGPDL--RDQGLRLGVSDLDAILLTHEHSDHIQ 77 (269) T ss_pred CCCCEEEEEEEEC-CCCEEEECCCCCCCCCCEEEEEEECCCEEEEEECCHHH--HHHHHHHCCCCCCEEEEECCCHHHHC T ss_conf 9852999999834-89803525886566687088702536047999428777--65423306764567999158567657 Q ss_pred CCHHHHHHHCCCEEECCH Q ss_conf 105678860795540584 Q gi|254780758|r 67 GAAQLKEDLSLTIIGPHK 84 (216) Q Consensus 67 g~~~l~~~~~~~i~~~~~ 84 (216) |+..|+..+..+++.... T Consensus 78 Gl~~l~~~~~~~~~~~~~ 95 (269) T COG1235 78 GLDDLRRAYTLPIYVNPG 95 (269) T ss_pred CHHHHHHHHCCCCEEEEC T ss_conf 888899885588457532 No 33 >PRK04286 hypothetical protein; Provisional Probab=98.75 E-value=5.1e-07 Score=60.56 Aligned_cols=147 Identities=16% Similarity=0.208 Sum_probs=74.5 Q ss_pred CEEEEEECCCCCC--CEEEEEECCCCEEEEECCCCC-----------H---HH---HHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 3599997087657--169999889984999838999-----------9---99---999999749975299986988521 Q gi|254780758|r 4 LSFHIVQVTPFKQ--NCTFLFDEESLEAVVVDPGGD-----------L---DI---IKDVIQSRNFHVKQIWITHGHVDH 64 (216) Q Consensus 4 ~~~~~i~v~~~~~--N~~li~~~~~~~~ilID~G~~-----------~---~~---i~~~l~~~~~~i~~Ii~TH~H~DH 64 (216) |++..+..-.++. =|.++.. ++..+|||||-. . ++ +.+.+.+.-.+-+.|++||.|.|| T Consensus 1 Mki~pla~eSLGvRSma~~Vet--~d~~IlIDPgvaLap~RygLpPhp~E~~~l~~~r~kI~~~a~~advvvISHYHyDH 78 (294) T PRK04286 1 MKIIPLASESLGVRSMATFVET--RDVRILIDPGVSLAPRRYGLPPHPIELERLEEVREKILELAKKADVVTISHYHYDH 78 (294) T ss_pred CEEEEEEECCCCCCEEEEEEEE--CCEEEEECCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCC T ss_conf 9569953056774325799983--88579977702118766798998099999999999999998429999991336667 Q ss_pred CCC-C--H-------HHHHHHCCCEEEC-CHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCC Q ss_conf 011-0--5-------6788607955405-843898840123444432002346666741896478399762012331012 Q gi|254780758|r 65 VGG-A--A-------QLKEDLSLTIIGP-HKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHC 133 (216) Q Consensus 65 ~gg-~--~-------~l~~~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~t 133 (216) .-= . . .-.+-|..++++. ++.+. . + ..+..+...+............+.||.++.+|+..+++-.. T Consensus 79 ~~p~~~~~Y~~~~~~~~~eiY~gK~~~~K~P~~~-I-N-~SQr~Ra~~flk~~~~~~~~i~~aDg~~f~~g~~~i~fS~p 155 (294) T PRK04286 79 HTPFYESVYEATSEEEYKEIYGGKIVLIKDPTEN-I-N-WSQRRRAYGFLKAVKGIAKKIEYADGKEFRFGGTVIEFSPP 155 (294) T ss_pred CCCCCCCHHHCCCCCCHHHHHCCCEEEECCCHHH-C-C-HHHHHHHHHHHHHHHHHCCEEEECCCCEEEECCEEEEECCC T ss_conf 7855343010244423588857858998194264-3-8-88999999999976520434897588668878888996688 Q ss_pred -CCCCC-CC-C----CCCC--CCCCCCCCCCH Q ss_conf -22472-10-0----0001--23432111100 Q gi|254780758|r 134 -PGHSP-GH-V----IYVN--FENNFAHLGDT 156 (216) Q Consensus 134 -PGHt~-g~-~----~~~~--~~~~~lftGD~ 156 (216) | |-+ |+ + ++.+ .+.+++|+-|. T Consensus 156 vp-HG~~gsklG~Vi~~~I~dg~~~~~~~SDv 186 (294) T PRK04286 156 VP-HGAEGSKLGYVVMVRISDGDESFVFASDV 186 (294) T ss_pred CC-CCCCCCCCCEEEEEEEECCCEEEEECCCC T ss_conf 77-89888855679999997497689980556 No 34 >KOG1135 consensus Probab=98.64 E-value=7.7e-08 Score=65.34 Aligned_cols=138 Identities=14% Similarity=0.155 Sum_probs=80.6 Q ss_pred CCCEEEEEECCCCEEEEECCCCCHHH---HHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHH--CCCEEECCHHH--- Q ss_conf 57169999889984999838999999---99999974997529998698852101105678860--79554058438--- Q gi|254780758|r 15 KQNCTFLFDEESLEAVVVDPGGDLDI---IKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKEDL--SLTIIGPHKDD--- 86 (216) Q Consensus 15 ~~N~~li~~~~~~~~ilID~G~~~~~---i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~--~~~i~~~~~~~--- 86 (216) .+-||++.-++. -+||||||+..- .++.++..--.+++|++||...-|+||+++...++ ++++|+.-+.- T Consensus 14 ~~~cyllqiD~~--~iLiDcGwd~~f~~~~i~~l~~~i~~iDaILLShpd~~hlGaLpY~~~k~gl~~~VYAT~PV~~mG 91 (764) T KOG1135 14 GPLCYLLQIDGV--RILIDCGWDESFDMSMIKELKPVIPTIDAILLSHPDILHLGALPYAVGKLGLNAPVYATLPVIKMG 91 (764) T ss_pred CCCEEEEEECCE--EEEEECCCCCHHCCCHHHHHHCCCCCCCEEEECCCCHHHHCCCHHHHHHCCCCCEEEEECCHHHHH T ss_conf 864589997580--899727996010660100121146644379955898677365053686377252278843433431 Q ss_pred -HHHHCCHHHHHH-----HHHCCCCCCCCCCEEEECCCCEEEEECC----CCCCCCCCCCCCCCCCCCC--CCCCCCCCC Q ss_conf -988401234444-----3200234666674189647839976201----2331012224721000001--234321111 Q gi|254780758|r 87 -AAMMGKVDEQAR-----LLSIRMNARNASSDRWLQDGDTLLLGTH----IFKVFHCPGHSPGHVIYVN--FENNFAHLG 154 (216) Q Consensus 87 -~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~gd~i~lg~~----~~~vi~tPGHt~g~~~~~~--~~~~~lftG 154 (216) .++......... .+....-...+..-..++..|...+-+. ++..+.+ ||++|.+.+-+ ..++++++= T Consensus 92 ~m~myD~~~S~~~~~df~l~sldDvd~aFd~I~~LKYsQ~v~L~gk~~Gl~itaynA-GhmiGGsIWkI~k~~E~ivYav 170 (764) T KOG1135 92 QMFMYDLYRSHGNVGDFDLFSLDDVDAAFDKIIQLKYSQPVALKGKGSGLTITAYNA-GHMIGGSIWKISKVGEDIVYAV 170 (764) T ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHEEEEECCCEEEECCCCCCEEEEEECC-CCCCCCEEEEEEECCCEEEEEE T ss_conf 441799876036643234443666677775413321155477426557248964047-7756745799873575389997 Q ss_pred C Q ss_conf 0 Q gi|254780758|r 155 D 155 (216) Q Consensus 155 D 155 (216) | T Consensus 171 d 171 (764) T KOG1135 171 D 171 (764) T ss_pred E T ss_conf 0 No 35 >pfam02112 PDEase_II cAMP phosphodiesterases class-II. Probab=98.60 E-value=3.3e-07 Score=61.70 Aligned_cols=151 Identities=19% Similarity=0.133 Sum_probs=79.6 Q ss_pred EEEEECC----CCCCC--EEEEEECCCCEEEEECCCCCHHHHHHHHHHC---------------------------CCCE Q ss_conf 9999708----76571--6999988998499983899999999999974---------------------------9975 Q gi|254780758|r 6 FHIVQVT----PFKQN--CTFLFDEESLEAVVVDPGGDLDIIKDVIQSR---------------------------NFHV 52 (216) Q Consensus 6 ~~~i~v~----~~~~N--~~li~~~~~~~~ilID~G~~~~~i~~~l~~~---------------------------~~~i 52 (216) +++|.+| |...| .|++.+...+..+.+|+|.-...+.+.+.+. .-.| T Consensus 1 F~vivLG~~GGp~E~n~ts~Lvr~~~~~~~ialDAGt~l~gI~~~~~~~~~~~~~~~~~p~~~~~~~p~~~a~~i~~~~I 80 (323) T pfam02112 1 FATNLLGQVGGIDEGNLSSFLIEDEAQETFIKLDAGSVLQGYPCLTVSKYLSPNVTITTPFSNFESGPYIKHTYLLFNRI 80 (323) T ss_pred CEEEEECCCCCCCCCCEEEEEEEECCCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 94999589889788872489965469886899977655888999986410477542357765534571356789998522 Q ss_pred EEEEECCCCCCCCCCCHHHHHH-----HCCCEEECCHHHHHHHCCHHHHHHHHHCCCCCCC----CCCEEEECCCCEEEE Q ss_conf 2999869885210110567886-----0795540584389884012344443200234666----674189647839976 Q gi|254780758|r 53 KQIWITHGHVDHVGGAAQLKED-----LSLTIIGPHKDDAAMMGKVDEQARLLSIRMNARN----ASSDRWLQDGDTLLL 123 (216) Q Consensus 53 ~~Ii~TH~H~DH~gg~~~l~~~-----~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~l~~gd~i~l 123 (216) ...++||.|.||+.|+.--... -+-.+|+....-..+........-|......... ...-..+..++...+ T Consensus 81 ~~ylITH~HLDHIaGLvinsp~~~~~~~~K~i~gl~~ti~aLk~hiFN~~iWPNl~~~g~~~~~~~~~~~~l~~~~~~~~ 160 (323) T pfam02112 81 KNYLITHSHLDHVCGLVINSPGFYKQMSKKTIYGLPYTINALQKHLFNNLVWPNLPSFGIVNLIYKVKMFDLSPGEFNKL 160 (323) T ss_pred CEEEECCCCHHHHHHHHCCCCCCCCCCCCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCC T ss_conf 13993265277664543058543346799518865899999987331674267886678877431699887387751024 Q ss_pred ECCCCCCCCCC-CCCC------CCCCCCCCCC----CCCCCCCH Q ss_conf 20123310122-2472------1000001234----32111100 Q gi|254780758|r 124 GTHIFKVFHCP-GHSP------GHVIYVNFEN----NFAHLGDT 156 (216) Q Consensus 124 g~~~~~vi~tP-GHt~------g~~~~~~~~~----~~lftGD~ 156 (216) ....+.+.+-| -|.. .|.+|++.+. .++|-||+ T Consensus 161 ~~~~~~v~~fp~sH~~~~~~~~~SsAflird~~t~~~il~fGD~ 204 (323) T pfam02112 161 TETTMSVVPFPVNHGGLIIKEALSTAFLFTDSVSGDSILVFGDV 204 (323) T ss_pred CCCCCEEEEECCCCCCCCCCCCCCEEEEEEECCCCCEEEEECCC T ss_conf 67763156744537887677634407999955899789998288 No 36 >COG2220 Predicted Zn-dependent hydrolases of the beta-lactamase fold [General function prediction only] Probab=98.50 E-value=5.1e-07 Score=60.56 Aligned_cols=172 Identities=19% Similarity=0.184 Sum_probs=84.4 Q ss_pred EECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHH----H-HHCCCCEEEEEECCCCCCCCCCCHHHHHHHC-CCEEEC Q ss_conf 970876571699998899849998389999999999----9-9749975299986988521011056788607-955405 Q gi|254780758|r 9 VQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDV----I-QSRNFHVKQIWITHGHVDHVGGAAQLKEDLS-LTIIGP 82 (216) Q Consensus 9 i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~----l-~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~~~-~~i~~~ 82 (216) ..+..++..|++|...+ ..+||||.......-.. . ...-.++++|++||.|.||++-........+ ++++.. T Consensus 7 m~itwlGha~~lie~~~--~~iliDP~~~~~~~~~~~~~~~~~~~~~~~D~ilitH~H~DHl~~~~~~~~~~~~~~~~~~ 84 (258) T COG2220 7 MKITWLGHAAFLIETGG--KRILIDPVLSGAPSPSNFPGGLFEDLLPPIDYILITHDHYDHLDDETLIALRTNKAPVVVV 84 (258) T ss_pred CEEEEEEEEEEEEEECC--EEEEECCCCCCCCCCCCCCCCCCHHHCCCCCEEEEECCCCHHCCCHHHHHHHCCCCCEEEC T ss_conf 17999512689999588--6899899867887744225667643247888899736861105707665542488748964 Q ss_pred CHHH-HHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCC---CCCCCC-CCCC--------CCCCCC--CC Q ss_conf 8438-9884012344443200234666674189647839976201233---101222-4721--------000001--23 Q gi|254780758|r 83 HKDD-AAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFK---VFHCPG-HSPG--------HVIYVN--FE 147 (216) Q Consensus 83 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~---vi~tPG-Ht~g--------~~~~~~--~~ 147 (216) .+.. ..+....... .....+..++++++++.++. +.+.+. ++++ .++|.+ ++ T Consensus 85 p~~~~~~~~~~g~~~-------------~~~~~~~~g~~~~~~~~~i~~~~a~h~~~~~~~~~~~~~~~~~~~~vi~~~g 151 (258) T COG2220 85 PLGAGDLLIRDGVEA-------------ERVHELGWGDVIELGDLEITAVPAYHVSARHLPGRGIRPTGLWVGYVIETPG 151 (258) T ss_pred CCHHHHHHHCCCCCC-------------CEEEECCCCCEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEECC T ss_conf 603444442158772-------------0556147886397356469888626324333588774346761699996099 Q ss_pred CCCCCCCCHHH--HC---CCCCCC---CCCCC--HHHH---HHHHHHHHHCCCCCEEEEECCC Q ss_conf 43211110000--01---356634---67989--9999---9999999851999789984887 Q gi|254780758|r 148 NNFAHLGDTLF--RS---SIGRTD---ILHGD--YQQL---INSINNKILPLGDEISFVCGHG 197 (216) Q Consensus 148 ~~~lftGD~lf--~~---~~G~~d---~~~~d--~~~~---~~sl~~~l~~l~~~~~v~PgHG 197 (216) .++.+.||+=+ .. ..|++| +|-|. ...+ .+.+ +....|.+. +++|.|= T Consensus 152 ~~iyh~GDt~~~~~~~~~~~~~~DvallPig~~~~~~~~~~~~~~-~~~~~l~~~-~viP~Hy 212 (258) T COG2220 152 GRVYHAGDTGYLFLIIEELDGPVDVALLPIGGYPNATMMPPEAAV-AAAEVLRPK-RVIPMHY 212 (258) T ss_pred EEEEECCCCHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCHHHHH-HHHHHHCCC-EEEECCC T ss_conf 279962567067665542057777899455456677657899999-999850856-4997543 No 37 >COG2248 Predicted hydrolase (metallo-beta-lactamase superfamily) [General function prediction only] Probab=98.25 E-value=2.2e-05 Score=51.02 Aligned_cols=145 Identities=17% Similarity=0.243 Sum_probs=74.4 Q ss_pred EEEECC--CC--CCCEEEEEECCCCEEEEECCCCC--------------HHHH---HHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 999708--76--57169999889984999838999--------------9999---999997499752999869885210 Q gi|254780758|r 7 HIVQVT--PF--KQNCTFLFDEESLEAVVVDPGGD--------------LDII---KDVIQSRNFHVKQIWITHGHVDHV 65 (216) Q Consensus 7 ~~i~v~--~~--~~N~~li~~~~~~~~ilID~G~~--------------~~~i---~~~l~~~~~~i~~Ii~TH~H~DH~ 65 (216) +++|++ .+ +.-|.++... .-.+|||||-. .+++ .+.+++.-.+.+-|.+||.|.||. T Consensus 2 kV~Pla~eSLGVRSmAt~vet~--dv~ILiDpGVsLaPkRy~LPPh~~E~erl~~~r~~i~~~ak~a~VitISHYHYDHh 79 (304) T COG2248 2 KVIPLASESLGVRSMATFVETK--DVGILIDPGVSLAPKRYGLPPHQRELERLRQAREKIQRYAKKADVITISHYHYDHH 79 (304) T ss_pred CEEECCCCCCCHHHHHHEEECC--CEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCC T ss_conf 3253131232344243213217--73699778654485346899887999999999999999986488899864012457 Q ss_pred CC---------CHHHHHHHCCCEEEC-CHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCC Q ss_conf 11---------056788607955405-84389884012344443200234666674189647839976201233101222 Q gi|254780758|r 66 GG---------AAQLKEDLSLTIIGP-HKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPG 135 (216) Q Consensus 66 gg---------~~~l~~~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPG 135 (216) -- ...-++-|..+++.. ++.+. +++.+. .+.+.+-........+..+.||.++++|+..+++=..=- T Consensus 80 tPf~~~~y~~s~e~~~eiY~gK~lLlKhPte~--IN~SQ~-~Ra~~fl~~~~~~~~~ie~ADgk~f~fG~t~IefS~pvp 156 (304) T COG2248 80 TPFFDGIYEASGETAKEIYKGKLLLLKHPTEN--INRSQR-RRAYRFLESLKDIAREIEYADGKTFEFGGTVIEFSPPVP 156 (304) T ss_pred CCCCCCHHHHCCCCHHHHHCCCEEEECCCHHH--HCHHHH-HHHHHHHHHHHHHCCEEEECCCCEEEECCEEEEECCCCC T ss_conf 86655212310324677735837885195565--577878-789999987643111468337865875778999568888 Q ss_pred CCCC-C-----CCCCC--CCCCCCCCCCH Q ss_conf 4721-0-----00001--23432111100 Q gi|254780758|r 136 HSPG-H-----VIYVN--FENNFAHLGDT 156 (216) Q Consensus 136 Ht~g-~-----~~~~~--~~~~~lftGD~ 156 (216) |-++ + +++.+ .+..++|+.|+ T Consensus 157 HG~eGskLGyVl~v~V~dg~~~i~faSDv 185 (304) T COG2248 157 HGREGSKLGYVLMVAVTDGKSSIVFASDV 185 (304) T ss_pred CCCCCCCCCEEEEEEEECCCEEEEECCCC T ss_conf 88766622149999994397689971366 No 38 >KOG1361 consensus Probab=97.53 E-value=7.7e-05 Score=47.92 Aligned_cols=110 Identities=21% Similarity=0.226 Sum_probs=65.7 Q ss_pred EEEECCCCCHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCHHHHHHHCCC-EEECCHHHHHHHCCHHHHHHHHHCCCCC Q ss_conf 99983899999999999974-997529998698852101105678860795-5405843898840123444432002346 Q gi|254780758|r 29 AVVVDPGGDLDIIKDVIQSR-NFHVKQIWITHGHVDHVGGAAQLKEDLSLT-IIGPHKDDAAMMGKVDEQARLLSIRMNA 106 (216) Q Consensus 29 ~ilID~G~~~~~i~~~l~~~-~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (216) .+.++||... ..++.+.. .....+-++||+|.||..|+ ...+..+ +|++......+..... T Consensus 91 ~~~~~p~~~f--~VD~f~~~~~~~~s~yFLsHFHSDHy~GL---~~sW~~p~lYCS~ita~Lv~~~~~------------ 153 (481) T KOG1361 91 VIKVLPGGEF--SVDAFRYGHIEGCSAYFLSHFHSDHYIGL---TKSWSHPPLYCSPITARLVPLKVS------------ 153 (481) T ss_pred CEEECCCCCE--EEEHHHCCCCCCCCEEEEECCCCCCCCCC---CCCCCCCCCCCCCCCHHHHHHHCC------------ T ss_conf 2460699727--87432237766511012201455431435---444558861014122655544011------------ Q ss_pred CCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCH Q ss_conf 66674189647839976201233101222472100000123---432111100 Q gi|254780758|r 107 RNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVNFE---NNFAHLGDT 156 (216) Q Consensus 107 ~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~~~---~~~lftGD~ 156 (216) .....-+.+.-++.+.+.+..+.++.+ -|.||+++|+++. ..+|.|||. T Consensus 154 v~~~~i~~l~l~~~~~i~~~~vt~ldA-nHCPGa~mf~F~~~~~~~~lhtGDF 205 (481) T KOG1361 154 VTKQSIQALDLNQPLEIPGIQVTLLDA-NHCPGAVMFLFELSFGPCILHTGDF 205 (481) T ss_pred CCHHHCEEECCCCCEEECCEEEEEECC-CCCCCCEEEEEECCCCCEEEECCCC T ss_conf 474114131478836624568998516-6699855897534788627864773 No 39 >PRK11709 putative L-ascorbate 6-phosphate lactonase; Provisional Probab=97.41 E-value=0.00082 Score=41.94 Aligned_cols=134 Identities=19% Similarity=0.290 Sum_probs=71.0 Q ss_pred CEEEEEECCCCCCCCCCC--HHHHHHH--CCCEEECCHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECC Q ss_conf 752999869885210110--5678860--795540584389884012344443200234666674189647839976201 Q gi|254780758|r 51 HVKQIWITHGHVDHVGGA--AQLKEDL--SLTIIGPHKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTH 126 (216) Q Consensus 51 ~i~~Ii~TH~H~DH~gg~--~~l~~~~--~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~ 126 (216) ++++++.||.|.||+--+ +.+.+.. .++.+++......+... + .+...-..+++||++.+++. T Consensus 109 ~lDavl~TH~H~DHiD~n~AAAv~~n~~~~v~FiGP~~~vd~W~~W--------G-----VP~eR~ivvkPGD~~kvkDi 175 (355) T PRK11709 109 EIDAVLATHDHSDHIDVNVAAAVMQNCADHVKFIGPQTCVDLWIGW--------G-----VPKERCIVVKPGDVVKVKDI 175 (355) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHCCCCCCCEECCHHHHHHHHHC--------C-----CCHHHEEEECCCCEEEEEEE T ss_conf 3207998511434300899999982685578531858999888862--------9-----96788799548987887126 Q ss_pred CCCCCC--------C-C-------CCCCC-----CCCCCC--CCCCCCCCCCHHHHCC---CCC---CCC----CCCCH- Q ss_conf 233101--------2-2-------24721-----000001--2343211110000013---566---346----79899- Q gi|254780758|r 127 IFKVFH--------C-P-------GHSPG-----HVIYVN--FENNFAHLGDTLFRSS---IGR---TDI----LHGDY- 172 (216) Q Consensus 127 ~~~vi~--------t-P-------GHt~g-----~~~~~~--~~~~~lftGD~lf~~~---~G~---~d~----~~~d~- 172 (216) +++++. | | |-++. .+.|++ +.+.+.++||.-|... .|+ .|. .+.++ T Consensus 176 ~i~alesfDRT~lvT~~~~~~~~~g~~~~~Md~~AVnYl~kT~gGnvYH~gDSHySn~fakhG~~~~IDVal~~yGeNP~ 255 (355) T PRK11709 176 EIHALDAFDRTALITLPADGKAAGGVLPDGMDRRAVNYLFKTPGGNLYHSGDSHYSNYYAKHGNEYQIDVALGSYGENPR 255 (355) T ss_pred EEEEECCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHEEEECCCCCEEECCCCHHHHHHHHHCCCCCCEEEEECCCCCCC T ss_conf 99985045624798517875446899972166763414897699767755761355678872761031178732566898 Q ss_pred ---HHHHHH-HHHHHHCCCCCEEEEECCCC Q ss_conf ---999999-99998519997899848879 Q gi|254780758|r 173 ---QQLINS-INNKILPLGDEISFVCGHGP 198 (216) Q Consensus 173 ---~~~~~s-l~~~l~~l~~~~~v~PgHG~ 198 (216) ..|..+ +..+-.+|..++ |+|=|=. T Consensus 256 GitDKmts~DiLRmaE~L~aKV-vIP~H~D 284 (355) T PRK11709 256 GITDKMTSIDILRMAESLNAKV-VIPVHHD 284 (355) T ss_pred CCCCCCCHHHHHHHHHHCCCCE-EEECHHH T ss_conf 7616576999999998639858-9513065 No 40 >KOG2121 consensus Probab=97.17 E-value=8.2e-05 Score=47.76 Aligned_cols=56 Identities=20% Similarity=0.270 Sum_probs=33.6 Q ss_pred EEEEEECCCCEEEEECCCCC-HHHHHHHHH-----HCCCCEEEEEECCCCCCCCCCCHHHHHH Q ss_conf 69999889984999838999-999999999-----7499752999869885210110567886 Q gi|254780758|r 18 CTFLFDEESLEAVVVDPGGD-LDIIKDVIQ-----SRNFHVKQIWITHGHVDHVGGAAQLKED 74 (216) Q Consensus 18 ~~li~~~~~~~~ilID~G~~-~~~i~~~l~-----~~~~~i~~Ii~TH~H~DH~gg~~~l~~~ 74 (216) +|++... ....+|.|||-+ ...+...-. ..-.++++|+++|.|.||..|+.-+.++ T Consensus 463 S~lv~i~-~~~~IlLDCGEgTlgql~R~YG~~~~~~~lr~LraI~ISHlHADHh~Gl~~vL~~ 524 (746) T KOG2121 463 SILVRID-SDDSILLDCGEGTLGQLVRHYGVENVDTALRKLRAIFISHLHADHHLGLISVLQA 524 (746) T ss_pred EEEEECC-CCCCEEEECCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 9997026-7753786448844878889861301678987589999986200001147899999 No 41 >COG5212 PDE1 Low-affinity cAMP phosphodiesterase [Signal transduction mechanisms] Probab=96.99 E-value=0.00076 Score=42.12 Aligned_cols=106 Identities=20% Similarity=0.251 Sum_probs=53.1 Q ss_pred CEEEEEECCCCCCCCCCC----HHHHHHHCCCEEECCHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECC Q ss_conf 752999869885210110----5678860795540584389884012344443200234666674189647839976201 Q gi|254780758|r 51 HVKQIWITHGHVDHVGGA----AQLKEDLSLTIIGPHKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTH 126 (216) Q Consensus 51 ~i~~Ii~TH~H~DH~gg~----~~l~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~ 126 (216) .|+.-++||+|.||+.|+ +.+.++.+-.+|+....-..+........-+..........-..+.+.+.+...++-. T Consensus 112 ~I~~y~ITH~HLDHIsGlVinSp~~~~qkkkTI~gl~~tIDvL~khvFN~lvWP~lt~~gs~~~~~qvv~P~~~~slt~t 191 (356) T COG5212 112 SINSYFITHAHLDHISGLVINSPDDSKQKKKTIYGLADTIDVLRKHVFNWLVWPNLTDSGSGTYRMQVVRPAQSLSLTLT 191 (356) T ss_pred HHHHEEECCCCCCCHHCEEECCCCCCCCCCCEEEECHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECHHHEEEEEEE T ss_conf 12223752302000001363486434557733772402799999875100015774414585488898473471055334 Q ss_pred CCCCCCCC---CCCCC----CCCCCCCCCC----CCCCCCH Q ss_conf 23310122---24721----0000012343----2111100 Q gi|254780758|r 127 IFKVFHCP---GHSPG----HVIYVNFENN----FAHLGDT 156 (216) Q Consensus 127 ~~~vi~tP---GHt~g----~~~~~~~~~~----~lftGD~ 156 (216) .+.+++-| |-.-| |.+|.+.+.+ ++..||+ T Consensus 192 ~l~~~pfpv~Hg~ktG~p~ySs~~lfr~nkS~~~f~~fGDv 232 (356) T COG5212 192 RLTGEPFPVSHGKKTGSPSYSSMLLFRSNKSNEFFAYFGDV 232 (356) T ss_pred EECCEEEECCCCCCCCCCCCCEEEEEECCCCCCEEEEECCC T ss_conf 40222564137866688442148998537876248994687 No 42 >TIGR00361 ComEC_Rec2 DNA internalization-related competence protein ComEC/Rec2; InterPro: IPR004797 This is the DNA internalisation-related competence protein ComEC/Rec2 family. Apparent orthologs are found in 5 species so far (Haemophilus influenzae, Escherichia coli, Bacillus subtilis, Neisseria gonorrhoeae, Streptococcus pneumoniae), of which all but E. coli are model systems for the study of competence for natural transformation. This protein is a predicted multiple membrane-spanning protein likely to be involved in DNA internalisation.; GO: 0030420 establishment of competence for transformation, 0016021 integral to membrane. Probab=96.87 E-value=0.0078 Score=36.26 Aligned_cols=172 Identities=14% Similarity=0.223 Sum_probs=90.3 Q ss_pred EEEEEECCCCCCCEEEEEECCCCEE-EEECCCCC------------------HHHHHHHHHHCCCC-EEEEEECCCCCCC Q ss_conf 5999970876571699998899849-99838999------------------99999999974997-5299986988521 Q gi|254780758|r 5 SFHIVQVTPFKQNCTFLFDEESLEA-VVVDPGGD------------------LDIIKDVIQSRNFH-VKQIWITHGHVDH 64 (216) Q Consensus 5 ~~~~i~v~~~~~N~~li~~~~~~~~-ilID~G~~------------------~~~i~~~l~~~~~~-i~~Ii~TH~H~DH 64 (216) .+..+.+| +....++. ..+.+. +++|+|+. ...++.++...++. ++.++++|.|.|| T Consensus 482 ~~~~~d~g--~g~~~~~~-~~~~~~~~l~d~gg~~~~~~~~w~~~~~p~~~~~~~~~p~l~~~~~~~~~~~~l~~~~~d~ 558 (731) T TIGR00361 482 EVDLLDVG--QGLALLIG-KNGQKGRILYDTGGTLSYSSLPWREGQHPFSLGELLLLPWLTWEGLKQLEGLILSHDDLDH 558 (731) T ss_pred CEEEEECC--CCCEEEEE-CCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCCC T ss_conf 13454226--76213344-4786531686347621125434666667521002455434543021012344200232111 Q ss_pred CCCCHHHHHHHCCCEEECC------HHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCC--C-- Q ss_conf 0110567886079554058------43898840123444432002346666741896478399762012331012--2-- Q gi|254780758|r 65 VGGAAQLKEDLSLTIIGPH------KDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHC--P-- 134 (216) Q Consensus 65 ~gg~~~l~~~~~~~i~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~t--P-- 134 (216) .||+..+.+.++..-+... +.+...+.... .. .........|+...+-+..+.++.- | T Consensus 559 ~g~~~~~l~~~~~~~~~~p~~~~~~p~d~~~~~~~~---------~~---~~~~~~~~~g~~~~~~~l~~~~~~p~~~~~ 626 (731) T TIGR00361 559 AGGADTLLKHWPVLWLVTPKGFNSEPKDEKVLQAAR---------LG---GEYHEPCKRGDDWQWQGLRFHVLSPLAPDP 626 (731) T ss_pred CCHHHHHHHHHHHHEEECCCCCCCCCCHHHHHHHHH---------HC---CCHHHHHHCCCCEEECCCEEEEECCCCCCC T ss_conf 240567776420010321544456640356776654---------04---401112213541011141135415545666 Q ss_pred CC-CCCCCCCCCCCC--CCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHH Q ss_conf 24-721000001234--321111000001356634679899999999999985199978998488798888999 Q gi|254780758|r 135 GH-SPGHVIYVNFEN--NFAHLGDTLFRSSIGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNSTIGRE 205 (216) Q Consensus 135 GH-t~g~~~~~~~~~--~~lftGD~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~tt~~~e 205 (216) .- ...++.++.+++ .++.+||.--+ ....... +....+..+..-+|.||..++-+.+ T Consensus 627 ~~~~~~~~~~~~~~g~~~~~l~gd~~~~-----------~~~~~~~---~~~~~~~~~~l~~~~~g~~~~~~~~ 686 (731) T TIGR00361 627 DSGNNHSCVLWVDDGGNSWLLTGDLEAP-----------GEQKELA---RTWPKIDADVLQVGHHGSKTSTGEP 686 (731) T ss_pred CCCCCCEEEEEEECCCCEEEEECCCCCC-----------HHHHHHH---HHHCCHHHHHEECCCCCCCCCCHHH T ss_conf 6677631689971687137872253451-----------2578898---7630000121013666766541257 No 43 >TIGR02649 true_RNase_BN ribonuclease BN; InterPro: IPR013469 Proteins in this entry include ribonuclease BN of Escherichia coli (strain K-12) and closely related proteins believed to be equivalent in function. E. coli appears to lack RNase Z, and this ribonuclease appears to be orthologous to (but not functionally equivalent to) RNase Z of Bacillus subtilis and various other species. ; GO: 0016891 endoribonuclease activity producing 5'-phosphomonoesters, 0042779 removal of tRNA 3'-trailer sequence. Probab=96.61 E-value=0.0052 Score=37.31 Aligned_cols=108 Identities=18% Similarity=0.196 Sum_probs=60.3 Q ss_pred EEEECCCCCHHHHHHHHHHC--CCCEEEEEECCCCCCCCCCCHHHHHH-------HCCCEEECCHHHHHHHCCHHHHHHH Q ss_conf 99983899999999999974--99752999869885210110567886-------0795540584389884012344443 Q gi|254780758|r 29 AVVVDPGGDLDIIKDVIQSR--NFHVKQIWITHGHVDHVGGAAQLKED-------LSLTIIGPHKDDAAMMGKVDEQARL 99 (216) Q Consensus 29 ~ilID~G~~~~~i~~~l~~~--~~~i~~Ii~TH~H~DH~gg~~~l~~~-------~~~~i~~~~~~~~~~~~~~~~~~~~ 99 (216) .-|+|||-..+ .+.+... --+++.|+++|.|.||..|++-|.-. .|..+|.+..-..+..........+ T Consensus 33 ~WlfdCGeGtq--hq~l~taf~PGkl~~ifishlhGdhlfGlPGllCsrsm~G~~~Plt~yGPqG~ref~~t~lr~sGsW 110 (304) T TIGR02649 33 LWLFDCGEGTQ--HQLLNTAFNPGKLEKIFISHLHGDHLFGLPGLLCSRSMAGIIQPLTVYGPQGVREFVETALRLSGSW 110 (304) T ss_pred EEEEECCCCHH--HHHHHHHCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHHHCCCC T ss_conf 06887687526--7888741488750035754223650015531344333137776406756621688987664211110 Q ss_pred HHCCCCCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 2002346666741896478399762012331012224721000001 Q gi|254780758|r 100 LSIRMNARNASSDRWLQDGDTLLLGTHIFKVFHCPGHSPGHVIYVN 145 (216) Q Consensus 100 ~~~~~~~~~~~~~~~l~~gd~i~lg~~~~~vi~tPGHt~g~~~~~~ 145 (216) ..+..... .+.-|+.++-|=.++..+.. -|--.+..|.+ T Consensus 111 td~Pl~i~------ei~aG~ilddGlrkvta~~l-ehPleCyGyr~ 149 (304) T TIGR02649 111 TDYPLEIV------EISAGEILDDGLRKVTAYRL-EHPLECYGYRV 149 (304) T ss_pred CCCCEEEE------EECCCCHHHHCCHHEEEECC-CCCCEECCEEE T ss_conf 13651578------62254112200010012014-68610011067 No 44 >TIGR02650 RNase_Z_T_toga ribonuclease Z, Thermotoga type; InterPro: IPR013470 All tRNA molecules are synthesised as long precursors with extra sequence at both the 5' and 3' ends, which undergo several processing steps to form the mature tRNA molecule. In many bacteria the tRNA precursors do not contain the CCA sequence which forms the 3' end of the mature tRNA and is essential for aminoacylation. In these organisms ribonucleotide Z cleaves the precursor to form a 3' end to which CCA is added. Proteins in this entry, by contrast are ribonuclease Z enzymes with unusual activity. Organisms containing these proteins include the CCA sequence in the tRNA precursor and ribonuclease Z cleaves the molecule immediately after this sequence producing a mature 3' end. The sequence of these proteins differs considerably from the majority of ribonuclease Z sequences.. Probab=96.31 E-value=0.0014 Score=40.63 Aligned_cols=55 Identities=27% Similarity=0.593 Sum_probs=35.6 Q ss_pred EEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHH---------HHHHCCCEEECCHH Q ss_conf 49998389999999999997499752999869885210110567---------88607955405843 Q gi|254780758|r 28 EAVVVDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQL---------KEDLSLTIIGPHKD 85 (216) Q Consensus 28 ~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l---------~~~~~~~i~~~~~~ 85 (216) +.+|+|+|-... ..|...-.--++|++||+|.||+.|+=-+ -+..|..++++... T Consensus 19 erilfdaGeGvs---ttlGskvyafkyvflthGhvdhiaGlWGvvnirnnGmGdrekPldvfyP~Gn 82 (277) T TIGR02650 19 ERILFDAGEGVS---TTLGSKVYAFKYVFLTHGHVDHIAGLWGVVNIRNNGMGDREKPLDVFYPKGN 82 (277) T ss_pred CEEEEECCCCCC---HHHCCEEEEEEEEEEECCCHHHHCCEEEEEEEECCCCCCCCCCCEEECCCCC T ss_conf 312543578500---1111101223455432362122100023677521788864575222358865 No 45 >TIGR02108 PQQ_syn_pqqB coenzyme PQQ biosynthesis protein B; InterPro: IPR011842 This entry describes coenzyme PQQ biosynthesis protein B, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases. Note that this gene appears to be required for PQQ in biosynthesis in Methylobacterium extorquens (under the name pqqG) and in Klebsiella pneumoniae but that the equivalent pqqV in Acinetobacter calcoaceticus is not necessary for heterologous expression of PQQ biosynthesis in Escherichia coli. Based on this latter finding, it is suggested that PqqB might be a transporter or a PQQ-dependent enzyme rather than a PQQ biosynthesis enzyme .; GO: 0006810 transport, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=94.27 E-value=0.19 Score=28.20 Aligned_cols=74 Identities=20% Similarity=0.370 Sum_probs=55.5 Q ss_pred CCCEEEEEECCCCEEEEECCCCCHHHHHHHHHH-------CC-C---CEEEEEECCCCCCCCCCCHHHHHH-HCCCEEEC Q ss_conf 571699998899849998389999999999997-------49-9---752999869885210110567886-07955405 Q gi|254780758|r 15 KQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQS-------RN-F---HVKQIWITHGHVDHVGGAAQLKED-LSLTIIGP 82 (216) Q Consensus 15 ~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~-------~~-~---~i~~Ii~TH~H~DH~gg~~~l~~~-~~~~i~~~ 82 (216) +|=+++-++++|.+=+|+-+-+| |++.+.+ .+ + +|..||+|-+-.||+.|+--|++. -|..+|++ T Consensus 36 RTQsSiAvS~DG~rW~L~NASPD---iRQQi~a~paL~P~~~aLR~TpI~~V~Lt~g~iDH~tGLL~LREgq~pf~~yat 112 (314) T TIGR02108 36 RTQSSIAVSADGERWVLLNASPD---IRQQIQAFPALHPQRGALRDTPIEGVVLTDGEIDHVTGLLSLREGQQPFTLYAT 112 (314) T ss_pred CCCEEEEECCCCCCEEEECCCHH---HHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHCCCCCCCEEEEC T ss_conf 54116888678873586707964---888886203215777888657604788206622456412333037885148627 Q ss_pred CHHHHHHHC Q ss_conf 843898840 Q gi|254780758|r 83 HKDDAAMMG 91 (216) Q Consensus 83 ~~~~~~~~~ 91 (216) +.--..+.. T Consensus 113 ~~Vlq~L~~ 121 (314) T TIGR02108 113 EMVLQDLSD 121 (314) T ss_pred CHHHHHHHC T ss_conf 157888754 No 46 >TIGR01393 lepA GTP-binding protein LepA; InterPro: IPR006297 LepA (GUF1 in Saccaromyces) is a GTP-binding membrane protein related to EF-G and EF-Tu. Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including GUF1 of yeast) originated within the bacterial LepA family. The function of the proteins in this family are unknown. ; GO: 0005525 GTP binding. Probab=93.55 E-value=0.3 Score=27.08 Aligned_cols=22 Identities=27% Similarity=0.471 Sum_probs=14.6 Q ss_pred CCCHHHHHHHHHHHHHCCCCCEEE Q ss_conf 989999999999998519997899 Q gi|254780758|r 169 HGDYQQLINSINNKILPLGDEISF 192 (216) Q Consensus 169 ~~d~~~~~~sl~~~l~~l~~~~~v 192 (216) ..++.+|+++| +|| +|+|.-+. T Consensus 306 ~~~Ye~LrdAL-eKL-~LNDAsL~ 327 (598) T TIGR01393 306 TEDYEDLRDAL-EKL-KLNDASLT 327 (598) T ss_pred CHHHHHHHHHH-HHH-HHHHHHHH T ss_conf 03468999997-555-44025421 No 47 >COG0462 PrsA Phosphoribosylpyrophosphate synthetase [Nucleotide transport and metabolism / Amino acid transport and metabolism] Probab=84.74 E-value=1.7 Score=22.72 Aligned_cols=31 Identities=3% Similarity=0.300 Sum_probs=13.8 Q ss_pred CCEEEEECCCCC------HHH--HHHHHHHCCC-CEEEEE Q ss_conf 984999838999------999--9999997499-752999 Q gi|254780758|r 26 SLEAVVVDPGGD------LDI--IKDVIQSRNF-HVKQIW 56 (216) Q Consensus 26 ~~~~ilID~G~~------~~~--i~~~l~~~~~-~i~~Ii 56 (216) +.+++++.++.. .+. +.+++++.+- +|+.|+ T Consensus 51 g~dVfI~qs~~~pvnd~lmELLi~idA~k~asA~~It~Vi 90 (314) T COG0462 51 GKDVFIIQSTSPPVNDNLMELLIMIDALKRASAKRITAVI 90 (314) T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 8769999089998688999999999998865886589993 No 48 >PTZ00145 phosphoribosylpyrophosphate synthetase; Provisional Probab=83.73 E-value=3.2 Score=21.07 Aligned_cols=31 Identities=16% Similarity=0.089 Sum_probs=15.4 Q ss_pred HHHHCCCC-CEEEEECCCCCCCHHHHHHHCHH Q ss_conf 99851999-78998488798888999882814 Q gi|254780758|r 181 NKILPLGD-EISFVCGHGPNSTIGRERRLNPF 211 (216) Q Consensus 181 ~~l~~l~~-~~~v~PgHG~~tt~~~e~~~n~~ 211 (216) +.|..... ++..+.-||=.+--+-||..|.- T Consensus 360 ~~Lke~GAk~VyA~aTHGVfSg~AierI~~S~ 391 (443) T PTZ00145 360 KQLKKHGARRVFAFATHGLFSGPAIDRIEKSP 391 (443) T ss_pred HHHHHCCCCEEEEEEECCCCCHHHHHHHHCCC T ss_conf 99996799879999977307854898975489 No 49 >COG0481 LepA Membrane GTPase LepA [Cell envelope biogenesis, outer membrane] Probab=83.68 E-value=3.4 Score=20.94 Aligned_cols=52 Identities=29% Similarity=0.358 Sum_probs=23.6 Q ss_pred CCCCCCCCCCC---CCCCCCCCHHHHCCC-------CC----C-----CCC--CCCHHHHHHHHHHHHHCCCCCE Q ss_conf 72100000123---432111100000135-------66----3-----467--9899999999999985199978 Q gi|254780758|r 137 SPGHVIYVNFE---NNFAHLGDTLFRSSI-------GR----T-----DIL--HGDYQQLINSINNKILPLGDEI 190 (216) Q Consensus 137 t~g~~~~~~~~---~~~lftGD~lf~~~~-------G~----~-----d~~--~~d~~~~~~sl~~~l~~l~~~~ 190 (216) ..|.+.|+... -+-.-.||++-..+. |. + -+| ..++..+++++ +|| .|+|.. T Consensus 256 ~aGeVG~~~a~iK~v~d~~VGDTiT~~~~p~~e~LpGfk~~~P~Vf~GlyPid~~dye~LrdAl-eKL-~LNDas 328 (603) T COG0481 256 KAGEVGYIIAGIKDVRDARVGDTITLASNPATEPLPGFKEVKPMVFAGLYPVDSDDYEDLRDAL-EKL-QLNDAS 328 (603) T ss_pred CCCCEEEEEEEEEECCCCCCCCEEECCCCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH-HHC-CCCCCE T ss_conf 5773448998511115686555675067877666888776785599841116666789999999-744-335301 No 50 >TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase; InterPro: IPR014235 Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores, in the peptidoglycan wall or spore cortex. This entry represents a subset of the larger polysaccharide deacetylase family that is specifically involved in delta-lactam biosynthesis. PdaA from Bacillus subtilis acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the subsequent transpeptidation that leads to lactam ring formation, as heterologous expression in Escherichia coli of CwlD and PdaA together is sufficient for delta-lactam production.. Probab=82.84 E-value=2 Score=22.26 Aligned_cols=56 Identities=23% Similarity=0.306 Sum_probs=37.3 Q ss_pred CCCEEEEEECCCCEEEE-ECCCCC---HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHH Q ss_conf 57169999889984999-838999---9999999997499752999869885210110567886 Q gi|254780758|r 15 KQNCTFLFDEESLEAVV-VDPGGD---LDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKED 74 (216) Q Consensus 15 ~~N~~li~~~~~~~~il-ID~G~~---~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~ 74 (216) .-|||+|.|...+...| +|=|++ -++|++-||+..++=.. |+| .|. +--.+.|..+ T Consensus 25 Ky~~~y~GDt~~K~iYLTFDnGYEnGYT~kILDVLKe~~V~AaF-FVT-ghY--~K~~pdLvKR 84 (225) T TIGR02884 25 KYNAYYLGDTSKKVIYLTFDNGYENGYTPKILDVLKENKVPAAF-FVT-GHY--IKTQPDLVKR 84 (225) T ss_pred HCCEEEEECCCCEEEEEECCCCCCCCCCHHHHHHHHCCCCCEEE-EEE-CCC--CCCCHHHHHH T ss_conf 26748863387204885143553357703342332037883114-640-773--1476676665 No 51 >PRK04923 ribose-phosphate pyrophosphokinase; Provisional Probab=77.65 E-value=5.5 Score=19.73 Aligned_cols=32 Identities=16% Similarity=-0.024 Sum_probs=17.4 Q ss_pred HHHHCCCCC-EEEEECCCCCCCHHHHHHHCHHH Q ss_conf 998519997-89984887988889998828145 Q gi|254780758|r 181 NKILPLGDE-ISFVCGHGPNSTIGRERRLNPFL 212 (216) Q Consensus 181 ~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~~~ 212 (216) +.|...+.. +.++--||=.+.-+.||-.|..+ T Consensus 238 ~~Lk~~GA~~V~~~~THglfs~~A~~rl~~s~i 270 (319) T PRK04923 238 AALKQRGALKVVAYITHPVLSGPAVDNINNSQL 270 (319) T ss_pred HHHHHCCCCEEEEEEECCCCCHHHHHHHHHCCC T ss_conf 999875998689999797688679999970899 No 52 >PRK02458 ribose-phosphate pyrophosphokinase; Provisional Probab=77.41 E-value=4.4 Score=20.27 Aligned_cols=38 Identities=16% Similarity=0.067 Sum_probs=21.5 Q ss_pred HHHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCHHH Q ss_conf 9999999998519997-89984887988889998828145 Q gi|254780758|r 174 QLINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNPFL 212 (216) Q Consensus 174 ~~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~~~ 212 (216) .+.++. +.|...... +.++--||=.+--+.||..|..+ T Consensus 233 Tl~~aa-~~Lk~~GA~~V~~~~THglfs~~A~~rl~~s~i 271 (323) T PRK02458 233 TFAEAA-KILERSGATEIYAVASHGLFAGGAAEVLETAPI 271 (323) T ss_pred HHHHHH-HHHHHCCCCEEEEEEECCCCCHHHHHHHHCCCC T ss_conf 899999-999964997689999764257079999864998 No 53 >PRK02269 ribose-phosphate pyrophosphokinase; Provisional Probab=77.35 E-value=4 Score=20.51 Aligned_cols=30 Identities=13% Similarity=0.144 Sum_probs=14.7 Q ss_pred HHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 998519997-899848879888899988281 Q gi|254780758|r 181 NKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 181 ~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) +.|...... +.++--||=.+.-+.||-.+. T Consensus 238 ~~Lk~~GA~~V~~~~THgvfs~~A~~rl~~s 268 (321) T PRK02269 238 DALAEAGATAVYASCTHPVLSGPALDNIQKS 268 (321) T ss_pred HHHHHCCCCEEEEEEECCCCCHHHHHHHHCC T ss_conf 9998489982799997802784799998608 No 54 >PRK06827 phosphoribosylpyrophosphate synthetase; Provisional Probab=77.34 E-value=5.4 Score=19.77 Aligned_cols=31 Identities=13% Similarity=0.192 Sum_probs=13.8 Q ss_pred HHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHH Q ss_conf 999999998519997-899848879888899988 Q gi|254780758|r 175 LINSINNKILPLGDE-ISFVCGHGPNSTIGRERR 207 (216) Q Consensus 175 ~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~ 207 (216) +.+.. +.|...... +.++--||=.|- +-||- T Consensus 281 l~~aA-~~Lk~~GA~~V~a~aTHgvfS~-a~e~~ 312 (381) T PRK06827 281 VLDAA-KELKSRGAKKIICAVSFPFFTE-GLEKF 312 (381) T ss_pred HHHHH-HHHHHCCCCEEEEEEECHHCCC-HHHHH T ss_conf 99999-9999879988999997366047-69999 No 55 >PRK07199 phosphoribosylpyrophosphate synthetase; Provisional Probab=76.52 E-value=5.6 Score=19.67 Aligned_cols=37 Identities=19% Similarity=0.095 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCHH Q ss_conf 9999999998519997-8998488798888999882814 Q gi|254780758|r 174 QLINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNPF 211 (216) Q Consensus 174 ~~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~~ 211 (216) .+.++. +.+...+.. +.++--||=.+.-+.||-.|.. T Consensus 226 Tl~~aa-~~Lk~~GA~~V~~~aTHgvfs~~A~~~i~~s~ 263 (301) T PRK07199 226 TLIEAA-RQLLAAGAASPVCVGVHALFAPDAYSALLAAG 263 (301) T ss_pred HHHHHH-HHHHHCCCCEEEEEEECCCCCHHHHHHHHHCC T ss_conf 799999-99998699858999978647856999998489 No 56 >PRK01999 consensus Probab=76.23 E-value=4.7 Score=20.13 Aligned_cols=30 Identities=20% Similarity=0.267 Sum_probs=15.1 Q ss_pred HHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 998519997-899848879888899988281 Q gi|254780758|r 181 NKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 181 ~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) +.+...... +.++--||=.+--+.||-.|. T Consensus 233 ~~L~~~GA~~V~~~~THglfs~~A~~rl~~s 263 (311) T PRK01999 233 DLMMEKGAKSVRAIASHAVMSGPASERVENS 263 (311) T ss_pred HHHHHCCCCEEEEEEECCCCCCHHHHHHHHC T ss_conf 9998559973899994554585599999748 No 57 >PRK04554 consensus Probab=76.07 E-value=4 Score=20.52 Aligned_cols=35 Identities=17% Similarity=0.087 Sum_probs=17.1 Q ss_pred HHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 999999998519997-899848879888899988281 Q gi|254780758|r 175 LINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 175 ~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) +.+.. +.|...... +.++--||=.+--+.||-.|. T Consensus 231 l~~aa-~~Lk~~GA~~V~~~aTHglfs~~A~~rl~~s 266 (327) T PRK04554 231 LCKAA-VALKERGAERVLAYASHAVFSGEAVSRIASS 266 (327) T ss_pred HHHHH-HHHHHCCCCEEEEEEECCCCCCHHHHHHHHC T ss_conf 99999-9999769987899987916063399999718 No 58 >PRK02270 consensus Probab=75.99 E-value=3.9 Score=20.56 Aligned_cols=37 Identities=24% Similarity=0.211 Sum_probs=17.7 Q ss_pred HHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCHHH Q ss_conf 999999998519997-89984887988889998828145 Q gi|254780758|r 175 LINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNPFL 212 (216) Q Consensus 175 ~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~~~ 212 (216) +.+.. +.|...+.. +.++--||=.+--..++..++.+ T Consensus 230 l~~aa-~~Lk~~GA~~V~~~aTHgvfs~~a~~~~~~s~i 267 (327) T PRK02270 230 IINAA-EVLKEKGAKKIVLAATHGLFSKGFEIFEENPNI 267 (327) T ss_pred HHHHH-HHHHHCCCCEEEEEEECHHCCCHHHHHHHCCCC T ss_conf 99999-999987998799999774438049999658998 No 59 >PRK01506 consensus Probab=75.47 E-value=5.8 Score=19.60 Aligned_cols=29 Identities=24% Similarity=0.245 Sum_probs=13.6 Q ss_pred HHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 98519997-899848879888899988281 Q gi|254780758|r 182 KILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 182 ~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) .+...... +.++--||=.+.-+.||-.+. T Consensus 239 ~Lk~~GA~~V~~~~THglfs~~A~~rl~~s 268 (317) T PRK01506 239 ALVENGASEVYACCTHPVLSGPAIERIQNS 268 (317) T ss_pred HHHHCCCCEEEEEEECCCCCCHHHHHHHCC T ss_conf 998659975899997700681799998618 No 60 >PRK00553 ribose-phosphate pyrophosphokinase; Provisional Probab=75.20 E-value=5.5 Score=19.74 Aligned_cols=30 Identities=7% Similarity=-0.016 Sum_probs=15.0 Q ss_pred HHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 998519997-899848879888899988281 Q gi|254780758|r 181 NKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 181 ~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) +.|..-+.. +.++--||=.+--+-||..+. T Consensus 247 ~~Lk~~GA~~V~a~aTHgvfsg~A~eri~~~ 277 (340) T PRK00553 247 KLLKKQKAKKVCVMATHGLFNKNAIQLFDEA 277 (340) T ss_pred HHHHHCCCCEEEEEEECCCCCHHHHHHHHHC T ss_conf 9999879987999997951485799999852 No 61 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=74.94 E-value=3.7 Score=20.73 Aligned_cols=22 Identities=9% Similarity=0.249 Sum_probs=11.6 Q ss_pred CCCCCC-CCCCCCCCCCCCCCCC Q ss_conf 331012-2247210000012343 Q gi|254780758|r 128 FKVFHC-PGHSPGHVIYVNFENN 149 (216) Q Consensus 128 ~~vi~t-PGHt~g~~~~~~~~~~ 149 (216) |+++.- ||=-+-.+.++-.++| T Consensus 1085 ieI~A~PPGKk~~~l~LLSGGEK 1107 (1191) T TIGR02168 1085 IEIMAQPPGKKNQNLSLLSGGEK 1107 (1191) T ss_pred CEEEEECCCCCCCCCCCCCCHHH T ss_conf 56785276398766430201579 No 62 >PRK02039 consensus Probab=74.74 E-value=4.5 Score=20.21 Aligned_cols=30 Identities=17% Similarity=0.021 Sum_probs=15.3 Q ss_pred HHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 998519997-899848879888899988281 Q gi|254780758|r 181 NKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 181 ~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) +.|...... +.++--||=.+--+.||-.+. T Consensus 235 ~~Lk~~GA~~V~~~~THgvfs~~A~~rl~~s 265 (316) T PRK02039 235 QVLKERGAKQVFAYATHPVLSGGAAERIAAS 265 (316) T ss_pred HHHHHCCCCEEEEEEECCCCCCHHHHHHHCC T ss_conf 9999769985799997842573499998608 No 63 >PRK05259 consensus Probab=74.01 E-value=4.2 Score=20.43 Aligned_cols=34 Identities=24% Similarity=0.224 Sum_probs=15.7 Q ss_pred HHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHC Q ss_conf 999999998519997-89984887988889998828 Q gi|254780758|r 175 LINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLN 209 (216) Q Consensus 175 ~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n 209 (216) +.+.. +.+...... +.++--||=.+--+.||-.+ T Consensus 225 l~~aa-~~Lk~~GA~~V~~~~THgvfs~~A~~ri~~ 259 (310) T PRK05259 225 LCNAA-EALLANGANSVTAYITHGVLSGGAVARIAS 259 (310) T ss_pred HHHHH-HHHHHCCCCEEEEEEECCCCCCHHHHHHHC T ss_conf 99999-999877998699999785368569999862 No 64 >PRK10966 exonuclease subunit SbcD; Provisional Probab=73.91 E-value=3.9 Score=20.59 Aligned_cols=10 Identities=20% Similarity=0.511 Sum_probs=4.0 Q ss_pred CCHHHHHHHH Q ss_conf 8999999999 Q gi|254780758|r 170 GDYQQLINSI 179 (216) Q Consensus 170 ~d~~~~~~sl 179 (216) |+...+.+.+ T Consensus 290 G~l~el~~~l 299 (402) T PRK10966 290 GDLASIEEQL 299 (402) T ss_pred CCHHHHHHHH T ss_conf 8999999999 No 65 >PRK03675 consensus Probab=73.64 E-value=6.8 Score=19.20 Aligned_cols=36 Identities=25% Similarity=0.274 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 9999999998519997-899848879888899988281 Q gi|254780758|r 174 QLINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 174 ~~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) .+.+.. +.|...+.. +.++--||=.+.-+.||-.|. T Consensus 216 Tl~~aa-~~Lk~~GA~~V~~~aTHglfs~~A~~~l~~~ 252 (279) T PRK03675 216 TMIRAA-EILRKLGAKKIFVVATHGVFAEGAIERVSKA 252 (279) T ss_pred HHHHHH-HHHHHCCCCEEEEEEECCCCCCHHHHHHHHH T ss_conf 899999-9999769986999997810680699999974 No 66 >PRK01259 ribose-phosphate pyrophosphokinase; Provisional Probab=73.60 E-value=5.9 Score=19.54 Aligned_cols=30 Identities=20% Similarity=0.083 Sum_probs=14.3 Q ss_pred HHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 998519997-899848879888899988281 Q gi|254780758|r 181 NKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 181 ~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) +.+...... +.++--||=.+.-+.||-.+. T Consensus 229 ~~Lk~~GA~~V~~~~THglfs~~A~~rl~~s 259 (309) T PRK01259 229 EALKERGAKSVFAYATHPVLSGGAAERIANS 259 (309) T ss_pred HHHHHCCCCEEEEEEECHHCCHHHHHHHHHC T ss_conf 9997569966999987631394599998708 No 67 >PRK02812 ribose-phosphate pyrophosphokinase; Provisional Probab=73.59 E-value=4.5 Score=20.25 Aligned_cols=27 Identities=19% Similarity=0.055 Sum_probs=11.6 Q ss_pred HHHCCCCC-EEEEECCCCCCCHHHHHHH Q ss_conf 98519997-8998488798888999882 Q gi|254780758|r 182 KILPLGDE-ISFVCGHGPNSTIGRERRL 208 (216) Q Consensus 182 ~l~~l~~~-~~v~PgHG~~tt~~~e~~~ 208 (216) .|..-... +.++--||=.+--+.||.. T Consensus 252 ~Lk~~GA~~V~a~aTHgvfsg~A~~rl~ 279 (331) T PRK02812 252 LLRKEGAKRVYACATHAVFSPPAIERLS 279 (331) T ss_pred HHHHCCCCEEEEEEECCCCCHHHHHHHH T ss_conf 9985799878999989655877999997 No 68 >PRK05038 consensus Probab=72.69 E-value=5.3 Score=19.80 Aligned_cols=35 Identities=17% Similarity=0.074 Sum_probs=16.2 Q ss_pred HHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 999999998519997-899848879888899988281 Q gi|254780758|r 175 LINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 175 ~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) +.+.. +.|...+.. +.++--||=.+--+.||-.+. T Consensus 229 l~~aa-~~Lk~~GA~~V~~~~THglfs~~A~~~l~~s 264 (315) T PRK05038 229 LCKAA-EALKERGAKRVFAYATHPVFSGNAAENIRNS 264 (315) T ss_pred HHHHH-HHHHHCCCCCCEEEEECCCCCCHHHHHHHCC T ss_conf 99999-9999659986359986535583699998608 No 69 >PRK03092 ribose-phosphate pyrophosphokinase; Provisional Probab=72.61 E-value=7.5 Score=18.95 Aligned_cols=29 Identities=24% Similarity=0.231 Sum_probs=13.9 Q ss_pred HHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 98519997-899848879888899988281 Q gi|254780758|r 182 KILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 182 ~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) .+...... +.++--||=.+--+.||..|. T Consensus 223 ~Lk~~GA~~V~~~~THglfs~~A~~rl~~s 252 (304) T PRK03092 223 ALKEAGAKDVIIAATHGVLSDPAAERLKNC 252 (304) T ss_pred HHHHCCCCEEEEEEECCCCCHHHHHHHHHC T ss_conf 998669983999997934596799999738 No 70 >TIGR01011 rpsB_bact ribosomal protein S2; InterPro: IPR005706 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the bacterial, mitochondrial and chloroplast forms of ribosomal protein S2.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=71.07 E-value=7.6 Score=18.92 Aligned_cols=17 Identities=24% Similarity=0.048 Sum_probs=6.1 Q ss_pred HCCCCEEEEEECCCCCC Q ss_conf 74997529998698852 Q gi|254780758|r 47 SRNFHVKQIWITHGHVD 63 (216) Q Consensus 47 ~~~~~i~~Ii~TH~H~D 63 (216) ++|+||-+|+-|-+.|| T Consensus 178 kLgIPvvA~vDTNCdPd 194 (227) T TIGR01011 178 KLGIPVVAIVDTNCDPD 194 (227) T ss_pred HCCCCEEEEECCCCCCC T ss_conf 37997898704788888 No 71 >PRK01132 consensus Probab=70.65 E-value=6 Score=19.49 Aligned_cols=33 Identities=15% Similarity=0.094 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHH Q ss_conf 9999999998519997-899848879888899988 Q gi|254780758|r 174 QLINSINNKILPLGDE-ISFVCGHGPNSTIGRERR 207 (216) Q Consensus 174 ~~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~ 207 (216) .+.+.. +.|...... +.++--||=.+--+.||. T Consensus 218 Tl~~aa-~~Lk~~GA~~V~a~~THglfs~~A~~ri 251 (286) T PRK01132 218 TIAKSS-RILRDKGASKIYVSAVHGLFVNNSEAKI 251 (286) T ss_pred HHHHHH-HHHHHCCCCEEEEEEECCCCCHHHHHHH T ss_conf 999999-9999879984999996820671699998 No 72 >TIGR01978 sufC FeS assembly ATPase SufC; InterPro: IPR010230 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufC, which acts as an ATPase in the SUF system. SufC belongs to the ATP-binding cassette transporter family (IPR003439 from INTERPRO) but is no longer thought to be part of a transporter. The complex is reported as cytosolic or associated with the membrane.; GO: 0005524 ATP binding, 0006810 transport. Probab=70.01 E-value=5.4 Score=19.78 Aligned_cols=14 Identities=7% Similarity=0.382 Sum_probs=6.3 Q ss_pred EEECCCCCCCCCCC Q ss_conf 99869885210110 Q gi|254780758|r 55 IWITHGHVDHVGGA 68 (216) Q Consensus 55 Ii~TH~H~DH~gg~ 68 (216) ||+.=-+|-=|-|. T Consensus 79 lFLaFQ~P~EIPGV 92 (248) T TIGR01978 79 LFLAFQYPEEIPGV 92 (248) T ss_pred CEECCCCCCCCCCC T ss_conf 51015888556885 No 73 >PRK00934 ribose-phosphate pyrophosphokinase; Provisional Probab=67.98 E-value=9.4 Score=18.36 Aligned_cols=36 Identities=14% Similarity=0.146 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 9999999998519997-899848879888899988281 Q gi|254780758|r 174 QLINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 174 ~~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) .+.++. +.|...... +.++--||=.+.-+.||..+. T Consensus 219 Tl~~aa-~~Lk~~GA~~V~~~~THglfs~~A~~rl~~s 255 (286) T PRK00934 219 TMAKAI-SILKEQGAKKIYVSCVHPVLVEDAINKIHSA 255 (286) T ss_pred HHHHHH-HHHHHCCCCEEEEEEECCCCCHHHHHHHHHC T ss_conf 899999-9999869986999997900780599999748 No 74 >PRK04117 consensus Probab=67.29 E-value=7.5 Score=18.95 Aligned_cols=36 Identities=17% Similarity=0.228 Sum_probs=17.8 Q ss_pred HHHHHHHHHHHCCCCC-EEEEECCCCCCCHHHHHHHCH Q ss_conf 9999999998519997-899848879888899988281 Q gi|254780758|r 174 QLINSINNKILPLGDE-ISFVCGHGPNSTIGRERRLNP 210 (216) Q Consensus 174 ~~~~sl~~~l~~l~~~-~~v~PgHG~~tt~~~e~~~n~ 210 (216) .+.+.. +.+...... +.++--||=.+--+.||-.|. T Consensus 226 Ti~~aa-~~L~~~GA~~V~~~~THglfs~~A~~rl~~s 262 (309) T PRK04117 226 TIVKAA-EALKEKGATSVMACCTHAVLSGPAYERIAKG 262 (309) T ss_pred HHHHHH-HHHHHCCCCEEEEEEECCCCCCHHHHHHHHC T ss_conf 999999-9998669974899994554583699999838 No 75 >KOG3592 consensus Probab=66.99 E-value=4 Score=20.51 Aligned_cols=24 Identities=25% Similarity=0.421 Sum_probs=20.2 Q ss_pred CEEEEEECCCCCCCCCCCHHHHHH Q ss_conf 752999869885210110567886 Q gi|254780758|r 51 HVKQIWITHGHVDHVGGAAQLKED 74 (216) Q Consensus 51 ~i~~Ii~TH~H~DH~gg~~~l~~~ 74 (216) +|++|++||--.|..+|+.-|.++ T Consensus 81 rVdaVLLthpg~dNLpginsllqr 104 (934) T KOG3592 81 RVDAVLLTHPGADNLPGINSLLQR 104 (934) T ss_pred HHHHHHHCCCCCCCCCCCHHHHHH T ss_conf 434654016666766550278899 No 76 >KOG3798 consensus Probab=66.23 E-value=6.2 Score=19.44 Aligned_cols=93 Identities=19% Similarity=0.218 Sum_probs=42.9 Q ss_pred CEEEEEECCCCCCCCCCCHHHHHHHC--CCEEECCHHHHHHHCCHHHHHHHHHCCCCCCCCCCEEEECCCCE--EEEECC Q ss_conf 75299986988521011056788607--95540584389884012344443200234666674189647839--976201 Q gi|254780758|r 51 HVKQIWITHGHVDHVGGAAQLKEDLS--LTIIGPHKDDAAMMGKVDEQARLLSIRMNARNASSDRWLQDGDT--LLLGTH 126 (216) Q Consensus 51 ~i~~Ii~TH~H~DH~gg~~~l~~~~~--~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~gd~--i~lg~~ 126 (216) +++.++.+|.|+||.-.-+ +++-.+ .++....+-...++.. ........+..+++ +.-++. T Consensus 132 ~~d~~~vsh~h~dhld~~~-~~~~~~~~~~~wfvp~g~k~~m~~--------------~gc~~v~el~wwe~~~~vkn~~ 196 (343) T KOG3798 132 DLDFAVVSHDHYDHLDADA-VKKITDRNPQIWFVPLGMKKWMEG--------------DGSSTVTELNWGESSEFVKNGK 196 (343) T ss_pred CCCEECCCCCCCCCCCHHH-HHHHHCCCCCCEECHHHHHHEECC--------------CCCCCEEEEECCHHHCEECCCC T ss_conf 9663034366523236688-876505685210303222012437--------------8877036750100220511895 Q ss_pred CCCCCCCCC-CCCC------------CCCCCCCCCCCCCCCCHHH Q ss_conf 233101222-4721------------0000012343211110000 Q gi|254780758|r 127 IFKVFHCPG-HSPG------------HVIYVNFENNFAHLGDTLF 158 (216) Q Consensus 127 ~~~vi~tPG-Ht~g------------~~~~~~~~~~~lftGD~lf 158 (216) ++.+..+|. |..+ |-++.-+..++.|+||+=| T Consensus 197 ~~ti~~tPaqHw~~R~L~D~Nk~LW~sw~v~g~~nrfffaGDTGy 241 (343) T KOG3798 197 TYTIWCLPAQHWGQRGLFDRNKRLWSSWAVIGENNRFFFAGDTGY 241 (343) T ss_pred EEEEEECCHHHHCCCCCCCCCCCEEEEEEEECCCCEEEECCCCCC T ss_conf 799997532431245433478621775578637845874687776 No 77 >TIGR01821 5aminolev_synth 5-aminolevulinic acid synthase; InterPro: IPR010961 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents 5-aminoaevulinic acid (ALA) synthase (2.3.1.37 from EC), which catalyses the first stage of tetrapyrrole biosynthesis by the C4 pathway, namely the condensation of succinyl CoA and glycine. ALA synthase is a pyridoxal-phosphate-dependent enzyme. During catalysis, glycine initially binds to the enzyme cofactor, and after condensation with succinyl CoA, CoA, carbon dioxide and 5-aminolevulinic acid are produced .; GO: 0003870 5-aminolevulinate synthase activity, 0030170 pyridoxal phosphate binding, 0033014 tetrapyrrole biosynthetic process. Probab=64.04 E-value=4 Score=20.50 Aligned_cols=19 Identities=21% Similarity=0.354 Sum_probs=11.9 Q ss_pred EECCCCCCCCCCCCCCCCC Q ss_conf 6201233101222472100 Q gi|254780758|r 123 LGTHIFKVFHCPGHSPGHV 141 (216) Q Consensus 123 lg~~~~~vi~tPGHt~g~~ 141 (216) -|...+++.+||.||+..+ T Consensus 390 RG~ERLRitPtP~H~~~~i 408 (427) T TIGR01821 390 RGTERLRITPTPVHTDKLI 408 (427) T ss_pred CCCCCCCCCCCCCCCHHHH T ss_conf 8750244268889887789 No 78 >PRK09382 ispDF bifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein; Provisional Probab=62.65 E-value=7.5 Score=18.92 Aligned_cols=70 Identities=19% Similarity=0.249 Sum_probs=38.1 Q ss_pred EECCCCEEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCC-------HHHHHHHHHHHHHC Q ss_conf 9647839976201233101-222472100000123432111100000135663467989-------99999999999851 Q gi|254780758|r 114 WLQDGDTLLLGTHIFKVFH-CPGHSPGHVIYVNFENNFAHLGDTLFRSSIGRTDILHGD-------YQQLINSINNKILP 185 (216) Q Consensus 114 ~l~~gd~i~lg~~~~~vi~-tPGHt~g~~~~~~~~~~~lftGD~lf~~~~G~~d~~~~d-------~~~~~~sl~~~l~~ 185 (216) -+.+|..+.+||.++..-. .-|||+|++.+.---+.+| | ++=.+.+| ..||.+| ...+++...+++.. T Consensus 232 ~~~~~~~l~LgGv~i~~~~~l~gHSDgDV~lHAl~DAiL--G-A~~~gDIG-~~Fp~sd~k~k~~~S~~~L~~~~~~~~~ 307 (382) T PRK09382 232 AFEEGDPVTLCGVKIPHEFGLKGHSDADVALHALTDALL--G-AIGAGDIG-EHFPDSDPQWKGAASKILLEHAVDLVRE 307 (382) T ss_pred CCCCCCCEEEEEEEECCCCCCEEECCCCHHHHHHHHHHH--H-HHCCCCCH-HCCCCCCHHHCCCCHHHHHHHHHHHHHH T ss_conf 147898479987898898735057687168999999998--8-74299702-0089987565147799999999999998 Q ss_pred CC Q ss_conf 99 Q gi|254780758|r 186 LG 187 (216) Q Consensus 186 l~ 187 (216) .. T Consensus 308 ~g 309 (382) T PRK09382 308 AG 309 (382) T ss_pred CC T ss_conf 69 No 79 >TIGR01501 MthylAspMutase methylaspartate mutase, S subunit; InterPro: IPR006394 These sequences represent the S (sigma) subunit of methylaspartate mutase (glutamate mutase), a cobalamin-dependent enzyme that catalyzes the first step in a pathway of glutamate fermentation. ; GO: 0016866 intramolecular transferase activity, 0019670 anaerobic glutamate catabolic process. Probab=61.53 E-value=5.7 Score=19.65 Aligned_cols=45 Identities=13% Similarity=0.320 Sum_probs=29.2 Q ss_pred CCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE-ECCCCCCCHHHHHH Q ss_conf 21111000001356634679899999999999985199978998-48879888899988 Q gi|254780758|r 150 FAHLGDTLFRSSIGRTDILHGDYQQLINSINNKILPLGDEISFV-CGHGPNSTIGRERR 207 (216) Q Consensus 150 ~lftGD~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~-PgHG~~tt~~~e~~ 207 (216) .|+.|=.| -+|+.||+ .+++|...+.-| +|| |+|++..-+..-|+ T Consensus 85 llYVGGNl---vVGk~df~---------dV~~rFkeMGfD-RVfap~t~~E~v~~~~k~ 130 (134) T TIGR01501 85 LLYVGGNL---VVGKTDFE---------DVEKRFKEMGFD-RVFAPQTDLEAVVDALKK 130 (134) T ss_pred EEEECCEE---EECCCCHH---------HHHHHHHHCCCC-EEECCCCCHHHHHHHHHH T ss_conf 99876766---55776736---------788887645873-323698582278999987 No 80 >PRK02277 orotate phosphoribosyltransferase-like protein; Provisional Probab=60.51 E-value=13 Score=17.53 Aligned_cols=14 Identities=7% Similarity=0.005 Sum_probs=8.9 Q ss_pred CCCCCCCCCCCHHH Q ss_conf 12343211110000 Q gi|254780758|r 145 NFENNFAHLGDTLF 158 (216) Q Consensus 145 ~~~~~~lftGD~lf 158 (216) +...++++.=|.+- T Consensus 139 V~Gk~VlIVDDViT 152 (201) T PRK02277 139 VAGKRCVIVDDVIT 152 (201) T ss_pred CCCCEEEEEEECCC T ss_conf 37878999970237 No 81 >COG4566 TtrR Response regulator [Signal transduction mechanisms] Probab=57.85 E-value=15 Score=17.25 Aligned_cols=44 Identities=25% Similarity=0.393 Sum_probs=31.8 Q ss_pred EEEEEC---CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHH Q ss_conf 499983---899999999999974997529998698852101105678 Q gi|254780758|r 28 EAVVVD---PGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGGAAQLK 72 (216) Q Consensus 28 ~~ilID---~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~ 72 (216) -|+|.| ||.+.-.+.+.|.+.|.+.--|++| +|.|=-.....++ T Consensus 50 GclllDvrMPg~sGlelq~~L~~~~~~~PVIfiT-GhgDIpmaV~AmK 96 (202) T COG4566 50 GCLLLDVRMPGMSGLELQDRLAERGIRLPVIFLT-GHGDIPMAVQAMK 96 (202) T ss_pred CEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEE-CCCCHHHHHHHHH T ss_conf 7699855788885089999999539998879982-7887289999997 No 82 >PRK11624 cdsA CDP-diglyceride synthase; Provisional Probab=57.79 E-value=5 Score=19.99 Aligned_cols=26 Identities=23% Similarity=0.326 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf 99999999998519997899848879 Q gi|254780758|r 173 QQLINSINNKILPLGDEISFVCGHGP 198 (216) Q Consensus 173 ~~~~~sl~~~l~~l~~~~~v~PgHG~ 198 (216) .++.+|.-||-....|.=.++||||- T Consensus 234 GDL~eS~~KR~~~VKDSG~llPGHGG 259 (285) T PRK11624 234 GDLTESMFKREAGIKDSGHLIPGHGG 259 (285) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 08999999775799978887887552 No 83 >TIGR03348 VI_IcmF type VI secretion protein IcmF. Members of this protein family are IcmF homologs and tend to be associated with type VI secretion systems. Probab=57.13 E-value=15 Score=17.18 Aligned_cols=44 Identities=20% Similarity=0.339 Sum_probs=24.6 Q ss_pred CCE-EEEEECCCCEEEEECCCCC--------------HHHHHHHHHHC--CCCEEEEEECCCCCC Q ss_conf 716-9999889984999838999--------------99999999974--997529998698852 Q gi|254780758|r 16 QNC-TFLFDEESLEAVVVDPGGD--------------LDIIKDVIQSR--NFHVKQIWITHGHVD 63 (216) Q Consensus 16 ~N~-~li~~~~~~~~ilID~G~~--------------~~~i~~~l~~~--~~~i~~Ii~TH~H~D 63 (216) .|| +++ +++++|||+.+- ...+++.|++. .-+|..|++|=.=.| T Consensus 153 ~~cdwwf----t~~AVliDtaGry~~Q~~~~~~d~~~W~~fL~lLkk~R~r~piNGvil~is~~~ 213 (1169) T TIGR03348 153 RNCDWWF----TDEAVLIDTAGRYTTQDSDPEEDAAAWLGFLGLLRKHRRRQPLNGVVVTVSLAD 213 (1169) T ss_pred CCCCEEE----ECCEEEEECCCCEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEHHH T ss_conf 5557165----278799947976026888640018999999999986489899876899978999 No 84 >TIGR01344 malate_syn_A malate synthase A; InterPro: IPR006252 These sequences represent plant malate synthase and one of two bacterial forms, designated malate synthase A. This enzyme and isocitrate lyase are the two characteristic enzymes of the glyoxylate shunt. The shunt enables the cell to use acetyl-CoA to generate increased levels of TCA cycle intermediates for biosynthetic pathways such as gluconeogenesis.; GO: 0004474 malate synthase activity, 0006097 glyoxylate cycle. Probab=55.88 E-value=6.1 Score=19.46 Aligned_cols=40 Identities=25% Similarity=0.280 Sum_probs=27.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHCHHHHCC Q ss_conf 3566346798999999999999851999789984887988889998828145148 Q gi|254780758|r 161 SIGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNSTIGRERRLNPFLKSN 215 (216) Q Consensus 161 ~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~tt~~~e~~~n~~~~~~ 215 (216) .|||+| |+=|.-|+|.+..++ .|+|--+-.| .+-|||++| T Consensus 257 NCGRWD--------YIFS~IK~L~~~G~~-~VLPDR~~vT------M~kpFl~AY 296 (522) T TIGR01344 257 NCGRWD--------YIFSFIKTLRKAGPE-FVLPDRDAVT------MDKPFLNAY 296 (522) T ss_pred CCCHHH--------HHHHHHHHHHHCCCC-CCCCCCCEEE------CCCHHHHHH T ss_conf 662467--------888898876416885-1468897035------484768899 No 85 >TIGR00475 selB selenocysteine-specific translation elongation factor; InterPro: IPR004535 In prokaryotes, the incorporation of selenocysteine as the 21st amino acid, encoded by TGA, requires several elements: SelC is the tRNA itself, SelD acts as a donor of reduced selenium, SelA modifies a serine residue on SelC into selenocysteine, and SelB is a selenocysteine-specific translation elongation factor. 3' or 5' non-coding elements of mRNA have been found as probable structures for directing selenocysteine incorporation . This family describes the elongation factor SelB, a close homologue of EF-Tu. It may function by replacing EF-Tu. A C-terminal domain not found in EF-Tu is in all SelB sequences in the seed alignment except that from Methanococcus jannaschii. This family should not include an equivalent protein for eukaryotes. ; GO: 0003723 RNA binding, 0003746 translation elongation factor activity, 0005525 GTP binding, 0001514 selenocysteine incorporation, 0005737 cytoplasm. Probab=55.41 E-value=7.6 Score=18.89 Aligned_cols=20 Identities=10% Similarity=0.047 Sum_probs=11.2 Q ss_pred HHHHHHCCCCEEEEEECCCC Q ss_conf 99999749975299986988 Q gi|254780758|r 42 KDVIQSRNFHVKQIWITHGH 61 (216) Q Consensus 42 ~~~l~~~~~~i~~Ii~TH~H 61 (216) +.-++..+.+-.-|++|=.. T Consensus 101 L~vl~~l~~~~~ivvltK~D 120 (627) T TIGR00475 101 LAVLKLLGIPELIVVLTKAD 120 (627) T ss_pred HHHHHHHCCCEEEEEEECCC T ss_conf 99999708961999973467 No 86 >TIGR00619 sbcd nuclease SbcCD, D subunit; InterPro: IPR004593 All proteins in this family for which functions are known are double-stranded DNA exonuclease (as part of a complex with SbcC homologs). This complex functions in the initiation of recombination and recombinational repair and is particularly important in regulating the stability of DNA sections that can form secondary structures. This family is likely to be homologous to the MRE11 family.; GO: 0004527 exonuclease activity, 0006259 DNA metabolic process. Probab=54.96 E-value=13 Score=17.58 Aligned_cols=43 Identities=19% Similarity=0.312 Sum_probs=24.0 Q ss_pred HHHHHHCCCCEEEEEEC--CCCCCCCCCCHHHHHHHCCCEEECCH Q ss_conf 99999749975299986--98852101105678860795540584 Q gi|254780758|r 42 KDVIQSRNFHVKQIWIT--HGHVDHVGGAAQLKEDLSLTIIGPHK 84 (216) Q Consensus 42 ~~~l~~~~~~i~~Ii~T--H~H~DH~gg~~~l~~~~~~~i~~~~~ 84 (216) +..|.+.+.++--|++. |=-.|-.-....|.+..+.-++.... T Consensus 76 l~~L~~~~~R~p~Vvi~GNHD~~~~Ls~~~~Ll~~~~~~v~~~~~ 120 (275) T TIGR00619 76 LLNLSDANPRLPIVVISGNHDSADRLSAAKKLLKELGVFVVGKPE 120 (275) T ss_pred HHHHHHCCCCEEEEEECCCCCCHHHHCHHHHHHHHCCCEEEECCC T ss_conf 999985389612787047888778611167789770954420247 No 87 >cd04170 EF-G_bact Elongation factor G (EF-G) subfamily. Translocation is mediated by EF-G (also called translocase). The structure of EF-G closely resembles that of the complex between EF-Tu and tRNA. This is an example of molecular mimicry; a protein domain evolved so that it mimics the shape of a tRNA molecule. EF-G in the GTP form binds to the ribosome, primarily through the interaction of its EF-Tu-like domain with the 50S subunit. The binding of EF-G to the ribosome in this manner stimulates the GTPase activity of EF-G. On GTP hydrolysis, EF-G undergoes a conformational change that forces its arm deeper into the A site on the 30S subunit. To accommodate this domain, the peptidyl-tRNA in the A site moves to the P site, carrying the mRNA and the deacylated tRNA with it. The ribosome may be prepared for these rearrangements by the initial binding of EF-G as well. The dissociation of EF-G leaves the ribosome ready to accept the next aminoacyl-tRNA into the A site. This group Probab=53.99 E-value=14 Score=17.40 Aligned_cols=50 Identities=16% Similarity=0.205 Sum_probs=22.0 Q ss_pred EEEECCCCC----HHHHHHHHHHCCCCEEEEEECCCCCCC---CCCCHHHHHHHCCCE Q ss_conf 999838999----999999999749975299986988521---011056788607955 Q gi|254780758|r 29 AVVVDPGGD----LDIIKDVIQSRNFHVKQIWITHGHVDH---VGGAAQLKEDLSLTI 79 (216) Q Consensus 29 ~ilID~G~~----~~~i~~~l~~~~~~i~~Ii~TH~H~DH---~gg~~~l~~~~~~~i 79 (216) .++||+... .+.+.+.+++.+++ ..++++....+. ---+..++++++.++ T Consensus 92 v~Vida~~GVe~~T~~~w~~~~~~~iP-~i~fINKmDr~~ad~~~~l~~i~~~lg~~~ 148 (268) T cd04170 92 LVVVSAQSGVEVGTEKLWEFADEAGIP-RIIFINKMDRERADFDKTLAALQEAFGRPV 148 (268) T ss_pred EEEECCCCCHHHHHHHHHHHHHHCCCC-EEEEEECCCCCCCCHHHHHHHHHHHHCCCE T ss_conf 999418754768799999999985999-899997878789964779999999868984 No 88 >TIGR02427 protocat_pcaD 3-oxoadipate enol-lactonase; InterPro: IPR012790 These proteins are 3-oxoadipate enol-lactonases (also called beta-ketoadipate enol-lactone hydrolase), which catalyse the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate can all be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this entry were chosen from within protocatechuate degradation operons of at least three genes of the pathway, and from genomes with the complete pathway through beta-ketoadipate.; GO: 0047570 3-oxoadipate enol-lactonase activity, 0019614 catechol catabolic process. Probab=53.46 E-value=11 Score=18.00 Aligned_cols=33 Identities=24% Similarity=0.355 Sum_probs=13.3 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEE---CCCCC--CCCCC Q ss_conf 99999999999974997529998---69885--21011 Q gi|254780758|r 35 GGDLDIIKDVIQSRNFHVKQIWI---THGHV--DHVGG 67 (216) Q Consensus 35 G~~~~~i~~~l~~~~~~i~~Ii~---TH~H~--DH~gg 67 (216) |-|...+-..+..+.....-|.+ =|++. |=..+ T Consensus 24 GTDl~~Wd~v~~~L~~~f~~lryD~RGHG~Srad~~~~ 61 (256) T TIGR02427 24 GTDLRMWDPVLPALTADFRVLRYDKRGHGLSRADVPEG 61 (256) T ss_pred HCCHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCC T ss_conf 04465568998620279179988508898755778888 No 89 >PRK01215 competence damage-inducible protein A; Provisional Probab=52.88 E-value=18 Score=16.77 Aligned_cols=82 Identities=15% Similarity=0.103 Sum_probs=44.9 Q ss_pred CCCCEEEEEECCC-------CCCCEEEEEEC---CCCEEE-EECCCCCHHHHHHHHHHCCCCEEEEEECCC----CCCCC Q ss_conf 9773599997087-------65716999988---998499-983899999999999974997529998698----85210 Q gi|254780758|r 1 MSKLSFHIVQVTP-------FKQNCTFLFDE---ESLEAV-VVDPGGDLDIIKDVIQSRNFHVKQIWITHG----HVDHV 65 (216) Q Consensus 1 m~~~~~~~i~v~~-------~~~N~~li~~~---~~~~~i-lID~G~~~~~i~~~l~~~~~~i~~Ii~TH~----H~DH~ 65 (216) |.+|+-.+|.+|- .++|+.++... -|-++. ..=-|.+.+.+.+.++..-.+-+.|+.|=+ |-|.. T Consensus 1 ~~~~kA~II~IG~ElL~G~i~dtNs~~la~~L~~~G~~v~~~~~V~D~~~~I~~~l~~a~~r~d~Vi~tGGLGPT~DDlT 80 (264) T PRK01215 1 MEKGRAWIITVGNELLIGRTVNTNASWIARRLTYLGYSVRRIVVVPDDEEEIVEAFREAIGRADVVVSTGGLGPTYDDMT 80 (264) T ss_pred CCCCEEEEEEECCCCCCCCEECHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHH T ss_conf 99855999998243106610516799999999977993889999289899999999999743899999489579985679 Q ss_pred CCCHHHHHHHCCCEEECCH Q ss_conf 1105678860795540584 Q gi|254780758|r 66 GGAAQLKEDLSLTIIGPHK 84 (216) Q Consensus 66 gg~~~l~~~~~~~i~~~~~ 84 (216) - ..+.+.++.+.....+ T Consensus 81 ~--eavA~a~g~~L~~~~e 97 (264) T PRK01215 81 N--EGFAKALGVELELNED 97 (264) T ss_pred H--HHHHHHHCCCEEECHH T ss_conf 9--9999984998498499 No 90 >pfam01148 CTP_transf_1 Cytidylyltransferase family. The members of this family are integral membrane protein cytidylyltransferases. The family includes phosphatidate cytidylyltransferase EC:2.7.7.41 as well as Sec59 from yeast. Sec59 is a dolichol kinase EC:2.7.1.108. Probab=52.44 E-value=7.6 Score=18.90 Aligned_cols=27 Identities=22% Similarity=0.268 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 999999999985199978998488798 Q gi|254780758|r 173 QQLINSINNKILPLGDEISFVCGHGPN 199 (216) Q Consensus 173 ~~~~~sl~~~l~~l~~~~~v~PgHG~~ 199 (216) .++.+|.-||-....|.=.++||||-- T Consensus 212 GDL~eS~~KR~~~vKDsg~llPGHGG~ 238 (259) T pfam01148 212 GDLVESGFKRDFGIKDSGKLIPGHGGI 238 (259) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCH T ss_conf 099999998805999777877875407 No 91 >TIGR01090 apt adenine phosphoribosyltransferase; InterPro: IPR005764 Adenine phosphoribosyltransferase (APRTase, 2.4.2.7 from EC) is a widely distributed enzyme, and its deficiency in humans causes the accumulation of 2,8-dihydroxyadenine. It is the sole catalyst for adenine recycling in most eukaryotes.AMP + diphosphate = adenine + 5-phospho-alpha-D-ribose 1-diphosphate ; GO: 0003999 adenine phosphoribosyltransferase activity, 0006168 adenine salvage. Probab=51.69 E-value=19 Score=16.66 Aligned_cols=32 Identities=22% Similarity=0.252 Sum_probs=17.1 Q ss_pred CCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 234321111000001356634679899999999999985199978 Q gi|254780758|r 146 FENNFAHLGDTLFRSSIGRTDILHGDYQQLINSINNKILPLGDEI 190 (216) Q Consensus 146 ~~~~~lftGD~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~ 190 (216) +..++|++=|.| . . +|. +.+++ +.+..|..++ T Consensus 113 ~g~RVliVDDLl-A--T------GGT---~~A~~-eLi~~Lggev 144 (175) T TIGR01090 113 PGQRVLIVDDLL-A--T------GGT---AEATL-ELIKKLGGEV 144 (175) T ss_pred CCCEEEEEECCC-C--C------HHH---HHHHH-HHHHHHCCEE T ss_conf 789089983220-1--2------678---99999-9999859616 No 92 >PRK10161 transcriptional regulator PhoB; Provisional Probab=51.13 E-value=19 Score=16.61 Aligned_cols=57 Identities=14% Similarity=0.143 Sum_probs=30.3 Q ss_pred CCCCCCCCCCCCHHHHCCCCCCCCCC-CCHHHHHHHHHHHHHCCCCC-E-EEEECCCCCC Q ss_conf 01234321111000001356634679-89999999999998519997-8-9984887988 Q gi|254780758|r 144 VNFENNFAHLGDTLFRSSIGRTDILH-GDYQQLINSINNKILPLGDE-I-SFVCGHGPNS 200 (216) Q Consensus 144 ~~~~~~~lftGD~lf~~~~G~~d~~~-~d~~~~~~sl~~~l~~l~~~-~-~v~PgHG~~t 200 (216) +....+-+++=|.|+..-.|.-...+ -+....+..||+||..-+.. . .-+.|.|+-. T Consensus 166 L~~~~g~vvsr~~L~~~vwg~~~~~~~rtldv~I~rLRkKl~~~~~~~~I~TvrG~GYr~ 225 (229) T PRK10161 166 FMTHPERVYSREQLLNHVWGTNVYVEDRTVDVHIRRLRKALEPGGHDRMVQTVRGTGYRF 225 (229) T ss_pred HHHCCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEECCCCEEE T ss_conf 997799879799999986399989884739999999999745679998199865847405 No 93 >COG0575 CdsA CDP-diglyceride synthetase [Lipid metabolism] Probab=50.60 E-value=9.5 Score=18.34 Aligned_cols=27 Identities=26% Similarity=0.357 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 999999999985199978998488798 Q gi|254780758|r 173 QQLINSINNKILPLGDEISFVCGHGPN 199 (216) Q Consensus 173 ~~~~~sl~~~l~~l~~~~~v~PgHG~~ 199 (216) .++.+|.-++.....|.-.++||||-- T Consensus 217 GDL~eS~iKR~~giKDsg~liPGHGGi 243 (265) T COG0575 217 GDLFESYIKRLLGIKDSGWLIPGHGGI 243 (265) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 789999999971999787778876631 No 94 >TIGR02362 dhaK1b probable dihydroxyacetone kinase DhaK1b subunit; InterPro: IPR012735 Two types of dihydroxyacetone kinase (glycerone kinase) are described. In yeast and a few bacteria, e.g. Citrobacter freundii, the enzyme is a single chain that uses ATP as phosphoryl donor and is designated 2.7.1.29 from EC. By contract, Escherichia coli and many other bacterial species have a multisubunit form with a phosphoprotein donor related to PTS transport proteins. This family represents a protein, unique to the firmicutes (low GC Gram-positives), that appears to be a divergent second copy of the K subunit of that complex; its gene is always found in operons with the other three proteins of the complex.. Probab=49.67 E-value=13 Score=17.48 Aligned_cols=16 Identities=13% Similarity=0.086 Sum_probs=9.1 Q ss_pred HHHHHHHCCCCEEEEE Q ss_conf 9999997499752999 Q gi|254780758|r 41 IKDVIQSRNFHVKQIW 56 (216) Q Consensus 41 i~~~l~~~~~~i~~Ii 56 (216) ..+.+++.|.++.+|+ T Consensus 114 A~~qar~EG~~~~~~i 129 (328) T TIGR02362 114 AIEQARQEGRDIKYII 129 (328) T ss_pred HHHHHHHCCCCEEEEE T ss_conf 9999986389503799 No 95 >TIGR00972 3a0107s01c2 phosphate ABC transporter, ATP-binding protein; InterPro: IPR005670 This is a family of phosphate transport system permease proteins.; GO: 0005315 inorganic phosphate transmembrane transporter activity, 0015114 phosphate transmembrane transporter activity, 0006817 phosphate transport, 0016020 membrane. Probab=49.35 E-value=19 Score=16.58 Aligned_cols=30 Identities=7% Similarity=0.248 Sum_probs=15.5 Q ss_pred CCCEEEEECCCC-CHHHHHHHHHHC-----CCCEEE Q ss_conf 998499983899-999999999974-----997529 Q gi|254780758|r 25 ESLEAVVVDPGG-DLDIIKDVIQSR-----NFHVKQ 54 (216) Q Consensus 25 ~~~~~ilID~G~-~~~~i~~~l~~~-----~~~i~~ 54 (216) .+.-.+||=|-+ .=+.++..|++. +.+++. T Consensus 26 ~n~vTAlIGPSGCGKSTlLR~lNRMnDl~~~~r~~G 61 (248) T TIGR00972 26 KNQVTALIGPSGCGKSTLLRSLNRMNDLVPGVRIEG 61 (248) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEE T ss_conf 770589877889867899999887764078816888 No 96 >PRK09836 DNA-binding transcriptional activator CusR; Provisional Probab=47.68 E-value=21 Score=16.29 Aligned_cols=56 Identities=9% Similarity=-0.141 Sum_probs=28.7 Q ss_pred CCCCCCCCCCCCHHHHCCCCCCCCC-CCCHHHHHHHHHHHHHCCC-CCEE-EEECCCCC Q ss_conf 0123432111100000135663467-9899999999999985199-9789-98488798 Q gi|254780758|r 144 VNFENNFAHLGDTLFRSSIGRTDIL-HGDYQQLINSINNKILPLG-DEIS-FVCGHGPN 199 (216) Q Consensus 144 ~~~~~~~lftGD~lf~~~~G~~d~~-~~d~~~~~~sl~~~l~~l~-~~~~-v~PgHG~~ 199 (216) +....+.+++=|.|+..-.|..... .-.....+..||+||..-. +..+ =+.|-|+- T Consensus 162 L~~~~g~vvsre~l~~~vw~~~~~~~~~~vdv~I~rLR~Kl~~~~~~~~I~TvrG~GY~ 220 (226) T PRK09836 162 FLRHQGEVLPRSLIASQVWDMNFDSDTNAIDVAVKRLRGKIDNDFEPKLIQTVRGVGYM 220 (226) T ss_pred HHHCCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCEE T ss_conf 98689964879999999628887888588999999999985788999839996681569 No 97 >PRK05433 GTP-binding protein LepA; Provisional Probab=45.14 E-value=23 Score=16.08 Aligned_cols=55 Identities=27% Similarity=0.306 Sum_probs=27.4 Q ss_pred CCCCCCCCCCC---CCCCCCCCHHHHCCC-------CC----C-----CCC--CCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 72100000123---432111100000135-------66----3-----467--9899999999999985199978998 Q gi|254780758|r 137 SPGHVIYVNFE---NNFAHLGDTLFRSSI-------GR----T-----DIL--HGDYQQLINSINNKILPLGDEISFV 193 (216) Q Consensus 137 t~g~~~~~~~~---~~~lftGD~lf~~~~-------G~----~-----d~~--~~d~~~~~~sl~~~l~~l~~~~~v~ 193 (216) .+|.+.|.+.. -+-+.+||||-...- |. + -+| ..|...+.++| +|| .|+|-...+ T Consensus 254 ~aGeVGyiiagiK~~~d~~vGDTit~~~~~~~~pLpGf~~~kP~Vfagi~P~~~~d~~~Lr~AL-~KL-~LnD~Sl~~ 329 (601) T PRK05433 254 SAGEVGYIIAGIKDVRDARVGDTITLAKNPAEEPLPGFKEVKPMVFAGLYPVDSDDYEDLRDAL-EKL-QLNDASLTY 329 (601) T ss_pred CCCCEEEEEECCCCHHHCCCCCEEECCCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHH-HHH-HHCCCCEEE T ss_conf 3784479982454443233476055477777534677889998699723427700689999999-988-635775376 No 98 >KOG1448 consensus Probab=43.27 E-value=25 Score=15.88 Aligned_cols=34 Identities=29% Similarity=0.358 Sum_probs=13.0 Q ss_pred HHHHHHHHHHCCC-CCEEEEECCCCCCCHHHHHHHC Q ss_conf 9999999985199-9789984887988889998828 Q gi|254780758|r 175 LINSINNKILPLG-DEISFVCGHGPNSTIGRERRLN 209 (216) Q Consensus 175 ~~~sl~~~l~~l~-~~~~v~PgHG~~tt~~~e~~~n 209 (216) +.++- ++|..-. .++..+--||=.+-=.-||..| T Consensus 230 l~~aa-~~L~~~GA~kV~a~~THgVfs~~a~er~~~ 264 (316) T KOG1448 230 LIKAA-DKLLEHGAKKVYAIVTHGVFSGPAIERLNE 264 (316) T ss_pred HHHHH-HHHHHCCCCEEEEEECCEECCCCHHHHHHH T ss_conf 99999-999865874599997132226318877344 No 99 >COG0104 PurA Adenylosuccinate synthase [Nucleotide transport and metabolism] Probab=43.08 E-value=25 Score=15.86 Aligned_cols=50 Identities=24% Similarity=0.369 Sum_probs=31.2 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEE--EEEE Q ss_conf 735999970876571699998899849998389999999999997499752--9998 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVK--QIWI 57 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~--~Ii~ 57 (216) +-.+|.||.|-+..|+..++.. + ++|||-.-.+. ++.|+++|+... .+++ T Consensus 49 ~~~LhLiPSGil~~~~~~vIGn--G--vVvdP~~L~~E-i~~L~~~g~~~~~~rL~I 100 (430) T COG0104 49 KYKLHLIPSGILRPGATLVIGN--G--VVVDPEVLLKE-IEELEERGVDVVRDRLRI 100 (430) T ss_pred EEEEEECCCCCCCCCCEEEECC--C--EEECHHHHHHH-HHHHHHCCCCCCCCEEEE T ss_conf 9999966644567996588638--6--78879999999-999986488666010787 No 100 >TIGR01307 pgm_bpd_ind 2,3-bisphosphoglycerate-independent phosphoglycerate mutase; InterPro: IPR005995 This 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (iPGAM) is a metalloenzyme found particularly in eubacteria and higher plants. It is distantly related to archaeal iPGAM (IPR004456 from INTERPRO) and distinct from the unrelated cofactor-dependent PGAM (PIRSF001490 from PIRSF). Activity has been demonstrated for proteins from a variety of organisms, including Pseudomonas syringae pv. tomato , Bacillus subtilis , Bacillus stearothermophilus , maize , castor bean , and Trypanosoma brucei . The structure of the B. stearothermophilus enzyme (PDB:1EJJ) has two domains . Residues 1-76 and 311-511 form the phosphatase domain, containing the active site residue and two metal-binding sites. This domain is similar to alkaline phosphatase (PDB:1ALK) and arylsulphatase (PDB:1AUK), which are members of the SCOP alkaline phosphatase-like superfamily, but there is meagre sequence similarity outside of the metal-binding segments. Residues 77-310 form the phosphotransferase domain, which is poorly conserved (or perhaps unrelated) in the archaeal enzymes.; GO: 0004619 phosphoglycerate mutase activity, 0006007 glucose catabolic process. Probab=42.65 E-value=9.1 Score=18.44 Aligned_cols=45 Identities=18% Similarity=0.324 Sum_probs=29.1 Q ss_pred HHHHHHHHH---CCCCEEEE-EEC----CCCCCCCCCCHHHHHHHCCCEEECC Q ss_conf 999999997---49975299-986----9885210110567886079554058 Q gi|254780758|r 39 DIIKDVIQS---RNFHVKQI-WIT----HGHVDHVGGAAQLKEDLSLTIIGPH 83 (216) Q Consensus 39 ~~i~~~l~~---~~~~i~~I-i~T----H~H~DH~gg~~~l~~~~~~~i~~~~ 83 (216) +.+++.+++ .+.+|--+ |++ |+|.||+-|+..+.+.-++..++.| T Consensus 100 ~a~~~a~~~a~~~~~~lHl~GL~S~GGVHSh~~Hl~AL~~~A~~~G~~k~~lH 152 (529) T TIGR01307 100 PALLGAIDRAKDANGKLHLMGLVSDGGVHSHIDHLIALIELAAERGIEKVCLH 152 (529) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCEEEEEE T ss_conf 27999999998468917999733889503608899999999966893179998 No 101 >PRK01117 adenylosuccinate synthetase; Provisional Probab=41.59 E-value=22 Score=16.18 Aligned_cols=45 Identities=22% Similarity=0.341 Sum_probs=28.4 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCE Q ss_conf 73599997087657169999889984999838999999999999749975 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHV 52 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i 52 (216) +..+|++|.|-|..|+..++.. + ++|||-.-.+. ++.|++.|+++ T Consensus 49 k~~lhllPSGil~~~~~~vIG~--G--vVidp~~L~~E-i~~L~~~Gi~~ 93 (431) T PRK01117 49 KYKLHLIPSGILRPGKTCVIGN--G--VVLDPEALLKE-IDELEARGVDT 93 (431) T ss_pred EEEEEECCCEECCCCCCEEECC--C--EEECHHHHHHH-HHHHHHCCCCC T ss_conf 9999607827516997178579--7--89869999999-99999769998 No 102 >TIGR02515 IV_pilus_PilQ type IV pilus secretin PilQ; InterPro: IPR013355 A number of proteins homologous to PilQ are involved in type IV pilus formation, competence for transformation, type III secretion, and type II secretion (also referred to as the main terminal branch of the general secretion pathway). Members of this family include PilQ itself, which is a component of the type IV pilus structure, from a number of species. In Haemophilus influenzae, the member of this family is associated with the uptake of exogenous DNA, rather than with formation of a type IV pilus - the surface structure required for this may be considered an unusual, incomplete type IV pilus structure.; GO: 0009297 pilus biogenesis, 0009306 protein secretion. Probab=39.25 E-value=29 Score=15.51 Aligned_cols=39 Identities=26% Similarity=0.352 Sum_probs=29.9 Q ss_pred EEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 999988998499983899999999999974997529998 Q gi|254780758|r 19 TFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHVKQIWI 57 (216) Q Consensus 19 ~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i~~Ii~ 57 (216) .+.+|+-++..++=|.....+++.+.+++.+.+++.|++ T Consensus 159 Svt~D~RTN~Liv~D~~~~l~~~r~li~~lD~Pv~QV~I 197 (464) T TIGR02515 159 SVTVDPRTNTLIVTDIPEKLARIRELIAELDIPVKQVLI 197 (464) T ss_pred CEEEECCCCEEEEEECHHHHHHHHHHHHHCCCCCCCEEE T ss_conf 089742425387862157899999999961898433899 No 103 >pfam00709 Adenylsucc_synt Adenylosuccinate synthetase. Probab=37.79 E-value=28 Score=15.57 Aligned_cols=45 Identities=20% Similarity=0.350 Sum_probs=28.1 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCE Q ss_conf 73599997087657169999889984999838999999999999749975 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHV 52 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i 52 (216) +..+|++|.|-|..|+-.++. + .++|||-.-.+. ++.|++.|+.+ T Consensus 47 k~~lhliPSGil~~~~~~vIG--~--GvVidp~~L~~E-i~~L~~~Gi~~ 91 (420) T pfam00709 47 KYKLHLLPSGILNPGVINVIG--N--GVVVDPEALLEE-IEELEKKGVDV 91 (420) T ss_pred EEEEECCCCEECCCCCEEEEC--C--CEEECHHHHHHH-HHHHHHCCCCC T ss_conf 999951783632799718989--9--789859999999-99999759897 No 104 >pfam10070 DUF2309 Uncharacterized protein conserved in bacteria (DUF2309). Members of this family of hypothetical bacterial proteins have no known function. Probab=37.14 E-value=26 Score=15.83 Aligned_cols=20 Identities=35% Similarity=0.627 Sum_probs=13.9 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHCHHH Q ss_conf 999789984887988889998828145 Q gi|254780758|r 186 LGDEISFVCGHGPNSTIGRERRLNPFL 212 (216) Q Consensus 186 l~~~~~v~PgHG~~tt~~~e~~~n~~~ 212 (216) +.+ ..++-|||..|+ ||||- T Consensus 494 FA~-lV~l~GHGS~s~------NNP~~ 513 (783) T pfam10070 494 FAP-LVVLVGHGSQST------NNPHA 513 (783) T ss_pred CCC-EEEEECCCCCCC------CCCHH T ss_conf 777-589951676767------88046 No 105 >TIGR00151 ispF 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; InterPro: IPR003526 This entry represents MECDP (2-C-methyl-D-erythritol 2,4-cyclodiphosphate) synthetase, an enzyme in the non-mevalonate pathway of isoprenoid synthesis, isoprenoids being essential in all organisms. Isoprenoids can also be synthesized through the mevalonate pathway. The non-mevolante route is used by many bacteria and human pathogens, including Mycobacterium tuberculosis and Plasmodium falciparum. This route appears to involve seven enzymes. MECDP synthetase catalyses the intramolecular attack by a phosphate group on a diphosphate, with cytidine monophosphate (CMP) acting as the leaving group to give the cyclic diphosphate product MEDCP. The enzyme is a trimer with three active sites shared between adjacent copies of the protein. The enzyme also has two metal binding sites, the metals playing key roles in catalysis. A number of proteins from eukaryotes and prokaryotes share this common N-terminal signature and appear to be involved in terpenoid biosynthesis. The ygbB protein is a putative enzyme of this type .; GO: 0008685 2-C-methyl-D-erythritol 24-cyclodiphosphate synthase activity, 0016114 terpenoid biosynthetic process. Probab=36.85 E-value=32 Score=15.29 Aligned_cols=68 Identities=28% Similarity=0.342 Sum_probs=38.9 Q ss_pred EEECCC---CEEEEECCCCCCCC-C-CCCCCCCCCCCCCCC---CCCCCCCHHHHCCCCC----CC--CCCCCHHHHHHH Q ss_conf 896478---39976201233101-2-224721000001234---3211110000013566----34--679899999999 Q gi|254780758|r 113 RWLQDG---DTLLLGTHIFKVFH-C-PGHSPGHVIYVNFEN---NFAHLGDTLFRSSIGR----TD--ILHGDYQQLINS 178 (216) Q Consensus 113 ~~l~~g---d~i~lg~~~~~vi~-t-PGHt~g~~~~~~~~~---~~lftGD~lf~~~~G~----~d--~~~~d~~~~~~s 178 (216) ..|.+| ..+.+||..+.--. . -|||.|++.+.---+ +.+--|| +|. +| |-+.|..+|++- T Consensus 9 H~~~~g~~P~~liLGGV~ip~~~~Gl~gHSDGDvllHAl~DAlLGA~glGD------IG~~FPdtd~~wKgaDS~~lL~~ 82 (159) T TIGR00151 9 HRLVPGDDPRPLILGGVEIPHEKLGLLGHSDGDVLLHALTDALLGALGLGD------IGKHFPDTDPRWKGADSRVLLRK 82 (159) T ss_pred EECCCCCCCCEEEEEEEEEECCCCCEEEECHHHHHHHHHHHHHHHHHHHCC------CCCCCCCCCHHHCCCCHHHHHHH T ss_conf 103878788817880067405455512416278899999988887752000------22437862334568877899999 Q ss_pred HHHHHHCC Q ss_conf 99998519 Q gi|254780758|r 179 INNKILPL 186 (216) Q Consensus 179 l~~~l~~l 186 (216) +.+.+... T Consensus 83 ~~~~~~~~ 90 (159) T TIGR00151 83 AVELVKEK 90 (159) T ss_pred HHHHHHHC T ss_conf 99999855 No 106 >pfam08747 DUF1788 Domain of unknown function (DUF1788). Putative uncharacterized domain in proteins of length around 200 amino acids. Probab=35.45 E-value=15 Score=17.22 Aligned_cols=58 Identities=12% Similarity=0.191 Sum_probs=27.8 Q ss_pred CHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCH--HHHHHHCHHHHC Q ss_conf 000001356634679899999999999985199978998488798888--999882814514 Q gi|254780758|r 155 DTLFRSSIGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNSTI--GRERRLNPFLKS 214 (216) Q Consensus 155 D~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~tt~--~~e~~~n~~~~~ 214 (216) |.+|.-++|..+ |---...+++.+..+....| -+..|||+=.-+++ ..+.+.+.|+|+ T Consensus 62 dvvfltGvG~vy-P~iR~h~lLn~L~~~~~~~P-~v~FyPG~y~g~~L~lFg~l~d~~YYRA 121 (125) T pfam08747 62 DIVFLTGVGEVY-PLIRSHNLLNNLHSVMGDVP-LVMFYPGEYDGQSLRLFGRLKDDNYYRA 121 (125) T ss_pred CEEEEECCCCCC-HHHHHHHHHHHHHHHHCCCC-EEEECCCCCCCCEEEECCCCCCCCCCCH T ss_conf 489993774215-06868999998888754881-8997587426865553356688774231 No 107 >TIGR02199 rfaE_dom_II rfaE bifunctional protein, domain II; InterPro: IPR011914 RfaE is a protein involved in the biosynthesis of ADP-L-glycero-D-manno-heptose, a precursor for LPS inner core biosynthesis. RfaE is a bifunctional protein in Escherichia coli, and separate proteins in some other genome. Domain I (IPR011913 from INTERPRO) is suggested to act in D-glycero-D-manno-heptose 1-phosphate biosynthesis, while domain II (this family) adds ADP to yield ADP-D-glycero-D-manno-heptose .; GO: 0016773 phosphotransferase activity alcohol group as acceptor, 0016779 nucleotidyltransferase activity, 0005975 carbohydrate metabolic process. Probab=34.78 E-value=20 Score=16.41 Aligned_cols=30 Identities=23% Similarity=0.371 Sum_probs=19.2 Q ss_pred HHHHHHCCCCEEEEEECCCCCC--CCCCCHHHHHH Q ss_conf 9999974997529998698852--10110567886 Q gi|254780758|r 42 KDVIQSRNFHVKQIWITHGHVD--HVGGAAQLKED 74 (216) Q Consensus 42 ~~~l~~~~~~i~~Ii~TH~H~D--H~gg~~~l~~~ 74 (216) .+.+++++.+ |+.|-+=+| |.|-..+|.++ T Consensus 3 ~~~~r~~g~k---vVFTNGCFDiLH~GHV~YL~~A 34 (144) T TIGR02199 3 VAEARARGKK---VVFTNGCFDILHAGHVSYLQQA 34 (144) T ss_pred CHHHHHCCCC---EEECCCCCCCCCCCHHHHHHHH T ss_conf 0378837897---7851773101021137689999 No 108 >pfam06415 iPGM_N BPG-independent PGAM N-terminus (iPGM_N). This family represents the N-terminal region of the 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (or phosphoglyceromutase or BPG-independent PGAM) protein (EC:5.4.2.1). The family is found in conjunction with pfam01676 (located in the C-terminal region of the protein). Probab=34.67 E-value=35 Score=15.08 Aligned_cols=40 Identities=20% Similarity=0.339 Sum_probs=27.1 Q ss_pred HHHHHHHCCCCEEEE-EE----CCCCCCCCCCCHHHHHHHCCCEE Q ss_conf 999999749975299-98----69885210110567886079554 Q gi|254780758|r 41 IKDVIQSRNFHVKQI-WI----THGHVDHVGGAAQLKEDLSLTII 80 (216) Q Consensus 41 i~~~l~~~~~~i~~I-i~----TH~H~DH~gg~~~l~~~~~~~i~ 80 (216) +++.+++.+.++.-+ ++ -|+|.||+-++-.+..+.+++.+ T Consensus 19 ~~~~~k~~~~~lHL~GL~SdGGVHShidHl~al~~~a~~~gv~~v 63 (223) T pfam06415 19 AIDHVKKNNGALHLIGLLSDGGVHSHIDHLFALLKLAKERGVKKV 63 (223) T ss_pred HHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCEE T ss_conf 999999739859999941499763247999999999997188708 No 109 >PRK10816 DNA-binding transcriptional regulator PhoP; Provisional Probab=34.41 E-value=35 Score=15.05 Aligned_cols=57 Identities=18% Similarity=0.147 Sum_probs=30.5 Q ss_pred CCCCCCCCCCCCCHHHHCCCCCCCCCCC-CHHHHHHHHHHHHHCCCC-CE-EEEECCCCC Q ss_conf 0012343211110000013566346798-999999999999851999-78-998488798 Q gi|254780758|r 143 YVNFENNFAHLGDTLFRSSIGRTDILHG-DYQQLINSINNKILPLGD-EI-SFVCGHGPN 199 (216) Q Consensus 143 ~~~~~~~~lftGD~lf~~~~G~~d~~~~-d~~~~~~sl~~~l~~l~~-~~-~v~PgHG~~ 199 (216) ++....+-+++-|.|+..-.|..+.+.. +...+...||+|+....+ +. .=+.|-|+- T Consensus 160 ~L~~~~~~vvsre~l~~~vw~~~~~~~~~sld~~I~rLRkKl~~~~~~~~I~tvrG~GY~ 219 (223) T PRK10816 160 TLIRNNGKVVSKDSLMLQLYPDAELRESHTIDVLMGRLRKKIQAQYPQEVITTVRGQGYL 219 (223) T ss_pred HHHHCCCEEECHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEECCCCEE T ss_conf 999879923839999998549987888677999999999974335899818997895679 No 110 >TIGR00968 3a0106s01 sulfate ABC transporter, ATP-binding protein; InterPro: IPR005666 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). These proteins are involved in the transmembrane transport of sulphate and thiosulphate.; GO: 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0016020 membrane. Probab=34.29 E-value=35 Score=15.04 Aligned_cols=28 Identities=14% Similarity=0.271 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHCCCCCE---EEEECCCCCC Q ss_conf 999999999985199978---9984887988 Q gi|254780758|r 173 QQLINSINNKILPLGDEI---SFVCGHGPNS 200 (216) Q Consensus 173 ~~~~~sl~~~l~~l~~~~---~v~PgHG~~t 200 (216) ++.++.||..|+.|.+++ .|+--|+... T Consensus 167 AkvRk~LR~WLR~LH~e~~~T~VfVTHD~~E 197 (241) T TIGR00968 167 AKVRKELRAWLRKLHDEVHVTTVFVTHDQEE 197 (241) T ss_pred HHHHHHHHHHHHHHCCEEEEEEEEEECCHHH T ss_conf 8999999999987403056779998628589 No 111 >PRK05434 phosphoglyceromutase; Provisional Probab=33.28 E-value=37 Score=14.94 Aligned_cols=40 Identities=15% Similarity=0.218 Sum_probs=25.8 Q ss_pred HHHHHHCCCCEEEE-EE----CCCCCCCCCCCHHHHHHHCCC-EEE Q ss_conf 99999749975299-98----698852101105678860795-540 Q gi|254780758|r 42 KDVIQSRNFHVKQI-WI----THGHVDHVGGAAQLKEDLSLT-IIG 81 (216) Q Consensus 42 ~~~l~~~~~~i~~I-i~----TH~H~DH~gg~~~l~~~~~~~-i~~ 81 (216) ++.+++.+.++.-+ ++ -|+|.||+.++-.+..+.+++ ++. T Consensus 102 i~~~k~~~~~lHl~GLlSdGGVHSH~~Hl~al~~~a~~~gv~~v~v 147 (511) T PRK05434 102 IDKAKKNGGALHLMGLLSDGGVHSHIDHLFALLELAAKEGVKKVYL 147 (511) T ss_pred HHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEE T ss_conf 9999973981699985238843133789999999999708884899 No 112 >TIGR00644 recJ single-stranded-DNA-specific exonuclease RecJ; InterPro: IPR004610 All proteins in this family are 5'-3' single-strand DNA exonucleases. These proteins are used in some aspects of mismatch repair, recombination, and recombinational repair. RecJ is a single-stranded DNA-specific endonuclease which is required for many types of recombination events, although the stringency of the requirement for RecJ appears to vary with the type of recombinational event monitored, and the other recombination gene products which are available.; GO: 0008409 5'-3' exonuclease activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=32.40 E-value=38 Score=14.86 Aligned_cols=31 Identities=19% Similarity=0.191 Sum_probs=20.0 Q ss_pred EEEE-ECCCCCHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 4999-83899999999999974997529998698 Q gi|254780758|r 28 EAVV-VDPGGDLDIIKDVIQSRNFHVKQIWITHG 60 (216) Q Consensus 28 ~~il-ID~G~~~~~i~~~l~~~~~~i~~Ii~TH~ 60 (216) +.++ ||||-..-.-.+++++.|+.| |++=|+ T Consensus 116 ~LiiTVD~Gi~a~~e~~~a~~~G~dV--IVtDHH 147 (705) T TIGR00644 116 SLIITVDNGISAHEEIEYAKELGIDV--IVTDHH 147 (705) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCEE--EEECCC T ss_conf 39998268742699999998669819--997256 No 113 >PRK13784 adenylosuccinate synthetase; Provisional Probab=31.33 E-value=39 Score=14.75 Aligned_cols=45 Identities=18% Similarity=0.271 Sum_probs=27.7 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCE Q ss_conf 73599997087657169999889984999838999999999999749975 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHV 52 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i 52 (216) +..+|++|.|-+..|+..++. . .++|||..-.+. ++.|++.|+++ T Consensus 49 k~~lhliPSGvl~~~~~~~ig--~--GvVidp~~L~~E-i~~l~~~gv~~ 93 (428) T PRK13784 49 KTFLHLIPSGVLHQHTKCVIG--H--GVVLDPVALDEE-ITRLQAKGIAI 93 (428) T ss_pred EEEEEECCCEECCCCCEEEEC--C--CEEECHHHHHHH-HHHHHHCCCCC T ss_conf 999970682763899738889--9--499979999999-99999619998 No 114 >PRK13811 orotate phosphoribosyltransferase; Provisional Probab=29.97 E-value=42 Score=14.61 Aligned_cols=10 Identities=40% Similarity=0.531 Sum_probs=5.2 Q ss_pred CCCCCCCCHH Q ss_conf 8521011056 Q gi|254780758|r 61 HVDHVGGAAQ 70 (216) Q Consensus 61 H~DH~gg~~~ 70 (216) ..|.++|.+. T Consensus 56 ~~d~I~G~a~ 65 (170) T PRK13811 56 DFDAVAGVAV 65 (170) T ss_pred CCCEEECCCH T ss_conf 9889971410 No 115 >TIGR02998 RraA_entero regulator of ribonuclease activity A; InterPro: IPR014339 This entry includes a number of closely related sequences from the gammaproteobacteria. The Escherichia coli member of this family, RraA, has been characterised as a regulator of RNase E (see IPR004659 from INTERPRO), and its crystal structure has been analysed . E. coli RraA acts as a trans-acting modulator of RNA turnover, binding essential endonuclease RNase E and inhibiting RNA processing . RNase E forms the core of a large RNA-catalysis machine termed the degradosomes. RraA (and RraB) causes remodelling of degradosome composition, which is associated with alterations in RNA decay and global transcript abundance and as such is a bacterial mechanism for the regulation of RNA cleavage .; GO: 0008428 ribonuclease inhibitor activity, 0051252 regulation of RNA metabolic process. Probab=29.25 E-value=25 Score=15.85 Aligned_cols=33 Identities=24% Similarity=0.433 Sum_probs=20.8 Q ss_pred EEEECCCCCCCEE---EEEECCCCEEEEECCCCCHH Q ss_conf 9997087657169---99988998499983899999 Q gi|254780758|r 7 HIVQVTPFKQNCT---FLFDEESLEAVVVDPGGDLD 39 (216) Q Consensus 7 ~~i~v~~~~~N~~---li~~~~~~~~ilID~G~~~~ 39 (216) ++++|.+|..|-- |+....+++++|||-|+... T Consensus 35 ~v~TvKCFE~ngLi~ell~~~G~g~VL~iDGGGs~R 70 (161) T TIGR02998 35 KVVTVKCFEDNGLIAELLEQNGTGRVLLIDGGGSVR 70 (161) T ss_pred EEEEEEEECCHHHHHHHHHHCCCCEEEEEECCCCHH T ss_conf 689998606714789999725893389996895155 No 116 >smart00788 Adenylsucc_synt Adenylosuccinate synthetase. Adenylosuccinate synthetase plays an important role in purine biosynthesis, by catalyzing the GTP-dependent conversion of IMP and aspartic acid to AMP. Adenylosuccinate synthetase has been characterized from various sources ranging from Escherichia coli (gene purA) to vertebrate tissues. In vertebrates, two isozymes are present - one involved in purine biosynthesis and the other in the purine nucleotide cycle. The crystal structure of adenylosuccinate synthetase from E. coli reveals that the dominant structural element of each monomer of the homodimer is a central beta-sheet of 10 strands. The first nine strands of the sheet are mutually parallel with right-handed crossover connections between the strands. The 10th strand is antiparallel with respect to the first nine strands. In addition, the enzyme has two antiparallel beta-sheets, comprised of two strands and three strands each, 11 alpha-helices and two short 3/10-helices. Furt Probab=28.66 E-value=44 Score=14.48 Aligned_cols=45 Identities=20% Similarity=0.369 Sum_probs=27.9 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCE Q ss_conf 73599997087657169999889984999838999999999999749975 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHV 52 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i 52 (216) +..+|++|.|-|..|+..++.. .++|||-.-.+. ++.|++.|+.+ T Consensus 47 k~~lhllPSGi~~~~~~~vIG~----GvVidp~~L~~E-i~~L~~~gi~~ 91 (421) T smart00788 47 KYKLHLLPSGILNPNKKCVIGN----GVVIDPEALFKE-IDQLEAKGVDV 91 (421) T ss_pred EEEEEECCCEECCCCCEEEECC----CEEECHHHHHHH-HHHHHHCCCCC T ss_conf 9999717835137996078779----689859999999-99999628998 No 117 >TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=28.53 E-value=44 Score=14.47 Aligned_cols=48 Identities=17% Similarity=0.154 Sum_probs=26.1 Q ss_pred CCCCCCCEEEECCCCEEEEECCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 46666741896478399762012-3310122247210000012343211110000 Q gi|254780758|r 105 NARNASSDRWLQDGDTLLLGTHI-FKVFHCPGHSPGHVIYVNFENNFAHLGDTLF 158 (216) Q Consensus 105 ~~~~~~~~~~l~~gd~i~lg~~~-~~vi~tPGHt~g~~~~~~~~~~~lftGD~lf 158 (216) ...|+-.+++++++.+++++-.. -+-+..| .++=+++..+ +++||+.. T Consensus 761 ~~~PVlidky~e~A~E~dVD~v~Dg~~v~i~-----gI~EHiE~AG-VHSGDs~~ 809 (1089) T TIGR01369 761 PEHPVLIDKYLEDAVEIDVDAVSDGEEVLIP-----GIMEHIEEAG-VHSGDSTC 809 (1089) T ss_pred CCCCEEEEEECCCCEEEEEEEEEECCEEEEE-----EHHHHHHHCC-CCCCCHHH T ss_conf 8997698885288889999899728848996-----2312342136-76130531 No 118 >PRK11148 cyclic 3',5'-adenosine monophosphate phosphodiesterase; Provisional Probab=28.50 E-value=44 Score=14.46 Aligned_cols=50 Identities=14% Similarity=0.091 Sum_probs=29.1 Q ss_pred ECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCC------CHHHHHHHCCCEEEC Q ss_conf 838999999999999749975299986988521011------056788607955405 Q gi|254780758|r 32 VDPGGDLDIIKDVIQSRNFHVKQIWITHGHVDHVGG------AAQLKEDLSLTIIGP 82 (216) Q Consensus 32 ID~G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~gg------~~~l~~~~~~~i~~~ 82 (216) +||-...+.+++.+++...+++.|++| +..=+-|. +..+.+.+++|+++. T Consensus 36 ~dt~~~l~~vl~~i~~~~~~~D~viiT-GDLs~dgs~esY~~l~~~L~~l~~P~~~l 91 (275) T PRK11148 36 VNTWESYQAVLEAIRAEQHEFDLIVAT-GDLAQDHSAEAYQHFAEGIAPLRKPCVWL 91 (275) T ss_pred CCHHHHHHHHHHHHHHCCCCCCEEEEC-CCCCCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 698999999999998459998899976-40258999999999999997269998995 No 119 >PRK13787 adenylosuccinate synthetase; Provisional Probab=28.28 E-value=43 Score=14.55 Aligned_cols=45 Identities=27% Similarity=0.302 Sum_probs=27.4 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCE Q ss_conf 73599997087657169999889984999838999999999999749975 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFHV 52 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~i 52 (216) +..+|++|.|.+..++.-++.. + ++|||..-.+. ++.|++.|+++ T Consensus 49 k~~lhllPSGil~~~~~~vIG~--G--vVvdp~~L~~E-i~~L~~~g~~~ 93 (423) T PRK13787 49 KYVFHLVPSGVIYDQTICVIGN--G--VVLDPLFFIEE-CDRLQKEGFPV 93 (423) T ss_pred EEEEECCCCEECCCCCEEEECC--C--EEECHHHHHHH-HHHHHHCCCCC T ss_conf 9999527717607996489899--8--99969999999-99986365056 No 120 >TIGR02439 catechol_proteo catechol 1,2-dioxygenase; InterPro: IPR012801 Members of this family known so far are catechol 1,2-dioxygenases of the proteobacteria. They are distinct from catechol 1,2-dioxygenases and chlorocatechol 1,2-dioxygenases of the actinobacteria, which are quite similar to each other and resolved by separate entries. This enzyme catalyses intradiol cleavage in which catechol + O2 becomes cis,cis-muconate. Catechol is an intermediate in the catabolism of many different aromatic compounds, as is the alternative intermediate protocatechuate. In Acinetobacter lwoffii, two isozymes are present with abilities, differing somewhat, to act on catechol analogues 3-methylcatechol, 4-methylcatechol, 4-methoxycatechol, and 4-chlorocatechol.; GO: 0005506 iron ion binding, 0018576 catechol 12-dioxygenase activity, 0019614 catechol catabolic process. Probab=27.90 E-value=45 Score=14.40 Aligned_cols=74 Identities=19% Similarity=0.192 Sum_probs=49.2 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCCC--C------CCCCCCC-------HHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 6201233101222472100000123--4------3211110-------00001356634679899999999999985199 Q gi|254780758|r 123 LGTHIFKVFHCPGHSPGHVIYVNFE--N------NFAHLGD-------TLFRSSIGRTDILHGDYQQLINSINNKILPLG 187 (216) Q Consensus 123 lg~~~~~vi~tPGHt~g~~~~~~~~--~------~~lftGD-------~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~ 187 (216) +.+-.|+++|+- |.|..|||.+. + +.+.|++ +|++-++| + -|.|-.+|+++.| -|=-..| T Consensus 144 i~gA~VE~WHAN--~~G~YS~FDksG~Qs~fNLRR~I~TD~~G~Y~ars~~P~GYG-c-pP~G~TQqlLn~L-GRHG~RP 218 (288) T TIGR02439 144 IAGAKVEVWHAN--SKGNYSHFDKSGEQSEFNLRRTIITDANGKYRARSIVPSGYG-C-PPQGPTQQLLNLL-GRHGNRP 218 (288) T ss_pred CCCCEEEEEECC--CCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEECCCCC-C-CCCCCHHHHHHHC-CCCCCCC T ss_conf 478766577517--888711227888879745653055179971799987001356-7-9687079998541-7888898 Q ss_pred CCEEE---EECCCCCCC Q ss_conf 97899---848879888 Q gi|254780758|r 188 DEISF---VCGHGPNST 201 (216) Q Consensus 188 ~~~~v---~PgHG~~tt 201 (216) .-+-. -|||-.-|| T Consensus 219 AHvHFFvSAPG~rkLTT 235 (288) T TIGR02439 219 AHVHFFVSAPGYRKLTT 235 (288) T ss_pred CCEEEEECCCCCCCCCC T ss_conf 60687565874331100 No 121 >TIGR00505 ribA GTP cyclohydrolase II; InterPro: IPR000926 GTP cyclohydrolase II catalyses the first committed step in the biosynthesis of riboflavin. The enzyme converts GTP and water to formate, 2,5-diamino-6-hydroxy-4-(5-phosphoribosylamino)- pyrimidine and pyrophosphate, and requires magnesium as a cofactor. It is sometimes found as a bifunctional enzyme with 3,4-dihydroxy-2-butanone 4-phosphate synthase (DHBP_synthase) IPR000422 from INTERPRO. ; GO: 0003935 GTP cyclohydrolase II activity, 0009231 riboflavin biosynthetic process. Probab=27.86 E-value=45 Score=14.40 Aligned_cols=18 Identities=22% Similarity=0.193 Sum_probs=9.6 Q ss_pred CCCCCHHHHHHHHHHCCC Q ss_conf 389999999999997499 Q gi|254780758|r 33 DPGGDLDIIKDVIQSRNF 50 (216) Q Consensus 33 D~G~~~~~i~~~l~~~~~ 50 (216) |||...+..++.|.+.|. T Consensus 76 DCG~qL~~AL~~I~~eG~ 93 (227) T TIGR00505 76 DCGEQLERALKQIAEEGR 93 (227) T ss_pred CHHHHHHHHHHHHHHCCC T ss_conf 716899999999875288 No 122 >TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH; InterPro: IPR010076 This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown. This enzyme belongs to the alpha/beta hydrolase fold family (IPR000073 from INTERPRO). Members of this family are restricted to the proteobacteria.; GO: 0009102 biotin biosynthetic process. Probab=27.64 E-value=46 Score=14.37 Aligned_cols=29 Identities=14% Similarity=0.398 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHHC--CCCCEEEE--ECCCCCCCH Q ss_conf 99999999999851--99978998--488798888 Q gi|254780758|r 172 YQQLINSINNKILP--LGDEISFV--CGHGPNSTI 202 (216) Q Consensus 172 ~~~~~~sl~~~l~~--l~~~~~v~--PgHG~~tt~ 202 (216) |..+.+.+ +++.+ -.. ..++ .+|=|.-|= T Consensus 204 P~~~~~~l-~~l~psG~se-~~~~~~aaHAPFlSh 236 (248) T TIGR01738 204 PAKVIEAL-DRLAPSGQSE-LLVFEKAAHAPFLSH 236 (248) T ss_pred CHHHHHHH-HHHCCCCCCE-EEEEECCCCCCCCCC T ss_conf 17899999-9746789831-212406688860100 No 123 >TIGR01394 TypA_BipA GTP-binding protein TypA; InterPro: IPR006298 This bacterial (and Arabidopsis) protein, termed TypA or BipA, is a GTP-binding protein. It is phosphorylated on a tyrosine residue under some cellular conditions. Mutants show altered regulation of some pathways, but the precise function is unknown.; GO: 0005525 GTP binding, 0005622 intracellular. Probab=27.34 E-value=30 Score=15.40 Aligned_cols=36 Identities=14% Similarity=0.182 Sum_probs=16.9 Q ss_pred EEEECCC--CCHH---HHHHHHHHCCCCEEEEEE----CCCCCCCC Q ss_conf 9998389--9999---999999974997529998----69885210 Q gi|254780758|r 29 AVVVDPG--GDLD---IIKDVIQSRNFHVKQIWI----THGHVDHV 65 (216) Q Consensus 29 ~ilID~G--~~~~---~i~~~l~~~~~~i~~Ii~----TH~H~DH~ 65 (216) .+|||+. +=++ .+.++| +.|++|-=|++ =++-++-+ T Consensus 98 lLlVDA~EGPMPQTrFVL~KAL-~~gLkPIVViNKiDrp~ARP~eV 142 (609) T TIGR01394 98 LLLVDASEGPMPQTRFVLKKAL-ELGLKPIVVINKIDRPSARPDEV 142 (609) T ss_pred EEEEECCCCCCCCCHHHHHHHH-HCCCCEEEEECCCCCCCCCHHHH T ss_conf 9998578889885347899999-56893699971347887883788 No 124 >PRK00143 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; Reviewed Probab=27.19 E-value=47 Score=14.33 Aligned_cols=101 Identities=10% Similarity=0.049 Sum_probs=55.5 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCCCCCC----------CCCCHHHHHHHCCCEEECCHHHHHHHCCHHHHHHHHHCC- Q ss_conf 999999999999749975299986988521----------011056788607955405843898840123444432002- Q gi|254780758|r 35 GGDLDIIKDVIQSRNFHVKQIWITHGHVDH----------VGGAAQLKEDLSLTIIGPHKDDAAMMGKVDEQARLLSIR- 103 (216) Q Consensus 35 G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH----------~gg~~~l~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~- 103 (216) |-|+......|++.|..+..|.+--+..+. +.-+..+.+..+.+.+...-.+.++..-.+.+...+... T Consensus 14 GVDSsVaA~LL~~~Gy~V~Gv~m~~w~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~~f~~~Vi~~f~~~Y~~G~ 93 (355) T PRK00143 14 GVDSSVAAALLKEQGYDVIGLFMKLWDDDDGSGGCCAEEDIADARAVADKLGIPLYVVDFAKEFWDNVFDYFLDEYKAGR 93 (355) T ss_pred CHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHHHHHHHCCC T ss_conf 89999999999977995899999887688777899857889999999998698579968699876788999999997699 Q ss_pred CCCCCCCCEEEECCCCEEEE-ECCCCCCCCCCCC Q ss_conf 34666674189647839976-2012331012224 Q gi|254780758|r 104 MNARNASSDRWLQDGDTLLL-GTHIFKVFHCPGH 136 (216) Q Consensus 104 ~~~~~~~~~~~l~~gd~i~l-g~~~~~vi~tPGH 136 (216) .+.+.+..++.++-+.-++. -......+.| || T Consensus 94 TPNPcv~CN~~IKFg~l~~~A~~lgad~iAT-GH 126 (355) T PRK00143 94 TPNPCVLCNKEIKFKAFLDYALELGADYIAT-GH 126 (355) T ss_pred CCCCCHHCCCCCCHHHHHHHHHHCCCCEECC-CC T ss_conf 9997334087233999999998739984233-52 No 125 >TIGR01935 NOT-MenG RraA family; InterPro: IPR010203 This entry includes a number of closely related sequences bacteria and plants. The Escherichia coli member of this family has been characterised as a regulator of RNase E (see IPR004659 from INTERPRO), and its crystal structure has been analysed . E. coli RraA acts as a trans-acting modulator of RNA turnover, binding essential endonuclease RNase E and inhibiting RNA processing . RNase E forms the core of a large RNA-catalysis machine termed the degradosomes. RraA (and RraB) causes remodelling of degradosome composition, which is associated with alterations in RNA decay and global transcript abundance and as such is a bacterial mechanism for the regulation of RNA cleavage .; GO: 0008428 ribonuclease inhibitor activity, 0051252 regulation of RNA metabolic process. Probab=27.18 E-value=24 Score=15.97 Aligned_cols=44 Identities=27% Similarity=0.422 Sum_probs=26.0 Q ss_pred EEECCCCCCCEE---EEEECCCCEEEEECCCCCHH------HHHHHHHHCCCC Q ss_conf 997087657169---99988998499983899999------999999974997 Q gi|254780758|r 8 IVQVTPFKQNCT---FLFDEESLEAVVVDPGGDLD------IIKDVIQSRNFH 51 (216) Q Consensus 8 ~i~v~~~~~N~~---li~~~~~~~~ilID~G~~~~------~i~~~l~~~~~~ 51 (216) +++|..|..|+= ++..+..+++++||-|+... .+-+...++|+. T Consensus 37 ivTvkcFEDNslvr~~L~qpG~GrVLVVDGgGSlr~ALlGd~lA~~A~~NGWe 89 (155) T TIGR01935 37 IVTVKCFEDNSLVREVLEQPGAGRVLVVDGGGSLRRALLGDNLAELAEENGWE 89 (155) T ss_pred CEEEEEEECCHHHHHHHHCCCCCCEEEEECCCHHHHHCCCCHHHHHHHHCCCC T ss_conf 03899970473688998259997279995885010210167567886505951 No 126 >cd04168 TetM_like Tet(M)-like subfamily. Tet(M), Tet(O), Tet(W), and OtrA are tetracycline resistance genes found in Gram-positive and Gram-negative bacteria. Tetracyclines inhibit protein synthesis by preventing aminoacyl-tRNA from binding to the ribosomal acceptor site. This subfamily contains tetracycline resistance proteins that function through ribosomal protection and are typically found on mobile genetic elements, such as transposons or plasmids, and are often conjugative. Ribosomal protection proteins are homologous to the elongation factors EF-Tu and EF-G. EF-G and Tet(M) compete for binding on the ribosomes. Tet(M) has a higher affinity than EF-G, suggesting these two proteins may have overlapping binding sites and that Tet(M) must be released before EF-G can bind. Tet(M) and Tet(O) have been shown to have ribosome-dependent GTPase activity. These proteins are part of the GTP translation factor family, which includes EF-G, EF-Tu, EF2, LepA, and SelB. Probab=26.60 E-value=48 Score=14.26 Aligned_cols=10 Identities=20% Similarity=0.169 Sum_probs=3.9 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780758|r 172 YQQLINSINN 181 (216) Q Consensus 172 ~~~~~~sl~~ 181 (216) ..++.+.|++ T Consensus 193 ~eel~~~L~~ 202 (237) T cd04168 193 ELELDNELSA 202 (237) T ss_pred HHHHHHHHHH T ss_conf 8899999999 No 127 >COG0075 Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase [Amino acid transport and metabolism] Probab=25.58 E-value=38 Score=14.82 Aligned_cols=34 Identities=18% Similarity=0.354 Sum_probs=18.5 Q ss_pred EEEEECCCC----CHHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 499983899----99999999997499752999869885 Q gi|254780758|r 28 EAVVVDPGG----DLDIIKDVIQSRNFHVKQIWITHGHV 62 (216) Q Consensus 28 ~~ilID~G~----~~~~i~~~l~~~~~~i~~Ii~TH~H~ 62 (216) ++.++++.| +++.+.+.|++ +-+++.|.+||..- T Consensus 105 ~v~~~~~~wg~~v~p~~v~~~L~~-~~~~~~V~~vH~ET 142 (383) T COG0075 105 EVVVLEVEWGEAVDPEEVEEALDK-DPDIKAVAVVHNET 142 (383) T ss_pred CEEEEECCCCCCCCHHHHHHHHHC-CCCCCEEEEEECCC T ss_conf 669984788887999999999852-89950899981157 No 128 >TIGR01452 PGP_euk phosphoglycolate/pyridoxal phosphate phosphatase family; InterPro: IPR006349 This family of sequences represent 2-phosphoglycolate phosphatase which is limited to the eukaryotic lineage. PGP is an essential enzyme in the glycolate salvage pathway in higher organisms (photorespiration in plants). Phosphoglycolate results from the oxidase activity of RubisCO in the Calvin cycle when concentrations of carbon dioxide are low relative to oxygen. In mammals, PGP is found in many tissues, notably in red blood cells where P-glycolate is and important activator of the hydrolysis of 2,3-bisphosphoglycerate, a major modifier of the oxygen affinity of hemoglobin. The PGPase enzyme described here is a member of the Haloacid dehalogenase superfamily of hydrolase enzymes. Unlike the bacterial PGP equivalog (IPR006346 from INTERPRO), which is a member of class (subfamily) I, this enzyme is a member of class (subfamily) II . These two enzymes have almost certainly arisen from convergent evolution (although these two ancestors may themselves have diverged from a more distant HAD superfamily progenitor). One of the sequences in the group comes from Chlamydomonas reinhardtii, a photosynthetic alga. The enzyme has been purified and characterised and these data are fully consistent with the assignment of function as a PGPase involved in photorespiration . Another, from Homo sapiens, is supported as a member of the family indirectly. Biochemical characterization of partially purified PGP's from various tissues including red blood cells have been performed while one gene for PGP has been localized to chromosome 16p13.3 . The Homo sapiens sequence maps to chromosome 22. There is indeed a related gene on chromosome 16 (and it is expressed, since EST's are found) which shows 46 0dentity and 59603b08ositives by BLAST2 (E=1e-66). Presumably, these two genes are isoforms and have the same catalytic function while being expressed in different tissues and may be differently regulated. The sequence from Caenorhabditis elegans, is only supported by sequence similarity. This family is closely related to a family of bacterial sequences including the Escherichia coli NagD and Bacillus subtilus AraL genes which are characterised by the ability to hydrolyze para-nitrophenylphosphate (pNPPases or NPPases). The Chlamydomonas reinhardtii PGPase does not catalyze this reaction and so presumably these two groups have different functions and substrate specificities. Many of the genes in this family have been annotated as pNPPases due to this association.; GO: 0016791 phosphoric monoester hydrolase activity, 0008152 metabolic process. Probab=25.31 E-value=51 Score=14.12 Aligned_cols=19 Identities=16% Similarity=0.247 Sum_probs=14.9 Q ss_pred HHHHHHHHHHCCCCEEEEEEC Q ss_conf 999999999749975299986 Q gi|254780758|r 38 LDIIKDVIQSRNFHVKQIWIT 58 (216) Q Consensus 38 ~~~i~~~l~~~~~~i~~Ii~T 58 (216) .+++++.|.+.|+++ +++| T Consensus 23 ~Pelld~L~~~gK~~--~fvt 41 (288) T TIGR01452 23 APELLDRLAKAGKKV--LFVT 41 (288) T ss_pred CHHHHHHHHHCCCEE--EEEE T ss_conf 727899997459879--9981 No 129 >COG0856 Orotate phosphoribosyltransferase homologs [Nucleotide transport and metabolism] Probab=24.99 E-value=51 Score=14.09 Aligned_cols=10 Identities=10% Similarity=0.255 Sum_probs=4.2 Q ss_pred CCCCCCCCHH Q ss_conf 4321111000 Q gi|254780758|r 148 NNFAHLGDTL 157 (216) Q Consensus 148 ~~~lftGD~l 157 (216) .++++.-|.+ T Consensus 142 K~cvIVDDvi 151 (203) T COG0856 142 KRCVIVDDVI 151 (203) T ss_pred CEEEEEECCC T ss_conf 5489983222 No 130 >COG1107 Archaea-specific RecJ-like exonuclease, contains DnaJ-type Zn finger domain [DNA replication, recombination, and repair] Probab=24.83 E-value=52 Score=14.07 Aligned_cols=10 Identities=20% Similarity=0.311 Sum_probs=4.6 Q ss_pred EEECCCCCCC Q ss_conf 9986988521 Q gi|254780758|r 55 IWITHGHVDH 64 (216) Q Consensus 55 Ii~TH~H~DH 64 (216) |++-|+|+|- T Consensus 447 vVVDHH~Pde 456 (715) T COG1107 447 VVVDHHYPDE 456 (715) T ss_pred EEECCCCCCC T ss_conf 9971788960 No 131 >PRK00090 bioD dithiobiotin synthetase; Reviewed Probab=24.33 E-value=53 Score=14.01 Aligned_cols=22 Identities=5% Similarity=0.143 Sum_probs=13.4 Q ss_pred HHHHHHHHHCCCCEEEE--EECCC Q ss_conf 99999999749975299--98698 Q gi|254780758|r 39 DIIKDVIQSRNFHVKQI--WITHG 60 (216) Q Consensus 39 ~~i~~~l~~~~~~i~~I--i~TH~ 60 (216) -.+.+.++++|.++.+. +.|.. T Consensus 18 ~~L~~~l~~~G~~v~~~KPv~tG~ 41 (223) T PRK00090 18 AALAQALREQGYRVAGYKPVQSGC 41 (223) T ss_pred HHHHHHHHHCCCCEEEECEEEECC T ss_conf 999999997899489975120489 No 132 >pfam03054 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. Probab=23.23 E-value=56 Score=13.89 Aligned_cols=101 Identities=11% Similarity=0.105 Sum_probs=53.9 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCCCCCCC-----------CCCHHHHHHHCCCEEECCHHHHHHHCCHHHHHHHHHCC Q ss_conf 9999999999997499752999869885210-----------11056788607955405843898840123444432002 Q gi|254780758|r 35 GGDLDIIKDVIQSRNFHVKQIWITHGHVDHV-----------GGAAQLKEDLSLTIIGPHKDDAAMMGKVDEQARLLSIR 103 (216) Q Consensus 35 G~~~~~i~~~l~~~~~~i~~Ii~TH~H~DH~-----------gg~~~l~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 103 (216) |-|+......|++.|..+..+.+-....+-. --+..+.++.+.+.+...-.+.++..-.+.....+... T Consensus 10 GVDSsVaA~LL~~~Gy~V~Gv~m~~w~~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~~f~~~Vi~~fv~~Y~~G 89 (354) T pfam03054 10 GVDSSVAAYLLKEQGYEVIGVFMKNWDEEDEFGHGCCSEEDLADAQRVCEQLGIPLYVVNFEKEYWEKVFEPFLDEYKNG 89 (354) T ss_pred CHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHHHHHHHCC T ss_conf 89999999999977996399999956688766788998899999999999869978997879987589999999999769 Q ss_pred -CCCCCCCCEEEECCCCEEEEE--CCCCCCCCCCCC Q ss_conf -346666741896478399762--012331012224 Q gi|254780758|r 104 -MNARNASSDRWLQDGDTLLLG--THIFKVFHCPGH 136 (216) Q Consensus 104 -~~~~~~~~~~~l~~gd~i~lg--~~~~~vi~tPGH 136 (216) .+.+.+..++.++-+.-++.. ......+.| || T Consensus 90 ~TPNPcv~CN~~IKFg~l~~~A~~~lGad~iAT-GH 124 (354) T pfam03054 90 RTPNPDILCNREIKFGALLDYAKQVLGADYLAT-GH 124 (354) T ss_pred CCCCCCHHHCCCCCHHHHHHHHHHHCCCCCCCC-CC T ss_conf 999841441896438999999998669981044-74 No 133 >COG0311 PDX2 Predicted glutamine amidotransferase involved in pyridoxine biosynthesis [Coenzyme metabolism] Probab=23.18 E-value=35 Score=15.07 Aligned_cols=20 Identities=20% Similarity=0.517 Sum_probs=11.4 Q ss_pred EECCCCCHHHHHHHHHHCCC Q ss_conf 98389999999999997499 Q gi|254780758|r 31 VVDPGGDLDIIKDVIQSRNF 50 (216) Q Consensus 31 lID~G~~~~~i~~~l~~~~~ 50 (216) ||=||+++..+-..+++.|+ T Consensus 42 LIiPGGESTTi~rL~~~~gl 61 (194) T COG0311 42 LIIPGGESTTIGRLLKRYGL 61 (194) T ss_pred EEECCCCHHHHHHHHHHCCC T ss_conf 89549507899999987384 No 134 >TIGR02191 RNaseIII ribonuclease III; InterPro: IPR011907 This family consists almost exclusively of bacterial examples of ribonuclease III (RNase III). This ubiquitous enzyme specifically cleaves double-stranded rRNA and is found in all bacteria and eukaryotes . In bacteria its main role is the processing of pre-rRNAs, where the large precursor ribosomal RNA molecules are at cleaved at specific sites to produce the immediate precursors of the functional molecules. RNase III also functions in the maturation and degradation of mRNAs, and the maturaton of tRNAs. In some organisms (eg. E. coli) cells are viable without this enzyme, though they are impeded in growth, but in others (eg. B. subtilis and M. genitalium) this enzyme is essential. The bacterial RNase III enzymes so far characterised are homodimers with a molecular mass of ~50 kDa , . The endonuclease domain is located within the N-terminal two-thirds of the protein, containing several alpha helices, but no beta strands. The double-stranded RNA binding domain is found at the C-terminal third of the protein, forming the alpha-beta(3)-alpha fold common to dsRNA-binding proteins. A signature box of 11 conserved amino acids found in the N-terminal region of RNase III may contain the active site, though this has not been proven.; GO: 0003723 RNA binding, 0004525 ribonuclease III activity, 0016075 rRNA catabolic process. Probab=22.83 E-value=34 Score=15.11 Aligned_cols=12 Identities=42% Similarity=0.770 Sum_probs=5.1 Q ss_pred CCCCCCCHHHHH Q ss_conf 887988889998 Q gi|254780758|r 195 GHGPNSTIGRER 206 (216) Q Consensus 195 gHG~~tt~~~e~ 206 (216) |+|..-|-|.+| T Consensus 98 G~GE~~~gGr~r 109 (228) T TIGR02191 98 GKGEEKSGGRRR 109 (228) T ss_pred CHHHHHCCCCCC T ss_conf 502543265442 No 135 >TIGR02855 spore_yabG sporulation peptidase YabG; InterPro: IPR008764 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This is a group of peptidases belong to MEROPS peptidase family U57 (clan U-). The type example is the YabG protein of Bacillus subtilis. This is a protease involved in the synthesis and maturation of the spore coat proteins SpoIVA and YrbA of Bacillus subtilis .. Probab=22.81 E-value=57 Score=13.84 Aligned_cols=27 Identities=15% Similarity=0.302 Sum_probs=18.2 Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 999999999985199978998488798 Q gi|254780758|r 173 QQLINSINNKILPLGDEISFVCGHGPN 199 (216) Q Consensus 173 ~~~~~sl~~~l~~l~~~~~v~PgHG~~ 199 (216) .+|=+.+.+.+....||.+|+.||+.- T Consensus 148 ~emPe~v~~L~~~~~PDIlViTGHDa~ 174 (292) T TIGR02855 148 KEMPEKVLDLIEEVRPDILVITGHDAY 174 (292) T ss_pred CCCCHHHHHHHHHHCCCEEEEECCCCE T ss_conf 218088999997309978999466630 No 136 >PRK13783 adenylosuccinate synthetase; Provisional Probab=22.47 E-value=58 Score=13.80 Aligned_cols=44 Identities=18% Similarity=0.257 Sum_probs=26.9 Q ss_pred CCEEEEEECCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHHHHCCCC Q ss_conf 7359999708765716999988998499983899999999999974997 Q gi|254780758|r 3 KLSFHIVQVTPFKQNCTFLFDEESLEAVVVDPGGDLDIIKDVIQSRNFH 51 (216) Q Consensus 3 ~~~~~~i~v~~~~~N~~li~~~~~~~~ilID~G~~~~~i~~~l~~~~~~ 51 (216) +..+|++|.|-+..++..++.. + ++|||-.-.+.+ +.|++.+.. T Consensus 48 k~~lhliPSGi~~~~~~~vIG~--G--vVidp~~L~~Ei-~~L~~~g~~ 91 (404) T PRK13783 48 KVVHHLLPSIDFKKNSGAFIGN--G--VVLDLEVLVEEL-EELKAKFPD 91 (404) T ss_pred EEEEEECCCCCCCCCCEEEECC--C--EEECHHHHHHHH-HHHHHHCCC T ss_conf 9999607855358995079889--7--899699999999-999973899 No 137 >cd01449 TST_Repeat_2 Thiosulfate sulfurtransferase (TST), C-terminal, catalytic domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the second repeat. Only the second repeat contains the catalytically active Cys residue. Probab=22.36 E-value=58 Score=13.78 Aligned_cols=55 Identities=20% Similarity=0.230 Sum_probs=24.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCC Q ss_conf 222472100000123432111100000135663467989999999999998519997899848879888 Q gi|254780758|r 133 CPGHSPGHVIYVNFENNFAHLGDTLFRSSIGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNST 201 (216) Q Consensus 133 tPGHt~g~~~~~~~~~~~lftGD~lf~~~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~tt 201 (216) -+||-||++.+-..+ ++..|..|+ +..++.+-+.+.-......+++|.+-|-.++ T Consensus 38 r~GhIpgAinip~~~---~~~~~~~~~-----------~~~~l~~~~~~~gi~~~~~iV~yC~sG~rA~ 92 (118) T cd01449 38 RSGHIPGAVNIPWTS---LLDEDGTFK-----------SPEELRALFAALGITPDKPVIVYCGSGVTAC 92 (118) T ss_pred CCCCCCCCEECCCCC---CCCCCCCCC-----------CHHHHHHHHHHHCCCCCCCEEEECCCCHHHH T ss_conf 188989953547425---313577658-----------9999999987608998886777479988999 No 138 >COG4822 CbiK Cobalamin biosynthesis protein CbiK, Co2+ chelatase [Coenzyme metabolism] Probab=22.12 E-value=59 Score=13.75 Aligned_cols=10 Identities=0% Similarity=0.315 Sum_probs=3.7 Q ss_pred HHHHHHHCCC Q ss_conf 9999997499 Q gi|254780758|r 41 IKDVIQSRNF 50 (216) Q Consensus 41 i~~~l~~~~~ 50 (216) .++.+++.|. T Consensus 65 aL~klk~~gy 74 (265) T COG4822 65 ALNKLKDQGY 74 (265) T ss_pred HHHHHHHCCC T ss_conf 9999997151 No 139 >COG2241 CobL Precorrin-6B methylase 1 [Coenzyme metabolism] Probab=21.92 E-value=59 Score=13.73 Aligned_cols=46 Identities=15% Similarity=0.229 Sum_probs=24.6 Q ss_pred EEECCCCCHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCHHHHHHHCC Q ss_conf 9983899999999999974-99752999869885210110567886079 Q gi|254780758|r 30 VVVDPGGDLDIIKDVIQSR-NFHVKQIWITHGHVDHVGGAAQLKEDLSL 77 (216) Q Consensus 30 ilID~G~~~~~i~~~l~~~-~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~ 77 (216) .......+.+.+.+.+++. +.++ +++.++.+=..|--..++..+.+ T Consensus 48 ~~~~~~~~~~~l~~i~~~~~g~~v--~VLasGDP~f~G~g~~l~~~~~~ 94 (210) T COG2241 48 IIWPYPFDAESLEEILAERKGRDV--VVLASGDPLFSGVGRLLRRKFSC 94 (210) T ss_pred EEECCCCCHHHHHHHHHHHCCCCE--EEEECCCCCHHHHHHHHHHHCCC T ss_conf 982465416899999987379976--99963896035368999986476 No 140 >pfam08759 DUF1792 Domain of unknown function (DUF1792). This putative domain is probably missannotated as a glycosyl transferase 8 family member. This domain is found at the C-terminus of proteins that also contain the glycosyl transferase domain at the N-terminus. Probab=21.73 E-value=60 Score=13.71 Aligned_cols=53 Identities=19% Similarity=0.241 Sum_probs=33.6 Q ss_pred CCCCCHHHHC--CCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHH Q ss_conf 1111000001--3566346798999999999999851999789984887988889 Q gi|254780758|r 151 AHLGDTLFRS--SIGRTDILHGDYQQLINSINNKILPLGDEISFVCGHGPNSTIG 203 (216) Q Consensus 151 lftGD~lf~~--~~G~~d~~~~d~~~~~~sl~~~l~~l~~~~~v~PgHG~~tt~~ 203 (216) +-.|--||.+ |+-|.--|..|+-+-++.|.+.+.....+.+|+=.=||+.||- T Consensus 128 ~GvGNdLF~naksI~rIicPsknAy~~yd~I~e~i~~~~k~~LiLlaLGPTAkVL 182 (225) T pfam08759 128 SGVGNDLFDNAKSIKRIICPSKNAYSKYDEIEEAIRKNAKDRLILLALGPTAKVL 182 (225) T ss_pred CCCCHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHH T ss_conf 4677155636640788957877479999999999998467879998608613654 No 141 >pfam02542 YgbB YgbB family. The ygbB protein is a putative enzyme of deoxy-xylulose pathway (terpenoid biosynthesis). Probab=21.30 E-value=54 Score=13.94 Aligned_cols=31 Identities=29% Similarity=0.383 Sum_probs=22.2 Q ss_pred ECCCCEEEEECCCCCCC-CCCCCCCCCCCCCC Q ss_conf 64783997620123310-12224721000001 Q gi|254780758|r 115 LQDGDTLLLGTHIFKVF-HCPGHSPGHVIYVN 145 (216) Q Consensus 115 l~~gd~i~lg~~~~~vi-~tPGHt~g~~~~~~ 145 (216) +.++..+.+||..++-- ..-|||.|++.+.- T Consensus 12 ~~~~~~l~LgGv~i~~~~~l~gHSDGDv~~HA 43 (157) T pfam02542 12 FVEGRPLILGGVEIPHEKGLLGHSDGDVLLHA 43 (157) T ss_pred ECCCCCEEEEEEEECCCCCCCCCCHHHHHHHH T ss_conf 65999789976970798886626728999999 No 142 >TIGR02017 hutG_amidohyd N-formylglutamate amidohydrolase; InterPro: IPR010247 In some species, histidine is converted to via urocanate and then formimino-L-glutamate to glutamate in four steps, where the fourth step is conversion of N-formimino-L-glutamate to L-glutamate and formamide. In others, that pathway from formimino-L-glutamate may differ, with the next enzyme being formiminoglutamate hydrolase (HutF) yielding N-formyl-L-glutamate. This entry represents the enzyme N-formylglutamate deformylase, also called N-formylglutamate amidohydrolase, which then produces glutamate.. Probab=20.54 E-value=63 Score=13.56 Aligned_cols=53 Identities=19% Similarity=0.256 Sum_probs=26.2 Q ss_pred CC-CCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCC-------------HHHHHHHHHHHHHCCCCCEEEEECCC Q ss_conf 22-472100000123432111100000135663467989-------------99999999999851999789984887 Q gi|254780758|r 134 PG-HSPGHVIYVNFENNFAHLGDTLFRSSIGRTDILHGD-------------YQQLINSINNKILPLGDEISFVCGHG 197 (216) Q Consensus 134 PG-Ht~g~~~~~~~~~~~lftGD~lf~~~~G~~d~~~~d-------------~~~~~~sl~~~l~~l~~~~~v~PgHG 197 (216) || +|-+ ++|.. .|=|+.||+.|-+ |.++ ++++.+-| +||+++-...++|=+|- T Consensus 82 pGQ~TT~----L~P~T--~FDG~ply~dG~~----P~~~ei~~r~~~~f~PYH~aL~~Ei-~RLra~hg~~vLyDaHS 148 (269) T TIGR02017 82 PGQATTG----LCPET--TFDGEPLYRDGEA----PSPAEIDSRLTQYFRPYHAALRAEI-ERLRAQHGYAVLYDAHS 148 (269) T ss_pred CCCCCCC----CCCCC--CCCCCHHHHCCCC----CCHHHHHHHHHHHCCCHHHHHHHHH-HHHHHHCCCEEEECCCC T ss_conf 9766665----67675--6777667644789----7988999998862422689999999-99884358079871642 No 143 >PRK12312 infB translation initiation factor IF-2; Provisional Probab=20.50 E-value=44 Score=14.48 Aligned_cols=29 Identities=17% Similarity=0.316 Sum_probs=15.3 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHCCCCEEEE Q ss_conf 9984999838999999999999749975299 Q gi|254780758|r 25 ESLEAVVVDPGGDLDIIKDVIQSRNFHVKQI 55 (216) Q Consensus 25 ~~~~~ilID~G~~~~~i~~~l~~~~~~i~~I 55 (216) +++....||+++. + ....++++|..++.| T Consensus 162 ~~~~itFiDTPGH-e-AFt~mR~RGa~vtDI 190 (610) T PRK12312 162 QGKKITFIDTPGH-E-AFTEMRARGAKVTDI 190 (610) T ss_pred CCCEEEEECCCCH-H-HHHHHHHCCCCCCCE T ss_conf 7976899728967-9-899999707765457 No 144 >TIGR03597 GTPase_YqeH ribosome biogenesis GTPase YqeH. This family describes YqeH, a member of a larger family of GTPases involved in ribosome biogenesis. Like YqlF, it shows a cyclical permutation relative to GTPases EngA (in which the GTPase domain is duplicated), Era, and others. Members of this protein family are found in a relatively small number of bacterial species, including Bacillus subtilis but not Escherichia coli. Probab=20.41 E-value=64 Score=13.54 Aligned_cols=32 Identities=13% Similarity=0.213 Sum_probs=20.6 Q ss_pred HHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHH Q ss_conf 9999997499752999869885210110567886 Q gi|254780758|r 41 IKDVIQSRNFHVKQIWITHGHVDHVGGAAQLKED 74 (216) Q Consensus 41 i~~~l~~~~~~i~~Ii~TH~H~DH~gg~~~l~~~ 74 (216) +.+.+++.|+++..|+++-++-. +|...|.+. T Consensus 116 v~~~~~~~gl~~~~V~lvSa~~g--~gi~~l~~~ 147 (360) T TIGR03597 116 MKKRAKELGLKPVDIILVSAKKG--NGIDELLDK 147 (360) T ss_pred HHHHHHHCCCCCCCEEEEECCCC--CCHHHHHHH T ss_conf 99999985998366899968889--899999999 No 145 >cd06315 PBP1_ABC_sugar_binding_like_6 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=20.08 E-value=65 Score=13.50 Aligned_cols=58 Identities=9% Similarity=0.107 Sum_probs=39.1 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEEC Q ss_conf 998499983899999999999974-9975299986988521011056788607955405 Q gi|254780758|r 25 ESLEAVVVDPGGDLDIIKDVIQSR-NFHVKQIWITHGHVDHVGGAAQLKEDLSLTIIGP 82 (216) Q Consensus 25 ~~~~~ilID~G~~~~~i~~~l~~~-~~~i~~Ii~TH~H~DH~gg~~~l~~~~~~~i~~~ 82 (216) -|++..++|+..++....+.+++. ..+++.|++.-...+-+...-...++-+.|++.- T Consensus 29 lGw~v~v~D~~g~~~~~~~~i~~ai~~k~D~Iii~~~D~~~~~~~l~~A~~agIPvv~~ 87 (280) T cd06315 29 IGWNLRILDGRGSEAGQAAALNQAIALKPDGIVLGGVDAAELQAELELAQKAGIPVVGW 87 (280) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 59879998899999999999999996399999999829788789999999879978962 Done!