Query gi|254780214|ref|YP_003064627.1| metal-dependent hydrolase protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 271 No_of_seqs 173 out of 2320 Neff 8.4 Searched_HMMs 39220 Date Mon May 23 19:35:00 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780214.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK02113 putative hydrolase; P 100.0 0 0 501.8 21.4 251 5-268 1-252 (252) 2 TIGR03307 PhnP phosphonate met 100.0 0 0 428.3 19.8 235 5-266 1-249 (249) 3 PRK11244 phnP carbon-phosphoru 100.0 0 0 424.1 20.4 236 5-268 1-250 (250) 4 PRK05184 pyrroloquinoline quin 100.0 0 0 365.3 16.9 249 5-268 1-302 (302) 5 PRK00055 ribonuclease Z; Revie 100.0 0 0 338.4 18.4 236 4-270 1-257 (259) 6 TIGR02651 RNase_Z ribonuclease 100.0 0 0 333.7 15.4 238 6-268 1-327 (327) 7 COG1234 ElaC Metal-dependent h 100.0 5.8E-40 1.5E-44 268.0 18.7 244 4-269 1-292 (292) 8 COG1235 PhnP Metal-dependent h 100.0 4.5E-40 1.1E-44 268.7 16.9 257 2-270 1-265 (269) 9 TIGR02108 PQQ_syn_pqqB coenzym 100.0 5.6E-40 1.4E-44 268.1 12.2 251 6-268 1-314 (314) 10 PRK02126 ribonuclease Z; Provi 100.0 3.6E-33 9.1E-38 225.8 13.0 206 40-253 17-328 (334) 11 KOG2121 consensus 100.0 7.5E-34 1.9E-38 230.0 2.5 240 3-268 441-715 (746) 12 TIGR02649 true_RNase_BN ribonu 99.9 1.3E-27 3.4E-32 191.2 10.2 231 6-267 1-303 (304) 13 COG0595 mRNA degradation ribon 99.7 1.1E-15 2.7E-20 117.2 12.2 168 3-212 7-203 (555) 14 TIGR03675 arCOG00543 arCOG0178 99.7 1.6E-16 4E-21 122.4 7.6 178 2-210 172-374 (630) 15 PRK00685 metal-dependent hydro 99.7 9E-15 2.3E-19 111.4 14.7 200 5-268 1-228 (228) 16 TIGR02650 RNase_Z_T_toga ribon 99.6 3.1E-15 7.8E-20 114.3 6.2 209 40-261 10-270 (277) 17 COG1236 YSH1 Predicted exonucl 99.5 1.2E-13 3.1E-18 104.4 8.4 187 5-225 1-203 (427) 18 pfam02112 PDEase_II cAMP phosp 99.5 9.6E-13 2.5E-17 98.8 13.0 215 5-243 1-289 (323) 19 COG1237 Metal-dependent hydrol 99.5 2E-12 5.2E-17 96.8 14.3 215 37-268 20-259 (259) 20 COG1782 Predicted metal-depend 99.4 1.2E-12 3E-17 98.3 8.2 189 3-225 179-395 (637) 21 PRK04286 hypothetical protein; 99.4 2.7E-12 6.8E-17 96.1 7.6 224 5-263 1-275 (294) 22 smart00849 Lactamase_B Metallo 99.3 1.5E-11 3.8E-16 91.4 8.9 135 37-188 4-144 (183) 23 COG2220 Predicted Zn-dependent 99.1 4.4E-09 1.1E-13 76.1 13.5 187 1-241 3-213 (258) 24 pfam00753 Lactamase_B Metallo- 99.1 4E-10 1E-14 82.5 8.1 55 37-96 4-63 (148) 25 TIGR00649 MG423 conserved hypo 99.1 3.4E-10 8.6E-15 83.0 6.0 125 73-212 69-212 (593) 26 KOG1136 consensus 99.0 1.3E-09 3.3E-14 79.3 6.0 173 4-209 3-205 (501) 27 COG2248 Predicted hydrolase (m 98.9 4.6E-09 1.2E-13 75.9 6.5 212 5-253 1-261 (304) 28 TIGR03413 GSH_gloB hydroxyacyl 98.8 8.9E-09 2.3E-13 74.2 6.7 105 53-187 20-127 (248) 29 PRK11539 hypothetical protein; 98.8 2.1E-07 5.5E-12 65.6 13.2 63 4-96 498-570 (754) 30 PRK10241 hydroxyacylglutathion 98.8 1.9E-08 4.9E-13 72.1 7.4 106 53-187 22-128 (251) 31 KOG1135 consensus 98.8 4.4E-08 1.1E-12 69.9 8.1 157 41-208 17-198 (764) 32 COG5212 PDE1 Low-affinity cAMP 98.7 5.9E-08 1.5E-12 69.1 6.4 161 74-241 111-314 (356) 33 COG0491 GloB Zn-dependent hydr 98.6 2.5E-07 6.3E-12 65.2 8.4 127 53-189 35-170 (252) 34 KOG1137 consensus 98.6 4.2E-08 1.1E-12 70.0 4.1 172 2-208 11-207 (668) 35 PRK11709 putative L-ascorbate 98.6 1.5E-06 3.8E-11 60.3 11.1 176 40-240 46-283 (355) 36 COG2333 ComEC Predicted hydrol 98.5 5.8E-06 1.5E-10 56.7 12.6 186 39-254 54-257 (293) 37 KOG1361 consensus 98.4 2.8E-06 7.2E-11 58.6 8.7 176 56-256 92-288 (481) 38 COG0426 FpaA Uncharacterized f 98.2 2.7E-06 6.9E-11 58.7 5.6 121 41-186 38-165 (388) 39 KOG0813 consensus 97.8 0.00013 3.3E-09 48.3 7.3 36 74-112 50-85 (265) 40 PRK05452 anaerobic nitric oxid 97.5 0.00016 4E-09 47.8 5.2 122 41-186 37-166 (479) 41 PRK11921 metallo-beta-lactamas 97.4 0.00025 6.5E-09 46.5 5.2 116 52-186 40-162 (395) 42 KOG3798 consensus 96.9 0.0064 1.6E-07 37.7 8.1 153 71-241 128-300 (343) 43 COG2015 Alkyl sulfatase and re 96.6 0.0021 5.3E-08 40.8 3.5 66 51-116 133-205 (655) 44 KOG0814 consensus 94.5 0.11 2.7E-06 30.2 5.5 53 43-98 24-80 (237) 45 KOG3592 consensus 92.6 0.11 2.7E-06 30.2 2.9 50 40-95 49-101 (934) 46 COG0391 Uncharacterized conser 87.8 2.1 5.3E-05 22.1 7.5 208 1-240 4-230 (323) 47 KOG2121 consensus 87.8 0.24 6E-06 28.0 1.2 67 53-119 73-143 (746) 48 KOG1138 consensus 87.5 0.79 2E-05 24.8 3.8 39 73-116 94-132 (653) 49 cd00530 PTE Phosphotriesterase 86.7 1.3 3.4E-05 23.4 4.5 45 220-264 163-209 (293) 50 TIGR01970 DEAH_box_HrpB ATP-de 82.7 2 5.2E-05 22.2 4.0 54 61-116 98-160 (858) 51 COG1735 Php Predicted metal-de 75.7 6 0.00015 19.3 4.8 44 221-264 179-225 (316) 52 KOG4736 consensus 75.1 2.7 7E-05 21.4 2.8 41 50-93 101-143 (302) 53 pfam02126 PTE Phosphotriestera 74.9 5.7 0.00015 19.4 4.3 45 220-264 169-215 (308) 54 pfam07521 RMMBL RNA-metabolisi 74.4 4.1 0.0001 20.3 3.5 26 215-240 15-40 (43) 55 TIGR02964 xanthine_xdhC xanthi 67.1 4.3 0.00011 20.2 2.3 38 226-265 179-218 (270) 56 cd07187 YvcK_like family of mo 64.5 11 0.00027 17.8 5.2 69 175-243 150-219 (308) 57 TIGR00140 hupD hydrogenase exp 59.8 11 0.00027 17.7 3.2 55 192-246 46-126 (156) 58 TIGR02817 adh_fam_1 zinc-bindi 58.9 3 7.6E-05 21.2 0.2 44 40-89 188-231 (338) 59 TIGR00143 hypF [NiFe] hydrogen 58.5 9.9 0.00025 17.9 2.8 75 183-263 430-523 (799) 60 TIGR01826 CofD_related conserv 58.1 14 0.00035 17.0 5.9 73 170-242 154-233 (331) 61 PRK09875 putative hydrolase; P 55.7 15 0.00038 16.8 4.9 47 220-266 165-213 (292) 62 TIGR00658 orni_carb_tr ornithi 53.8 16 0.00041 16.6 4.7 48 35-87 56-113 (341) 63 PRK11664 ATP-dependent RNA hel 52.8 17 0.00043 16.5 6.3 47 43-90 83-131 (812) 64 TIGR00412 redox_disulf_2 redox 49.8 8.5 0.00022 18.3 1.4 11 4-14 1-11 (78) 65 TIGR00361 ComEC_Rec2 DNA inter 47.7 4.7 0.00012 20.0 -0.3 26 70-95 540-565 (731) 66 TIGR01931 cysJ sulfite reducta 45.6 21 0.00053 15.9 2.8 55 180-234 544-621 (628) 67 TIGR00432 arcsn_tRNA_tgt archa 45.2 9 0.00023 18.2 0.9 13 21-33 276-288 (658) 68 pfam01872 RibD_C RibD C-termin 43.0 24 0.00061 15.6 2.8 10 74-83 39-48 (196) 69 KOG2882 consensus 39.3 27 0.00069 15.2 4.9 12 195-206 174-185 (306) 70 TIGR02417 fruct_sucro_rep D-fr 37.7 29 0.00073 15.0 3.0 52 52-111 89-146 (335) 71 TIGR00107 deoD purine nucleosi 37.4 17 0.00043 16.5 1.3 14 6-21 53-66 (234) 72 TIGR02181 GRX_bact glutaredoxi 37.0 6 0.00015 19.3 -1.1 62 23-98 7-72 (82) 73 cd01946 ribokinase_group_C Rib 36.8 29 0.00075 15.0 2.5 49 191-239 154-202 (277) 74 PRK11200 grxA glutaredoxin 1; 34.8 19 0.00047 16.2 1.2 74 5-99 1-79 (87) 75 cd07044 CofD_YvcK Family of Co 34.8 32 0.00082 14.8 4.9 66 176-241 149-216 (309) 76 TIGR01212 TIGR01212 radical SA 34.1 29 0.00073 15.1 2.0 52 185-240 120-185 (307) 77 TIGR00216 ispH_lytB 4-hydroxy- 33.0 34 0.00087 14.6 3.0 53 177-238 241-296 (354) 78 pfam08915 tRNA-Thr_ED Archaea- 32.4 35 0.00089 14.5 3.5 26 220-245 59-87 (137) 79 PRK13789 phosphoribosylamine-- 31.8 21 0.00054 15.9 1.1 14 1-14 1-14 (426) 80 COG4566 TtrR Response regulato 31.2 37 0.00093 14.4 2.6 37 53-91 49-91 (202) 81 CHL00194 ycf39 Ycf39; Provisio 30.9 37 0.00095 14.4 5.8 27 215-241 205-231 (319) 82 TIGR02363 dhaK1 dihydroxyaceto 27.3 43 0.0011 14.0 3.3 15 188-202 82-96 (354) 83 TIGR03471 HpnJ hopanoid biosyn 26.6 44 0.0011 13.9 6.2 30 56-86 48-79 (472) 84 PRK07261 topology modulation p 26.4 33 0.00084 14.7 1.3 12 5-16 1-12 (171) 85 TIGR01110 mdcA malonate decarb 26.3 16 0.00042 16.6 -0.3 37 38-85 172-210 (551) 86 TIGR02633 xylG D-xylose ABC tr 26.3 42 0.0011 14.0 1.8 11 74-84 190-200 (501) 87 TIGR01751 crot-CoA-red crotony 24.9 47 0.0012 13.7 2.1 51 60-114 179-234 (409) 88 TIGR01302 IMP_dehydrog inosine 24.9 47 0.0012 13.7 1.8 27 199-232 250-276 (476) 89 pfam05571 DUF766 Protein of un 22.8 52 0.0013 13.5 1.7 27 8-41 9-35 (296) 90 smart00812 Alpha_L_fucos Alpha 22.5 53 0.0013 13.4 3.7 10 79-88 98-107 (384) 91 TIGR01621 RluA-like pseudourid 22.1 54 0.0014 13.4 2.3 44 50-94 16-59 (222) 92 cd00518 H2MP Hydrogenase speci 22.0 54 0.0014 13.3 5.8 22 218-239 89-112 (139) 93 COG0455 flhG Antiactivator of 21.8 54 0.0014 13.3 2.5 13 52-64 31-44 (262) 94 pfam09108 Xol-1_N Switch prote 21.7 37 0.00094 14.4 0.8 32 79-112 70-105 (164) 95 KOG0978 consensus 21.2 15 0.00038 16.8 -1.3 10 80-89 63-72 (698) 96 cd07186 CofD_like LPPG:FO 2-ph 20.9 57 0.0014 13.2 4.7 62 179-240 161-224 (303) 97 TIGR00337 PyrG CTP synthase; I 20.9 57 0.0015 13.2 4.2 40 200-241 142-184 (571) 98 KOG4584 consensus 20.8 51 0.0013 13.5 1.4 30 185-214 284-313 (348) 99 pfam09788 Tmemb_55A Transmembr 20.6 54 0.0014 13.4 1.4 12 12-23 59-70 (258) 100 TIGR01469 cobA_cysG_Cterm urop 20.4 58 0.0015 13.1 1.7 29 205-233 198-227 (242) 101 PRK00771 signal recognition pa 20.1 59 0.0015 13.1 3.9 44 75-120 235-278 (433) No 1 >PRK02113 putative hydrolase; Provisional Probab=100.00 E-value=0 Score=501.76 Aligned_cols=251 Identities=34% Similarity=0.541 Sum_probs=224.8 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCCCCCCEEEEECC Q ss_conf 59999854178965133788787777876774443069999336778971899987984156675223342059999743 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLREQVLSIDAVLYTHP 84 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i~~Id~I~iTH~ 84 (271) |||||||||+|+|+|++||+|++|+++++||+|+|+|++|+. +++++|||||||+|.|+++.++++||+||+||. T Consensus 1 Mki~~LGtG~s~GvP~~~C~C~~C~~~~~~~~R~r~s~lv~~-----~~~~iLID~gpD~r~Q~lr~~i~~iDavllTH~ 75 (252) T PRK02113 1 MKIRILGSGTSTGVPEIGCTCPVCTSKDPRDNRLRTSALVET-----EGARILIDCGPDFREQMLRLPFGKIDAVLITHE 75 (252) T ss_pred CEEEEEEEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEEEE-----CCCEEEEECCCCHHHHHHHCCCCCCCEEEEECC T ss_conf 989999770896742799978769886878872464899998-----992799968913899998779876148999745 Q ss_pred CHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEE Q ss_conf 52422011102433317821110131120232100222210136677531001232112344333420115874899999 Q gi|254780214|r 85 HADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPI 164 (271) Q Consensus 85 H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~ 164 (271) |+||+.||++||+|. +.+++|||+++.+.+.++++|+|+|......+. +.+....++ ....+...+++++|+ T Consensus 76 H~DHi~GlddLR~f~--~~~~ipvy~~~~t~~~l~~~~~y~f~~~~~~~~-~~l~~~~i~-----~~~~f~i~~~~v~p~ 147 (252) T PRK02113 76 HYDHVGGLDDLRPFC--RFGEVPIYAEQYVAERLRSRMPYCFVEHSYPGV-PNIPLREIE-----PDRPFLVNHTEVTPL 147 (252) T ss_pred CHHHHCCHHHHHHHH--CCCCCCEEECHHHHHHHHHHCHHHHCCCCCCCC-CCCEEEEEC-----CCCCEEECEEEEEEE T ss_conf 704426989999885--458744665789999998633112146888886-563157715-----999467420799878 Q ss_pred EEECCCEECCCCCCCCEEEECCCCCCHHHHHHHHCCHHEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 95077621123346862530323300167888622211025302335768875799999999997299779997688898 Q gi|254780214|r 165 LQQHGRISSLGFRFGNVAYCTDVNAFPAESLEKLQNLDFLIIDALKNGLHSSHFSLSESLKKIELINPKNAILTHMHVDL 244 (271) Q Consensus 165 ~~~Hg~~~~~Gyri~~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtHls~~~ 244 (271) +++||..+++||||++++|++|++.+||+++++++|+|+||+||++..+|++|++++||++++++++|||++||||||++ T Consensus 148 ~v~Hg~~~~~G~rig~~aY~tD~~~ip~~~~~~L~~~D~liid~l~~~~h~sH~~~~eal~~~~~i~pkr~~ltH~sH~~ 227 (252) T PRK02113 148 RVMHGKLPILGYRIGKMAYITDMLTMPEEEYEQLQGIDVLVMNALRIAPHPTHQSLEEALENIKRIGAKETYLIHMSHHI 227 (252) T ss_pred EECCCCCEECCEECCCEEEECCCCCCCHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCC T ss_conf 50688821644750446993787657989999753588578743567999997899999999997099829998988986 Q ss_pred C-HHHHHHHCCCCCEEEEECCEEEE Q ss_conf 8-88999867998199020448996 Q gi|254780214|r 245 D-YDMVLKSTPSRVVPAFDGMQFSS 268 (271) Q Consensus 245 ~-~~~~~~~~~~~v~~A~DGm~i~l 268 (271) + |+++++.+|+||++|||||+|++ T Consensus 228 ~~h~~~~~~lp~~v~~AYDGL~i~f 252 (252) T PRK02113 228 GLHADVEKELPPHVHFAYDGLEIIF 252 (252) T ss_pred CHHHHHHHHCCCCCEEEECCEEEEC T ss_conf 2699999667899889668949849 No 2 >TIGR03307 PhnP phosphonate metabolism protein PhnP. This family of proteins found in operons encoding phosphonate C-P lyase systems as is observed in E. coli and is a member of the metallo-beta-lactamase superfamily (pfam00753). As defined by this model, all instances of this protein are associated with the C-P lyase, but not all genomes containing the C-P lyase system contain phnP. Probab=100.00 E-value=0 Score=428.30 Aligned_cols=235 Identities=29% Similarity=0.422 Sum_probs=193.0 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCC--CCCCCCCCCEEEEEECCCCCCCEEEEEECC-CCHHHHHHHCCCCCCCEEEE Q ss_conf 599998541789651337887877778--767744430699993367789718999879-84156675223342059999 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPT--NPKNRRTRSSLKISRVSERGSNTTVIVDTG-PDFYMQVLREQVLSIDAVLY 81 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~--~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G-~d~r~ql~~~~i~~Id~I~i 81 (271) |||||||||+|+|||++||+|++|+++ +++|+|+|+|++|+. ++++|||||| ||+|.|+.. ++||+||+ T Consensus 1 Mki~~LGtG~s~GvP~~~C~C~~C~~a~~~~~~rr~r~s~li~~-----~~~~iLID~g~pdlr~ql~~---~~IdaVll 72 (249) T TIGR03307 1 MRLTFLGTGGAQQVPVYGCDCVACQRARRNQDYRRQPCSAVIEV-----NGVRTLIDAGLTDLAERFPP---GSLQAILL 72 (249) T ss_pred CEEEEEEECCCCCCCCCCCCCCCCCHHCCCCCCCCCCEEEEEEE-----CCCEEEEECCCHHHHHHCCC---CCCCEEEE T ss_conf 98999965489764578998865742101877665111899998-----99289998997799986051---68788999 Q ss_pred ECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEE Q ss_conf 74352422011102433317821110131120232100222210136677531001232112344333420115874899 Q gi|254780214|r 82 THPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEA 161 (271) Q Consensus 82 TH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i 161 (271) ||.|+||+.||++|| | ..++++|||+++.+. .+...|.| .....+. ..... ...+..+++++ T Consensus 73 TH~H~DHi~GlddlR-~--~~~~~ipvy~~~~~~-~~~~~~~~----~~~~~~~-----~~~~~-----~~~~~i~g~~i 134 (249) T TIGR03307 73 THYHMDHVQGLFPLR-W--GMGEPIPVYGPPDPE-GCDDLFKH----PGILDFS-----KPLLA-----FEPFHLGGLRV 134 (249) T ss_pred ECCCHHHHCCCHHHH-H--CCCCCEEEEECCCHH-HHHHHHCC----CCCCCCC-----CCCCC-----CCCEEECCEEE T ss_conf 653345446624455-2--168865788688788-99998626----9842434-----65479-----89677398189 Q ss_pred EEEEEECCCEECCCCCC----CCEEEECCCCCCHHHHHHHHCC--HHEEEEECC---CCCCCCCCCCHHHHHHHHHHCCC Q ss_conf 99995077621123346----8625303233001678886222--110253023---35768875799999999997299 Q gi|254780214|r 162 IPILQQHGRISSLGFRF----GNVAYCTDVNAFPAESLEKLQN--LDFLIIDAL---KNGLHSSHFSLSESLKKIELINP 232 (271) Q Consensus 162 ~~~~~~Hg~~~~~Gyri----~~~~Y~~Dt~~~~e~~~~~l~~--~D~lI~d~l---~~~~~~~H~~~~ea~~~~~~~~~ 232 (271) +|+++.|+. .++||+| +++||+||++.+|+++++++++ +|+||+||+ +..+|++|++++||++++++++| T Consensus 135 ~p~~~~H~~-~~~Gy~i~~~g~~~aY~tD~~~ip~es~~~L~~~~lD~li~d~~~~~~~~~~~sH~~~~eale~~~~l~p 213 (249) T TIGR03307 135 TPVPLVHSK-LTFGYLLETDGQRLAYLTDTAGLPPDTEAFLKNHPLDLLILDCSHPPQEDAPRNHNDLTRALAIYEQLRP 213 (249) T ss_pred EEEEEECCC-CCEEEEEECCCCEEEEECCCCCCCHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHCC T ss_conf 999996078-6469999518955999678887798999997289988899627877767888887899999999997199 Q ss_pred CEEEEECCCCCCCH--HHHHHHCCCCCEEEEECCEE Q ss_conf 77999768889888--89998679981990204489 Q gi|254780214|r 233 KNAILTHMHVDLDY--DMVLKSTPSRVVPAFDGMQF 266 (271) Q Consensus 233 k~~~LtHls~~~~~--~~~~~~~~~~v~~A~DGm~i 266 (271) ||++||||||++|+ .+..+.+|+||++|||||+| T Consensus 214 kr~~lTHmsh~ld~~~~~~~~~lp~~v~~ayDGl~i 249 (249) T TIGR03307 214 KQLILTHISHQLDAWLMENPDCLPDGVAIGYDGQTL 249 (249) T ss_pred CEEEEECCCCCCCHHHHHHHHHCCCCCEEECCCCCC T ss_conf 849998988574468876575489996892378499 No 3 >PRK11244 phnP carbon-phosphorus lyase complex accessory protein; Provisional Probab=100.00 E-value=0 Score=424.11 Aligned_cols=236 Identities=29% Similarity=0.440 Sum_probs=193.1 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCC--CCCCCCCCCEEEEEECCCCCCCEEEEEECC-CCHHHHHHHCCCCCCCEEEE Q ss_conf 599998541789651337887877778--767744430699993367789718999879-84156675223342059999 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPT--NPKNRRTRSSLKISRVSERGSNTTVIVDTG-PDFYMQVLREQVLSIDAVLY 81 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~--~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G-~d~r~ql~~~~i~~Id~I~i 81 (271) |||||||||+|+|||++||+|++|++| +++|+|+|+|++|+. ++.+|||||| ||+|.|+. .++||+||+ T Consensus 1 Mki~~LGtG~s~GvP~~~C~C~~C~~a~~~~~~rR~r~s~lv~~-----~~~~iLID~g~~dl~~~~~---~~~iDaVll 72 (250) T PRK11244 1 MRLTLLGTGGAQGVPVYGCECVACARARRDPAYRRRPCSALLEF-----NGARTLIDAGLHDLAERFP---PGSLQAFLL 72 (250) T ss_pred CEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEE-----CCEEEEEECCCHHHHHHCC---CCCCCEEEE T ss_conf 98999973589862478998833730014855450334899998-----9927999898089997555---568778998 Q ss_pred ECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCEECCCCCEE Q ss_conf 74352422011102433317821110131120232100222210136677531001232-11234433342011587489 Q gi|254780214|r 82 THPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNP-IVIENNDVPICMKSAGGVIE 160 (271) Q Consensus 82 TH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~ 160 (271) ||.|+||+.||++|| | ..++++|||+++.+. .+.. +|+.. .+... ..++. ...+..++++ T Consensus 73 TH~H~DHi~GlddlR-~--~~~~~ipvy~~~~~~-~~~~----l~~~~------~~~~~~~~~~~-----~~~~~i~g~~ 133 (250) T PRK11244 73 THYHMDHVQGLFPLR-W--GVGDVIPVYGPPDPE-GCDD----LFKHP------GILDFSHPLEA-----FEPFDLGGLQ 133 (250) T ss_pred ECCCHHHHCCHHHHH-H--CCCCCEEEEECCCHH-HHHH----HHHCC------CCCCCCCCCCC-----CCCEEECCEE T ss_conf 257533433768887-1--379835677377577-8999----85078------84342366588-----8866868838 Q ss_pred EEEEEEECCCEECCCCCCCC----EEEECCCCCCHHHHHHHHC--CHHEEEEECC---CCCCCCCCCCHHHHHHHHHHCC Q ss_conf 99999507762112334686----2530323300167888622--2110253023---3576887579999999999729 Q gi|254780214|r 161 AIPILQQHGRISSLGFRFGN----VAYCTDVNAFPAESLEKLQ--NLDFLIIDAL---KNGLHSSHFSLSESLKKIELIN 231 (271) Q Consensus 161 i~~~~~~Hg~~~~~Gyri~~----~~Y~~Dt~~~~e~~~~~l~--~~D~lI~d~l---~~~~~~~H~~~~ea~~~~~~~~ 231 (271) ++|++++|+. +++||+|++ ++|++|++.+||+++++++ ++|+||+||+ +..++++|++++||++++++++ T Consensus 134 itp~~~~Hg~-~~~Gy~i~~~g~~~aY~tD~~~ipe~~~~~L~~~~ld~lv~d~~~~~~~~~~~sH~~~~~ale~i~~l~ 212 (250) T PRK11244 134 VTPLPLNHSK-LTFGYLLETAHSRFAYLTDTVGLPEDTAKFLRNNQPDLLVLDCSHPPQEDAPRNHNDLTTALAIIEVLR 212 (250) T ss_pred EEEEEECCCC-CCEEEEEECCCCEEEEECCCCCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 9999977889-736999964893299966778779999999861899899986887887887988799999999999729 Q ss_pred CCEEEEECCCCCCCH-HHHHHHCCCCCEEEEECCEEEE Q ss_conf 977999768889888-8999867998199020448996 Q gi|254780214|r 232 PKNAILTHMHVDLDY-DMVLKSTPSRVVPAFDGMQFSS 268 (271) Q Consensus 232 ~k~~~LtHls~~~~~-~~~~~~~~~~v~~A~DGm~i~l 268 (271) |||++||||||++|+ .+..+.+|+||++|||||+|+| T Consensus 213 pkr~~lTH~sh~~d~~~~~~~~lp~~v~~aYDGl~i~l 250 (250) T PRK11244 213 PPRVILTHISHQLDAWLMENAALPSGVEVAYDGMEIGL 250 (250) T ss_pred CCEEEEECCCCCCCHHHHHHHCCCCCCEEEECCEEEEC T ss_conf 98799989667763577776408999689628989769 No 4 >PRK05184 pyrroloquinoline quinone biosynthesis protein PqqB; Provisional Probab=100.00 E-value=0 Score=365.31 Aligned_cols=249 Identities=21% Similarity=0.295 Sum_probs=192.6 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCC----CCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHC-------CC Q ss_conf 59999854178965133788787777876----77444306999933677897189998798415667522-------33 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNP----KNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLRE-------QV 73 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~----~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~-------~i 73 (271) ||++|||||+++|+|+|||+|++|++++. .+.|+|+|++|+.++.+ .+||||+||+|.||++. ++ T Consensus 1 M~v~vLGta~~gGvPq~gC~c~~C~~ar~~~~~~~~Rt~sSi~v~~dg~~----~~LiDasPDlr~QL~~~~~l~p~~~~ 76 (302) T PRK05184 1 MRIIVLGSAAGGGFPQWNCNCPNCRGARAGTIRAQPRTQSSIAVSADGED----WVLVNASPDIRQQIRATPALQPARGL 76 (302) T ss_pred CEEEEEEECCCCCCCCCCCCCCCCCHHHCCCCCCCCCCEEEEEEECCCCE----EEEEECCCCHHHHHHHHHHHCCCCCC T ss_conf 98999997778962558989701704224885566541148999828986----99998993899998651441754344 Q ss_pred --CCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf --420599997435242201110243331782111013112023210022221013667753100123211234433342 Q gi|254780214|r 74 --LSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPIC 151 (271) Q Consensus 74 --~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (271) +.||+||+||+|.||+.||.+|| .+++++||+++.+.+.|+++|++ |........ .....+...+ +.. T Consensus 77 r~s~i~~V~lTHaH~dH~~GL~~LR-----e~~~i~vya~~~~~~~L~~~~p~-~~~l~~~~~---v~~~~i~~~~-~~~ 146 (302) T PRK05184 77 RDTPIAAVVLTDGQIDHVTGLLSLR-----EGQPFPLYATPMVLEDLSTGFPI-FNVLDHYCG---VDRRPIALDG-PFA 146 (302) T ss_pred CCCCCCEEEEECCCCCHHCCHHHHH-----CCCCCCEEECHHHHHHHHHCCCH-HHHHCCCCC---CCCEECCCCC-CCC T ss_conf 5787348998657500331788871-----56972067468999999863988-785333466---2102414688-623 Q ss_pred EECCCCCEEEEEEEEE----------CCC--EECCCCCC------CCEEEECCCCCCHHHHHHHHCCHHEEEEECCCCCC Q ss_conf 0115874899999950----------776--21123346------86253032330016788862221102530233576 Q gi|254780214|r 152 MKSAGGVIEAIPILQQ----------HGR--ISSLGFRF------GNVAYCTDVNAFPAESLEKLQNLDFLIIDALKNGL 213 (271) Q Consensus 152 ~~~~~~~i~i~~~~~~----------Hg~--~~~~Gyri------~~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~l~~~~ 213 (271) . ....+++++||+++ |+. .+++||+| ++++|++|++.+++++++.++++|++++|++++.. T Consensus 147 ~-~~~~gl~v~p~~Vp~k~p~~~~h~~~~~~~etvG~~i~~~~~gk~~~YipD~~~~~~~~~~~l~~~D~~l~DGTf~~~ 225 (302) T PRK05184 147 V-PGLPGLRFTAFPVPSKAPPYSPHRGDPHPGDNIGLRIEDTATGRRLFYAPGLAEVTDDLRARLAGADCVLFDGTLWRD 225 (302) T ss_pred C-CCCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHCCCEEEEECCCCCC T ss_conf 4-787981899986488777654445678768468999972788717999448565688999998469989995002245 Q ss_pred -----------------CCCCCCHHHHHHHHHHCCCCEEEEECCCCCC-----CHHHHHHHCCCCCEEEEECCEEEE Q ss_conf -----------------8875799999999997299779997688898-----888999867998199020448996 Q gi|254780214|r 214 -----------------HSSHFSLSESLKKIELINPKNAILTHMHVDL-----DYDMVLKSTPSRVVPAFDGMQFSS 268 (271) Q Consensus 214 -----------------~~~H~~~~ea~~~~~~~~~k~~~LtHls~~~-----~~~~~~~~~~~~v~~A~DGm~i~l 268 (271) |..|....++++++++++++|.+||||||.. +..|.+..++.++++|||||+|+| T Consensus 226 dEm~~~~~~~k~~r~m~H~p~~~~~~sl~~l~~l~~~rkifiHiNhTNP~l~~~s~e~~~v~~~G~~vA~dGm~~~L 302 (302) T PRK05184 226 DEMIRAGLGTKTGRRMGHLAQSGPGGMIAALADLPIARKVLIHINNTNPILDEDSPERAELEAAGWEVAHDGMEIEL 302 (302) T ss_pred CCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHCCCEEECCCCEEEC T ss_conf 20000366766667788788787477999997479887899982699814376878788787679888068878879 No 5 >PRK00055 ribonuclease Z; Reviewed Probab=100.00 E-value=0 Score=338.44 Aligned_cols=236 Identities=22% Similarity=0.263 Sum_probs=179.5 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCC--CCCCEEEE Q ss_conf 2599998541789651337887877778767744430699993367789718999879841566752233--42059999 Q gi|254780214|r 4 YYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLREQV--LSIDAVLY 81 (271) Q Consensus 4 ~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i--~~Id~I~i 81 (271) .|||||||||++ +|.. .|.+||++|+. ++.+||||||++++.||.+.++ ++||+||| T Consensus 1 mmkit~LGtg~~--~p~~--------------~R~~s~~li~~-----~~~~iliD~G~G~~~~l~~~~~~~~~l~~I~i 59 (259) T PRK00055 1 MMELTFLGTGSG--VPTP--------------TRNVSSILLRL-----GGELWLFDCGEGTQRQMLRAGIGPRKIDRIFI 59 (259) T ss_pred CCEEEEEECCCC--CCCC--------------CCCCCEEEEEE-----CCEEEEEECCCHHHHHHHHCCCCHHHCCEEEE T ss_conf 948999906899--8899--------------99648899999-----99089995881599999981999557658999 Q ss_pred ECCCHHHHHHCCHHH--HHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCE Q ss_conf 743524220111024--333178211101311202321002222101366775310012321123443334201158748 Q gi|254780214|r 82 THPHADHIHGIDGLR--GYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVI 159 (271) Q Consensus 82 TH~H~DHi~Gl~~Lr--~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 159 (271) ||.|+||+.||++|. .+...+.+|++||+|+++.+.++....+... .. .++ ....+.........+..+++ T Consensus 60 TH~H~DH~~Gl~~ll~~~~~~~~~~~l~i~gp~~~~~~l~~~~~~~~~-~~--~~~----~~~~~~~~~~~~~~~~~~~~ 132 (259) T PRK00055 60 THLHGDHIFGLPGLLSTRSLSGRTEPLTIYGPKGIKEFVETLLRASGS-HL--PYR----IAIHEIDEKTEGVVFDDDGF 132 (259) T ss_pred ECCCHHHHCHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHHCC-CC--CCC----EEEEECCCCCCCEEEECCCE T ss_conf 643357665289999998745999716999177899999999885066-88--972----56763257987328964998 Q ss_pred EEEEEEEECCCEECCCCCCC----CEEEECCCCCCHHHHHHHHCCHHEEEEECCCCCC------CCCCCCHHHHHHHHHH Q ss_conf 99999950776211233468----6253032330016788862221102530233576------8875799999999997 Q gi|254780214|r 160 EAIPILQQHGRISSLGFRFG----NVAYCTDVNAFPAESLEKLQNLDFLIIDALKNGL------HSSHFSLSESLKKIEL 229 (271) Q Consensus 160 ~i~~~~~~Hg~~~~~Gyri~----~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~l~~~~------~~~H~~~~ea~~~~~~ 229 (271) ++++++++|+ .+|+||||+ +++|++||. .+++.+++++|+|+||+||++... .++|+|++||+++|++ T Consensus 133 ~i~~~~~~H~-~~s~Gyri~~~g~~i~ysgDt~-~~~~l~~~~~gaDlLI~ea~~~~~~~~~~~~~~Hst~~~a~~~a~~ 210 (259) T PRK00055 133 TVTAFPLDHS-IPCLGYRFEEKGRKIAYVGDTR-PCEALVELAKGADLLVHEATFGDEDEAKAKEHYHSTAKQAAEIAKE 210 (259) T ss_pred EEEEEECCCC-CCEEEEEEEECCCEEEEECCCC-CCHHHHHHHCCCCEEEEECCCCCCCHHHCCCCCCCCHHHHHHHHHH T ss_conf 9999986799-8627999996881699952788-6888998860699999946557742244137766899999999998 Q ss_pred CCCCEEEEECCCCCCCH---H----HHHHHCCCCCEEEEECCEEEEEC Q ss_conf 29977999768889888---8----99986799819902044899626 Q gi|254780214|r 230 INPKNAILTHMHVDLDY---D----MVLKSTPSRVVPAFDGMQFSSPI 270 (271) Q Consensus 230 ~~~k~~~LtHls~~~~~---~----~~~~~~~~~v~~A~DGm~i~l~~ 270 (271) ++||+++|||+++..+. . ++++.. .||++|||||+|+||- T Consensus 211 ~~~k~lvLtH~~p~~~~~~~~~~~~e~~~~f-~~v~~A~Dgm~i~l~~ 257 (259) T PRK00055 211 AGVKRLILTHFSPRYTDPEELQLLKEAREIF-PNTELAEDFMRVEVPK 257 (259) T ss_pred CCCCEEEEEEECCCCCCCHHHHHHHHHHHHC-CCCEECCCCCEEECCC T ss_conf 4999999992077889824999999999878-9929816998996279 No 6 >TIGR02651 RNase_Z ribonuclease Z; InterPro: IPR013471 All tRNA molecules are synthesised as long precursors with extra sequence at both the 5' and 3' ends, which undergo several processing steps to form the mature tRNA molecule. In many bacteria the tRNA precursors do not contain the CCA sequence which forms the 3' end of the mature tRNA and is essential for aminoacylation. In these organisms ribonucleotide Z cleaves the precursor to form a 3' end to which CCA is added. Species where all tRNA sequences already have the CCA tail, such as E. coli, do not generally posses this form of ribonuclease Z. ; GO: 0016891 endoribonuclease activity producing 5'-phosphomonoesters, 0042779 removal of tRNA 3'-trailer sequence. Probab=100.00 E-value=0 Score=333.74 Aligned_cols=238 Identities=28% Similarity=0.371 Sum_probs=180.5 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCC--CCCCEEEEEC Q ss_conf 99998541789651337887877778767744430699993367789718999879841566752233--4205999974 Q gi|254780214|r 6 HFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLREQV--LSIDAVLYTH 83 (271) Q Consensus 6 ~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i--~~Id~I~iTH 83 (271) +|||||||+ |+|.. .|+-+|++|...... |..+|||||++.+.||++.++ ++|+.||||| T Consensus 1 ~i~fLGTGg--~~Pt~--------------~Rn~~si~l~~~~e~--g~~~LFDCGEGTQ~Q~~~~~~~~~~i~~IFITH 62 (327) T TIGR02651 1 EITFLGTGG--GVPTK--------------ERNVSSIALKLEDER--GELWLFDCGEGTQRQMLRSGISPMKIDRIFITH 62 (327) T ss_pred CEEEEECCC--CCCCC--------------CCCCCEEEEECHHHC--CCEEEEECCCHHHHHHHHCCCCCCCCCEEEEEC T ss_conf 958985277--88775--------------647758999602242--874255477189999997168863111368706 Q ss_pred CCHHHHHHCCHHHH-HH-H--CCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC----------- Q ss_conf 35242201110243-33-1--782111013112023210022221013667753100123211234433----------- Q gi|254780214|r 84 PHADHIHGIDGLRG-YF-L--KQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDV----------- 148 (271) Q Consensus 84 ~H~DHi~Gl~~Lr~-~~-~--~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~----------- 148 (271) +|.|||.|||.|.. +. . .+++||.||||+++.+.++..+...... -.|+. ....++.... T Consensus 63 ~HGDHi~GLPGLl~t~sf~G~~r~~pL~IyGP~G~~e~~~~~l~~~~~~---~~y~i--~~~E~~~~~~~P~~~~~k~k~ 137 (327) T TIGR02651 63 LHGDHILGLPGLLSTMSFQGGGRKEPLTIYGPPGIKEFIETSLRVSYTY---LNYPI--KIHEIEEKGVLPEFVVYKEKL 137 (327) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCEEEECCHHHHHHHHHHHHHHCCC---CCCCE--EEEEECCCCCCCHHHHHCCCE T ss_conf 6314554236888831037888644528868744799999999862237---88415--999726899773123102561 Q ss_pred -CCCEECCCCCEEEEEEEEECCCEECCCCCCC------------------------------------------------ Q ss_conf -3420115874899999950776211233468------------------------------------------------ Q gi|254780214|r 149 -PICMKSAGGVIEAIPILQQHGRISSLGFRFG------------------------------------------------ 179 (271) Q Consensus 149 -~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri~------------------------------------------------ 179 (271) .....+....++|.++++.| .++|+||+|. T Consensus 138 l~~~~~~~~~~f~v~~~~l~H-~~~~~GY~~~Ekd~~g~~~~eka~~~g~p~Gp~~~~L~~G~~v~~~~G~~i~~~~v~G 216 (327) T TIGR02651 138 LEDGLVFEDDGFKVEAFPLDH-SIPSLGYRFEEKDRPGKFDREKAKELGIPPGPLYGKLKRGETVTLIDGRIIDPEDVLG 216 (327) T ss_pred EECCEEEECCCEEEEEEECCC-CCEEEEEEEEECCCCCCCCHHHHHHCCCCCCHHHHHHHCCCEEEECCCEEECHHHHCC T ss_conf 202326765844899995057-8414678998638899873899995589977368997658618963886878034058 Q ss_pred ------CEEEECCCCCCHHHHHHHHCCHHEEEEECCCCC-CC------CCCCCHHHHHHHHHHCCCCEEEEECCCCCCCH Q ss_conf ------625303233001678886222110253023357-68------87579999999999729977999768889888 Q gi|254780214|r 180 ------NVAYCTDVNAFPAESLEKLQNLDFLIIDALKNG-LH------SSHFSLSESLKKIELINPKNAILTHMHVDLDY 246 (271) Q Consensus 180 ------~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~l~~~-~~------~~H~~~~ea~~~~~~~~~k~~~LtHls~~~~~ 246 (271) +++|+|||.+ +++++++++|||+|||||+... .. .+|||..||+++|++++||+++|||+|..++- T Consensus 217 ~~rkG~~~~y~gDT~p-~~~~~~~a~~~DlLiHEaTF~~~~~~~~A~e~~HST~~~AA~~Ak~A~Vk~L~LTH~S~RY~~ 295 (327) T TIGR02651 217 PPRKGRKIVYTGDTRP-CEEVIELAKNADLLIHEATFLDSEDKKLAKEYGHSTAAQAAEIAKEANVKKLILTHISPRYTD 295 (327) T ss_pred CCCCCEEEEEEECCCC-CHHHHHHHHCCCEEEEECCCCCCHHHHHHHHCCCCCHHHHHHHHHHHCCCEEEEEEECCCCCC T ss_conf 7888608999717754-778999983198899805777712589998607983899999998707755898885266786 Q ss_pred ------HHH---H-HHCCCCCEEEEECCEEEE Q ss_conf ------899---9-867998199020448996 Q gi|254780214|r 247 ------DMV---L-KSTPSRVVPAFDGMQFSS 268 (271) Q Consensus 247 ------~~~---~-~~~~~~v~~A~DGm~i~l 268 (271) .++ + +....|+.+|+|.|++++ T Consensus 296 ~~~~~~~~ll~eAr~~~f~~~~~a~D~~~~~i 327 (327) T TIGR02651 296 DESDDLEELLEEARKKIFPNTYIAEDFMEIEI 327 (327) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEECCCEEEEC T ss_conf 32110799999973145776488414106619 No 7 >COG1234 ElaC Metal-dependent hydrolases of the beta-lactamase superfamily III [General function prediction only] Probab=100.00 E-value=5.8e-40 Score=268.02 Aligned_cols=244 Identities=24% Similarity=0.318 Sum_probs=167.1 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCCC--CCCEEEE Q ss_conf 25999985417896513378878777787677444306999933677897189998798415667522334--2059999 Q gi|254780214|r 4 YYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLREQVL--SIDAVLY 81 (271) Q Consensus 4 ~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i~--~Id~I~i 81 (271) .|+++|||||++ +|.. .|..+|++|.. ++..+|||||++...|+++.+++ +|++||| T Consensus 1 ~m~i~fLGtg~~--~Pt~--------------~r~~~s~ll~~-----~~~~~L~DcGeGt~~~l~~~~~~~~~i~~IfI 59 (292) T COG1234 1 MMEITFLGTGGA--VPTK--------------DRNVSSILLRL-----EGEKFLFDCGEGTQHQLLRAGLPPRKIDAIFI 59 (292) T ss_pred CCEEEEEECCCC--CCCC--------------CCCCCEEEEEE-----CCEEEEEECCHHHHHHHHHHCCCCCCCCEEEE T ss_conf 978999966998--7747--------------67766799981-----99279997786799999980388202688998 Q ss_pred ECCCHHHHHHCCHHHH--HHHCCCCEEEEEECCEEEEEECCCCC-------C--CCCCCC--CCCCCCCCCCCCCCCCCC Q ss_conf 7435242201110243--33178211101311202321002222-------1--013667--753100123211234433 Q gi|254780214|r 82 THPHADHIHGIDGLRG--YFLKQKRPIDVYAAPDCMKHLFESFR-------Y--CFKALD--DRTYPPIVNPIVIENNDV 148 (271) Q Consensus 82 TH~H~DHi~Gl~~Lr~--~~~~~~~~i~vy~~~~~~~~l~~~~~-------y--~f~~~~--~~~~~~~~~~~~~~~~~~ 148 (271) ||.|.||+.||+.|.. ++.....|+.||+|++..+++..... + .+.... ...+........+..... T Consensus 60 TH~H~DHi~gL~~ll~~~~~~~~~~~l~iygP~g~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~v~~~~~~h~~~~~~y 139 (292) T COG1234 60 THLHGDHIAGLPGLLVSRSFRGRREPLKIYGPPGIKEFVETSLRLSYSKLTYEIIGHEIEEDAFEVEALELDHGVPALGY 139 (292) T ss_pred ECCCCCCCCCHHHHHHHHHCCCCCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCEEEEEEECCCCCCCCEE T ss_conf 27863423777899996140189998249888530021211211134445506999982466358988762766543224 Q ss_pred CCCEECCCC------------CEEEEEEEEECC---CEECCCCCC------CCEEEECCCCCCHHHHHHHHCCHHEEEEE Q ss_conf 342011587------------489999995077---621123346------86253032330016788862221102530 Q gi|254780214|r 149 PICMKSAGG------------VIEAIPILQQHG---RISSLGFRF------GNVAYCTDVNAFPAESLEKLQNLDFLIID 207 (271) Q Consensus 149 ~~~~~~~~~------------~i~i~~~~~~Hg---~~~~~Gyri------~~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d 207 (271) ........+ ...+..++..|. .+.+.++++ .+++|++||++ +++.+++++|+|+||+| T Consensus 140 ~~~e~~~~~~~~~~~~~~~~~g~~~~~l~~~h~~~~~~~~~~~~~~~~~~G~~v~ysGDT~p-~~~~~~~a~~aDlLiHE 218 (292) T COG1234 140 RIEEPDRPGRFDAEKLKGLPPGPLITALKAGHPVEERVITPADRIGEPRKGKSVVYSGDTRP-CDELIDLAKGADLLIHE 218 (292) T ss_pred ECCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCEEECCHHHEEECCCCCEEEEEECCCCC-CHHHHHHHCCCCEEEEE T ss_conf 41367765545877751588886178641598431255148885305889709999799977-76899976689899996 Q ss_pred CCCCC-----CCCC-CCCHHHHHHHHHHCCCCEEEEECCCCCCC--HH----HHHHHCCCCCEEEEECCEEEEE Q ss_conf 23357-----6887-57999999999972997799976888988--88----9998679981990204489962 Q gi|254780214|r 208 ALKNG-----LHSS-HFSLSESLKKIELINPKNAILTHMHVDLD--YD----MVLKSTPSRVVPAFDGMQFSSP 269 (271) Q Consensus 208 ~l~~~-----~~~~-H~~~~ea~~~~~~~~~k~~~LtHls~~~~--~~----~~~~~~~~~v~~A~DGm~i~l~ 269 (271) |+... .+.. |+|.+||+++|+++++|+++|||+|+... .+ +..+..+.++.+|.|||+++++ T Consensus 219 at~~~~~~~~a~~~~HsT~~eAa~iA~~A~vk~LiLtH~s~ry~~~~~~~~~ea~~~f~~~~~~a~D~~~~~v~ 292 (292) T COG1234 219 ATFEDDLEDLANEGGHSTAEEAAEIAKEAGVKKLILTHFSPRYPKDDEELLKEARAIFPGETIVARDGLVFEVP 292 (292) T ss_pred EECCCCHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHCCCCEEECCCCEEEECC T ss_conf 12476513440447999999999999975988699982276664407999999998589976874487178589 No 8 >COG1235 PhnP Metal-dependent hydrolases of the beta-lactamase superfamily I [General function prediction only] Probab=100.00 E-value=4.5e-40 Score=268.74 Aligned_cols=257 Identities=31% Similarity=0.408 Sum_probs=177.5 Q ss_pred CCEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCCCCCCEEEE Q ss_conf 87259999854178965133788787777876774443069999336778971899987984156675223342059999 Q gi|254780214|r 2 KDYYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLREQVLSIDAVLY 81 (271) Q Consensus 2 ~~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i~~Id~I~i 81 (271) +..|++++||||+++|+|+++|+|.+|. |.|.|.+..+.. +++|||+|+++|.|.++.+++++|+||+ T Consensus 1 ~~~~~f~~lgsG~~gg~p~~~~~~~~c~-----~~~~~v~~~~~~-------~~~lid~g~~~~~~~~~~~~~~idai~~ 68 (269) T COG1235 1 TGGMRFTVLGSGSSGGVPVIGCDCRACG-----GNRLRVDCGVGV-------KTLLIDAGPDLRDQGLRLGVSDLDAILL 68 (269) T ss_pred CCCCEEEEEEEECCCCEEEECCCCCCCC-----CCEEEEEEECCC-------EEEEEECCHHHHHHHHHHCCCCCCEEEE T ss_conf 9852999999834898035258865666-----870887025360-------4799942877765423306764567999 Q ss_pred ECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCEECCCCCEE Q ss_conf 74352422011102433317821110131120232100222210136677-53100123211234433342011587489 Q gi|254780214|r 82 THPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDD-RTYPPIVNPIVIENNDVPICMKSAGGVIE 160 (271) Q Consensus 82 TH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 160 (271) ||.|+||+.|+++|++++.......+..........+.+.++|.+..... ...+.................+..++.+. T Consensus 69 TH~H~DHi~Gl~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hd~~~~~~~~~~ 148 (269) T COG1235 69 THEHSDHIQGLDDLRRAYTLPIYVNPGTLRASTSDRLLGGFPYLFRHPFPPFSLPAIGGLEVTPFPVPHDAIEPVGFVII 148 (269) T ss_pred ECCCHHHHCCHHHHHHHHCCCCEEEECEECCCCCCHHHCCCCHHHCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCEE T ss_conf 15856765788889988558845753200134321233026034317677656312373178658989855437776134 Q ss_pred EEEEEEECCCEECCC--CCCCCEEEECCCCCCHHHHHHHHCCHHEEEEEC---CCC--CCCCCCCCHHHHHHHHHHCCCC Q ss_conf 999995077621123--346862530323300167888622211025302---335--7688757999999999972997 Q gi|254780214|r 161 AIPILQQHGRISSLG--FRFGNVAYCTDVNAFPAESLEKLQNLDFLIIDA---LKN--GLHSSHFSLSESLKKIELINPK 233 (271) Q Consensus 161 i~~~~~~Hg~~~~~G--yri~~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~---l~~--~~~~~H~~~~ea~~~~~~~~~k 233 (271) .+...+-++....+| +++++++|++|+..++++.-..+...+.+..+. ... ..+++|.++++|.+++++.++| T Consensus 149 ~~~~~~~~~~~~g~~~~~~~~~vay~~Dt~~~~~~~d~~l~~~~~~~~~~~~~~~~~~gh~~~h~~~~~a~~~~~~~~~~ 228 (269) T COG1235 149 RTGRKLHGGTDIGYGLEWRIGDVAYLTDTELFPSNHDVELLDNGLYPLDIKDRILPDPGHLSNHLSAEEALELIEKLKPK 228 (269) T ss_pred ECCEEEECCCCEEEEEEEECCCEEEECCCCCCCHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 13503405653376788761618996654336404489885176313444431258888787731399999997537866 Q ss_pred EEEEECCCCCCCHHHHHHHCCCCCEEEEECCEEEEEC Q ss_conf 7999768889888899986799819902044899626 Q gi|254780214|r 234 NAILTHMHVDLDYDMVLKSTPSRVVPAFDGMQFSSPI 270 (271) Q Consensus 234 ~~~LtHls~~~~~~~~~~~~~~~v~~A~DGm~i~l~~ 270 (271) |++|||+||.+++++..+..+..+..+++++.+..++ T Consensus 229 rivLtHls~~~~~~~~~~~~~~~v~~~~~~~~~~~~~ 265 (269) T COG1235 229 RLVLTHLSHKNDDEELPELEAEEVEPAYGGMGIDVGE 265 (269) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 6999964876874566677775431112520354014 No 9 >TIGR02108 PQQ_syn_pqqB coenzyme PQQ biosynthesis protein B; InterPro: IPR011842 This entry describes coenzyme PQQ biosynthesis protein B, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases. Note that this gene appears to be required for PQQ in biosynthesis in Methylobacterium extorquens (under the name pqqG) and in Klebsiella pneumoniae but that the equivalent pqqV in Acinetobacter calcoaceticus is not necessary for heterologous expression of PQQ biosynthesis in Escherichia coli. Based on this latter finding, it is suggested that PqqB might be a transporter or a PQQ-dependent enzyme rather than a PQQ biosynthesis enzyme .; GO: 0006810 transport, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=100.00 E-value=5.6e-40 Score=268.11 Aligned_cols=251 Identities=22% Similarity=0.314 Sum_probs=188.5 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCCCC----CCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHH-------CC-C Q ss_conf 99998541789651337887877778767----744430699993367789718999879841566752-------23-3 Q gi|254780214|r 6 HFTILGCGASLGVPRITGDWGACDPTNPK----NRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLR-------EQ-V 73 (271) Q Consensus 6 ~l~~LGtg~s~g~P~~~c~c~~C~~~~~~----~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~-------~~-i 73 (271) +|.+|||++.+|+|||+|+|++|+.+|.. ..||+||+.++.+|.+| +||.++||+|+|+.. .+ + T Consensus 1 ~i~VLGSAAGGGfPQWNC~C~~C~~aR~~~~~ak~RTQsSiAvS~DG~rW----~L~NASPDiRQQi~a~paL~P~~~aL 76 (314) T TIGR02108 1 HIVVLGSAAGGGFPQWNCNCPNCRGARAGTDGAKARTQSSIAVSADGERW----VLLNASPDIRQQIQAFPALHPQRGAL 76 (314) T ss_pred CEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCE----EEECCCHHHHHHHHHHHHHCCCCCCC T ss_conf 95886522678877525687034687730022033541168886788735----86707964888886203215777888 Q ss_pred C--CCCEEEEECCCHHHHHHCCHHHHHHHCCC-CEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4--20599997435242201110243331782-11101311202321002222101366775310012321123443334 Q gi|254780214|r 74 L--SIDAVLYTHPHADHIHGIDGLRGYFLKQK-RPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPI 150 (271) Q Consensus 74 ~--~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~-~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~ 150 (271) + .|.+|+||-.-.|||.||..|| .+ .|..||+++.|++.|.+-++ +|..+..... ......+...+... T Consensus 77 R~TpI~~V~Lt~g~iDH~tGLL~LR-----Egq~pf~~yat~~Vlq~L~~GNp-~F~~L~~~~G--gv~r~Pial~~~~~ 148 (314) T TIGR02108 77 RDTPIEGVVLTDGEIDHVTGLLSLR-----EGQQPFTLYATEMVLQDLSDGNP-IFNVLNHWNG--GVRRQPIALDEKFE 148 (314) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHC-----CCCCCCEEEECCHHHHHHHCCCC-CCCCCCCCCC--CCCCCCCCCCCCCC T ss_conf 6576047882066224564123330-----37885148627157888754886-0212252237--65442311588867 Q ss_pred C-EE-CCCCCEEEEEEEEEC---------------C---CEECCCCCC------CCEEEECCCCCCHHHHHHHHCCHHEE Q ss_conf 2-01-158748999999507---------------7---621123346------86253032330016788862221102 Q gi|254780214|r 151 C-MK-SAGGVIEAIPILQQH---------------G---RISSLGFRF------GNVAYCTDVNAFPAESLEKLQNLDFL 204 (271) Q Consensus 151 ~-~~-~~~~~i~i~~~~~~H---------------g---~~~~~Gyri------~~~~Y~~Dt~~~~e~~~~~l~~~D~l 204 (271) . .. ..--.++++||.++- | +.+++|..| +++.|++-+..++|++.+.+++||++ T Consensus 149 ~~~~DG~~~gL~~t~f~vpgkaPlY~e~rs~~dl~g~~~~gdT~Gl~I~D~~~~k~~~YiPGca~vt~~L~a~~~~Ad~v 228 (314) T TIGR02108 149 FKIKDGAAPGLEFTAFAVPGKAPLYSEHRSGEDLAGDPHPGDTLGLLIEDGKTGKRLFYIPGCAEVTDDLKARMAGADLV 228 (314) T ss_pred CCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCEEEEECCCCCCCHHHHHHHHCCCEE T ss_conf 43147789886273566277777578100352136875788704899975367654787035666386889898617737 Q ss_pred EEECCC--------------CCCCCCCCCHH---HHHHHHHHCCCCEEEEECCCCCC-----CHHHHHHHCCCCCEEEEE Q ss_conf 530233--------------57688757999---99999997299779997688898-----888999867998199020 Q gi|254780214|r 205 IIDALK--------------NGLHSSHFSLS---ESLKKIELINPKNAILTHMHVDL-----DYDMVLKSTPSRVVPAFD 262 (271) Q Consensus 205 I~d~l~--------------~~~~~~H~~~~---ea~~~~~~~~~k~~~LtHls~~~-----~~~~~~~~~~~~v~~A~D 262 (271) ++|++- .+..-+|+..+ .+++...++..-|-||.|||... +..|.++....+|++||| T Consensus 229 ~fDGTlW~DDEM~~~Gvg~K~G~~MGH~~~sG~gG~~~~La~~~~~RKVLiHINNTNPiL~e~S~Era~~~a~Gw~VA~D 308 (314) T TIGR02108 229 FFDGTLWRDDEMIRAGVGTKTGRRMGHVSISGEGGSLAVLADLEIARKVLIHINNTNPILDEDSPERAEVEAAGWEVAHD 308 (314) T ss_pred EECCCCCCCHHHHHCCCCCCCCCCCCCEEECCCCCHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHCCCEECCC T ss_conf 75474023334641147873343125165368784556530689887789862277721244674478886468602157 Q ss_pred CCEEEE Q ss_conf 448996 Q gi|254780214|r 263 GMQFSS 268 (271) Q Consensus 263 Gm~i~l 268 (271) ||+|.| T Consensus 309 Gm~i~L 314 (314) T TIGR02108 309 GMEIEL 314 (314) T ss_pred CCEECC T ss_conf 756429 No 10 >PRK02126 ribonuclease Z; Provisional Probab=100.00 E-value=3.6e-33 Score=225.84 Aligned_cols=206 Identities=17% Similarity=0.276 Sum_probs=142.0 Q ss_pred CEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEEC Q ss_conf 06999933677897189998798415667522334205999974352422011102433317821110131120232100 Q gi|254780214|r 40 SSLKISRVSERGSNTTVIVDTGPDFYMQVLREQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLF 119 (271) Q Consensus 40 ~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~ 119 (271) +.++|... ..+..+|||||. ..||....+.+||+|||||.|.||+.||+.|...+..+.+|+.||||++..+.++ T Consensus 17 p~l~v~~~---~~~~~lLfDcGe--~~~l~~~~l~~Id~IFITHlHgDH~~GL~gLLr~~~gr~~~L~IyGP~Gl~~~v~ 91 (334) T PRK02126 17 PGLYVDFL---FERRALLFDLGD--LHHLPPRELLRISHIFVSHTHMDHFIGFDRLLRLCLGRPRRLRLFGPPGFADQVE 91 (334) T ss_pred CEEEEEEC---CCCCEEEEECCC--CCCCCHHHHCCCCEEEEECCCCCHHCCCHHHHHHHCCCCCCEEEECCCCHHHHHH T ss_conf 87999716---798189998998--4300645646547699826762212357999986279998629999824899999 Q ss_pred CCCC-CCCCCCCCCCCCCCCCCCC-----------------------CCCCCCCCCEECCCCCEEEEEEEEECCCEECCC Q ss_conf 2222-1013667753100123211-----------------------234433342011587489999995077621123 Q gi|254780214|r 120 ESFR-YCFKALDDRTYPPIVNPIV-----------------------IENNDVPICMKSAGGVIEAIPILQQHGRISSLG 175 (271) Q Consensus 120 ~~~~-y~f~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~G 175 (271) ..+. |-++... .++....... ...........+....++|.+++++| .++|+| T Consensus 92 ~~L~~y~wnl~~--~~~~~~~v~~~e~~~~~~~~~~f~~~~~F~~~~~~~~~~~~~~i~e~~~~~V~a~~l~H-~ip~~g 168 (334) T PRK02126 92 HKLAGYTWNLVE--NYPTTFRVHEVELHDGRIRRALFRCRRAFRREAEEELSREDGVLLDEPWFRVRAAFLDH-GIPCLA 168 (334) T ss_pred HHHHHHHHCCCC--CCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCEEEEEEECCC-CCCCCE T ss_conf 986331110124--57741389999605653222112432123334322345778657857987999997479-876407 Q ss_pred CCCC-----------------------------------------------------------------------CEEEE Q ss_conf 3468-----------------------------------------------------------------------62530 Q gi|254780214|r 176 FRFG-----------------------------------------------------------------------NVAYC 184 (271) Q Consensus 176 yri~-----------------------------------------------------------------------~~~Y~ 184 (271) |+|. +++|+ T Consensus 169 Y~i~E~~~~~i~~~kl~~~Gv~~Gp~l~~LK~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~l~~~v~~~~~G~ki~y~ 248 (334) T PRK02126 169 FALEEKAHVNIDKNRLAELGLPPGPWLRELKHAVLRGEPDDTPIRVPWRDGGGERERTRPLGELKEAILRIVPGQKIGYV 248 (334) T ss_pred EEEEECCCCCCCHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHCCCCCCCCEEEEE T ss_conf 99997887671899999769998738999976553146688458842255665554335402322201457899789999 Q ss_pred CCCCCCHH---HHHHHHCCHHEEEEECCCCCC------CCCCCCHHHHHHHHHHCCCCEEEEECCCCCCC--HHHHHHHC Q ss_conf 32330016---788862221102530233576------88757999999999972997799976888988--88999867 Q gi|254780214|r 185 TDVNAFPA---ESLEKLQNLDFLIIDALKNGL------HSSHFSLSESLKKIELINPKNAILTHMHVDLD--YDMVLKST 253 (271) Q Consensus 185 ~Dt~~~~e---~~~~~l~~~D~lI~d~l~~~~------~~~H~~~~ea~~~~~~~~~k~~~LtHls~~~~--~~~~~~~~ 253 (271) |||..-++ .++++++|+|+||+||+.... ..+|+|..||+++|++++||+++|||+|+... .+++.++. T Consensus 249 tDT~~~~~n~~~i~~~a~~aDlLvhEaTF~~~d~~~A~~~~HsTa~qAa~iAk~a~vk~LiLtHfS~RY~~~~~~l~~Ea 328 (334) T PRK02126 249 TDIGYTPENLARIVELAAGVDLLFIEAVFLDEDAEKARRKNHLTARQAGRLAREAGVRRLVPFHFSPRYQGRPAELYREA 328 (334) T ss_pred EECCCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHH T ss_conf 76788811789999986479899996567622798886559985999999999859998998750634589869999999 No 11 >KOG2121 consensus Probab=99.98 E-value=7.5e-34 Score=230.05 Aligned_cols=240 Identities=22% Similarity=0.265 Sum_probs=162.5 Q ss_pred CEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHC-C-------CC Q ss_conf 725999985417896513378878777787677444306999933677897189998798415667522-3-------34 Q gi|254780214|r 3 DYYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLRE-Q-------VL 74 (271) Q Consensus 3 ~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~-~-------i~ 74 (271) +.|+|.|||||++. |. .+|+-+|+++..... .+||+|||++...|+.|. + ++ T Consensus 441 ~~~eIi~LGTGSai--Ps--------------kyRNVSS~lv~i~~~----~~IlLDCGEgTlgql~R~YG~~~~~~~lr 500 (746) T KOG2121 441 KDPEIIFLGTGSAI--PS--------------KYRNVSSILVRIDSD----DSILLDCGEGTLGQLVRHYGVENVDTALR 500 (746) T ss_pred CCCEEEEECCCCCC--CC--------------CCCCEEEEEEECCCC----CCEEEECCCCHHHHHHHHHHHCCHHHHHH T ss_conf 88479996577667--77--------------653137999702677----53786448844878889861301678987 Q ss_pred CCCEEEEECCCHHHHHHCCHH-HHHHH--C--CCCEEEEEECCEEEEEECCCC--CCC-CCCCCCCCCCCCCCC-CCCC- Q ss_conf 205999974352422011102-43331--7--821110131120232100222--210-136677531001232-1123- Q gi|254780214|r 75 SIDAVLYTHPHADHIHGIDGL-RGYFL--K--QKRPIDVYAAPDCMKHLFESF--RYC-FKALDDRTYPPIVNP-IVIE- 144 (271) Q Consensus 75 ~Id~I~iTH~H~DHi~Gl~~L-r~~~~--~--~~~~i~vy~~~~~~~~l~~~~--~y~-f~~~~~~~~~~~~~~-~~~~- 144 (271) ++.+|||||.|+||..||..+ +.|.. . ...|+-|.+|+...+.++..- .+. ...+.....+..+.. ...+ T Consensus 501 ~LraI~ISHlHADHh~Gl~~vL~~r~k~~k~~~~~pl~vv~P~ql~~wl~~y~~~~~~~~~~~~~i~~~g~lf~~~s~~s 580 (746) T KOG2121 501 KLRAIFISHLHADHHLGLISVLQARTKLLKGVENSPLLVVAPRQLKKWLQEYHRCPSFPASSVAKIGAPGALFAQKSPDS 580 (746) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCCEEEECHHHHHHHHHHHHCCCCCCHHHHHHHCCCHHHHHCCCCCC T ss_conf 58999998620000114789999999850456668149968199999999973175431226654068501321138654 Q ss_pred CCC-CCCCEECCCCCEEEEEEEEECCCEECCCCCC-----CCEEEECCCCCCHHHHHHHHCCHHEEEEECCCCC------ Q ss_conf 443-3342011587489999995077621123346-----8625303233001678886222110253023357------ Q gi|254780214|r 145 NND-VPICMKSAGGVIEAIPILQQHGRISSLGFRF-----GNVAYCTDVNAFPAESLEKLQNLDFLIIDALKNG------ 212 (271) Q Consensus 145 ~~~-~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri-----~~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~l~~~------ 212 (271) ..+ .....-...+...+...++.|++ .++|-++ .|++|+|||+. ++...+..+|+|+||+||+.+. T Consensus 581 ~~~~~~~~~l~~~~l~~i~tc~viHCp-~syg~~i~~~~~~Ki~YSGDTrP-~~~~v~~g~datlLIHEAT~ED~l~EeA 658 (746) T KOG2121 581 VPERLLSYLLRELGLESIQTCPVIHCP-QSYGCSITHGSGWKIVYSGDTRP-CEDLVKAGKDATLLIHEATLEDDLEEEA 658 (746) T ss_pred CCHHHHHHHHHHCCCEEEEECCCEECC-HHHCEEEECCCCEEEEECCCCCC-CHHHHHHCCCCCEEEEEHHHCHHHHHHH T ss_conf 420245678775485268864717547-10053586166248998378987-5667653268716876001010478999 Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHCC-----CCCEEEEECCEEEE Q ss_conf 688757999999999972997799976888988889998679-----98199020448996 Q gi|254780214|r 213 LHSSHFSLSESLKKIELINPKNAILTHMHVDLDYDMVLKSTP-----SRVVPAFDGMQFSS 268 (271) Q Consensus 213 ~~~~H~~~~ea~~~~~~~~~k~~~LtHls~~~~~~~~~~~~~-----~~v~~A~DGm~i~l 268 (271) ..++|+|.+||+++++++++|+++|||+|+... ++- ++ .++-+|+|+|.|.. T Consensus 659 v~k~HST~sEAi~V~~~m~ar~liLTHFSQRY~--K~p--l~~d~~~~~~~~afd~m~v~~ 715 (746) T KOG2121 659 VEKGHSTTSEAISVAKKMNAKRLILTHFSQRYP--KVP--LPSDGEMDPVCVAFDKMAVSV 715 (746) T ss_pred HHHCCCCHHHHHHHHHHCCCHHHHHHHHHCCCC--CCC--CCCCCCCCHHHHHHHCCEEEC T ss_conf 982899799999999863515344543100168--898--798323106787641014420 No 12 >TIGR02649 true_RNase_BN ribonuclease BN; InterPro: IPR013469 Proteins in this entry include ribonuclease BN of Escherichia coli (strain K-12) and closely related proteins believed to be equivalent in function. E. coli appears to lack RNase Z, and this ribonuclease appears to be orthologous to (but not functionally equivalent to) RNase Z of Bacillus subtilis and various other species. ; GO: 0016891 endoribonuclease activity producing 5'-phosphomonoesters, 0042779 removal of tRNA 3'-trailer sequence. Probab=99.95 E-value=1.3e-27 Score=191.23 Aligned_cols=231 Identities=19% Similarity=0.270 Sum_probs=171.0 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHC--CCCCCCEEEEEC Q ss_conf 999985417896513378878777787677444306999933677897189998798415667522--334205999974 Q gi|254780214|r 6 HFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLRE--QVLSIDAVLYTH 83 (271) Q Consensus 6 ~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~--~i~~Id~I~iTH 83 (271) +|+||||. .|||. |. |+-+.+++...... ...-+|+|||++.++||+.. ++.++|.||||| T Consensus 1 eliflGts--aGvPt-----------r~---rnvtaill~lqhPt-q~G~WlfdCGeGtqhq~l~taf~PGkl~~ifish 63 (304) T TIGR02649 1 ELIFLGTS--AGVPT-----------RS---RNVTAILLNLQHPT-QSGLWLFDCGEGTQHQLLNTAFNPGKLEKIFISH 63 (304) T ss_pred CEEEEECC--CCCCC-----------CC---HHHHHHEEECCCCC-CCCEEEEECCCCHHHHHHHHHCCCCCCEEEEEEE T ss_conf 94774215--68875-----------51---01200001004877-6740688768752678887414887500357542 Q ss_pred CCHHHHHHCCHH---HHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEE Q ss_conf 352422011102---43331782111013112023210022221013667753100123211234433342011587489 Q gi|254780214|r 84 PHADHIHGIDGL---RGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIE 160 (271) Q Consensus 84 ~H~DHi~Gl~~L---r~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 160 (271) .|.||..|||.| |.+ .....|+.||||.++.+++...... -..|.. ..++..+..-+.....+--+ T Consensus 64 lhGdhlfGlPGllCsrsm-~G~~~Plt~yGPqG~ref~~t~lr~-sGsWtd---------~Pl~i~ei~aG~ilddGlrk 132 (304) T TIGR02649 64 LHGDHLFGLPGLLCSRSM-AGIIQPLTVYGPQGVREFVETALRL-SGSWTD---------YPLEIVEISAGEILDDGLRK 132 (304) T ss_pred ECCCCCCCCCHHHHHHHH-CCCCCCEEEECCCHHHHHHHHHHHH-CCCCCC---------CCEEEEEECCCCHHHHCCHH T ss_conf 236500155313443331-3777640675662168898766421-111013---------65157862254112200010 Q ss_pred EEEEEEECCCEECCCCCCC------------------------------------------------------CEEEECC Q ss_conf 9999950776211233468------------------------------------------------------6253032 Q gi|254780214|r 161 AIPILQQHGRISSLGFRFG------------------------------------------------------NVAYCTD 186 (271) Q Consensus 161 i~~~~~~Hg~~~~~Gyri~------------------------------------------------------~~~Y~~D 186 (271) +++++..| +.+|+|||+. .++.-+| T Consensus 133 vta~~leh-PleCyGyr~~ehdkPGalna~alkaaGv~PGPlfq~lkaGkt~tl~dGrqinGadyla~~v~Gk~~aifGd 211 (304) T TIGR02649 133 VTAYRLEH-PLECYGYRVVEHDKPGALNARALKAAGVTPGPLFQELKAGKTVTLEDGRQINGADYLAAAVAGKALAIFGD 211 (304) T ss_pred EEEECCCC-CCEECCEEEEECCCCCCHHHHHHHHCCCCCCHHHHHHHCCCEEEECCCCEECCCHHHCCCCCCCEEEEECC T ss_conf 01201468-61001106775178750224777645899851467653486577226742033011012468755898527 Q ss_pred CCCCHHHHHHHHCCHHEEEEECCCC------CCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHH-------HHHC Q ss_conf 3300167888622211025302335------76887579999999999729977999768889888899-------9867 Q gi|254780214|r 187 VNAFPAESLEKLQNLDFLIIDALKN------GLHSSHFSLSESLKKIELINPKNAILTHMHVDLDYDMV-------LKST 253 (271) Q Consensus 187 t~~~~e~~~~~l~~~D~lI~d~l~~------~~~~~H~~~~ea~~~~~~~~~k~~~LtHls~~~~~~~~-------~~~~ 253 (271) |.. ++..++.++++|++|+|++-. ...++|++..++..+++++.+.|+++||+|-..|-+-. .... T Consensus 212 t~P-C~aal~la~Gvdvmvhe~tld~~me~kan~rGhsstrq~atlarea~vG~li~th~ssryddkGCq~ll~eCr~if 290 (304) T TIGR02649 212 TAP-CEAALDLAKGVDVMVHEATLDISMEEKANARGHSSTRQAATLAREAAVGKLIITHVSSRYDDKGCQRLLAECRAIF 290 (304) T ss_pred CCH-HHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHHHHHHHHHC T ss_conf 750-5899997568447885002313466542015652024566665542024044542000003256899998765313 Q ss_pred CCCCEEEEECCEEE Q ss_conf 99819902044899 Q gi|254780214|r 254 PSRVVPAFDGMQFS 267 (271) Q Consensus 254 ~~~v~~A~DGm~i~ 267 (271) | ..++|||--.++ T Consensus 291 P-atela~df~vf~ 303 (304) T TIGR02649 291 P-ATELAYDFSVFN 303 (304) T ss_pred C-HHHHHCCCEEEC T ss_conf 3-124320212313 No 13 >COG0595 mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair] Probab=99.67 E-value=1.1e-15 Score=117.17 Aligned_cols=168 Identities=25% Similarity=0.318 Sum_probs=102.1 Q ss_pred CEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHH-------------HHHH Q ss_conf 725999985417896513378878777787677444306999933677897189998798415-------------6675 Q gi|254780214|r 3 DYYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFY-------------MQVL 69 (271) Q Consensus 3 ~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r-------------~ql~ 69 (271) ..|++..|| |+=-+|-|| ..++. ++.-+++|||--+. ..-+ T Consensus 7 ~~i~i~~lG-----G~~EiGkN~----------------~vve~-----~~~i~i~D~G~~fp~~~~~gvDliIPd~~yl 60 (555) T COG0595 7 AKIKIFALG-----GVGEIGKNM----------------YVVEY-----GDDIIILDAGLKFPEDDLLGVDLIIPDFSYL 60 (555) T ss_pred CCEEEEEEC-----CHHHHCCCE----------------EEEEE-----CCCEEEEECCCCCCCCCCCCCCEEECCHHHH T ss_conf 855899805-----810222623----------------89997-----8968999776566864355555882582774 Q ss_pred HCCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22334205999974352422011102433317821110131120232100222210136677531001232112344333 Q gi|254780214|r 70 REQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVP 149 (271) Q Consensus 70 ~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~ 149 (271) +.+..+|++|||||.|.||++|+| +.+.+...+|||+++-+.+.++..... . .... ...+..+.+ T Consensus 61 ~~n~~kvkgI~lTHgHeDHIGaip----~ll~~~~~~piy~s~lt~~Li~~k~~~---~---~~~~-----~~~~~~ev~ 125 (555) T COG0595 61 EENKDKVKGIFLTHGHEDHIGALP----YLLKQVLFAPIYASPLTAALIKEKLKE---H---GLFK-----NENELHEVK 125 (555) T ss_pred HHCCCCCEEEEECCCCHHHCCCHH----HHHHCCCCCCEECCHHHHHHHHHHHHH---H---CCCC-----CCCCEEEEC T ss_conf 315002139994687444323467----898527768444477669999989887---3---2113-----567439947 Q ss_pred CCEECCCCCEEEEEEEEECCCEECCCCCC----CCEEEECCCCCCHH--------H--HHH-HH-CCHHEEEEECCCCC Q ss_conf 42011587489999995077621123346----86253032330016--------7--888-62-22110253023357 Q gi|254780214|r 150 ICMKSAGGVIEAIPILQQHGRISSLGFRF----GNVAYCTDVNAFPA--------E--SLE-KL-QNLDFLIIDALKNG 212 (271) Q Consensus 150 ~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri----~~~~Y~~Dt~~~~e--------~--~~~-~l-~~~D~lI~d~l~~~ 212 (271) .......+.++++++++.|.-.+++||.+ +.++|++|.+ +.. + .+. .. +|+.+||+|+++-. T Consensus 126 ~~~~i~~~~~~v~f~~vtHSIPds~g~~i~Tp~G~Iv~TGDFk-~d~~~~~g~~~d~~r~~~~g~eGVl~LisdsTna~ 203 (555) T COG0595 126 PGSEIKFGSFEVEFFPVTHSIPDSLGIVIKTPEGNIVYTGDFK-FDPTPVDGEPTDLARLAEIGKEGVLALISDSTNAE 203 (555) T ss_pred CCCEEEECCEEEEEEEECCCCCCCEEEEEECCCCCEEEECCEE-ECCCCCCCCCCCHHHHHHHCCCCCEEEEECCCCCC T ss_conf 9985776767999996035574516899987984389968778-55886777767889999746678479995786568 No 14 >TIGR03675 arCOG00543 arCOG01782 universal archaeal KH-domain/beta-lactamase-domain protein. This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782. Probab=99.67 E-value=1.6e-16 Score=122.38 Aligned_cols=178 Identities=21% Similarity=0.215 Sum_probs=100.1 Q ss_pred CCEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHH---------HHHHCC Q ss_conf 87259999854178965133788787777876774443069999336778971899987984156---------675223 Q gi|254780214|r 2 KDYYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYM---------QVLREQ 72 (271) Q Consensus 2 ~~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~---------ql~~~~ 72 (271) ++..++|+||.|.-- .||+++++. .+++||+|||-..-. ..-... T Consensus 172 ~~~vrit~LG~~~eV---------------------GRSc~l~~~-----~~~~illDcG~~~~~~~~~~~P~~d~~e~~ 225 (630) T TIGR03675 172 DRWVRVTALGGFREV---------------------GRSALLLST-----PESRILLDCGVNVGANGDNAYPYLDVPEFQ 225 (630) T ss_pred CCEEEEEECCCCCCC---------------------CCEEEEEEC-----CCEEEEEECCCCCCCCCCCCCCCCCCCCCC T ss_conf 974999834676702---------------------645899963-----981799988888777775458865677798 Q ss_pred CCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCC-CCCCCCCCCCC-CCCCCCCC---CC-CCC Q ss_conf 34205999974352422011102433317821110131120232100222-21013667753-10012321---12-344 Q gi|254780214|r 73 VLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESF-RYCFKALDDRT-YPPIVNPI---VI-ENN 146 (271) Q Consensus 73 i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~-~y~f~~~~~~~-~~~~~~~~---~~-~~~ 146 (271) ++.||||+|||+|.||++-||-| +..+..-|||+++.|.+-..-.. +|+ +...... .++..... .+ ... T Consensus 226 ~~~IDAVvlTHAHLDH~G~LP~L----~k~Gf~GpIy~T~pT~dL~~llL~D~~-ki~~~e~~~plYt~~dv~~~l~~~~ 300 (630) T TIGR03675 226 LDELDAVVITHAHLDHSGLVPLL----FKYGYDGPVYCTPPTRDLMTLLQLDYI-DVAQREGKKPPYSSKDVREALKHTI 300 (630) T ss_pred CCCCCEEEEECCCCCCCCCCCHH----HHCCCCCCEEECHHHHHHHHHHHHHHH-HHHHCCCCCCCCCHHHHHHHHHCCE T ss_conf 01185899833532125632188----764888988865459999999999899-9987058999989999999984456 Q ss_pred CCCCCEECCC-CCEEEEEEEEECCCEECCC-CCCC----CEEEECCCCCCHHHHHHHH----CCHHEEEEECCC Q ss_conf 3334201158-7489999995077621123-3468----6253032330016788862----221102530233 Q gi|254780214|r 147 DVPICMKSAG-GVIEAIPILQQHGRISSLG-FRFG----NVAYCTDVNAFPAESLEKL----QNLDFLIIDALK 210 (271) Q Consensus 147 ~~~~~~~~~~-~~i~i~~~~~~Hg~~~~~G-yri~----~~~Y~~Dt~~~~e~~~~~l----~~~D~lI~d~l~ 210 (271) ..+....... .+++++....-|--..+.- +.++ +++|+||.+.-++.+++-+ ..+|+||+|++. T Consensus 301 ~l~y~~~~~i~~~ik~tf~~AGHILGSAmv~l~i~~g~~~ilfTGD~~~~~~rll~~A~~~~~~~D~LI~ESTY 374 (630) T TIGR03675 301 TLDYGEVTDIAPDIKLTFYNAGHILGSAIAHLHIGDGLYNIVYTGDFKYEKTRLLDPAVNKFPRVETLIMESTY 374 (630) T ss_pred ECCCCCEEECCCCEEEEEECCCCCCEEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEECCC T ss_conf 24899859717984999976886104479999976896799995998999776768756787777789997435 No 15 >PRK00685 metal-dependent hydrolase; Provisional Probab=99.65 E-value=9e-15 Score=111.43 Aligned_cols=200 Identities=18% Similarity=0.203 Sum_probs=123.2 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEEC----CCCHHHHHHHCCCCCCCEEE Q ss_conf 59999854178965133788787777876774443069999336778971899987----98415667522334205999 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDT----GPDFYMQVLREQVLSIDAVL 80 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~----G~d~r~ql~~~~i~~Id~I~ 80 (271) ||||+||- +|.+|+. ++.+||+|. +|... ++...-++|.|+ T Consensus 1 m~i~wlGh---------------------------a~~li~~-----~g~~iLiDP~~s~~~~~~---~~~~~~~~D~Vl 45 (228) T PRK00685 1 MKITWLGH---------------------------SAFRIET-----GGKKILIDPFLTGNPKAD---LKPEDVKVDHIL 45 (228) T ss_pred CEEEEECC---------------------------CEEEEEE-----CCEEEEECCCCCCCCCCC---CCCCCCCCCEEE T ss_conf 98999405---------------------------6999998-----999999888668998778---885446799999 Q ss_pred EECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEE Q ss_conf 97435242201110243331782111013112023210022221013667753100123211234433342011587489 Q gi|254780214|r 81 YTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIE 160 (271) Q Consensus 81 iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 160 (271) +||.|.||.. +... ...+...++.+|.+....+.... . + ..+...+.....++++ T Consensus 46 iSH~H~DHl~---d~~~--~~~~~~~~~i~p~~~~~~~~~~g--------~-------~-----~~~~~~~~~~~~~gi~ 100 (228) T PRK00685 46 LTHGHGDHVG---DTVE--IAKRTGAVVVANFELATYLGWQG--------V-------K-----THPMNIGGTVEFDGGK 100 (228) T ss_pred ECCCCCCCCC---CHHH--HHHHCCCEEEECHHHHHHHHHCC--------C-------C-----EEEECCCCEEEECCEE T ss_conf 6688834356---5888--87515998996368999998659--------8-------1-----8993278749978889 Q ss_pred EEEEEEECCCE------------ECCCCCCC----CEEEECCCCCCHH-HHHHHHCCHHEEEEECCCCCCCCCCCCHHHH Q ss_conf 99999507762------------11233468----6253032330016-7888622211025302335768875799999 Q gi|254780214|r 161 AIPILQQHGRI------------SSLGFRFG----NVAYCTDVNAFPA-ESLEKLQNLDFLIIDALKNGLHSSHFSLSES 223 (271) Q Consensus 161 i~~~~~~Hg~~------------~~~Gyri~----~~~Y~~Dt~~~~e-~~~~~l~~~D~lI~d~l~~~~~~~H~~~~ea 223 (271) ++.++..|++. ...||.++ ++.++|||.++++ +.+....+.|++++.. + ...||+.+|| T Consensus 101 i~~vpa~H~~~~~~~~~~~~~~g~~~g~vi~~~g~~iy~~GDT~~~~~~~~i~~~~~pDlallpi---G-g~~~m~p~eA 176 (228) T PRK00685 101 VKFTQALHSSSYITEDGVSIYLGNPAGIVFTFEGKTIYHAGDTDLFSDMKLIGELHQPDIALLPI---G-DRFTMGPADA 176 (228) T ss_pred EEEEECCCCCCCCCCCCCCEECCCCEEEEEEECCEEEEEECCCCCCHHHHHHHHHCCCCEEEEEC---C-CCCCCCHHHH T ss_conf 99996654666557788626877877999997993899833766655799977655996999918---9-9610599999 Q ss_pred HHHHHHCCCCEEEEECCCCC----CCHHHHHHHC---CCCCEEEEECCEEEE Q ss_conf 99999729977999768889----8888999867---998199020448996 Q gi|254780214|r 224 LKKIELINPKNAILTHMHVD----LDYDMVLKST---PSRVVPAFDGMQFSS 268 (271) Q Consensus 224 ~~~~~~~~~k~~~LtHls~~----~~~~~~~~~~---~~~v~~A~DGm~i~l 268 (271) +++++.++||.++-+|...- -+-+++++.+ ..+|.+=..|-+|+| T Consensus 177 ~~~~~~l~~k~~IP~H~gtf~~~~~~p~~~~~~~~~~~~~v~i~~~Ge~~~l 228 (228) T PRK00685 177 ALAAEWLKPKTVVPMHYNTFPVIEQDPEKFVAGLEGVKTTVVILEPGESIEL 228 (228) T ss_pred HHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHCCCCEEEECCCCCEEEC T ss_conf 9999973999999818888886537999999987524987997699998869 No 16 >TIGR02650 RNase_Z_T_toga ribonuclease Z, Thermotoga type; InterPro: IPR013470 All tRNA molecules are synthesised as long precursors with extra sequence at both the 5' and 3' ends, which undergo several processing steps to form the mature tRNA molecule. In many bacteria the tRNA precursors do not contain the CCA sequence which forms the 3' end of the mature tRNA and is essential for aminoacylation. In these organisms ribonucleotide Z cleaves the precursor to form a 3' end to which CCA is added. Proteins in this entry, by contrast are ribonuclease Z enzymes with unusual activity. Organisms containing these proteins include the CCA sequence in the tRNA precursor and ribonuclease Z cleaves the molecule immediately after this sequence producing a mature 3' end. The sequence of these proteins differs considerably from the majority of ribonuclease Z sequences.. Probab=99.58 E-value=3.1e-15 Score=114.34 Aligned_cols=209 Identities=19% Similarity=0.257 Sum_probs=134.3 Q ss_pred CEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCCCCCCEEEEECCCHHHHHHCCHH---HHHHH-CCCCEEEEEECCEEE Q ss_conf 06999933677897189998798415667522334205999974352422011102---43331-782111013112023 Q gi|254780214|r 40 SSLKISRVSERGSNTTVIVDTGPDFYMQVLREQVLSIDAVLYTHPHADHIHGIDGL---RGYFL-KQKRPIDVYAAPDCM 115 (271) Q Consensus 40 ~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~L---r~~~~-~~~~~i~vy~~~~~~ 115 (271) .|.+|-+. -.+||+|+|++.-.. +..++-....||+||.|.||++||-.+ |---+ .+.+|++||-|++-. T Consensus 10 fstWiyys-----PerilfdaGeGvstt-lGskvyafkyvflthGhvdhiaGlWGvvnirnnGmGdrekPldvfyP~Gnr 83 (277) T TIGR02650 10 FSTWIYYS-----PERILFDAGEGVSTT-LGSKVYAFKYVFLTHGHVDHIAGLWGVVNIRNNGMGDREKPLDVFYPKGNR 83 (277) T ss_pred HHHHHEEC-----CCEEEEECCCCCCHH-HCCEEEEEEEEEEECCCHHHHCCEEEEEEEECCCCCCCCCCCEEECCCCCH T ss_conf 55231035-----431254357850011-111012234554323621221000236775217888645752223588650 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECC-CEECCCCCC---------------- Q ss_conf 210022221013667753100123211234433342011587489999995077-621123346---------------- Q gi|254780214|r 116 KHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHG-RISSLGFRF---------------- 178 (271) Q Consensus 116 ~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg-~~~~~Gyri---------------- 178 (271) + +++.-.++-+ ..+...-..+..+++..+--......+..--+.|++..|- ..-++||-| T Consensus 84 a-veeyt~fik~--anP~lrfsfnvhPl~eG~rvflr~aGGfkryvqPfrtkhv~~evsfGyhifevrrklk~efqGlds 160 (277) T TIGR02650 84 A-VEEYTEFIKK--ANPELRFSFNVHPLKEGERVFLRDAGGFKRYVQPFRTKHVASEVSFGYHIFEVRRKLKKEFQGLDS 160 (277) T ss_pred H-HHHHHHHHHH--CCCCCEEEEECCCCCCCCEEEEECCCCCCCCCCCCCCEEEEEEEECCHHHHHHHHHHHHHHCCCCH T ss_conf 4-8888999863--386633665313134687789844787200125300000100000130244677888886236405 Q ss_pred --------------------CC-EEEECCCCCCHHHHHHHHCCHHEEEEECC----CCCCCCCCCCHHHHHHHHHHCCCC Q ss_conf --------------------86-25303233001678886222110253023----357688757999999999972997 Q gi|254780214|r 179 --------------------GN-VAYCTDVNAFPAESLEKLQNLDFLIIDAL----KNGLHSSHFSLSESLKKIELINPK 233 (271) Q Consensus 179 --------------------~~-~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~l----~~~~~~~H~~~~ea~~~~~~~~~k 233 (271) ++ +..++|.-.++++. ++|.++||+||+ +...+.+|.+++|+++-.+++++| T Consensus 161 k~i~rlvkekGrdfvt~eyhkk~ltisGdslaldPee---~~GtelliheCtfld~rdrryknha~idevme~vk~aGvk 237 (277) T TIGR02650 161 KVIARLVKEKGRDFVTREYHKKVLTISGDSLALDPEE---VRGTELLIHECTFLDARDRRYKNHAAIDEVMESVKKAGVK 237 (277) T ss_pred HHHHHHHHHCCCHHHHHHHCCEEEEEECCCCCCCCCC---CCCCEEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 8888888640520132221223567515500037020---2541456531000261121112122488999998734640 Q ss_pred EEEEECCCCCCC--H----HHHHHHCCCCCEEEE Q ss_conf 799976888988--8----899986799819902 Q gi|254780214|r 234 NAILTHMHVDLD--Y----DMVLKSTPSRVVPAF 261 (271) Q Consensus 234 ~~~LtHls~~~~--~----~~~~~~~~~~v~~A~ 261 (271) +++|.|+|...= - .+..+.+| .|++.| T Consensus 238 ~vilyhistryir~~~svikkyre~~P-dvei~y 270 (277) T TIGR02650 238 KVILYHISTRYIRSLKSVIKKYREELP-DVEILY 270 (277) T ss_pred EEEEEEHHHHHHHHHHHHHHHHHHHCC-CEEEEE T ss_conf 777641116889999999999876268-616887 No 17 >COG1236 YSH1 Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis] Probab=99.49 E-value=1.2e-13 Score=104.44 Aligned_cols=187 Identities=21% Similarity=0.185 Sum_probs=101.0 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHH--HHHC--CCCCCCEEE Q ss_conf 599998541789651337887877778767744430699993367789718999879841566--7522--334205999 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQ--VLRE--QVLSIDAVL 80 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~q--l~~~--~i~~Id~I~ 80 (271) |++.|+|.+..- .|++.+++. ++.+||+|||...-.+ .... ...++|+++ T Consensus 1 ~~~~~~g~~~ev---------------------g~s~~~l~~-----~~~~il~D~G~~~~~~~~~~p~~~~~~~vDavl 54 (427) T COG1236 1 MTLRFLGAAREV---------------------GRSCVLLET-----GGTRILLDCGLFPGDPSPERPLLPPFPKVDAVL 54 (427) T ss_pred CCEECCCCCCCC---------------------CCEEEEEEE-----CCCEEEECCCCCCCCCCCCCCCCCCCCCCCEEE T ss_conf 914534655787---------------------754689874-----797699658988576765576788888767799 Q ss_pred EECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCC-----CCCCCCCCCCEECC Q ss_conf 9743524220111024333178211101311202321002222101366775310012321-----12344333420115 Q gi|254780214|r 81 YTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPI-----VIENNDVPICMKSA 155 (271) Q Consensus 81 iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 155 (271) |||+|.||++|++-+ ....-.-+||+++.|.+...-.+.-.++.... .+.+..... .......+...... T Consensus 55 lTHaHlDH~g~lp~l----~~~~~~~~v~aT~~T~~l~~~~l~d~~~~~~~-~~~~~~~~~d~~~~~~~~~~~~yg~~~~ 129 (427) T COG1236 55 LTHAHLDHIGALPYL----VRNGFEGPVYATPPTAALLKVLLGDSLKLAEG-PDKPPYSEEDVERVPDLIRPLPYGEPVE 129 (427) T ss_pred EECCCHHHHCCCHHH----HHHCCCCCEEECHHHHHHHHHHHHCCCCCCCC-CCCCCCCHHHHHHHHHCEEEECCCCEEE T ss_conf 600652432346788----76312786575351899999873111010357-7777654345665552227705898178 Q ss_pred CCCEEEEEEEEECCCEECCCCCC----CCEEEECCCCCCHHHHHHH---HCCHHEEEEECCCCCCCCCCCCHHHHHH Q ss_conf 87489999995077621123346----8625303233001678886---2221102530233576887579999999 Q gi|254780214|r 156 GGVIEAIPILQQHGRISSLGFRF----GNVAYCTDVNAFPAESLEK---LQNLDFLIIDALKNGLHSSHFSLSESLK 225 (271) Q Consensus 156 ~~~i~i~~~~~~Hg~~~~~Gyri----~~~~Y~~Dt~~~~e~~~~~---l~~~D~lI~d~l~~~~~~~H~~~~ea~~ 225 (271) ..++++++++.-|-. .+-+|.+ ++++|++|.+...+..+.- ...+|+||+|++.-. ..|.+.+++.+ T Consensus 130 v~~~~v~~~~AGHil-Gsa~~~le~~~~~ilytGD~~~~~~~l~~~a~~~~~~DvLI~EsTYg~--~~~~~r~~~e~ 203 (427) T COG1236 130 VGGVKVTFYNAGHIL-GSAAILLEVDGGRILYTGDVKRRKDRLLNGAELPPCIDVLIVESTYGD--RLHPNRDEVER 203 (427) T ss_pred ECCEEEEEECCCCCC-CEEEEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEEECCCCC--CCCCCHHHHHH T ss_conf 652899997179741-027999996894799982367777877777677888878998065588--77888789999 No 18 >pfam02112 PDEase_II cAMP phosphodiesterases class-II. Probab=99.49 E-value=9.6e-13 Score=98.82 Aligned_cols=215 Identities=20% Similarity=0.205 Sum_probs=121.4 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHH--HHHHC----------- Q ss_conf 59999854178965133788787777876774443069999336778971899987984156--67522----------- Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYM--QVLRE----------- 71 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~--ql~~~----------- 71 (271) +++++||+++. |+=+ ..+|.||...+ .+..|-+|+|..+.. +++.. T Consensus 1 F~vivLG~~GG---p~E~---------------n~ts~Lvr~~~---~~~~ialDAGt~l~gI~~~~~~~~~~~~~~~~~ 59 (323) T pfam02112 1 FATNLLGQVGG---IDEG---------------NLSSFLIEDEA---QETFIKLDAGSVLQGYPCLTVSKYLSPNVTITT 59 (323) T ss_pred CEEEEECCCCC---CCCC---------------CEEEEEEEECC---CCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCC T ss_conf 94999589889---7888---------------72489965469---886899977655888999986410477542357 Q ss_pred ---------------CC-CCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCC Q ss_conf ---------------33-42059999743524220111024333178211101311202321002222101366775310 Q gi|254780214|r 72 ---------------QV-LSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYP 135 (271) Q Consensus 72 ---------------~i-~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~ 135 (271) -+ +.|.+.||||.|.||+.||-.--|.......|-.|||.+.|.+.|++ ++|+..-+...+ T Consensus 60 p~~~~~~~p~~~a~~i~~~~I~~ylITH~HLDHIaGLvinsp~~~~~~~~K~i~gl~~ti~aLk~---hiFN~~iWPNl~ 136 (323) T pfam02112 60 PFSNFESGPYIKHTYLLFNRIKNYLITHSHLDHVCGLVINSPGFYKQMSKKTIYGLPYTINALQK---HLFNNLVWPNLP 136 (323) T ss_pred CCCCCCCCCHHHHHHHHHHHCCEEEECCCCHHHHHHHHCCCCCCCCCCCCCEEEECHHHHHHHHH---HHCCCCCCCCCC T ss_conf 76553457135678999852213993265277664543058543346799518865899999987---331674267886 Q ss_pred CCCC---CCCCCCCCC--CCCEECCCCCEEEEEEEEECCC-----EECCCCCCC------CEEEECCCCC-------CHH Q ss_conf 0123---211234433--3420115874899999950776-----211233468------6253032330-------016 Q gi|254780214|r 136 PIVN---PIVIENNDV--PICMKSAGGVIEAIPILQQHGR-----ISSLGFRFG------NVAYCTDVNA-------FPA 192 (271) Q Consensus 136 ~~~~---~~~~~~~~~--~~~~~~~~~~i~i~~~~~~Hg~-----~~~~Gyri~------~~~Y~~Dt~~-------~~e 192 (271) .... ......... ..........+.+.+++++|+. +++.+|.|. .++|.||+.. ..+ T Consensus 137 ~~g~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~fp~sH~~~~~~~~~SsAflird~~t~~~il~fGD~epDsvs~~~~~~ 216 (323) T pfam02112 137 SFGIVNLIYKVKMFDLSPGEFNKLTETTMSVVPFPVNHGGLIIKEALSTAFLFTDSVSGDSILVFGDVEPDLVESESLNL 216 (323) T ss_pred CCCCCCCCEEEEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEEEECCCCCEEEEECCCCCCCCCCCCHHH T ss_conf 67887743169988738775102467763156744537887677634407999955899789998288886335781589 Q ss_pred HHHHHH------CCHHEEEEECCCCCCC-----CCCCCHHHHHH----HHHHCC----C---CEEEEECCCCC Q ss_conf 788862------2211025302335768-----87579999999----999729----9---77999768889 Q gi|254780214|r 193 ESLEKL------QNLDFLIIDALKNGLH-----SSHFSLSESLK----KIELIN----P---KNAILTHMHVD 243 (271) Q Consensus 193 ~~~~~l------~~~D~lI~d~l~~~~~-----~~H~~~~ea~~----~~~~~~----~---k~~~LtHls~~ 243 (271) .+|+.+ +...-+++||...... .+|++..-.++ ++...+ + =++++||+=.. T Consensus 217 ~iW~~~Ap~I~~~kLk~I~IEcS~~n~~~d~~LyGHLtP~~Li~EL~~L~~~~~~~~~pL~gL~ViItHvK~~ 289 (323) T pfam02112 217 FIWSVIASLIKQNKLKAILIECSFPNEIPDNELFGHLTPRLLINELSQLETLTTSTSQPLLGLNVIVTHVKSP 289 (323) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCC T ss_conf 9999986442124447899996799999844024579849999999999987266788779984899986786 No 19 >COG1237 Metal-dependent hydrolases of the beta-lactamase superfamily II [General function prediction only] Probab=99.49 E-value=2e-12 Score=96.80 Aligned_cols=215 Identities=20% Similarity=0.215 Sum_probs=107.9 Q ss_pred CCCCEEEEEECCCCCCCEEEEEECCCCH---HH--HHHHCCCCCCCEEEEECCCHHHHHHCCHHHHHHHC-CCCEEEEEE Q ss_conf 4430699993367789718999879841---56--67522334205999974352422011102433317-821110131 Q gi|254780214|r 37 RTRSSLKISRVSERGSNTTVIVDTGPDF---YM--QVLREQVLSIDAVLYTHPHADHIHGIDGLRGYFLK-QKRPIDVYA 110 (271) Q Consensus 37 R~~~s~~i~~~~~~~~~~~iLiD~G~d~---r~--ql~~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~-~~~~i~vy~ 110 (271) +-.-|+||+. ++++||+|+|++- .. +++..++++||+|+|||.|+||++||+ ++.. ....++||+ T Consensus 20 ~hGfS~LVE~-----~~~riLFDtG~~~~~ll~Na~~lgvd~~did~vvlSHgH~DH~GGL~----~~~~~~~~~i~v~a 90 (259) T COG1237 20 EHGFSALVED-----EGTRILFDTGTDSDVLLHNARLLGVDLRDIDAVVLSHGHYDHTGGLP----YLLEENNPGIPVYA 90 (259) T ss_pred CCCEEEEEEC-----CCEEEEEECCCCCHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCHH----HHHHCCCCCCEEEE T ss_conf 6754999975-----87179995799857888779981998134747998578731347567----68752677850786 Q ss_pred CCEEEEEECCCCCCCCCC-CCC-CCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEE----ECC------CEECCCCCC Q ss_conf 120232100222210136-677-53100123211234433342011587489999995----077------621123346 Q gi|254780214|r 111 APDCMKHLFESFRYCFKA-LDD-RTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQ----QHG------RISSLGFRF 178 (271) Q Consensus 111 ~~~~~~~l~~~~~y~f~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~----~Hg------~~~~~Gyri 178 (271) +|+..+.....+.++... ... ....-+.++..+.. .... .+.+...+++. .-| +-.++.+.- T Consensus 91 hp~af~~~~~~~~~~gi~e~~~~~~~~~~~~~~~I~~-----g~~~-~Gei~~~~~e~~~~~~dg~~D~~~de~aLi~~~ 164 (259) T COG1237 91 HPDAFKAKIEVFREIGIPELEELARLILSEEPDEIVE-----GVIT-LGEIPKVTFEKGGYFEDGEPDPVLDEQALIVET 164 (259) T ss_pred CHHHHHHHCCCCCCCCCHHHHHCCCEEECCCCCEEEC-----CEEE-ECCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC T ss_conf 8588765130444255413443043354178826445-----8278-325676543334434567878766741799964 Q ss_pred C-CEEEECCCCC--CHHHHHHHHCCH--H--EEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHH Q ss_conf 8-6253032330--016788862221--1--0253023357688757999999999972997799976888988889998 Q gi|254780214|r 179 G-NVAYCTDVNA--FPAESLEKLQNL--D--FLIIDALKNGLHSSHFSLSESLKKIELINPKNAILTHMHVDLDYDMVLK 251 (271) Q Consensus 179 ~-~~~Y~~Dt~~--~~e~~~~~l~~~--D--~lI~d~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtHls~~~~~~~~~~ 251 (271) + .++.++-|+. + .+..+.++.. | ..|+.++..-. .+-..+.+.++..++.++++++=.|.+=......+.+ T Consensus 165 ~~GlvvItGCsH~GI-~niv~~~~~~~g~rv~~ViGGFHL~~-~~~~~l~~~~~~l~el~v~~i~pcHCTg~~a~~~l~~ 242 (259) T COG1237 165 EKGLVVITGCSHPGI-VNIVEWAKERSGDRVKAVIGGFHLIG-ASEERLEEVADYLKELGVEKIYPCHCTGEKAKRYLRR 242 (259) T ss_pred CCCEEEEECCCCCCH-HHHHHHHHHHCCCEEEEEEEEECCCC-CCHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHH T ss_conf 786399982786438-99999999860644578962102378-7688999999999855887698207788899999998 Q ss_pred HCCCCCEEEEECCEEEE Q ss_conf 67998199020448996 Q gi|254780214|r 252 STPSRVVPAFDGMQFSS 268 (271) Q Consensus 252 ~~~~~v~~A~DGm~i~l 268 (271) .++.+..-.+=|++|++ T Consensus 243 ~~~~~~~~v~~G~~ie~ 259 (259) T COG1237 243 VFGEKYEEVGVGTEIEV 259 (259) T ss_pred HCCCCEEECCCCEEEEC T ss_conf 72765022157438849 No 20 >COG1782 Predicted metal-dependent RNase, consists of a metallo-beta-lactamase domain and an RNA-binding KH domain [General function prediction only] Probab=99.40 E-value=1.2e-12 Score=98.26 Aligned_cols=189 Identities=23% Similarity=0.260 Sum_probs=99.3 Q ss_pred CEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHH---------HHHHHCCC Q ss_conf 725999985417896513378878777787677444306999933677897189998798415---------66752233 Q gi|254780214|r 3 DYYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFY---------MQVLREQV 73 (271) Q Consensus 3 ~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r---------~ql~~~~i 73 (271) ...++|+||.-.- =.|||++++. .+.++|+|||-.-- .+.-.... T Consensus 179 ~wvRvt~LGg~~E---------------------VGRSa~lv~T-----~eSrVLlDcG~n~a~~~~~~~Pyl~vpE~~~ 232 (637) T COG1782 179 RWVRVTALGGFRE---------------------VGRSALLVST-----PESRVLLDCGVNVAGNGEDAFPYLDVPEFQP 232 (637) T ss_pred CEEEEEEECCCHH---------------------CCCEEEEEEC-----CCCEEEEECCCCCCCCCCCCCCCCCCCCCCC T ss_conf 4599996124112---------------------0625699736-----8833899636567788643574455610251 Q ss_pred CCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECC-CCCCC---CCCCCCCCCCC------CCCCCCC Q ss_conf 42059999743524220111024333178211101311202321002-22210---13667753100------1232112 Q gi|254780214|r 74 LSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFE-SFRYC---FKALDDRTYPP------IVNPIVI 143 (271) Q Consensus 74 ~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~-~~~y~---f~~~~~~~~~~------~~~~~~~ 143 (271) ..||||+|||+|.||.+=| |+.++-+..-|||+++.|.+.+-- ..+|+ -+......|.+ +.....+ T Consensus 233 ~~lDAViiTHAHLDH~G~l----P~LfkYgy~GPVY~T~PTRDlm~LLq~Dyi~va~keg~~ppY~~k~v~~~lkhtItl 308 (637) T COG1782 233 DELDAVIITHAHLDHCGFL----PLLFKYGYDGPVYCTPPTRDLMVLLQLDYIEVAEKEGGEPPYESKDVRKVLKHTITL 308 (637) T ss_pred CCCCEEEEEECCCCCCCCH----HHHHHCCCCCCEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHEEEEE T ss_conf 2053699850234434521----456651788874317971889999988799999964999998878999887545410 Q ss_pred CCCCCCCCEECCCCCEEEEEEEEECCCEECC-CCCCC----CEEEECCCCCCHHHHHHH----HCCHHEEEEECCCCCCC Q ss_conf 3443334201158748999999507762112-33468----625303233001678886----22211025302335768 Q gi|254780214|r 144 ENNDVPICMKSAGGVIEAIPILQQHGRISSL-GFRFG----NVAYCTDVNAFPAESLEK----LQNLDFLIIDALKNGLH 214 (271) Q Consensus 144 ~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~-Gyri~----~~~Y~~Dt~~~~e~~~~~----l~~~D~lI~d~l~~~~~ 214 (271) +..+... . .-+++.+....-|--..+. -+-|+ +++|++|.++-...+++. ...++.||.|++.-+.. T Consensus 309 dYgevTD--I--aPDirLTf~NAGHILGSA~~HlHIGdGlyNi~yTGDfk~~~trLl~~A~n~FpRvEtlimEsTYGg~~ 384 (637) T COG1782 309 DYGEVTD--I--APDIRLTFYNAGHILGSAMAHLHIGDGLYNIVYTGDFKFEKTRLLEPANNKFPRVETLIMESTYGGRD 384 (637) T ss_pred CCCCCCC--C--CCCCEEEEECCCCHHCCEEEEEEECCCCEEEEEECCCCCCCEECCCHHHCCCCCHHEEEEEECCCCCC T ss_conf 2576244--6--87617898416410131015888648713579943444440120573333683232046530137754 Q ss_pred CCCCCHHHHHH Q ss_conf 87579999999 Q gi|254780214|r 215 SSHFSLSESLK 225 (271) Q Consensus 215 ~~H~~~~ea~~ 225 (271) .-+.+-+||.+ T Consensus 385 d~q~~R~eaE~ 395 (637) T COG1782 385 DVQPPREEAEK 395 (637) T ss_pred CCCCCHHHHHH T ss_conf 45776789999 No 21 >PRK04286 hypothetical protein; Provisional Probab=99.35 E-value=2.7e-12 Score=96.08 Aligned_cols=224 Identities=25% Similarity=0.302 Sum_probs=114.4 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCH---HH------------HHH Q ss_conf 599998541789651337887877778767744430699993367789718999879841---56------------675 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDF---YM------------QVL 69 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~---r~------------ql~ 69 (271) ||+..||+- |.|+ |. .|.+|+. ++.+||||.|-.+ |+ +.. T Consensus 1 Mki~pla~e-SLGv------------------RS-ma~~Vet-----~d~~IlIDPgvaLap~RygLpPhp~E~~~l~~~ 55 (294) T PRK04286 1 MKIIPLASE-SLGV------------------RS-MATFVET-----RDVRILIDPGVSLAPRRYGLPPHPIELERLEEV 55 (294) T ss_pred CEEEEEEEC-CCCC------------------CE-EEEEEEE-----CCEEEEECCCHHCCCCCCCCCCCHHHHHHHHHH T ss_conf 956995305-6774------------------32-5799983-----885799777021187667989980999999999 Q ss_pred HCCC----CCCCEEEEECCCHHHHHHCCHHHHHH-HCCCCEEEEEECCEEEE-----EECC---CCCCCCCCCCCCCCCC Q ss_conf 2233----42059999743524220111024333-17821110131120232-----1002---2221013667753100 Q gi|254780214|r 70 REQV----LSIDAVLYTHPHADHIHGIDGLRGYF-LKQKRPIDVYAAPDCMK-----HLFE---SFRYCFKALDDRTYPP 136 (271) Q Consensus 70 ~~~i----~~Id~I~iTH~H~DHi~Gl~~Lr~~~-~~~~~~i~vy~~~~~~~-----~l~~---~~~y~f~~~~~~~~~~ 136 (271) +..| ++-|.|+|||+|.||..-..+ ..+. .....+..||.-+-++. .++. +-.+.|... ... T Consensus 56 r~kI~~~a~~advvvISHYHyDH~~p~~~-~~Y~~~~~~~~~eiY~gK~~~~K~P~~~IN~SQr~Ra~~flk~----~~~ 130 (294) T PRK04286 56 REKILELAKKADVVTISHYHYDHHTPFYE-SVYEATSEEEYKEIYGGKIVLIKDPTENINWSQRRRAYGFLKA----VKG 130 (294) T ss_pred HHHHHHHHHHCCEEEEECCCCCCCCCCCC-CHHHCCCCCCHHHHHCCCEEEECCCHHHCCHHHHHHHHHHHHH----HHH T ss_conf 99999998429999991336667785534-3010244423588857858998194264388899999999997----652 Q ss_pred CCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCE-ECCCCCC--------CCEEEECCCC-CCHHHHHHHH--CCHHEE Q ss_conf 12321123443334201158748999999507762-1123346--------8625303233-0016788862--221102 Q gi|254780214|r 137 IVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRI-SSLGFRF--------GNVAYCTDVN-AFPAESLEKL--QNLDFL 204 (271) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~-~~~Gyri--------~~~~Y~~Dt~-~~~e~~~~~l--~~~D~l 204 (271) ....+...+- ..+.+.+..|++ +-+++||.. .-+||.+ .+++|++|+. .+.++.++++ ++.|++ T Consensus 131 --~~~~i~~aDg-~~f~~g~~~i~f-S~pvpHG~~gsklG~Vi~~~I~dg~~~~~~~SDvqG~~~~e~v~~il~~~P~iv 206 (294) T PRK04286 131 --IAKKIEYADG-KEFRFGGTVIEF-SPPVPHGAEGSKLGYVVMVRISDGDESFVFASDVQGPLNDEAVEFILEKKPTVV 206 (294) T ss_pred --HCCEEEECCC-CEEEECCEEEEE-CCCCCCCCCCCCCCEEEEEEEECCCEEEEECCCCCCCCCHHHHHHHHHCCCCEE T ss_conf --0434897588-668878888996-688778988885567999999749768998055667687899999986399989 Q ss_pred EEECCC-CCC-CCCCCCHHH----HHHHHHHCCCCEEEEEC-CCCCCCHHHHHHHCC----CCCEEEEEC Q ss_conf 530233-576-887579999----99999972997799976-888988889998679----981990204 Q gi|254780214|r 205 IIDALK-NGL-HSSHFSLSE----SLKKIELINPKNAILTH-MHVDLDYDMVLKSTP----SRVVPAFDG 263 (271) Q Consensus 205 I~d~l~-~~~-~~~H~~~~e----a~~~~~~~~~k~~~LtH-ls~~~~~~~~~~~~~----~~v~~A~DG 263 (271) |+|+-. +=. ..+.-.++. ..++++. .++.++|-| +--+..+.+..+++. .+|..|-+- T Consensus 207 ii~GPPtYL~gr~~~~~le~~i~Nl~~ii~~-~~~~lIlDHHllRD~~~re~l~~v~~~ae~~v~taAe~ 275 (294) T PRK04286 207 IIGGPPTYLLYRVSGEDLEKGIENLERIILL-RPKTLILDHHLLRDINYREKLKELAEVAESRVITAAEF 275 (294) T ss_pred EECCCCHHHHHHCCHHHHHHHHHHHHHHHHH-CCCEEEEECHHHCCCCHHHHHHHHHHHHHHCEEEHHHH T ss_conf 9679845652204676699999999999981-89879981413305268999999998765410469997 No 22 >smart00849 Lactamase_B Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor. Probab=99.30 E-value=1.5e-11 Score=91.39 Aligned_cols=135 Identities=21% Similarity=0.233 Sum_probs=76.1 Q ss_pred CCCCEEEEEECCCCCCCEEEEEECCCCHHHH----HHHCCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECC Q ss_conf 4430699993367789718999879841566----752233420599997435242201110243331782111013112 Q gi|254780214|r 37 RTRSSLKISRVSERGSNTTVIVDTGPDFYMQ----VLREQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAP 112 (271) Q Consensus 37 R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~q----l~~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~ 112 (271) +...|.+|+. ++..+|||||++...+ +.+.+.++|++|++||.|.||++|+..+. .+...+||+++ T Consensus 4 ~~~n~~li~~-----~~~~vliD~G~~~~~~~~~~l~~~~~~~i~~vv~TH~H~DH~gg~~~~~-----~~~~~~i~~~~ 73 (183) T smart00849 4 VGVNSYLVEG-----DGGAILIDTGPGEAEDLLAELKKLGPKDIDAIILTHGHPDHIGGLPELL-----EAPGAPVYAPE 73 (183) T ss_pred CCCEEEEEEE-----CCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEECCCCCCHHCHHHHHH-----HHCCCEEEECH T ss_conf 6418999998-----9989999298980999999999659988749998999940033099998-----73299899879 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCC--CCEEEECCCC Q ss_conf 023210022221013667753100123211234433342011587489999995077621123346--8625303233 Q gi|254780214|r 113 DCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRF--GNVAYCTDVN 188 (271) Q Consensus 113 ~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri--~~~~Y~~Dt~ 188 (271) .+.+.+++......... ........ .........+...+..+++...+ .|. ...++|.+ .+++|++|+. T Consensus 74 ~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~p-GHt-~g~~~~~~~~~~vl~~GD~~ 144 (183) T smart00849 74 GTAELLKDLLKLGGALG--AEAPPPPP---DRTLKDGEELDLGGLELEVIHTP-GHT-PGSIVLYLPEGKILFTGDLL 144 (183) T ss_pred HHHHHHHCCHHCCCCCC--CCCCCCCC---CEECCCCCEEEECCEEEEEEECC-CCC-CCCEEEEECCCCEEEEEEEE T ss_conf 99999861001033333--34677766---16728999999899869986468-878-89899998999999998898 No 23 >COG2220 Predicted Zn-dependent hydrolases of the beta-lactamase fold [General function prediction only] Probab=99.11 E-value=4.4e-09 Score=76.07 Aligned_cols=187 Identities=16% Similarity=0.134 Sum_probs=102.0 Q ss_pred CCCEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHH-------HHHCCC Q ss_conf 9872599998541789651337887877778767744430699993367789718999879841566-------752233 Q gi|254780214|r 1 MKDYYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQ-------VLREQV 73 (271) Q Consensus 1 m~~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~q-------l~~~~i 73 (271) +...|++|.||. ++++|+. ++++||||..-.-... .....+ T Consensus 3 ~~~~m~itwlGh---------------------------a~~lie~-----~~~~iliDP~~~~~~~~~~~~~~~~~~~~ 50 (258) T COG2220 3 SAEDMKITWLGH---------------------------AAFLIET-----GGKRILIDPVLSGAPSPSNFPGGLFEDLL 50 (258) T ss_pred CCCCCEEEEEEE---------------------------EEEEEEE-----CCEEEEECCCCCCCCCCCCCCCCCCHHHC T ss_conf 776717999512---------------------------6899995-----88689989986788774422566764324 Q ss_pred CCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEE-EEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 4205999974352422011102433317821110-131120232100222210136677531001232112344333420 Q gi|254780214|r 74 LSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPID-VYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICM 152 (271) Q Consensus 74 ~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~-vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (271) ..+|+|++||.|.||+.-= .++ .....+.+ ++.|......+... ... . ....+.+.+. T Consensus 51 ~~~D~ilitH~H~DHl~~~-~~~---~~~~~~~~~~~~p~~~~~~~~~~------g~~--------~---~~~~~~~~g~ 109 (258) T COG2220 51 PPIDYILITHDHYDHLDDE-TLI---ALRTNKAPVVVVPLGAGDLLIRD------GVE--------A---ERVHELGWGD 109 (258) T ss_pred CCCCEEEEECCCCHHCCCH-HHH---HHHCCCCCEEECCCHHHHHHHCC------CCC--------C---CEEEECCCCC T ss_conf 7888899736861105707-665---54248874896460344444215------877--------2---0556147886 Q ss_pred ECCCCCEEEEEEEEECCC------------EECCCCCCC----CEEEECCCCCCHHHHHHHHCCHHEEEEECCCCCCCCC Q ss_conf 115874899999950776------------211233468----6253032330016788862221102530233576887 Q gi|254780214|r 153 KSAGGVIEAIPILQQHGR------------ISSLGFRFG----NVAYCTDVNAFPAESLEKLQNLDFLIIDALKNGLHSS 216 (271) Q Consensus 153 ~~~~~~i~i~~~~~~Hg~------------~~~~Gyri~----~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d~l~~~~~~~ 216 (271) ......++++.++..|.+ ....||.|. ++..++||..+........-.+|++.+..-.+. ... T Consensus 110 ~~~~~~~~i~~~~a~h~~~~~~~~~~~~~~~~~~~~vi~~~g~~iyh~GDt~~~~~~~~~~~~~~DvallPig~~~-~~~ 188 (258) T COG2220 110 VIELGDLEITAVPAYHVSARHLPGRGIRPTGLWVGYVIETPGGRVYHAGDTGYLFLIIEELDGPVDVALLPIGGYP-NAT 188 (258) T ss_pred EEECCCCEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEECCEEEEECCCCHHHHHHHHHCCCCCCEEEECCCCCC-CCC T ss_conf 3973564698886263243335887743467616999960992799625670676655420577778994554566-776 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf 5799999999997299779997688 Q gi|254780214|r 217 HFSLSESLKKIELINPKNAILTHMH 241 (271) Q Consensus 217 H~~~~ea~~~~~~~~~k~~~LtHls 241 (271) +++..++.++++.+++|+++-+|.+ T Consensus 189 ~~~~~~~~~~~~~l~~~~viP~Hy~ 213 (258) T COG2220 189 MMPPEAAVAAAEVLRPKRVIPMHYG 213 (258) T ss_pred CCCHHHHHHHHHHHCCCEEEECCCC T ss_conf 5789999999985085649975435 No 24 >pfam00753 Lactamase_B Metallo-beta-lactamase superfamily. Probab=99.10 E-value=4e-10 Score=82.52 Aligned_cols=55 Identities=31% Similarity=0.451 Sum_probs=41.7 Q ss_pred CCCCEEEEEECCCCCCCEEEEEECCCCHHHHHH-----HCCCCCCCEEEEECCCHHHHHHCCHHH Q ss_conf 443069999336778971899987984156675-----223342059999743524220111024 Q gi|254780214|r 37 RTRSSLKISRVSERGSNTTVIVDTGPDFYMQVL-----REQVLSIDAVLYTHPHADHIHGIDGLR 96 (271) Q Consensus 37 R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~-----~~~i~~Id~I~iTH~H~DHi~Gl~~Lr 96 (271) +...|.+|+. ++..+|||||.+...+.. +...++|++|++||.|.||++|+..++ T Consensus 4 ~~~~~~li~~-----~~~~vLiD~G~~~~~~~~~~~~~~~~~~~v~~vi~TH~H~DH~gg~~~~~ 63 (148) T pfam00753 4 VGSNSYLVEG-----DGGAILIDTGLGADDALLLLAALGLDPKDIDAIILTHAHADHIGGLPELK 63 (148) T ss_pred CEEEEEEEEE-----CCEEEEEECCCCHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHCCHHHHH T ss_conf 5558999998-----99899995989915899999983799768649997999855655689999 No 25 >TIGR00649 MG423 conserved hypothetical protein; InterPro: IPR004613 This is a family of conserved hypothetical proteins, the members of which contain an ATP-binding domain at the N-terminal end of the protein. It is possibly part of a superfamily of beta-lactmases.. Probab=99.05 E-value=3.4e-10 Score=83.01 Aligned_cols=125 Identities=18% Similarity=0.219 Sum_probs=77.4 Q ss_pred CCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCE-EEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCC Q ss_conf 342059999743524220111024333178211-1013112023210022221013667753100123-21123443334 Q gi|254780214|r 73 VLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRP-IDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVN-PIVIENNDVPI 150 (271) Q Consensus 73 i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~-i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~-~~~~~~~~~~~ 150 (271) -.+|.||||||.|.|||+|++.| + .+.+ .||||++-++.-+++.+.- ...+..... ..+++.. . T Consensus 69 ~~kvkgI~~tHGH~DHIGav~yL----~-~~~~~~PiYGt~lt~~L~~~ki~~------E~~l~~~~~yl~~~~~~---~ 134 (593) T TIGR00649 69 EDKVKGIVITHGHEDHIGAVPYL----L-HQYGFPPIYGTPLTIALIKSKIKA------EHGLNVRTDYLQEINEG---E 134 (593) T ss_pred CCCEEEEEECCCCCCCCCHHHHH----H-HHCCCCCEECCHHHHHHHHHHHHH------HHCCCCCCCHHHEECCC---C T ss_conf 25067889708710001228889----8-656888240437899999998865------42002257502101589---6 Q ss_pred CEEC-CCCCEEEEEEEEECCCEECCCCCC----CCEEEECCCCCCH--------HHH---HHHH-CCHHEEEEECCCCC Q ss_conf 2011-587489999995077621123346----8625303233001--------678---8862-22110253023357 Q gi|254780214|r 151 CMKS-AGGVIEAIPILQQHGRISSLGFRF----GNVAYCTDVNAFP--------AES---LEKL-QNLDFLIIDALKNG 212 (271) Q Consensus 151 ~~~~-~~~~i~i~~~~~~Hg~~~~~Gyri----~~~~Y~~Dt~~~~--------e~~---~~~l-~~~D~lI~d~l~~~ 212 (271) ...+ ....+.++++++-|+=.+|+|+.+ |.++|++|.+ |+ .+. .+.. +|+=+||.|.++-+ T Consensus 135 ~~~~~~i~~~~~eFir~tHSIPdS~~~a~hTp~G~IVy~~DFK-fD~~p~~~~~~Dl~~~~~~G~~GVL~LlsdsT~~~ 212 (593) T TIGR00649 135 PVQVGKIENFAIEFIRVTHSIPDSVGLALHTPEGSIVYAGDFK-FDNTPVIGEPPDLNRIAELGKKGVLLLLSDSTNVE 212 (593) T ss_pred EEEEEECCCEEEEEEEEEECCCCCEEEEEECCCCEEEEECCEE-EECCCCCCCCCCHHHHHHHHCCCCEEEEECCEECC T ss_conf 0888750643884577143241221888976983499832545-53777889860079999973288489995020226 No 26 >KOG1136 consensus Probab=98.96 E-value=1.3e-09 Score=79.35 Aligned_cols=173 Identities=22% Similarity=0.283 Sum_probs=86.5 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHHH----------CC- Q ss_conf 2599998541789651337887877778767744430699993367789718999879841566752----------23- Q gi|254780214|r 4 YYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVLR----------EQ- 72 (271) Q Consensus 4 ~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~----------~~- 72 (271) .|+++-||.|..- .||+++++. ++++|++|||...-.+=.| .+ T Consensus 3 ~i~v~pLGAGQdv---------------------GrSCilvsi-----~Gk~iM~DCGMHMG~nD~rRfPdFSyI~~~g~ 56 (501) T KOG1136 3 EIKVTPLGAGQDV---------------------GRSCILVSI-----GGKNIMFDCGMHMGFNDDRRFPDFSYISKSGR 56 (501) T ss_pred CCEEEECCCCCCC---------------------CCEEEEEEE-----CCCEEEEECCCCCCCCCCCCCCCCEEECCCCC T ss_conf 5147754677625---------------------742799997-----79078995454555576566888523668887 Q ss_pred C-CCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEE---ECCCCCCCCCCCCCCCCCCCCCCCCCCCC-- Q ss_conf 3-42059999743524220111024333178211101311202321---00222210136677531001232112344-- Q gi|254780214|r 73 V-LSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKH---LFESFRYCFKALDDRTYPPIVNPIVIENN-- 146 (271) Q Consensus 73 i-~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~---l~~~~~y~f~~~~~~~~~~~~~~~~~~~~-- 146 (271) + +-||.|+|||+|.||++.||.+-. ..+..-|||-+-.|.+- +.+-|.-+ ..+..+-..++....+..+ T Consensus 57 ~~~~idCvIIsHFHlDHcGaLPyfsE---v~GY~GPIYMt~PTkaicPvlLeDyRkv--~vd~kGe~n~FT~q~I~nCMK 131 (501) T KOG1136 57 FTDAIDCVIISHFHLDHCGALPYFSE---VVGYDGPIYMTYPTKAICPVLLEDYRKV--AVDRKGESNFFTTQDIKNCMK 131 (501) T ss_pred CCCCEEEEEEEEECCCCCCCCCCHHH---HHCCCCCEEEECCHHHHCHHHHHHHHHH--HCCCCCCCCCEEHHHHHHHHH T ss_conf 45404589985321100466400576---6177886478556353053889888987--616567533200898999876 Q ss_pred -----CCCCCEECCCCCEEEEEEEEECCCEECCCC--CCC--CEEEECCCCCCHHHHHH--HH--CCHHEEEEECC Q ss_conf -----333420115874899999950776211233--468--62530323300167888--62--22110253023 Q gi|254780214|r 147 -----DVPICMKSAGGVIEAIPILQQHGRISSLGF--RFG--NVAYCTDVNAFPAESLE--KL--QNLDFLIIDAL 209 (271) Q Consensus 147 -----~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gy--ri~--~~~Y~~Dt~~~~e~~~~--~l--~~~D~lI~d~l 209 (271) ....... ...+++++++-..| ...+.-| +.+ +++|++|-+--|+.-+. ++ -..|+||.|++ T Consensus 132 KVv~i~l~qt~~-vD~dl~IrayYAGH-VLGAaMf~ikvGd~svvYTGDYnmTpDrHLGaA~id~~rpdlLIsEST 205 (501) T KOG1136 132 KVVAIDLHQTIQ-VDEDLQIRAYYAGH-VLGAAMFYIKVGDQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISEST 205 (501) T ss_pred HEEEEEEHHEEE-ECCCCEEEEEECCC-CCCEEEEEEEECCEEEEEECCCCCCCCCCCCHHHHCCCCCCEEEEECC T ss_conf 506733022077-25541364331232-000057999845535898467667864345604331336766875310 No 27 >COG2248 Predicted hydrolase (metallo-beta-lactamase superfamily) [General function prediction only] Probab=98.89 E-value=4.6e-09 Score=75.95 Aligned_cols=212 Identities=24% Similarity=0.310 Sum_probs=107.7 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCCH---H-------------HHH Q ss_conf 599998541789651337887877778767744430699993367789718999879841---5-------------667 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPDF---Y-------------MQV 68 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~---r-------------~ql 68 (271) ||++-||+- |.||-+ -+.+++. .+..||||.|-.+ | .|+ T Consensus 1 MkV~Pla~e-SLGVRS-------------------mAt~vet-----~dv~ILiDpGVsLaPkRy~LPPh~~E~erl~~~ 55 (304) T COG2248 1 MKVIPLASE-SLGVRS-------------------MATFVET-----KDVGILIDPGVSLAPKRYGLPPHQRELERLRQA 55 (304) T ss_pred CCEEECCCC-CCCHHH-------------------HHHEEEC-----CCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 932531312-323442-------------------4321321-----773699778654485346899887999999999 Q ss_pred HHCCC----CCCCEEEEECCCHHHHHHCCHHHHHHHC-CCCEEEEEECCEEE-----EEECCCC---CCCCCCCCCCCCC Q ss_conf 52233----4205999974352422011102433317-82111013112023-----2100222---2101366775310 Q gi|254780214|r 69 LREQV----LSIDAVLYTHPHADHIHGIDGLRGYFLK-QKRPIDVYAAPDCM-----KHLFESF---RYCFKALDDRTYP 135 (271) Q Consensus 69 ~~~~i----~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~-~~~~i~vy~~~~~~-----~~l~~~~---~y~f~~~~~~~~~ 135 (271) +..+ ++.|-|.|||+|.||..-+.+ ..+.. ....-.+|+.+-.+ +.|++.. .|-|-...+ T Consensus 56 -r~~i~~~ak~a~VitISHYHYDHhtPf~~--~~y~~s~e~~~eiY~gK~lLlKhPte~IN~SQ~~Ra~~fl~~~~---- 128 (304) T COG2248 56 -REKIQRYAKKADVITISHYHYDHHTPFFD--GIYEASGETAKEIYKGKLLLLKHPTENINRSQRRRAYRFLESLK---- 128 (304) T ss_pred -HHHHHHHHHHCCEEEEEEECCCCCCCCCC--CHHHHCCCCHHHHHCCCEEEECCCHHHHCHHHHHHHHHHHHHHH---- T ss_conf -99999998648889986401245786655--21231032467773583788519556557787878999998764---- Q ss_pred CCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCE-ECCCCCC--------CCEEEECCCC-CCHHHHHHHH--CCHHE Q ss_conf 012321123443334201158748999999507762-1123346--------8625303233-0016788862--22110 Q gi|254780214|r 136 PIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRI-SSLGFRF--------GNVAYCTDVN-AFPAESLEKL--QNLDF 203 (271) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~-~~~Gyri--------~~~~Y~~Dt~-~~~e~~~~~l--~~~D~ 203 (271) .-...++..+ ...+.+.+..|++. -+++||.. .-+||.+ .++++++|+. .+.++.++++ ++.|+ T Consensus 129 --~~~~~ie~AD-gk~f~fG~t~IefS-~pvpHG~eGskLGyVl~v~V~dg~~~i~faSDvqGp~~~~~l~~i~e~~P~v 204 (304) T COG2248 129 --DIAREIEYAD-GKTFEFGGTVIEFS-PPVPHGREGSKLGYVLMVAVTDGKSSIVFASDVQGPINDEALEFILEKRPDV 204 (304) T ss_pred --HHCCEEEECC-CCEEEECCEEEEEC-CCCCCCCCCCCCCEEEEEEEECCCEEEEECCCCCCCCCCHHHHHHHHCCCCE T ss_conf --3111468337-86587577899956-8888887666221499999943976899713666888608889998518988 Q ss_pred EEEECCCC--CCC-CCCCCHHHHH----HHHHHCCCCEEEEEC-CCCCCCHHHHHHHC Q ss_conf 25302335--768-8757999999----999972997799976-88898888999867 Q gi|254780214|r 204 LIIDALKN--GLH-SSHFSLSESL----KKIELINPKNAILTH-MHVDLDYDMVLKST 253 (271) Q Consensus 204 lI~d~l~~--~~~-~~H~~~~ea~----~~~~~~~~k~~~LtH-ls~~~~~~~~~~~~ 253 (271) +|+++-.. -.+ .+--+++-++ +++++.+ +++++-| +--+.++.+..+++ T Consensus 205 ~ii~GPpty~lg~r~~~~~~E~~irNl~~ii~~~~-~~lViDHHllRD~~y~e~l~~l 261 (304) T COG2248 205 LIIGGPPTYLLGYRVGPKSLEKGIRNLERIIEETN-ATLVIDHHLLRDKNYREFLEEL 261 (304) T ss_pred EEECCCCHHHHHHHCCHHHHHHHHHHHHHHHHHCC-CEEEEEEHHHCCCCHHHHHHHH T ss_conf 99669826676655076779999989999997076-6499852023177789999999 No 28 >TIGR03413 GSH_gloB hydroxyacylglutathione hydrolase. Members of this protein family are hydroxyacylglutathione hydrolase, a detoxification enzyme known as glyoxalase II. It follows lactoylglutathione lyase, or glyoxalase I, and acts to remove the toxic metabolite methylglyoxal and related compounds. This protein belongs to the broader metallo-beta-lactamase family (pfam00753). Probab=98.84 E-value=8.9e-09 Score=74.17 Aligned_cols=105 Identities=20% Similarity=0.312 Sum_probs=60.2 Q ss_pred CEEEEEECCCC-HHHHHHHCCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCC Q ss_conf 71899987984-15667522334205999974352422011102433317821110131120232100222210136677 Q gi|254780214|r 53 NTTVIVDTGPD-FYMQVLREQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDD 131 (271) Q Consensus 53 ~~~iLiD~G~d-~r~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~ 131 (271) +..++||+|.. -..+.++..--+|.+|++||.|+||+.|...|+ .+.+.+||+++... .+.. T Consensus 20 ~~a~vIDP~d~~~i~~~l~~~~l~l~~IL~TH~H~DHi~G~~~L~-----~~~~~~Vyg~~~~~------~~~~------ 82 (248) T TIGR03413 20 GQAAVVDPGEAEPVLDALEARGLTLTAILLTHHHHDHVGGVAELL-----EAFPAPVYGPAEER------IPGI------ 82 (248) T ss_pred CCEEEECCCCCHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHH-----HHCCCCCCCCHHHC------CCCC------ T ss_conf 858998799809999999987996789994898860013189999-----75146612011116------8777------ Q ss_pred CCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCCC--CEEEECCC Q ss_conf 531001232112344333420115874899999950776211233468--62530323 Q gi|254780214|r 132 RTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRFG--NVAYCTDV 187 (271) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri~--~~~Y~~Dt 187 (271) . ..+. .+..+..+++++..+.++-....++.|.+. +++++||| T Consensus 83 -------d-~~l~-----dgd~i~ig~~~~~vi~TPGHT~ghv~f~~~~~~~lFtGDT 127 (248) T TIGR03413 83 -------T-HPVK-----DGDTVTLGGLEFEVLAVPGHTLGHIAYYLPDSPALFCGDT 127 (248) T ss_pred -------E-EEEC-----CCCEEEECCEEEEEEECCCCCCCCEEEEECCCEEEEECCE T ss_conf -------0-7825-----8988988998899996789973006899557407984463 No 29 >PRK11539 hypothetical protein; Provisional Probab=98.82 E-value=2.1e-07 Score=65.58 Aligned_cols=63 Identities=24% Similarity=0.416 Sum_probs=45.5 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCC-----HHHH-----HHHCCC Q ss_conf 259999854178965133788787777876774443069999336778971899987984-----1566-----752233 Q gi|254780214|r 4 YYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPD-----FYMQ-----VLREQV 73 (271) Q Consensus 4 ~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d-----~r~q-----l~~~~i 73 (271) .+++|||=-|. ..|++|+. +++.+|+|+|+. .-.+ |...++ T Consensus 498 ~~~v~~LDVGQ------------------------G~avlI~~-----~~~~~L~DtG~~~~~~~~g~~vi~P~L~~~Gi 548 (754) T PRK11539 498 EWRVDMLDVGQ------------------------GLAMVIER-----NGKAILYDTGNAWPEGDSAQQVIIPWLRWHNL 548 (754) T ss_pred CCEEEEEECCC------------------------CEEEEEEE-----CCEEEEEECCCCCCCCCCHHHHHHHHHHHCCC T ss_conf 60899997278------------------------51899998-----99789995799888887237788999996799 Q ss_pred CCCCEEEEECCCHHHHHHCCHHH Q ss_conf 42059999743524220111024 Q gi|254780214|r 74 LSIDAVLYTHPHADHIHGIDGLR 96 (271) Q Consensus 74 ~~Id~I~iTH~H~DHi~Gl~~Lr 96 (271) +||++++||.|.||++|+..+. T Consensus 549 -~lD~lvlSH~D~DH~GGl~~ll 570 (754) T PRK11539 549 -TPEGVILSHEHLDHRGGLASLL 570 (754) T ss_pred -CCEEEEECCCCCCCCCCHHHHH T ss_conf -7208994588500126899999 No 30 >PRK10241 hydroxyacylglutathione hydrolase; Provisional Probab=98.80 E-value=1.9e-08 Score=72.10 Aligned_cols=106 Identities=19% Similarity=0.243 Sum_probs=61.2 Q ss_pred CEEEEEECCCCH-HHHHHHCCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCC Q ss_conf 718999879841-5667522334205999974352422011102433317821110131120232100222210136677 Q gi|254780214|r 53 NTTVIVDTGPDF-YMQVLREQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDD 131 (271) Q Consensus 53 ~~~iLiD~G~d~-r~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~ 131 (271) +..++||+|..- -.+.++..--++++|++||.|+||++|...|+.-+ ..++||++......-. T Consensus 22 ~~a~vIDPgd~~~i~~~l~~~~l~l~~IL~TH~H~DHi~G~~~L~~~~----~~~~v~g~~~~~~~~~------------ 85 (251) T PRK10241 22 GRCLIVDPGEAEPVLNAIAANNWQPEAIFLTHHHHDHVGGVKELVEKF----PQIVVYGPQETQDKGT------------ 85 (251) T ss_pred CCEEEECCCCHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHC----CCCEEECHHHCCCCCC------------ T ss_conf 838999499709999999977991789994589557663299999876----8985867033267689------------ Q ss_pred CCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCCCCEEEECCC Q ss_conf 53100123211234433342011587489999995077621123346862530323 Q gi|254780214|r 132 RTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRFGNVAYCTDV 187 (271) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri~~~~Y~~Dt 187 (271) ...++. +..+..++.++..+.++-....++.|..++.+++||| T Consensus 86 --------~~~v~d-----gd~i~~g~~~~~vi~TPGHT~Ghi~y~~~~~lFtGDT 128 (251) T PRK10241 86 --------TQVVKD-----GETAFVLGHEFSVFATPGHTLGHICYFSKPYLFCGDT 128 (251) T ss_pred --------CEEECC-----CCEEEECCEEEEEEECCCCCCCHHHHHCCCCEEECCC T ss_conf --------878489-----9999989989999977999830001113770675153 No 31 >KOG1135 consensus Probab=98.76 E-value=4.4e-08 Score=69.85 Aligned_cols=157 Identities=22% Similarity=0.322 Sum_probs=84.3 Q ss_pred EEEEEECCCCCCCEEEEEECCCC--HHHHHHHC---CCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEE Q ss_conf 69999336778971899987984--15667522---33420599997435242201110243331782111013112023 Q gi|254780214|r 41 SLKISRVSERGSNTTVIVDTGPD--FYMQVLRE---QVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCM 115 (271) Q Consensus 41 s~~i~~~~~~~~~~~iLiD~G~d--~r~ql~~~---~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~ 115 (271) +-+++. ++.+||||||-+ +..|++.. -+.+||||+|||.-.-|++||+.. +...+-..+||++-.+. T Consensus 17 cyllqi-----D~~~iLiDcGwd~~f~~~~i~~l~~~i~~iDaILLShpd~~hlGaLpY~---~~k~gl~~~VYAT~PV~ 88 (764) T KOG1135 17 CYLLQI-----DGVRILIDCGWDESFDMSMIKELKPVIPTIDAILLSHPDILHLGALPYA---VGKLGLNAPVYATLPVI 88 (764) T ss_pred EEEEEE-----CCEEEEEECCCCCHHCCCHHHHHHCCCCCCCEEEECCCCHHHHCCCHHH---HHHCCCCCEEEEECCHH T ss_conf 589997-----5808997279960106601001211466443799558986773650536---86377252278843433 Q ss_pred EEECCCCCC-CCCC---------CCCCCCC-CCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCC----CC Q ss_conf 210022221-0136---------6775310-0123211234433342011587489999995077621123346----86 Q gi|254780214|r 116 KHLFESFRY-CFKA---------LDDRTYP-PIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRF----GN 180 (271) Q Consensus 116 ~~l~~~~~y-~f~~---------~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri----~~ 180 (271) +. -+.|-| .+.. ......- .+.....++. ..+......+.+++++|...-|.... .-|+| ++ T Consensus 89 ~m-G~m~myD~~~S~~~~~df~l~sldDvd~aFd~I~~LKY-sQ~v~L~gk~~Gl~itaynAGhmiGG-sIWkI~k~~E~ 165 (764) T KOG1135 89 KM-GQMFMYDLYRSHGNVGDFDLFSLDDVDAAFDKIIQLKY-SQPVALKGKGSGLTITAYNAGHMIGG-SIWKISKVGED 165 (764) T ss_pred HH-HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHEEEEEC-CCEEEECCCCCCEEEEEECCCCCCCC-EEEEEEECCCE T ss_conf 43-14417998760366432344436666777754133211-55477426557248964047775674-57998735753 Q ss_pred EEEECCCCC-----CHHHHHHHHCCHHEEEEEC Q ss_conf 253032330-----0167888622211025302 Q gi|254780214|r 181 VAYCTDVNA-----FPAESLEKLQNLDFLIIDA 208 (271) Q Consensus 181 ~~Y~~Dt~~-----~~e~~~~~l~~~D~lI~d~ 208 (271) ++|+.|.+. +.-.+++.+..+.+||.|+ T Consensus 166 ivYavd~NHkKe~HLNG~~l~~l~RPsllITda 198 (764) T KOG1135 166 IVYAVDFNHKKERHLNGCSLSGLNRPSLLITDA 198 (764) T ss_pred EEEEEECCCCHHCCCCCCCCCCCCCCCEEEECC T ss_conf 899970365210005786421137863698336 No 32 >COG5212 PDE1 Low-affinity cAMP phosphodiesterase [Signal transduction mechanisms] Probab=98.67 E-value=5.9e-08 Score=69.06 Aligned_cols=161 Identities=20% Similarity=0.231 Sum_probs=88.2 Q ss_pred CCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCC Q ss_conf 42059999743524220111024333178211101311202321002222101366775310----01232112344333 Q gi|254780214|r 74 LSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYP----PIVNPIVIENNDVP 149 (271) Q Consensus 74 ~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~----~~~~~~~~~~~~~~ 149 (271) ..|..-||||.|.|||.|+--=-|. ...+++=.|||.+.|.+.+++ |.|+..-....- ...+..+++... T Consensus 111 Q~I~~y~ITH~HLDHIsGlVinSp~-~~~qkkkTI~gl~~tIDvL~k---hvFN~lvWP~lt~~gs~~~~~qvv~P~~-- 184 (356) T COG5212 111 QSINSYFITHAHLDHISGLVINSPD-DSKQKKKTIYGLADTIDVLRK---HVFNWLVWPNLTDSGSGTYRMQVVRPAQ-- 184 (356) T ss_pred HHHHHEEECCCCCCCHHCEEECCCC-CCCCCCCEEEECHHHHHHHHH---HHHCCCCCCCCCCCCCCEEEEEEECHHH-- T ss_conf 3122237523020000013634864-345577337724027999998---7510001577441458548889847347-- Q ss_pred CCEECCCCCEEEEEEEEECCC---EE--CCCCCCC------CEEEECCCCC-------CHHHHHHHH------CCHHEEE Q ss_conf 420115874899999950776---21--1233468------6253032330-------016788862------2211025 Q gi|254780214|r 150 ICMKSAGGVIEAIPILQQHGR---IS--SLGFRFG------NVAYCTDVNA-------FPAESLEKL------QNLDFLI 205 (271) Q Consensus 150 ~~~~~~~~~i~i~~~~~~Hg~---~~--~~Gyri~------~~~Y~~Dt~~-------~~e~~~~~l------~~~D~lI 205 (271) ........+++.||+++||. .+ +..|.|. -|+|.+|+.. .-++.+..+ +...-++ T Consensus 185 -~~slt~t~l~~~pfpv~Hg~ktG~p~ySs~~lfr~nkS~~~f~~fGDvepD~vese~ll~~~Wr~~ae~I~q~~Lkgil 263 (356) T COG5212 185 -SLSLTLTRLTGEPFPVSHGKKTGSPSYSSMLLFRSNKSNEFFAYFGDVEPDDVESEKLLDTVWRKLAEKITQQQLKGIL 263 (356) T ss_pred -EEEEEEEEECCEEEECCCCCCCCCCCCCEEEEEECCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEE T ss_conf -1055334402225641378666884421489985378762489946878203210177999999998764687627169 Q ss_pred EECCCCC-----CCCCCCCHHHHHHHHHHC----------CCCEEEEECCC Q ss_conf 3023357-----688757999999999972----------99779997688 Q gi|254780214|r 206 IDALKNG-----LHSSHFSLSESLKKIELI----------NPKNAILTHMH 241 (271) Q Consensus 206 ~d~l~~~-----~~~~H~~~~ea~~~~~~~----------~~k~~~LtHls 241 (271) +||.-+. .-.+|++..-.+.-.+++ +-=.+++||+- T Consensus 264 iEcS~P~~~~~~~LfGH~~P~~L~nEL~~L~~l~~s~~~l~gL~vviTHiK 314 (356) T COG5212 264 IECSYPNDVADNKLFGHMTPTWLLNELKKLEQLSGSGQPLKGLPVVITHIK 314 (356) T ss_pred EEECCCCCCCHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC T ss_conf 994389987756765056849999999999988625877789607888325 No 33 >COG0491 GloB Zn-dependent hydrolases, including glyoxylases [General function prediction only] Probab=98.61 E-value=2.5e-07 Score=65.19 Aligned_cols=127 Identities=21% Similarity=0.193 Sum_probs=60.4 Q ss_pred CEEEEEECCCCHH------HHHHHCCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCC Q ss_conf 7189998798415------6675223342059999743524220111024333178211101311202321002222101 Q gi|254780214|r 53 NTTVIVDTGPDFY------MQVLREQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCF 126 (271) Q Consensus 53 ~~~iLiD~G~d~r------~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f 126 (271) +..+|||||.... .++...+. +|++|++||.|.||++|+..++... . ..++|.++.............. T Consensus 35 ~~~~liDtG~~~~~~~~~~~~l~~~~~-di~~vilTH~H~DH~gg~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~ 109 (252) T COG0491 35 GGAVLIDTGLGDADAEALLEALAALGL-DVDAILLTHGHFDHIGGAAVLKEAF---G-AAPVIAPAEVPLLLREEILRKA 109 (252) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHCCC-CCCEEEECCCCHHHHHHHHHHHHHC---C-CCEEEECCHHHHHHHCCCCCCC T ss_conf 848999499984057999999886699-8778991988078782799998745---8-7349804025544301211122 Q ss_pred CC-CCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCCCC--EEEECCCCC Q ss_conf 36-6775310012321123443334201158748999999507762112334686--253032330 Q gi|254780214|r 127 KA-LDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRFGN--VAYCTDVNA 189 (271) Q Consensus 127 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri~~--~~Y~~Dt~~ 189 (271) .. ......++. ...............+..+++.+.+ -| .....+|.+.+ +.|++|.-. T Consensus 110 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~i~~p-GH-T~g~~~~~~~~~~~l~~gD~~~ 170 (252) T COG0491 110 GVTAEAYAAPGA---SPLRALEDGDELDLGGLELEVLHTP-GH-TPGHIVFLLEDGGVLFTGDTLF 170 (252) T ss_pred CCCHHHCCCCCC---CCCCCCCCCCEEEECCCEEEEEECC-CC-CCCCEEEEECCCCEEEEECCCC T ss_conf 100121147733---3443357898899789269999899-88-8558899989998999932116 No 34 >KOG1137 consensus Probab=98.60 E-value=4.2e-08 Score=69.99 Aligned_cols=172 Identities=24% Similarity=0.284 Sum_probs=84.2 Q ss_pred CCEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECC--CCHHHH-HH----HCCCC Q ss_conf 872599998541789651337887877778767744430699993367789718999879--841566-75----22334 Q gi|254780214|r 2 KDYYHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTG--PDFYMQ-VL----REQVL 74 (271) Q Consensus 2 ~~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G--~d~r~q-l~----~~~i~ 74 (271) +|-++++=||.|.- -.||+.+++. +|++|++||| |++..- .+ ..+++ T Consensus 11 ~d~l~~~pLGag~E---------------------VGRSC~ile~-----kGk~iMld~gvhpaysg~aslpf~d~vd~s 64 (668) T KOG1137 11 SDQLKFTPLGAGNE---------------------VGRSCHILEY-----KGKTIMLDCGVHPAYSGMASLPFYDEVDLS 64 (668) T ss_pred CCCEEEEECCCCCC---------------------CCCEEEEEEE-----CCEEEEECCCCCCCCCCCCCCCCHHHCCCC T ss_conf 77179987788762---------------------5733799996-----683798536667453444446421004601 Q ss_pred CCCEEEEECCCHHHHHHCCHHHHHHHCCCC-EEEEEECCEEEE---EECCCCCCCCCC-CCCCCCCC-C--CCCCCCCCC Q ss_conf 205999974352422011102433317821-110131120232---100222210136-67753100-1--232112344 Q gi|254780214|r 75 SIDAVLYTHPHADHIHGIDGLRGYFLKQKR-PIDVYAAPDCMK---HLFESFRYCFKA-LDDRTYPP-I--VNPIVIENN 146 (271) Q Consensus 75 ~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~-~i~vy~~~~~~~---~l~~~~~y~f~~-~~~~~~~~-~--~~~~~~~~~ 146 (271) +||..+|||+|.||.+-|| |++.+-. .-.+|-+..|.+ .+..-|--.-+. .+...|-. . .....++.. T Consensus 65 ~id~llIthFhldh~aslp----~~~qkTsf~grvfmth~TkAi~kwllsdyvrvs~~s~~~~Ly~e~dl~~s~dKie~i 140 (668) T KOG1137 65 AIDPLLITHFHLDHAASLP----FTLQKTSFIGRVFMTHPTKAIYKWLLSDYVRVSNRSGDDRLYTEGDLMESMDKIETI 140 (668) T ss_pred CCCHHHHHHHHHHHCCCCC----CEEEECCCCCEEEEECCHHHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHEEE T ss_conf 0547887665353226662----100000225616882441887876642336765135753343510477765532222 Q ss_pred CCCCCEECCCCCEEEEEEEEECCCEECCCCCCC----CEEEECCCCCCHHHHHHH--H----CCHHEEEEEC Q ss_conf 333420115874899999950776211233468----625303233001678886--2----2211025302 Q gi|254780214|r 147 DVPICMKSAGGVIEAIPILQQHGRISSLGFRFG----NVAYCTDVNAFPAESLEK--L----QNLDFLIIDA 208 (271) Q Consensus 147 ~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri~----~~~Y~~Dt~~~~e~~~~~--l----~~~D~lI~d~ 208 (271) ++ .......+|++.|+..-| -..+.-|.++ ++.|+||.+ +|+.... + .+.|++|.+. T Consensus 141 df--he~~ev~gIkf~p~~aGh-VlgacMf~veiagv~lLyTGd~s--reeDrhl~aae~P~~~~dvli~es 207 (668) T KOG1137 141 DF--HETVEVNGIKFWPYHAGH-VLGACMFMVEIAGVRLLYTGDYS--REEDRHLIAAEMPPTGPDVLITES 207 (668) T ss_pred EE--CCCCCCCCEEEEEECCCH-HHHHEEEEEEECEEEEEECCCCC--HHHCCCCCCHHCCCCCCCEEEEEE T ss_conf 20--023334874899604541-23321366652407999625666--210423320017999864799973 No 35 >PRK11709 putative L-ascorbate 6-phosphate lactonase; Provisional Probab=98.56 E-value=1.5e-06 Score=60.34 Aligned_cols=176 Identities=20% Similarity=0.213 Sum_probs=94.9 Q ss_pred CEEEEEECCCCCCCEEEEEE--CCCCH----------HHHHHH--------------------CCCCCCCEEEEECCCHH Q ss_conf 06999933677897189998--79841----------566752--------------------23342059999743524 Q gi|254780214|r 40 SSLKISRVSERGSNTTVIVD--TGPDF----------YMQVLR--------------------EQVLSIDAVLYTHPHAD 87 (271) Q Consensus 40 ~s~~i~~~~~~~~~~~iLiD--~G~d~----------r~ql~~--------------------~~i~~Id~I~iTH~H~D 87 (271) ++++|... ++.+|+|| ||.+- ++|+.| +.|+.|||++.||.|.| T Consensus 46 ~GiW~Kt~----~~tNi~iDlW~gtGK~t~~~~~m~~~HQm~rM~G~~klQPNLR~~P~ViDPFaik~lDavl~TH~H~D 121 (355) T PRK11709 46 TGIWLKTE----GGTNVCVDFWCGTGKQTHGNPLMKRGHQMARMAGVKKLQPNLRTQPFVLDPFAINEIDAVLATHDHSD 121 (355) T ss_pred CEEEEECC----CCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCEEECCHHCCCCCEEEEEECCCC T ss_conf 45999748----99359999503766444567777706688762387536854247973137411153207998511434 Q ss_pred HHHHCCHHHHHHHC-CCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEE Q ss_conf 22011102433317-82111013112023210022221013667753100123211234433342011587489999995 Q gi|254780214|r 88 HIHGIDGLRGYFLK-QKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQ 166 (271) Q Consensus 88 Hi~Gl~~Lr~~~~~-~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~ 166 (271) |+-=- ..+.+.. ....++..||+.+.+....+ +.|. ....+++. +..++..++++.++.. T Consensus 122 HiD~n--~AAAv~~n~~~~v~FiGP~~~vd~W~~W-----------GVP~-eR~ivvkP-----GD~~kvkDi~i~ales 182 (355) T PRK11709 122 HIDVN--VAAAVMQNCADHVKFIGPQTCVDLWIGW-----------GVPK-ERCIVVKP-----GDVVKVKDIEIHALDA 182 (355) T ss_pred CHHHH--HHHHHHHCCCCCCCEECCHHHHHHHHHC-----------CCCH-HHEEEECC-----CCEEEEEEEEEEEECC T ss_conf 30089--9999982685578531858999888862-----------9967-88799548-----9878871269998504 Q ss_pred E---------------CCCE------ECCCCCCC----CEEEECCCCCCHHHHHHHHC--CHHEEEEECCCCCC--CCCC Q ss_conf 0---------------7762------11233468----62530323300167888622--21102530233576--8875 Q gi|254780214|r 167 Q---------------HGRI------SSLGFRFG----NVAYCTDVNAFPAESLEKLQ--NLDFLIIDALKNGL--HSSH 217 (271) Q Consensus 167 ~---------------Hg~~------~~~Gyri~----~~~Y~~Dt~~~~e~~~~~l~--~~D~lI~d~l~~~~--~~~H 217 (271) - .|.. .++.|.|+ ++..++|+. ++.--.+.-+ ..|+.+. +.-+++ ...- T Consensus 183 fDRT~lvT~~~~~~~~~g~~~~~Md~~AVnYl~kT~gGnvYH~gDSH-ySn~fakhG~~~~IDVal~-~yGeNP~GitDK 260 (355) T PRK11709 183 FDRTALITLPADGKAAGGVLPDGMDRRAVNYLFKTPGGNLYHSGDSH-YSNYYAKHGNEYQIDVALG-SYGENPRGITDK 260 (355) T ss_pred CCCEEEEECCCCCCCCCCCCCCHHHHHHHHEEEECCCCCEEECCCCH-HHHHHHHHCCCCCCEEEEE-CCCCCCCCCCCC T ss_conf 56247985178754468999721667634148976997677557613-5567887276103117873-256689876165 Q ss_pred CCHHHHHHHHHHCCCCEEEEECC Q ss_conf 79999999999729977999768 Q gi|254780214|r 218 FSLSESLKKIELINPKNAILTHM 240 (271) Q Consensus 218 ~~~~ea~~~~~~~~~k~~~LtHl 240 (271) ||..+.+++|+.+++|-++-.|. T Consensus 261 mts~DiLRmaE~L~aKVvIP~H~ 283 (355) T PRK11709 261 MTSIDILRMAESLNAKVVIPVHH 283 (355) T ss_pred CCHHHHHHHHHHCCCCEEEECHH T ss_conf 76999999998639858951306 No 36 >COG2333 ComEC Predicted hydrolase (metallo-beta-lactamase superfamily) [General function prediction only] Probab=98.49 E-value=5.8e-06 Score=56.69 Aligned_cols=186 Identities=22% Similarity=0.213 Sum_probs=85.7 Q ss_pred CCEEEEEECCCCCCCEEEEEECCC--C---HHHHHHHCCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCE Q ss_conf 306999933677897189998798--4---15667522334205999974352422011102433317821110131120 Q gi|254780214|r 39 RSSLKISRVSERGSNTTVIVDTGP--D---FYMQVLREQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPD 113 (271) Q Consensus 39 ~~s~~i~~~~~~~~~~~iLiD~G~--d---~r~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~ 113 (271) ..+++++. ++.++|+|+|. + +.--|-..++++||.+++||.|.||++|++.+..-+ .-..+-++.+.. T Consensus 54 g~a~li~~-----~~~~~l~dtg~~~~~~~iip~Lk~~GV~~iD~lIlTH~d~DHiGg~~~vl~~~--~v~~~~i~~~~~ 126 (293) T COG2333 54 GLATLIRS-----EGKTILYDTGNSMGQDVIIPYLKSLGVRKLDQLILTHPDADHIGGLDEVLKTI--KVPELWIYAGSD 126 (293) T ss_pred CEEEEEEE-----CCCEEEEECCCCCCCEEEHHHHHHCCCCCCCEEEECCCCCCCCCCHHHHHHHC--CCCCEEEECCCC T ss_conf 72789851-----89118862575447243635576749860137893168744447899998528--877179827877 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEE-EE----ECCCEECCCCCC----CCEEEE Q ss_conf 232100222210136677531001232112344333420115874899999-95----077621123346----862530 Q gi|254780214|r 114 CMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPI-LQ----QHGRISSLGFRF----GNVAYC 184 (271) Q Consensus 114 ~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~-~~----~Hg~~~~~Gyri----~~~~Y~ 184 (271) ...... ............-+.....+++.+..+ +. .-.+-.+...++ .++..+ T Consensus 127 ~~~~~~------------------~~~~~~~~~~~~~G~~~~~~~~~f~vl~P~~~~~~~~N~~S~Vl~v~~g~~s~LlT 188 (293) T COG2333 127 STSTFV------------------LRDAGIPVRSCKAGDSWQWGGVVFQVLSPVGGVSDDLNNDSCVLRVTFGGNSFLLT 188 (293) T ss_pred CCCHHH------------------HHHCCCCEECCCCCCEEEECCEEEEEECCCCCCCCCCCCCCEEEEEEECCEEEEEE T ss_conf 651455------------------65337741200127547888857999768866665556762599999589169996 Q ss_pred CCCCCCHHHHHHHHC-CHHEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC-CCCC--CCHHHHHHHCC Q ss_conf 323300167888622-2110253023357688757999999999972997799976-8889--88889998679 Q gi|254780214|r 185 TDVNAFPAESLEKLQ-NLDFLIIDALKNGLHSSHFSLSESLKKIELINPKNAILTH-MHVD--LDYDMVLKSTP 254 (271) Q Consensus 185 ~Dt~~~~e~~~~~l~-~~D~lI~d~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtH-ls~~--~~~~~~~~~~~ 254 (271) +|...-.|. ..++ +.| |=+|-++...|-+.. +-..++.++++||-+++.= -... ..|.++.+.+. T Consensus 189 GD~e~~~E~--~l~~~~~~-l~~dVLkV~HHGS~t--Sss~~Fl~~v~Pk~AliS~G~~N~yghPh~~Vl~rl~ 257 (293) T COG2333 189 GDLEEKGEK--LLKKYGPD-LRADVLKVGHHGSKT--SSSLAFLEAVKPKVALISSGRNNRYGHPHQEVLERLQ 257 (293) T ss_pred CCCCHHHHH--HHHHHCCC-CCCEEEEECCCCCCC--CCCHHHHHHCCCCEEEEEEECCCCCCCCCHHHHHHHH T ss_conf 578705689--99843787-442089816677665--6728899743984799975236778997399999998 No 37 >KOG1361 consensus Probab=98.38 E-value=2.8e-06 Score=58.63 Aligned_cols=176 Identities=18% Similarity=0.218 Sum_probs=91.9 Q ss_pred EEEECCCCHHHHHHHCC-CCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCC Q ss_conf 99987984156675223-34205999974352422011102433317821110131120232100222210136677531 Q gi|254780214|r 56 VIVDTGPDFYMQVLREQ-VLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTY 134 (271) Q Consensus 56 iLiD~G~d~r~ql~~~~-i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~ 134 (271) +.++.|.-|..-..+.+ +....+-|+||+|+||..||.- .| . .| ++|+.+-|...+...+.+-+. T Consensus 92 ~~~~p~~~f~VD~f~~~~~~~~s~yFLsHFHSDHy~GL~~--sW--~--~p-~lYCS~ita~Lv~~~~~v~~~------- 157 (481) T KOG1361 92 IKVLPGGEFSVDAFRYGHIEGCSAYFLSHFHSDHYIGLTK--SW--S--HP-PLYCSPITARLVPLKVSVTKQ------- 157 (481) T ss_pred EEECCCCCEEEEHHHCCCCCCCCEEEEECCCCCCCCCCCC--CC--C--CC-CCCCCCCCHHHHHHHCCCCHH------- T ss_conf 4606997278743223776651101220145543143544--45--5--88-610141226555440114741------- Q ss_pred CCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCC-----CCEEEECCCCCCHHHHHH----HH-CCHHEE Q ss_conf 00123211234433342011587489999995077621123346-----862530323300167888----62-221102 Q gi|254780214|r 135 PPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRF-----GNVAYCTDVNAFPAESLE----KL-QNLDFL 204 (271) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri-----~~~~Y~~Dt~~~~e~~~~----~l-~~~D~l 204 (271) . +...+.....+. ..+.++.+..+|++. ++-|.| ..+.++||.+ .++.... .. +..|.+ T Consensus 158 ----~---i~~l~l~~~~~i--~~~~vt~ldAnHCPG-a~mf~F~~~~~~~~lhtGDFR-~s~~m~~~p~~~~~~~i~~l 226 (481) T KOG1361 158 ----S---IQALDLNQPLEI--PGIQVTLLDANHCPG-AVMFLFELSFGPCILHTGDFR-ASADMSKEPALTLEQTIDIL 226 (481) T ss_pred ----H---CEEECCCCCEEE--CCEEEEEECCCCCCC-CEEEEEECCCCCEEEECCCCC-CCHHHHHCHHHHCCCCCCEE T ss_conf ----1---413147883662--456899851666998-558975347886278647731-38656319687658764658 Q ss_pred EEECCCCCCC----CCCCCHHHHHHHHHHCCCC--EEEEECCCCCCC----HHHHHHHCCCC Q ss_conf 5302335768----8757999999999972997--799976888988----88999867998 Q gi|254780214|r 205 IIDALKNGLH----SSHFSLSESLKKIELINPK--NAILTHMHVDLD----YDMVLKSTPSR 256 (271) Q Consensus 205 I~d~l~~~~~----~~H~~~~ea~~~~~~~~~k--~~~LtHls~~~~----~~~~~~~~~~~ 256 (271) .+|++.-.+. .-+-.++++.+++.....+ |+++.+-+..+. ..++++++... T Consensus 227 yLDtTycnp~y~Fpsq~esvq~v~~~i~~~~~~~~~~Li~v~~ysiGkE~l~~eia~~l~~k 288 (481) T KOG1361 227 YLDTTYCNPKYDFPSQEESVQEVVDVIRSHASKNDRVLIVVGTYSIGKEKLLLEIARILNSK 288 (481) T ss_pred EEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEEECCHHHHHHHHHHHHCCC T ss_conf 87501017777885578899999999975554077427999887514057999999985883 No 38 >COG0426 FpaA Uncharacterized flavoproteins [Energy production and conversion] Probab=98.22 E-value=2.7e-06 Score=58.74 Aligned_cols=121 Identities=16% Similarity=0.184 Sum_probs=72.5 Q ss_pred EEEEEECCCCCCCEEEEEECC-CCHHHHHHH-----CCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEE Q ss_conf 699993367789718999879-841566752-----23342059999743524220111024333178211101311202 Q gi|254780214|r 41 SLKISRVSERGSNTTVIVDTG-PDFYMQVLR-----EQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDC 114 (271) Q Consensus 41 s~~i~~~~~~~~~~~iLiD~G-~d~r~ql~~-----~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~ 114 (271) |-||. +++.+|||++ +.+..+++. .++++||+|+++|.-.||.+.|++|.... ....|++++.. T Consensus 38 SYLI~------~~k~aLID~~~~~f~~~~l~~l~~~id~k~iDYIi~~H~ePDhsg~l~~ll~~~----p~a~ii~s~~~ 107 (388) T COG0426 38 SYLIV------GDKTALIDTVGEKFFDEYLENLSKYIDPKEIDYIIVNHTEPDHSGSLPELLELA----PNAKIICSKLA 107 (388) T ss_pred EEEEE------CCCEEEECCCCCCHHHHHHHHHHHHCCHHCCEEEEECCCCCCHHHHHHHHHHHC----CCCEEEEEHHH T ss_conf 58996------796799889980169999999986348001719997888963354699999868----89789963899 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCC-CCEEEECC Q ss_conf 3210022221013667753100123211234433342011587489999995077621123346-86253032 Q gi|254780214|r 115 MKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRF-GNVAYCTD 186 (271) Q Consensus 115 ~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri-~~~~Y~~D 186 (271) .+.|+..+.. .. + +............+..+++.+.+-.|-+-.-+-|.- .++.|+.| T Consensus 108 ~~~L~~~~~~----~~--------~---~~ivk~Gd~ldlGg~tL~Fi~ap~LHWPd~m~TYd~~~kILFS~D 165 (388) T COG0426 108 ARFLKGFYHD----PE--------W---FKIVKTGDTLDLGGHTLKFIPAPFLHWPDTMFTYDPEDKILFSCD 165 (388) T ss_pred HHHHHHHCCC----CC--------C---EEECCCCCEECCCCCEEEEEECCCCCCCCCEEEEECCCCEEECCC T ss_conf 9999974488----54--------1---363478877644894799995787779873367624775777146 No 39 >KOG0813 consensus Probab=97.76 E-value=0.00013 Score=48.28 Aligned_cols=36 Identities=28% Similarity=0.385 Sum_probs=26.3 Q ss_pred CCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECC Q ss_conf 420599997435242201110243331782111013112 Q gi|254780214|r 74 LSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAP 112 (271) Q Consensus 74 ~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~ 112 (271) .+|.+|+.||.|+||.+|+.+|.-.. ...+.+|+.. T Consensus 50 ~~l~~Il~THhH~DHsGGn~~i~~~~---~~~~~v~g~~ 85 (265) T KOG0813 50 RRLTAILTTHHHYDHSGGNEDIKREI---PYDIKVIGGA 85 (265) T ss_pred CCEEEEEECCCCCCCCCCHHHHHHHC---CCCCEEECCC T ss_conf 76468995222421467678887413---5782796687 No 40 >PRK05452 anaerobic nitric oxide reductase flavorubredoxin; Provisional Probab=97.55 E-value=0.00016 Score=47.77 Aligned_cols=122 Identities=16% Similarity=0.217 Sum_probs=70.5 Q ss_pred EEEEEECCCCCCCEEEEEECC-CCHHHHHHH-----CCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEE Q ss_conf 699993367789718999879-841566752-----23342059999743524220111024333178211101311202 Q gi|254780214|r 41 SLKISRVSERGSNTTVIVDTG-PDFYMQVLR-----EQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDC 114 (271) Q Consensus 41 s~~i~~~~~~~~~~~iLiD~G-~d~r~ql~~-----~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~ 114 (271) |-||. +++..|||+- ..+..+++. -++++||+|++.|.-.||.+.|+.|... -...+||+++.. T Consensus 37 SYLI~------deK~aLIDtv~~~f~~e~l~~L~~~id~~~IDYIIvnH~EpDHSGsL~~Lle~----~P~~~Iv~s~~a 106 (479) T PRK05452 37 SYLIR------EEKNVLIDTVDHKFSREFVQNLRMEIDLADIDYIVINHAEEDHAGALTELMAQ----IPDTPIYCTANA 106 (479) T ss_pred EEEEC------CCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHH----CCCCEEEECHHH T ss_conf 36984------89889992997628999999999745966798899689997547689999998----899999987899 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECC-CCCEEEEEEEEECCCEECCCCCC-CCEEEECC Q ss_conf 32100222210136677531001232112344333420115-87489999995077621123346-86253032 Q gi|254780214|r 115 MKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPICMKSA-GGVIEAIPILQQHGRISSLGFRF-GNVAYCTD 186 (271) Q Consensus 115 ~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~i~i~~~~~~Hg~~~~~Gyri-~~~~Y~~D 186 (271) .+.|...+.. .+ .+..+++.. ...... +..+++.+.+..|=+-..+-|.- .++.|+.| T Consensus 107 ~~~l~~~~~~----~~-------~~~~vVk~G---d~LdLG~GktL~Fi~tPmlHWPDsM~TY~~edkILFS~D 166 (479) T PRK05452 107 IDSINGHHHH----PE-------WNFNVVKTG---DTLDIGNGKQLIFVETPMLHWPDSMMTYLTGDAVLFSND 166 (479) T ss_pred HHHHHHHHCC----CC-------CCEEEECCC---CEEECCCCCEEEEEECCCCCCCCCEEEEECCCCEEEECC T ss_conf 9999985278----76-------673784689---857269984689961898778873268862675686125 No 41 >PRK11921 metallo-beta-lactamase/flavodoxin domain-containing protein; Provisional Probab=97.45 E-value=0.00025 Score=46.47 Aligned_cols=116 Identities=19% Similarity=0.226 Sum_probs=69.8 Q ss_pred CCEEEEEECC-CCHHHHHHH-----CCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCC Q ss_conf 9718999879-841566752-----2334205999974352422011102433317821110131120232100222210 Q gi|254780214|r 52 SNTTVIVDTG-PDFYMQVLR-----EQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYC 125 (271) Q Consensus 52 ~~~~iLiD~G-~d~r~ql~~-----~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~ 125 (271) +++..|||+- +.+..+++. .+.++||.|++.|.-.||.+.|+.|.. .-....|++++...+.|++.|.. T Consensus 40 ~ek~aLiDtv~~~f~~e~l~~l~~vi~~~~IdYiVvnH~EPDHsg~l~~ll~----~~p~~~vv~s~~~~~~l~~~~~~- 114 (395) T PRK11921 40 DEKTVLIDTVWAPFAKEFVENLKKEIDLDKIDYIVANHGEIDHSGALPELMK----EIPDTPIYCTANGAKSLKGHYHQ- 114 (395) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHH----HCCCCEEEECHHHHHHHHHHHCC- T ss_conf 8978999289843899999999964596569999938999757899999999----88999999878999999987288- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEECCCEECCCCCC-CCEEEECC Q ss_conf 13667753100123211234433342011587489999995077621123346-86253032 Q gi|254780214|r 126 FKALDDRTYPPIVNPIVIENNDVPICMKSAGGVIEAIPILQQHGRISSLGFRF-GNVAYCTD 186 (271) Q Consensus 126 f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyri-~~~~Y~~D 186 (271) . .+..+++.. ......+..+++...+-.|=+-..+-|.- +++.|+.| T Consensus 115 -----~------~~~~~Vk~g---d~L~LG~~tL~F~~tP~lHWPdtm~Ty~~edkiLFs~D 162 (395) T PRK11921 115 -----D------WNFVVVKTG---DRLEIGSKELIFIEAPMLHWPDSMFCYLTGDNILFSND 162 (395) T ss_pred -----C------CCEEEECCC---CEEECCCCEEEEEECCCCCCCCCEEEEECCCCEEECCC T ss_conf -----8------773896789---88842896699997898878884379975787786035 No 42 >KOG3798 consensus Probab=96.95 E-value=0.0064 Score=37.74 Aligned_cols=153 Identities=16% Similarity=0.194 Sum_probs=83.1 Q ss_pred CCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 23342059999743524220111024333178211101311202321002222101366775310012321123443334 Q gi|254780214|r 71 EQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIENNDVPI 150 (271) Q Consensus 71 ~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~~~~~~~ 150 (271) ..+.++|-+..||.|.||.- ..++.. +....+..=++|.+.+..+... +.. ........+ .. T Consensus 128 ~~~p~~d~~~vsh~h~dhld-~~~~~~--~~~~~~~~wfvp~g~k~~m~~~---------gc~-----~v~el~wwe-~~ 189 (343) T KOG3798 128 EDLPDLDFAVVSHDHYDHLD-ADAVKK--ITDRNPQIWFVPLGMKKWMEGD---------GSS-----TVTELNWGE-SS 189 (343) T ss_pred CCCCCCCEECCCCCCCCCCC-HHHHHH--HHCCCCCCEECHHHHHHEECCC---------CCC-----CEEEEECCH-HH T ss_conf 37899663034366523236-688876--5056852103032220124378---------877-----036750100-22 Q ss_pred CEECCCCCEEEEEEEEECCCEECC---------CCCC----CCEEEECCCCCCHHH---HHHHHCCHHEEEEECCCCCCC Q ss_conf 201158748999999507762112---------3346----862530323300167---888622211025302335768 Q gi|254780214|r 151 CMKSAGGVIEAIPILQQHGRISSL---------GFRF----GNVAYCTDVNAFPAE---SLEKLQNLDFLIIDALKNGLH 214 (271) Q Consensus 151 ~~~~~~~~i~i~~~~~~Hg~~~~~---------Gyri----~~~~Y~~Dt~~~~e~---~~~~l~~~D~lI~d~l~~~~~ 214 (271) .....+..+++...+..|-.--++ ++.+ .++.+.+||.+.+.+ .-+.+.-+|+..+-+-.+.+. T Consensus 190 ~~vkn~~~~ti~~tPaqHw~~R~L~D~Nk~LW~sw~v~g~~nrfffaGDTGyc~~~F~~IgerfGpfdLAaiPiGaYePr 269 (343) T KOG3798 190 EFVKNGKTYTIWCLPAQHWGQRGLFDRNKRLWSSWAVIGENNRFFFAGDTGYCDGEFKKIGERFGPFDLAAIPIGAYEPR 269 (343) T ss_pred CEECCCCEEEEEECCHHHHCCCCCCCCCCCEEEEEEEECCCCEEEECCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCH T ss_conf 05118957999975324312454334786217755786378458746877766279999987648763330224456834 Q ss_pred ----CCCCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf ----875799999999997299779997688 Q gi|254780214|r 215 ----SSHFSLSESLKKIELINPKNAILTHMH 241 (271) Q Consensus 215 ----~~H~~~~ea~~~~~~~~~k~~~LtHls 241 (271) +-|.+.+||+++-+.+++|+.+-+|-. T Consensus 270 WfmK~~HInPeEav~Ihkdv~arns~gIHWG 300 (343) T KOG3798 270 WFMKSQHINPEEAVEIHKDVRAKNSIGIHWG 300 (343) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHCCEEEEEE T ss_conf 3245446998999999887754110367663 No 43 >COG2015 Alkyl sulfatase and related hydrolases [Secondary metabolites biosynthesis, transport, and catabolism] Probab=96.62 E-value=0.0021 Score=40.78 Aligned_cols=66 Identities=23% Similarity=0.393 Sum_probs=40.2 Q ss_pred CCCEEEEEECC--C-CHHHHH--HH--CCCCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEE Q ss_conf 89718999879--8-415667--52--2334205999974352422011102433317821110131120232 Q gi|254780214|r 51 GSNTTVIVDTG--P-DFYMQV--LR--EQVLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMK 116 (271) Q Consensus 51 ~~~~~iLiD~G--~-d~r~ql--~~--~~i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~ 116 (271) ++..-|+||.= | ..+..| .+ -+-+.|.+|+.||.|.||++|.-.+-.-.-....+++|.+|++-++ T Consensus 133 Gdtg~IViDpL~t~~tA~aAldl~~~~~g~rPV~aVIYtHsH~DHfGGVkGiv~eadV~sGkV~iiAP~GFme 205 (655) T COG2015 133 GDTGWIVIDPLVTPETAKAALDLYNQHRGQRPVVAVIYTHSHSDHFGGVKGIVSEADVKSGKVQIIAPAGFME 205 (655) T ss_pred CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCEEECCCHHHCCCCCEEEECCHHHHH T ss_conf 7852699815678288999999999765889748999612542013771001578884667413764605789 No 44 >KOG0814 consensus Probab=94.46 E-value=0.11 Score=30.15 Aligned_cols=53 Identities=26% Similarity=0.355 Sum_probs=32.1 Q ss_pred EEEECCCCCCCEEEEEECCCCH--H-HHHHHC-CCCCCCEEEEECCCHHHHHHCCHHHHH Q ss_conf 9993367789718999879841--5-667522-334205999974352422011102433 Q gi|254780214|r 43 KISRVSERGSNTTVIVDTGPDF--Y-MQVLRE-QVLSIDAVLYTHPHADHIHGIDGLRGY 98 (271) Q Consensus 43 ~i~~~~~~~~~~~iLiD~G~d~--r-~ql~~~-~i~~Id~I~iTH~H~DHi~Gl~~Lr~~ 98 (271) ++..+.. .++-++||.=-+. | .|+++. ++ ++-.-+=||.|+|||.|-.-|+.. T Consensus 24 Yll~d~~--~~~AviIDPV~et~~RD~qlikdLgl-~LiYa~NTH~HADHiTGtg~Lkt~ 80 (237) T KOG0814 24 YLLGDHK--TGKAVIIDPVLETVSRDAQLIKDLGL-DLIYALNTHVHADHITGTGLLKTL 80 (237) T ss_pred EEEEECC--CCCEEEECCHHHCCCCHHHHHHHCCC-EEEEEECCEEECCCCCCCCHHHHH T ss_conf 8855078--88558964056325305799986180-014423040320330163337776 No 45 >KOG3592 consensus Probab=92.60 E-value=0.11 Score=30.20 Aligned_cols=50 Identities=30% Similarity=0.592 Sum_probs=38.2 Q ss_pred CEEEEEECCCCCCCEEEEEECCCC---HHHHHHHCCCCCCCEEEEECCCHHHHHHCCHH Q ss_conf 069999336778971899987984---15667522334205999974352422011102 Q gi|254780214|r 40 SSLKISRVSERGSNTTVIVDTGPD---FYMQVLREQVLSIDAVLYTHPHADHIHGIDGL 95 (271) Q Consensus 40 ~s~~i~~~~~~~~~~~iLiD~G~d---~r~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~L 95 (271) .|+|... +|.+||++-|.+ ....|.|+ +.+||+|+|||--.|-..||..| T Consensus 49 daALFav-----nGf~iLv~GgserKS~fwklVrH-ldrVdaVLLthpg~dNLpginsl 101 (934) T KOG3592 49 DAALFAV-----NGFNILVNGGSERKSCFWKLVRH-LDRVDAVLLTHPGADNLPGINSL 101 (934) T ss_pred CCEEEEE-----CCEEEEECCCCCCCCCHHHHHHH-HHHHHHHHHCCCCCCCCCCCHHH T ss_conf 5015752-----46478615776666506888887-76434654016666766550278 No 46 >COG0391 Uncharacterized conserved protein [Function unknown] Probab=87.81 E-value=2.1 Score=22.14 Aligned_cols=208 Identities=14% Similarity=0.133 Sum_probs=87.4 Q ss_pred CCCEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCC-CCCEEEEEECCCCCCCEEEEEECC----CCHHHHHHHCCCC- Q ss_conf 9872599998541789651337887877778767744-430699993367789718999879----8415667522334- Q gi|254780214|r 1 MKDYYHFTILGCGASLGVPRITGDWGACDPTNPKNRR-TRSSLKISRVSERGSNTTVIVDTG----PDFYMQVLREQVL- 74 (271) Q Consensus 1 m~~~m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R-~~~s~~i~~~~~~~~~~~iLiD~G----~d~r~ql~~~~i~- 74 (271) |...+++++||-|+ |.|..--.-. ..- .+-++.++...+-|...++..|.| .|+|.-+.-.+++ T Consensus 4 l~~~~kvvvlgGGt--Gl~~lL~gLk--------~~~~~~iTaIVtvaDdggssG~lr~~~~~~~~GD~rn~l~al~~~~ 73 (323) T COG0391 4 LAKKPKVVVLGGGT--GLPKLLSGLK--------RLLPSEITAIVTVADDGGSSGRLRLDTGLYPPGDLRNCLAALGIDE 73 (323) T ss_pred CCCCCEEEEECCCC--CHHHHHHHHH--------HHCCCEEEEEEEECCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCC T ss_conf 55485499986877--7889999877--------6167159999996556776760200017978712677888743678 Q ss_pred -------CC-CEEEEECC--CHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf -------20-59999743--524220111024333178211101311202321002222101366775310012321123 Q gi|254780214|r 75 -------SI-DAVLYTHP--HADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFESFRYCFKALDDRTYPPIVNPIVIE 144 (271) Q Consensus 75 -------~I-d~I~iTH~--H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~~~~y~f~~~~~~~~~~~~~~~~~~ 144 (271) ++ ..-|=-+. =.||-.|--.|+....... .+....+.+.+.+. ...+.+|....+..+. T Consensus 74 e~~~~~e~L~qyrf~~~~g~L~gh~lgnl~l~a~~~~~~------~~~~Ai~~~~~~l~-----v~~~vlP~sdd~v~l~ 142 (323) T COG0391 74 ETFRTHERLFQYRFGEGNGELGGHDLGNLMLAALSLISG------SLSEAIDALSKLLG-----VKGRVLPMSDDPVDLV 142 (323) T ss_pred CHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCC------CHHHHHHHHHHHHC-----CCCEEEECCCCCEEEE T ss_conf 406679999860537888723675313489998874058------88999999999867-----8856954678860689 Q ss_pred CCCCCCCEECCCCCEEEEEEEEECCCEECCCCC-CCCEEEEC-CCCCCHHHHHHHHCCHHEEEEEC-CCCCCCCCCCCHH Q ss_conf 443334201158748999999507762112334-68625303-23300167888622211025302-3357688757999 Q gi|254780214|r 145 NNDVPICMKSAGGVIEAIPILQQHGRISSLGFR-FGNVAYCT-DVNAFPAESLEKLQNLDFLIIDA-LKNGLHSSHFSLS 221 (271) Q Consensus 145 ~~~~~~~~~~~~~~i~i~~~~~~Hg~~~~~Gyr-i~~~~Y~~-Dt~~~~e~~~~~l~~~D~lI~d~-l~~~~~~~H~~~~ 221 (271) . +...+...+ .-+. .+...|.. +.++.|.+ .--.-+++.++.++++|+.|+.- .....---++.++ T Consensus 143 a-------~~~dG~~~v-~gE~---~i~~~~~~~v~~V~~~~~~~~~a~~eaveAI~~AD~IviGPgSl~TSIlP~Lllp 211 (323) T COG0391 143 A-------ETEDGRRIV-FGES---WIAELGGPPVHRVRLEGPEKPSAAPEAVEAIKEADLIVIGPGSLFTSILPILLLP 211 (323) T ss_pred E-------ECCCCCEEE-EEEE---CHHHCCCCCCEEEEEECCCCCCCCHHHHHHHHHCCEEEECCCCCHHHHCHHHCCH T ss_conf 9-------836785778-4013---0644389974079983688898889999999858989986883276643030105 Q ss_pred HHHHHHHHCCCCEEEEECC Q ss_conf 9999999729977999768 Q gi|254780214|r 222 ESLKKIELINPKNAILTHM 240 (271) Q Consensus 222 ea~~~~~~~~~k~~~LtHl 240 (271) +..+..++..+++++..-+ T Consensus 212 ~I~eaLr~~~ap~i~v~n~ 230 (323) T COG0391 212 GIAEALRETVAPIVYVCNL 230 (323) T ss_pred HHHHHHHHCCCCEEEECCC T ss_conf 7999998678977986147 No 47 >KOG2121 consensus Probab=87.81 E-value=0.24 Score=28.02 Aligned_cols=67 Identities=15% Similarity=0.157 Sum_probs=45.7 Q ss_pred CEEEEEECCCCHHHHHHH--CCCCCCCEEEEECCCHHHHHHCCHHHH-HH-HCCCCEEEEEECCEEEEEEC Q ss_conf 718999879841566752--233420599997435242201110243-33-17821110131120232100 Q gi|254780214|r 53 NTTVIVDTGPDFYMQVLR--EQVLSIDAVLYTHPHADHIHGIDGLRG-YF-LKQKRPIDVYAAPDCMKHLF 119 (271) Q Consensus 53 ~~~iLiD~G~d~r~ql~~--~~i~~Id~I~iTH~H~DHi~Gl~~Lr~-~~-~~~~~~i~vy~~~~~~~~l~ 119 (271) -++.++.||++.+.-+-+ ..++++|.||+|-.+|+-++|++.|.- .. .+...++.+|||+.....+. T Consensus 73 ~~~~~~n~Geg~qr~~~ehk~~~sk~~~iflt~~~w~~~GglpGl~ltl~~~G~~g~~~l~gP~~l~~~l~ 143 (746) T KOG2121 73 RKRFIFNCGEGTQRLLTEHKIKLSKLDSIFLTRVCWSSCGGLPGLLLTLADIGEPGPVVLHGPSDLNYILS 143 (746) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCHHHHCCCCCCEEEHHHCCCCCCCCCCCCHHHHHHHH T ss_conf 22443000178999998866642124443750244887478864155464407898644028636889999 No 48 >KOG1138 consensus Probab=87.54 E-value=0.79 Score=24.77 Aligned_cols=39 Identities=15% Similarity=0.148 Sum_probs=24.3 Q ss_pred CCCCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEE Q ss_conf 34205999974352422011102433317821110131120232 Q gi|254780214|r 73 VLSIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMK 116 (271) Q Consensus 73 i~~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~ 116 (271) .+.||.|+||..|. ..|||.+ .-+.+-.-.||+++.|.+ T Consensus 94 ~stiDvILISNy~~--mlgLPfi---TentGF~gkiY~TE~t~q 132 (653) T KOG1138 94 ASTIDVILISNYMG--MLGLPFI---TENTGFFGKIYATEPTAQ 132 (653) T ss_pred CCCEEEEEECCHHH--HCCCCEE---ECCCCCEEEEEEECHHHH T ss_conf 34303999716002--0056336---417883268998224899 No 49 >cd00530 PTE Phosphotriesterase (PTE) catalyzes the hydrolysis of organophosphate nerve agents, including the chemical warfare agents VX, soman, and sarin as well as the insecticide paraoxon. PTE exists as a homodimer with one active site per monomer. The active site is located next to a binuclear metal center, at the C-terminal end of a TIM alpha- beta barrel motif. The native enzyme contains two zinc ions at the active site however these can be replaced with other metals such as cobalt, cadmium, nickel or manganese and the enzyme remains active. Probab=86.70 E-value=1.3 Score=23.36 Aligned_cols=45 Identities=27% Similarity=0.376 Sum_probs=33.9 Q ss_pred HHHHHHHHHHCC--CCEEEEECCCCCCCHHHHHHHCCCCCEEEEECC Q ss_conf 999999999729--977999768889888899986799819902044 Q gi|254780214|r 220 LSESLKKIELIN--PKNAILTHMHVDLDYDMVLKSTPSRVVPAFDGM 264 (271) Q Consensus 220 ~~ea~~~~~~~~--~k~~~LtHls~~~~~~~~~~~~~~~v~~A~DGm 264 (271) ..|.+++.++.+ +.|+++.|+....|.+...+.+..++.++||+. T Consensus 163 ~~e~~~il~~~G~d~~rvvigH~D~~~D~~~~~~la~~G~~l~fD~~ 209 (293) T cd00530 163 GLEQLRILEEEGVDPSKVVIGHLDRNDDPDYLLKIAALGAYLEFDGI 209 (293) T ss_pred CHHHHHHHHHCCCCHHHEEEECCCCCCCHHHHHHHHHCCCEEEECCC T ss_conf 09999999985989225789636898999999999976988987778 No 50 >TIGR01970 DEAH_box_HrpB ATP-dependent helicase HrpB; InterPro: IPR010225 This entry represents HrpB, one of two related predicted DEAH-box ATP-dependent helicases of unknown function found in many proteobacteria, but also in a few species of other lineages. The member from Rhizobium meliloti, designated HelO, has been studied but is not essential for growth and mutants have no obvious phenotype . HrpB is typically about 800 residues in length, while its paralog HrpA (IPR010222 from INTERPRO), also uncharacterised, is about 1300 amino acids long. Related characterised eukaryotic proteins are RNA helicases associated with pre-mRNA processing .. Probab=82.67 E-value=2 Score=22.22 Aligned_cols=54 Identities=9% Similarity=0.302 Sum_probs=33.0 Q ss_pred CCCHHHHHHHCCC--CCCCEEEEECCCHHHHHHCCHHH-HHHH------CCCCEEEEEECCEEEE Q ss_conf 9841566752233--42059999743524220111024-3331------7821110131120232 Q gi|254780214|r 61 GPDFYMQVLREQV--LSIDAVLYTHPHADHIHGIDGLR-GYFL------KQKRPIDVYAAPDCMK 116 (271) Q Consensus 61 G~d~r~ql~~~~i--~~Id~I~iTH~H~DHi~Gl~~Lr-~~~~------~~~~~i~vy~~~~~~~ 116 (271) .+|+.+.|+-.++ .-|+.|||==||-=|+.+ ||. .+.. +..-|+.|..-..|++ T Consensus 98 TEGiLTRMlQ~DP~L~GVg~liFDEFHERsL~a--DLaLALaLdVQs~LRdDPPLkil~MSATLd 160 (858) T TIGR01970 98 TEGILTRMLQDDPELEGVGLLIFDEFHERSLDA--DLALALALDVQSALRDDPPLKILIMSATLD 160 (858) T ss_pred ECCHHHHHCCCCCCCCCCCEEEECHHHHHHHHH--HHHHHHHHHHHHHHCCCCCCEEHHCCCCCH T ss_conf 232687740168884535232311022434678--899999885334310686400000002632 No 51 >COG1735 Php Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only] Probab=75.73 E-value=6 Score=19.27 Aligned_cols=44 Identities=18% Similarity=0.184 Sum_probs=28.9 Q ss_pred HHHHHHHHHC--CCCEEEEECCCCCCCHHHHHHHC-CCCCEEEEECC Q ss_conf 9999999972--99779997688898888999867-99819902044 Q gi|254780214|r 221 SESLKKIELI--NPKNAILTHMHVDLDYDMVLKST-PSRVVPAFDGM 264 (271) Q Consensus 221 ~ea~~~~~~~--~~k~~~LtHls~~~~~~~~~~~~-~~~v~~A~DGm 264 (271) .|.+++..+- .+.++.+-||+++.+--...+++ ..++.++||+- T Consensus 179 ~eq~~il~~egvdl~~v~igH~d~n~dd~~y~~~l~~~Ga~l~fD~i 225 (316) T COG1735 179 LEQLRILAEEGVDLRKVSIGHMDPNTDDVYYQKKLADRGAFLEFDRI 225 (316) T ss_pred HHHHHHHHHCCCCHHHEEEECCCCCCCHHHHHHHHHHCCCEEEECCC T ss_conf 99999999729986673673258999739999999845856873245 No 52 >KOG4736 consensus Probab=75.10 E-value=2.7 Score=21.40 Aligned_cols=41 Identities=32% Similarity=0.443 Sum_probs=30.4 Q ss_pred CCCCEEEEEECCCCHHHHHHHCCC--CCCCEEEEECCCHHHHHHCC Q ss_conf 789718999879841566752233--42059999743524220111 Q gi|254780214|r 50 RGSNTTVIVDTGPDFYMQVLREQV--LSIDAVLYTHPHADHIHGID 93 (271) Q Consensus 50 ~~~~~~iLiD~G~d~r~ql~~~~i--~~Id~I~iTH~H~DHi~Gl~ 93 (271) +.++.-+++|.|-. .|.++++ ++|+.+.+||.|.+|..++. T Consensus 101 ~d~~~v~v~~~gls---~lak~~vt~d~i~~vv~t~~~~~hlgn~~ 143 (302) T KOG4736 101 VDGGDVVVVDTGLS---VLAKEGVTLDQIDSVVITHKSPGHLGNNN 143 (302) T ss_pred ECCCCEEEEECCCC---HHHHCCCCHHHCCEEEEECCCCCCCCCCC T ss_conf 12883499955872---24214767534551687324755426533 No 53 >pfam02126 PTE Phosphotriesterase family. Probab=74.87 E-value=5.7 Score=19.41 Aligned_cols=45 Identities=9% Similarity=0.060 Sum_probs=33.4 Q ss_pred HHHHHHHHHHCC--CCEEEEECCCCCCCHHHHHHHCCCCCEEEEECC Q ss_conf 999999999729--977999768889888899986799819902044 Q gi|254780214|r 220 LSESLKKIELIN--PKNAILTHMHVDLDYDMVLKSTPSRVVPAFDGM 264 (271) Q Consensus 220 ~~ea~~~~~~~~--~k~~~LtHls~~~~~~~~~~~~~~~v~~A~DGm 264 (271) ..|.+++.++.+ ++|+++.|+....|.+...+.+..++.++||+. T Consensus 169 ~~e~l~il~e~Gvd~~rvvigH~D~~~D~~y~~~la~~G~~l~fD~~ 215 (308) T pfam02126 169 GLQQIRILQEEGVDLSRVVIGHCDDIDDLKYLLELAALGCYLGYDLF 215 (308) T ss_pred HHHHHHHHHHCCCCHHHEEEECCCCCCCHHHHHHHHHCCCEEEECCC T ss_conf 19999999985989446788526888998999999974978985356 No 54 >pfam07521 RMMBL RNA-metabolising metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in RNA metabolism. Probab=74.36 E-value=4.1 Score=20.32 Aligned_cols=26 Identities=38% Similarity=0.562 Sum_probs=23.5 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 87579999999999729977999768 Q gi|254780214|r 215 SSHFSLSESLKKIELINPKNAILTHM 240 (271) Q Consensus 215 ~~H~~~~ea~~~~~~~~~k~~~LtHl 240 (271) .+|.+-+|..++++.++|++++++|= T Consensus 15 SgHAd~~eL~~~i~~~~p~~v~~vHG 40 (43) T pfam07521 15 SGHADQEELLELLKLLKPKKVILVHG 40 (43) T ss_pred ECCCCHHHHHHHHHHCCCCEEEEECC T ss_conf 73689899999999669998999439 No 55 >TIGR02964 xanthine_xdhC xanthine dehydrogenase accessory protein XdhC; InterPro: IPR014308 Members of this protein are the accessory protein XdhC, found in baceria, that is responsible for insertion of the molybdenum cofactor into the xanthine dehydrogenase large chain, XdhB. This protein is not part of the mature xanthine dehydrogenase. Xanthine dehydrogenase is an enzyme for purine catabolism, from other purines to xanthine to urate to further breakdown products.. Probab=67.08 E-value=4.3 Score=20.20 Aligned_cols=38 Identities=21% Similarity=0.197 Sum_probs=26.1 Q ss_pred HHHHC--CCCEEEEECCCCCCCHHHHHHHCCCCCEEEEECCE Q ss_conf 99972--99779997688898888999867998199020448 Q gi|254780214|r 226 KIELI--NPKNAILTHMHVDLDYDMVLKSTPSRVVPAFDGMQ 265 (271) Q Consensus 226 ~~~~~--~~k~~~LtHls~~~~~~~~~~~~~~~v~~A~DGm~ 265 (271) ..+++ +.--+|||| ||.+|.+=..+.|. +--+||=||. T Consensus 179 ~v~~aP~~s~~lvlTH-dHaLD~~L~~~iL~-R~DfAY~GlI 218 (270) T TIGR02964 179 EVAEAPPGSYFLVLTH-DHALDLELCEAILA-RGDFAYFGLI 218 (270) T ss_pred HHHHCCCCCEEEEEEC-CHHHHHHHHHHHHC-CCCEEEEECH T ss_conf 9983799965999605-85899999999962-5990688020 No 56 >cd07187 YvcK_like family of mostly uncharacterized proteins similar to B.subtilis YvcK. One member of this protein family, YvcK from Bacillus subtilis, has been proposed to play a role in carbon metabolism, since its function is essential for growth on intermediates of the Krebs cycle and the pentose phosphate pathway. In general, this family of mostly uncharacterized proteins is related to the CofD-like protein family. CofD has been characterized as a 2-phospho-L-lactate transferase involved in F420 biosynthesis. This family appears to have the same conserved phosphate binding site as the other family in this hierarchy, but a different substrate binding site. Probab=64.50 E-value=11 Score=17.75 Aligned_cols=69 Identities=10% Similarity=0.086 Sum_probs=45.9 Q ss_pred CCCCCCEEEECCCCCCHHHHHHHHCCHHEEEEE-CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 334686253032330016788862221102530-233576887579999999999729977999768889 Q gi|254780214|r 175 GFRFGNVAYCTDVNAFPAESLEKLQNLDFLIID-ALKNGLHSSHFSLSESLKKIELINPKNAILTHMHVD 243 (271) Q Consensus 175 Gyri~~~~Y~~Dt~~~~e~~~~~l~~~D~lI~d-~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtHls~~ 243 (271) .-+|+++.+....-...++.++.++++|++|+. +..+..--.++-+++..+-+++.+++++++.-+..+ T Consensus 150 ~~~I~~v~l~p~~~~~~~~a~~aI~~ADlIv~gPGSlyTSiiPnLlv~gi~~AI~~s~a~kV~i~Nl~tq 219 (308) T cd07187 150 GSPIKRVFLEPPDPKANPEALEAIEEADLIVYGPGSLYTSILPNLLVKGIAEAIRASKAPKVYICNLMTQ 219 (308) T ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHCCEEEECCCCCHHHHCCHHCCHHHHHHHHHCCCCEEEEEECCCC T ss_conf 8996489976799998999999999799899879840665511422366999999679978999617899 No 57 >TIGR00140 hupD hydrogenase expression/formation protein; InterPro: IPR004419 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to the MEROPS peptidase family A31 (HybD endopeptidase family, clan AE). The family contains the HyaD and HybD endopeptidases and their homologues. ; GO: 0046872 metal ion binding, 0016485 protein processing. Probab=59.79 E-value=11 Score=17.71 Aligned_cols=55 Identities=27% Similarity=0.454 Sum_probs=32.5 Q ss_pred HHHHHHHCCHHEE-EEECCCCC-CCCC---------------------C-CCHHHHHHHHHHCC--CCEEEEECCCCCCC Q ss_conf 6788862221102-53023357-6887---------------------5-79999999999729--97799976888988 Q gi|254780214|r 192 AESLEKLQNLDFL-IIDALKNG-LHSS---------------------H-FSLSESLKKIELIN--PKNAILTHMHVDLD 245 (271) Q Consensus 192 e~~~~~l~~~D~l-I~d~l~~~-~~~~---------------------H-~~~~ea~~~~~~~~--~k~~~LtHls~~~~ 245 (271) -.++..++.+|.| |+||..++ ..|+ | ..+.|++.+++-.+ ||+++|.-+-|.++ T Consensus 46 ~~L~~~~~~~d~Liv~DAv~~G~~~pGt~~~~~dd~vp~~~Ga~K~S~Hq~g~~~vL~~~~~~G~~P~~~~liG~~P~~~ 125 (156) T TIGR00140 46 LYLLGLIEEADKLIVLDAVDYGGLEPGTLYILRDDEVPKFLGAKKMSLHQTGFQEVLALAELLGHLPKELVLIGVQPEVE 125 (156) T ss_pred HHHHHHHHHCCCEEEEEEEECCCCCCEEEEEEECCCCHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEEEEEEEEEE T ss_conf 89999987379168997664579995068985255340330502367450028999999986056863038873346774 Q ss_pred H Q ss_conf 8 Q gi|254780214|r 246 Y 246 (271) Q Consensus 246 ~ 246 (271) . T Consensus 126 l 126 (156) T TIGR00140 126 L 126 (156) T ss_pred E T ss_conf 2 No 58 >TIGR02817 adh_fam_1 zinc-binding alcohol dehydrogenase family protein; InterPro: IPR014182 Members of this entry form a distinct subset of the larger family of oxidoreductases that includes zinc-binding alcohol dehydrogenases and NADPH:quinone reductases. Some current members of this entry carry designations as putative alginate lyase, however none have been characterised as such.. Probab=58.95 E-value=3 Score=21.17 Aligned_cols=44 Identities=14% Similarity=0.166 Sum_probs=27.1 Q ss_pred CEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCCCCCCEEEEECCCHHHH Q ss_conf 06999933677897189998798415667522334205999974352422 Q gi|254780214|r 40 SSLKISRVSERGSNTTVIVDTGPDFYMQVLREQVLSIDAVLYTHPHADHI 89 (271) Q Consensus 40 ~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i~~Id~I~iTH~H~DHi 89 (271) |+.++...| -...||=+..+..||.+-+++.+++||=| -|-|-. T Consensus 188 s~~Wv~~LG-----AH~VIDHskPL~~ql~~L~l~~v~~V~Sl-T~TDqH 231 (338) T TIGR02817 188 SQEWVLELG-----AHHVIDHSKPLKAQLEKLGLEAVSYVFSL-THTDQH 231 (338) T ss_pred HHHHHHHCC-----CCEEECCCCCHHHHHHHHCCCCCCEEECC-CHHHHH T ss_conf 999999739-----91886588436899998288998557516-607899 No 59 >TIGR00143 hypF [NiFe] hydrogenase maturation protein HypF; InterPro: IPR004421 The large subunit of [NiFe]-hydrogenase, as well as other nickel metalloenzymes, is synthesized as a precursor devoid of the metalloenzyme active site. This precursor then undergoes a complex post-translational maturation process that requires a number of accessory proteins , , . Members of the HypF family are accessory proteins involved in hydrogenase maturation. They contain the following domains: acylphosphatase, zinc fingers (2 repeats), a YrdC-like domain, and a C-terminal domain with a putative O-carbamoyltransferase motif. The presence of CO and CN- ligands of the active site iron atoms is essential for [NiFe]-hydrogenase enzyme activity . Both ligands have been suggested to originate from carbamoylphosphate , which is required for maturation of [NiFe]-hydrogenases . Escherichia coli HypF interacts with carbamoylphosphate as a substrate and releases inorganic phosphate . In addition, HypF also cleaves ATP into AMP and pyrophosphate in the presence of carbamoylphosphate. This, and the fact that HypF catalyzes a carbamoylphosphate-dependent pyrophosphate ATP exchange reaction, suggest that the protein catalyzes the activation of carbamoylphosphate . The mechanism of action of HypF, as well as of its individual domains, is not yet clear. Mutations in any of the three major signature motifs, the acylphosphatase, the zinc fingers, and the O-carbamoyltransferase motif, can block carbamoylphosphate phosphatase activity. This indicates an integrated cooperativity between these domains in the cleavage reaction . The N-terminal acylphosphatase (ACP) domain is thought to support the conversion of carbamoylphosphate into CO and CN- , . Biochemical results demonstrating its ACP activity are not available , . ACPs are small enzymes that specifically catalyze the hydrolysis of carboxylphosphate bonds in acylphosphates, including carbamoylphosphate . Zinc fingers have been implicated in bivalent cation binding or as part of a chaperone domain interacting with the large subunit precursor, but experimental studies on such a function are lacking thus far. The YrdC-like domain is present in protein families with regulatory functions (IPR012200 from INTERPRO, IPR010923 from INTERPRO) and has been implicated in RNA binding . It is not clear what function it may have in members of the HypF family. A C-terminal domain is distantly related to peptidase M22, but contains a conserved O-carbamoyltransferase motif required for the carbamoylphosphate phosphatase activity . The function of this domain is not clear. Nomenclature note: the following names are used as synonyms of HypF: HupY in Azotobacter chroococcum, HupN in Rhizobium leguminosarum, HydA in Escherichia coli. In other organisms, these names are used to designate various "hydrogenase cluster" proteins unrelated to the members of this family. ; GO: 0030528 transcription regulator activity. Probab=58.48 E-value=9.9 Score=17.93 Aligned_cols=75 Identities=19% Similarity=0.205 Sum_probs=43.9 Q ss_pred EECCCCCCHHH------HHHHH------CCHHEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCE-EEEECCCCCCCHHHH Q ss_conf 30323300167------88862------221102530233576887579999999999729977-999768889888899 Q gi|254780214|r 183 YCTDVNAFPAE------SLEKL------QNLDFLIIDALKNGLHSSHFSLSESLKKIELINPKN-AILTHMHVDLDYDMV 249 (271) Q Consensus 183 Y~~Dt~~~~e~------~~~~l------~~~D~lI~d~l~~~~~~~H~~~~ea~~~~~~~~~k~-~~LtHls~~~~~~~~ 249 (271) |+|||+.++-. .++++ +..|++|+| .||.-.|-.=|-+++++.+++- .|.-|+.|-+. --. T Consensus 430 h~Gdls~~~t~n~~fkeA~~~~~~~~d~~~~~~iv~D-----~HP~y~st~~Ae~l~~~fs~~~~~VQHHhAHalA-~~a 503 (799) T TIGR00143 430 HIGDLSKYETYNKFFKEAVNFLLRLYDFEPIDVIVCD-----LHPQYNSTQYAEELAKKFSLPVLRVQHHHAHALA-VMA 503 (799) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEEC-----CCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHH-HHH T ss_conf 5256556303568899999999974189850488864-----8888310789999998718750100147899999-998 Q ss_pred HHH-----CCCCCE-EEEEC Q ss_conf 986-----799819-90204 Q gi|254780214|r 250 LKS-----TPSRVV-PAFDG 263 (271) Q Consensus 250 ~~~-----~~~~v~-~A~DG 263 (271) .+. +.+-|. +|-|| T Consensus 504 e~~~~~Ph~~~~v~gi~~DG 523 (799) T TIGR00143 504 EAGLQDPHLEEAVIGIALDG 523 (799) T ss_pred HCCCCCCCCCCCEEEEEECC T ss_conf 61788888888668998325 No 60 >TIGR01826 CofD_related conserved hypothetical protein; InterPro: IPR010119 This entry represents a subfamily of conserved hypothetical proteins that forms a sister group to the family of CofD, (IPR010115 from INTERPRO), LPPG:Fo 2-phospho-L-lactate transferase, an enzyme of cytochrome F420 biosynthesis.. Probab=58.10 E-value=14 Score=17.05 Aligned_cols=73 Identities=15% Similarity=0.214 Sum_probs=48.3 Q ss_pred CEE--CCCCC-CCCEEEEC---CCCCCHHHHHHHHCCHHEEEEE-CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 621--12334-68625303---2330016788862221102530-23357688757999999999972997799976888 Q gi|254780214|r 170 RIS--SLGFR-FGNVAYCT---DVNAFPAESLEKLQNLDFLIID-ALKNGLHSSHFSLSESLKKIELINPKNAILTHMHV 242 (271) Q Consensus 170 ~~~--~~Gyr-i~~~~Y~~---Dt~~~~e~~~~~l~~~D~lI~d-~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtHls~ 242 (271) .++ ..+=+ |.++.-.+ |.....++.++.++.||+.|+. +..++.--.++-+.+..+-.++.+|+++|..-+.. T Consensus 154 ~I~e~~~~~~Pi~rv~LePal~~~~a~a~eAv~aI~eADlIilGPGSLyTSI~PnLLvp~i~~A~~~s~A~kvYV~Nlmt 233 (331) T TIGR01826 154 NIPELKKGGKPIERVRLEPALEDVKALAREAVEAIREADLIILGPGSLYTSIIPNLLVPEIAEALRESKAPKVYVCNLMT 233 (331) T ss_pred EEEHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHCCEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 10040258888048883367888487515899999666927877724577742432167899999728998999733566 No 61 >PRK09875 putative hydrolase; Provisional Probab=55.72 E-value=15 Score=16.80 Aligned_cols=47 Identities=17% Similarity=0.126 Sum_probs=32.0 Q ss_pred HHHHHHHHHHCC--CCEEEEECCCCCCCHHHHHHHCCCCCEEEEECCEE Q ss_conf 999999999729--97799976888988889998679981990204489 Q gi|254780214|r 220 LSESLKKIELIN--PKNAILTHMHVDLDYDMVLKSTPSRVVPAFDGMQF 266 (271) Q Consensus 220 ~~ea~~~~~~~~--~k~~~LtHls~~~~~~~~~~~~~~~v~~A~DGm~i 266 (271) ..|.+++.++.+ +.|+++.|+....|.+...+.+..++.++||+.-. T Consensus 165 g~e~l~il~e~Gvd~~rVvigH~D~~~D~~~~~~la~~G~~l~fD~~G~ 213 (292) T PRK09875 165 GLEQLALLQAHGVDLSRVTVGHCDLKDNLDNILKMIDLGAYVQFDTIGK 213 (292) T ss_pred HHHHHHHHHHCCCCHHHEEEECCCCCCCHHHHHHHHHCCCEEEECCCCC T ss_conf 7999999998498921578965789899899999996597897567873 No 62 >TIGR00658 orni_carb_tr ornithine carbamoyltransferase; InterPro: IPR002292 The ornithine carbamoyltransferases are a family of enzymes that catalyse the production of citrulline from carbamoyl-phosphate and ornithine . This reaction is part of the arginine biosynthetic pathway, and in some organisms the reaction is also part of the arginine deaminase pathway . In most prokaryotes, and some lower eukaryotes, the enzyme is found in the cytoplasm, but in higher eukaryotes, such as mammals, the enzyme is found in the mitochondrial matrix and functions as part of the urea cycle. In most organisms analysed to date, the enzyme has been shown to consist of a trimer of identical or nonidentical subunits . Extensive similarity has been found within the ornithine carbamoyltransferase family, some of the conserved areas being important in catalysis . The C-terminal region of these enzymes shows a degree of similarity to the aspartate carbamoyltransferases, which are also known to bind carbamoyl- phosphate . This region contains a highly conserved Cys residue (in a His-Cys-Lys-Pro motif) implicated in ornithine binding . ; GO: 0004585 ornithine carbamoyltransferase activity, 0006520 amino acid metabolic process, 0009348 ornithine carbamoyltransferase complex. Probab=53.83 E-value=16 Score=16.61 Aligned_cols=48 Identities=21% Similarity=0.356 Sum_probs=22.9 Q ss_pred CCCCCCEEEEEECCCCCCCEEEEEECCCCHHHHHH-HCCC--------CCCCEEEE-ECCCHH Q ss_conf 74443069999336778971899987984156675-2233--------42059999-743524 Q gi|254780214|r 35 NRRTRSSLKISRVSERGSNTTVIVDTGPDFYMQVL-REQV--------LSIDAVLY-THPHAD 87 (271) Q Consensus 35 ~~R~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~-~~~i--------~~Id~I~i-TH~H~D 87 (271) -.|||.|.=+.... -+++-+-++.. | -||= ++.| +-||+|++ .+.|.| T Consensus 56 STRTRvSFEvA~y~--lGg~~lyL~~~-~--~QlG~gEsi~DTARVLsRyVD~I~~R~~~h~~ 113 (341) T TIGR00658 56 STRTRVSFEVAAYQ--LGGQPLYLNPS-D--LQLGRGESIKDTARVLSRYVDGIMARVYKHED 113 (341) T ss_pred CCCCCCCHHHHHHH--CCCCEEECCCC-C--CEECCCCCCCCHHHHHCCEEEEEEEECCCHHH T ss_conf 86521007799986--49821224875-3--20036887000110203156567773255378 No 63 >PRK11664 ATP-dependent RNA helicase HrpB; Provisional Probab=52.76 E-value=17 Score=16.50 Aligned_cols=47 Identities=15% Similarity=0.296 Sum_probs=28.7 Q ss_pred EEEECCCCCCCEEEEEECCCCHHHHHHHCC--CCCCCEEEEECCCHHHHH Q ss_conf 999336778971899987984156675223--342059999743524220 Q gi|254780214|r 43 KISRVSERGSNTTVIVDTGPDFYMQVLREQ--VLSIDAVLYTHPHADHIH 90 (271) Q Consensus 43 ~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~--i~~Id~I~iTH~H~DHi~ 90 (271) .|..+...+.+++|.+ |.+++..+.+..+ ++.+++|+|--+|.=++. T Consensus 83 ~vR~e~~~s~~Tri~~-~T~GiLlr~l~~dp~L~~~~~vI~DE~HER~l~ 131 (812) T PRK11664 83 RMRAESKVGPNTRLEV-VTEGILTRMIQRDPELSGVGLVILDEFHERSLQ 131 (812) T ss_pred EECCCCCCCCCCEEEE-ECHHHHHHHHHHCCCCCCCCEEEEECCCCCCHH T ss_conf 8256778899857999-755899999724977677888999575468751 No 64 >TIGR00412 redox_disulf_2 redox-active disulfide protein 2; InterPro: IPR005243 This small protein is found in three archaeal species so far (Methanococcus jannaschii, Archaeoglobus fulgidus, and Methanobacterium thermoautotrophicum (strain Marburg / DSM 2133)) as well as in Anabaena. It is homologous to thioredoxins, glutaredoxins, and protein disulphide isomerases, and shares with them a redox-active disulphide. The redox active disulphide region CXXC motif resembles neither thioredoxin nor glutaredoxin. . Probab=49.77 E-value=8.5 Score=18.34 Aligned_cols=11 Identities=27% Similarity=0.534 Sum_probs=8.8 Q ss_pred EEEEEEEEECC Q ss_conf 25999985417 Q gi|254780214|r 4 YYHFTILGCGA 14 (271) Q Consensus 4 ~m~l~~LGtg~ 14 (271) .|+|.+.|||+ T Consensus 1 MvkI~i~GtGC 11 (78) T TIGR00412 1 MVKIKIYGTGC 11 (78) T ss_pred CEEEEEECCCC T ss_conf 92788852999 No 65 >TIGR00361 ComEC_Rec2 DNA internalization-related competence protein ComEC/Rec2; InterPro: IPR004797 This is the DNA internalisation-related competence protein ComEC/Rec2 family. Apparent orthologs are found in 5 species so far (Haemophilus influenzae, Escherichia coli, Bacillus subtilis, Neisseria gonorrhoeae, Streptococcus pneumoniae), of which all but E. coli are model systems for the study of competence for natural transformation. This protein is a predicted multiple membrane-spanning protein likely to be involved in DNA internalisation.; GO: 0030420 establishment of competence for transformation, 0016021 integral to membrane. Probab=47.66 E-value=4.7 Score=19.97 Aligned_cols=26 Identities=27% Similarity=0.506 Sum_probs=21.8 Q ss_pred HCCCCCCCEEEEECCCHHHHHHCCHH Q ss_conf 22334205999974352422011102 Q gi|254780214|r 70 REQVLSIDAVLYTHPHADHIHGIDGL 95 (271) Q Consensus 70 ~~~i~~Id~I~iTH~H~DHi~Gl~~L 95 (271) .+++.++++++++|.|.||.+|...+ T Consensus 540 ~~~~~~~~~~~l~~~~~d~~g~~~~~ 565 (731) T TIGR00361 540 WEGLKQLEGLILSHDDLDHAGGADTL 565 (731) T ss_pred HCCCHHHHHHHHCCCCCCCCCHHHHH T ss_conf 30210123442002321112405677 No 66 >TIGR01931 cysJ sulfite reductase [NADPH] flavoprotein, alpha-component; InterPro: IPR010199 This entry describes an NADPH-dependent sulphite reductase flavoprotein subunit. Most members of the proteins of this entry are found in Cys biosynthesis gene clusters. The closest homologues are designated as subunits nitrate reductase.; GO: 0004783 sulfite reductase (NADPH) activity, 0000103 sulfate assimilation, 0019344 cysteine biosynthetic process. Probab=45.63 E-value=21 Score=15.93 Aligned_cols=55 Identities=15% Similarity=0.144 Sum_probs=36.6 Q ss_pred CEEEECCCCC-------C---HHHHHHHH-CCHHEEEEE-CCCC-----------CCCCCCCCHHHHHHHHHHCCCCE Q ss_conf 6253032330-------0---16788862-221102530-2335-----------76887579999999999729977 Q gi|254780214|r 180 NVAYCTDVNA-------F---PAESLEKL-QNLDFLIID-ALKN-----------GLHSSHFSLSESLKKIELINPKN 234 (271) Q Consensus 180 ~~~Y~~Dt~~-------~---~e~~~~~l-~~~D~lI~d-~l~~-----------~~~~~H~~~~ea~~~~~~~~~k~ 234 (271) ++|+|=|... + ..++++++ .||.+-||. |.+- -...+||+.++|-++...++..| T Consensus 544 DlAfSRDq~eKiYVQhrirE~g~ELw~Wlq~GA~iYVCGDA~~MAKDVh~aLL~vi~~~G~~d~E~A~eyL~~LR~~k 621 (628) T TIGR01931 544 DLAFSRDQAEKIYVQHRIREQGAELWQWLQEGAHIYVCGDAKKMAKDVHQALLDVIAKEGGLDAEEAEEYLTDLRVEK 621 (628) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC T ss_conf 230032656531155677874899999985897489826820103479999999999618988889999999887618 No 67 >TIGR00432 arcsn_tRNA_tgt archaeosine tRNA-ribosyltransferase; InterPro: IPR004804 The archaeosine tRNA-guanine transglycosylase (tgt) differs from the tgt of Escherichia coli and other bacteria in the site of action and the modification that results. It exchanges 7-cyano-7-deazaguanine (preQ0) with guanine at position 15 of archaeal tRNA; this nucleotide is subsequently converted to archaeosine, found exclusively in the archaea. In contrast, bacterial tgt catalyzes the exchange of preQ0 or preQ1 for the guanine base at position 34; this nucleotide is subsequently modified to queuosine (IPR004803 from INTERPRO). Archaeoglobus fulgidus has both enzymes. . Probab=45.25 E-value=9 Score=18.18 Aligned_cols=13 Identities=15% Similarity=0.375 Sum_probs=9.6 Q ss_pred CCCCCCCCCCCCC Q ss_conf 3788787777876 Q gi|254780214|r 21 ITGDWGACDPTNP 33 (271) Q Consensus 21 ~~c~c~~C~~~~~ 33 (271) +=|.||||++-+| T Consensus 276 fPCsCpVCs~ytP 288 (658) T TIGR00432 276 FPCSCPVCSDYTP 288 (658) T ss_pred CCCCCCCCCCCCH T ss_conf 7898878889882 No 68 >pfam01872 RibD_C RibD C-terminal domain. The function of this domain is not known, but it is thought to be involved in riboflavin biosynthesis. This domain is found in the C terminus of RibD/RibG, in combination with pfam00383, as well as in isolation in some archaebacterial proteins. This family appears to be related to pfam00186. Probab=42.97 E-value=24 Score=15.57 Aligned_cols=10 Identities=30% Similarity=0.361 Sum_probs=5.8 Q ss_pred CCCCEEEEEC Q ss_conf 4205999974 Q gi|254780214|r 74 LSIDAVLYTH 83 (271) Q Consensus 74 ~~Id~I~iTH 83 (271) ...|+|++.- T Consensus 39 ~~~daIlvG~ 48 (196) T pfam01872 39 AEADAILVGR 48 (196) T ss_pred HHCCEEEECH T ss_conf 6189599745 No 69 >KOG2882 consensus Probab=39.32 E-value=27 Score=15.21 Aligned_cols=12 Identities=42% Similarity=0.504 Sum_probs=7.0 Q ss_pred HHHHCCHHEEEE Q ss_conf 886222110253 Q gi|254780214|r 195 LEKLQNLDFLII 206 (271) Q Consensus 195 ~~~l~~~D~lI~ 206 (271) ..++++..++.+ T Consensus 174 ~~yLqnP~clfl 185 (306) T KOG2882 174 LNYLQNPGCLFL 185 (306) T ss_pred HHHHCCCCCEEE T ss_conf 997479980898 No 70 >TIGR02417 fruct_sucro_rep D-fructose-responsive transcription factor; InterPro: IPR012781 Members of this family belong the lacI helix-turn-helix family of DNA-binding transcriptional regulators. All members are from the proteobacteria. Characterised members act as positive and negative transcriptional regulators of fructose and sucrose transport and metabolism. Sucrose is a disaccharide composed of fructose and glucose; D-fructose-1-phosphate rather than an intact sucrose moiety has been shown to act as the inducer.; GO: 0003677 DNA binding, 0006355 regulation of transcription DNA-dependent, 0009750 response to fructose stimulus, 0045449 regulation of transcription. Probab=37.69 E-value=29 Score=15.05 Aligned_cols=52 Identities=15% Similarity=0.267 Sum_probs=32.3 Q ss_pred CCEEEEEECCCC---HHHHHHHCCC-CCCCEEEEECCCH-HHHHHCCHHHHHHHCCC-CEEEEEEC Q ss_conf 971899987984---1566752233-4205999974352-42201110243331782-11101311 Q gi|254780214|r 52 SNTTVIVDTGPD---FYMQVLREQV-LSIDAVLYTHPHA-DHIHGIDGLRGYFLKQK-RPIDVYAA 111 (271) Q Consensus 52 ~~~~iLiD~G~d---~r~ql~~~~i-~~Id~I~iTH~H~-DHi~Gl~~Lr~~~~~~~-~~i~vy~~ 111 (271) .|.+|||=|+.| -=.++...=+ ++||++|+.=.|. +| ++|.+-. +++||.+- T Consensus 89 ~GyQLLIaCSDD~P~~E~~~~e~l~~Rq~DaLiVAs~~~p~~--------~~Yqkl~n~~~Pv~~l 146 (335) T TIGR02417 89 AGYQLLIACSDDNPDQEKKVIERLLARQVDALIVASCLPPED--------EYYQKLQNEGLPVVAL 146 (335) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCC--------HHHHHHCCCCCCEEEE T ss_conf 175566641788813489999998616736588438877687--------4889836689978886 No 71 >TIGR00107 deoD purine nucleoside phosphorylase; InterPro: IPR004402 Purine nucleoside phosphorylase 2.4.2.1 from EC also called inosine phosphorylase (pnp) cleave guanosine or inosine to respective bases and sugar-1-phosphate molecules. These enzymes are involved in the salvage of nucleotides and nucleosides.; GO: 0004731 purine-nucleoside phosphorylase activity, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=37.40 E-value=17 Score=16.51 Aligned_cols=14 Identities=36% Similarity=1.066 Sum_probs=7.3 Q ss_pred EEEEEEECCCCCEECC Q ss_conf 9999854178965133 Q gi|254780214|r 6 HFTILGCGASLGVPRI 21 (271) Q Consensus 6 ~l~~LGtg~s~g~P~~ 21 (271) +|++.|+| +|||++ T Consensus 53 ~IsvmGhG--MGIPSi 66 (234) T TIGR00107 53 KISVMGHG--MGIPSI 66 (234) T ss_pred EEEEEECC--CCCCCH T ss_conf 58886405--655535 No 72 >TIGR02181 GRX_bact glutaredoxin 3; InterPro: IPR011900 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family of glutaredoxins includes the E. coli protein GrxC (Grx3) which appears to have a secondary role in reducing ribonucleotide reductase (in the absence of GrxA) possibly indicating a role in the reduction of other protein disulphides .; GO: 0006118 electron transport, 0045454 cell redox homeostasis. Probab=37.04 E-value=6 Score=19.30 Aligned_cols=62 Identities=13% Similarity=0.195 Sum_probs=37.9 Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEECCC-CCCCEEEEEECCC-CHHHHHHH-C-CCCCCCEEEEECCCHHHHHHCCHHHHH Q ss_conf 887877778767744430699993367-7897189998798-41566752-2-334205999974352422011102433 Q gi|254780214|r 23 GDWGACDPTNPKNRRTRSSLKISRVSE-RGSNTTVIVDTGP-DFYMQVLR-E-QVLSIDAVLYTHPHADHIHGIDGLRGY 98 (271) Q Consensus 23 c~c~~C~~~~~~~~R~~~s~~i~~~~~-~~~~~~iLiD~G~-d~r~ql~~-~-~i~~Id~I~iTH~H~DHi~Gl~~Lr~~ 98 (271) -.||-|..|- .||+..|. ..+. |=||--| ..|..|+. . +=+.|==|||= .-||+|-+||-.+ T Consensus 7 ~~CPYC~rAK---------~LL~~kGv~~~~E--i~vd~dpD~~r~eM~~Rs~Gr~TVPQIFIg---~~HVGGcDDL~~L 72 (82) T TIGR02181 7 PYCPYCSRAK---------ALLESKGVGTFTE--IRVDGDPDALRDEMMQRSGGRRTVPQIFIG---DVHVGGCDDLYAL 72 (82) T ss_pred CCCCCCHHHH---------HHHHHCCCCCCEE--EECCCCCCHHHHHHHHHHCCCCCCCEEEEC---CCCCCCCHHHHHH T ss_conf 9988026889---------9998569886303--445798106889999982899404337629---9024543799998 No 73 >cd01946 ribokinase_group_C Ribokinase-like subgroup C. Found only in bacteria, this subgroup is part of the ribokinase/pfkB superfamily. Its oligomerization state is unknown at this time. Probab=36.80 E-value=29 Score=15.00 Aligned_cols=49 Identities=20% Similarity=0.295 Sum_probs=30.7 Q ss_pred HHHHHHHHCCHHEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC Q ss_conf 1678886222110253023357688757999999999972997799976 Q gi|254780214|r 191 PAESLEKLQNLDFLIIDALKNGLHSSHFSLSESLKKIELINPKNAILTH 239 (271) Q Consensus 191 ~e~~~~~l~~~D~lI~d~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtH 239 (271) ++...+.++.+|+++..-..-..-.+-.++.++.+...+.++|.+++|- T Consensus 154 ~~~~~~~l~~~dil~~Ne~Ea~~l~g~~~~~~a~~~l~~~g~k~vvVT~ 202 (277) T cd01946 154 PEKLKKVLAKVDVVIINDGEARQLTGAANLVKAARLILAMGPKALIIKR 202 (277) T ss_pred HHHHHHHHHHCCEEEECHHHHHHHHCCCCHHHHHHHHHHHCCCEEEECC T ss_conf 3554558853763222689999984985067899999862446589831 No 74 >PRK11200 grxA glutaredoxin 1; Provisional Probab=34.79 E-value=19 Score=16.23 Aligned_cols=74 Identities=18% Similarity=0.229 Sum_probs=37.5 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEECCCC-HHHHHHHCC----CCCCCEE Q ss_conf 59999854178965133788787777876774443069999336778971899987984-156675223----3420599 Q gi|254780214|r 5 YHFTILGCGASLGVPRITGDWGACDPTNPKNRRTRSSLKISRVSERGSNTTVIVDTGPD-FYMQVLREQ----VLSIDAV 79 (271) Q Consensus 5 m~l~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s~~i~~~~~~~~~~~iLiD~G~d-~r~ql~~~~----i~~Id~I 79 (271) |++++.+.. .||-|..|.. .-.+.-++..+ =...-+|-..+ +..+.+... ++.+=-| T Consensus 1 M~V~Iysk~----------~CpyC~~AK~----Ll~~l~~~~~~----~~~~~idi~~~~~~~~~l~~~~g~~~~TVPQI 62 (87) T PRK11200 1 MFVVIFGRP----------GCPYCVRAKE----LAEKLSEERDD----FDYRYVDIHAEGISKADLEKTVGKPVETVPQI 62 (87) T ss_pred CEEEEEECC----------CCHHHHHHHH----HHHHHCCCCCC----CCEEEEECCCCHHHHHHHHHHHCCCCCCCCEE T ss_conf 989998189----------9866999999----99982533576----20799726777678999999848998743879 Q ss_pred EEECCCHHHHHHCCHHHHHH Q ss_conf 99743524220111024333 Q gi|254780214|r 80 LYTHPHADHIHGIDGLRGYF 99 (271) Q Consensus 80 ~iTH~H~DHi~Gl~~Lr~~~ 99 (271) || -.-||+|..||..|. T Consensus 63 fI---~g~~IGGy~dL~~~~ 79 (87) T PRK11200 63 FV---DQKHIGGCTDFEAYV 79 (87) T ss_pred EE---CCEEECCHHHHHHHH T ss_conf 89---999853999999999 No 75 >cd07044 CofD_YvcK Family of CofD-like proteins and proteins related to YvcK. CofD is a 2-phospho-L-lactate transferase that catalyzes the last step in the biosynthesis of coenzyme F(420)-0 (F(420) without polyglutamate) by transferring the lactyl phosphate moiety of lactyl(2)diphospho-(5')guanosine (LPPG) to 7,8-didemethyl-8-hydroxy-5-deazariboflavin ribitol (F0). F420 is a hydride carrier, important for energy metabolism of methanogenic archaea, as well as for the biosynthesis of other natural products, like tetracycline in Streptomyces. F420 and some of its precursors are also utilized as cofactors for enzymes, like DNA photolyase in Mycobacterium tuberculosis. YvcK from Bacillus subtilis is a member of a family of mostly uncharacterized proteins and has been proposed to play a role in carbon metabolism, since its function is essential for growth on intermediates of the Krebs cycle and pentose phosphate pathway. Both families appear to have a conserved phosphate binding site, but ha Probab=34.75 E-value=32 Score=14.76 Aligned_cols=66 Identities=14% Similarity=0.148 Sum_probs=41.8 Q ss_pred CCCCCEEEEC-CCCCCHHHHHHHHCCHHEEEEE-CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf 3468625303-2330016788862221102530-2335768875799999999997299779997688 Q gi|254780214|r 176 FRFGNVAYCT-DVNAFPAESLEKLQNLDFLIID-ALKNGLHSSHFSLSESLKKIELINPKNAILTHMH 241 (271) Q Consensus 176 yri~~~~Y~~-Dt~~~~e~~~~~l~~~D~lI~d-~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtHls 241 (271) -+|.++.+.+ +.-...++.++.++++|+.++. +..+..--.++-+++..+.+++.++++++..-+. T Consensus 149 ~~I~~v~l~~~~~~~~~~~~l~AI~~ADlIvlgPGSlyTSIiPnLlv~gI~eAi~~s~a~kvyi~Nlm 216 (309) T cd07044 149 KKIDRVFLTPVDEASPSREVLEAIEKADNIVIGPGSLYTSILPNISVPGIREALKKTXAKKVYVSNIX 216 (309) T ss_pred CCCEEEEECCCCCCCCCHHHHHHHHHCCEEEECCCCCHHHHCCCCCCHHHHHHHHHCCCCEEEEECCC T ss_conf 87217995378879889999999986899998798447776641255569999997599889980789 No 76 >TIGR01212 TIGR01212 radical SAM protein, TIGR01212 family; InterPro: IPR005911 This uncharacterised protein family shows significant similarity to IPR005910 from INTERPRO, a longer protein that is a histone acetyltransferase at its C-terminus and is a subunit of RNA polymerase II (in yeast). This family lacks the GNAT acetyltransferase domain.. Probab=34.13 E-value=29 Score=15.07 Aligned_cols=52 Identities=19% Similarity=0.361 Sum_probs=21.8 Q ss_pred CCCCCCHHHHHHHH-----CCHHEEEEECCCCC-------CCCCC--CCHHHHHHHHHHCCCCEEEEECC Q ss_conf 32330016788862-----22110253023357-------68875--79999999999729977999768 Q gi|254780214|r 185 TDVNAFPAESLEKL-----QNLDFLIIDALKNG-------LHSSH--FSLSESLKKIELINPKNAILTHM 240 (271) Q Consensus 185 ~Dt~~~~e~~~~~l-----~~~D~lI~d~l~~~-------~~~~H--~~~~ea~~~~~~~~~k~~~LtHl 240 (271) +|| +||++++.| ++.|++|==+|+.. =.++| +++-+|+.-+++-+.| +-+|+ T Consensus 120 PDC--~P~~VLDlL~ey~~~GyevWvELGLQtah~~TL~~INRgHd~~~y~~a~~~~~krGik--VC~H~ 185 (307) T TIGR01212 120 PDC--VPDEVLDLLAEYKERGYEVWVELGLQTAHDRTLKKINRGHDFACYVDAVKRARKRGIK--VCSHV 185 (307) T ss_pred CCC--CHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCE--EEEEE T ss_conf 987--7478999999995497589996053565589999851437878999999999765988--99998 No 77 >TIGR00216 ispH_lytB 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; InterPro: IPR003451 Terpenes are among the largest groups of natural products and include compounds such as vitamins, cholesterol and carotenoids. The biosynthesis of all terpenoids begins with one or both of the two C5 precursors of the pathway: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). In animals, fungi, and certain bacteria, the synthesis of IPP and DMAPP occurs via the well-known mevalonate pathway, however, a second, nonmevalonate terpenoid pathway has been identified in many eubacteria, algae and the chloroplasts of higher plants . LytB(IspH) catalyses the conversion of 1-hydroy-2-methyl-2-(E)-butenyl 4-diphosphate into IPP and DMAPP in this second pathway The enzyme appears to be responsible for a branch-step in the nonmevalonate pathway, in that IPP and DMAPP are produced in parallel from a single precursor although the exact mechanism of this is not currently fully understood . Escherichia coli LytB protein had been found to regulate the activity of RelA (guanosine 3',5'-bispyrophosphate synthetase I), which in turn controls the level of a regulatory metabolite. It is involved in penicillin tolerance and the stringent response .; GO: 0019288 isopentenyl diphosphate biosynthetic process mevalonate-independent pathway. Probab=32.98 E-value=34 Score=14.58 Aligned_cols=53 Identities=19% Similarity=0.260 Sum_probs=38.4 Q ss_pred CCCCEEEECCCCCCHHHHHHHHCC-HHEEEE-ECCCCCCCCCCCC-HHHHHHHHHHCCCCEEEEE Q ss_conf 468625303233001678886222-110253-0233576887579-9999999997299779997 Q gi|254780214|r 177 RFGNVAYCTDVNAFPAESLEKLQN-LDFLII-DALKNGLHSSHFS-LSESLKKIELINPKNAILT 238 (271) Q Consensus 177 ri~~~~Y~~Dt~~~~e~~~~~l~~-~D~lI~-d~l~~~~~~~H~~-~~ea~~~~~~~~~k~~~Lt 238 (271) -|+.++|.|.-+ -+.+.+.++. +|++|+ .+ .||+ -....++|++.++++.|+. T Consensus 241 sfntIC~AT~~R--Q~Av~~la~~D~Dl~~ViGg-------~nSsNt~rL~~iA~~~~~~~~~~i 296 (354) T TIGR00216 241 SFNTICYATQNR--QDAVKELAPEDVDLMLVIGG-------KNSSNTKRLLEIAEEEGGKKSYLI 296 (354) T ss_pred CHHHHHHHHHHH--HHHHHHHHHHCCEEEEEECC-------CCCCCHHHHHHHHHHHCCCEEEEE T ss_conf 243476788888--99999832205148999757-------777533689999998488715860 No 78 >pfam08915 tRNA-Thr_ED Archaea-specific editing domain of threonyl-tRNA synthetase. Archaea-specific editing domain of threonyl-tRNA synthetase, with marked structural similarity to D-amino acids deacylases found in eubacteria and eukaryotes. This domain can bind D-amino acids, and ensures high fidelity during translation. It is especially responsible for removing incorrectly attached serine from tRNA-Thr. The domain forms a fold that can be be defined as two layers of beta-sheets (a three-stranded sheet and a five-stranded sheet), with two alpha-helices located adjacent to the five-stranded sheet. Probab=32.37 E-value=35 Score=14.52 Aligned_cols=26 Identities=27% Similarity=0.379 Sum_probs=20.8 Q ss_pred HHHHHHHHHHCCCCEEEEE---CCCCCCC Q ss_conf 9999999997299779997---6888988 Q gi|254780214|r 220 LSESLKKIELINPKNAILT---HMHVDLD 245 (271) Q Consensus 220 ~~ea~~~~~~~~~k~~~Lt---Hls~~~~ 245 (271) +++..++++++++++++|. |+|-++. T Consensus 59 v~eI~~~a~kv~~~~ivlYPyAHLSs~La 87 (137) T pfam08915 59 AEEILKVASKVKATRVVLYPYAHLSSDLA 87 (137) T ss_pred HHHHHHHHHHCCCCEEEEECHHHHCCCCC T ss_conf 99999999844986799945033126558 No 79 >PRK13789 phosphoribosylamine--glycine ligase; Provisional Probab=31.83 E-value=21 Score=15.89 Aligned_cols=14 Identities=21% Similarity=0.468 Sum_probs=13.2 Q ss_pred CCCEEEEEEEEECC Q ss_conf 98725999985417 Q gi|254780214|r 1 MKDYYHFTILGCGA 14 (271) Q Consensus 1 m~~~m~l~~LGtg~ 14 (271) |...||+.++|+|+ T Consensus 1 m~~~MkVLviGsGG 14 (426) T PRK13789 1 MQVKLKVLLIGSGG 14 (426) T ss_pred CCCCCEEEEECCCH T ss_conf 96798899989888 No 80 >COG4566 TtrR Response regulator [Signal transduction mechanisms] Probab=31.20 E-value=37 Score=14.40 Aligned_cols=37 Identities=14% Similarity=0.267 Sum_probs=22.1 Q ss_pred CEEEEEEC------CCCHHHHHHHCCCCCCCEEEEECCCHHHHHH Q ss_conf 71899987------9841566752233420599997435242201 Q gi|254780214|r 53 NTTVIVDT------GPDFYMQVLREQVLSIDAVLYTHPHADHIHG 91 (271) Q Consensus 53 ~~~iLiD~------G~d~r~ql~~~~i~~Id~I~iTH~H~DHi~G 91 (271) -..+|+|. |.++..+|...++. +--||+| .|.|==.. T Consensus 49 pGclllDvrMPg~sGlelq~~L~~~~~~-~PVIfiT-GhgDIpma 91 (202) T COG4566 49 PGCLLLDVRMPGMSGLELQDRLAERGIR-LPVIFLT-GHGDIPMA 91 (202) T ss_pred CCEEEEECCCCCCCHHHHHHHHHHCCCC-CCEEEEE-CCCCHHHH T ss_conf 9769985578888508999999953999-8879982-78872899 No 81 >CHL00194 ycf39 Ycf39; Provisional Probab=30.91 E-value=37 Score=14.36 Aligned_cols=27 Identities=19% Similarity=0.241 Sum_probs=15.9 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf 875799999999997299779997688 Q gi|254780214|r 215 SSHFSLSESLKKIELINPKNAILTHMH 241 (271) Q Consensus 215 ~~H~~~~ea~~~~~~~~~k~~~LtHls 241 (271) |.-+|..|.++++.++-=|+.-.+.+. T Consensus 205 P~a~T~~EIa~l~~~~~Gk~~~i~~vP 231 (319) T CHL00194 205 PKSWNSSEIISLCEQLSGQKAKVTRIP 231 (319) T ss_pred CCCCCHHHHHHHHHHHHCCCCCEEECC T ss_conf 863899999999999859998778689 No 82 >TIGR02363 dhaK1 dihydroxyacetone kinase, DhaK subunit; InterPro: IPR012736 In bacteria, dihydroxyacetone is formed by the oxidation of glycerol or the aldol cleavage of fructose-6-phosphate. Dihydroxyacetone kinase converts this compound to the glycolytic intermediate dihydroxyacetone phosphate. Two forms of this enzyme have been shown to exist, using either ATP or a phosphoprotein of the phosphoenolpyruvate: sugar phosphotransferase system (PTS) as phosphate donor. The ATP-dependent kinases (2.7.1.29 from EC) are single-subunit enzymes that are composed of two domains . The phosphoprotein-dependent kinases are composed of three subunits: DhaK and DhaL, which are homologous to the domains from the ATP-dependent enzyme; and DhaM which is a component of the PTS system . This entry represents the DhaK subunit of phosphoprotein-dependent dihydroxyacetone kinase. The model excludes the DhaK paralog DhaK2 (IPR012735 from INTERPRO) encoded in the same operon as DhaK in the firmicutes, with which it sometimes exists as a DhaK/DhaK2 fusion. This subunit is a homodimer which contains the dihydroxyacetone-binding site . The overall fold consists of two six-stranded mixed beta-sheets surrounded by nine alpha-helices and a beta-ribbon covering the exposed edge strand of one sheet. . Probab=27.30 E-value=43 Score=13.98 Aligned_cols=15 Identities=27% Similarity=0.532 Sum_probs=5.9 Q ss_pred CCCHHHHHHHHCCHH Q ss_conf 300167888622211 Q gi|254780214|r 188 NAFPAESLEKLQNLD 202 (271) Q Consensus 188 ~~~~e~~~~~l~~~D 202 (271) +.-||..++.+|-+| T Consensus 82 SPTPDqI~~AikAVD 96 (354) T TIGR02363 82 SPTPDQILEAIKAVD 96 (354) T ss_pred CCCCCHHHHHHHHHC T ss_conf 978548999898303 No 83 >TIGR03471 HpnJ hopanoid biosynthesis associated radical SAM protein HpnJ. One of the well-described hopanoid intermediates is bacteriohopanetetrol. In the conversion from hopene several reactions must occur in the side chain for which a radical mechanism might be reasonable. These include the four (presumably anaerobic) hydroxylations and a methyl shift. Probab=26.58 E-value=44 Score=13.90 Aligned_cols=30 Identities=20% Similarity=0.235 Sum_probs=15.0 Q ss_pred EEEECCCC-H-HHHHHHCCCCCCCEEEEECCCH Q ss_conf 99987984-1-5667522334205999974352 Q gi|254780214|r 56 VIVDTGPD-F-YMQVLREQVLSIDAVLYTHPHA 86 (271) Q Consensus 56 iLiD~G~d-~-r~ql~~~~i~~Id~I~iTH~H~ 86 (271) =+|||-++ + ..++++ .+++.|-|++|..=+ T Consensus 48 ~~iDa~~~~~~~~~~~~-~~~~~d~v~i~t~Tp 79 (472) T TIGR03471 48 RLVDAPPHGVTIDDTLA-IAKDYDLVVLHTSTP 79 (472) T ss_pred CEEECCCCCCCHHHHHH-HHCCCCEEEEEECCC T ss_conf 37527766899899999-850699899992586 No 84 >PRK07261 topology modulation protein; Provisional Probab=26.39 E-value=33 Score=14.68 Aligned_cols=12 Identities=25% Similarity=0.379 Sum_probs=10.0 Q ss_pred EEEEEEEECCCC Q ss_conf 599998541789 Q gi|254780214|r 5 YHFTILGCGASL 16 (271) Q Consensus 5 m~l~~LGtg~s~ 16 (271) |||.+.|+++|+ T Consensus 1 MrI~IiG~sGsG 12 (171) T PRK07261 1 MKIAIIGYSGSG 12 (171) T ss_pred CEEEEECCCCCC T ss_conf 989998899986 No 85 >TIGR01110 mdcA malonate decarboxylase, alpha subunit; InterPro: IPR005777 This model describes malonate decarboxylase alpha subunit. Malonate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases. Essentially, it couples the energy derived from decarboxylation of a carboxylic acid substrate to move Na+ ion across the bilayer. Functional malonate decarboylase is a multi subunit protein. The alpha subunit enzymatically performs the transfer of malonate (substrate) to an acyl carrier protein subunit for subsequent decarboxylation, hence the name: acetyl-S-acyl carrier protein:malonate carrier protein-SH transferase.; GO: 0016831 carboxy-lyase activity, 0006814 sodium ion transport. Probab=26.30 E-value=16 Score=16.58 Aligned_cols=37 Identities=19% Similarity=0.240 Sum_probs=17.1 Q ss_pred CCCEEEEEECCCCCCCEEEEEECCCCHHHHHHHCCC--CCCCEEEEECCC Q ss_conf 430699993367789718999879841566752233--420599997435 Q gi|254780214|r 38 TRSSLKISRVSERGSNTTVIVDTGPDFYMQVLREQV--LSIDAVLYTHPH 85 (271) Q Consensus 38 ~~~s~~i~~~~~~~~~~~iLiD~G~d~r~ql~~~~i--~~Id~I~iTH~H 85 (271) ++..+.|... |=|+|--.++. |-.| +=||+++.+--| T Consensus 172 F~~GIVIAQV-------N~lVD~~~~lp----RVDiPa~WvD~vi~s~kp 210 (551) T TIGR01110 172 FKDGIVIAQV-------NELVDDVDKLP----RVDIPASWVDFVIESPKP 210 (551) T ss_pred CCCCCEEEEC-------CCCCCCCCCCC----CCCCCCCCEEEEEECCCC T ss_conf 4687179741-------30006434666----300482605068876888 No 86 >TIGR02633 xylG D-xylose ABC transporter, ATP-binding protein; InterPro: IPR013455 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Several bacterial species use the enzymes xylose isomerase and xylulokinase for xylose utilisation. This entry represents the ATP-binding cassette (ABC) subunit of the known, or predicted, high-affinity xylose ABC transporter used for xylose import. The genes encoding these proteins, which closely resemble other sugar transport ABC transporter genes, are typically found near xylose utilisation enzymes and regulatory proteins. . Probab=26.29 E-value=42 Score=14.00 Aligned_cols=11 Identities=18% Similarity=0.341 Sum_probs=7.2 Q ss_pred CCCCEEEEECC Q ss_conf 42059999743 Q gi|254780214|r 74 LSIDAVLYTHP 84 (271) Q Consensus 74 ~~Id~I~iTH~ 84 (271) +.|..|+|||- T Consensus 190 ~g~AC~YISHK 200 (501) T TIGR02633 190 KGVACVYISHK 200 (501) T ss_pred CCCEEEEEECC T ss_conf 69359998223 No 87 >TIGR01751 crot-CoA-red crotonyl-CoA reductase; InterPro: IPR010085 Crotonyl-CoA reductase is responsible for the conversion of crotonyl-CoA to butyryl-CoA. In serine cycle methylotrophic bacteria this enzyme is involved in the process of acetyl-CoA to glyoxylate , . In other bacteria the enzyme is used to produce butyrate for incorporation into polyketides such as tylosin from Streptomyces fradiae and coronatine from Pseudomonas syringae .. Probab=24.94 E-value=47 Score=13.71 Aligned_cols=51 Identities=24% Similarity=0.219 Sum_probs=27.2 Q ss_pred CCCCHHHHHHHCCCC---CCCEEEEECCCHHHHHHCCHHHH-HH-HCCCCEEEEEECCEE Q ss_conf 798415667522334---20599997435242201110243-33-178211101311202 Q gi|254780214|r 60 TGPDFYMQVLREQVL---SIDAVLYTHPHADHIHGIDGLRG-YF-LKQKRPIDVYAAPDC 114 (271) Q Consensus 60 ~G~d~r~ql~~~~i~---~Id~I~iTH~H~DHi~Gl~~Lr~-~~-~~~~~~i~vy~~~~~ 114 (271) |+++.+.||...+.. +=|.||| |---+||..+.. +. .....||-|...|.- T Consensus 179 ~~~TAYRqL~g~~p~~~KpGD~VLI----WGAaGGLGs~A~Ql~~a~Gg~Pv~VVSSp~k 234 (409) T TIGR01751 179 TGATAYRQLVGWKPAEVKPGDNVLI----WGAAGGLGSYAIQLARAGGGIPVAVVSSPDK 234 (409) T ss_pred HHHHHHHHHCCCCCCCCCCCCCEEE----ECCCCCHHHHHHHHHHHCCCCEEEEECCHHH T ss_conf 1344554420687776688883798----4266625689999999548952898467789 No 88 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=24.93 E-value=47 Score=13.74 Aligned_cols=27 Identities=30% Similarity=0.367 Sum_probs=19.0 Q ss_pred CCHHEEEEECCCCCCCCCCCCHHHHHHHHHHCCC Q ss_conf 2211025302335768875799999999997299 Q gi|254780214|r 199 QNLDFLIIDALKNGLHSSHFSLSESLKKIELINP 232 (271) Q Consensus 199 ~~~D~lI~d~l~~~~~~~H~~~~ea~~~~~~~~~ 232 (271) -|+|++++|+ +|-+....++.+|+++. T Consensus 250 AGvDv~viDs-------shGhs~~vl~~ik~~k~ 276 (476) T TIGR01302 250 AGVDVIVIDS-------SHGHSIYVLDSIKKIKK 276 (476) T ss_pred CCCCEEEEEC-------CCCCCHHHHHHHHHHHH T ss_conf 5965899816-------65453789999999986 No 89 >pfam05571 DUF766 Protein of unknown function (DUF766). This family consists of several eukaryotic proteins of unknown function. Probab=22.84 E-value=52 Score=13.46 Aligned_cols=27 Identities=30% Similarity=0.541 Sum_probs=16.4 Q ss_pred EEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCE Q ss_conf 9985417896513378878777787677444306 Q gi|254780214|r 8 TILGCGASLGVPRITGDWGACDPTNPKNRRTRSS 41 (271) Q Consensus 8 ~~LGtg~s~g~P~~~c~c~~C~~~~~~~~R~~~s 41 (271) +.|+.+++.|. ..+|.+| ||++|.+.+ T Consensus 9 ~~L~n~~~~~~---~s~CGaC----prG~R~n~~ 35 (296) T pfam05571 9 TLLFNNSSTDI---YSECGAC----PRGYRVNAQ 35 (296) T ss_pred EECCCCCCCCC---CCCCCCC----CCCCCCCCC T ss_conf 44158987766---5368889----985214987 No 90 >smart00812 Alpha_L_fucos Alpha-L-fucosidase. O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site PUBMED:. Because the fold of proteins is better conserved than their sequences, some of the families can be grouped in 'clans'. Family 29 encompasses alpha-L-fucosidases, which is a lysosomal enzyme responsible for hydrolyzing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Deficiency of alpha-L-fucosidase results in the lysosomal storage disease fucosidosis. Probab=22.47 E-value=53 Score=13.41 Aligned_cols=10 Identities=40% Similarity=0.494 Sum_probs=3.8 Q ss_pred EEEECCCHHH Q ss_conf 9997435242 Q gi|254780214|r 79 VLYTHPHADH 88 (271) Q Consensus 79 I~iTH~H~DH 88 (271) |++|=-|.|= T Consensus 98 ~V~TaKHHDG 107 (384) T smart00812 98 VVLTAKHHDG 107 (384) T ss_pred EEEEEEECCC T ss_conf 7752130477 No 91 >TIGR01621 RluA-like pseudouridine synthase Rlu family protein, TIGR01621; InterPro: IPR006508 These sequences represent a clade within the pseudouridine synthase superfamily. The superfamily includes E. coli proteins: RluA, RluB, RluC, RluD, and RsuA. The sequences in this family are most closely related to RluA ; only Neisseria does not appear to have an RluA homolog. It is presumed that these sequences function as pseudouridine synthases, although perhaps with different specificity.. Probab=22.09 E-value=54 Score=13.36 Aligned_cols=44 Identities=18% Similarity=0.231 Sum_probs=24.6 Q ss_pred CCCCEEEEEECCCCHHHHHHHCCCCCCCEEEEECCCHHHHHHCCH Q ss_conf 789718999879841566752233420599997435242201110 Q gi|254780214|r 50 RGSNTTVIVDTGPDFYMQVLREQVLSIDAVLYTHPHADHIHGIDG 94 (271) Q Consensus 50 ~~~~~~iLiD~G~d~r~ql~~~~i~~Id~I~iTH~H~DHi~Gl~~ 94 (271) +.+|-.+==|-|+....|.+-.-+ .+.-|+|-|-=-=-+.||.- T Consensus 16 K~~giSVHkD~getgL~~~vA~Ql-gv~~vwLvHRLDK~TSGiLl 59 (222) T TIGR01621 16 KSEGISVHKDDGETGLLREVAKQL-GVEKVWLVHRLDKVTSGILL 59 (222) T ss_pred CCCCEEEEECCCCHHHHHHHHHHH-CCCCEEEEEECCHHHHHHHH T ss_conf 799832772571113899999981-89808998700034468999 No 92 >cd00518 H2MP Hydrogenase specific C-terminal endopeptidases, also called Hydrogen Maturation Proteases (H2MP). These enzymes belong to the peptidase family M52. Maturation of [FeNi] hydrogenases includes formation of the nickel metallocenter, proteolytic processing and assembly with other subunits. Hydrogenase maturation endopeptidases are responsible for the proteolytic processing, liberating a short C-terminal peptide by cleaving after a His or an Arg residue, e.g., HycI (E. coli) is involved in processing of HypE, the large subunit of hydrogenase 3. This cleavage is nickel dependent. This CD also includes such hydrogenase-processing proteins as HydD, HupW, and HoxW, as well as, proteins of the F420-reducing hydrogenase of methanogens (e.g., FrcD). Also included, is the Pyrococcus furiosus FrxA protein, a bifunctional endopeptidase/ sulfhydrogenase found in NADP-reducing hyperthermophiles.The Pyrococcus FrxA is not related to those found in Helicobacter pylori. Probab=21.98 E-value=54 Score=13.35 Aligned_cols=22 Identities=27% Similarity=0.516 Sum_probs=10.9 Q ss_pred CCHHHHHHHHHHCC--CCEEEEEC Q ss_conf 79999999999729--97799976 Q gi|254780214|r 218 FSLSESLKKIELIN--PKNAILTH 239 (271) Q Consensus 218 ~~~~ea~~~~~~~~--~k~~~LtH 239 (271) +++.+++++++..+ +.++++.- T Consensus 89 ~~~~~~L~~~~~~~~~~~~~~ivg 112 (139) T cd00518 89 LGLAELLALLRLLGGLPPEVVLIG 112 (139) T ss_pred CCHHHHHHHHHHHCCCCCCEEEEE T ss_conf 999999999998379998089999 No 93 >COG0455 flhG Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility] Probab=21.82 E-value=54 Score=13.33 Aligned_cols=13 Identities=8% Similarity=0.355 Sum_probs=7.8 Q ss_pred CCEE-EEEECCCCH Q ss_conf 9718-999879841 Q gi|254780214|r 52 SNTT-VIVDTGPDF 64 (271) Q Consensus 52 ~~~~-iLiD~G~d~ 64 (271) ++++ ++||+=.++ T Consensus 31 ~~k~V~~iDaD~g~ 44 (262) T COG0455 31 GGKVVLLIDADLGL 44 (262) T ss_pred CCCEEEEEECCCCC T ss_conf 89769999658888 No 94 >pfam09108 Xol-1_N Switch protein XOL-1, N-terminal. Members of this family, which are required for the formation of the active site of the sex-determining protein Xol-1, adopt a secondary structure consisting of five alpha helices and six antiparallel beta sheets, in a beta-alpha-beta-beta-beta-alpha-beta-alpha-alpha-alpha-beta arrangement. The fold of this family is similar to that found in ribosomal protein S5 domain 2-like. Probab=21.72 E-value=37 Score=14.39 Aligned_cols=32 Identities=31% Similarity=0.428 Sum_probs=19.0 Q ss_pred EEEECCCHHHHHHCCHHHHH----HHCCCCEEEEEECC Q ss_conf 99974352422011102433----31782111013112 Q gi|254780214|r 79 VLYTHPHADHIHGIDGLRGY----FLKQKRPIDVYAAP 112 (271) Q Consensus 79 I~iTH~H~DHi~Gl~~Lr~~----~~~~~~~i~vy~~~ 112 (271) ..--|+|.||..|| +|.| -+.+-+.|.+|.+- T Consensus 70 ~~a~~~h~~~~l~i--IrDfl~~leL~nvY~I~iy~~~ 105 (164) T pfam09108 70 DLADHAHDDHRLGI--IRDFIDDLELHNVYEINFTDEH 105 (164) T ss_pred HHHHHCCCCCHHHH--HHHHHHHHHHHEEEEEEECCCC T ss_conf 66665056520589--9999998631025899860565 No 95 >KOG0978 consensus Probab=21.17 E-value=15 Score=16.81 Aligned_cols=10 Identities=30% Similarity=0.414 Sum_probs=5.9 Q ss_pred EEECCCHHHH Q ss_conf 9974352422 Q gi|254780214|r 80 LYTHPHADHI 89 (271) Q Consensus 80 ~iTH~H~DHi 89 (271) ..+|.|.+|. T Consensus 63 ~~~~~~~~~~ 72 (698) T KOG0978 63 LADHLQEKHA 72 (698) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 96 >cd07186 CofD_like LPPG:FO 2-phospho-L-lactate transferase; important in F420 biosynthesis. CofD is a 2-phospho-L-lactate transferase that catalyzes the last step in the biosynthesis of coenzyme F(420)-0 (F(420) without polyglutamate) by transferring the lactyl phosphate moiety of lactyl(2)diphospho-(5')guanosine (LPPG) to 7,8-didemethyl-8-hydroxy-5-deazariboflavin ribitol (F0). F420 is a hydride carrier, important for energy metabolism of methanogenic archaea, as well as for the biosynthesis of other natural products, like tetracycline in Streptomyces. F420 and some of its precursors are also utilized as cofactors for enzymes, like DNA photolyase in Mycobacterium tuberculosis. Probab=20.90 E-value=57 Score=13.21 Aligned_cols=62 Identities=13% Similarity=0.153 Sum_probs=42.3 Q ss_pred CCEEEEC-CCCCCHHHHHHHHCCHHEEEEE-CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 8625303-2330016788862221102530-233576887579999999999729977999768 Q gi|254780214|r 179 GNVAYCT-DVNAFPAESLEKLQNLDFLIID-ALKNGLHSSHFSLSESLKKIELINPKNAILTHM 240 (271) Q Consensus 179 ~~~~Y~~-Dt~~~~e~~~~~l~~~D~lI~d-~l~~~~~~~H~~~~ea~~~~~~~~~k~~~LtHl 240 (271) .++.|.+ +....+++.++.++++|++|+. +..+..---|+.+++.-+..++.++++++..-| T Consensus 161 ~~v~~~~~e~a~~~p~al~AI~~AD~IIiGPgnp~tSI~PiL~VpgI~eAL~~s~A~vVaVSpi 224 (303) T cd07186 161 RDVRFVGAEEARPAPEVLEAIEDADLVIIGPSNPVTSIGPILALPGIREALRDKKAPVVAVSPI 224 (303) T ss_pred EEEEECCCCCCCCCHHHHHHHHHCCEEEECCCCCHHHHCCCCCCHHHHHHHHHCCCCEEEECCC T ss_conf 8899658887878989999998599899858862132153114556999998579998998988 No 97 >TIGR00337 PyrG CTP synthase; InterPro: IPR004468 CTP synthase is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. The enzyme catalyzes the reaction L-glutamine + H2O + UTP + ATP = CTP + phosphate + ADP + L-glutamate. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. This gene has been found circa 500 bp 5 upstream of enolase in both beta (Nitrosomonas europaea) and gamma (Escherichia coli) subdivisions of Proteobacterium .; GO: 0003883 CTP synthase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=20.86 E-value=57 Score=13.21 Aligned_cols=40 Identities=18% Similarity=0.335 Sum_probs=19.8 Q ss_pred CHHEEEEEC--CCCCCCCCCCCHHHHHH-HHHHCCCCEEEEECCC Q ss_conf 211025302--33576887579999999-9997299779997688 Q gi|254780214|r 200 NLDFLIIDA--LKNGLHSSHFSLSESLK-KIELINPKNAILTHMH 241 (271) Q Consensus 200 ~~D~lI~d~--l~~~~~~~H~~~~ea~~-~~~~~~~k~~~LtHls 241 (271) |+|++|+|- + .+--.+ +-+=||+. +..+++-.++.+.|++ T Consensus 142 ~~Dv~IvEiGGT-VGDIEs-~PFLEAiRQ~~~e~G~Env~~iHvT 184 (571) T TIGR00337 142 GADVVIVEIGGT-VGDIES-LPFLEAIRQLKKEVGRENVLFIHVT 184 (571) T ss_pred CCCEEEEEECCC-CCCCCC-HHHHHHHHHHHHHHCCCCEEEEEEE T ss_conf 974799983770-000036-2589999999987389867999840 No 98 >KOG4584 consensus Probab=20.84 E-value=51 Score=13.48 Aligned_cols=30 Identities=20% Similarity=0.355 Sum_probs=20.5 Q ss_pred CCCCCCHHHHHHHHCCHHEEEEECCCCCCC Q ss_conf 323300167888622211025302335768 Q gi|254780214|r 185 TDVNAFPAESLEKLQNLDFLIIDALKNGLH 214 (271) Q Consensus 185 ~Dt~~~~e~~~~~l~~~D~lI~d~l~~~~~ 214 (271) -|.+.++.++-...+++|++|+|++-..-| T Consensus 284 iDlrrvsqeLa~l~~daDLVViEGMGRalh 313 (348) T KOG4584 284 IDLRRVSQELAYLSSDADLVVIEGMGRALH 313 (348) T ss_pred EEHHHHHHHHHHHHCCCCEEEEECCCHHHH T ss_conf 218761489998744898899844443554 No 99 >pfam09788 Tmemb_55A Transmembrane protein 55A. Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate. Probab=20.61 E-value=54 Score=13.37 Aligned_cols=12 Identities=17% Similarity=0.105 Sum_probs=5.5 Q ss_pred ECCCCCEECCCC Q ss_conf 417896513378 Q gi|254780214|r 12 CGASLGVPRITG 23 (271) Q Consensus 12 tg~s~g~P~~~c 23 (271) ++.+++.|.+.| T Consensus 59 s~~~~~~p~V~C 70 (258) T pfam09788 59 SPDSGSAPVVNC 70 (258) T ss_pred CCCCCCCCEEEE T ss_conf 887899976877 No 100 >TIGR01469 cobA_cysG_Cterm uroporphyrin-III C-methyltransferase; InterPro: IPR006366 This domain is found in a family of sequences that represent enzymes, or enzyme domains, with uroporphyrin-III C-methyltransferase activity. This enzyme catalyzes the first step committed to the biosynthesis of either siroheme or cobalamin (vitamin B12) rather than protoheme (heme). Cobalamin contains cobalt while siroheme contains iron. Siroheme is a cofactor for nitrite and sulphite reductases and therefore plays a role in cysteine biosynthesis; many members of this family are CysG, siroheme synthase, with an additional N-terminal domain and with additional oxidation and iron insertion activities.; GO: 0008168 methyltransferase activity, 0006779 porphyrin biosynthetic process. Probab=20.43 E-value=58 Score=13.15 Aligned_cols=29 Identities=17% Similarity=-0.071 Sum_probs=17.7 Q ss_pred EEECCCCCCCCCCCCHHHHHHHHHHC-CCC Q ss_conf 53023357688757999999999972-997 Q gi|254780214|r 205 IIDALKNGLHSSHFSLSESLKKIELI-NPK 233 (271) Q Consensus 205 I~d~l~~~~~~~H~~~~ea~~~~~~~-~~k 233 (271) |-.|+..+.-.-+.|+.+..+-+.+. +.+ T Consensus 198 v~~At~~~Q~~~~~tL~~~~~~~~~~W~~~ 227 (242) T TIGR01469 198 VEWATTPNQRVVIGTLGDLAEKAAEEWNLK 227 (242) T ss_pred EECCCCCCCEEEEEEHHHHHHHHHHHCCCC T ss_conf 432885120255653888999999726999 No 101 >PRK00771 signal recognition particle protein Srp54; Provisional Probab=20.14 E-value=59 Score=13.11 Aligned_cols=44 Identities=14% Similarity=0.147 Sum_probs=24.7 Q ss_pred CCCEEEEECCCHHHHHHCCHHHHHHHCCCCEEEEEECCEEEEEECC Q ss_conf 2059999743524220111024333178211101311202321002 Q gi|254780214|r 75 SIDAVLYTHPHADHIHGIDGLRGYFLKQKRPIDVYAAPDCMKHLFE 120 (271) Q Consensus 75 ~Id~I~iTH~H~DHi~Gl~~Lr~~~~~~~~~i~vy~~~~~~~~l~~ 120 (271) .|++|++|-...|-=+|- ..+.....+.||...|.-+..+.|+. T Consensus 235 ~i~gvIlTKlDgdarGGa--aLSi~~~t~~PI~FiG~GEk~~dle~ 278 (433) T PRK00771 235 GIGGIIITKLDGTAKGGG--ALSAVAETGAPIKFIGTGETIDDLEP 278 (433) T ss_pred CCCEEEEECCCCCCCCCH--HHHHHHHHCCCCEEEECCCCCCCCCC T ss_conf 887379972567887305--42189887899568861787214886 Done!