Query gi|254780652|ref|YP_003065065.1| Fructose-bisphosphate aldolase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 339 No_of_seqs 143 out of 956 Neff 4.4 Searched_HMMs 39220 Date Sun May 29 20:50:30 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780652.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PTZ00019 fructose-bisphosphate 100.0 0 0 1092.2 34.8 337 2-338 6-345 (356) 2 pfam00274 Glycolytic Fructose- 100.0 0 0 1078.3 33.2 335 4-338 1-337 (348) 3 cd00948 FBP_aldolase_I_a Fruct 100.0 0 0 1075.4 35.1 327 3-329 2-330 (330) 4 cd00344 FBP_aldolase_I Fructos 100.0 0 0 1053.0 34.3 324 3-326 2-328 (328) 5 KOG1557 consensus 100.0 0 0 1027.1 30.3 334 3-336 14-349 (363) 6 COG3588 Fructose-1,6-bisphosph 100.0 0 0 893.2 27.6 331 1-331 1-332 (332) 7 cd00949 FBP_aldolase_I_bact Fr 100.0 0 0 679.2 16.6 261 10-305 4-275 (292) 8 PRK05377 fructose-1,6-bisphosp 100.0 0 0 649.5 19.7 261 10-305 7-278 (296) 9 PRK04161 tagatose 1,6-diphosph 95.4 0.26 6.5E-06 29.1 22.0 275 10-317 12-303 (326) 10 PRK12399 tagatose 1,6-diphosph 94.2 0.52 1.3E-05 27.1 23.0 261 13-303 12-289 (324) 11 cd00949 FBP_aldolase_I_bact Fr 87.4 0.36 9.1E-06 28.1 2.0 63 58-133 76-138 (292) 12 PRK05377 fructose-1,6-bisphosp 85.0 0.4 1E-05 27.9 1.2 61 60-133 81-141 (296) 13 PRK12858 tagatose 1,6-diphosph 84.0 3.7 9.4E-05 21.5 22.5 259 13-303 15-292 (340) 14 COG0285 FolC Folylpolyglutamat 76.3 5.4 0.00014 20.5 4.5 105 208-319 286-396 (427) 15 TIGR02203 MsbA_lipidA lipid A 74.5 6.3 0.00016 20.0 4.5 94 171-270 411-514 (603) 16 COG3684 LacD Tagatose-1,6-bisp 66.3 11 0.00028 18.5 12.6 274 7-317 9-300 (306) 17 KOG3697 consensus 65.1 8.2 0.00021 19.3 3.4 100 208-319 68-169 (345) 18 TIGR00364 TIGR00364 exsB prote 60.3 6.8 0.00017 19.8 2.3 33 255-289 2-35 (227) 19 TIGR00389 glyS_dimeric glycyl- 59.4 11 0.00027 18.5 3.2 34 4-39 528-565 (606) 20 cd01995 ExsB ExsB is a transcr 54.9 15 0.00038 17.6 3.3 10 279-288 137-146 (169) 21 cd00945 Aldolase_Class_I Class 50.7 20 0.00051 16.7 11.9 169 60-273 25-193 (201) 22 PRK13772 formimidoylglutamase; 50.1 20 0.00052 16.7 5.3 87 91-189 42-128 (313) 23 KOG0964 consensus 49.4 11 0.00028 18.4 1.9 30 256-285 1096-1125(1200) 24 COG0367 AsnB Asparagine syntha 48.2 22 0.00056 16.5 5.7 45 232-276 211-255 (542) 25 pfam02938 GAD GAD domain. This 47.8 20 0.00051 16.7 3.0 41 82-130 1-41 (94) 26 PRK13946 shikimate kinase; Pro 41.9 27 0.00069 15.9 7.4 124 17-165 46-190 (195) 27 cd02426 Pol_gamma_b_Cterm C-te 40.3 27 0.00069 15.9 2.7 12 178-189 63-74 (128) 28 cd03239 ABC_SMC_head The struc 39.3 30 0.00076 15.6 2.8 52 257-318 94-145 (178) 29 PRK13948 shikimate kinase; Pro 36.1 33 0.00085 15.3 5.9 121 17-163 36-177 (182) 30 pfam12227 DUF3603 Protein of u 36.1 11 0.00029 18.4 0.2 14 91-104 4-17 (214) 31 PTZ00077 asparagine synthetase 34.7 35 0.00089 15.1 2.9 32 231-262 217-248 (610) 32 PRK01722 formimidoylglutamase; 34.4 36 0.00091 15.1 5.3 89 91-190 44-132 (320) 33 cd01399 GlcN6P_deaminase GlcN6 34.1 36 0.00091 15.1 4.2 19 253-271 182-200 (232) 34 pfam11285 DUF3086 Protein of u 33.9 32 0.00082 15.4 2.3 82 253-336 99-210 (283) 35 KOG1997 consensus 33.8 36 0.00092 15.1 2.6 91 200-322 1392-1484(1518) 36 PRK13774 formimidoylglutamase; 33.4 37 0.00094 15.0 6.5 90 90-190 45-135 (311) 37 PRK09431 asnB asparagine synth 32.9 37 0.00095 15.0 3.1 31 233-263 209-239 (555) 38 pfam08742 C8 C8 domain. This d 32.9 34 0.00087 15.2 2.3 20 154-173 45-64 (74) 39 PRK04173 glycyl-tRNA synthetas 31.7 37 0.00095 15.0 2.3 18 303-320 374-391 (460) 40 TIGR00243 Dxr 1-deoxy-D-xylulo 30.2 41 0.0011 14.7 2.5 126 6-142 110-254 (406) 41 COG1926 Predicted phosphoribos 30.1 42 0.0011 14.6 3.6 62 116-179 103-166 (220) 42 pfam10312 Cactin_mid Conserved 30.0 42 0.0011 14.6 3.8 65 134-207 24-90 (190) 43 KOG1014 consensus 29.9 42 0.0011 14.6 7.8 100 3-106 84-196 (312) 44 PRK10528 multifunctional acyl- 28.6 44 0.0011 14.5 6.6 145 76-245 26-184 (191) 45 TIGR02087 LEUD_arch 3-isopropy 28.0 45 0.0012 14.4 3.0 68 220-291 13-87 (159) 46 TIGR01536 asn_synth_AEB aspara 27.7 46 0.0012 14.4 2.9 182 40-262 105-322 (646) 47 TIGR02907 spore_VI_D stage VI 27.0 23 0.00058 16.4 0.5 13 132-144 321-334 (371) 48 pfam00218 IGPS Indole-3-glycer 26.8 31 0.00078 15.5 1.2 83 158-271 146-228 (254) 49 KOG3153 consensus 26.4 23 0.00058 16.4 0.5 119 72-208 29-156 (250) 50 KOG0225 consensus 25.9 32 0.00081 15.4 1.1 18 82-99 239-256 (394) 51 pfam03932 CutC CutC family. Co 25.9 49 0.0013 14.2 5.5 22 251-272 85-109 (202) 52 cd03272 ABC_SMC3_euk Eukaryoti 25.7 20 0.00052 16.7 0.1 40 253-292 154-193 (243) 53 KOG2311 consensus 24.7 52 0.0013 14.0 2.0 102 15-123 76-217 (679) 54 TIGR00088 trmD tRNA (guanine-N 24.6 37 0.00094 15.0 1.3 125 66-203 7-154 (247) 55 pfam07395 Mig-14 Mig-14. This 24.4 49 0.0012 14.2 1.8 24 263-292 225-248 (264) 56 KOG0416 consensus 24.1 48 0.0012 14.3 1.8 73 134-225 47-120 (189) 57 pfam04577 DUF563 Protein of un 24.0 50 0.0013 14.2 1.8 12 198-209 132-143 (246) 58 COG5400 Uncharacterized protei 24.0 42 0.0011 14.6 1.5 102 7-126 48-151 (205) 59 pfam02463 SMC_N RecF/RecN/SMC 24.0 48 0.0012 14.2 1.8 31 255-285 1076-1106(1162) 60 pfam10058 DUF2296 Predicted in 23.8 13 0.00034 17.9 -1.1 20 160-179 16-39 (51) 61 pfam04015 DUF362 Domain of unk 23.7 54 0.0014 13.9 4.8 21 155-176 88-108 (262) 62 pfam03140 DUF247 Plant protein 23.6 10 0.00026 18.6 -1.7 17 119-135 223-239 (390) 63 pfam02329 HDC Histidine carbox 23.6 23 0.00059 16.3 0.1 48 77-131 26-73 (306) 64 COG4598 HisP ABC-type histidin 23.5 36 0.00091 15.1 1.0 20 256-275 151-170 (256) 65 COG4909 PduC Propanediol dehyd 23.3 53 0.0014 14.0 1.9 39 63-105 365-404 (554) 66 PRK00026 trmD tRNA (guanine-N( 23.2 55 0.0014 13.9 2.5 13 102-114 52-64 (226) 67 TIGR02177 PorB_KorB 2-oxoacid: 23.1 34 0.00087 15.2 0.8 16 198-213 13-28 (302) 68 KOG2229 consensus 23.1 56 0.0014 13.8 2.2 84 230-317 288-379 (616) 69 COG4277 Predicted DNA-binding 22.5 57 0.0015 13.8 2.2 13 45-57 98-110 (404) 70 KOG0057 consensus 22.3 36 0.00093 15.0 0.8 98 155-270 394-500 (591) 71 COG0363 NagB 6-phosphogluconol 22.2 58 0.0015 13.7 5.0 18 253-270 186-203 (238) 72 PRK13689 hypothetical protein; 21.6 39 0.00099 14.9 0.9 32 191-223 9-42 (75) 73 TIGR00445 mraY phospho-N-acety 21.0 40 0.001 14.8 0.8 19 105-123 155-173 (336) 74 KOG3935 consensus 20.8 62 0.0016 13.5 2.8 29 233-263 348-376 (446) 75 TIGR00960 3a0501s02 Type II (G 20.8 56 0.0014 13.8 1.6 10 253-262 206-215 (216) 76 TIGR02111 PQQ_syn_pqqC coenzym 20.8 62 0.0016 13.5 2.1 54 40-96 64-117 (239) 77 smart00832 C8 C8 domain. This 20.4 63 0.0016 13.5 2.4 20 154-173 47-66 (76) 78 TIGR00452 TIGR00452 methyltran 20.4 63 0.0016 13.5 3.7 62 130-209 241-303 (316) 79 KOG1344 consensus 20.2 63 0.0016 13.5 4.3 80 180-268 223-308 (324) No 1 >PTZ00019 fructose-bisphosphate aldolase; Provisional Probab=100.00 E-value=0 Score=1092.24 Aligned_cols=337 Identities=49% Similarity=0.796 Sum_probs=333.0 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCC Q ss_conf 47999999999747990999146887757878857899986788754564100778023046434677884100048874 Q gi|254780652|r 2 MQDLENSVAMMLQAGKGILAADESNATIQKRFKAIHLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTS 81 (339) Q Consensus 2 ~~eL~~~a~~~~~~gKGilAaDeS~gt~~Krl~~igventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~ 81 (339) ++||.+||++|++|||||||||||+|||+|||++|||||||||||.||++|||||++++||||||||||||||+++||++ T Consensus 6 ~~eL~~tA~~i~~~gKGILAaDES~~Ti~KRf~~igventeenRr~yRelLftt~~l~~yIsGvILfdETl~Q~t~dG~~ 85 (356) T PTZ00019 6 AKELAETAKKIVSPGKGILAADESTGTIKKRFDNIGLENTEENRAAYRELLFGTKGLGKFISGAILFEETLFQKNAAGVP 85 (356) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCHHHHEEEEECCHHHHCCCCCCCCC T ss_conf 99999999998179985887257752677667745999965778788875437876676010376758886030448988 Q ss_pred HHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCC--CCCCCHHHHHHHHHHHH Q ss_conf 6798995798789942785421789998621578357999999854339555010002336--77874088889989999 Q gi|254780652|r 82 FVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSV--SDVLPSMTVVKANMHIL 159 (339) Q Consensus 82 ~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i--~~~~Ps~~~I~~na~~l 159 (339) |+++|.++|||||||||+|++||+|+++|++|+|||||++||++||++||||||||+||+| ++++||+.||.+|++.| T Consensus 86 f~~~l~~~givpGIKVD~G~~~l~g~~~E~~T~GLDgL~eR~~~Y~~~GArFAKWRaV~~I~~~~~~PS~~~I~~Na~~L 165 (356) T PTZ00019 86 MVDLLREEGILPGIKVDKGLVTIPCTDGETSTQGLDGLAERCKKYYKAGARFAKWRAVLVIDLAKGKPSDLSITETAHTL 165 (356) T ss_pred HHHHHHHCCCEEEEEECCCCCCCCCCCCCEECCCCCCHHHHHHHHHHCCCCCEECCEEEEECCCCCCCCHHHHHHHHHHH T ss_conf 38989867971017717885218999823110464559999999998698610002588965899998699999869999 Q ss_pred HHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHH Q ss_conf 9999999844787201101103775798999999999999999998763853111000025565764443-589889987 Q gi|254780652|r 160 ARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAP-QVSDEEIAS 238 (339) Q Consensus 160 arYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~-~~~~eevA~ 238 (339) ||||+|||++||||||||||||||+|+||+|++||++||+.||++|.+|+|+|||+|||||||+||.+|+ ++++++||+ T Consensus 166 ArYA~icQ~~GlVPIVEPEVLmdG~H~ie~c~~VT~~vL~~vf~~L~~~~V~leG~lLKPnMV~pG~~~~~~~s~eeVA~ 245 (356) T PTZ00019 166 ARYAAICQENGLVPIVEPEILADGSHTIEVCAEVTEKVLAAVFKALNDHGVLLEGALLKPNMVTAGYDCPIKATSQDVGF 245 (356) T ss_pred HHHHHHHHHHCCCCCCCCHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEECCCCCCCCCCCCCCCCHHHHHH T ss_conf 99999998708734214032037765679999999999999999999749740661511651356643777699899999 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 77999850287438625860588889999998999862789980599851078637889881899889999999999999 Q gi|254780652|r 239 NTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRA 318 (339) Q Consensus 239 ~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Ra 318 (339) +|++||+++||++||||+|||||||+++||.||||||++++.||.||||||||||+++|++|+||++|+.+||++|++|| T Consensus 246 ~Tv~~L~~~VP~aVPGI~FLSGGQSeeeAT~~LnamNk~~~~PW~LSFSYgRALQ~s~Lk~W~Gk~eNv~aAQ~al~~RA 325 (356) T PTZ00019 246 LTVRTLRRTVPPALPGVVFLSGGQSEEEASVNLNEINKLGPHPWALSFSYGRALQASVLKTWKGKKSNVEAARKVLLKRA 325 (356) T ss_pred HHHHHHHHCCCCCCCCCEECCCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 78999986199668822358899988999999999845999996487635788739999984897777999999999999 Q ss_pred HHHHHHHCCCCCCCCCCCCC Q ss_conf 74158978974867682125 Q gi|254780652|r 319 HMNSLATKGCWKKSLENSNG 338 (339) Q Consensus 319 k~ns~A~~G~y~~~~e~~~~ 338 (339) ++||+|++|+|++++|++++ T Consensus 326 k~Ns~A~~G~y~~~~~~~~a 345 (356) T PTZ00019 326 KANSLAQLGKYKGGAGGSDA 345 (356) T ss_pred HHHHHHHCCCCCCCCCCCCC T ss_conf 97379876887899997542 No 2 >pfam00274 Glycolytic Fructose-bisphosphate aldolase class-I. Probab=100.00 E-value=0 Score=1078.32 Aligned_cols=335 Identities=50% Similarity=0.824 Sum_probs=330.4 Q ss_pred HHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCHH Q ss_conf 99999999974799099914688775787885789998678875456410077802304643467788410004887467 Q gi|254780652|r 4 DLENSVAMMLQAGKGILAADESNATIQKRFKAIHLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSFV 83 (339) Q Consensus 4 eL~~~a~~~~~~gKGilAaDeS~gt~~Krl~~igventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~~ 83 (339) ||.+||++|+++||||||||||+|||+|||++|||||||||||+||+||||||++++||||||||||||||+++||++|+ T Consensus 1 EL~~~a~~~~~~gKGILAaDES~~T~~KRf~~igventeenRr~yRelLfttp~~~~~IsGvILfeETl~Q~~~dG~~f~ 80 (348) T pfam00274 1 ELIKTAKAIVAPGKGILAADESTGTIGKRLASIGVENTEENRRAYRQLLFTTPGLGEYISGVILFEETLYQKTDDGKPFV 80 (348) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCHHHCCCEEECCHHHHHCCCCCCCCHH T ss_conf 96899999817998388725776458777886499985566888877532577755511324531877503376898858 Q ss_pred HHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 98995798789942785421789998621578357999999854339555010002336778740888899899999999 Q gi|254780652|r 84 DLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARYA 163 (339) Q Consensus 84 ~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA 163 (339) ++|.++|||||||||||++|++|+++|++|+|||||.+||++||++||||||||+||+|++++||+.||.+|++.||||| T Consensus 81 ~~L~~~GivpGIKVDkG~~~l~g~~~E~~T~GLDgL~eR~~~Y~~~GArFAKWRaVi~I~~~~PS~~~I~~Na~~LArYA 160 (348) T pfam00274 81 DVLKEKGIVPGIKVDKGLVPLAGTDGETTTQGLDGLAERCAEYYKQGARFAKWRAVLKISPNTPSELAIEENANVLARYA 160 (348) T ss_pred HHHHHCCCCCCEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEEEEECCCCCCCHHHHHHHHHHHHHHH T ss_conf 99985698553534778435899975420346456899999999769852201125552799985999999999999999 Q ss_pred HHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHH Q ss_conf 999844787201101103775798999999999999999998763853111000025565764443-5898899877799 Q gi|254780652|r 164 VLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAP-QVSDEEIASNTIK 242 (339) Q Consensus 164 ~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~-~~~~eevA~~T~~ 242 (339) +|||++||||||||||||||+|+|++|++||++||+.||++|++|+|+|||+|||||||+||.+|+ ++++++||.+|++ T Consensus 161 ~icQ~~glVPIVEPEVlmdG~H~ie~~~~vt~~vl~~vf~~L~~~~V~LeG~lLKPnMV~pG~~~~~~~~~e~VA~~Tv~ 240 (348) T pfam00274 161 AICQQNGLVPIVEPEILLDGDHDLERCQEVTEKVLAAVFKALNDHHVMLEGTLLKPNMVTPGADCPKKATPEDVAEATVR 240 (348) T ss_pred HHHHHCCCCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHH T ss_conf 99986596220033131047656899999999999999999998498612522356602257768887999999999999 Q ss_pred HHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCC-CCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH Q ss_conf 985028743862586058888999999899986278-9980599851078637889881899889999999999999741 Q gi|254780652|r 243 IFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMG-DCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRAHMN 321 (339) Q Consensus 243 ~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~-~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Rak~n 321 (339) ||+++||++||||+|||||||+++||.|||+||++. ..||.||||||||||+++|++|+|+.+|+.+||++|++||++| T Consensus 241 ~l~~~VP~aVpGI~FLSGGqSeeeAt~~LnamN~~~~~~PW~LsFSygRALQ~s~Lk~W~G~~eNv~aAQ~~l~~Ra~~N 320 (348) T pfam00274 241 ALRRTVPPAVPGIVFLSGGQSEEEATLNLNAMNQLPLKKPWALSFSYGRALQASVLKAWGGKPENVKAAQEALLARAKAN 320 (348) T ss_pred HHHHHCCCCCCCCEECCCCCCHHHHHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99863985478344368887589999999998636789998688727788619999980898788999999999999874 Q ss_pred HHHHCCCCCCCCCCCCC Q ss_conf 58978974867682125 Q gi|254780652|r 322 SLATKGCWKKSLENSNG 338 (339) Q Consensus 322 s~A~~G~y~~~~e~~~~ 338 (339) |+|++|+|++++|.+++ T Consensus 321 s~A~~G~y~~~~e~~~~ 337 (348) T pfam00274 321 SLAQLGKYTGGGEGAAA 337 (348) T ss_pred HHHHCCCCCCCCCCCCC T ss_conf 79875875777777555 No 3 >cd00948 FBP_aldolase_I_a Fructose-1,6-bisphosphate aldolase. The enzyme catalyzes the cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate (DHAP). This family includes proteins found in vertebrates, plants, and bacterial plant pathogens. Mutations in the aldolase genes in humans cause hemolytic anemia and hereditary fructose intolerance. The enzyme is a member of the class I aldolase family, which utilizes covalent catalysis through a Schiff base formed between a lysine residue of the enzyme and ketose substrates. Probab=100.00 E-value=0 Score=1075.38 Aligned_cols=327 Identities=53% Similarity=0.851 Sum_probs=324.2 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCH Q ss_conf 79999999997479909991468877578788578999867887545641007780230464346778841000488746 Q gi|254780652|r 3 QDLENSVAMMLQAGKGILAADESNATIQKRFKAIHLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSF 82 (339) Q Consensus 3 ~eL~~~a~~~~~~gKGilAaDeS~gt~~Krl~~igventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~ 82 (339) +||.+||++|+++||||||||||+|||+|||++|||||||||||+||++|||||++++||||||||||||||+++||++| T Consensus 2 ~eL~~ta~~~~~~gKGILAaDES~~T~~KRf~~igventeenRr~yRelLftt~~~~~yIsGvILfdETl~Q~t~dG~~~ 81 (330) T cd00948 2 EELIKTAKAIVAPGKGILAADESTGTIGKRFASIGVENTEENRRAYRELLFTTPGLGQYISGVILFEETLYQKTDDGKPF 81 (330) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHCCCCCCHHHHHHHHHHHEECCCHHHCEEEEECCHHHHCCCCCCCCCH T ss_conf 58999999981699869872577536776677549998578898998885108873435237877677743456568814 Q ss_pred HHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 79899579878994278542178999862157835799999985433955501000233677874088889989999999 Q gi|254780652|r 83 VDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARY 162 (339) Q Consensus 83 ~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larY 162 (339) +++|.++||+||||||||++|++|+++|++|+|||||++||++||++||||||||+||+|++++||+.||.+|++.|||| T Consensus 82 ~~~L~~~givpGIKVD~G~~~l~g~~~E~~T~GLDgL~eR~~~Y~~~GArFAKWRaVi~I~~~~PS~~~I~~Na~~LArY 161 (330) T cd00948 82 VDILKEKGIVPGIKVDKGLVPLAGTDGETTTQGLDGLAERCAKYYKQGARFAKWRAVLKIGNGTPSELAIKENAHGLARY 161 (330) T ss_pred HHHHHHCCCEEEEECCCCCCCCCCCCCCEECCCCCCHHHHHHHHHHCCCCEEEEEEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 79998679734597379861689998341046745399999999986998234026787699998699999989999999 Q ss_pred HHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHH Q ss_conf 9999844787201101103775798999999999999999998763853111000025565764443-589889987779 Q gi|254780652|r 163 AVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAP-QVSDEEIASNTI 241 (339) Q Consensus 163 A~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~-~~~~eevA~~T~ 241 (339) |+|||++||||||||||||||+|+||+|++||++||+.||++|++|+|+|||+|||||||+||++|+ ++++++||++|+ T Consensus 162 A~icQ~~glVPIVEPEVLmdG~H~ie~c~~vt~~vL~~vf~~L~~~~V~leG~lLKPnMV~~G~~~~~~~~~eeVA~~Tv 241 (330) T cd00948 162 AAICQENGLVPIVEPEVLMDGDHDIERCQEVTEKVLAAVYKALNDHHVLLEGTLLKPNMVTPGADCKKKASPEEVAEYTV 241 (330) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEECCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99999859813215010016763479999999999999999999829765774402650136765777599899999899 Q ss_pred HHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCC-CCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 9985028743862586058888999999899986278-998059985107863788988189988999999999999974 Q gi|254780652|r 242 KIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMG-DCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRAHM 320 (339) Q Consensus 242 ~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~-~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Rak~ 320 (339) +||+++||++||||+|||||||+++||.|||+||+++ +.||.||||||||||+++|++|+||++|+++||++|++||++ T Consensus 242 ~~L~~~VP~avPGI~FLSGGQSee~At~~LnamNk~~~~~PW~LsFSYgRALQ~s~Lk~W~G~~~Nv~aAQ~~l~~RAk~ 321 (330) T cd00948 242 RALRRTVPAAVPGIVFLSGGQSEEEATLNLNAMNKLPLPKPWALSFSYGRALQASALKAWGGKKENVEAAQKALLKRAKA 321 (330) T ss_pred HHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 99986099668821567799898999999999856999999668864368874999999589878899999999999997 Q ss_pred HHHHHCCCC Q ss_conf 158978974 Q gi|254780652|r 321 NSLATKGCW 329 (339) Q Consensus 321 ns~A~~G~y 329 (339) ||+|++|+| T Consensus 322 Ns~A~~G~y 330 (330) T cd00948 322 NSLAALGKY 330 (330) T ss_pred HHHHHCCCC T ss_conf 368866799 No 4 >cd00344 FBP_aldolase_I Fructose-bisphosphate aldolase class I. Fructose-1,6-bisphosphate aldolase is an enzyme of the glycolytic and gluconeogenic pathways found in vertebrates, plants, and bacteria. The enzyme catalyzes the cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate (DHAP). Mutations in the aldolase genes in humans cause hemolytic anemia and hereditary fructose intolerance. The enzyme is a member of the class I aldolase family, which utilizes covalent catalysis through a Schiff base formed between a lysine residue of the enzyme and ketose substrates. Although structurally similar, the class II aldolases use a different mechanism and are believed to have an independent evolutionary origin. Probab=100.00 E-value=0 Score=1053.03 Aligned_cols=324 Identities=48% Similarity=0.779 Sum_probs=320.1 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHHEECCC-CHHEEEEEECCHHHHHHCCCCCCC Q ss_conf 7999999999747990999146887757878857899986788754564100778-023046434677884100048874 Q gi|254780652|r 3 QDLENSVAMMLQAGKGILAADESNATIQKRFKAIHLESTENSRRDYREMLFRAKD-AMQYISSILLYEETLYQNAQDGTS 81 (339) Q Consensus 3 ~eL~~~a~~~~~~gKGilAaDeS~gt~~Krl~~igventeenr~~~R~~l~tsp~-~~~~IsGvILfeeTl~q~~~~g~~ 81 (339) +||.+||++|++|||||||||||+|||+|||++|||||||||||+||++|||+|+ +++||||||||||||||+++||++ T Consensus 2 ~eL~~~a~~l~~~gKGILAaDES~~T~~KRf~~igventeenRr~yRelLftt~~~~~~~IsGvILfeETl~Q~~~~G~~ 81 (328) T cd00344 2 KELSDIAHRIVAPGKGILAADESTGSIAKRLQSIGTENTEENRRFYRQLLLTADDRVNPRIGGVILFHETLYQKADDGRP 81 (328) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHCCCEEECCHHHHCCCCCCCCC T ss_conf 58999999980799858872578645765676459999757699999998158246662470365648873330359972 Q ss_pred HHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 67989957987899427854217899986215783579999998543395550100023367787408888998999999 Q gi|254780652|r 82 FVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILAR 161 (339) Q Consensus 82 ~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~lar 161 (339) |+++|.++|||||||||||++|++|+++|++|+|||||++||++||++||||||||+||+|++++||+.||.+|+++||| T Consensus 82 ~~~~l~~~givPGIKVDkG~~~l~g~~~E~~T~GLDgL~~R~~~Y~~~GarFAKWRaVi~I~~~~PS~~~I~~Na~~LAr 161 (328) T cd00344 82 FPQVIKSKGGVVGIKVDKGVVPLAGTNGETTTQGLDGLSERCAQYKKDGADFAKWRCVLKIGEHTPSALAIMENANVLAR 161 (328) T ss_pred HHHHHHHCCCEEEEEECCCCCCCCCCCCCEECCCHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 17999867982348707986238999702114771469999999997799854323788669999858999999999999 Q ss_pred HHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHH Q ss_conf 99999844787201101103775798999999999999999998763853111000025565764443-58988998777 Q gi|254780652|r 162 YAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAP-QVSDEEIASNT 240 (339) Q Consensus 162 YA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~-~~~~eevA~~T 240 (339) ||+|||++||||||||||||||+|+|++|++||++||.++|++|.+|+|+|||+|||||||+||.+|+ ++++++||++| T Consensus 162 YA~icQ~~GLVPIVEPEVlmdG~H~ie~c~~Vt~~vL~~vf~~L~~~~V~leG~lLKpnMV~~G~~~~~~~~~~~VA~~T 241 (328) T cd00344 162 YASICQQNGIVPIVEPEILPDGDHDLKRCQYVTEKVLAAVYKALSDHHIYLEGTLLKPNMVTPGHACTQKFSHEEIAMAT 241 (328) T ss_pred HHHHHHHCCCCCCCCCHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCEEECCCCCCCCCCHHHHHHHH T ss_conf 99999970877600500215788778999999999999999999970776667452055210686456649989999999 Q ss_pred HHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCC-CCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 99985028743862586058888999999899986278-99805998510786378898818998899999999999997 Q gi|254780652|r 241 IKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMG-DCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRAH 319 (339) Q Consensus 241 ~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~-~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Rak 319 (339) ++||+++||++||||+|||||||++|||.|||+||++. ..||.||||||||||+++|++|+||++|+.+||++|++||+ T Consensus 242 v~~l~~~VP~aVpGI~FLSGGqSeeeAt~~LnamN~~~~~~PW~lsFSygRALQ~~~Lk~W~G~~~Nv~aaQ~~l~~Rak 321 (328) T cd00344 242 VTALRRTVPPAVTGVTFLSGGQSEEEASINLNAINKCPLLKPWALTFSYGRALQASALKAWGGKKENLKAAQEEYVKRAL 321 (328) T ss_pred HHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 99998529976887677779999999999999982699999956987526987399999848976639999999999999 Q ss_pred HHHHHHC Q ss_conf 4158978 Q gi|254780652|r 320 MNSLATK 326 (339) Q Consensus 320 ~ns~A~~ 326 (339) +||+|++ T Consensus 322 ~NslA~q 328 (328) T cd00344 322 ANSLAAQ 328 (328) T ss_pred HHHHHHC T ss_conf 7577319 No 5 >KOG1557 consensus Probab=100.00 E-value=0 Score=1027.09 Aligned_cols=334 Identities=50% Similarity=0.799 Sum_probs=328.7 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCH Q ss_conf 79999999997479909991468877578788578999867887545641007780230464346778841000488746 Q gi|254780652|r 3 QDLENSVAMMLQAGKGILAADESNATIQKRFKAIHLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSF 82 (339) Q Consensus 3 ~eL~~~a~~~~~~gKGilAaDeS~gt~~Krl~~igventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~ 82 (339) +||.+||++|++|||||||||||+|||||||.+|||||||+|||+||++|||+|++.+||||||||||||||+++||+|| T Consensus 14 ~EL~~~A~~I~~pGkGILAaDES~~T~gkRl~sIgveNtE~NRr~yRelLfttpg~~~~IsGvILfeETlyQkt~dGkpf 93 (363) T KOG1557 14 DELIKIAKKIVTPGKGILAADESTGTIGKRLASIGVENTEENRRAYRELLFTTPGLNQYISGVILFEETLYQKTDDGKPF 93 (363) T ss_pred HHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCHHHCCCEEEEEEEHHEEECCCCCCH T ss_conf 99999999862799724753577535877888617866188899999896358871103424786520010137899786 Q ss_pred HHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 79899579878994278542178999862157835799999985433955501000233677874088889989999999 Q gi|254780652|r 83 VDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARY 162 (339) Q Consensus 83 ~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larY 162 (339) +++|.++|||||||||||++||+|+++|++|||||||.+||++||+.||||||||||++|++++||.++|.+|++.|||| T Consensus 94 ~~~L~~~givpGIKvDKG~vplaGt~~E~ttqGLD~L~~Rca~y~k~Ga~FAKWR~vlki~~~~PS~lai~EnA~~LARY 173 (363) T KOG1557 94 VDLLKEKGIVPGIKVDKGLVPLAGTNGETTTQGLDGLAERCAQYYKDGARFAKWRAVLKIGDGTPSALAIKENANGLARY 173 (363) T ss_pred HHHHHHCCCCCCEEECCCCCCCCCCCCCCEEECHHHHHHHHHHHHHCCCCHHHEEEEEEECCCCCHHHHHHHHHHHHHHH T ss_conf 99999759821257557752455567862020322488999999875775210468888369994189999888899999 Q ss_pred HHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHH Q ss_conf 999984478720110110377579899999999999999999876385311100002556576444-3589889987779 Q gi|254780652|r 163 AVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSA-PQVSDEEIASNTI 241 (339) Q Consensus 163 A~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~-~~~~~eevA~~T~ 241 (339) |.|||++||||||||||+.||||+|++|++|||+||+.||++|++|+|+||||||||||||||++| ++++||+||++|| T Consensus 174 A~IcQ~nGLVPIVEPEil~dGdHdi~r~~~VtE~Vla~vykaL~~hhV~lEGtLLKPnMVTpG~~s~~K~tpe~iA~~Tv 253 (363) T KOG1557 174 ASICQQNGLVPIVEPEILPDGDHDIKRCQYVTEKVLAAVYKALNDHHVYLEGTLLKPNMVTPGAESTEKYTPEQIALATV 253 (363) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEECEECCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99985369301026502158863489999999999999999862220565121414552368764566588899999999 Q ss_pred HHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCC-CCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 9985028743862586058888999999899986278-998059985107863788988189988999999999999974 Q gi|254780652|r 242 KIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMG-DCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRAHM 320 (339) Q Consensus 242 ~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~-~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Rak~ 320 (339) ++|+|+||++||||+|||||||+||||.||||||+.+ ..||.||||||||||+|+|++|+||+||+.+||++|++||++ T Consensus 254 taLrrtVP~AVPGI~FLSGgqseeeAt~nLnAiN~~~~~kpW~LtFSygRALq~s~L~aW~Gk~eni~~aq~~~l~ra~a 333 (363) T KOG1557 254 TALRRTVPAAVPGIVFLSGGQSEEEATLNLNAINQEPGPKPWSLTFSYGRALQASVLKAWGGKDENIAAAQEALLKRAKA 333 (363) T ss_pred HHHHHCCCCCCCEEEEECCCCCHHHHHCCHHHHHCCCCCCCCEEEEHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 99984399878656996177626654224677645888887356411788988999998559777799999999999861 Q ss_pred HHHHHCCCCCCCCCCC Q ss_conf 1589789748676821 Q gi|254780652|r 321 NSLATKGCWKKSLENS 336 (339) Q Consensus 321 ns~A~~G~y~~~~e~~ 336 (339) ||+|++|+|..+-+.. T Consensus 334 n~~a~~Gky~~~~~~~ 349 (363) T KOG1557 334 NSLAALGKYTGQASGG 349 (363) T ss_pred CCHHHCCCCCCCCCCC T ss_conf 4511205656655565 No 6 >COG3588 Fructose-1,6-bisphosphate aldolase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=893.19 Aligned_cols=331 Identities=47% Similarity=0.694 Sum_probs=326.9 Q ss_pred CHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHHEECCCC-HHEEEEEECCHHHHHHCCCCC Q ss_conf 9479999999997479909991468877578788578999867887545641007780-230464346778841000488 Q gi|254780652|r 1 MMQDLENSVAMMLQAGKGILAADESNATIQKRFKAIHLESTENSRRDYREMLFRAKDA-MQYISSILLYEETLYQNAQDG 79 (339) Q Consensus 1 m~~eL~~~a~~~~~~gKGilAaDeS~gt~~Krl~~igventeenr~~~R~~l~tsp~~-~~~IsGvILfeeTl~q~~~~g 79 (339) |..+|+.||.+|+++||||||||||+||++|||++||||+||++||+||+|||+||++ ++||+|||||||||+|++++| T Consensus 1 m~erl~~~a~~~v~nGKG~iAADeS~gt~~krf~~~Gie~te~srrd~Re~l~~s~~~~~~yI~GaILfeeTm~q~~~~g 80 (332) T COG3588 1 MTERLNDTALKKVANGKGFIAADESGGTTPKRFDSYGIEETEYSRRDMRERLFTSPDFMEDYILGAILFEETMDQKADGG 80 (332) T ss_pred CCCCHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHEEHHHHHHHHHHCCC T ss_conf 95106689999975699658635789852668998188873030688998883585400223111001487887751589 Q ss_pred CCHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHH Q ss_conf 74679899579878994278542178999862157835799999985433955501000233677874088889989999 Q gi|254780652|r 80 TSFVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHIL 159 (339) Q Consensus 80 ~~~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~l 159 (339) +|++++|+++||+||||||||+.|++|+++|++|+|||||++||++||++|+||||||+||+|+++.||..||++|+++| T Consensus 81 ~p~~~l~~~kgivPgiKvDkGl~~~~g~~~ek~t~gLd~L~~R~~~~~~~GarfaKwRsvI~~~~~~ps~~~I~~nv~~l 160 (332) T COG3588 81 YPADYLWKEKGIVPGIKVDKGLKPLAGVQLEKPTEGLDGLLKRAKEYHIFGARFAKWRSVIKIADGIPSWGGIKANVHQL 160 (332) T ss_pred CCHHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 88799998549873056357865446996455776779999999986541433988999987126887611188999999 Q ss_pred HHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 99999998447872011011037757989999999999999999987638531110000255657644435898899877 Q gi|254780652|r 160 ARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAPQVSDEEIASN 239 (339) Q Consensus 160 arYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~~~~~eevA~~ 239 (339) ||||++||++||||||||||+|||+||+++|++||+.+|..+|++|++..|.++|++||||||++|+.|.++||++||+. T Consensus 161 a~yAa~cq~aGlVPIVEPEV~mdg~~d~~~~eeVtk~~L~k~~~~L~~~~vvm~g~~lk~smv~~g~~~~~~s~~~vae~ 240 (332) T COG3588 161 AEYAALCQAAGLVPIVEPEVDIDGDHDKARSEEVTKAELRKLLNALNEERVVMLGLILKTSMVISGKKSREASPDEVAED 240 (332) T ss_pred HHHHHHHHHCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCHHHHHHH T ss_conf 99999999779832105404226763089999999999999998742667573044046200136521232466777777 Q ss_pred HHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 79998502874386258605888899999989998627899805998510786378898818998899999999999997 Q gi|254780652|r 240 TIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRAH 319 (339) Q Consensus 240 T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Rak 319 (339) |+.+++.+||+.||||||||||||++||++|||+||...++||.||||||||||+.+|++|+||.||++++|.+|++|+. T Consensus 241 tl~~~~~tvP~~vpgIvfLSGG~s~eeA~~~Lsa~n~~~~~pw~L~fSf~RAL~~~~L~aw~~d~enf~~~q~~~~e~~~ 320 (332) T COG3588 241 TLYSLLSTVPAVVPGIVFLSGGYSSEEANAHLSANNGERPLPWSLIFSFSRALQEGALKAWQGDEENFAKAQAASIERIY 320 (332) T ss_pred HHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 89999850774455048835876568899998750477888701333088886550665422527889999999999999 Q ss_pred HHHHHHCCCCCC Q ss_conf 415897897486 Q gi|254780652|r 320 MNSLATKGCWKK 331 (339) Q Consensus 320 ~ns~A~~G~y~~ 331 (339) +||+|+.|+|+. T Consensus 321 ~asl~a~gkw~~ 332 (332) T COG3588 321 EASLAALGKWKQ 332 (332) T ss_pred HHHHHHCCCCCC T ss_conf 987764077789 No 7 >cd00949 FBP_aldolase_I_bact Fructose-1.6-bisphosphate aldolase found in gram +/- bacteria. The enzyme catalyzes the cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate (DHAP). The enzyme is member of the class I aldolase family, which utilizes covalent catalysis through a Schiff base formed between a lysine residue of the enzyme and ketose substrates. Probab=100.00 E-value=0 Score=679.18 Aligned_cols=261 Identities=25% Similarity=0.298 Sum_probs=228.4 Q ss_pred HHHHHCCCEEEEE-CCCCCCHHHHHHHCCCCCCHHHH--------HHHHHHHEECCCC-HHEEEEEECCHHHHHHCCCCC Q ss_conf 9997479909991-46887757878857899986788--------7545641007780-230464346778841000488 Q gi|254780652|r 10 AMMLQAGKGILAA-DESNATIQKRFKAIHLESTENSR--------RDYREMLFRAKDA-MQYISSILLYEETLYQNAQDG 79 (339) Q Consensus 10 ~~~~~~gKGilAa-DeS~gt~~Krl~~igventeenr--------~~~R~~l~tsp~~-~~~IsGvILfeeTl~q~~~~g 79 (339) .+++++||||||| |||+|||||||++||||+|++|+ |+||+||||||+| ++||||||||||||+|++ || T Consensus 4 l~~v~~GkGilAAlDES~gT~~Krl~~igve~~~~~~~~~~f~~~~~~R~~l~ts~~~~~~~IsGvILfeeTl~q~~-dG 82 (292) T cd00949 4 LERMKSGKGFIAALDQSGGSTPKALAAYGIEEDAYSNEEEMFDLVHEMRTRIITSPAFDGDKILGAILFEQTMDREI-EG 82 (292) T ss_pred HHHHCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEEHHHHCCCC-CC T ss_conf 87334898279700678882777899769998865668888888999999985698646682599996377426312-69 Q ss_pred CCHHHHH-HHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHH Q ss_conf 7467989-957987899427854217899986215783579999998543395550100023367787408888998999 Q gi|254780652|r 80 TSFVDLI-SSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHI 158 (339) Q Consensus 80 ~~~~~~l-~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~ 158 (339) .||+++| +++|||||||||||++|++ +|++.|+|+|||++||++|++.||+||||||||++ |+..+|.+|+++ T Consensus 83 ~p~~~~L~~~~GIvpgIKVDkGl~~~~--~g~~~~k~~dgLd~~~~r~~~~g~~gaKwRaVi~~----~~~~~i~~~~~~ 156 (292) T cd00949 83 KPTADYLWEKKQIVPFLKVDKGLAEEK--NGVQLMKPIPNLDELLMRAKEKGVFGTKMRSVIKE----ANPKGIAAVVDQ 156 (292) T ss_pred CCCHHHHHHCCCCCCEEEECCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCCCCEEHHHHHCC----CCHHHHHHHHHH T ss_conf 826999997289874068478976678--99711315878999999999805674306776348----987899999999 Q ss_pred HHHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 99999999844787201101103775798999999999999999998763853111000025565764443589889987 Q gi|254780652|r 159 LARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAPQVSDEEIAS 238 (339) Q Consensus 159 larYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~~~~~eevA~ 238 (339) +|+||++||++||||||||||+||++|+.+.+..+...++.++++.+..++|+|+ ++++ +++. T Consensus 157 ~~~~A~~~q~~GLVPIVEPEV~i~~~~~~e~e~~l~~~l~~~l~~l~~~~~v~lK-----~tlP------------~~~~ 219 (292) T cd00949 157 QFELAKQILSHGLVPIIEPEVDIHSADKAKCEAILKAEILKHLDKLPEGQQVMLK-----LTLP------------TEAN 219 (292) T ss_pred HHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEC-----CCCC------------CCCC T ss_conf 9999999998798245776316776658889999999999999845677674660-----5756------------7555 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCCHH Q ss_conf 7799985028743862586058888999999899986278998059985107863788988189988 Q gi|254780652|r 239 NTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGKQD 305 (339) Q Consensus 239 ~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~~e 305 (339) .+..++.+ |+||||||||||||++|||.|||+|| +||||||||||++ |++|++++| T Consensus 220 ~~~~~~~~---P~V~~VvfLSGGqSeeeAt~~Ln~~n-------~~~~SfgRALqeg-L~~~q~~~e 275 (292) T cd00949 220 FYSELIEH---PKVLRVVALSGGYSREEANELLAKNN-------GVIASFSRALTEG-LSADQSDAE 275 (292) T ss_pred CCCCCCCC---CCCCEEEECCCCCCHHHHHHHHHHCC-------CCEEEEHHHHHCC-CCCCCCHHH T ss_conf 32010478---67661343588889899999997367-------9568717965581-021359999 No 8 >PRK05377 fructose-1,6-bisphosphate aldolase; Reviewed Probab=100.00 E-value=0 Score=649.45 Aligned_cols=261 Identities=26% Similarity=0.309 Sum_probs=238.5 Q ss_pred HHHHHCCCEEEEE-CCCCCCHHHHHHHCCCCCCHHHH--------HHHHHHHEECCCC-HHEEEEEECCHHHHHHCCCCC Q ss_conf 9997479909991-46887757878857899986788--------7545641007780-230464346778841000488 Q gi|254780652|r 10 AMMLQAGKGILAA-DESNATIQKRFKAIHLESTENSR--------RDYREMLFRAKDA-MQYISSILLYEETLYQNAQDG 79 (339) Q Consensus 10 ~~~~~~gKGilAa-DeS~gt~~Krl~~igventeenr--------~~~R~~l~tsp~~-~~~IsGvILfeeTl~q~~~~g 79 (339) .+.|+.||||||| |||+|||||||++||||++|+|| |+||+||||||+| ++||+|||||||||+|++ +| T Consensus 7 ~~~~~~GkGfiAAlDeS~gs~~Krl~~~gve~~e~~~~~emf~~~h~mR~ri~tsp~f~~~~I~GaILFeeTm~~~~-~G 85 (296) T PRK05377 7 LEKMKNGKGFIAALDQSGGSTPKALKLYGVEEDAYSNEEEMFDLVHEMRTRIITSPAFTGDKILGAILFEQTMDREI-EG 85 (296) T ss_pred HHHHHCCCCEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEHHHHHCCC-CC T ss_conf 98774698059963678881677899859997522568888999999999872687545670589997587651546-79 Q ss_pred CCHHHHH-HHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHH Q ss_conf 7467989-957987899427854217899986215783579999998543395550100023367787408888998999 Q gi|254780652|r 80 TSFVDLI-SSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHI 158 (339) Q Consensus 80 ~~~~~~l-~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~ 158 (339) .+|+++| +++||||||||||||.|+. +|++.|+|+|||++||++|++.|++||||||||+ .||..+|.+|+++ T Consensus 86 ~~~~~~L~~~kGIvPgIKVDkGl~~~~--~G~q~~k~~~gLd~ll~r~~~~g~~GtKwRsVI~----~~~~~~I~a~v~q 159 (296) T PRK05377 86 KPTADYLWEDKGVVPFLKVDKGLAEEA--NGVQLMKPIPNLDDLLARAVEKGIFGTKMRSVIK----EANEQGIKAVVAQ 159 (296) T ss_pred CCCHHHHHHCCCCCCEEEECCCCCCCC--CCCEEECCCCCHHHHHHHHHHCCCCHHHHHHHHC----CCCHHHHHHHHHH T ss_conf 720999986479860377367876678--9855402676589999999985774255787642----8999999999999 Q ss_pred HHHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 99999999844787201101103775798999999999999999998763853111000025565764443589889987 Q gi|254780652|r 159 LARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAPQVSDEEIAS 238 (339) Q Consensus 159 larYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~~~~~eevA~ 238 (339) |||||++||++||||||||||+|||+|+.+.|..+++.++..+++...+|+|+|++|| | + ++. T Consensus 160 q~~yA~~iq~~GLVPIVEPEV~id~~~k~e~E~~l~~~il~~L~~l~~d~~VmLKlTl--P--------------e-~~~ 222 (296) T PRK05377 160 QFEVAKQILAAGLVPIIEPEVDINSPDKAEAEALLKAEILKQLDSLPEDQQVMLKLTI--P--------------T-EAN 222 (296) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEECCC--C--------------C-CCC T ss_conf 9999999998798321066535688226789999999999999735788717986247--6--------------5-100 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCCHH Q ss_conf 7799985028743862586058888999999899986278998059985107863788988189988 Q gi|254780652|r 239 NTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGKQD 305 (339) Q Consensus 239 ~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~~e 305 (339) .+..+..+ |.|++|||||||||++|||.+|+. +|+|+|||||||+++ |.+|++++| T Consensus 223 ~y~~l~~h---p~V~rVVaLSGGys~~eA~~~L~~-------n~~~iaSFSRAL~eg-L~~~qsd~e 278 (296) T PRK05377 223 LYKELIDH---PKVLRVVALSGGYSRDEANELLAR-------NHGMIASFSRALTEG-LSAQQSDEE 278 (296) T ss_pred HHHHHCCC---CCCCEEEEECCCCCHHHHHHHHHC-------CCCCEEEEHHHHHCC-CCCCCCHHH T ss_conf 15663358---774358773588787999999843-------998478756864386-766689899 No 9 >PRK04161 tagatose 1,6-diphosphate aldolase; Reviewed Probab=95.41 E-value=0.26 Score=29.11 Aligned_cols=275 Identities=17% Similarity=0.203 Sum_probs=147.0 Q ss_pred HHHHHCCCEEEEE---CCCCCCHHHHHHHC-CCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCHHHH Q ss_conf 9997479909991---46887757878857-8999867887545641007780230464346778841000488746798 Q gi|254780652|r 10 AMMLQAGKGILAA---DESNATIQKRFKAI-HLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSFVDL 85 (339) Q Consensus 10 ~~~~~~gKGilAa---DeS~gt~~Krl~~i-gventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~~~~ 85 (339) ++|+ ...|+++| ||- |++.|-++.. |.+.+.+.--.+..++. ..++.|-|+ ||.|--. |.|.... T Consensus 12 ~~ls-~~~G~~~~lAiDQR-gslk~mia~~~~~~~~~~~l~~fK~~v~--~~Ltp~ASa-ILlDp~y------g~pa~~~ 80 (326) T PRK04161 12 EKVS-HQNGIISALAFDQR-GALKQMMAAHQEGEATVTQIETLKVLVS--EELTPYASS-ILLDPEY------GLPATKV 80 (326) T ss_pred HHHC-CCCCCEEEEEECCC-HHHHHHHHHCCCCCCCHHHHHHHHHHHH--HHHHHHCCE-EEECCCC------CCCCHHC T ss_conf 9863-98998799985655-7899999854589997789999999999--986552022-6657654------7730320 Q ss_pred H-HHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9-957987899427854217899986215783579999998543395550100023367787408888998999999999 Q gi|254780652|r 86 I-SSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARYAV 164 (339) Q Consensus 86 l-~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA~ 164 (339) + .+.|.+-. .++-- .+.++....+.-+|+.. ....+..|+.--|-=.-++.++ +...........-|+.. T Consensus 81 ~~~~~GLlla--~E~tg--yd~t~~gRl~~li~~ws--v~rik~~GadAvK~L~yy~pD~---~~ein~~k~a~ve~ig~ 151 (326) T PRK04161 81 RDNQTGLLLA--YEKTG--YDATTTSRLPDCLVEWS--VKRLKEAGADAVKFLLYYDVDG---DEEINLQKQAYIERIGS 151 (326) T ss_pred CCCCCCEEEE--EECCC--CCCCCCCCCCCCCCCCC--HHHHHHHCCCCEEEEEEECCCC---CHHHHHHHHHHHHHHHH T ss_conf 2888765999--61346--77778765766676668--9999982846137999978999---87888999999999999 Q ss_pred HHHHCCCCCEECHHHC---CCCCCCHHHHHHHHHHHHHHHHHHHHH--CCCCCCCCCCCCCCCCC-----CCCCC--CCC Q ss_conf 9984478720110110---377579899999999999999999876--38531110000255657-----64443--589 Q gi|254780652|r 165 LCQNAGLVPIVEPEVL---MEGDHTIERCAEVTEFVLRTLFNELYT--MRVCFEGLVLKSNMILP-----GKSAP--QVS 232 (339) Q Consensus 165 i~Q~~GLVPIVEPEVl---~dG~H~ie~~~~vte~vL~~lf~~L~~--~~V~leg~lLKpnMv~p-----G~~~~--~~~ 232 (339) .|..+++-=++||=+. ++...+.+-....-++|+.. .++..+ ++|+ +||.-++.. |-... -++ T Consensus 152 eC~~~dipf~lE~l~Y~~~~~~~~~~eya~~Kp~~V~~~-~kefs~~~ygvD----vlKvE~Pvn~~~veg~~~~~~~yt 226 (326) T PRK04161 152 ECTAEDIPFFLELLSYDERISDNNSAAYAKLKPHKVNGA-MKVFSDKRFGVD----VLKVEVPVNMAYVEGFTEGEVVYS 226 (326) T ss_pred HHHHCCCCEEEEEEECCCCCCCCCCHHHHHHCHHHHHHH-HHHHCCCCCCCC----EEEECCCCCCCCCCCCCCCCCCCC T ss_conf 998779983898851078777766788998673899999-999614456997----798526776001024576654136 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 88998777999850287438625860588889999998999862789980599851078637889881899889999999 Q gi|254780652|r 233 DEEIASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQR 312 (339) Q Consensus 233 ~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~ 312 (339) -++ |.....-+... ..+| -+|||.|-+.+.=..-|.-=.+. ...+ .-|==|||+=..+++.|.... -.++++ T Consensus 227 ~~e-a~~~fk~~~~a--~~~P-wv~LSAGV~~~~F~~~l~~A~~a-Gasf-~G~L~GRA~W~d~i~~~~~~g--~~~~~~ 298 (326) T PRK04161 227 QEE-AIKAFKDQEAA--THLP-YIYLSAGVSAKLFQETLYFAAEA-GAQF-NGVLCGRATWAGSVPVYITKG--EEAARK 298 (326) T ss_pred HHH-HHHHHHHHHHC--CCCC-EEEECCCCCHHHHHHHHHHHHHC-CCCC-CCEEECHHHHHHHHHHHCCCC--HHHHHH T ss_conf 999-99999998735--3999-79983999989999999999986-9984-567653687776776411326--899999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780652|r 313 VLSHR 317 (339) Q Consensus 313 ~~~~R 317 (339) -|... T Consensus 299 wL~t~ 303 (326) T PRK04161 299 WLCTE 303 (326) T ss_pred HHHHH T ss_conf 99998 No 10 >PRK12399 tagatose 1,6-diphosphate aldolase; Reviewed Probab=94.19 E-value=0.52 Score=27.08 Aligned_cols=261 Identities=16% Similarity=0.209 Sum_probs=148.8 Q ss_pred HHCCCEEEEE---CCCCCCHHHHHH-HCCCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCHHHHH-H Q ss_conf 7479909991---468877578788-5789998678875456410077802304643467788410004887467989-9 Q gi|254780652|r 13 LQAGKGILAA---DESNATIQKRFK-AIHLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSFVDLI-S 87 (339) Q Consensus 13 ~~~gKGilAa---DeS~gt~~Krl~-~igventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~~~~l-~ 87 (339) ++...|+++| ||-+ ++.|-++ .-|.+.+.+.--++..++.. .++.|-|++ |+|--. |.|..+.+ . T Consensus 12 ls~~~G~f~~lAiDQRg-sLkkmia~~~g~~~~~~~l~~fK~~v~~--~Lt~~ASaI-LlDpey------g~pa~~~~~~ 81 (324) T PRK12399 12 LSNDNGVISALAFDQRG-ALKRMMAQHQTEEPTVAQIEELKVLVSE--ELTPYASSI-LLDPEY------GLPASKARDE 81 (324) T ss_pred HCCCCCCEEEEEECCCH-HHHHHHHHHCCCCCCHHHHHHHHHHHHH--HHHHHCCEE-EECCCC------CCCHHHHCCC T ss_conf 54989976888855546-8999999833899988899999999999--876532346-657443------7611320288 Q ss_pred HCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 57987899427854217899986215783579999998543395550100023367787408888998999999999998 Q gi|254780652|r 88 SSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARYAVLCQ 167 (339) Q Consensus 88 ~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~Q 167 (339) +.|.+-.. ++.- .+.+++...+.-+|+..- ..-+..|+.--|-=.-++.++. .......-...-|+...|. T Consensus 82 ~~GLlla~--E~tg--yd~~~~gRl~~~i~~wSv--~rik~~GadAvK~L~yy~pD~~---~~in~~k~~~Verig~eC~ 152 (324) T PRK12399 82 EAGLLLAY--EKTG--YDATTTGRLPDCLDVWSA--KRIKEEGADAVKFLLYYDVDED---DEINEQKKAYIERIGSECV 152 (324) T ss_pred CCCEEEEE--CCCC--CCCCCCCCCCCCCCCCCH--HHHHHCCCCEEEEEEEECCCCC---HHHHHHHHHHHHHHHHHHH T ss_conf 88638996--0445--667888767443344589--9999706573889999779997---7888999999999999999 Q ss_pred HCCCCCEECHHHC---CCCCCCHHHHHHHHHHHHHHHHHHHHH--CCCCCCCCCCCCCCCCC-----CCCC--CCCCHHH Q ss_conf 4478720110110---377579899999999999999999876--38531110000255657-----6444--3589889 Q gi|254780652|r 168 NAGLVPIVEPEVL---MEGDHTIERCAEVTEFVLRTLFNELYT--MRVCFEGLVLKSNMILP-----GKSA--PQVSDEE 235 (339) Q Consensus 168 ~~GLVPIVEPEVl---~dG~H~ie~~~~vte~vL~~lf~~L~~--~~V~leg~lLKpnMv~p-----G~~~--~~~~~ee 235 (339) .+++-=++||=+. ++...+.+......++|+..+ ++..+ ++|++ ||.-.+.. |-.. .-++-++ T Consensus 153 ~~dipf~lE~l~Y~~~~~~~~~~eya~~Kp~~V~~~~-kefs~~~ygvDv----lKvE~Pvn~~~veg~~~ge~~yt~~e 227 (324) T PRK12399 153 AEDIPFFLEILTYDEKIADNGSVEYAKVKPHKVNEAM-KVFSKPRFNVDV----LKVEVPVNMKYVEGFAEGEVVYTKEE 227 (324) T ss_pred HCCCCCEEEEECCCCCCCCCCCHHHHHHCHHHHHHHH-HHHHCCCCCCCE----EEEECCCCCCCCCCCCCCCCCCCHHH T ss_conf 7699715765113677778766889876648999999-997033569876----98615766010135566752112999 Q ss_pred HHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCC Q ss_conf 98777999850287438625860588889999998999862789980599851078637889881899 Q gi|254780652|r 236 IASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGK 303 (339) Q Consensus 236 vA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~ 303 (339) .+.+.-.+-.. ..+| -+|||.|-+.+.=..-|..=.+. ..++ --|==|||+=..+++.|... T Consensus 228 a~~~fke~~~a---~~~P-wv~LSAGV~~~~F~~~l~~A~~a-Ga~f-sG~L~GRA~W~d~i~~~~~~ 289 (324) T PRK12399 228 AAQHFKDQDAA---TDLP-YIYLSAGVSAELFQETLVFAHEA-GAKF-NGVLCGRATWAGSVKVYIEE 289 (324) T ss_pred HHHHHHHHHHC---CCCC-EEEEECCCCHHHHHHHHHHHHHC-CCCC-CCEEECHHHHHHHHHHHCCC T ss_conf 99999998744---5999-79980899989999999999985-9973-35534588888787652121 No 11 >cd00949 FBP_aldolase_I_bact Fructose-1.6-bisphosphate aldolase found in gram +/- bacteria. The enzyme catalyzes the cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate (DHAP). The enzyme is member of the class I aldolase family, which utilizes covalent catalysis through a Schiff base formed between a lysine residue of the enzyme and ketose substrates. Probab=87.43 E-value=0.36 Score=28.14 Aligned_cols=63 Identities=16% Similarity=0.095 Sum_probs=40.8 Q ss_pred CHHEEEEEECCHHHHHHCCCCCCCHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 0230464346778841000488746798995798789942785421789998621578357999999854339555 Q gi|254780652|r 58 AMQYISSILLYEETLYQNAQDGTSFVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARV 133 (339) Q Consensus 58 ~~~~IsGvILfeeTl~q~~~~g~~~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~f 133 (339) +.+.+.|.-..+.-+..+ +-+|+. =-++|..|.- .|. ..++.++|||+|.+|+++|+.+||+. T Consensus 76 l~q~~dG~p~~~~L~~~~--GIvpgI--KVDkGl~~~~---~g~------~~~k~~dgLd~~~~r~~~~g~~gaKw 138 (292) T cd00949 76 MDREIEGKPTADYLWEKK--QIVPFL--KVDKGLAEEK---NGV------QLMKPIPNLDELLMRAKEKGVFGTKM 138 (292) T ss_pred HCCCCCCCCCHHHHHHCC--CCCCEE--EECCCCCCCC---CCC------CCCCCCCCHHHHHHHHHHCCCCCEEH T ss_conf 263126982699999728--987406--8478976678---997------11315878999999999805674306 No 12 >PRK05377 fructose-1,6-bisphosphate aldolase; Reviewed Probab=84.96 E-value=0.4 Score=27.87 Aligned_cols=61 Identities=18% Similarity=0.199 Sum_probs=35.5 Q ss_pred HEEEEEECCHHHHHHCCCCCCCHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 30464346778841000488746798995798789942785421789998621578357999999854339555 Q gi|254780652|r 60 QYISSILLYEETLYQNAQDGTSFVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARV 133 (339) Q Consensus 60 ~~IsGvILfeeTl~q~~~~g~~~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~f 133 (339) +.+.|.-.-+. |..+ .+-+||.. -++|..|.- -|. +.++.+.|||+|.+||.+|+.+|++. T Consensus 81 ~~~~G~~~~~~-L~~~-kGIvPgIK--VDkGl~~~~---~G~------q~~k~~~gLd~ll~r~~~~g~~GtKw 141 (296) T PRK05377 81 REIEGKPTADY-LWED-KGVVPFLK--VDKGLAEEA---NGV------QLMKPIPNLDDLLARAVEKGIFGTKM 141 (296) T ss_pred CCCCCCCCHHH-HHHC-CCCCCEEE--ECCCCCCCC---CCC------EEECCCCCHHHHHHHHHHCCCCHHHH T ss_conf 54679720999-9864-79860377--367876678---985------54026765899999999857742557 No 13 >PRK12858 tagatose 1,6-diphosphate aldolase; Reviewed Probab=83.99 E-value=3.7 Score=21.54 Aligned_cols=259 Identities=19% Similarity=0.231 Sum_probs=143.9 Q ss_pred HHCCCEEEEE---CCCCCCHHHHHH-HCCCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCHHHHH-H Q ss_conf 7479909991---468877578788-5789998678875456410077802304643467788410004887467989-9 Q gi|254780652|r 13 LQAGKGILAA---DESNATIQKRFK-AIHLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSFVDLI-S 87 (339) Q Consensus 13 ~~~gKGilAa---DeS~gt~~Krl~-~igventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~~~~l-~ 87 (339) ++...|+++| ||- |++.|-+. .-+-+.+.+.--++..++. ..++.|-|++.|-.+ . |.| ... . T Consensus 15 ls~~~G~f~alAiDQR-gslk~mia~~~~~~~~~~~l~~fK~~vs--~~Ltp~ASaILlDp~-y------glp--a~~~~ 82 (340) T PRK12858 15 LANERGVFRAAAMDQR-GSLKKMLAKARTDEATYRDLVDFKLAVS--EALTPYASAILLDPE-Y------GLP--AALVR 82 (340) T ss_pred HCCCCCCEEEEEECCC-HHHHHHHHHHCCCCCCHHHHHHHHHHHH--HHHHHHCCEEEECCC-C------CCH--HHHHC T ss_conf 5398998799985655-7899999874489998889999999999--987653136775734-3------751--56640 Q ss_pred --HCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf --579878994278542178999862157835799999985433955501000233677874088889989999999999 Q gi|254780652|r 88 --SSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARYAVL 165 (339) Q Consensus 88 --~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA~i 165 (339) +.|.+-. .++-- ...+.+...+.-+|+..-+ ..+..||.--|-=.-+..++ ++.....+-...-|+... T Consensus 83 ~~~~GLlla--~E~tg--yd~~~~gRl~~li~~wsv~--rik~~GadAvK~L~yyrpD~---~~~in~~k~~~verig~e 153 (340) T PRK12858 83 DPNCGLLLS--YEKTG--YDATAPGRLPDLIDNWSVL--KIKEAGADAVKLLLYYRPDD---DKAINDRKQAFVERVGAE 153 (340) T ss_pred CCCCCEEEE--EECCC--CCCCCCCCCCCCCCCCCHH--HHHHCCCCEEEEEEEECCCC---CHHHHHHHHHHHHHHHHH T ss_conf 899775999--60124--6678998776677657899--99973968378999967999---768899999999999999 Q ss_pred HHHCCCCCEECHHHCCC---CCCCHHHHHHHHHHHHHHHHHHHH--HCCCCCCCCCCCCCCCCCCCCCC-------CCCH Q ss_conf 98447872011011037---757989999999999999999987--63853111000025565764443-------5898 Q gi|254780652|r 166 CQNAGLVPIVEPEVLME---GDHTIERCAEVTEFVLRTLFNELY--TMRVCFEGLVLKSNMILPGKSAP-------QVSD 233 (339) Q Consensus 166 ~Q~~GLVPIVEPEVl~d---G~H~ie~~~~vte~vL~~lf~~L~--~~~V~leg~lLKpnMv~pG~~~~-------~~~~ 233 (339) |..+++-=++||=+--. -.++.+.....-++|+..+ +++. +++|+ +||.-++...+..+ -++- T Consensus 154 C~~~dipf~lE~l~Y~~~~~d~~~~eyak~Kp~~V~~a~-kefs~~~ygvD----vlKvE~Pvn~~~veg~~~~~~~yt~ 228 (340) T PRK12858 154 CRANDIPFFLELLTYDFKISDKKSEEFAKVKPELVIATM-KEFSKPRYGVD----VLKVEVPVDMKFVEGRDGFAAAYTQ 228 (340) T ss_pred HHHCCCCEEEEEEECCCCCCCCCCHHHHHHCHHHHHHHH-HHHCCCCCCCC----EEEECCCCCCCCCCCCCCCCCCCCH T ss_conf 987799807887401688887657989986738899999-98637666986----7996266661202245675321359 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCC Q ss_conf 8998777999850287438625860588889999998999862789980599851078637889881899 Q gi|254780652|r 234 EEIASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGK 303 (339) Q Consensus 234 eevA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~ 303 (339) ++...+ ++-+-.. ..+| -+|||.|-+.+.=..-|.-=.+. ...+ --|==|||+=..+++.+... T Consensus 229 ~ea~~~-fk~~~~a--~~~P-wv~LSAGV~~~~F~~~l~~A~~a-Ga~f-sG~L~GRA~W~dai~~~~~~ 292 (340) T PRK12858 229 EEAFKL-FREQSDA--TPLP-FIFLSAGVTPELFLRELEFAKQA-GADF-NGVLCGRATWQDAIEIFAPE 292 (340) T ss_pred HHHHHH-HHHHHHC--CCCC-EEEECCCCCHHHHHHHHHHHHHC-CCCC-CCEEECHHHHHHHHHCCCCC T ss_conf 999999-9999735--4999-79981899989999999999985-9972-35534578777577414643 No 14 >COG0285 FolC Folylpolyglutamate synthase [Coenzyme metabolism] Probab=76.27 E-value=5.4 Score=20.47 Aligned_cols=105 Identities=18% Similarity=0.198 Sum_probs=67.6 Q ss_pred CCCCCCCCCC----CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCE Q ss_conf 3853111000----025565764443589889987779998502874386258605888899999989998627899805 Q gi|254780652|r 208 MRVCFEGLVL----KSNMILPGKSAPQVSDEEIASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWR 283 (339) Q Consensus 208 ~~V~leg~lL----KpnMv~pG~~~~~~~~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~ 283 (339) .+|.++|=+= +|.+.+-|...| .-|.+-.+.|+...+. .+.+.++.|-+++.+...-|.....+ ..-|. T Consensus 286 ~~~~wpGR~e~l~~~p~i~lDgAHNp-----~aa~~La~~l~~~~~~-~~~~~~v~g~l~dKd~~~~l~~L~~~-~~~~~ 358 (427) T COG0285 286 ANVDWPGRLERLSENPLILLDGAHNP-----HAARALAETLKTLFND-RPRLTLVFGMLKDKDIAGMLAALLPI-VDEIY 358 (427) T ss_pred HHCCCCCEEEEECCCCEEEEECCCCH-----HHHHHHHHHHHHHHCC-CCCEEEEEEEECCCCHHHHHHHHHCC-CCEEE T ss_conf 86768733788337974999888999-----9999999999977335-77528999832588889999974064-76899 Q ss_pred EE-EEHHHHHHHHHHHHHCCCHHHHH-HHHHHHHHHHH Q ss_conf 99-85107863788988189988999-99999999997 Q gi|254780652|r 284 LS-FSYGRALQESVLRAWNGKQDNVV-VAQRVLSHRAH 319 (339) Q Consensus 284 ls-fSygRALQ~~~Lk~W~G~~eN~~-~aQ~~~~~Rak 319 (339) ++ |.+-||+-...|..-.+...+.. ...+..+.+++ T Consensus 359 ~~~~~~~ra~~~~~l~~~~~~~~~~~~~~~~~a~~~~~ 396 (427) T COG0285 359 TTPLPWPRALDAEELLAFAGERGGVELDDVAEALELAL 396 (427) T ss_pred ECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 82589735489999999987635785003999999999 No 15 >TIGR02203 MsbA_lipidA lipid A export permease/ATP-binding protein MsbA; InterPro: IPR011917 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of a single polypeptide chain transporter in the ATP-binding cassette (ABC) transporter family, MsbA, which exports lipid A. It may also act in multidrug resistance. Lipid A, a part of lipopolysaccharide, is found in the outer leaflet of the outer membrane of most Gram-negative bacteria. Members of this family are restricted to the Proteobacteria (although lipid A is more broadly distributed) and often are clustered with lipid A biosynthesis genes .; GO: 0005524 ATP binding, 0006869 lipid transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=74.54 E-value=6.3 Score=20.02 Aligned_cols=94 Identities=26% Similarity=0.317 Sum_probs=59.0 Q ss_pred CCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC-CCCCCHHH----HHHHHHHHHH Q ss_conf 872011011037757989999999999999999987638531110000255657644-43589889----9877799985 Q gi|254780652|r 171 LVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKS-APQVSDEE----IASNTIKIFK 245 (339) Q Consensus 171 LVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~-~~~~~~ee----vA~~T~~~l~ 245 (339) ++-+=+=+|++|| |+|+ +.+=+-|..-+.....+=+.|.+|+ -|=|-.|.. ..+++-++ ++.+...-+- T Consensus 411 Fy~p~~G~IllDG-~~l~---d~~L~~LR~q~alVsQ~V~LFdDTi--A~NvaYg~~da~~~~~a~v~~A~~~Ayl~dfv 484 (603) T TIGR02203 411 FYEPDSGQILLDG-VDLQ---DYTLADLRRQVALVSQDVVLFDDTI--ANNVAYGRLDAEQVDRAEVERALAAAYLQDFV 484 (603) T ss_pred CCCCCCCCEEECC-CHHH---HCCHHHHHHHHHHHCCCEEECCCCH--HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 6045888565278-4044---3026635623000215237205326--77644042340017878999999986468887 Q ss_pred HCCCCCCCEE-----EEECCCCCHHHHHHH Q ss_conf 0287438625-----860588889999998 Q gi|254780652|r 246 RVVPCAVQGI-----AFLSGGQSEQDATAR 270 (339) Q Consensus 246 ~~vP~~VpgI-----vFLSGGqs~eeAt~~ 270 (339) +..|..+.-- +=|||||-+=-|.+| T Consensus 485 d~lP~Gldt~vG~NG~~LSGGQRQRlAIAR 514 (603) T TIGR02203 485 DKLPLGLDTPVGENGSRLSGGQRQRLAIAR 514 (603) T ss_pred HCCCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 147888563424132237760458999999 No 16 >COG3684 LacD Tagatose-1,6-bisphosphate aldolase [Carbohydrate transport and metabolism] Probab=66.33 E-value=11 Score=18.46 Aligned_cols=274 Identities=21% Similarity=0.232 Sum_probs=130.6 Q ss_pred HHHHHHHHCCCEEEEE---CCCCCCHHHHHHHCCCCC--CHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCC Q ss_conf 9999997479909991---468877578788578999--86788754564100778023046434677884100048874 Q gi|254780652|r 7 NSVAMMLQAGKGILAA---DESNATIQKRFKAIHLES--TENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTS 81 (339) Q Consensus 7 ~~a~~~~~~gKGilAa---DeS~gt~~Krl~~igven--teenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~ 81 (339) .+.+++.. -+|+++| ||-+ .+.+-|.+-+-+. |...--++..+ -+.-++.|-|.+.|-.|-=+..+ T Consensus 9 ~~~~~is~-~~G~iaaLA~DQRg-amr~m~a~~~~~epvt~~~led~Kv~--va~eLtpyASaiLLDpeyg~pa~----- 79 (306) T COG3684 9 YGIEDISR-ASGGIAALAFDQRG-AMRRMMAAAQAEEPVTVSQLEDFKVL--VAEELTPYASAILLDPEYGLPAI----- 79 (306) T ss_pred HHHHHHHC-CCCCEEEEEECCHH-HHHHHHHHCCCCCCCHHHHHHHHHHH--HHHHHHHHHHHHHCCHHHCCHHH----- T ss_conf 67998604-57866888854376-89999985788888478889999999--99986665677621742163688----- Q ss_pred HHHHHHHCCCEEEEEECCCEE-ECC-CCCCCCCCCCHHHHHHH--HHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHH Q ss_conf 679899579878994278542-178-99986215783579999--99854339555010002336778740888899899 Q gi|254780652|r 82 FVDLISSSGVLVGIKVDRGLK-PYP-LYPGEHITVGLDDLDVR--LKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMH 157 (339) Q Consensus 82 ~~~~l~~kGIvpgIKVDkGl~-~l~-g~~~e~~t~GLD~L~eR--~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~ 157 (339) ..+|.+ --|.||. ... -..+-+++.-|.+|... ++.-+..|+..-|.---+..++ | +.=..+-. T Consensus 80 -----~~~~~~---~~~~~lL~a~dktgyd~t~~~rlp~l~~~isa~riK~~G~~avK~Lvy~~~D~--~--e~neqk~a 147 (306) T COG3684 80 -----VERNAI---AKSCALLLAYDKTGYDNTIPVRLPDLLRKISAKRIKEDGGDAVKFLVYYRSDE--D--EINEQKLA 147 (306) T ss_pred -----HHHCCC---CCCCCEEEEHHHHCCCCCCCCCCHHHHHHHCHHHHHHHCCCCEEEEEEECCCC--H--HHHHHHHH T ss_conf -----875211---33431045403106888987653123111078999871666168999974896--1--77699999 Q ss_pred HHHHHHHHHHHCCCCCEECHHHCC-CCCCCHHHHHHHH-HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 999999999844787201101103-7757989999999-99999999998763853111000025565764443589889 Q gi|254780652|r 158 ILARYAVLCQNAGLVPIVEPEVLM-EGDHTIERCAEVT-EFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAPQVSDEE 235 (339) Q Consensus 158 ~larYA~i~Q~~GLVPIVEPEVl~-dG~H~ie~~~~vt-e~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~~~~~ee 235 (339) -+-|+...|-+.+|.-++||=+-- ++.++ +.-+++. .+| ..-.+.+.+-+|+ +||.-|+..+... -+| T Consensus 148 ~ierigsec~aedi~f~lE~ltyd~~~~d~-~eyak~kp~kV-~~a~k~fsd~Gad----vlKvevPvyveGe----~~e 217 (306) T COG3684 148 YIERIGSECHAEDLPFFLEPLTYDPRIGDK-EEYAKRKPQKV-IEAMKEFSDSGAD----VLKVEVPVYVEGE----QEE 217 (306) T ss_pred HHHHHHHHHHHCCCCEEEEEEECCCCCCCH-HHHHHHCHHHH-HHHHHHHCCCCCC----EEEEECCEECCCC----CHH T ss_conf 999987775006875167653038777886-88874065989-9999984337875----6986233002676----199 Q ss_pred HHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHH-HHHCC------CCCCCEEEEEHHHHHHHHHHHHHCCCHHHHH Q ss_conf 9877799985028743862586058888999999899-98627------8998059985107863788988189988999 Q gi|254780652|r 236 IASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLS-AMNSM------GDCPWRLSFSYGRALQESVLRAWNGKQDNVV 308 (339) Q Consensus 236 vA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~Ln-aiNk~------~~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~ 308 (339) .+ ....-+..+ ..+|=| .||-|-|.+-=-.-+- ||..- +..-|.=+-+-|+.-+...|++-+- .|+. T Consensus 218 a~-~~f~~~~~~--~~lP~i-~LSAGV~~klF~~tv~fA~eaGAsGvL~GRAtWa~~v~~g~d~~re~Lrt~g~--~ni~ 291 (306) T COG3684 218 AA-AAFQRQNDH--INLPWI-YLSAGVSAKLFQRTVRFAMEAGASGVLAGRATWAGVVEQGEDAAREWLRTVGF--PNLD 291 (306) T ss_pred HH-HHHHHHHCC--CCCCEE-EEECCCCHHHHHHHHHHHHHCCCCEEEECHHHHHCCCCCCCHHHHHHHHHHCC--CCHH T ss_conf 99-999874047--789869-88467667775899999997388606743234412200395789999986153--2289 Q ss_pred HHHHHHHHH Q ss_conf 999999999 Q gi|254780652|r 309 VAQRVLSHR 317 (339) Q Consensus 309 ~aQ~~~~~R 317 (339) .-++.+.+- T Consensus 292 eL~~vlde~ 300 (306) T COG3684 292 ELNKVLDET 300 (306) T ss_pred HHHHHHHHH T ss_conf 999999986 No 17 >KOG3697 consensus Probab=65.07 E-value=8.2 Score=19.27 Aligned_cols=100 Identities=16% Similarity=0.250 Sum_probs=56.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 3853111000025565764443-589889987779998502874386258605888899999989998627899805998 Q gi|254780652|r 208 MRVCFEGLVLKSNMILPGKSAP-QVSDEEIASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSF 286 (339) Q Consensus 208 ~~V~leg~lLKpnMv~pG~~~~-~~~~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsf 286 (339) -+-.+.-++.++||-.+|.+-. ..+.-.+...+..|-+.---..+++|-|-|||-.+ ....+-++|| ++..|.--. T Consensus 68 ~n~~~s~i~gr~~lq~a~m~I~itvst~sL~l~~~d~kqiianh~m~~isfasGgD~d--t~~~~ayvaK-D~~~rRac~ 144 (345) T KOG3697 68 INPVLSSILGRSNLQFAGMSITLTISTSSLNLMTADCKQIIANHHMQSISFASGGDPD--TADYVAYVAK-DPVNRRACH 144 (345) T ss_pred CCCCCHHHCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH--HHHHHHHHCC-CCCCCCEEE T ss_conf 8642011137642114785369984244555420676552121245321245689920--6667765406-864330037 Q ss_pred EHHHHHHHHHHHHHCCCHHH-HHHHHHHHHHHHH Q ss_conf 51078637889881899889-9999999999997 Q gi|254780652|r 287 SYGRALQESVLRAWNGKQDN-VVVAQRVLSHRAH 319 (339) Q Consensus 287 SygRALQ~~~Lk~W~G~~eN-~~~aQ~~~~~Rak 319 (339) - |.-..|+.++ +..+-++|--|++ T Consensus 145 V---------lec~~g~a~DVI~Tigqaf~lrfk 169 (345) T KOG3697 145 I---------LECCDGLAQDVIGTIGQAFELRFK 169 (345) T ss_pred E---------EECCCCHHHHHHHHHHHHHEEECC T ss_conf 8---------860574257888765343200025 No 18 >TIGR00364 TIGR00364 exsB protein; InterPro: IPR004479 This protein family is represented by a single member in nearly every completed large (> 1000 genes) prokaryotic genome. In Sinorhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA , . In Arthrobacter viscosus, the homologous gene is designated alu1 and is associated with an aluminum tolerance phenotype. When expressed in Escherichia coli, it conferred aliminium tolerance . The entry also contains the gene queC, which is responsible for the conversion of GTP to 7-cyano-7-deazaguanine (preQ0). The biosynthesis of hypermodified tRNA nucleoside queuosine only occurs in eubacteria. It occupies the wobble position for all known tRNAs that are specific for Asp, Asn, His or Tyr .. Probab=60.26 E-value=6.8 Score=19.81 Aligned_cols=33 Identities=36% Similarity=0.578 Sum_probs=13.9 Q ss_pred EEEECCCCCHHHHHHHHHHHHCCC-CCCCEEEEEHH Q ss_conf 586058888999999899986278-99805998510 Q gi|254780652|r 255 IAFLSGGQSEQDATARLSAMNSMG-DCPWRLSFSYG 289 (339) Q Consensus 255 IvFLSGGqs~eeAt~~LnaiNk~~-~~pW~lsfSyg 289 (339) ||-|||||.- +|.-+-|+...+ ..=..+||-|| T Consensus 2 v~~lSGG~DS--tT~~~~a~~~~GkyeV~a~TF~YG 35 (227) T TIGR00364 2 VVVLSGGQDS--TTVLLIALDEGGKYEVHAITFDYG 35 (227) T ss_pred EEEEECCHHH--HHHHHHHHHCCCCEEEEEECCCCH T ss_conf 6874373468--999999996179507998546501 No 19 >TIGR00389 glyS_dimeric glycyl-tRNA synthetase; InterPro: IPR002315 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . In eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , . In some eubacteria, in archaea and eukaryota, glycyl-tRNA synthetase is an alpha2 dimer, this family. It belongs to class IIc and is one of the most complex synthetases. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha (see IPR002310 from INTERPRO) and beta subunits (see IPR002311 from INTERPRO) also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain . The sequence and crystal structure of the homodimeric glycyl-tRNA synthetase from Thermus thermophilus, shows that each monomer consists of an active site strongly resembling that of the aspartyl and seryl enzymes, a C-terminal anticodon recognition domain of 100 residues and a third domain unusually inserted between motifs 1 and 2 almost certainly interacting with the acceptor arm of tRNA(Gly). The C-terminal domain has a novel five-stranded parallel-antiparallel beta-sheet structure with three surrounding helices. The active site residues most probably responsible for substrate recognition, in particular in the Gly binding pocket, can be identified by inference from aspartyl-tRNA synthetase due to the conserved nature of the class II active site , .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=59.36 E-value=11 Score=18.50 Aligned_cols=34 Identities=24% Similarity=0.361 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHCCCEEEEE-CCCCCCHHHHH---HHCCCC Q ss_conf 9999999997479909991-46887757878---857899 Q gi|254780652|r 4 DLENSVAMMLQAGKGILAA-DESNATIQKRF---KAIHLE 39 (339) Q Consensus 4 eL~~~a~~~~~~gKGilAa-DeS~gt~~Krl---~~igve 39 (339) ....|.+.|.. .||.+. |.||.|||||. +.||++ T Consensus 528 ~a~~i~~~Lr~--~~i~~~~DdSG~SIGkRYrR~DEIGtP 565 (606) T TIGR00389 528 IAKEIEQALRK--SGIIIEYDDSGTSIGKRYRRADEIGTP 565 (606) T ss_pred HHHHHHHHHHH--CCCEEEECCCCCCHHHHHHCCCCCCCC T ss_conf 99999999862--697699877898610031014436887 No 20 >cd01995 ExsB ExsB is a transcription regulator related protein. It is a subfamily of a Adenosine nucleotide binding superfamily of proteins. This protein family is represented by a single member in nearly every completed large ( 1000 genes) prokaryotic genome. In Rhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA. In Arthrobacter viscosus, the homologous gene is designated ALU1 and is associated with an aluminum tolerance phenotype. The function is unknown Probab=54.94 E-value=15 Score=17.60 Aligned_cols=10 Identities=30% Similarity=0.607 Sum_probs=5.0 Q ss_pred CCCCEEEEEH Q ss_conf 9980599851 Q gi|254780652|r 279 DCPWRLSFSY 288 (339) Q Consensus 279 ~~pW~lsfSy 288 (339) ..||.+|+|= T Consensus 137 gv~~~~T~SC 146 (169) T cd01995 137 GVPLELTWSC 146 (169) T ss_pred CCCHHHCCCC T ss_conf 9979982157 No 21 >cd00945 Aldolase_Class_I Class I aldolases. The class I aldolases use an active-site lysine which stablilzes a reaction intermediates via Schiff base formation, and have TIM beta/alpha barrel fold. The members of this family include 2-keto-3-deoxy-6-phosphogluconate (KDPG) and 2-keto-4-hydroxyglutarate (KHG) aldolases, transaldolase, dihydrodipicolinate synthase sub-family, Type I 3-dehydroquinate dehydratase, DeoC and DhnA proteins, and metal-independent fructose-1,6-bisphosphate aldolase. Although structurally similar, the class II aldolases use a different mechanism and are believed to have an independent evolutionary origin. Probab=50.69 E-value=20 Score=16.74 Aligned_cols=169 Identities=15% Similarity=0.038 Sum_probs=94.7 Q ss_pred HEEEEEECCHHHHHHCCCCCCCHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEEC Q ss_conf 30464346778841000488746798995798789942785421789998621578357999999854339555010002 Q gi|254780652|r 60 QYISSILLYEETLYQNAQDGTSFVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVV 139 (339) Q Consensus 60 ~~IsGvILfeeTl~q~~~~g~~~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaV 139 (339) ..+.++.++...+..- .+.+...++.....||-.+-. ..++.+...++++...||..-.+..- T Consensus 25 ~~~~av~v~p~~v~~~-------~~~l~~~~~~v~~vv~fp~g~----------~~~~~k~~e~~~ai~~GAdeid~v~~ 87 (201) T cd00945 25 YGFAAVCVNPGYVRLA-------ADALAGSDVPVIVVVGFPTGL----------TTTEVKVAEVEEAIDLGADEIDVVIN 87 (201) T ss_pred HCCCEEEECHHHHHHH-------HHHCCCCCCCEEEEECCCCCC----------CCHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 2982999889999999-------998089998389995889999----------97779999999999909998997405 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 33677874088889989999999999984478720110110377579899999999999999999876385311100002 Q gi|254780652|r 140 LSVSDVLPSMTVVKANMHILARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKS 219 (339) Q Consensus 140 i~i~~~~Ps~~~I~~na~~larYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKp 219 (339) +.-....- -....+.+......| +.|++.+|.+|.-.. .+ ...+....+...+-+ -..+|. T Consensus 88 ~~~~~~~~----~~~~~~ei~~v~~~~-~~g~~~kvi~e~~~l--~~--------~~~i~~a~~~~~~~G----adfvKt 148 (201) T cd00945 88 IGSLKEGD----WEEVLEEIAAVVEAA-DGGLPLKVILETRGL--KT--------ADEIAKAARIAAEAG----ADFIKT 148 (201) T ss_pred HHHHHCCC----HHHHHHHHHHHHHHH-CCCCCEEEEECCCCC--CC--------HHHHHHHHHHHHHHC----CCEEEE T ss_conf 67775668----899999999999973-579837999616778--99--------999999999999809----987985 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHH Q ss_conf 556576444358988998777999850287438625860588889999998999 Q gi|254780652|r 220 NMILPGKSAPQVSDEEIASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSA 273 (339) Q Consensus 220 nMv~pG~~~~~~~~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~Lna 273 (339) +..-. ...++++.+ +.+++.++.. .+|..-+|-.+-+++...|.+ T Consensus 149 stG~~---~~~at~~~v-----~~m~~~~~~~-~~vk~sGGi~~~~~a~~~l~a 193 (201) T cd00945 149 STGFG---GGGATVEDV-----KLMKEAVGGR-VGVKAAGGIKTLEDALAAIEA 193 (201) T ss_pred CCCCC---CCCCCHHHH-----HHHHHHHCCC-CEEECCCCCCCHHHHHHHHHH T ss_conf 58878---898999999-----9999982878-638635897999999999982 No 22 >PRK13772 formimidoylglutamase; Provisional Probab=50.15 E-value=20 Score=16.68 Aligned_cols=87 Identities=21% Similarity=0.266 Sum_probs=61.4 Q ss_pred CEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 87899427854217899986215783579999998543395550100023367787408888998999999999998447 Q gi|254780652|r 91 VLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARYAVLCQNAG 170 (339) Q Consensus 91 IvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~Q~~G 170 (339) ++-|.-.|.|+.-..|-+|-+ +|=+.+++-++..-.. ..+.|+..+|-......+.+.-..|+....-+.+.| T Consensus 42 ~llGf~~D~GV~rN~GR~GA~--~GP~aIR~ala~l~~~-----~~~~v~D~GDv~~~~~dLe~aq~~l~~~v~~ll~~g 114 (313) T PRK13772 42 VLLGFACDAGVLRNQGRPGAA--HGPRAIRRALANVPAH-----GLPALYDAGDVVCEDGDLESAQDALAGTVAALLARG 114 (313) T ss_pred EEEECCCCCCHHCCCCCCCHH--HCHHHHHHHHHCCCCC-----CCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 999236530011159970555--6989999998468756-----898068788877599899999999999999999779 Q ss_pred CCCEECHHHCCCCCCCHHH Q ss_conf 8720110110377579899 Q gi|254780652|r 171 LVPIVEPEVLMEGDHTIER 189 (339) Q Consensus 171 LVPIVEPEVl~dG~H~ie~ 189 (339) -.||| +=|+|++-- T Consensus 115 ~~piv-----LGGdH~iA~ 128 (313) T PRK13772 115 ARPLV-----LGGGHEVAW 128 (313) T ss_pred CEEEE-----ECCCCHHHH T ss_conf 85799-----888532556 No 23 >KOG0964 consensus Probab=49.38 E-value=11 Score=18.41 Aligned_cols=30 Identities=33% Similarity=0.354 Sum_probs=24.7 Q ss_pred EEECCCCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 860588889999998999862789980599 Q gi|254780652|r 256 AFLSGGQSEQDATARLSAMNSMGDCPWRLS 285 (339) Q Consensus 256 vFLSGGqs~eeAt~~LnaiNk~~~~pW~ls 285 (339) --|||||-..-|.+-.=||.++.+.|+.|- T Consensus 1096 ~QLSGGQKsvvALaLIFaIQrcDPAPFYlf 1125 (1200) T KOG0964 1096 EQLSGGQKSVVALALIFAIQRCDPAPFYLF 1125 (1200) T ss_pred HHHCCCHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 883474287999999999982687401447 No 24 >COG0367 AsnB Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism] Probab=48.19 E-value=22 Score=16.49 Aligned_cols=45 Identities=24% Similarity=0.130 Sum_probs=26.4 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHC Q ss_conf 988998777999850287438625860588889999998999862 Q gi|254780652|r 232 SDEEIASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNS 276 (339) Q Consensus 232 ~~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk 276 (339) ..+++...-..+.++++-+.||..+|||||-.----...+...+. T Consensus 211 ~~~~l~~~l~~sV~~r~~advpvg~~lSGGlDSS~Iaa~a~~~~~ 255 (542) T COG0367 211 LAEHLRSLLEDAVKRRLVADVPVGVFLSGGLDSSLIAAIAAEELG 255 (542) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHC T ss_conf 889999999999986621588789996776229999999998623 No 25 >pfam02938 GAD GAD domain. This domain is found in some members of the GatB and aspartyl tRNA synthetases. Probab=47.80 E-value=20 Score=16.71 Aligned_cols=41 Identities=20% Similarity=0.358 Sum_probs=29.9 Q ss_pred HHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 6798995798789942785421789998621578357999999854339 Q gi|254780652|r 82 FVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAG 130 (339) Q Consensus 82 ~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~G 130 (339) |.+.+++.|+|-+|.|..+.. +.. .-+|.|.+.+++|-..| T Consensus 1 F~~~l~~GG~Vkai~vp~~a~-~sR-------k~id~l~~~ak~~GakG 41 (94) T pfam02938 1 FSEALKKGGSVKALCVPGGAG-LSR-------KQIDELERFAKEFGAKG 41 (94) T ss_pred CCHHHHCCCEEEEEECCCCCC-CCH-------HHHHHHHHHHHHHCCCC T ss_conf 934765899899998789862-258-------88999999999949884 No 26 >PRK13946 shikimate kinase; Provisional Probab=41.88 E-value=27 Score=15.87 Aligned_cols=124 Identities=22% Similarity=0.249 Sum_probs=63.9 Q ss_pred CEEEEECC-----CCCCHHHHHHHCCCCCCHHHHHHHHHHHE-ECCCCHHEE----EEEECCHHHHHHCCCCCCCHHHHH Q ss_conf 90999146-----88775787885789998678875456410-077802304----643467788410004887467989 Q gi|254780652|r 17 KGILAADE-----SNATIQKRFKAIHLESTENSRRDYREMLF-RAKDAMQYI----SSILLYEETLYQNAQDGTSFVDLI 86 (339) Q Consensus 17 KGilAaDe-----S~gt~~Krl~~igventeenr~~~R~~l~-tsp~~~~~I----sGvILfeeTl~q~~~~g~~~~~~l 86 (339) --|+-.|+ .+.|++.=|+.+| |+.+|++-...+ ........| +|++++++++ ++| T Consensus 46 ~~fiD~D~~IE~~~g~sI~eIF~~~G----E~~FR~~E~~~l~~l~~~~~~VIstGGG~v~~~~n~-----------~~L 110 (195) T PRK13946 46 LPFLDADTEIERAARMTIPEIFATYG----EPEFRDLERRVIARLLKGGPLVLATGGGAFMNEETR-----------AAI 110 (195) T ss_pred CCEEECHHHHHHHHCCCHHHHHHHHC----HHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHH-----------HHH T ss_conf 79898859999980998999999869----799999899999998648987997587423689999-----------999 Q ss_pred HHCCCEEEEEECCCE----------EECCCCCCCCCCCCHHHHH-HHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHH Q ss_conf 957987899427854----------2178999862157835799-99998543395550100023367787408888998 Q gi|254780652|r 87 SSSGVLVGIKVDRGL----------KPYPLYPGEHITVGLDDLD-VRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKAN 155 (339) Q Consensus 87 ~~kGIvpgIKVDkGl----------~~l~g~~~e~~t~GLD~L~-eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~n 155 (339) .++|++..|+++--. .|+- .+....+-+..|. +|.. +|.. |.++ |.. ++.+.+..+.+. T Consensus 111 ~~~g~vI~L~~~~e~l~~Rl~~~~~RPLl--~~~~~~~~l~~l~~~R~~-lY~~-Ad~~-----I~t-~~~s~~~ia~eI 180 (195) T PRK13946 111 REKGISVWLKADLDVLWERVSRRDTRPLL--RTADPKETLARLMEERYP-VYAQ-ADLT-----VES-RDVPHEVIADEV 180 (195) T ss_pred HHCCEEEEEECCHHHHHHHHCCCCCCCCC--CCCCHHHHHHHHHHHHHH-HHHH-CCEE-----EEC-CCCCHHHHHHHH T ss_conf 95796899849999999997289999989--998879999999999999-9997-8989-----889-989999999999 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780652|r 156 MHILARYAVL 165 (339) Q Consensus 156 a~~larYA~i 165 (339) .+.|..|-.. T Consensus 181 i~~L~~~~~~ 190 (195) T PRK13946 181 IEALAAYLEV 190 (195) T ss_pred HHHHHHHHHH T ss_conf 9999999856 No 27 >cd02426 Pol_gamma_b_Cterm C-terminal domain of mitochondrial DNA polymerase gamma B subunit, which is required for processivity. Polymerase gamma replicates and repairs mitochondrial DNA. The c-terminal domain of its B subunit is strikingly similar to the anticodon-binding domain of glycyl tRNA synthetase. Probab=40.32 E-value=27 Score=15.86 Aligned_cols=12 Identities=17% Similarity=0.241 Sum_probs=4.3 Q ss_pred HHCCCCCCCHHH Q ss_conf 110377579899 Q gi|254780652|r 178 EVLMEGDHTIER 189 (339) Q Consensus 178 EVl~dG~H~ie~ 189 (339) -+..|.+.+|.+ T Consensus 63 ~~~~d~~~sIGk 74 (128) T cd02426 63 GYLETQHSSLEQ 74 (128) T ss_pred CCCCCCCCCHHH T ss_conf 635454347889 No 28 >cd03239 ABC_SMC_head The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression. Probab=39.31 E-value=30 Score=15.61 Aligned_cols=52 Identities=23% Similarity=0.279 Sum_probs=33.7 Q ss_pred EECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 60588889999998999862789980599851078637889881899889999999999999 Q gi|254780652|r 257 FLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRA 318 (339) Q Consensus 257 FLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Ra 318 (339) +||||+-.--|...+=||+++.+.|.-+-=----| | |+.|+..-...+.+.+ T Consensus 94 lLSGGEksl~alal~~ai~~~~p~p~~iLDEvdAa-----L-----D~~N~~r~~~~i~el~ 145 (178) T cd03239 94 ILSGGEKSLSALALIFALQEIKPSPFYVLDEIDAA-----L-----DPTNRRRVSDMIKEMA 145 (178) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCC-----C-----CHHHHHHHHHHHHHHH T ss_conf 16752589999999999971389966998276554-----7-----9889999999999997 No 29 >PRK13948 shikimate kinase; Provisional Probab=36.13 E-value=33 Score=15.29 Aligned_cols=121 Identities=18% Similarity=0.185 Sum_probs=58.1 Q ss_pred CEEEEECC-----CCCCHHHHHHHCCCCCCHHHHHHHHHH-HEECCCCHHEE----EEEECCHHHHHHCCCCCCCHHHHH Q ss_conf 90999146-----887757878857899986788754564-10077802304----643467788410004887467989 Q gi|254780652|r 17 KGILAADE-----SNATIQKRFKAIHLESTENSRRDYREM-LFRAKDAMQYI----SSILLYEETLYQNAQDGTSFVDLI 86 (339) Q Consensus 17 KGilAaDe-----S~gt~~Krl~~igventeenr~~~R~~-l~tsp~~~~~I----sGvILfeeTl~q~~~~g~~~~~~l 86 (339) --|+-.|+ .+-|++.=|+.+| |+.+|+.-.- |-........| +|++++++++ ++| T Consensus 36 ~~fiD~D~~Ie~~~g~sI~eIF~~~G----E~~FR~~E~~~l~~l~~~~~~VIStGGG~v~~~~n~-----------~~l 100 (182) T PRK13948 36 LHFVDTDKLITRVVGKSIPEVFAQEG----EEYFRACEKEVVRRVTRLDYAVISLGGGTFIHEENR-----------RAL 100 (182) T ss_pred CCEEECHHHHHHHHCCCHHHHHHHHC----HHHHHHHHHHHHHHHHCCCCEEEECCCCEECCHHHH-----------HHH T ss_conf 59888849999988939999999848----999999999999999747996997588500589999-----------999 Q ss_pred HHCCCEEEEEECCCE---------EECCCCCCCCCCCCHHHH-HHHHHHHHHCCCEEECCEECCCC-CCCCCHHHHHHHH Q ss_conf 957987899427854---------217899986215783579-99999854339555010002336-7787408888998 Q gi|254780652|r 87 SSSGVLVGIKVDRGL---------KPYPLYPGEHITVGLDDL-DVRLKQYKDAGARVAKWRVVLSV-SDVLPSMTVVKAN 155 (339) Q Consensus 87 ~~kGIvpgIKVDkGl---------~~l~g~~~e~~t~GLD~L-~eR~~~y~~~Ga~faKwRaVi~i-~~~~Ps~~~I~~n 155 (339) .++|.|..++++--. .|+- ......+-|..| .+|..-|. . |. +.| .++.+.+..+.+. T Consensus 101 ~~~g~vv~L~~~~~~i~~R~~~~~RPll--~~~~~~~~l~~l~~eR~~~Y~-~-A~-------~~I~td~~~~~eiv~~I 169 (182) T PRK13948 101 LGRGPVVVLWASPETVYQRTKHSDRPLL--QVEDPLERIRTLMEEREPVYR-Q-GT-------IHVHSDGRPVEEIVEEV 169 (182) T ss_pred HHCCEEEEEECCHHHHHHHCCCCCCCCC--CCCCHHHHHHHHHHHHHHHHH-H-CC-------EEEECCCCCHHHHHHHH T ss_conf 9689899996999999988178999988--899879999999999999999-6-78-------89889989999999999 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254780652|r 156 MHILARYA 163 (339) Q Consensus 156 a~~larYA 163 (339) .+.|..|+ T Consensus 170 i~~L~a~~ 177 (182) T PRK13948 170 VERLWAWA 177 (182) T ss_pred HHHHHHHH T ss_conf 99999998 No 30 >pfam12227 DUF3603 Protein of unknown function (DUF3603). This protein is found in bacteria and eukaryotes. Proteins in this family are about 250 amino acids in length. Probab=36.05 E-value=11 Score=18.35 Aligned_cols=14 Identities=29% Similarity=0.233 Sum_probs=10.3 Q ss_pred CEEEEEECCCEEEC Q ss_conf 87899427854217 Q gi|254780652|r 91 VLVGIKVDRGLKPY 104 (339) Q Consensus 91 IvpgIKVDkGl~~l 104 (339) -||-+||++.|-+. T Consensus 4 QVpllkVs~~l~~y 17 (214) T pfam12227 4 QVPLLKVSPELFHY 17 (214) T ss_pred CCCEEEECHHHHHH T ss_conf 66267866889999 No 31 >PTZ00077 asparagine synthetase; Provisional Probab=34.69 E-value=35 Score=15.14 Aligned_cols=32 Identities=28% Similarity=0.135 Sum_probs=19.6 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 89889987779998502874386258605888 Q gi|254780652|r 231 VSDEEIASNTIKIFKRVVPCAVQGIAFLSGGQ 262 (339) Q Consensus 231 ~~~eevA~~T~~~l~~~vP~~VpgIvFLSGGq 262 (339) .+.+++-..-..+.++.+-+.||==+|||||- T Consensus 217 ~~~~~lr~~L~~AV~~rL~sDVPvG~lLSGGL 248 (610) T PTZ00077 217 ADFEEIRIHLEKAVIKRLMGDVPFGILLSGGL 248 (610) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 54999999999999998225672366634770 No 32 >PRK01722 formimidoylglutamase; Provisional Probab=34.36 E-value=36 Score=15.11 Aligned_cols=89 Identities=22% Similarity=0.296 Sum_probs=51.4 Q ss_pred CEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 87899427854217899986215783579999998543395550100023367787408888998999999999998447 Q gi|254780652|r 91 VLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARYAVLCQNAG 170 (339) Q Consensus 91 IvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~Q~~G 170 (339) .+-|.-.|.|+.-.-|-+|-. +|=+.+++.+..+.. +.++-.|+..+|-.-....+.+.-..++..-.-+.+.| T Consensus 44 allG~~~D~GV~rN~GR~GA~--~GP~aIR~al~~l~~----~~~~~~l~D~GDv~~~~~~le~~~~~l~~~v~~ll~~g 117 (320) T PRK01722 44 ALIGFACDEGVKRNKGRSGAS--FAPDAIRRALANMAS----HAGHSRLYDLGDIVCHVTDLEESQQALADTVSHCLKAN 117 (320) T ss_pred EEECCCCCCCCCCCCCCCCHH--HCHHHHHHHHHHCCC----CCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 999035541435689983455--699999998740778----78998067568723489889999999999999999679 Q ss_pred CCCEECHHHCCCCCCCHHHH Q ss_conf 87201101103775798999 Q gi|254780652|r 171 LVPIVEPEVLMEGDHTIERC 190 (339) Q Consensus 171 LVPIVEPEVl~dG~H~ie~~ 190 (339) ..||| +-|+|+|--. T Consensus 118 ~~pIv-----LGGdHsia~g 132 (320) T PRK01722 118 PRTIV-----LGGGHEIAFG 132 (320) T ss_pred CEEEE-----ECCCCHHHHH T ss_conf 86899-----8476425778 No 33 >cd01399 GlcN6P_deaminase GlcN6P_deaminase: Glucosamine-6-phosphate (GlcN6P) deaminase subfamily; GlcN6P deaminase catalyzes the reversible conversion of GlcN6P to D-fructose-6-phosphate (Fru6P) and ammonium. The reaction is an aldo-keto isomerization coupled with an amination or deamination. It is the last step of the metabolic pathway of N-acetyl-D-glucosamine-6-phosphate (GlcNAc6P). GlcN6P deaminase is a hexameric enzyme that is allosterically activated by GlcNAc6P. Probab=34.11 E-value=36 Score=15.08 Aligned_cols=19 Identities=21% Similarity=0.447 Sum_probs=9.5 Q ss_pred CEEEEECCCCCHHHHHHHH Q ss_conf 6258605888899999989 Q gi|254780652|r 253 QGIAFLSGGQSEQDATARL 271 (339) Q Consensus 253 pgIvFLSGGqs~eeAt~~L 271 (339) -.|+|+..|..-.++..+. T Consensus 182 ~~i~~~~~G~~Ka~~v~~~ 200 (232) T cd01399 182 KEILLLATGEGKAEAVKKA 200 (232) T ss_pred CEEEEEECCHHHHHHHHHH T ss_conf 8699996582799999999 No 34 >pfam11285 DUF3086 Protein of unknown function (DUF3086). This family of proteins with unknown function appears to be restricted to Cyanobacteria. Probab=33.90 E-value=32 Score=15.39 Aligned_cols=82 Identities=16% Similarity=0.291 Sum_probs=43.4 Q ss_pred CEEEEECCCCCHHHHHHHHHHHHCC---C---CCCCEEEEEHHHHHHHHHHHHH-------------CCCHHHHH--HHH Q ss_conf 6258605888899999989998627---8---9980599851078637889881-------------89988999--999 Q gi|254780652|r 253 QGIAFLSGGQSEQDATARLSAMNSM---G---DCPWRLSFSYGRALQESVLRAW-------------NGKQDNVV--VAQ 311 (339) Q Consensus 253 pgIvFLSGGqs~eeAt~~LnaiNk~---~---~~pW~lsfSygRALQ~~~Lk~W-------------~G~~eN~~--~aQ 311 (339) +.+.|++++..+++.-.+=. +.+. + .-||.|-=||--.= ...+.-| +....|+- +|- T Consensus 99 ~~~~~~~~~F~~d~~~Ir~~-l~~yr~qPDyYa~PWqLRRs~e~~~-~e~~~~WFf~qGGRGa~~s~gsR~~NiLvasa~ 176 (283) T pfam11285 99 STPPAAADTFKPDERLIRQL-LEQFREQPDYYADPWKLRRSLEPVD-AELLEDWFFNQGGRGAQPSRGSRLQNILVASAL 176 (283) T ss_pred CCCCHHHHCCCHHHHHHHHH-HHHHHHCCCCCCCHHHHHCCCCHHH-HHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 88740333045369999999-9998519987787067651556777-999998876479874556777417899999999 Q ss_pred HHHH-----HHH----HHHHHHHCCCCCCCCCCC Q ss_conf 9999-----999----741589789748676821 Q gi|254780652|r 312 RVLS-----HRA----HMNSLATKGCWKKSLENS 336 (339) Q Consensus 312 ~~~~-----~Ra----k~ns~A~~G~y~~~~e~~ 336 (339) -..+ .|. .+|+--++|+|-..+.+. T Consensus 177 isiL~eLYGd~~~~LvLA~~PERLGEWRRgLQDc 210 (283) T pfam11285 177 ISILGELYGDRFQTLVLAGQPERLGEWRRGLQDC 210 (283) T ss_pred HHHHHHHHCCHHEEEEECCCCHHHHHHHHHHHHH T ss_conf 9999998311311323459804666788888886 No 35 >KOG1997 consensus Probab=33.82 E-value=36 Score=15.05 Aligned_cols=91 Identities=16% Similarity=0.127 Sum_probs=44.5 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCC--CCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHCC Q ss_conf 999998763853111000025565764443--589889987779998502874386258605888899999989998627 Q gi|254780652|r 200 TLFNELYTMRVCFEGLVLKSNMILPGKSAP--QVSDEEIASNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARLSAMNSM 277 (339) Q Consensus 200 ~lf~~L~~~~V~leg~lLKpnMv~pG~~~~--~~~~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~LnaiNk~ 277 (339) ++-++...+.++.++ ..|++.|.-.. +..|-++|.. |||+-+-+.--..++| |+ T Consensus 1392 el~~a~~~~~~d~k~----LQ~~LQGsV~~tVN~Gpl~~a~v-----------------Fl~~~~~~~~~~~~~~---kL 1447 (1518) T KOG1997 1392 ELRKATNQEPPDPKM----LQLVLQGSVGVTVNAGPLAYARV-----------------FLSEEPADRYPDRHLN---KL 1447 (1518) T ss_pred HHHHHHCCCCCCHHH----HHHHHHCCCCCEECCCHHHHHHH-----------------HHCCCCCCCCCCHHHH---HH T ss_conf 999860579987799----99997136103145660999999-----------------9735867666425789---99 Q ss_pred CCCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 899805998510786378898818998899999999999997415 Q gi|254780652|r 278 GDCPWRLSFSYGRALQESVLRAWNGKQDNVVVAQRVLSHRAHMNS 322 (339) Q Consensus 278 ~~~pW~lsfSygRALQ~~~Lk~W~G~~eN~~~aQ~~~~~Rak~ns 322 (339) .++| |-++..|-.|-.-...-+..-|+++-.-.+-|- T Consensus 1448 -----r~~f---r~f~~~C~~Al~~n~~lI~~Dq~ey~~~L~~nf 1484 (1518) T KOG1997 1448 -----RLCF---REFMKVCQDALELNERLIGEDQKEYHEELESNF 1484 (1518) T ss_pred -----HHHH---HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf -----9999---999999999999989863612899999999739 No 36 >PRK13774 formimidoylglutamase; Provisional Probab=33.40 E-value=37 Score=15.01 Aligned_cols=90 Identities=17% Similarity=0.227 Sum_probs=59.4 Q ss_pred CC-EEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98-78994278542178999862157835799999985433955501000233677874088889989999999999984 Q gi|254780652|r 90 GV-LVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSDVLPSMTVVKANMHILARYAVLCQN 168 (339) Q Consensus 90 GI-vpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~Q~ 168 (339) +| +-|.-.|.|+.-..|-+|-. +|=+.+++-+..-. .+-..+.++..+|-.-....+.+..+.+++..+-+.+ T Consensus 45 ~vallGf~~D~GV~rN~GR~GA~--~gP~aiR~~l~~l~----~~~~~~~i~D~Gdv~~~~~~Le~aq~~l~~~v~~~l~ 118 (311) T PRK13774 45 GVGILGYAVDKGVALNKGRIGAK--EGPDAIKQAFAGLP----DLNQCETLVDYGNVYHDHEELIDTQKEFAMLAAKSIA 118 (311) T ss_pred CEEEEEECCCCCEECCCCCCCHH--HCHHHHHHHHHCCC----CCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 87999213681412478986633--59599999985687----6576775565787775972399999999999999997 Q ss_pred CCCCCEECHHHCCCCCCCHHHH Q ss_conf 4787201101103775798999 Q gi|254780652|r 169 AGLVPIVEPEVLMEGDHTIERC 190 (339) Q Consensus 169 ~GLVPIVEPEVl~dG~H~ie~~ 190 (339) .|..||| +-|+|+|.-. T Consensus 119 ~g~~piv-----LGGdH~ia~~ 135 (311) T PRK13774 119 NHRQTFL-----LGGGHDIAYA 135 (311) T ss_pred CCCCEEE-----ECCCCHHHHH T ss_conf 6995289-----8886078889 No 37 >PRK09431 asnB asparagine synthetase B; Provisional Probab=32.94 E-value=37 Score=14.96 Aligned_cols=31 Identities=23% Similarity=0.219 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEECCCCC Q ss_conf 8899877799985028743862586058888 Q gi|254780652|r 233 DEEIASNTIKIFKRVVPCAVQGIAFLSGGQS 263 (339) Q Consensus 233 ~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs 263 (339) .+++-+.-..+.++.+.+.||==+|||||-. T Consensus 209 ~~~lr~lL~~AV~~rL~sDvpvG~~LSGGLD 239 (555) T PRK09431 209 KNELREALEAAVKKRLMSDVPYGVLLSGGLD 239 (555) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEECCCHH T ss_conf 9999999999999983568862432057706 No 38 >pfam08742 C8 C8 domain. This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826. Probab=32.93 E-value=34 Score=15.22 Aligned_cols=20 Identities=35% Similarity=0.491 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHCCCCC Q ss_conf 98999999999998447872 Q gi|254780652|r 154 ANMHILARYAVLCQNAGLVP 173 (339) Q Consensus 154 ~na~~larYA~i~Q~~GLVP 173 (339) .....|+.||..||..|..+ T Consensus 45 ~~C~~l~~Ya~~C~~~Gv~i 64 (74) T pfam08742 45 CLCSALAAYARACAQAGVCV 64 (74) T ss_pred HHCHHHHHHHHHHHHCCCCC T ss_conf 30499999999999971968 No 39 >PRK04173 glycyl-tRNA synthetase; Provisional Probab=31.72 E-value=37 Score=14.98 Aligned_cols=18 Identities=11% Similarity=0.015 Sum_probs=9.4 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 988999999999999974 Q gi|254780652|r 303 KQDNVVVAQRVLSHRAHM 320 (339) Q Consensus 303 ~~eN~~~aQ~~~~~Rak~ 320 (339) |++-...|++.+....+. T Consensus 374 k~~l~~~a~~i~~~L~~~ 391 (460) T PRK04173 374 KDKLSEKAREIYAELRKD 391 (460) T ss_pred CCCHHHHHHHHHHHHHHC T ss_conf 612589999999999877 No 40 >TIGR00243 Dxr 1-deoxy-D-xylulose 5-phosphate reductoisomerase; InterPro: IPR003821 1-deoxy-D-xylulose 5-phosphate reductoisomerase synthesizes 2-C-methyl-D-erythritol 4-phosphate from 1-deoxy-D-xylulose 5-phosphate in a single step by intramolecular rearrangement and reduction and is responsible for terpenoid biosynthesis in some organisms . In Arabidopsis thaliana 1-deoxy-D-xylulose 5-phosphate reductoisomerase is the first committed enzyme of the non-mevalonate pathway for isoprenoid biosynthesis.; GO: 0003824 catalytic activity, 0008299 isoprenoid biosynthetic process. Probab=30.22 E-value=41 Score=14.67 Aligned_cols=126 Identities=23% Similarity=0.157 Sum_probs=72.1 Q ss_pred HHHHHHHHHCCCEEEEE-CCCCCCHHHHH----HHCCCC----CCHHH--HHHH----HHHHEECCCCHHEEEEEECCHH Q ss_conf 99999997479909991-46887757878----857899----98678--8754----5641007780230464346778 Q gi|254780652|r 6 ENSVAMMLQAGKGILAA-DESNATIQKRF----KAIHLE----STENS--RRDY----REMLFRAKDAMQYISSILLYEE 70 (339) Q Consensus 6 ~~~a~~~~~~gKGilAa-DeS~gt~~Krl----~~igve----nteen--r~~~----R~~l~tsp~~~~~IsGvILfee 70 (339) .-|-+++- .||-|+=| -||==|+|.-| +.||+. ++|-| ++.. -+.|=-++--...+.-.|| T Consensus 110 lPtl~Ai~-~~K~iaLANKEsLVt~G~l~~d~vk~~~~~LlPvDSEH~AiFq~LP~~~Q~~~G~~~~~~~~~~~iil--- 185 (406) T TIGR00243 110 LPTLKAIE-AGKTIALANKESLVTAGHLVLDAVKKYGVKLLPVDSEHSAIFQSLPQEIQNKLGYAGLKELGVKRIIL--- 185 (406) T ss_pred HHHHHHHH-HCCCEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCHHHHHHCCCCCCCCCCCCEEEE--- T ss_conf 88899998-47814401205477556789988973488104423216889760797788616867722168407898--- Q ss_pred HHHHCCCCCCCHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCC----CHHHHHHHHHHHHHCCCEEECCEECCCC Q ss_conf 84100048874679899579878994278542178999862157----8357999999854339555010002336 Q gi|254780652|r 71 TLYQNAQDGTSFVDLISSSGVLVGIKVDRGLKPYPLYPGEHITV----GLDDLDVRLKQYKDAGARVAKWRVVLSV 142 (339) Q Consensus 71 Tl~q~~~~g~~~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~----GLD~L~eR~~~y~~~Ga~faKwRaVi~i 142 (339) |..|=+|.|.-.+. +|.+-||.-|+.-.=+=|.|+|- -+.-+.|-...|+=||+..-.+-.||+- T Consensus 186 -----taSGG~FRd~Ple~--l~~~t~~~AlkHPNWSMG~KItiDSAtmmNKGLE~iEA~WLF~~s~~~idv~ih~ 254 (406) T TIGR00243 186 -----TASGGAFRDTPLEE--LPIVTVEDALKHPNWSMGKKITIDSATMMNKGLEVIEAHWLFGASYEQIDVVIHP 254 (406) T ss_pred -----ECCCCCCCCCHHHH--HCCCCHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHCEEEECC T ss_conf -----56885210221565--1267858873587866748322001767866899999988607886474164667 No 41 >COG1926 Predicted phosphoribosyltransferases [General function prediction only] Probab=30.05 E-value=42 Score=14.65 Aligned_cols=62 Identities=23% Similarity=0.220 Sum_probs=39.6 Q ss_pred HHHHHHHHHHHHHCCCEE-ECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHH-CCCCCEECHHH Q ss_conf 357999999854339555-01000233677874088889989999999999984-47872011011 Q gi|254780652|r 116 LDDLDVRLKQYKDAGARV-AKWRVVLSVSDVLPSMTVVKANMHILARYAVLCQN-AGLVPIVEPEV 179 (339) Q Consensus 116 LD~L~eR~~~y~~~Ga~f-aKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~Q~-~GLVPIVEPEV 179 (339) ...|..|-..|...-..| -|||+||-.++|..+-....+-+..+- ++.|++ -=.||+--++. T Consensus 103 ~~El~rrr~~yr~~~~~~~~~g~~VIlVDDGiATGatm~aAi~~~r--~~~~~~IviAVPV~p~~a 166 (220) T COG1926 103 RKELLRRREAYRGGRPVPSLKGRTVILVDDGIATGATMKAAVRALR--AKGPKEIVIAVPVAPEDA 166 (220) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHH--HCCCCEEEEECCCCCHHH T ss_conf 9999999999728899878888989997077212077999999998--628860899844597899 No 42 >pfam10312 Cactin_mid Conserved mid region of cactin. This is the conserved middle region of a family of proteins referred to as cactins. The region contains two of three predicted coiled-coil domains. Most members of this family have a CactinC_cactus pfam09732 domain at the C-terminal end. Upstream of Mid_cactin in Drosophila members are a serine-rich region, some non-typical RD motifs and three predicted bipartite nuclear localisation signals, none of which are well-conserved. Cactin associates with IkappaB-cactus as one of the intracellular members of the Rel (NF-kappaB) pathway which is conserved in invertebrates and vertebrates. In mammals, this pathway controls the activities of the immune and inflammatory response genes as well as viral genes, and is critical for cell growth and survival. In Drosophila, the Rel pathway functions in the innate cellular and humoral immune response, in muscle development, and in the establishment of dorsal-ventral polarity in the early embryo. Probab=30.00 E-value=42 Score=14.64 Aligned_cols=65 Identities=23% Similarity=0.273 Sum_probs=44.5 Q ss_pred ECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHH--HHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 010002336778740888899899999999999--84478720110110377579899999999999999999876 Q gi|254780652|r 134 AKWRVVLSVSDVLPSMTVVKANMHILARYAVLC--QNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYT 207 (339) Q Consensus 134 aKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~--Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~ 207 (339) +|-||-|.|.+|-+-.- +.|++|..++ .+..=+.+.||..+++|- +.+...+..+.+. +|..|.. T Consensus 24 ak~Rs~iRi~~gRaKpI------D~La~~l~~~~~~~~~e~~~~eP~~~l~gL-~~~eleeL~~dIk--~y~~Le~ 90 (190) T pfam10312 24 AKKRSEIRIKEGRAKPI------DLLARNLRFIDPEKDVEFDLHEPYTVLEGL-TVDELEELEEDIK--MYLELEK 90 (190) T ss_pred HHHHHHHHHHCCCCCHH------HHHHHHHHHCCCCCCCCCCCCCHHHHHCCC-CHHHHHHHHHHHH--HHHHHHC T ss_conf 99998848870899708------999999764131011133435899998269-9999999999999--9999874 No 43 >KOG1014 consensus Probab=29.94 E-value=42 Score=14.64 Aligned_cols=100 Identities=21% Similarity=0.346 Sum_probs=61.9 Q ss_pred HHHHHHHHHHHHCCC---EEEEECCCCCC-----HHHHHHHCCCC---CCHHHHHHHHHHHEECCC-CHHEEEEEECCHH Q ss_conf 799999999974799---09991468877-----57878857899---986788754564100778-0230464346778 Q gi|254780652|r 3 QDLENSVAMMLQAGK---GILAADESNAT-----IQKRFKAIHLE---STENSRRDYREMLFRAKD-AMQYISSILLYEE 70 (339) Q Consensus 3 ~eL~~~a~~~~~~gK---GilAaDeS~gt-----~~Krl~~igve---nteenr~~~R~~l~tsp~-~~~~IsGvILfee 70 (339) +.|..+++.|....| -++++|=..++ +.+-|..+.|. |+--+.+.|.+.+...|. .-+.|.-|=++-- T Consensus 84 ~KL~~v~kEI~~~~~vev~~i~~Dft~~~~~ye~i~~~l~~~~VgILVNNvG~~~~~P~~f~~~~~~~~~~ii~vN~~~~ 163 (312) T KOG1014 84 EKLEAVAKEIEEKYKVEVRIIAIDFTKGDEVYEKLLEKLAGLDVGILVNNVGMSYDYPESFLKYPEGELQNIINVNILSV 163 (312) T ss_pred HHHHHHHHHHHHHHCCEEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHCCHHHHHHEEEEECCHH T ss_conf 99999999998875807999998648981568999988627864899965531678837787385564531467743268 Q ss_pred HHHHCCCCCCCHHHHH-HHCCCEEEEEECCCEEECCC Q ss_conf 8410004887467989-95798789942785421789 Q gi|254780652|r 71 TLYQNAQDGTSFVDLI-SSSGVLVGIKVDRGLKPYPL 106 (339) Q Consensus 71 Tl~q~~~~g~~~~~~l-~~kGIvpgIKVDkGl~~l~g 106 (339) ++--.. -+|... ..+|++..|.--.|+.|.|. T Consensus 164 ~~~t~~----ilp~M~~r~~G~IvnigS~ag~~p~p~ 196 (312) T KOG1014 164 TLLTQL----ILPGMVERKKGIIVNIGSFAGLIPTPL 196 (312) T ss_pred HHHHHH----HHHHHHCCCCCEEEEECCCCCCCCCHH T ss_conf 999988----505553378866998226335566715 No 44 >PRK10528 multifunctional acyl-CoA thioesterase I and protease I and lysophospholipase L1; Provisional Probab=28.59 E-value=44 Score=14.49 Aligned_cols=145 Identities=14% Similarity=0.188 Sum_probs=78.5 Q ss_pred CCCCCCHHHHHHHCCCEEEEEECC-CEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCC-C----CCCHH Q ss_conf 048874679899579878994278-54217899986215783579999998543395550100023367-7----87408 Q gi|254780652|r 76 AQDGTSFVDLISSSGVLVGIKVDR-GLKPYPLYPGEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVS-D----VLPSM 149 (339) Q Consensus 76 ~~~g~~~~~~l~~kGIvpgIKVDk-Gl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~-~----~~Ps~ 149 (339) .+.+..+|.+|.++ .+. +-+-..|..|+|...||.-|..-+.++.-.- .+|-.+ | +.|.+ T Consensus 26 ~~~~~~~p~~L~~~-------l~~~~~viNaGvSGdTT~~gL~Rl~~~L~~~~P~~-------Vii~lGgND~LRg~~~~ 91 (191) T PRK10528 26 MPASAAWPALLNDK-------WQSKTSVVNASISGDTSQQGLARLPALLKQHQPRW-------VLVELGGNDGLRGFPPQ 91 (191) T ss_pred CCCCCCHHHHHHHH-------HHHCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCE-------EEEEECCCCHHCCCCHH T ss_conf 88443759999999-------86189488668763325469999898874079998-------99984684343089989 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEE-CHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH-CC-----CCCCCCCCCCCCC Q ss_conf 88899899999999999844787201-10110377579899999999999999999876-38-----5311100002556 Q gi|254780652|r 150 TVVKANMHILARYAVLCQNAGLVPIV-EPEVLMEGDHTIERCAEVTEFVLRTLFNELYT-MR-----VCFEGLVLKSNMI 222 (339) Q Consensus 150 ~~I~~na~~larYA~i~Q~~GLVPIV-EPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~-~~-----V~leg~lLKpnMv 222 (339) . +.+|.. +.-..|++.|..||+ +..+. .++.-.. ++ -...+|..+.. ++ .+++++.++|.+. T Consensus 92 ~-~~~NL~---~ii~~~~~~g~~vlL~g~~~p--~n~g~~y----~~-~f~~iy~~lA~e~~v~l~pf~l~~v~~~~~~~ 160 (191) T PRK10528 92 Q-TEQTLR---QIIQDVKAANAQPLLMQIRLP--ANYGRRY----NE-AFSAIYPKLAKEFDIPLLPFFMEEVYLKPQWM 160 (191) T ss_pred H-HHHHHH---HHHHHHHHCCCCEEEEECCCC--CCCCHHH----HH-HHHHHHHHHHHHHCCCEEHHHHHHHCCCHHHC T ss_conf 9-999999---999999988997899706567--5512667----88-89999999999809974137887563597552 Q ss_pred CCCCCCC-CCCHHHHHHHHHHHHH Q ss_conf 5764443-5898899877799985 Q gi|254780652|r 223 LPGKSAP-QVSDEEIASNTIKIFK 245 (339) Q Consensus 223 ~pG~~~~-~~~~eevA~~T~~~l~ 245 (339) .+---.| ....+-+|+.-...|+ T Consensus 161 q~DgiHPN~~G~~~ia~~v~~~L~ 184 (191) T PRK10528 161 QDDGIHPNRDAQPFIADWMAKQLQ 184 (191) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 888899598899999999999999 No 45 >TIGR02087 LEUD_arch 3-isopropylmalate dehydratase, small subunit; InterPro: IPR011827 3-isopropylmalate dehydratase (or isopropylmalate isomerase; 4.2.1.33 from EC) catalyses the stereo-specific isomerisation of 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate. This enzyme performs the second step in the biosynthesis of leucine, and is present in most prokaryotes and many fungal species. The prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit, while the fungal form is a monomeric enzyme. Both forms of isopropylmalate are related and are part of the larger aconitase family . Aconitases are mostly monomeric proteins which share four domains in common and contain a single, labile [4Fe-4S] cluster. Three structural domains (1, 2 and 3) are tightly packed around the iron-sulphur cluster, while a fourth domain (4) forms a deep active-site cleft. The prokaryotic enzyme is encoded by two adjacent genes, leuC and leuD, corresponding to aconitase domains 1-3 and 4 respectively , . LeuC does not bind an iron-sulphur cluster. It is thought that some prokaryotic isopropylamalate dehydrogenases can also function as homoaconitase 4.2.1.36 from EC, converting cis-homoaconitate to homoisocitric acid in lysine biosynthesis . Homoaconitase has been identified in higher fungi (mitochondria) and several archaea and one thermophilic species of bacteria, Thermus thermophilus . This entry is most closely related to the 3-isopropylmalate dehydratase IPR004431 from INTERPRO. It also includes members, which cluster with other genes of leucine biosynthesis. The structure of the Pyrococcus horikoshii small subunit (O59393 from SWISSPROT) has recently been determined . As expected the structure of this polypeptide is similar to that of aconitase domain 4, though one alpha helix is replaced by a short loop with relatively high temperature factor values. This loop region is thought to be important for substrate recognition. Unlike other aconitase family proteins, this subunit formed a tetramer through disulphide linkages, though it is not expected to interfere with its interaction with the large subunit. These disulphide linkages would be expected to confer thermostability on the enzyme, reflecting the thermophilic lifestyle of the organism.; GO: 0003861 3-isopropylmalate dehydratase activity, 0009098 leucine biosynthetic process, 0009316 3-isopropylmalate dehydratase complex. Probab=28.04 E-value=45 Score=14.43 Aligned_cols=68 Identities=24% Similarity=0.394 Sum_probs=43.3 Q ss_pred CCCCCCCCCCCCC-HHHHHHHHHHHHHHCCCCCC-CEEEEECC-----CCCHHHHHHHHHHHHCCCCCCCEEEEEHHHH Q ss_conf 5565764443589-88998777999850287438-62586058-----8889999998999862789980599851078 Q gi|254780652|r 220 NMILPGKSAPQVS-DEEIASNTIKIFKRVVPCAV-QGIAFLSG-----GQSEQDATARLSAMNSMGDCPWRLSFSYGRA 291 (339) Q Consensus 220 nMv~pG~~~~~~~-~eevA~~T~~~l~~~vP~~V-pgIvFLSG-----Gqs~eeAt~~LnaiNk~~~~pW~lsfSygRA 291 (339) -|++||..-...+ +++.|.|-..-++---|..| +|.|---| |-|+|+|-.-|. ..+- -=-+-=||+|- T Consensus 13 D~IIPGrYLR~~~D~~eLA~haM~g~~PeF~~kvr~GDviVAGkNFGcGSSREQA~~ALK---~aGv-~aviA~SFARI 87 (159) T TIGR02087 13 DVIIPGRYLRTTDDPDELASHAMEGIDPEFAKKVREGDVIVAGKNFGCGSSREQAALALK---AAGV-AAVIAESFARI 87 (159) T ss_pred CEECCCCCCCCCCCHHHHHHHCCCCCCCCHHHHCCCCCEEEECCCCCCCCCHHHHHHHHH---HCCE-EEEEEEECCCH T ss_conf 704585225754777889987055899631340789978871416477887788999997---4790-48985201660 No 46 >TIGR01536 asn_synth_AEB asparagine synthase (glutamine-hydrolyzing); InterPro: IPR006426 These sequences represent glutamine-hydrolysing asparagine synthase. The group have a poorly conserved C-terminal extension while bacterial members of the family tend to have a long, poorly conserved insert lacking from archaeal and eukaryotic sequences. Multiple isozymes have been demonstrated, such as in Bacillus subtilis .; GO: 0004066 asparagine synthase (glutamine-hydrolyzing) activity, 0006529 asparagine biosynthetic process. Probab=27.70 E-value=46 Score=14.39 Aligned_cols=182 Identities=23% Similarity=0.288 Sum_probs=93.6 Q ss_pred CCHHHHHHHHHH--HEECCCCHHEEEEEECCHHHHHHC--CCCC-CCHHHHHHHCCCEEEEEECC------------CEE Q ss_conf 986788754564--100778023046434677884100--0488-74679899579878994278------------542 Q gi|254780652|r 40 STENSRRDYREM--LFRAKDAMQYISSILLYEETLYQN--AQDG-TSFVDLISSSGVLVGIKVDR------------GLK 102 (339) Q Consensus 40 nteenr~~~R~~--l~tsp~~~~~IsGvILfeeTl~q~--~~~g-~~~~~~l~~kGIvpgIKVDk------------Gl~ 102 (339) |.+|-|+.+++. =|+|..=++=|+- +|++.+ +. .-|| ..+++.| .|+=.|.=.|+ |.+ T Consensus 105 N~~eLR~~L~~~G~~F~T~SDtEViL~--~Y~~wf-r~~~~~~g~e~~~~~L--~GmFAFa~~D~~~~~l~~aRDrfGiK 179 (646) T TIGR01536 105 NHEELREELEAKGYTFETRSDTEVILH--AYEEWF-RNLDLEDGIEECVDRL--DGMFAFALWDKKKGELFLARDRFGIK 179 (646) T ss_pred CHHHHHHHHHHCCCEECCCCCHHHHHH--HHHHHH-CCCCHHHHHHHHHHHH--CCCEEEEEEECCCCEEEEEECCCCCC T ss_conf 558999999867942616884899999--999973-2788637899999973--89789999974777189984788623 Q ss_pred ECC-CCC-CCCCCCCHHHHHHHHHHHHHCCCEEECCEECCCCCC--CCCHHHHHHHHHHHHHHHHHHHHHCCCCC--EEC Q ss_conf 178-999-862157835799999985433955501000233677--87408888998999999999998447872--011 Q gi|254780652|r 103 PYP-LYP-GEHITVGLDDLDVRLKQYKDAGARVAKWRVVLSVSD--VLPSMTVVKANMHILARYAVLCQNAGLVP--IVE 176 (339) Q Consensus 103 ~l~-g~~-~e~~t~GLD~L~eR~~~y~~~Ga~faKwRaVi~i~~--~~Ps~~~I~~na~~larYA~i~Q~~GLVP--IVE 176 (339) ||- +.. +..+ ++.--.-|.+...+ ..+....- +.+.+--++++ -.+ T Consensus 180 PLYy~~~~~g~~------------------~F~SE~Kal~~~~~~~~~~~~~~~----------~~l~~~~~~~~~~~~~ 231 (646) T TIGR01536 180 PLYYAYDEGGQL------------------YFASEIKALLAHPREENIKPEPDG----------AALADYLGFQFSLVPT 231 (646) T ss_pred CCEEEEECCCEE------------------EEEEHHHHHHCCCCCCCCCCCCCH----------HHHCCCEEEEEECCCC T ss_conf 520134428626------------------873104454227875676545771----------1303120355304688 Q ss_pred HHHCCCCCCCHHHHHHHHHH-------HHHHHHHHHHHCCCCCCCCCCCCCCCCCC------CCCCCCCHHHHHHHHHHH Q ss_conf 01103775798999999999-------99999999876385311100002556576------444358988998777999 Q gi|254780652|r 177 PEVLMEGDHTIERCAEVTEF-------VLRTLFNELYTMRVCFEGLVLKSNMILPG------KSAPQVSDEEIASNTIKI 243 (339) Q Consensus 177 PEVl~dG~H~ie~~~~vte~-------vL~~lf~~L~~~~V~leg~lLKpnMv~pG------~~~~~~~~eevA~~T~~~ 243 (339) |+....|=+++..+...|-+ ....+.... .-|.+ +++.-++ .++.+..++++=..-..+ T Consensus 232 ~~~~~~~~~~~~p~~~~t~~~df~~~~~~~~~~~~~---~~YW~-----~~~~~~~a~~g~~~~~~~~~~~~lR~ll~~a 303 (646) T TIGR01536 232 PKTIFAGVFKLEPGHALTVKADFFELDGLEKLERKI---RRYWE-----VERDEPEALKGEHADSEEDLVEELRSLLEDA 303 (646) T ss_pred CCCCCCCCHHCCCCCCCCCCCEEEEECCHHHHHHHH---HHHHC-----CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 762146400102200013320013106424577888---87504-----5554555667877777578899999999999 Q ss_pred HHHCCCCCCCEEEEECCCC Q ss_conf 8502874386258605888 Q gi|254780652|r 244 FKRVVPCAVQGIAFLSGGQ 262 (339) Q Consensus 244 l~~~vP~~VpgIvFLSGGq 262 (339) .+.++-+.||==+|||||- T Consensus 304 V~~rl~aDvpvG~~LSGGl 322 (646) T TIGR01536 304 VKRRLVADVPVGVLLSGGL 322 (646) T ss_pred HHHHCCCCCCEEEEEECHH T ss_conf 8853200560688871716 No 47 >TIGR02907 spore_VI_D stage VI sporulation protein D; InterPro: IPR014256 This entry represents SpoVID, the stage VI sporulation protein D, which is restricted to endospore-forming bacteria, all of which are found among the Firmicutes. It is widely distributed but not quite universal in this group. Between well-conserved N-terminal and C-terminal domains is a poorly conserved, low-complexity region of variable length, rich in glutamic acid. SpoVID is involved in spore coat assembly in the mother cell compartment late in the process of sporulation.. Probab=26.96 E-value=23 Score=16.36 Aligned_cols=13 Identities=15% Similarity=0.376 Sum_probs=8.7 Q ss_pred EEECCE-ECCCCCC Q ss_conf 550100-0233677 Q gi|254780652|r 132 RVAKWR-VVLSVSD 144 (339) Q Consensus 132 ~faKwR-aVi~i~~ 144 (339) -||||| |+|.-++ T Consensus 321 ~FTKlk~civQe~d 334 (371) T TIGR02907 321 EFTKLKMCIVQEGD 334 (371) T ss_pred CCCCEEEEEEECCC T ss_conf 10003888985288 No 48 >pfam00218 IGPS Indole-3-glycerol phosphate synthase. Probab=26.76 E-value=31 Score=15.52 Aligned_cols=83 Identities=24% Similarity=0.282 Sum_probs=46.1 Q ss_pred HHHHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 99999999984478720110110377579899999999999999999876385311100002556576444358988998 Q gi|254780652|r 158 ILARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAPQVSDEEIA 237 (339) Q Consensus 158 ~larYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~~~~~eevA 237 (339) +|..+...+.+-||-|+||= |+.+. + +++ + +.+-.+-|..+-.-..-+|- T Consensus 146 ~l~~l~~~a~~lgl~~LvEv-------h~~~E---l-~~a-------l------------~~~a~iIGINNRnL~tf~vd 195 (254) T pfam00218 146 LLEELYEYARSLGMEPLVEV-------HNEEE---L-ERA-------L------------ALGAKLIGVNNRNLKTFEVD 195 (254) T ss_pred HHHHHHHHHHHHCCEEEEEE-------CCHHH---H-HHH-------H------------HCCCCEEEECCCCHHHHHCC T ss_conf 99999999998488679886-------89999---9-999-------8------------48997896327884651005 Q ss_pred HHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHH Q ss_conf 7779998502874386258605888899999989 Q gi|254780652|r 238 SNTIKIFKRVVPCAVQGIAFLSGGQSEQDATARL 271 (339) Q Consensus 238 ~~T~~~l~~~vP~~VpgIvFLSGGqs~eeAt~~L 271 (339) ..|..-|...+|..+. ++.-||=.+.++..... T Consensus 196 ~~~t~~L~~~ip~~~~-~VsESGI~~~~di~~l~ 228 (254) T pfam00218 196 LNTTRRLAPMVPEDVL-LVAESGISTPEDVEKLA 228 (254) T ss_pred HHHHHHHHHHCCCCCE-EEECCCCCCHHHHHHHH T ss_conf 7999999955898987-99838999999999999 No 49 >KOG3153 consensus Probab=26.40 E-value=23 Score=16.35 Aligned_cols=119 Identities=15% Similarity=0.055 Sum_probs=62.8 Q ss_pred HHHCCCCCCCHH-HHHHHCCCEEEEEECCCEEECC-------CCCCCCCCCCHHHHHHHHHHHHHCCCEEECCEE-CCCC Q ss_conf 410004887467-9899579878994278542178-------999862157835799999985433955501000-2336 Q gi|254780652|r 72 LYQNAQDGTSFV-DLISSSGVLVGIKVDRGLKPYP-------LYPGEHITVGLDDLDVRLKQYKDAGARVAKWRV-VLSV 142 (339) Q Consensus 72 l~q~~~~g~~~~-~~l~~kGIvpgIKVDkGl~~l~-------g~~~e~~t~GLD~L~eR~~~y~~~Ga~faKwRa-Vi~i 142 (339) ++|+..--.++. -+|+...+ -+=+|.|.-.+- -.....++--+|.+.+.-..|| +||-. ||+- T Consensus 29 LN~~iq~p~~~f~~LWk~A~l--RvcaDGgaNrlyd~~~~~~~~~Pd~I~GDfDSi~~ev~~yy------~~~g~~vV~~ 100 (250) T KOG3153 29 LNQEIQIPDNRFRLLWKKAKL--RVCADGGANRLYDYLSDRTSEKPDYICGDFDSITEEVDDYY------KKNGVTVVHT 100 (250) T ss_pred ECCCCCCCHHHHHHHHHHHHE--EEECCCCHHHHHHHCCCCCCCCCCEEECCHHHHHHHHHHHH------HHCCCEEEEC T ss_conf 678887853789998764006--57516743344230354434487656536023109999999------8529736757 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 778740888899899999999999844787201101103775798999999999999999998763 Q gi|254780652|r 143 SDVLPSMTVVKANMHILARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTM 208 (339) Q Consensus 143 ~~~~Ps~~~I~~na~~larYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~ 208 (339) |.+. ...+-+--+.||+.+.-|=.+-+++. --+.+.--+.-+-.-|..+|++.+.+ T Consensus 101 ----pdQd-----~TDftKcv~~i~~~~~~~e~~~~~Iv-vlgglgGRfDq~m~nln~Ly~~~~~~ 156 (250) T KOG3153 101 ----PDQD-----TTDFTKCVKWIQEHKTLTEWKFLNIV-VLGGLGGRFDQTMANLNTLYRAVDIQ 156 (250) T ss_pred ----CCCC-----CCHHHHHHHHHHHHCCCCCCEEEEEE-EECCCCCCHHHHHHHHHHHHEHHCCC T ss_conf ----8767-----21389999999886236442044589-96466762777777677661210356 No 50 >KOG0225 consensus Probab=25.94 E-value=32 Score=15.44 Aligned_cols=18 Identities=22% Similarity=0.357 Sum_probs=11.4 Q ss_pred HHHHHHHCCCEEEEEECC Q ss_conf 679899579878994278 Q gi|254780652|r 82 FVDLISSSGVLVGIKVDR 99 (339) Q Consensus 82 ~~~~l~~kGIvpgIKVDk 99 (339) .+++++.-+-|||||||. T Consensus 239 ~teyykRG~yiPGl~VdG 256 (394) T KOG0225 239 STEYYKRGDYIPGLKVDG 256 (394) T ss_pred CHHHHHCCCCCCCEEECC T ss_conf 827775068787557878 No 51 >pfam03932 CutC CutC family. Copper transport in Escherichia coli is mediated by the products of at least six genes, cutA, cutB, cutC, cutD, cutE, and cutF. A mutation in one or more of these genes results in an increased copper sensitivity. Members of this family are between 200 and 300 amino acids in length are found in both eukaryotes and bacteria. Probab=25.86 E-value=49 Score=14.17 Aligned_cols=22 Identities=18% Similarity=0.468 Sum_probs=9.4 Q ss_pred CCCEEEE--E-CCCCCHHHHHHHHH Q ss_conf 3862586--0-58888999999899 Q gi|254780652|r 251 AVQGIAF--L-SGGQSEQDATARLS 272 (339) Q Consensus 251 ~VpgIvF--L-SGGqs~eeAt~~Ln 272 (339) .+.|||| | +-|+=+.+++.+|- T Consensus 85 G~~GvV~G~L~~d~~iD~~~~~~li 109 (202) T pfam03932 85 GAPGVVLGALTADGEIDTKRMEKLI 109 (202) T ss_pred CCCEEEEEEECCCCCCCHHHHHHHH T ss_conf 9897899888899982999999999 No 52 >cd03272 ABC_SMC3_euk Eukaryotic SMC3 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (fo Probab=25.69 E-value=20 Score=16.71 Aligned_cols=40 Identities=33% Similarity=0.310 Sum_probs=30.8 Q ss_pred CEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHH Q ss_conf 6258605888899999989998627899805998510786 Q gi|254780652|r 253 QGIAFLSGGQSEQDATARLSAMNSMGDCPWRLSFSYGRAL 292 (339) Q Consensus 253 pgIvFLSGGqs~eeAt~~LnaiNk~~~~pW~lsfSygRAL 292 (339) -.|.=|||||-.--|.+.+=||+++.+.|+-|==-.--|| T Consensus 154 r~i~~~~~~~k~~~alalifai~~~~PaPfyllDEidaaL 193 (243) T cd03272 154 QEMQQLSGGQKSLVALALIFAIQKCDPAPFYLFDEIDAAL 193 (243) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCC T ss_conf 9999971346779999999998455998378871344015 No 53 >KOG2311 consensus Probab=24.67 E-value=52 Score=14.04 Aligned_cols=102 Identities=22% Similarity=0.330 Sum_probs=65.0 Q ss_pred CCCEEEEE--CCCCCCHHHHHHHCCCCCCHHHH---------------HHH----H-----------------HHHEECC Q ss_conf 79909991--46887757878857899986788---------------754----5-----------------6410077 Q gi|254780652|r 15 AGKGILAA--DESNATIQKRFKAIHLESTENSR---------------RDY----R-----------------EMLFRAK 56 (339) Q Consensus 15 ~gKGilAa--DeS~gt~~Krl~~igventeenr---------------~~~----R-----------------~~l~tsp 56 (339) .|||.|-- |-=+|-|+|--+.-||..---|| +.| . .+++..| T Consensus 76 igKg~LmrEVDALdGl~~rvcD~s~vq~k~LNrs~GPAVwg~RAQiDR~lYkk~MQkei~st~nL~ire~~V~dliv~~~ 155 (679) T KOG2311 76 IGKGHLMREVDALDGLCSRVCDQSGVQYKVLNRSKGPAVWGLRAQIDRKLYKKNMQKEISSTPNLEIREGAVADLIVEDP 155 (679) T ss_pred CCCCEEEEEEHHHCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHEEECCC T ss_conf 66530244312002567654115555687750257972237087651999999999875268760012200133363168 Q ss_pred CCH-HEEEEEECCHHHHHHCCCCCCCHHHHHHHCCCEEEEEECCCEEECC-CCCCCCCCCCHHHHHHHH Q ss_conf 802-3046434677884100048874679899579878994278542178-999862157835799999 Q gi|254780652|r 57 DAM-QYISSILLYEETLYQNAQDGTSFVDLISSSGVLVGIKVDRGLKPYP-LYPGEHITVGLDDLDVRL 123 (339) Q Consensus 57 ~~~-~~IsGvILfeeTl~q~~~~g~~~~~~l~~kGIvpgIKVDkGl~~l~-g~~~e~~t~GLD~L~eR~ 123 (339) +-+ ..|.||.|++.|.-.. ..++-.-|.--.=++.-|++-.| |--||+.+.||-.-.+++ T Consensus 156 ~~~~~~~~gV~l~dgt~v~a-------~~VilTTGTFL~~~I~iGlk~~pAGRiGe~ps~~Lse~l~kl 217 (679) T KOG2311 156 DDGHCVVSGVVLVDGTVVYA-------ESVILTTGTFLRGQINIGLKTHPAGRIGEQPSIGLSETLQKL 217 (679) T ss_pred CCCCEEEEEEEEECCCEECC-------CEEEEEECCCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 88734777899755727314-------349996132122378504311447444677435788999972 No 54 >TIGR00088 trmD tRNA (guanine-N1)-methyltransferase; InterPro: IPR002649 In transfer RNA many different modified nucleosides are found, especially in the anticodon region. tRNA (guanine-N1-)-methyltransferase 2.1.1.31 from EC is one of several nucleases operating together with the tRNA-modifying enzymes before the formation of the mature tRNA. It catalyses the reaction:S-adenosyl-L-methionine + tRNA -> S-adenosyl-L-homocysteine + tRNA containing N1-methylguanine methylating guanosine(G) to N1-methylguanine (1-methylguanosine (m1G)) at position 37 of tRNAs that read CUN (leucine), CCN(proline), and CGG (arginine) codons. The presence of m1G improves the cellular growth rate and the polypeptide steptime and also prevents the tRNA from shifting the reading frame . The mechanism of the trmD3-induced frameshift involving mutant tRNA(Pro) and tRNA(Leu) species has been investigated . It has been suggested that the conformation of the anticodon loop may be a major determining element for the formation of m1G37 in vivo . ; GO: 0003723 RNA binding, 0008175 tRNA methyltransferase activity, 0008033 tRNA processing. Probab=24.63 E-value=37 Score=14.99 Aligned_cols=125 Identities=18% Similarity=0.186 Sum_probs=53.3 Q ss_pred ECCHHHHHHCCCCCCCHHHHHHHCCCEEEEEECC---------CEEE--CCCCCCCCCC-CCHHHHHHHHHHHHHCCCEE Q ss_conf 4677884100048874679899579878994278---------5421--7899986215-78357999999854339555 Q gi|254780652|r 66 LLYEETLYQNAQDGTSFVDLISSSGVLVGIKVDR---------GLKP--YPLYPGEHIT-VGLDDLDVRLKQYKDAGARV 133 (339) Q Consensus 66 ILfeeTl~q~~~~g~~~~~~l~~kGIvpgIKVDk---------Gl~~--l~g~~~e~~t-~GLD~L~eR~~~y~~~Ga~f 133 (339) =||.+++.--..+|.. .=-.+++++-+==|+- -+.+ .-|..||... |++-+=.+.++.....||+ T Consensus 7 tLFP~~~~~~~~~sil--~~A~~k~l~~~e~~n~RdFs~dKH~tvDD~pYGGG~GMvLk~ePi~~Al~~~~~~~~~~~~- 83 (247) T TIGR00088 7 TLFPEMFKPYLESSIL--KKALEKNLVSFEVVNPRDFSKDKHKTVDDRPYGGGAGMVLKPEPIFDALESVKAAKKEGAK- 83 (247) T ss_pred ECCHHHHHHHHCCCHH--HHHHHCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCE- T ss_conf 2787773312032158--9997547056675322000356345256753378976504867899999852104325887- Q ss_pred ECCEECCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHCC--------CCC-EECHHHCCCCCCCHHHHHHHHHH-HHHHHH Q ss_conf 0100023367-787408888998999999999998447--------872-01101103775798999999999-999999 Q gi|254780652|r 134 AKWRVVLSVS-DVLPSMTVVKANMHILARYAVLCQNAG--------LVP-IVEPEVLMEGDHTIERCAEVTEF-VLRTLF 202 (339) Q Consensus 134 aKwRaVi~i~-~~~Ps~~~I~~na~~larYA~i~Q~~G--------LVP-IVEPEVl~dG~H~ie~~~~vte~-vL~~lf 202 (339) ||-.+ .|.+-.. +-+..||.+-.++-=|| +.- -|.-||.| ||.=+- |=|+-.- +...|. T Consensus 84 -----villsP~G~~~~Q---~~~~~La~~~~lvLiCGrYEG~DeR~~~~~vD~e~Si-GDfVLt-GGEL~AL~lIDsv~ 153 (247) T TIGR00088 84 -----VILLSPQGEKFDQ---ALARELAKKEHLVLICGRYEGFDERIIELLVDEEISI-GDFVLT-GGELPALTLIDSVV 153 (247) T ss_pred -----EEEECCCCCCCCH---HHHHHHHHCCCEEEEECCCCCCHHHHHHHHCCEEEEC-CCEEEC-HHHHHHHHHHHHHH T ss_conf -----9997889964266---8999874128478995650360266453207867983-662001-04799999998898 Q ss_pred H Q ss_conf 9 Q gi|254780652|r 203 N 203 (339) Q Consensus 203 ~ 203 (339) + T Consensus 154 R 154 (247) T TIGR00088 154 R 154 (247) T ss_pred H T ss_conf 7 No 55 >pfam07395 Mig-14 Mig-14. This family contains a number of bacterial mig-14 proteins (approximately 270 residues long). In Salmonella, mig-14 contributes to resistance to antimicrobial peptides, although the mechanism is not fully understood. Probab=24.43 E-value=49 Score=14.20 Aligned_cols=24 Identities=38% Similarity=0.397 Sum_probs=16.4 Q ss_pred CHHHHHHHHHHHHCCCCCCCEEEEEHHHHH Q ss_conf 899999989998627899805998510786 Q gi|254780652|r 263 SEQDATARLSAMNSMGDCPWRLSFSYGRAL 292 (339) Q Consensus 263 s~eeAt~~LnaiNk~~~~pW~lsfSygRAL 292 (339) ....|.+.-.+.|| .|-|||||+= T Consensus 225 N~~~A~~~~~~~~K------~lrfSfGr~~ 248 (264) T pfam07395 225 NTQAAWEDARARGK------PLRFSFGRAD 248 (264) T ss_pred CHHHHHHHHHHHCC------CEEEEECCCC T ss_conf 17999999997299------2699807897 No 56 >KOG0416 consensus Probab=24.15 E-value=48 Score=14.26 Aligned_cols=73 Identities=19% Similarity=0.311 Sum_probs=54.2 Q ss_pred ECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC-CEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 010002336778740888899899999999999844787-2011011037757989999999999999999987638531 Q gi|254780652|r 134 AKWRVVLSVSDVLPSMTVVKANMHILARYAVLCQNAGLV-PIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCF 212 (339) Q Consensus 134 aKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~Q~~GLV-PIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~l 212 (339) ..||--+...+..|=... .-|+| -|..|-|..- +=-.|..|-.++|..+|+..+-...+| T Consensus 47 GvWkv~V~lPd~YP~KSP----------------SIGFvnKIfHPNIDe~---SGsVCLDViNQtWSp~yDL~NIfetfL 107 (189) T KOG0416 47 GVWKVRVELPDNYPFKSP----------------SIGFVNKIFHPNIDEA---SGSVCLDVINQTWSPLYDLVNIFETFL 107 (189) T ss_pred CEEEEEEECCCCCCCCCC----------------CCCCEEECCCCCCHHH---CCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 668998878877887798----------------5320310237880230---374279988624168888999999985 Q ss_pred CCCCCCCCCCCCC Q ss_conf 1100002556576 Q gi|254780652|r 213 EGLVLKSNMILPG 225 (339) Q Consensus 213 eg~lLKpnMv~pG 225 (339) .+.|+-||..-|= T Consensus 108 PQLL~YPNp~DPL 120 (189) T KOG0416 108 PQLLRYPNPSDPL 120 (189) T ss_pred HHHHCCCCCCCCC T ss_conf 9986389988876 No 57 >pfam04577 DUF563 Protein of unknown function (DUF563). Family of uncharacterized proteins. Probab=24.04 E-value=50 Score=14.15 Aligned_cols=12 Identities=8% Similarity=0.180 Sum_probs=4.6 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999987638 Q gi|254780652|r 198 LRTLFNELYTMR 209 (339) Q Consensus 198 L~~lf~~L~~~~ 209 (339) ...+++.|.+|. T Consensus 132 y~~il~~LS~y~ 143 (246) T pfam04577 132 YGDVLRQLSDYD 143 (246) T ss_pred HHHHHHHHCCCC T ss_conf 999999856798 No 58 >COG5400 Uncharacterized protein conserved in bacteria [Function unknown] Probab=24.03 E-value=42 Score=14.60 Aligned_cols=102 Identities=18% Similarity=0.263 Sum_probs=59.7 Q ss_pred HHHHHHHHCCCEEEEE-CCCC-CCHHHHHHHCCCCCCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCHHH Q ss_conf 9999997479909991-4688-7757878857899986788754564100778023046434677884100048874679 Q gi|254780652|r 7 NSVAMMLQAGKGILAA-DESN-ATIQKRFKAIHLESTENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSFVD 84 (339) Q Consensus 7 ~~a~~~~~~gKGilAa-DeS~-gt~~Krl~~igventeenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~~~ 84 (339) -+|+.|+..|.||+-. -++- .-+.|.|++||++|-. |.--.+-+-+|.|.--=|-|+|-|-.. .. .- T Consensus 48 YtaqEivdagh~FfG~Tsggla~vvEkaF~~yGlPnGY---------ilGeEGSGAfIaGltYGeG~LytKn~g-~h-~v 116 (205) T COG5400 48 YTAQEIVDAGHGFFGETSGGLAKVVEKAFQSYGLPNGY---------ILGEEGSGAFIAGLTYGEGTLYTKNAG-DH-KV 116 (205) T ss_pred CCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCE---------EECCCCCCCEEEEEEECCCEEEECCCC-CC-CE T ss_conf 00999987324520235503899999999964898733---------742556662575455235267744888-75-23 Q ss_pred HHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 899579878994278542178999862157835799999985 Q gi|254780652|r 85 LISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQY 126 (339) Q Consensus 85 ~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y 126 (339) +|.- |-|--|=|= .|+--+..+-.||+...--..| T Consensus 117 FWQG----PslGwD~GG---qgsRvmmLvYnL~~v~aly~Ry 151 (205) T COG5400 117 FWQG----PSLGWDWGG---QGSRVMMLVYNLDDVDALYRRY 151 (205) T ss_pred EEEC----CCCCCCCCC---CCEEEEEEEECCCCHHHHHHHC T ss_conf 7407----854643378---7407999994487778998645 No 59 >pfam02463 SMC_N RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination. Probab=24.00 E-value=48 Score=14.22 Aligned_cols=31 Identities=29% Similarity=0.355 Sum_probs=23.6 Q ss_pred EEEECCCCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 5860588889999998999862789980599 Q gi|254780652|r 255 IAFLSGGQSEQDATARLSAMNSMGDCPWRLS 285 (339) Q Consensus 255 IvFLSGGqs~eeAt~~LnaiNk~~~~pW~ls 285 (339) +--||||+-.--|.+-|=||.++.|.||-+= T Consensus 1076 l~~LSGGEKtL~ALaLlFAi~~~~PaPFyiL 1106 (1162) T pfam02463 1076 LDNLSGGEKTLVALALIFAIQKYRPAPFYLL 1106 (1162) T ss_pred HHHHCHHHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 6884878899999999999986289837875 No 60 >pfam10058 DUF2296 Predicted integral membrane metal-binding protein (DUF2296). This domain, found in various hypothetical bacterial and eukaryotic metal-binding proteins, has no known function. Probab=23.83 E-value=13 Score=17.90 Aligned_cols=20 Identities=45% Similarity=0.786 Sum_probs=15.7 Q ss_pred HHHHHHHHH----CCCCCEECHHH Q ss_conf 999999984----47872011011 Q gi|254780652|r 160 ARYAVLCQN----AGLVPIVEPEV 179 (339) Q Consensus 160 arYA~i~Q~----~GLVPIVEPEV 179 (339) -|||.||.+ +|++|-.|+|- T Consensus 16 ~r~aLIC~~C~~hNG~a~~~~~~~ 39 (51) T pfam10058 16 NRYALICSKCHSHNGLAPKEEFEE 39 (51) T ss_pred CCEEEECHHHHHHCCCCCCCCCHH T ss_conf 852148523453288678886021 No 61 >pfam04015 DUF362 Domain of unknown function (DUF362). Sometimes present in iron-sulphur proteins. Probab=23.73 E-value=54 Score=13.92 Aligned_cols=21 Identities=14% Similarity=0.134 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHCCCCCEEC Q ss_conf 8999999999998447872011 Q gi|254780652|r 155 NMHILARYAVLCQNAGLVPIVE 176 (339) Q Consensus 155 na~~larYA~i~Q~~GLVPIVE 176 (339) .+....-|..+|++.|. ++++ T Consensus 88 ~v~~~sGi~~va~~~gv-~~v~ 108 (262) T pfam04015 88 AVLKTMGILDLARELDI-TVVE 108 (262) T ss_pred HHHHHHCHHHHHHHCCC-EEEE T ss_conf 99999799999997499-7985 No 62 >pfam03140 DUF247 Plant protein of unknown function. The function of the plant proteins constituting this family is unknown. Probab=23.59 E-value=10 Score=18.61 Aligned_cols=17 Identities=18% Similarity=0.362 Sum_probs=12.8 Q ss_pred HHHHHHHHHHCCCEEEC Q ss_conf 99999985433955501 Q gi|254780652|r 119 LDVRLKQYKDAGARVAK 135 (339) Q Consensus 119 L~eR~~~y~~~Ga~faK 135 (339) .--++.+++..|.+|-| T Consensus 223 ~~~sA~eL~~aGVkFk~ 239 (390) T pfam03140 223 LIPSATELREAGVKFKR 239 (390) T ss_pred CCCCHHHHHHCCCEEEE T ss_conf 68689999971968876 No 63 >pfam02329 HDC Histidine carboxylase PI chain. Histidine carboxylase catalyses the formation of histamine from histidine. Cleavage of the proenzyme PI chain yields two subunits, alpha and beta, which arrange as a hexamer (alpha beta)6. Probab=23.56 E-value=23 Score=16.30 Aligned_cols=48 Identities=19% Similarity=0.250 Sum_probs=32.7 Q ss_pred CCCCCHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 4887467989957987899427854217899986215783579999998543395 Q gi|254780652|r 77 QDGTSFVDLISSSGVLVGIKVDRGLKPYPLYPGEHITVGLDDLDVRLKQYKDAGA 131 (339) Q Consensus 77 ~~g~~~~~~l~~kGIvpgIKVDkGl~~l~g~~~e~~t~GLD~L~eR~~~y~~~Ga 131 (339) .+|.-.|--. -.|-|.+|||+.|+++.- +++ .|||..+|...|...-| T Consensus 26 c~GY~~PGa~-GnGYVs~lKvs~Gtvdkt--dd~----~lDg~~~~IVsyDRaE~ 73 (306) T pfam02329 26 CRGYMNPGAI-GNGYVTTLKLSVGTVDKT--DDD----VLDAITERIVSYDRCET 73 (306) T ss_pred HHCCCCCCCC-CCCEEEEEEEECCEEECC--CCH----HHCCCCCCEEEECHHHC T ss_conf 0124898653-672589999732426537--632----21354123256531230 No 64 >COG4598 HisP ABC-type histidine transport system, ATPase component [Amino acid transport and metabolism] Probab=23.53 E-value=36 Score=15.09 Aligned_cols=20 Identities=55% Similarity=0.635 Sum_probs=13.2 Q ss_pred EEECCCCCHHHHHHHHHHHH Q ss_conf 86058888999999899986 Q gi|254780652|r 256 AFLSGGQSEQDATARLSAMN 275 (339) Q Consensus 256 vFLSGGqs~eeAt~~LnaiN 275 (339) ++|||||-+--|.++--||+ T Consensus 151 ~~LSGGQQQR~aIARaLame 170 (256) T COG4598 151 AHLSGGQQQRVAIARALAME 170 (256) T ss_pred CCCCCHHHHHHHHHHHHHCC T ss_conf 11474178889999987359 No 65 >COG4909 PduC Propanediol dehydratase, large subunit [Secondary metabolites biosynthesis, transport, and catabolism] Probab=23.35 E-value=53 Score=13.96 Aligned_cols=39 Identities=33% Similarity=0.495 Sum_probs=21.4 Q ss_pred EEEECCHHHHHHCCCCCCCHHHHH-HHCCCEEEEEECCCEEECC Q ss_conf 643467788410004887467989-9579878994278542178 Q gi|254780652|r 63 SSILLYEETLYQNAQDGTSFVDLI-SSSGVLVGIKVDRGLKPYP 105 (339) Q Consensus 63 sGvILfeeTl~q~~~~g~~~~~~l-~~kGIvpgIKVDkGl~~l~ 105 (339) |.+.-|+..+..+..|+.-|.||+ -.+ -+|||.||.|++ T Consensus 365 sa~pnydnmfagsn~daedfddynvlqr----dlkvdggltpv~ 404 (554) T COG4909 365 SAIPNYDNMFAGSNIDAEDFDDYNVLQR----DLKVDGGLTPVP 404 (554) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHH----HCCCCCCCCCCC T ss_conf 4565753335766688555456788876----401478976586 No 66 >PRK00026 trmD tRNA (guanine-N(1)-)-methyltransferase; Reviewed Probab=23.25 E-value=55 Score=13.86 Aligned_cols=13 Identities=23% Similarity=0.304 Sum_probs=7.2 Q ss_pred EECCCCCCCCCCC Q ss_conf 2178999862157 Q gi|254780652|r 102 KPYPLYPGEHITV 114 (339) Q Consensus 102 ~~l~g~~~e~~t~ 114 (339) .|..|.+||.... T Consensus 52 ~PyGGG~GMVm~~ 64 (226) T PRK00026 52 RPYGGGAGMVMKP 64 (226) T ss_pred CCCCCCCCCEEEH T ss_conf 7578998217630 No 67 >TIGR02177 PorB_KorB 2-oxoacid:acceptor oxidoreductase, beta subunit, pyruvate/2-ketoisovalerate family; InterPro: IPR011896 A number of anaerobic and microaerophilic species lack pyruvate dehydrogenase and have instead a four subunit, oxygen-sensitive pyruvate oxidoreductase, with either ferredoxins or flavodoxins used as the acceptor. Several related four-subunit enzymes may exist in the same species. This entry describes the beta subunits, representing mostly pyruvate and 2-ketoisovalerate specific enzymes.. Probab=23.14 E-value=34 Score=15.23 Aligned_cols=16 Identities=25% Similarity=0.206 Sum_probs=5.8 Q ss_pred HHHHHHHHHHCCCCCC Q ss_conf 9999999876385311 Q gi|254780652|r 198 LRTLFNELYTMRVCFE 213 (339) Q Consensus 198 L~~lf~~L~~~~V~le 213 (339) |.++.++|.+.+++=+ T Consensus 13 L~AL~~AL~EL~LdP~ 28 (302) T TIGR02177 13 LSALQKALAELNLDPE 28 (302) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 9999999986288985 No 68 >KOG2229 consensus Probab=23.09 E-value=56 Score=13.84 Aligned_cols=84 Identities=18% Similarity=0.213 Sum_probs=60.5 Q ss_pred CCCHHHHHHHHHHHHHHCCCCCCCEE-------EEECCCCCHHHHHHHHHHHHCC-CCCCCEEEEEHHHHHHHHHHHHHC Q ss_conf 58988998777999850287438625-------8605888899999989998627-899805998510786378898818 Q gi|254780652|r 230 QVSDEEIASNTIKIFKRVVPCAVQGI-------AFLSGGQSEQDATARLSAMNSM-GDCPWRLSFSYGRALQESVLRAWN 301 (339) Q Consensus 230 ~~~~eevA~~T~~~l~~~vP~~VpgI-------vFLSGGqs~eeAt~~LnaiNk~-~~~pW~lsfSygRALQ~~~Lk~W~ 301 (339) +.+.-+|-..-..+-++.||+++.-. .|.+-+-|.|+.+.-+|++-.+ ...|-.|+= --||.-+..- + T Consensus 288 qrDvTqIl~~aaQa~Hd~VP~d~iEpl~k~Ian~FVtD~~spEv~~vgiN~iREic~RvplaMte---eLLqDLa~YK-~ 363 (616) T KOG2229 288 QRDVTQILAAAAQASHDLVPPDIIEPLLKTIANNFVTDENSPEVMAVGINAIREICERVPLAMTE---ELLQDLASYK-T 363 (616) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHCCHHHCH---HHHHHHHHHH-H T ss_conf 34699999999986657787677658999999875236778530001236899998625152057---8999999986-7 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 9988999999999999 Q gi|254780652|r 302 GKQDNVVVAQRVLSHR 317 (339) Q Consensus 302 G~~eN~~~aQ~~~~~R 317 (339) .++++|..|+..+++. T Consensus 364 sk~K~V~maarSLI~L 379 (616) T KOG2229 364 SKKKVVMMAARSLIAL 379 (616) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 7788888889999999 No 69 >COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif [General function prediction only] Probab=22.48 E-value=57 Score=13.76 Aligned_cols=13 Identities=46% Similarity=0.698 Sum_probs=6.9 Q ss_pred HHHHHHHHEECCC Q ss_conf 8754564100778 Q gi|254780652|r 45 RRDYREMLFRAKD 57 (339) Q Consensus 45 r~~~R~~l~tsp~ 57 (339) ||.|=+=||-|-| T Consensus 98 rRnYIeGLFLSSG 110 (404) T COG4277 98 RRNYIEGLFLSSG 110 (404) T ss_pred HHHHHHHHEECCC T ss_conf 8742433000246 No 70 >KOG0057 consensus Probab=22.27 E-value=36 Score=15.03 Aligned_cols=98 Identities=27% Similarity=0.383 Sum_probs=49.9 Q ss_pred HHHHHHHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 89999999999984478720110110377579899999999999999999876385311100002556576444358988 Q gi|254780652|r 155 NMHILARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRTLFNELYTMRVCFEGLVLKSNMILPGKSAPQVSDE 234 (339) Q Consensus 155 na~~larYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~lf~~L~~~~V~leg~lLKpnMv~pG~~~~~~~~e 234 (339) ....|+|+-- ..| +|++||-. | .+++-+-+..+....-...+.+..|+|-- =...|+.++.| T Consensus 394 ilr~LlrF~d---~sG-------~I~IdG~d-i---k~~~~~SlR~~Ig~VPQd~~LFndTIl~N----I~YGn~sas~e 455 (591) T KOG0057 394 ILRLLLRFFD---YSG-------SILIDGQD-I---KEVSLESLRQSIGVVPQDSVLFNDTILYN----IKYGNPSASDE 455 (591) T ss_pred HHHHHHHHHC---CCC-------CEEECCEE-H---HHHCHHHHHHHEEEECCCCCCCCHHHHHH----HHCCCCCCCHH T ss_conf 9999999744---688-------59998733-7---65075776522167677664300659988----63289876889 Q ss_pred HHHHHHHHH-HHH---CCCCC----C-CEEEEECCCCCHHHHHHH Q ss_conf 998777999-850---28743----8-625860588889999998 Q gi|254780652|r 235 EIASNTIKI-FKR---VVPCA----V-QGIAFLSGGQSEQDATAR 270 (339) Q Consensus 235 evA~~T~~~-l~~---~vP~~----V-pgIvFLSGGqs~eeAt~~ 270 (339) +|-+.--++ +++ ..|.. | -+=.+|||||-+--|-.| T Consensus 456 eV~e~~k~a~~hd~i~~l~~GY~T~VGerG~~LSGGekQrvslaR 500 (591) T KOG0057 456 EVVEACKRAGLHDVISRLPDGYQTLVGERGLMLSGGEKQRVSLAR 500 (591) T ss_pred HHHHHHHHCCCHHHHHHCCCCCHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 999999972837888736663032675344425640678999999 No 71 >COG0363 NagB 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase [Carbohydrate transport and metabolism] Probab=22.19 E-value=58 Score=13.72 Aligned_cols=18 Identities=22% Similarity=0.425 Sum_probs=9.5 Q ss_pred CEEEEECCCCCHHHHHHH Q ss_conf 625860588889999998 Q gi|254780652|r 253 QGIAFLSGGQSEQDATAR 270 (339) Q Consensus 253 pgIvFLSGGqs~eeAt~~ 270 (339) ..|+||--|..-.++... T Consensus 186 k~v~llv~G~~Ka~al~~ 203 (238) T COG0363 186 KEVLLLVTGEEKADALKQ 203 (238) T ss_pred CEEEEEEECHHHHHHHHH T ss_conf 879999708178999999 No 72 >PRK13689 hypothetical protein; Provisional Probab=21.64 E-value=39 Score=14.86 Aligned_cols=32 Identities=28% Similarity=0.310 Sum_probs=20.9 Q ss_pred HHHHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCC Q ss_conf 9999999999999987638--53111000025565 Q gi|254780652|r 191 AEVTEFVLRTLFNELYTMR--VCFEGLVLKSNMIL 223 (339) Q Consensus 191 ~~vte~vL~~lf~~L~~~~--V~leg~lLKpnMv~ 223 (339) .+--|.++.+++..|..|+ ++|..+.| -|||| T Consensus 9 ~~qvE~il~el~~vLeKH~Ap~DLSLMvL-GN~vT 42 (75) T PRK13689 9 DEQVEQLLAELLAVLEKHKAPTDLSLMVL-GNMVT 42 (75) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHH-HHHHH T ss_conf 89999999999999986599941599999-79999 No 73 >TIGR00445 mraY phospho-N-acetylmuramoyl-pentapeptide-transferase; InterPro: IPR003524 Phospho-N-acetylmuramoyl-pentapeptide-transferase (2.7.8.13 from EC) (mraY) is a bacterial enzyme responsible for the formation of the first lipid intermediate of the cell wall peptidoglycan synthesis. It catalyzes the formation of undecaprenyl-pyrophosphoryl-N-acetylmuramoyl-pentapeptide from UDP-MurNAc-pentapeptide and undecaprenyl-phosphate. It is an integral membrane protein with probably ten transmembrane domains. It belongs to family 4 of glycosyl transferases. Homologs of mraY have been found in the archaebacteria Methanobacterium thermoautotrophicum and in Arabidopsis thaliana (Mouse-ear cress).; GO: 0008963 phospho-N-acetylmuramoyl-pentapeptide-transferase activity, 0009252 peptidoglycan biosynthetic process, 0016020 membrane. Probab=20.98 E-value=40 Score=14.78 Aligned_cols=19 Identities=26% Similarity=0.272 Sum_probs=14.8 Q ss_pred CCCCCCCCCCCHHHHHHHH Q ss_conf 8999862157835799999 Q gi|254780652|r 105 PLYPGEHITVGLDDLDVRL 123 (339) Q Consensus 105 ~g~~~e~~t~GLD~L~eR~ 123 (339) ..+|--..|+|||||+.-. T Consensus 155 ~~SNAvNLTDGLDGLA~~~ 173 (336) T TIGR00445 155 GTSNAVNLTDGLDGLAIGP 173 (336) T ss_pred HCCCCCHHHHHHHHHHHHH T ss_conf 6045310000013889999 No 74 >KOG3935 consensus Probab=20.85 E-value=62 Score=13.55 Aligned_cols=29 Identities=34% Similarity=0.651 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEECCCCC Q ss_conf 8899877799985028743862586058888 Q gi|254780652|r 233 DEEIASNTIKIFKRVVPCAVQGIAFLSGGQS 263 (339) Q Consensus 233 ~eevA~~T~~~l~~~vP~~VpgIvFLSGGqs 263 (339) .++.|..-..+|+|.+|+ .-+.|||+|-. T Consensus 348 nQelaL~~~~aL~r~~~~--~d~tFLSaGTD 376 (446) T KOG3935 348 NQELALSCLDALKRRVPA--HDFTFLSAGTD 376 (446) T ss_pred CHHHHHHHHHHHCCCCCC--CCEEEECCCCC T ss_conf 188999999986688885--32157626777 No 75 >TIGR00960 3a0501s02 Type II (General) Secretory Pathway (IISP) Family protein; InterPro: IPR013505 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). FtsE is an ABC transporter ATP-binding protein. This protein and its permease partner FtsX, localize to the cell division site. In a number of species, the ftsEX gene pair is located next to ftsY, which encodes the signal recognition particle-docki ng protein.; GO: 0005524 ATP binding, 0051301 cell division. Probab=20.82 E-value=56 Score=13.81 Aligned_cols=10 Identities=30% Similarity=0.438 Sum_probs=4.8 Q ss_pred CEEEEECCCC Q ss_conf 6258605888 Q gi|254780652|r 253 QGIAFLSGGQ 262 (339) Q Consensus 253 pgIvFLSGGq 262 (339) .+++-||-|+ T Consensus 206 ~R~L~L~~G~ 215 (216) T TIGR00960 206 HRVLTLSKGR 215 (216) T ss_pred CCEEEECCCC T ss_conf 9669840788 No 76 >TIGR02111 PQQ_syn_pqqC coenzyme PQQ biosynthesis protein C; InterPro: IPR011845 This entry describes the coenzyme PQQ (pyrrolo-quinoline-quinone) biosynthesis protein PqqC. Pyrroloquinoline quinone (PQQ) is the prosthetic group of several bacterial enzymes, including methanol dehydrogenase of methylotrophs and the glucose dehydrogenase of a number of bacteria . PQQC has been found to be required in the synthesis of PQQ but its function is unclear. This entry does not include related proteins likely to be functionally distinct from PqqC, such as homologs found in the Chlamydias.; GO: 0018189 pyrroloquinoline quinone biosynthetic process. Probab=20.79 E-value=62 Score=13.54 Aligned_cols=54 Identities=22% Similarity=0.229 Sum_probs=36.4 Q ss_pred CCHHHHHHHHHHHEECCCCHHEEEEEECCHHHHHHCCCCCCCHHHHHHHCCCEEEEE Q ss_conf 986788754564100778023046434677884100048874679899579878994 Q gi|254780652|r 40 STENSRRDYREMLFRAKDAMQYISSILLYEETLYQNAQDGTSFVDLISSSGVLVGIK 96 (339) Q Consensus 40 nteenr~~~R~~l~tsp~~~~~IsGvILfeeTl~q~~~~g~~~~~~l~~kGIvpgIK 96 (339) ...+.||.||+||+--.|-.+-=+|. +.=|+=...=|.+-..+...++|+||.. T Consensus 64 ~D~~~Rr~W~qRi~DhDG~~~~dGGi---~rWLrLaeAvGL~r~~v~S~~~vLpg~R 117 (239) T TIGR02111 64 PDPELRRIWRQRILDHDGEVEEDGGI---ERWLRLAEAVGLDREYVESERGVLPGTR 117 (239) T ss_pred CCHHHHHHHHHHHHCCCCCCCCCCHH---HHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 98888999987442048678887428---9999876316887787411121100467 No 77 >smart00832 C8 C8 domain. This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. Probab=20.40 E-value=63 Score=13.49 Aligned_cols=20 Identities=35% Similarity=0.526 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHHCCCCC Q ss_conf 98999999999998447872 Q gi|254780652|r 154 ANMHILARYAVLCQNAGLVP 173 (339) Q Consensus 154 ~na~~larYA~i~Q~~GLVP 173 (339) .-...|+-||..||+.|..+ T Consensus 47 ~lC~al~~Ya~~C~~~gv~i 66 (76) T smart00832 47 CLCDALAAYAAACAEAGVCI 66 (76) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 67799999999999972978 No 78 >TIGR00452 TIGR00452 methyltransferase, putative; InterPro: IPR010017 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalyzed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order . Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI ), shared by other AdoMet-Mtases , is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments , although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases , , , . The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences . This is a set of proteobacterial proteins, which have homology in their central region to a large number of methyltransferases active on a variety of substrates. . Probab=20.37 E-value=63 Score=13.48 Aligned_cols=62 Identities=24% Similarity=0.254 Sum_probs=44.0 Q ss_pred CCEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEECHHHCCCCCCCHHHHHHHHHHHHHH-HHHHHHHC Q ss_conf 95550100023367787408888998999999999998447872011011037757989999999999999-99998763 Q gi|254780652|r 130 GARVAKWRVVLSVSDVLPSMTVVKANMHILARYAVLCQNAGLVPIVEPEVLMEGDHTIERCAEVTEFVLRT-LFNELYTM 208 (339) Q Consensus 130 Ga~faKwRaVi~i~~~~Ps~~~I~~na~~larYA~i~Q~~GLVPIVEPEVl~dG~H~ie~~~~vte~vL~~-lf~~L~~~ 208 (339) +-|+||||.|+-| ||..+... =|...|++-|-=-+|.. +-..++.-|+-++.+ +.+.|+.+ T Consensus 241 ~~~YAkMkNVYFI----PSv~aL~~----------Wl~kvGF~~~~i~~V~~----TT~~EQRkTd~~~~~SL~DFLD~~ 302 (316) T TIGR00452 241 KDRYAKMKNVYFI----PSVSALKN----------WLEKVGFENVRILDVLK----TTLEEQRKTDWLLGESLEDFLDPT 302 (316) T ss_pred CCCCCCEEEEEEH----HHHHHHHH----------HHHHCCCCEEEEEEEEC----CCHHHHHHHHHHHHCCHHHHCCCC T ss_conf 5444220001420----23899988----------88744953147875100----372242445677520066624888 Q ss_pred C Q ss_conf 8 Q gi|254780652|r 209 R 209 (339) Q Consensus 209 ~ 209 (339) + T Consensus 303 D 303 (316) T TIGR00452 303 D 303 (316) T ss_pred C T ss_conf 7 No 79 >KOG1344 consensus Probab=20.24 E-value=63 Score=13.46 Aligned_cols=80 Identities=25% Similarity=0.293 Sum_probs=35.8 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCC----CCCCCCCCCCCCCCCCCCC--CCCCHHHHHHHHHHHHHHCCCCCCC Q ss_conf 037757989999999999999999987638----5311100002556576444--3589889987779998502874386 Q gi|254780652|r 180 LMEGDHTIERCAEVTEFVLRTLFNELYTMR----VCFEGLVLKSNMILPGKSA--PQVSDEEIASNTIKIFKRVVPCAVQ 253 (339) Q Consensus 180 l~dG~H~ie~~~~vte~vL~~lf~~L~~~~----V~leg~lLKpnMv~pG~~~--~~~~~eevA~~T~~~l~~~vP~~Vp 253 (339) ++.|..+-|-- .++-..+.+.|.++. |+--||= ++.|.-- -.-||+-|-+.--.+++-----.+| T Consensus 223 l~~gTeddeYL----rkl~r~l~~sl~ef~Pd~VvYNAGTD-----iLeGDpLG~L~ISp~Gi~~RDelVFr~~R~~~iP 293 (324) T KOG1344 223 LRNGTEDDEYL----RKLKRCLMQSLAEFRPDMVVYNAGTD-----ILEGDPLGNLAISPEGIIERDELVFRTFRALGIP 293 (324) T ss_pred EECCCCCHHHH----HHHHHHHHHHHHHHCCCEEEEECCCC-----CCCCCCCCCEEECCCCCCHHHHHHHHHHHHCCCC T ss_conf 41477746999----99999999999861986799807874-----2058877772346233502568999999872986 Q ss_pred EEEEECCCCCHHHHH Q ss_conf 258605888899999 Q gi|254780652|r 254 GIAFLSGGQSEQDAT 268 (339) Q Consensus 254 gIvFLSGGqs~eeAt 268 (339) -++.+||||-..-|. T Consensus 294 vvMltSGGY~K~sAr 308 (324) T KOG1344 294 VVMLTSGGYLKASAR 308 (324) T ss_pred EEEEECCCEEHHHHH T ss_conf 899914861111003 Done!