Query gi|254780185|ref|YP_003064598.1| glutamine synthetase protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 458 No_of_seqs 147 out of 2723 Neff 7.4 Searched_HMMs 39220 Date Mon May 23 07:58:26 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780185.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00653 GlnA glutamine synth 100.0 0 0 1080.3 26.2 452 6-457 1-486 (486) 2 PRK09469 glnA glutamine synthe 100.0 0 0 942.3 43.4 455 4-458 2-469 (469) 3 TIGR03105 gln_synth_III glutam 100.0 0 0 906.3 39.1 420 8-457 1-435 (435) 4 COG0174 GlnA Glutamine synthet 100.0 0 0 899.1 38.6 428 4-458 2-443 (443) 5 pfam00120 Gln-synt_C Glutamine 100.0 0 0 594.9 20.3 255 103-372 1-257 (257) 6 KOG0683 consensus 100.0 8.9E-42 0 280.3 8.3 341 16-412 28-374 (380) 7 COG3968 Uncharacterized protei 99.9 8.9E-24 2.3E-28 168.0 14.6 278 125-416 215-574 (724) 8 pfam03951 Gln-synt_N Glutamine 99.7 3.8E-17 9.7E-22 126.7 8.9 81 16-96 1-84 (84) 9 PRK13517 carboxylate-amine lig 98.9 7.8E-08 2E-12 68.6 14.1 199 125-381 8-266 (371) 10 TIGR02050 gshA_cyan_rel unchar 98.8 5.2E-08 1.3E-12 69.7 10.8 211 128-389 2-275 (297) 11 PRK13515 carboxylate-amine lig 98.8 7E-07 1.8E-11 62.6 14.8 199 125-381 5-263 (381) 12 PRK13518 carboxylate-amine lig 98.7 1.1E-06 2.8E-11 61.4 13.8 161 200-378 47-267 (357) 13 PRK13516 carboxylate-amine lig 98.7 1.7E-06 4.3E-11 60.3 14.5 192 125-374 11-262 (372) 14 pfam04107 GCS2 Glutamate-cyste 98.2 1.4E-05 3.6E-10 54.5 9.4 160 201-376 37-258 (291) 15 COG2170 Uncharacterized conser 97.7 0.00068 1.7E-08 44.0 10.8 158 206-381 43-260 (369) 16 KOG0683 consensus 97.1 0.00033 8.5E-09 45.9 2.4 257 60-375 93-369 (380) 17 TIGR03444 gshA_related glutama 85.7 1.9 4.8E-05 22.5 4.9 30 347-376 290-320 (390) 18 COG3572 GshA Gamma-glutamylcys 84.8 0.63 1.6E-05 25.5 2.1 23 348-370 318-341 (456) 19 pfam08759 DUF1792 Domain of un 82.3 0.99 2.5E-05 24.3 2.3 77 237-318 76-157 (225) 20 KOG0558 consensus 77.0 2.4 6.1E-05 21.9 2.9 56 181-252 246-302 (474) 21 PRK05784 phosphoribosylamine-- 59.1 13 0.00034 17.2 5.5 97 125-226 203-316 (485) 22 COG4456 VagC Virulence-associa 43.5 13 0.00032 17.3 1.4 29 330-358 7-35 (74) 23 PRK00885 phosphoribosylamine-- 40.8 26 0.00066 15.4 6.4 198 3-225 49-294 (424) 24 PRK06395 phosphoribosylamine-- 38.7 28 0.00072 15.2 6.8 96 125-225 188-299 (435) 25 cd04865 LigD_Pol_like_2 LigD_P 38.3 28 0.00072 15.2 5.1 122 195-342 80-210 (228) 26 cd06184 flavohem_like_fad_nad_ 37.9 23 0.00058 15.8 2.0 28 214-244 143-170 (247) 27 pfam12224 Amidoligase_2 Putati 36.7 30 0.00077 15.0 5.7 14 349-362 224-237 (250) 28 TIGR01108 oadA oxaloacetate de 35.8 31 0.0008 14.9 5.3 63 172-251 138-201 (616) 29 TIGR00871 zwf glucose-6-phosph 35.6 13 0.00032 17.3 0.4 39 321-361 366-405 (498) 30 PRK09632 ATP-dependent DNA lig 34.7 32 0.00083 14.8 5.6 18 89-107 131-148 (753) 31 TIGR01393 lepA GTP-binding pro 34.4 33 0.00084 14.8 3.4 53 64-122 272-324 (598) 32 PRK13289 nitric oxide dioxygen 34.2 28 0.00072 15.2 2.0 12 232-243 306-317 (399) 33 COG1602 Uncharacterized conser 33.4 32 0.00082 14.8 2.2 52 188-240 183-239 (402) 34 COG4710 Predicted DNA-binding 33.3 19 0.00048 16.3 1.0 47 399-451 11-57 (80) 35 COG3867 Arabinogalactan endo-1 33.0 35 0.00088 14.6 3.2 23 4-26 64-87 (403) 36 pfam07745 Glyco_hydro_53 Glyco 33.0 35 0.00088 14.6 9.1 26 4-29 25-51 (332) 37 PRK11930 putative bifunctional 32.8 35 0.00089 14.6 3.9 93 322-414 241-358 (824) 38 TIGR00750 lao LAO/AO transport 32.3 18 0.00047 16.3 0.8 41 184-230 122-162 (333) 39 PRK13895 conjugal transfer pro 30.5 33 0.00084 14.7 1.9 17 232-248 5-21 (144) 40 cd07952 ED_3B_like Uncharacter 28.2 42 0.0011 14.1 7.8 100 184-300 80-198 (256) 41 pfam11657 Activator-TraM Trans 27.8 39 0.001 14.3 1.9 17 232-248 5-21 (144) 42 pfam06877 DUF1260 Protein of u 27.8 42 0.0011 14.1 6.6 90 110-245 5-96 (103) 43 PRK13789 phosphoribosylamine-- 27.3 43 0.0011 14.0 5.0 96 125-225 192-301 (426) 44 TIGR00397 mauM_napG MauM/NapG 27.1 25 0.00064 15.5 0.8 30 103-132 103-132 (213) 45 pfam09904 DUF2131 Uncharacteri 26.0 29 0.00075 15.1 1.0 24 105-136 10-33 (67) 46 TIGR01793 cit_synth_euk citrat 25.6 26 0.00065 15.4 0.6 52 5-58 16-70 (430) 47 pfam03851 UvdE UV-endonuclease 25.0 48 0.0012 13.8 5.8 67 186-258 45-113 (275) 48 KOG2438 consensus 24.9 48 0.0012 13.8 5.1 164 209-411 170-342 (505) 49 KOG1016 consensus 24.8 28 0.00072 15.2 0.7 11 184-194 420-430 (1387) 50 cd04862 PaeLigD_Pol_like PaeLi 24.5 49 0.0012 13.7 7.5 126 191-342 75-209 (227) 51 COG1702 PhoH Phosphate starvat 21.8 55 0.0014 13.4 2.4 48 357-408 153-200 (348) 52 TIGR02768 TraA_Ti Ti-type conj 21.4 56 0.0014 13.3 3.5 53 213-267 99-152 (888) 53 TIGR00631 uvrb excinuclease AB 21.1 49 0.0013 13.7 1.3 27 184-210 457-483 (667) 54 pfam03484 B5 tRNA synthetase B 20.5 58 0.0015 13.2 2.7 46 188-245 22-69 (69) No 1 >TIGR00653 GlnA glutamine synthetase, type I; InterPro: IPR004809 Glutamine synthetase type I (or glutamate-ammonia ligase) has a dodecameric form, which can be subdivided into 1-alpha and 1-beta forms. The phylogeny of the 1-alpha and 1-beta forms appears polyphyletic. Escherichia coli, Synechocystis sp. (strain PCC 6803) , Aquifex aeolicus, and the crenarcheon Sulfolobus acidocaldarius have form 1-beta, while Bacillus subtilis, Thermotoga maritima, and various euryarchaea has form 1-alpha. The 1-beta dodecamer from the crenarcheon S. acidocaldarius differs from that in E. coli in that it is not regulated by adenylation. ; GO: 0004356 glutamate-ammonia ligase activity, 0009399 nitrogen fixation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1080.28 Aligned_cols=452 Identities=44% Similarity=0.748 Sum_probs=433.6 Q ss_pred HHHHHHHHH-CCCCEEEEEEECCCCCEEEEEEEHHHHH-----HHHHHCCCCCCCCCHHCCCC-CCCCEEEEEECCCCEE Q ss_conf 999999985-7983999999789982523787787830-----55884598456651002468-9988899964110587 Q gi|254780185|r 6 TSIIQKIAQ-EKVKFIDLRFTDLQGKFRHISMNVSLFN-----ENTLLNGITFDSSSIDGWHS-KHPNLLLIPDIETMHM 78 (458) Q Consensus 6 ~~i~~~l~~-~~I~~V~~~~~Dl~G~~R~k~ip~~~f~-----~~~~~~G~~fd~s~i~g~~~-~~~D~~l~PD~~T~~~ 78 (458) +++++.+|+ ++||||+++|+|+.|.+++.+||++.+. +.++++|+.||||||.||.. ++|||+|+|||+|+++ T Consensus 1 ~~~~~~~keG~~VkfV~l~F~Di~G~~~~~~iP~~~~~~i~~~~~~fe~G~~FDGSSi~Gf~~IeeSDM~l~PDp~T~~i 80 (486) T TIGR00653 1 EEVLKLIKEGENVKFVDLRFTDIKGKLQHVEIPASALDDIDELEEAFEEGIMFDGSSIRGFQGIEESDMLLKPDPSTAVI 80 (486) T ss_pred CCHHHHHCCCCCEEEEEEEEECCCCCEEEEEEEEECCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCCCCCCCEEE T ss_conf 92023650788057999998637884101688831321178888886426344661205746611102201027363167 Q ss_pred CCCCCCC--EEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 7568972--27998520147544853025553235567998730345225773133089970677777865477666765 Q gi|254780185|r 79 DPFYAQS--TVAFICNVYDPITLQPYNRDPRYTAQKAIDYLQTTDIGDTLLLGITTEFFVFDNVHCTISPIKSGFALEST 156 (458) Q Consensus 79 ~Pw~~~~--~~~v~~d~~~~~~g~P~~~~PR~~Lk~~~~~l~~~~~G~~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~ 156 (458) +||.+++ |++|+||+|++.+++|++.|||++|||+++.|+++++|.+.++|.|.||||||++..........+..++. T Consensus 81 ~Pw~~~~~~~~~~ic~vY~p~~~~~y~rDPR~~A~rae~~L~~~~~g~~~~fGPEpEFFlFd~~~~~~~~n~~~~~~d~~ 160 (486) T TIGR00653 81 DPWRAEKKKTLRVICDVYEPVTGEPYERDPRSIAKRAEEYLKSTGIGDTAYFGPEPEFFLFDSIEFGSLSNSSYYEVDSE 160 (486) T ss_pred CCCCCCCCCEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCEEEECCCCCCCCCCCCCEEEEEEE T ss_conf 45316888579999887307777327888478999999986214677377865873121005465786555442114444 Q ss_pred CCCCCC------C---CCEECCCCCCCC-CCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCH-HHCCCCCCCCHHHH Q ss_conf 457777------7---750036554556-355244468999999999986054112221335621-21036555668887 Q gi|254780185|r 157 EFLQNG------N---NKGYDSHAKSSY-TLPPQDKLHDMRSEIVSALNNIGVQITKYHNQINDA-QHSFGLQSESLLHA 225 (458) Q Consensus 157 ~~~~~~------~---~~g~~~~~~~~y-~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~h~E~gpG-Q~Ei~~~~~~~l~a 225 (458) ++.||. + |.+|....+++| .+.|.|...+++++|+..|+++|+.||..|||+|+| |.||+++|..++++ T Consensus 161 eG~WN~~r~~e~g~~~n~g~~~~~kgGYF~v~P~D~~~D~R~e~v~~L~~~G~~~E~~HHEVA~aGQ~Eid~kf~~l~~~ 240 (486) T TIGR00653 161 EGAWNTGREYEGGSELNSGYKPRDKGGYFPVPPTDTAVDIRREMVLTLEQLGLDVEVHHHEVATAGQHEIDFKFDTLLKT 240 (486) T ss_pred ECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCEEEECCHHHHHHH T ss_conf 35558684468888612460347777775876898888999999999986598288860101478742530464789874 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHCC----CCCCCEECCC--CCCCCHHHHHHHHHHHHHH Q ss_conf 769999999999998744975444121014568601201111114----6763122155--5657899999999999886 Q gi|254780185|r 226 SDNLQKYKYSVHQVANSYCKIATFMPKPIASHNGSGMYLNMSIHK----GEKAIFTGNQ--HDRVSLKGLYYLGGIIKHA 299 (458) Q Consensus 226 AD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~GsG~Hih~Sl~~----~~~n~f~~~~--~~~ls~~~~~fiaGil~h~ 299 (458) ||++++||++||.||++||++|||||||+.|+||||||+|+|||+ +|.|+|.+++ ..+||++|+|||||||+|+ T Consensus 241 AD~~~~yKyvvK~vA~~~Gk~ATFMPKPlfGdNGSGMH~H~SLWK~ePf~GenlF~g~~~~y~GLSe~ALyyIGGiLkHA 320 (486) T TIGR00653 241 ADDIQTYKYVVKNVAKKHGKTATFMPKPLFGDNGSGMHCHQSLWKGEPFDGENLFAGEEGEYAGLSETALYYIGGILKHA 320 (486) T ss_pred HCCEEEHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCEEEEEEEECCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 05045147899899998388797458788772668554835563277888840006858876543588888752087777 Q ss_pred HHCCHHCCCCCHHHHHHCCCCCCCEEEEECCCCCCCEEECCCCC-CCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 41000006670014431888521201011368876401025788-76541244046753212799999999999987885 Q gi|254780185|r 300 KSLNALTNASTNSYKRLLTDSQSPTKLTYSTHNHSASCRIPYEN-KSDNKSIEIRFPDLSANPYLAPAAILMAGLDGIAK 378 (458) Q Consensus 300 ~al~a~~~PtvNSYkRl~p~~~ap~~~~WG~~NR~a~iRip~~~-~~~~~riE~R~~da~aNPYL~lAailaAgl~Gi~~ 378 (458) |||+|||||||||||||+||++||++++||..||||+||||+.. +++++|||+|+||++||||||+||+|+||||||+| T Consensus 321 ~AL~A~tNPTvNSYKRLVPGYEAPv~lays~~NRSA~iRIP~~~~n~kA~R~E~R~PDPs~NPYLAFAA~LmAGLDGIKn 400 (486) T TIGR00653 321 KALAAFTNPTVNSYKRLVPGYEAPVYLAYSARNRSALIRIPASRGNPKATRIEVRFPDPSANPYLAFAALLMAGLDGIKN 400 (486) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCCCCEEEECCEECCCCEEEEEEECCCCCCCHHHHHHHHHHHHCCCCCC T ss_conf 88889765660552236886121023021677875258740164088336998707887656789999999862012001 Q ss_pred CCCCCCCCC----CCCHHHHHC--CCCCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHH-HHHHHHHCCCCHHH Q ss_conf 599986655----788777713--20001798999999873989898413378899999999999-99999965999789 Q gi|254780185|r 379 KIHPGKNID----ELSLEEQNT--IPRICISLRESLENLDKDREFLKVGNVFDDDQIDAFINLKM-KEVLKLESSPSPVE 451 (458) Q Consensus 379 ~l~p~~~~~----~~~~~~~~~--~~~LP~~L~eAl~~l~~d~~~~~~~~~lG~~~v~~~~~~K~-~E~~~~~~~vt~wE 451 (458) ||+||+|++ .++.||... |+.||.||+|||++|++|++|.+.+++||+++++.|+.+|| .||++++..||+|| T Consensus 401 KI~PgeP~d~n~Y~l~peE~~e~GI~~lP~~L~eAL~~Le~D~Ef~~~~~vfg~~~~e~fie~Kr~~E~~~~r~~phP~E 480 (486) T TIGR00653 401 KIDPGEPVDKNLYELSPEELRELGIPQLPGSLEEALDELESDHEFLVLKGVFGEEFIEAFIELKREKEWDDYRLRPHPWE 480 (486) T ss_pred CCCCCCCCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHCCCCCHHHCCCCHHHHHHHHHHHHHHCCCCHHCCCCCCHH T ss_conf 45878731043335898999738901105679999999731462002203203789999987653025100036774012 Q ss_pred HHHHCC Q ss_conf 986357 Q gi|254780185|r 452 FEMYCS 457 (458) Q Consensus 452 ~~~Y~~ 457 (458) +++||+ T Consensus 481 f~~Y~~ 486 (486) T TIGR00653 481 FEKYYD 486 (486) T ss_pred HHCCCC T ss_conf 201159 No 2 >PRK09469 glnA glutamine synthetase; Provisional Probab=100.00 E-value=0 Score=942.32 Aligned_cols=455 Identities=43% Similarity=0.751 Sum_probs=417.2 Q ss_pred CHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEEHHHHHHHHHHCCCCCCCCCHHCCCC-CCCCEEEEEECCCCEECCCC Q ss_conf 98999999985798399999978998252378778783055884598456651002468-99888999641105877568 Q gi|254780185|r 4 KATSIIQKIAQEKVKFIDLRFTDLQGKFRHISMNVSLFNENTLLNGITFDSSSIDGWHS-KHPNLLLIPDIETMHMDPFY 82 (458) Q Consensus 4 ~~~~i~~~l~~~~I~~V~~~~~Dl~G~~R~k~ip~~~f~~~~~~~G~~fd~s~i~g~~~-~~~D~~l~PD~~T~~~~Pw~ 82 (458) +.++|+++|++++|+||+++|+|++|++|+|.||++.|.+..+++|+.||+|++.||.. +++|++++||++|++++||. T Consensus 2 s~eevl~~i~e~~I~~V~~~f~Dl~G~~r~k~Ip~~~~~~~~~~~G~~fdGSsv~g~~~i~~sD~~l~PD~~T~~v~Pw~ 81 (469) T PRK09469 2 SAEHVLTMLNEHEVKFVDLRFTDTKGKEQHVTIPAHQVNADFFEEGKMFDGSSIGGWKGINESDMVLMPDASTAVLDPFF 81 (469) T ss_pred CHHHHHHHHHHCCCEEEEEEEECCCCCCCEEEECHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEECCCCCEECCCC T ss_conf 98999999998898599999987998854069848995099985897535544035566777777999877767888665 Q ss_pred CCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 97227998520147544853025553235567998730345225773133089970677777865477666765457777 Q gi|254780185|r 83 AQSTVAFICNVYDPITLQPYNRDPRYTAQKAIDYLQTTDIGDTLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNG 162 (458) Q Consensus 83 ~~~~~~v~~d~~~~~~g~P~~~~PR~~Lk~~~~~l~~~~~G~~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~ 162 (458) ++.++.++||++++.+.+|++.|||++|||+++++++.++++++.+|.|+|||||++.....++....+..+..+..|+. T Consensus 82 ~~~~~~v~cdi~~~~~~~~~~~dPR~iLkr~~~~l~~~G~~~~~~~g~E~EFyLfd~~~~~~~~~~~~~~id~~e~~~~~ 161 (469) T PRK09469 82 ADSTLIIRCDILEPGTMQGYDRDPRSIAKRAEDYLRSTGIADTVLFGPEPEFFLFDDIRFGTSISGSHVKIDDIEAAWNS 161 (469) T ss_pred CCCEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEECCCEEEEEECCCCCCCCCCCCCEECCCCCCCCCC T ss_conf 88619999999887887647889799999999999975996301231433899842644466666661202653234567 Q ss_pred ------CCCEECCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC-CHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf ------77500365545-563552444689999999999860541122213356-2121036555668887769999999 Q gi|254780185|r 163 ------NNKGYDSHAKS-SYTLPPQDKLHDMRSEIVSALNNIGVQITKYHNQIN-DAQHSFGLQSESLLHASDNLQKYKY 234 (458) Q Consensus 163 ------~~~g~~~~~~~-~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~h~E~g-pGQ~Ei~~~~~~~l~aAD~~~~~k~ 234 (458) ++.++.+..++ +|.+.+.|...+++.+++..|+++||+|++.|||+| |||+||++.|.++|+|||+++++|+ T Consensus 162 g~~~~~~~~~~~p~~~~gy~~~~~~d~~~d~~~~i~~~l~~~Gi~ve~~hhE~a~~GQ~Ei~~~~~~~l~aAD~~~~~k~ 241 (469) T PRK09469 162 GTQYEGGNKGHRPGVKGGYFPVPPVDSSQDIRSAMCLVMEEMGLVVEAHHHEVATAGQNEVATRFNTMTKKADEIQIYKY 241 (469) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCEEEEECCCCCHHHHHHHHHHHHH T ss_conf 76445776676666567667688643277799999999998699715445567789704897157869999989999999 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCCHHHH Q ss_conf 99999874497544412101456860120111111467631221555657899999999999886410000066700144 Q gi|254780185|r 235 SVHQVANSYCKIATFMPKPIASHNGSGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNASTNSYK 314 (458) Q Consensus 235 ~ik~iA~~~G~~ATFmpKP~~~~~GsG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~PtvNSYk 314 (458) +||+||++||+.|||||||+.+.+|||||+|+|||++|+|+|.++++++||+.+++||||||+|+|+++||+|||||||| T Consensus 242 ~vk~iA~~~g~~atFmpKP~~~~~Gsg~H~H~Sl~~~g~n~f~~~~~~~lS~~~~~~iaGll~h~~~l~a~~~PtvNSYk 321 (469) T PRK09469 242 VVHNVAHAFGKTATFMPKPMVGDNGSGMHCHMSLSKNGVNLFAGDKYAGLSEQALYYIGGIIKHAKAINALANPTTNSYK 321 (469) T ss_pred HHHHHHHHCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 99999997498788821667999957415898842799436689987775799999999999876865657658805764 Q ss_pred HHCCCCCCCEEEEECCCCCCCEEECCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH--- Q ss_conf 31888521201011368876401025788765412440467532127999999999999878855999866557887--- Q gi|254780185|r 315 RLLTDSQSPTKLTYSTHNHSASCRIPYENKSDNKSIEIRFPDLSANPYLAPAAILMAGLDGIAKKIHPGKNIDELSL--- 391 (458) Q Consensus 315 Rl~p~~~ap~~~~WG~~NR~a~iRip~~~~~~~~riE~R~~da~aNPYL~lAailaAgl~Gi~~~l~p~~~~~~~~~--- 391 (458) ||+|++|||++++||.+||+++||||..++++++|||+|++|++|||||++||+|+||||||+++|+||+|+.++.+ T Consensus 322 Rl~p~~~aP~~~~Wg~~NR~~~iRip~~~~~~~~riE~R~~d~~aNPYL~~Aa~l~Agl~Gi~~~l~p~~~~~~~~y~~~ 401 (469) T PRK09469 322 RLVPGYEAPVMLAYSARNRSASIRIPVVPSPKARRIEVRFPDPAANPYLCFAALLMAGLDGIKNKIHPGEAMDKNLYDLP 401 (469) T ss_pred HCCCCCCCCCCCEEECCCCCCEEECCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 14898768874322068886214357678987633674079988798999999999998788638999987777602199 Q ss_pred -HHHHCCCCCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCC Q ss_conf -7771320001798999999873989898413378899999999999999999659997899863579 Q gi|254780185|r 392 -EEQNTIPRICISLRESLENLDKDREFLKVGNVFDDDQIDAFINLKMKEVLKLESSPSPVEFEMYCSI 458 (458) Q Consensus 392 -~~~~~~~~LP~~L~eAl~~l~~d~~~~~~~~~lG~~~v~~~~~~K~~E~~~~~~~vt~wE~~~Y~~~ 458 (458) ++..++++||.||.|||++|++|+++++..++||++||+.|+++||+||++|++.||+||+++||+| T Consensus 402 ~~~~~~~p~lP~sl~eAl~~le~d~~~~~~~g~fg~~~v~~~~~~Kr~E~~~~~~~vt~wE~~~Y~~~ 469 (469) T PRK09469 402 PEEAAEIPQVAGSLEEALNALDADREFLTAGGVFTDDAIDAYIALRREEVDRVRMTPHPVEFELYYSV 469 (469) T ss_pred HHHHCCCCCCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCC T ss_conf 77863677784479999999872989898655275999999999999999999778998999761489 No 3 >TIGR03105 gln_synth_III glutamine synthetase, type III. This family consists of the type III isozyme of glutamine synthetase, originally described in Rhizobium meliloti, where types I and II also occur. Probab=100.00 E-value=0 Score=906.29 Aligned_cols=420 Identities=26% Similarity=0.416 Sum_probs=377.3 Q ss_pred HHHHHHHCCCCEEEEEEECCCCCEEEEEEEHHHHHHHHHHCCCCCCCCCHHCC--CCCCCCEEEEEECCCCEECCCCCCC Q ss_conf 99999857983999999789982523787787830558845984566510024--6899888999641105877568972 Q gi|254780185|r 8 IIQKIAQEKVKFIDLRFTDLQGKFRHISMNVSLFNENTLLNGITFDSSSIDGW--HSKHPNLLLIPDIETMHMDPFYAQS 85 (458) Q Consensus 8 i~~~l~~~~I~~V~~~~~Dl~G~~R~k~ip~~~f~~~~~~~G~~fd~s~i~g~--~~~~~D~~l~PD~~T~~~~Pw~~~~ 85 (458) |.+++++++|+||+++|+|++|++|||.||+++|. +++.+|++|+++++.++ ..+++|++++||++|++++||. +. T Consensus 1 ~~~~i~~~~I~~v~~~~~D~~G~~RgK~vp~~~~~-~~~~~G~~f~~~~~~~~~~~~~~~D~~~~PD~~Tl~~lPw~-p~ 78 (435) T TIGR03105 1 LQALARDKGIKYFLASFVDLHGVQKAKLVPAEAID-HMATGGAGFAGFAAWGLGQSPADPDLMAIPDLSSLTQLPWQ-PG 78 (435) T ss_pred CHHHHHHCCCEEEEEEEECCCCCEEEEEEEHHHHH-HHHHCCCCCEEECHHHCCCCCCCCCEEEEECCCCEEECCCC-CC T ss_conf 96688558985999999999995789998699999-99858997101243213789998868999757830778888-98 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 27998520147544853025553235567998730345225773133089970677777865477666765457777775 Q gi|254780185|r 86 TVAFICNVYDPITLQPYNRDPRYTAQKAIDYLQTTDIGDTLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNK 165 (458) Q Consensus 86 ~~~v~~d~~~~~~g~P~~~~PR~~Lk~~~~~l~~~~~G~~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 165 (458) +++|+||++ .+|+|++.|||++||++++++++ .|+++++|+|+|||||+++..... . +.++. T Consensus 79 ~a~v~~d~~--~dG~p~~~~PR~~L~~~~~~~~~--~G~~~~~g~ElEF~l~~~~~~~~~-----~---------~~~~~ 140 (435) T TIGR03105 79 VAWVAADLH--VNGKPYPQAPRVVLKRQLAEAAE--LGLTLNTGVECEFFLLRRDEDGSL-----S---------IADRA 140 (435) T ss_pred EEEEEEEEC--CCCCCCCCCHHHHHHHHHHHHHH--CCCCEEECCCEEEEEEECCCCCCC-----C---------CCCCC T ss_conf 489999974--08988886979999999999996--599289767627999821688985-----5---------67877 Q ss_pred EECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 00365545563552444689999999999860541122213356212103655566888776999999999999874497 Q gi|254780185|r 166 GYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITKYHNQINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCK 245 (458) Q Consensus 166 g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~ 245 (458) .....++|++...++..+++++|+++|+++||+|+++|+|+|||||||++.|+++|+|||+++++|++||+||.+||+ T Consensus 141 --~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~l~~~~~l~aAD~~~~~K~~ik~iA~~~G~ 218 (435) T TIGR03105 141 --DTLAKPCYDQRGLMRRYDVLTEISDAMNALGWDPYQNDHEDANGQFEMNFTYADALTTADRHAFFRYMVKEIAEKHGM 218 (435) T ss_pred --CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf --767788667200101679999999999977997188664657981699726799899999999999999999998099 Q ss_pred EEEEEEEECCCCCCCHHHHHHHHCC-CCCCCEECC-CCC--CCCHHHHHHHHHHHHHHHHCCHHCCCCCHHHHHHCCC-- Q ss_conf 5444121014568601201111114-676312215-556--5789999999999988641000006670014431888-- Q gi|254780185|r 246 IATFMPKPIASHNGSGMYLNMSIHK-GEKAIFTGN-QHD--RVSLKGLYYLGGIIKHAKSLNALTNASTNSYKRLLTD-- 319 (458) Q Consensus 246 ~ATFmpKP~~~~~GsG~Hih~Sl~~-~~~n~f~~~-~~~--~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~-- 319 (458) +|||||||+.+.+|||||+|+|||+ +|+|+|.++ +++ +||+.++|||||||+|+|++|||+|||+||||||+|+ T Consensus 219 ~ATFMpKP~~~~~GsG~HiH~SL~d~~g~n~f~d~~~~~~~glS~~~~~fiaGlL~~~~~l~a~~~Pt~NSYkRl~~~~~ 298 (435) T TIGR03105 219 RATFMPKPFADLTGNGCHFHLSLWDEDGRNLFADDSDPNGLGLSKLAYHFIGGILHHAPALCAVLAPTVNSYKRLNAPRT 298 (435) T ss_pred EEEECCCCCCCCCCCEEEEEEEEECCCCCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCC T ss_conf 89970313789998700389867759997603689876655545999999999998613231501477652111568865 Q ss_pred ----CCCCEEEEECCCCCCCEEECCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHH-- Q ss_conf ----52120101136887640102578876541244046753212799999999999987885599986655788777-- Q gi|254780185|r 320 ----SQSPTKLTYSTHNHSASCRIPYENKSDNKSIEIRFPDLSANPYLAPAAILMAGLDGIAKKIHPGKNIDELSLEE-- 393 (458) Q Consensus 320 ----~~ap~~~~WG~~NR~a~iRip~~~~~~~~riE~R~~da~aNPYL~lAailaAgl~Gi~~~l~p~~~~~~~~~~~-- 393 (458) +|||++++||.+||+++||||. ++|||+|++|++|||||++||+|+||++||+++++||+|+.++.+.+ T Consensus 299 ~~~~~waP~~~~WG~~NR~~~iRip~-----~~riE~R~~da~aNPYL~lAailaAGl~Gi~~~l~p~~p~~~~~~~~~~ 373 (435) T TIGR03105 299 TSGATWAPNFISYGGNNRTHMVRIPD-----PGRFELRLADGAANPYLAQAAILAAGLDGIERKLDPGPPRDINLYAEEL 373 (435) T ss_pred CCCCCCCCCEEEECCCCCCCEEEECC-----CCEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCH T ss_conf 65765576202313577655078237-----8713644899887989999999998887886589999977788655630 Q ss_pred -HHCCCCCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCC Q ss_conf -7132000179899999987398989841337889999999999999999965999789986357 Q gi|254780185|r 394 -QNTIPRICISLRESLENLDKDREFLKVGNVFDDDQIDAFINLKMKEVLKLESSPSPVEFEMYCS 457 (458) Q Consensus 394 -~~~~~~LP~~L~eAl~~l~~d~~~~~~~~~lG~~~v~~~~~~K~~E~~~~~~~vt~wE~~~Y~~ 457 (458) ..++++||+||+|||++|++|+.+ +++||++|++.|+++|++||++|++.||+||+++||| T Consensus 374 ~~~~~~~LP~sL~eAl~~l~~~~~l---~~~lg~~~v~~y~~~K~~E~~~~~~~vt~wE~~~Yle 435 (435) T TIGR03105 374 AARGVETLPQNLLEALRALEADPLL---AEALGAEFVDEFLKLKRQEWEEYHRHVSDWEIDRYLD 435 (435) T ss_pred HHCCCCCCCCCHHHHHHHHHCCHHH---HHHCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCC T ss_conf 1268566688999999999759098---7854999999999999999999977999999985149 No 4 >COG0174 GlnA Glutamine synthetase [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=899.13 Aligned_cols=428 Identities=37% Similarity=0.635 Sum_probs=388.5 Q ss_pred CHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEEHH----HHHHHHHHCCCCCCCCCHHCCCC-CCCCEEEEEECCCCEE Q ss_conf 989999999857983999999789982523787787----83055884598456651002468-9988899964110587 Q gi|254780185|r 4 KATSIIQKIAQEKVKFIDLRFTDLQGKFRHISMNVS----LFNENTLLNGITFDSSSIDGWHS-KHPNLLLIPDIETMHM 78 (458) Q Consensus 4 ~~~~i~~~l~~~~I~~V~~~~~Dl~G~~R~k~ip~~----~f~~~~~~~G~~fd~s~i~g~~~-~~~D~~l~PD~~T~~~ 78 (458) +.+++.+.+++++|++|+++|+|+.|++|+|++|++ .+. ..+..|+.||+||+.||.. +++||+|+||++|+++ T Consensus 2 ~~~~~~~~~~~~~v~~v~~~f~D~~G~~r~k~~p~~~~~~~~~-~~~~~g~~fdgss~~g~~~i~~sDm~l~Pd~~T~~~ 80 (443) T COG0174 2 TKEDVLKLLKENGVKFVDLRFTDLNGVLRGKTIPAEKPVSVLA-QLFEGGVVFDGSSIAGFEGIGESDMVLKPDLSTLVV 80 (443) T ss_pred CHHHHHHHHHHCCCEEEEEEEECCCCCEEEEEEECCCCHHHHH-HHHCCCCCCCCCCCCCCCCCCCCCEEEEECCCCEEE T ss_conf 1788999987289369999997799976226986655326777-643058676886412467789888699626675276 Q ss_pred CCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCE-EEEEECCCEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 75689722799852014754485302555323556799873034522-57731330899706777778654776667654 Q gi|254780185|r 79 DPFYAQSTVAFICNVYDPITLQPYNRDPRYTAQKAIDYLQTTDIGDT-LLLGITTEFFVFDNVHCTISPIKSGFALESTE 157 (458) Q Consensus 79 ~Pw~~~~~~~v~~d~~~~~~g~P~~~~PR~~Lk~~~~~l~~~~~G~~-~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~ 157 (458) +||..++|++|+||++++ +|+|++.|||++|||+++++++. |+. +++|+|+|||||+.+...... T Consensus 81 ~Pw~~~~ta~v~cdv~~~-~g~p~~~dPR~vlkr~~~~l~~~--G~~~~~~g~E~EFfLfd~~~~~~~~----------- 146 (443) T COG0174 81 DPWREGPTARVLCDVYDP-DGTPYPRDPRSVLKRALARLKDE--GLAPAVVGPELEFFLFDRDGRDPDG----------- 146 (443) T ss_pred CCCCCCCCEEEEEEEECC-CCCCCCCCHHHHHHHHHHHHHHC--CCCCCCEECCEEEEEEECCCCCCCC----------- T ss_conf 777788727999998879-99958988399999999999866--8854302024169996036677666----------- Q ss_pred CCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 57777775003655455635524446899999999998605411222133562121036555668887769999999999 Q gi|254780185|r 158 FLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITKYHNQINDAQHSFGLQSESLLHASDNLQKYKYSVH 237 (458) Q Consensus 158 ~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik 237 (458) ++.....++|++.+.+...+++++|+..|+++||+++++|||+||||+||++.+.++|++||+++++|++|| T Consensus 147 --------~~~~~~g~yf~~~~~~~~~d~~~~i~~~l~~~Gi~ie~~hhEva~gQ~EI~~~~~~~l~~AD~~~~~K~vvk 218 (443) T COG0174 147 --------GRPADKGGYFDVAPLDEAEDFRRDIVEALEAAGIEIEAIHHEVAPGQFEINLRFDDALKAADQIVIFKYVVK 218 (443) T ss_pred --------CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHH T ss_conf --------766887764676542227899999999999879982630145058855783378788999899999999999 Q ss_pred HHHHHCCCEEEEEEEECCCCCCCHHHHHHHHCC-CCCCCEECC-CCCCCCHHHHHHHHHHHHHHHHCCHHCCCCCHHHHH Q ss_conf 998744975444121014568601201111114-676312215-556578999999999998864100000667001443 Q gi|254780185|r 238 QVANSYCKIATFMPKPIASHNGSGMYLNMSIHK-GEKAIFTGN-QHDRVSLKGLYYLGGIIKHAKSLNALTNASTNSYKR 315 (458) Q Consensus 238 ~iA~~~G~~ATFmpKP~~~~~GsG~Hih~Sl~~-~~~n~f~~~-~~~~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkR 315 (458) +||++||++|||||||+.+.+|||||+|+|||+ +|+|+|.++ +..+||+.++|||||||+|+++++||+||||||||| T Consensus 219 ~vA~~hG~~aTFMpKP~~g~~GsGmH~H~Sl~~~dg~n~F~d~~~~~~lS~~~~~~igGilkha~~~~ai~~PtvNSYkR 298 (443) T COG0174 219 EVAEKHGLTATFMPKPFFGDNGSGMHVHQSLWDKDGGNLFADEDGYAGLSETALHFIGGILKHAPALTAITAPTVNSYKR 298 (443) T ss_pred HHHHHHCCEEEEECCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 99999598599947877899977304898776489975204888852579999999999998789867675777510421 Q ss_pred H-CCCCCCCEEEEECCCCCCCEEECCCCC-CCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH- Q ss_conf 1-888521201011368876401025788-7654124404675321279999999999998788559998665578877- Q gi|254780185|r 316 L-LTDSQSPTKLTYSTHNHSASCRIPYEN-KSDNKSIEIRFPDLSANPYLAPAAILMAGLDGIAKKIHPGKNIDELSLE- 392 (458) Q Consensus 316 l-~p~~~ap~~~~WG~~NR~a~iRip~~~-~~~~~riE~R~~da~aNPYL~lAailaAgl~Gi~~~l~p~~~~~~~~~~- 392 (458) | +|..|||++++||.+||||+||||.+. +++++|||+|+||++|||||++||+|+||||||++||+|++|+..+.++ T Consensus 299 l~vp~e~AP~~~~wg~~NRsa~iRIP~~~~~~~~~RiE~R~pdp~aNPYLa~AaiL~agldGI~~ki~p~~p~~~n~y~~ 378 (443) T COG0174 299 LGVPYEWAPTYIAWGVRNRSASVRIPASGANGKARRVEFRVPDPDANPYLAFAAILAAGLDGIENKLEPGEPVDGNLYEL 378 (443) T ss_pred CCCCCCCCCCHHCCCCCCCCEEEEECCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 48985558611100347751579957888777764457647999889899999999999977754799987445643228 Q ss_pred ---HHHCCCCCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCC Q ss_conf ---771320001798999999873989898413378899999999999999999659997899863579 Q gi|254780185|r 393 ---EQNTIPRICISLRESLENLDKDREFLKVGNVFDDDQIDAFINLKMKEVLKLESSPSPVEFEMYCSI 458 (458) Q Consensus 393 ---~~~~~~~LP~~L~eAl~~l~~d~~~~~~~~~lG~~~v~~~~~~K~~E~~~~~~~vt~wE~~~Y~~~ 458 (458) ++.. +.||.||.|||++|+ +.++ ++++||++|++.|+++|+.||++|++.||+||+++|+++ T Consensus 379 ~~~e~~~-~~LP~sL~eAl~~l~-~~~~--~~~~lge~~~~~yi~~K~~E~~~~~~~v~~wE~~~yl~~ 443 (443) T COG0174 379 SPEERKE-PTLPASLREALDALE-DSEF--LREALGEDFIDAYIALKRAEWEEFRSRVTPWEFERYLSV 443 (443) T ss_pred CHHHHCC-CCCCCCHHHHHHHHH-HCHH--HHHHCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 9545233-567626999999862-0568--887628499999999999999999725999999988529 No 5 >pfam00120 Gln-synt_C Glutamine synthetase, catalytic domain. Probab=100.00 E-value=0 Score=594.93 Aligned_cols=255 Identities=36% Similarity=0.571 Sum_probs=230.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHH Q ss_conf 02555323556799873034522577313308997067777786547766676545777777500365545563552444 Q gi|254780185|r 103 NRDPRYTAQKAIDYLQTTDIGDTLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQDK 182 (458) Q Consensus 103 ~~~PR~~Lk~~~~~l~~~~~G~~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~~ 182 (458) +.|||++|||+++++++ .|+++++|+|+|||||+++.....+... .+........++|++.+.+. T Consensus 1 P~~PR~~Lk~~~~~~~~--~Gl~~~~g~E~EF~l~~~~~~~~~~~~~-------------~~~~~~~~~~~~y~~~~~~~ 65 (257) T pfam00120 1 PRDPRSILKRALARLAE--LGYTPYFGPEQEFFLFDDVRPGGGPPGG-------------YPDPRGPYYGGYFPVAPLDE 65 (257) T ss_pred CCCHHHHHHHHHHHHHH--CCCEEEEEECEEEEEEECCCCCCCCCCC-------------CCCCCCCCCCCCCCCCCCHH T ss_conf 98979999999999997--6995979956687998577777677668-------------87988876777577763031 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHH Q ss_conf 68999999999986054112221335621210365556688877699999999999987449754441210145686012 Q gi|254780185|r 183 LHDMRSEIVSALNNIGVQITKYHNQINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPKPIASHNGSGM 262 (458) Q Consensus 183 ~~~~~~ei~~~l~~~Gi~ie~~h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~GsG~ 262 (458) .++|+++|++.|+.+||+|+++|+|+|||||||++.|.++|+|||+++++|++||+||++||+.|||||||+.+.+|||| T Consensus 66 ~~~~~~~i~~~l~~~Gi~ve~~~~E~gpgQ~Ei~l~~~~~l~aAD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~GsG~ 145 (257) T pfam00120 66 ARDIRRDIVKALEAAGIPVEGIHHEVAPGQHEIDFRFADALEAADNLQLFKYVVKRVAEKHGLTATFMPKPIFGDNGSGM 145 (257) T ss_pred HHHHHHHHHHHHHHCCCCCEEEECCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEE T ss_conf 89999999999997799818866575798579962546498886999999999999999839679970354579998521 Q ss_pred HHHHHHCC-CCCCCEEC-CCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCCHHHHHHCCCCCCCEEEEECCCCCCCEEECC Q ss_conf 01111114-67631221-55565789999999999988641000006670014431888521201011368876401025 Q gi|254780185|r 263 YLNMSIHK-GEKAIFTG-NQHDRVSLKGLYYLGGIIKHAKSLNALTNASTNSYKRLLTDSQSPTKLTYSTHNHSASCRIP 340 (458) Q Consensus 263 Hih~Sl~~-~~~n~f~~-~~~~~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~~~ap~~~~WG~~NR~a~iRip 340 (458) |+|+|||+ +|+|+|.+ +++.+||+.+++||||||+|++++++|++||+||||||+|++|||++++||++||+++|||| T Consensus 146 HiH~SL~d~~g~n~f~~~~~~~~lS~~~~~fiaGlL~h~~~l~a~~~Pt~NSYkRl~p~~~ap~~~~WG~~NR~a~iRip 225 (257) T pfam00120 146 HTHQSLWDKDGKNLFADGDGYAGLSETARHFIGGILKHAPAITAYTAPTVNSYKRLVPGTEAPVYIAWGVRNRSASIRIP 225 (257) T ss_pred EEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCCCCCCCCEEECCCCCCCEEEC T ss_conf 79986675889762468887422309999997638775687625368997656515799748666776156786117867 Q ss_pred CCCCCCCCEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 78876541244046753212799999999999 Q gi|254780185|r 341 YENKSDNKSIEIRFPDLSANPYLAPAAILMAG 372 (458) Q Consensus 341 ~~~~~~~~riE~R~~da~aNPYL~lAailaAg 372 (458) ..++++++|||+|++|++|||||++||+|+|| T Consensus 226 ~~~~~~~~riE~R~~da~aNPYL~lAailaAG 257 (257) T pfam00120 226 RGGGPKAGRVEDRLPDPDANPYLALAALLAAG 257 (257) T ss_pred CCCCCCCCEEEECCCCCCCCHHHHHHHHHHCC T ss_conf 78998774678678878878999999998668 No 6 >KOG0683 consensus Probab=100.00 E-value=8.9e-42 Score=280.33 Aligned_cols=341 Identities=18% Similarity=0.219 Sum_probs=261.6 Q ss_pred CCCEEEEEEECCCC-CEEEEEEEHHHHHHHHHH-CCCCCCCCCHHCCCCCCCCEEEEEECCCCEECCCCCCCEEEEEEEE Q ss_conf 98399999978998-252378778783055884-5984566510024689988899964110587756897227998520 Q gi|254780185|r 16 KVKFIDLRFTDLQG-KFRHISMNVSLFNENTLL-NGITFDSSSIDGWHSKHPNLLLIPDIETMHMDPFYAQSTVAFICNV 93 (458) Q Consensus 16 ~I~~V~~~~~Dl~G-~~R~k~ip~~~f~~~~~~-~G~~fd~s~i~g~~~~~~D~~l~PD~~T~~~~Pw~~~~~~~v~~d~ 93 (458) +--.+...|+|-.| .+|+|+-....-.+...+ .-..||+++.......++|.+|+|. .+.+.|+..++..+|+|++ T Consensus 28 ~kv~a~YVwidg~ge~~rsk~rt~d~~~~~~~~lp~wnydgsst~QA~g~nSd~~l~Pv--a~~~dPfr~g~Nilv~c~~ 105 (380) T KOG0683 28 KKVQAEYVWIDGTGENLRSKTRTLDAEPSSISELPIWNYDGSSTGQAPGENSDVYLRPV--AIYPDPFRNGNNILVMCDT 105 (380) T ss_pred CEEEEEEEEECCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEE--HCCCCCCCCCCCEEEEEEC T ss_conf 62799999965756530431322467756600076630468636656678875178630--0027855589977999433 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCC-CCCCCCCCCCCCCCCCCCCCCCCCCEECCCCC Q ss_conf 14754485302555323556799873034522577313308997067-77778654776667654577777750036554 Q gi|254780185|r 94 YDPITLQPYNRDPRYTAQKAIDYLQTTDIGDTLLLGITTEFFVFDNV-HCTISPIKSGFALESTEFLQNGNNKGYDSHAK 172 (458) Q Consensus 94 ~~~~~g~P~~~~PR~~Lk~~~~~l~~~~~G~~~~~g~E~EF~Lf~~~-~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~ 172 (458) ++ .++.|.+..-|+...+.+.+..-. .-.++.|.|.|+.+++.. .... .|+ ..|+..+.. T Consensus 106 ~~-~~~~P~~tn~R~~c~~~~~~~~~~--~~~PWfg~Eqeyt~l~~~~~~p~--------------gwp--~~GFp~Pqg 166 (380) T KOG0683 106 YD-FDGKPTETNKRVACARIMPKLSTK--DTEPWFGMEQEYTLLDALDGHPF--------------GWP--KGGFPGPQG 166 (380) T ss_pred CC-CCCCCCCCCCHHHHHHHHCCCCCC--CCCCCHHHHHHHHHHCCCCCCCC--------------CCC--CCCCCCCCC T ss_conf 66-789966422103478874345523--45875335678776504568846--------------477--668888877 Q ss_pred CCCCCCHHHH--HHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 5563552444--68999999999986054112221335621210365556688877699999999999987449754441 Q gi|254780185|r 173 SSYTLPPQDK--LHDMRSEIVSALNNIGVQITKYHNQINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFM 250 (458) Q Consensus 173 ~~y~~~~~~~--~~~~~~ei~~~l~~~Gi~ie~~h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFm 250 (458) .+|..-..++ ..++.+..+.+|..+||.+...+.|++||||||.+.|+.++.|+|+++++|+++++||+++|+.|||. T Consensus 167 pyyc~VGad~~~~rdiveahy~acLyaGl~i~G~N~EvmPgQwEfqvGp~~GI~~gD~lw~aR~il~rVae~~Gviasf~ 246 (380) T KOG0683 167 PYYCGVGADRVFGRDIVEAHYRACLYAGLNISGINVEVMPGQWEFQVGPCEGISMGDQLWMARYILHRVAEKFGVIASFD 246 (380) T ss_pred CCEECCCCCCCCCCHHHHHHHHHHHHHHHEECCCCCCCCCCEEEEEECCHHCCCCCHHHHHHHHHHHHHHHHHCEEEEEC T ss_conf 70000266301241234306888774030233632313475048854450011220047899999999999849168725 Q ss_pred EEECCCC-CCCHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCCHHHHHHCCCCCCCEEEEEC Q ss_conf 2101456-860120111111467631221555657899999999999886410000066700144318885212010113 Q gi|254780185|r 251 PKPIASH-NGSGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNASTNSYKRLLTDSQSPTKLTYS 329 (458) Q Consensus 251 pKP~~~~-~GsG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~~~ap~~~~WG 329 (458) |||..++ +|+|+|.+.|.-+ -+.+ |..++....+|.+++|+|..+..|. .+|| T Consensus 247 pKp~~g~WngaG~Htn~ST~~-------------------mr~~----~g~~~i~~a~~~ls~rh~~hi~~yd---p~~G 300 (380) T KOG0683 247 PKPILGDWNGAGCHTNFSTKE-------------------MREA----GGLKIIEEAIPKLSKRHREHIAAYD---PKGG 300 (380) T ss_pred CCCCCCCCCCCCCCCCCCHHH-------------------HHHC----CCHHHHHHHHHHHHHHHHHHHHHCC---CCCC T ss_conf 888777666765223202166-------------------6605----7789999986420023044233128---5678 Q ss_pred CCCCCCEEECCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCCCCCCHHHHHH Q ss_conf 68876401025788765412440467532127999999999999878855999866557887777132000179899999 Q gi|254780185|r 330 THNHSASCRIPYENKSDNKSIEIRFPDLSANPYLAPAAILMAGLDGIAKKIHPGKNIDELSLEEQNTIPRICISLRESLE 409 (458) Q Consensus 330 ~~NR~a~iRip~~~~~~~~riE~R~~da~aNPYL~lAailaAgl~Gi~~~l~p~~~~~~~~~~~~~~~~~LP~~L~eAl~ 409 (458) .+|+....+ . ...+.+.++...-|++|||+.++.+++|+.+|..+.-.|..+++. +.... -.+|.+|.+++. T Consensus 301 ~dN~rrltg--~--hEt~~i~~Fs~GvAnr~~siri~r~va~~~~Gy~edrrP~sN~Dp--y~Vt~--~~~~t~l~~~~~ 372 (380) T KOG0683 301 KDNERRLTG--R--HETGSIDNFSWGVANRNPSIRIPRTVAAEGKGYFEDRRPSSNCDP--YAVTL--MIIPTTLLEAVE 372 (380) T ss_pred CCCHHHHCC--C--CCCCCCCCCCCCCCCCCCEEEECHHHHCCCCCCCCCCCCCCCCCC--CEEEH--HHHHHHHHCCCC T ss_conf 633455318--8--865552200010346786166213121366531245898678873--20107--776677753420 Q ss_pred HHH Q ss_conf 987 Q gi|254780185|r 410 NLD 412 (458) Q Consensus 410 ~l~ 412 (458) ++ T Consensus 373 -~~ 374 (380) T KOG0683 373 -LE 374 (380) T ss_pred -HH T ss_conf -33 No 7 >COG3968 Uncharacterized protein related to glutamine synthetase [General function prediction only] Probab=99.92 E-value=8.9e-24 Score=168.02 Aligned_cols=278 Identities=17% Similarity=0.201 Sum_probs=188.8 Q ss_pred EEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 25773133089970677777865477666765457777775003655455635524446899999999998605411222 Q gi|254780185|r 125 TLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITKY 204 (458) Q Consensus 125 ~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~ 204 (458) ...+|.|+||||.+.......|+-.. ...+..+.++...+-.+++..+ .-.++...|++++-..|.++||+..+- T Consensus 215 ~s~~GaEQEYFlvd~~~~~~RpDLi~-tGRTLFGa~ppkGQEldDHYFG----aipeRV~~FM~Dve~~LyaLGIpaKTr 289 (724) T COG3968 215 FSNVGAEQEYFLVDKKSYDERPDLIF-TGRTLFGAPPPKGQELDDHYFG----AIPERVSAFMKDVEKELYALGIPAKTR 289 (724) T ss_pred CCCCCCCCEEEEECHHHCCCCCCEEE-ECHHHCCCCCCCCCCCCCHHCC----CCHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 36778431368861101034856266-2402128999987612200012----447999999988999999808853334 Q ss_pred CCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHC-CCCCCCEECCCCCC Q ss_conf 13356212103655566888776999999999999874497544412101456860120111111-46763122155565 Q gi|254780185|r 205 HNQINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPKPIASHNGSGMYLNMSIH-KGEKAIFTGNQHDR 283 (458) Q Consensus 205 h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~GsG~Hih~Sl~-~~~~n~f~~~~~~~ 283 (458) |.|++||||||..-+.++--|+|+..+..+++|.+|+|||+.+-..-|||.|.+|||-|+|+|+- +.|.|+.++.+-.. T Consensus 290 HNEVAPgQfEIApife~~N~A~DhqQL~M~vLk~tA~KhGlVCLLHEKPFAGiNGSGKH~NWSmGtd~g~NLLdPgD~Ph 369 (724) T COG3968 290 HNEVAPGQFEIAPIFESGNLATDHQQLVMEVLKKTALKHGLVCLLHEKPFAGINGSGKHNNWSMGTDDGLNLLDPGDMPH 369 (724) T ss_pred CCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 56558775244100115653320799999999998876255888615875675788775645414687756579999987 Q ss_pred CCHHHHHHHH----HHHHHHHHCCHHCCCCCHHHHHHCCCCCCC-EEE-------------------------------- Q ss_conf 7899999999----999886410000066700144318885212-010-------------------------------- Q gi|254780185|r 284 VSLKGLYYLG----GIIKHAKSLNALTNASTNSYKRLLTDSQSP-TKL-------------------------------- 326 (458) Q Consensus 284 ls~~~~~fia----Gil~h~~al~a~~~PtvNSYkRl~p~~~ap-~~~-------------------------------- 326 (458) -.....-|.+ ++-++.+-+ -..+.+..-=.||..+ +|| ..+ T Consensus 370 dN~QFL~Fc~AvIkaVdkY~~Ll-Ra~~a~AsNDhRLGAN-EAPPAI~SVflGdqLedifEqi~~G~~~ssk~~g~mdLg 447 (724) T COG3968 370 DNKQFLLFCTAVIKAVDKYADLL-RASAANASNDHRLGAN-EAPPAIISVFLGDQLEDIFEQIEKGKATSSKGNGKMDLG 447 (724) T ss_pred CCCEEEHHHHHHHHHHHHHHHHH-HHHHHCCCCCCCCCCC-CCCCCEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 56000021588998788879999-9987515775345567-799612676403069999999964877666667632443 Q ss_pred ---------EECCCCCCCEEECCCCCCCCCCEEEECCCCCC---CCHHHHHHHHHHHHHHHHHHCCC----CCCCC---- Q ss_conf ---------11368876401025788765412440467532---12799999999999987885599----98665---- Q gi|254780185|r 327 ---------TYSTHNHSASCRIPYENKSDNKSIEIRFPDLS---ANPYLAPAAILMAGLDGIAKKIH----PGKNI---- 386 (458) Q Consensus 327 ---------~WG~~NR~a~iRip~~~~~~~~riE~R~~da~---aNPYL~lAailaAgl~Gi~~~l~----p~~~~---- 386 (458) .-|..||+.++-. .+.+||+|.++++ +-|-.+|-+++|-.+.-|..+|+ -+..+ T Consensus 448 ~~vlP~v~kdAgDRNRTSPFAF------TGNkFEFRavgSSqSvs~P~tVLN~~vAesl~~i~drLe~kl~kg~dld~ai 521 (724) T COG3968 448 ISVLPAVEKDAGDRNRTSPFAF------TGNKFEFRAVGSSQSVSEPNTVLNVIVAESLSEIADRLEGKLRKGSDLDAAI 521 (724) T ss_pred HHHCCCCCCCCCCCCCCCCCEE------CCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 1205566445565567887303------0661368527865445664578889999999999899887763246578899 Q ss_pred ----------------CCCCHH--------HHHCCCCCCCCHHHHHHHHHHCHH Q ss_conf ----------------578877--------771320001798999999873989 Q gi|254780185|r 387 ----------------DELSLE--------EQNTIPRICISLRESLENLDKDRE 416 (458) Q Consensus 387 ----------------~~~~~~--------~~~~~~~LP~~L~eAl~~l~~d~~ 416 (458) .++.|. +..+++.|- +-.+|+.+|.+++- T Consensus 522 ~~vl~et~k~h~~vvFnGnGYSeeWhkeAekeRGL~Nlr-TT~dA~~~L~~~kn 574 (724) T COG3968 522 EAVLTETVKDHKKVVFNGNGYSEEWHKEAEKERGLPNLR-TTVDATKALIADKN 574 (724) T ss_pred HHHHHHHHHHHCEEEECCCCCCHHHHHHHHHHCCCCHHH-HHHHHHHHHCCHHH T ss_conf 999999998616687579875489999999853863035-46778887503267 No 8 >pfam03951 Gln-synt_N Glutamine synthetase, beta-Grasp domain. Probab=99.71 E-value=3.8e-17 Score=126.67 Aligned_cols=81 Identities=42% Similarity=0.838 Sum_probs=74.6 Q ss_pred CCCEEEEEEECCCCCEEEEEEEHHHHHHHHHHCCCCCCCCCHHCCCC-CCCCEEEEEECCCCEECCCCCCC--EEEEEEE Q ss_conf 98399999978998252378778783055884598456651002468-99888999641105877568972--2799852 Q gi|254780185|r 16 KVKFIDLRFTDLQGKFRHISMNVSLFNENTLLNGITFDSSSIDGWHS-KHPNLLLIPDIETMHMDPFYAQS--TVAFICN 92 (458) Q Consensus 16 ~I~~V~~~~~Dl~G~~R~k~ip~~~f~~~~~~~G~~fd~s~i~g~~~-~~~D~~l~PD~~T~~~~Pw~~~~--~~~v~~d 92 (458) +|++|+++|+|+.|.+++.++|++++.+.++.+|+.|||||+.||.. ..+||+|+||++|+.++||++++ +++|+|| T Consensus 1 ~v~~v~~~F~D~~G~~~~~~~~~~~~~~~~~~~g~~FDGSSi~G~~~i~~SDm~L~Pd~~t~~~dP~~~~~~~~~~v~Cd 80 (84) T pfam03951 1 GVKFVDLRFTDILGRLKHVRIPATELDEDAFEEGIGFDGSSIEGFAPINESDMYLKPDPSTAYIDPFRPDPGKTARVICD 80 (84) T ss_pred CCEEEEEEEECCCCCEEEEEEEHHHHCHHHHHCCCEEECHHEECCCCCCCCCEEEEECCCCEEECCCCCCCCCEEEEEEE T ss_conf 97899999978999738999868992877875795073000320003787766996579971883305899878999997 Q ss_pred ECCC Q ss_conf 0147 Q gi|254780185|r 93 VYDP 96 (458) Q Consensus 93 ~~~~ 96 (458) +|++ T Consensus 81 v~~p 84 (84) T pfam03951 81 VYDP 84 (84) T ss_pred CCCC T ss_conf 3088 No 9 >PRK13517 carboxylate-amine ligase; Provisional Probab=98.93 E-value=7.8e-08 Score=68.58 Aligned_cols=199 Identities=16% Similarity=0.222 Sum_probs=120.5 Q ss_pred EEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 25773133089970677777865477666765457777775003655455635524446899999999998605411222 Q gi|254780185|r 125 TLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITKY 204 (458) Q Consensus 125 ~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~ 204 (458) .+-+|+|.||+|+|.......| .-.++...+...+. -..+ T Consensus 8 ~~TlGvEeE~~LVD~~t~~l~~---------------------------------------~a~~vl~~~~~~~~-~~~v 47 (371) T PRK13517 8 RPTLGVEEEFLLVDPVTGELSP---------------------------------------RATEVLAEAGENGP-EPRL 47 (371) T ss_pred CCCCEEEEEEEEECCCCCCCCC---------------------------------------CHHHHHHHHHHCCC-CCCC T ss_conf 9961603214334478678566---------------------------------------44999997343047-7675 Q ss_pred CCCCCCHHHCCCCCCC-CHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE-ECCCCC------------------------ Q ss_conf 1335621210365556-68887769999999999998744975444121-014568------------------------ Q gi|254780185|r 205 HNQINDAQHSFGLQSE-SLLHASDNLQKYKYSVHQVANSYCKIATFMPK-PIASHN------------------------ 258 (458) Q Consensus 205 h~E~gpGQ~Ei~~~~~-~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpK-P~~~~~------------------------ 258 (458) ++|...+|.|++..++ +.-.+.+++.-.|..+...|.++|+...-... |+.... T Consensus 48 ~~El~~~~iEi~t~~~~~~~el~~~L~~~r~~l~~aA~~~G~~l~a~GthP~~~~~~~~~t~~~RY~~l~~~~~~~~~~~ 127 (371) T PRK13517 48 QKELLRNTVEVVTGVCDTVGEAREDLRRTRAVARRAADRRGARLAAAGTHPFSPWSDQPLTDNPRYRELIERTQWWAREQ 127 (371) T ss_pred CHHHHHHEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 75655016886687779999999999999999999999859978722677888865577896478999999999999876 Q ss_pred -CCHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCC---------CHHHHHHC----CCCC-CC Q ss_conf -601201111114676312215556578999999999998864100000667---------00144318----8852-12 Q gi|254780185|r 259 -GSGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNAS---------TNSYKRLL----TDSQ-SP 323 (458) Q Consensus 259 -GsG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~Pt---------vNSYkRl~----p~~~-ap 323 (458) =.|||||+.+-+. +.+-..+..+.-++|-+.|+.+.+ .-|||..+ |..- .| T Consensus 128 ~i~G~HVHVgv~d~--------------e~av~v~n~lr~~lP~lLALSAnSPf~~G~DTG~aS~R~~~f~~~P~~G~p~ 193 (371) T PRK13517 128 LICGVHVHVGVPSR--------------EKAVPVLNRLRPWLPHLLALSANSPFWRGADTGYASNRAMLFQQWPTAGPPP 193 (371) T ss_pred HHHEEEEEECCCCH--------------HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCC T ss_conf 14025798347988--------------9999999999999899999843895306866763457999997488889996 Q ss_pred EEEEECC---------------CCCCC--EEECCCCCCCCCCEEEECCCCCCCCHH--HHHHHHHHHHHHHHHHCCC Q ss_conf 0101136---------------88764--010257887654124404675321279--9999999999987885599 Q gi|254780185|r 324 TKLTYST---------------HNHSA--SCRIPYENKSDNKSIEIRFPDLSANPY--LAPAAILMAGLDGIAKKIH 381 (458) Q Consensus 324 ~~~~WG~---------------~NR~a--~iRip~~~~~~~~riE~R~~da~aNPY--L~lAailaAgl~Gi~~~l~ 381 (458) .+-+|.. +++.. -+| | ++.-.++|+|+.|+..++= +++||++.|-..-+-+.++ T Consensus 194 ~f~s~~~y~~~v~~l~~~G~i~d~~~iwWdvR-P---s~~~~TvE~Rv~D~~~~~~d~~aiaal~~aLv~~~~~~~~ 266 (371) T PRK13517 194 QFGSWAEYEAFVADLLKTGVILDEGMVYWDIR-P---SPNFGTVEVRVADVPLTLRETVALAALVRCLVVTADRRLD 266 (371) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEC-C---CCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 54898999999999986397287882798616-6---7888852354058889999999999999999999998887 No 10 >TIGR02050 gshA_cyan_rel uncharacterized enzyme; InterPro: IPR011793 This family represents a division of a larger family, the other branch of which is predicted to act as glutamate--cysteine ligase (the first of two enzymes in glutathione biosynthesis) in the cyanobacteria. Species containing this protein, however, are generally not believed to make glutathione, and the function is unknown.. Probab=98.85 E-value=5.2e-08 Score=69.66 Aligned_cols=211 Identities=17% Similarity=0.226 Sum_probs=135.6 Q ss_pred EEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 73133089970677777865477666765457777775003655455635524446899999999998605411222133 Q gi|254780185|r 128 LGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITKYHNQ 207 (458) Q Consensus 128 ~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~h~E 207 (458) +|+|.||+|+|.......+.. +..-.++.+.-+.+..- =.-..+.+| T Consensus 2 lGvEeE~~LVD~~~~d~~~~~--------------------------------~~~~~~~~~~~~~~~~~-PGGy~~~~E 48 (297) T TIGR02050 2 LGVEEELLLVDPHTYDLAASS--------------------------------DEAAALVGECREKLGED-PGGYEFKKE 48 (297) T ss_pred CHHEEEHHHCCCCCCCCCCCC--------------------------------HHHHHHHHCCCCCCCCC-CCCCCCCHH T ss_conf 402000212055678876661--------------------------------04899986077411368-886320422 Q ss_pred CCCHHHCCCCCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEE-EECCCCC-----------------C--------C Q ss_conf 562121036555-66888776999999999999874497544412-1014568-----------------6--------0 Q gi|254780185|r 208 INDAQHSFGLQS-ESLLHASDNLQKYKYSVHQVANSYCKIATFMP-KPIASHN-----------------G--------S 260 (458) Q Consensus 208 ~gpGQ~Ei~~~~-~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmp-KP~~~~~-----------------G--------s 260 (458) .=-.|+|+...+ .+.-+|.-.+--.|..+++.|..+|+...=.+ -||+... | . T Consensus 49 ~~~~~vE~at~v~~~~~eA~~~~~~~r~~l~~~A~~~G~~~~~~gtHPf~~W~~~~~~~~~rY~~~~e~~gy~~~~~Lv~ 128 (297) T TIGR02050 49 LFESQVELATPVCTTLAEALAELREVRARLVEAASDLGLRIAAAGTHPFARWREQELAENPRYQRLLERYGYVARQSLVF 128 (297) T ss_pred HHHCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCHHHHHHHHHCCEEEHHEEEC T ss_conf 22144446568447878999999999999999986648636513467753331477786555899998618000001321 Q ss_pred HHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCC---------HHHHHHC----CCCC-CCEEE Q ss_conf 12011111146763122155565789999999999988641000006670---------0144318----8852-12010 Q gi|254780185|r 261 GMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNAST---------NSYKRLL----TDSQ-SPTKL 326 (458) Q Consensus 261 G~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~~~-ap~~~ 326 (458) |+|||+++- +. +.+-.-|-+|+.++|-|.||-|.+. -|||+-. |..- .+... T Consensus 129 G~HVHVgv~--------s~------~~~~~vl~rl~~~lP~LlALSAsSPfw~G~DTgyAS~R~~~~~~~P~~G~p~~f~ 194 (297) T TIGR02050 129 GLHVHVGVP--------SP------DDAVAVLNRLLPWLPHLLALSASSPFWQGRDTGYASYRANIFQEWPTAGLPPAFQ 194 (297) T ss_pred CEEEEECCC--------CH------HHHHHHHHHHHHHHHHHHHHHHHCHHHCCCCCHHHHHHHHHHHHCCCCCCCHHHC T ss_conf 134770327--------64------3588998656667899999986151115825237788999998466678742220 Q ss_pred EE--------------CCCCCCC---EEECCCCCCCCCCEEEECCCCCCCCHH--HHHHHHHHHHHHHHHH---CCCCCC Q ss_conf 11--------------3688764---010257887654124404675321279--9999999999987885---599986 Q gi|254780185|r 327 TY--------------STHNHSA---SCRIPYENKSDNKSIEIRFPDLSANPY--LAPAAILMAGLDGIAK---KIHPGK 384 (458) Q Consensus 327 ~W--------------G~~NR~a---~iRip~~~~~~~~riE~R~~da~aNPY--L~lAailaAgl~Gi~~---~l~p~~ 384 (458) +| ..++..- .||. ++.-.-||+|++|+..|+= +++||++-|-.+=.-+ ..+-|. T Consensus 195 ~w~ay~~~~a~l~~tG~i~~~g~l~wDiRP----s~~~gTvEvRv~D~~~~l~~~~~~aaL~rALv~~~~~rYD~~~~p~ 270 (297) T TIGR02050 195 SWDAYEAYVADLRETGVIEDDGDLYWDIRP----SSHYGTVEVRVADACLNLEEALALAALIRALVEELLRRYDEWDAPF 270 (297) T ss_pred CHHHHHHHHHHHHHCCCCCCCCCEEEEEEE----CCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 589999999999854892478866883450----6998826887148998899999999999999999997515122779 Q ss_pred CCCCC Q ss_conf 65578 Q gi|254780185|r 385 NIDEL 389 (458) Q Consensus 385 ~~~~~ 389 (458) ++.+. T Consensus 271 ~~~~~ 275 (297) T TIGR02050 271 PVPED 275 (297) T ss_pred CCCCC T ss_conf 98764 No 11 >PRK13515 carboxylate-amine ligase; Provisional Probab=98.78 E-value=7e-07 Score=62.61 Aligned_cols=199 Identities=17% Similarity=0.260 Sum_probs=124.0 Q ss_pred EEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 25773133089970677777865477666765457777775003655455635524446899999999998605411222 Q gi|254780185|r 125 TLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITKY 204 (458) Q Consensus 125 ~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~ 204 (458) .+-+|+|.||+|.|.......+ ...+. +.+.+... .-+.+ T Consensus 5 ~~TlGVEeE~~LvD~~t~~l~~-----------------------------------~a~~~---l~~~~~~~--~~~~v 44 (381) T PRK13515 5 EFTFGIEEEYLLVDAETLDLAR-----------------------------------SAPAA---LMEACRDA--LGEQF 44 (381) T ss_pred CCCCEEEEEEEEECCCCCCCCC-----------------------------------CHHHH---HHHHHHHH--CCCCC T ss_conf 9874523446667588656675-----------------------------------23899---99853331--25666 Q ss_pred CCCCCCHHHCCCCCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE-ECCCC------------------------- Q ss_conf 133562121036555-668887769999999999998744975444121-01456------------------------- Q gi|254780185|r 205 HNQINDAQHSFGLQS-ESLLHASDNLQKYKYSVHQVANSYCKIATFMPK-PIASH------------------------- 257 (458) Q Consensus 205 h~E~gpGQ~Ei~~~~-~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpK-P~~~~------------------------- 257 (458) ++|.-.+|.|++..+ .+.-.+.+++...|..+.+.|.++|+...--.. ||... T Consensus 45 ~~El~~~qiEi~T~v~~~~~e~~~~L~~~r~~l~~aA~~~g~~l~a~GthP~~~w~~~~~t~~~RY~~l~~~~~~~a~~~ 124 (381) T PRK13515 45 APEMLQSQIEVGTPVCATLAEAREELGRLRQRLAELAAQFGLRIIAAGTHPFADWRRQRPTPKERYAQLFEDLGGVARRS 124 (381) T ss_pred CHHHHHCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 75666237994687779999999999999999999999869889830789888876687896078999999999999864 Q ss_pred CCCHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCC---------HHHHHHC----CCCC-CC Q ss_conf 86012011111146763122155565789999999999988641000006670---------0144318----8852-12 Q gi|254780185|r 258 NGSGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNAST---------NSYKRLL----TDSQ-SP 323 (458) Q Consensus 258 ~GsG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~~~-ap 323 (458) .=.|||||+-+-|. +.+-.-+..+.-++|.|.|+.+.+. .|||..+ |..- .| T Consensus 125 ~i~G~HVHVgv~d~--------------e~av~v~n~lr~~LP~LLALSAsSPFw~G~DTG~aS~R~~~~~~~P~aG~P~ 190 (381) T PRK13515 125 LICGMHVHVGVPDS--------------EDRMALMNQVLYFLPHLLALSTSSPFWGGRDTGLKSYRRAIFGELPRTGLPP 190 (381) T ss_pred HHEEEEEEECCCCH--------------HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCC T ss_conf 32423898368988--------------9999999999988899999970893104787764416876886488889997 Q ss_pred EEEEECC--------------CCCCC---EEECCCCCCCCCCEEEECCCCCCCCHH--HHHHHHHHHHHHHHHHCCC Q ss_conf 0101136--------------88764---010257887654124404675321279--9999999999987885599 Q gi|254780185|r 324 TKLTYST--------------HNHSA---SCRIPYENKSDNKSIEIRFPDLSANPY--LAPAAILMAGLDGIAKKIH 381 (458) Q Consensus 324 ~~~~WG~--------------~NR~a---~iRip~~~~~~~~riE~R~~da~aNPY--L~lAailaAgl~Gi~~~l~ 381 (458) .+-+|.. .+... -|| | +++-..+|+|++|+..+.= +++||++.|-..-+-+..+ T Consensus 191 ~f~s~~~y~~~v~~l~~tG~i~d~~~i~WdvR-P---s~~~pTlEvRv~D~~~~~~~~~~iAal~raLv~~~~~~~~ 263 (381) T PRK13515 191 LFPSWAEYQRYVALLVRTGCIDDDGKIWWDLR-P---SPRFPTLELRICDVCPRLDDALALAALFRALVRHLYRLRR 263 (381) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEC-C---CCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 76899999999999986599687880589825-7---8999962575068879999999999999999999998787 No 12 >PRK13518 carboxylate-amine ligase; Provisional Probab=98.70 E-value=1.1e-06 Score=61.43 Aligned_cols=161 Identities=17% Similarity=0.191 Sum_probs=104.4 Q ss_pred CCCCCCCCCCCHHHCCCCCCC-CHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE-ECCCC-------------------- Q ss_conf 112221335621210365556-68887769999999999998744975444121-01456-------------------- Q gi|254780185|r 200 QITKYHNQINDAQHSFGLQSE-SLLHASDNLQKYKYSVHQVANSYCKIATFMPK-PIASH-------------------- 257 (458) Q Consensus 200 ~ie~~h~E~gpGQ~Ei~~~~~-~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpK-P~~~~-------------------- 257 (458) .-+.+++|.-.+|.|++..++ +.-++.+++...|..+.+.|.++|+..-=... ||... T Consensus 47 ~~~~v~~El~~~qvEi~T~vc~~~~e~~~~L~~~R~~l~~aA~~~G~~l~a~GThP~~~w~~~~it~~~RY~~l~~~~~~ 126 (357) T PRK13518 47 LAGRLDHELFKFVIETQTPTLNDVAAAPDALREVRDALVAHAEEHGFRIAAAGLHPGAKWRELEHAEKPRYRSQLDRIQY 126 (357) T ss_pred CCCCCCHHHHCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 45655746655479945867699999999999999999999998698899626898888776888997789999999889 Q ss_pred -----CCCHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCC---------HHHHHHC----CC Q ss_conf -----86012011111146763122155565789999999999988641000006670---------0144318----88 Q gi|254780185|r 258 -----NGSGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNAST---------NSYKRLL----TD 319 (458) Q Consensus 258 -----~GsG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~ 319 (458) .=.|||||+.+-+. +.+-.-+..+.-++|.|.||.+.+. .|||-.. |. T Consensus 127 ~ar~~li~G~HVHVGv~d~--------------e~av~v~n~lr~~LP~LlALSAnSPFw~G~DTG~aS~R~~~~~~~P~ 192 (357) T PRK13518 127 PQHRNTTAGLHIHVGVDDP--------------DKAVWVANELRWHMPVLLALSANSPYWNGFDTGLASARAKIFEALPN 192 (357) T ss_pred HHHHHHHEEEEEEECCCCH--------------HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCC T ss_conf 9998746103799617978--------------99999999888778999999717942048888751089999972764 Q ss_pred CC-CCEEEEECC--------------CCCCC---EEECCCCCCCCCCEEEECCCCCCCCHH--HHHHHHHHHHHHHHHH Q ss_conf 52-120101136--------------88764---010257887654124404675321279--9999999999987885 Q gi|254780185|r 320 SQ-SPTKLTYST--------------HNHSA---SCRIPYENKSDNKSIEIRFPDLSANPY--LAPAAILMAGLDGIAK 378 (458) Q Consensus 320 ~~-ap~~~~WG~--------------~NR~a---~iRip~~~~~~~~riE~R~~da~aNPY--L~lAailaAgl~Gi~~ 378 (458) .- .|.+-+|.. ++.+- -|| | +++-.-+|+|++|...+.= +++||++.|-..=+-+ T Consensus 193 aG~P~~f~s~~~y~~~v~~L~~tG~I~d~~~i~WDiR-P---s~~~pTvEvRV~D~~~~~~~~~~lAaL~raLv~~~~~ 267 (357) T PRK13518 193 TGMPTAFEDYEAFQAFERRMVETGSIEDRGELWYDVR-P---HSGHGTVEVRAPDAQADPDVVLAFVEYAHALVTDLAA 267 (357) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEC-C---CCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 7999544899999999999997398277884589944-2---6999972576578889989999999999999999999 No 13 >PRK13516 carboxylate-amine ligase; Provisional Probab=98.69 E-value=1.7e-06 Score=60.26 Aligned_cols=192 Identities=17% Similarity=0.174 Sum_probs=120.8 Q ss_pred EEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 25773133089970677777865477666765457777775003655455635524446899999999998605411222 Q gi|254780185|r 125 TLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITKY 204 (458) Q Consensus 125 ~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~ 204 (458) .+-+|+|-||+|+|.......+ ....++.++.. . ..-..+ T Consensus 11 ~~T~GVEeE~~LvD~~t~~l~~-----------------------------------~a~~vl~~~~~----~-~~~~~v 50 (372) T PRK13516 11 PFTLGVELELQVVNPHDYDLTQ-----------------------------------ASSDLLRAVKN----Q-ITAGEI 50 (372) T ss_pred CCCCEEEEEEEEECCCCCCCCC-----------------------------------CHHHHHHHHCC----C-CCCCCC T ss_conf 9821656668776489756675-----------------------------------42999997113----4-356656 Q ss_pred CCCCCCHHHCCCCCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE-ECCCC------------------------- Q ss_conf 133562121036555-668887769999999999998744975444121-01456------------------------- Q gi|254780185|r 205 HNQINDAQHSFGLQS-ESLLHASDNLQKYKYSVHQVANSYCKIATFMPK-PIASH------------------------- 257 (458) Q Consensus 205 h~E~gpGQ~Ei~~~~-~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpK-P~~~~------------------------- 257 (458) ++|.-.+|.|++..+ .+.=++.+++...|..+.++|.++|+...--.. ||... T Consensus 51 ~~El~~~qvEi~T~v~~~~~e~~~~L~~~R~~l~~~A~~~g~~l~a~GthP~~~~~~~~it~~~RY~~l~~~~~~~a~~~ 130 (372) T PRK13516 51 KPEITESMIEIATGVCRDIDQALGQLSAMRDVLVQAADDLNLGICGGGTHPFQQWQRQRICDNPRFQYLSELYGYLAKQF 130 (372) T ss_pred CHHHHHHEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 74765134784683669999999999999999999999829877633789888755477896278999999999999988 Q ss_pred CCCHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCC---------HHHHHHC----CCCC-CC Q ss_conf 86012011111146763122155565789999999999988641000006670---------0144318----8852-12 Q gi|254780185|r 258 NGSGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNAST---------NSYKRLL----TDSQ-SP 323 (458) Q Consensus 258 ~GsG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~~~-ap 323 (458) .=.|||||+.+-+. +.+-..+..|.-++|.|.||.+.+. -|||... |..- .| T Consensus 131 ~i~G~HVHVgv~d~--------------e~av~v~n~lr~~LP~LLALSAnSPFw~G~DTG~aS~R~~~~~~~P~aG~pp 196 (372) T PRK13516 131 TVFGQHVHIGCPSG--------------DDAIYLLHGLSRYVPHFIALSASSPYVQGVDTGFASARLNSVSAFPLSGRAP 196 (372) T ss_pred HHHCEEEEECCCCH--------------HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCC T ss_conf 46102688457978--------------8899999998888789999842893205775875002465786287889899 Q ss_pred EEEEECC--------------CCCCC---EEECCCCCCCCCCEEEECCCCCCCCHH--HHHHHHHHHHHH Q ss_conf 0101136--------------88764---010257887654124404675321279--999999999998 Q gi|254780185|r 324 TKLTYST--------------HNHSA---SCRIPYENKSDNKSIEIRFPDLSANPY--LAPAAILMAGLD 374 (458) Q Consensus 324 ~~~~WG~--------------~NR~a---~iRip~~~~~~~~riE~R~~da~aNPY--L~lAailaAgl~ 374 (458) .+-+|.. .|..- -|| | +++-..+|+|++|+..++= +++||++-|-.. T Consensus 197 ~f~s~~~y~~~v~~L~~tG~i~d~~~i~WdvR-P---s~~~pTvEvRv~D~~~~~~~~~~iAaL~raLv~ 262 (372) T PRK13516 197 FVLNWQEFEAYFRKMSYTGVIDSMKDFYWDIR-P---KPEFGTVEVRVMDTPLTVERAAAIAGYIQALAR 262 (372) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEC-C---CCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHH T ss_conf 86999999999999987187376782589833-6---899996035403477999999999999999999 No 14 >pfam04107 GCS2 Glutamate-cysteine ligase family 2(GCS2). Also known as gamma-glutamylcysteine synthetase and gamma-ECS (EC:6.3.2.2). This enzyme catalyses the first and rate limiting step in de novo glutathione biosynthesis. Members of this family are found in archaea, bacteria and plants. May and Leaver discuss the possible evolutionary origins of glutamate-cysteine ligase enzymes in different organisms and suggest that it evolved independently in different eukaryotes, from an ancestral bacterial enzyme. They also state that Arabidopsis thaliana gamma-glutamylcysteine synthetase is structurally unrelated to mammalian, yeast and Escherichia coli homologues. In plants, there are separate cytosolic and chloroplast forms of the enzyme. Probab=98.23 E-value=1.4e-05 Score=54.52 Aligned_cols=160 Identities=20% Similarity=0.200 Sum_probs=91.7 Q ss_pred CCCCCCCCCCHHHCCCCCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEE--------------EEEEECC-------C-- Q ss_conf 1222133562121036555-66888776999999999999874497544--------------4121014-------5-- Q gi|254780185|r 201 ITKYHNQINDAQHSFGLQS-ESLLHASDNLQKYKYSVHQVANSYCKIAT--------------FMPKPIA-------S-- 256 (458) Q Consensus 201 ie~~h~E~gpGQ~Ei~~~~-~~~l~aAD~~~~~k~~ik~iA~~~G~~AT--------------FmpKP~~-------~-- 256 (458) -....+|--.||+|++..+ .+.-.+.+.+...+..++++|.++|+... ++|||-. . T Consensus 37 ~~~~~~e~~~~qiEist~p~~~~~e~~~~l~~~~~~l~~~a~~~g~~l~~~Gt~P~~~~~~~~~~~~~RY~~m~~~~~~~ 116 (291) T pfam04107 37 GGGFVKELPGGQVELSTPPLESLAEAAEEISQHREELRHVADELGLGLLGLGTHPFALRSRDPVMPKGRYRRMYEYMPRV 116 (291) T ss_pred CCCEEECCCCCEEEECCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 89653326898899714221899999999999999999999982980682277877887547768884257999999875 Q ss_pred -CCC-----CHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCC---------CHHHHHHC---- Q ss_conf -686-----01201111114676312215556578999999999998864100000667---------00144318---- Q gi|254780185|r 257 -HNG-----SGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNAS---------TNSYKRLL---- 317 (458) Q Consensus 257 -~~G-----sG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~Pt---------vNSYkRl~---- 317 (458) ..| +|||||+++-++.. ..-.-+.-+...+|-+.|+++-+ ..|||-.. T Consensus 117 g~~~~~mm~~g~hVhV~vd~~~e-------------~~~~~~~~~~~~~P~llALsAnSPf~~G~dtG~~S~R~~iw~~~ 183 (291) T pfam04107 117 GVYGRQMMVAGCHVQVNIDSSSE-------------AIMAVLRLVRALLPVLLALSANSPFWGGRDTGYASTRALIFTQT 183 (291) T ss_pred HHHHHHHHHCEEEEEECCCCCCH-------------HHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCHHHHHHHHHCC T ss_conf 33789897722258854788758-------------89999999988889999997298233588764131778987408 Q ss_pred CCC-CCCEEEEECC---------------CCCCC--EEECCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHH-HHHHH Q ss_conf 885-2120101136---------------88764--010257887654124404675321279999999999-99878 Q gi|254780185|r 318 TDS-QSPTKLTYST---------------HNHSA--SCRIPYENKSDNKSIEIRFPDLSANPYLAPAAILMA-GLDGI 376 (458) Q Consensus 318 p~~-~ap~~~~WG~---------------~NR~a--~iRip~~~~~~~~riE~R~~da~aNPYL~lAailaA-gl~Gi 376 (458) |.. --|...+|+. +-|.. .+|. .+. ..+.+|.|..|++.+|=..+++++-+ ++|.. T Consensus 184 p~~G~~p~~~~~~~~~~yv~~ll~~~~i~~~~~~~~d~r~--~~~-~~~t~E~r~~ds~~f~~vr~~~~~e~r~~d~~ 258 (291) T pfam04107 184 PQAGPLPLAFEDGAFERYARYALDTGIIFVRRRLWWDGRP--PGL-PGETLELRIHDTTAFPPVRLRALLEARLLDAQ 258 (291) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHCCCCEEECCCCEEECCC--CCC-CCCHHHHHHHCCCCCHHHHHHHHHHHHHHHCC T ss_conf 9999999879838999999999718976966982683178--989-97688999830469879999999999887418 No 15 >COG2170 Uncharacterized conserved protein [Function unknown] Probab=97.74 E-value=0.00068 Score=43.97 Aligned_cols=158 Identities=19% Similarity=0.223 Sum_probs=96.4 Q ss_pred CCCCCHHHCCCCCCCCHH-HHHHHHHHHHHHHHHHHHHCCCEEEE-EEEECCCC--------------------CC---- Q ss_conf 335621210365556688-87769999999999998744975444-12101456--------------------86---- Q gi|254780185|r 206 NQINDAQHSFGLQSESLL-HASDNLQKYKYSVHQVANSYCKIATF-MPKPIASH--------------------NG---- 259 (458) Q Consensus 206 ~E~gpGQ~Ei~~~~~~~l-~aAD~~~~~k~~ik~iA~~~G~~ATF-mpKP~~~~--------------------~G---- 259 (458) +|.=-.+.|+....++.+ +|+-..=-.|..+++.|..||+...= -.-||.+. .| T Consensus 43 ~e~~e~~vE~~t~vc~~~~eA~~~~r~~r~~l~q~a~d~gL~~~~~GtHPfadw~~~~~~~~prY~~~ie~~~y~~~q~~ 122 (369) T COG2170 43 HEITESTVELATGVCRLLAEAAAQLRALRDYLVQAASDHGLRICGGGTHPFADWRRQEVPDNPRYQRLIERTGYLGRQMT 122 (369) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHEE T ss_conf 88887743034036665999999999999999997442380332367885025565247887239999998516776215 Q ss_pred -CHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCC---------HHHHHHC----CCC-CCCE Q ss_conf -012011111146763122155565789999999999988641000006670---------0144318----885-2120 Q gi|254780185|r 260 -SGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNAST---------NSYKRLL----TDS-QSPT 324 (458) Q Consensus 260 -sG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~~-~ap~ 324 (458) -|.|||+-+- + ++.+-..+-+|+.++|-+.|+.|.+. -|+|+-. |.. -.|- T Consensus 123 v~G~HVHVGi~----------~----~d~~~~~l~~l~~~~PhlLALSASSPf~~G~dTGyAS~R~~if~~~P~~g~pp~ 188 (369) T COG2170 123 VAGQHVHVGIP----------S----PDDAMYLLHRLLRYVPHLLALSASSPFWQGTDTGYASARANIFSQLPTNGLPPA 188 (369) T ss_pred EEEEEEEECCC----------C----HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCC T ss_conf 64577883689----------8----899999999887650588765037853167323135655766320776799965 Q ss_pred EEEEC--------------CCCCCC---EEECCCCCCCCCCEEEECCCCCCCCHH--HHHHHHHHHHHHHHHHCCC Q ss_conf 10113--------------688764---010257887654124404675321279--9999999999987885599 Q gi|254780185|r 325 KLTYS--------------THNHSA---SCRIPYENKSDNKSIEIRFPDLSANPY--LAPAAILMAGLDGIAKKIH 381 (458) Q Consensus 325 ~~~WG--------------~~NR~a---~iRip~~~~~~~~riE~R~~da~aNPY--L~lAailaAgl~Gi~~~l~ 381 (458) +.+|+ .+|..- .|| | +|.=.++|+|++|+..||= ++++|++-|-+.=+-...+ T Consensus 189 f~sw~~f~~~~~~~~~tG~I~~~~~lwwdIR-P---sph~gTlEvRi~D~~~~l~~~~aivaL~~Alv~~ll~~~d 260 (369) T COG2170 189 FQSWAAFEAFFRDQLETGTIDSMGDLWWDIR-P---SPHLGTLEVRICDTVLNLAELLAIVALIHALVVWLLRRRD 260 (369) T ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCEEEECC-C---CCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 1688999999999887364144663378613-6---8877825888447879999999999999999999963889 No 16 >KOG0683 consensus Probab=97.07 E-value=0.00033 Score=45.92 Aligned_cols=257 Identities=13% Similarity=0.093 Sum_probs=121.4 Q ss_pred CCCCCCCEEEEEECCCCEECCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHH---CCCCCEEEEEECCCEEE Q ss_conf 468998889996411058775689722799852014754485302555323556799873---03452257731330899 Q gi|254780185|r 60 WHSKHPNLLLIPDIETMHMDPFYAQSTVAFICNVYDPITLQPYNRDPRYTAQKAIDYLQT---TDIGDTLLLGITTEFFV 136 (458) Q Consensus 60 ~~~~~~D~~l~PD~~T~~~~Pw~~~~~~~v~~d~~~~~~g~P~~~~PR~~Lk~~~~~l~~---~~~G~~~~~g~E~EF~L 136 (458) +..+ .+.+..=|-.++.--|- +. +-++.|.-.- .++++....|+...++....+.+ +-+|+ ++.||+-++++ T Consensus 93 fr~g-~Nilv~c~~~~~~~~P~-~t-n~R~~c~~~~-~~~~~~~~~PWfg~Eqeyt~l~~~~~~p~gw-p~~GFp~Pqgp 167 (380) T KOG0683 93 FRNG-NNILVMCDTYDFDGKPT-ET-NKRVACARIM-PKLSTKDTEPWFGMEQEYTLLDALDGHPFGW-PKGGFPGPQGP 167 (380) T ss_pred CCCC-CCEEEEEECCCCCCCCC-CC-CCHHHHHHHH-CCCCCCCCCCCHHHHHHHHHHCCCCCCCCCC-CCCCCCCCCCC T ss_conf 5589-97799943366789966-42-2103478874-3455234587533567877650456884647-76688888777 Q ss_pred ECCCCCCCCCCCCCCCCCCCC-------------CCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 706777778654776667654-------------5777777500365545563552444689999999999860541122 Q gi|254780185|r 137 FDNVHCTISPIKSGFALESTE-------------FLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSEIVSALNNIGVQITK 203 (458) Q Consensus 137 f~~~~~~~~p~~~~~~~~~~~-------------~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~ 203 (458) +...-+. +.-.+....... .-....+..+--...+|..++.-|...-.+.=+-+.+++.|+ +-+ T Consensus 168 yyc~VGa--d~~~~rdiveahy~acLyaGl~i~G~N~EvmPgQwEfqvGp~~GI~~gD~lw~aR~il~rVae~~Gv-ias 244 (380) T KOG0683 168 YYCGVGA--DRVFGRDIVEAHYRACLYAGLNISGINVEVMPGQWEFQVGPCEGISMGDQLWMARYILHRVAEKFGV-IAS 244 (380) T ss_pred CEECCCC--CCCCCCHHHHHHHHHHHHHHHEECCCCCCCCCCEEEEEECCHHCCCCCHHHHHHHHHHHHHHHHHCE-EEE T ss_conf 0000266--3012412343068887740302336323134750488544500112200478999999999998491-687 Q ss_pred CCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHH-HHHHHCCCCCCCEECCCCC Q ss_conf 213356212103655566888776999999999999874497544412101456860120-1111114676312215556 Q gi|254780185|r 204 YHNQINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPKPIASHNGSGMY-LNMSIHKGEKAIFTGNQHD 282 (458) Q Consensus 204 ~h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~GsG~H-ih~Sl~~~~~n~f~~~~~~ 282 (458) ++.+.-+|.|--+=.|+.- ..++.-..||+.+..-+-|.. |--| .|++.|| +.... T Consensus 245 f~pKp~~g~WngaG~Htn~------------ST~~mr~~~g~~~i~~a~~~l----s~rh~~hi~~yd-------p~~G~ 301 (380) T KOG0683 245 FDPKPILGDWNGAGCHTNF------------STKEMREAGGLKIIEEAIPKL----SKRHREHIAAYD-------PKGGK 301 (380) T ss_pred ECCCCCCCCCCCCCCCCCC------------CHHHHHHCCCHHHHHHHHHHH----HHHHHHHHHHCC-------CCCCC T ss_conf 2588877766676522320------------216666057789999986420----023044233128-------56786 Q ss_pred CCCHHHHHHHHHHHHHHHHCCHHCCCCCHHHHHHCCCCCCC--EEEEECCCCCCCEEECCCCCC-CCCCEEEECCCCCCC Q ss_conf 57899999999999886410000066700144318885212--010113688764010257887-654124404675321 Q gi|254780185|r 283 RVSLKGLYYLGGIIKHAKSLNALTNASTNSYKRLLTDSQSP--TKLTYSTHNHSASCRIPYENK-SDNKSIEIRFPDLSA 359 (458) Q Consensus 283 ~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~~~ap--~~~~WG~~NR~a~iRip~~~~-~~~~riE~R~~da~a 359 (458) +. -+||....+.. +..+||.-||.+.||||.... .....+|.|-||+++ T Consensus 302 ---dN-------------------------~rrltg~hEt~~i~~Fs~GvAnr~~siri~r~va~~~~Gy~edrrP~sN~ 353 (380) T KOG0683 302 ---DN-------------------------ERRLTGRHETGSIDNFSWGVANRNPSIRIPRTVAAEGKGYFEDRRPSSNC 353 (380) T ss_pred ---CC-------------------------HHHHCCCCCCCCCCCCCCCCCCCCCEEEECHHHHCCCCCCCCCCCCCCCC T ss_conf ---33-------------------------45531888655522000103467861662131213665312458986788 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 2799999999999987 Q gi|254780185|r 360 NPYLAPAAILMAGLDG 375 (458) Q Consensus 360 NPYL~lAailaAgl~G 375 (458) +||+|-.+++--.|.. T Consensus 354 Dpy~Vt~~~~~t~l~~ 369 (380) T KOG0683 354 DPYAVTLMIIPTTLLE 369 (380) T ss_pred CCCEEEHHHHHHHHHC T ss_conf 7320107776677753 No 17 >TIGR03444 gshA_related glutamate--cysteine ligase family protein. Members of this bacterial protein family bear homology to glutamate--cysteine ligase, an enzyme in the two-step pathway of glutathione (GSH) biosynthesis, but are distinctly different. Among the bacterial genomes that carry the uncharacterized methyltransferase (TIGR03438) and conserved hypothetical protein TIGR03440, this protein is found in a subset, always in the vicinity of these other genes. Conserved hypothetical protein TIGR03442 is found in these same genomes. The role of this cassette is probably biosynthetic, but the product is unknown. Probab=85.74 E-value=1.9 Score=22.53 Aligned_cols=30 Identities=30% Similarity=0.321 Sum_probs=22.0 Q ss_pred CCEEEECCCCCCCCHHH-HHHHHHHHHHHHH Q ss_conf 41244046753212799-9999999999878 Q gi|254780185|r 347 NKSIEIRFPDLSANPYL-APAAILMAGLDGI 376 (458) Q Consensus 347 ~~riE~R~~da~aNPYL-~lAailaAgl~Gi 376 (458) .++||.|.+|+---.++ ++.|++.+-++.- T Consensus 290 K~yLE~R~aDa~p~~~~~a~~A~~~gLlyD~ 320 (390) T TIGR03444 290 RGWLELRYLDAQPGDGWPVPVATLTALLDDP 320 (390) T ss_pred CCEEEECCCCCCCCCCCCHHHHHHHHHHCCH T ss_conf 5634750167788130009999999984289 No 18 >COG3572 GshA Gamma-glutamylcysteine synthetase [Coenzyme metabolism] Probab=84.77 E-value=0.63 Score=25.48 Aligned_cols=23 Identities=22% Similarity=0.192 Sum_probs=12.1 Q ss_pred CEEEECCCCCCCCHHH-HHHHHHH Q ss_conf 1244046753212799-9999999 Q gi|254780185|r 348 KSIEIRFPDLSANPYL-APAAILM 370 (458) Q Consensus 348 ~riE~R~~da~aNPYL-~lAaila 370 (458) +.+|.|.+|+---+-+ ++.|... T Consensus 318 ~~lEmRgad~gP~~r~cAlpAfwv 341 (456) T COG3572 318 RFLEMRGADSGPWRRICALPAFWV 341 (456) T ss_pred CEEEEECCCCCCCHHHHHHHHHHH T ss_conf 146761356787123334277888 No 19 >pfam08759 DUF1792 Domain of unknown function (DUF1792). This putative domain is probably missannotated as a glycosyl transferase 8 family member. This domain is found at the C-terminus of proteins that also contain the glycosyl transferase domain at the N-terminus. Probab=82.31 E-value=0.99 Score=24.26 Aligned_cols=77 Identities=17% Similarity=0.236 Sum_probs=53.4 Q ss_pred HHHHHHCCCEEEEEEEECCCCC---CCHHHHHH--HHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCHHCCCCCH Q ss_conf 9998744975444121014568---60120111--111467631221555657899999999999886410000066700 Q gi|254780185|r 237 HQVANSYCKIATFMPKPIASHN---GSGMYLNM--SIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAKSLNALTNASTN 311 (458) Q Consensus 237 k~iA~~~G~~ATFmpKP~~~~~---GsG~Hih~--Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~al~a~~~PtvN 311 (458) |.+....=|..||+.+|+.+.. -++-++|. +||++..-++-.-.. ..+.. ---|++.+.++-=+.||+-| T Consensus 76 k~l~~~~~Y~~tfiSRpY~d~~dKs~~~~~F~~lK~iW~~rdIliVEG~~-sR~Gv----GNdLF~naksI~rIicPskn 150 (225) T pfam08759 76 KLLVPNRWYGSTFISRPYIDYKDKSKSARYFHKLKQIWQNRDILIVEGEK-SRSGV----GNDLFDNAKSIKRIICPSKN 150 (225) T ss_pred HHHCCCCEECCEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCC-CCCCC----CHHHHHCCCEEEEEECCCHH T ss_conf 97456774611042332212122257699999999984799089992365-11467----71556366407889578774 Q ss_pred HHHHHCC Q ss_conf 1443188 Q gi|254780185|r 312 SYKRLLT 318 (458) Q Consensus 312 SYkRl~p 318 (458) +|.++.. T Consensus 151 Ay~~yd~ 157 (225) T pfam08759 151 AYSKYDE 157 (225) T ss_pred HHHHHHH T ss_conf 7999999 No 20 >KOG0558 consensus Probab=76.98 E-value=2.4 Score=21.87 Aligned_cols=56 Identities=20% Similarity=0.249 Sum_probs=39.8 Q ss_pred HHHHHHHHHHHHHHHHH-CCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 44689999999999860-5411222133562121036555668887769999999999998744975444121 Q gi|254780185|r 181 DKLHDMRSEIVSALNNI-GVQITKYHNQINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPK 252 (458) Q Consensus 181 ~~~~~~~~ei~~~l~~~-Gi~ie~~h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpK 252 (458) .....|.+.++..|..+ -||--.+. |-+ -+|.++-+|+-.|++|...|..-||||- T Consensus 246 ~plrGf~rAMvKtMt~alkiPHF~y~---------------dEI-n~~sLvklr~elk~~a~e~~IKltfmPf 302 (474) T KOG0558 246 VPLRGFSRAMVKTMTEALKIPHFGYV---------------DEI-NCDSLVKLRQELKENAKERGIKLTFMPF 302 (474) T ss_pred EECHHHHHHHHHHHHHHHCCCCCCCC---------------CCC-CHHHHHHHHHHHHHHHHHCCCEEEEHHH T ss_conf 21234789999998887458854424---------------424-7688999999986666645732652378 No 21 >PRK05784 phosphoribosylamine--glycine ligase; Provisional Probab=59.06 E-value=13 Score=17.20 Aligned_cols=97 Identities=10% Similarity=0.132 Sum_probs=57.7 Q ss_pred EEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCC----H---H---HHHHHHHHHHHHHH Q ss_conf 257731330899706777778654776667654577777750036554556355----2---4---44689999999999 Q gi|254780185|r 125 TLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLP----P---Q---DKLHDMRSEIVSAL 194 (458) Q Consensus 125 ~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~----~---~---~~~~~~~~ei~~~l 194 (458) +.+.|.|+-|+.|-+.. ...|.... ........ +..|..+.++++|+-. | . .....+++..+..| T Consensus 203 EfL~G~E~S~~a~~DG~-~~~~lp~a---QDhKR~~d-gD~GPNTGGMGAysP~~~~~P~l~~~~~~~~~~Ii~pti~~l 277 (485) T PRK05784 203 EKVDGVEYTVQVLTDGE-SVFPLPPV---QDNPHAYE-YGIGPETGGMGSVSGPGELLPFIEEEEYEEAVEIVKATVEAI 277 (485) T ss_pred ECCCCCEEEEEEEEECC-EEEECCCC---CCCCCCCC-CCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 64468179999998299-69989222---05765568-999989998700678766775447999999987489999999 Q ss_pred HH-HCCCCCCC------CCCCCCHHHCCCCCCCCHHHHH Q ss_conf 86-05411222------1335621210365556688877 Q gi|254780185|r 195 NN-IGVQITKY------HNQINDAQHSFGLQSESLLHAS 226 (458) Q Consensus 195 ~~-~Gi~ie~~------h~E~gpGQ~Ei~~~~~~~l~aA 226 (458) .. +|++...+ .++.||--.|+|..+.||-..+ T Consensus 278 ~~e~g~~y~GvLY~GlMit~~GPkVlEfN~RfGDPEtQ~ 316 (485) T PRK05784 278 RKETGERYVGVLSGQMMLTALGPTVIEYYSRLGDPEALN 316 (485) T ss_pred HHCCCCCCEEEEEEEEEECCCCCEEEEEECCCCCCHHHH T ss_conf 861699727888767886189848999968878833888 No 22 >COG4456 VagC Virulence-associated protein and related proteins [Function unknown] Probab=43.49 E-value=13 Score=17.34 Aligned_cols=29 Identities=24% Similarity=0.249 Sum_probs=20.2 Q ss_pred CCCCCCEEECCCCCCCCCCEEEECCCCCC Q ss_conf 68876401025788765412440467532 Q gi|254780185|r 330 THNHSASCRIPYENKSDNKSIEIRFPDLS 358 (458) Q Consensus 330 ~~NR~a~iRip~~~~~~~~riE~R~~da~ 358 (458) ..||+-++|+|..-.-...++|++.-|.+ T Consensus 7 ~snrSQAVRLP~e~~f~~~~VeI~r~G~~ 35 (74) T COG4456 7 RSNRSQAVRLPKEFRFPEDRVEIIREGDR 35 (74) T ss_pred ECCCEEEEECCHHEECCCCEEEEEEECCE T ss_conf 04971027654421027858999995895 No 23 >PRK00885 phosphoribosylamine--glycine ligase; Provisional Probab=40.75 E-value=26 Score=15.39 Aligned_cols=198 Identities=12% Similarity=0.185 Sum_probs=96.0 Q ss_pred CCHHHHHHHHHHCCCCEEEEEEEC-----CCCC---EEEEEE----EHH---------HHHHHH-HHCCCCCCCCCH--- Q ss_conf 798999999985798399999978-----9982---523787----787---------830558-845984566510--- Q gi|254780185|r 3 DKATSIIQKIAQEKVKFIDLRFTD-----LQGK---FRHISM----NVS---------LFNENT-LLNGITFDSSSI--- 57 (458) Q Consensus 3 ~~~~~i~~~l~~~~I~~V~~~~~D-----l~G~---~R~k~i----p~~---------~f~~~~-~~~G~~fd~s~i--- 57 (458) +..++|.++.++++|++| ++. ..|+ ++..-+ |.+ .|.+.. ...+++-..... T Consensus 49 ~d~~~l~~~a~~~~idlv---iiGPE~pL~~Gi~D~l~~~gi~vfGP~k~aA~lE~SK~faK~~m~~~~IPTa~~~~f~~ 125 (424) T PRK00885 49 TDIEALVAFAKEEGIDLT---VVGPEAPLVAGIVDAFRAAGLKIFGPTKAAAQLEGSKAFAKDFMARYGIPTAAYEVFTD 125 (424) T ss_pred CCHHHHHHHHHHCCCCEE---EECCCHHHHHHHHHHHHHCCCCEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECC T ss_conf 799999999998499999---98966788735799995069946894978988766379999999982989875488689 Q ss_pred -----HCCCCCCCCEEEEEECCCCEECCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-----EEE Q ss_conf -----0246899888999641105877568972279985201475448530255532355679987303452-----257 Q gi|254780185|r 58 -----DGWHSKHPNLLLIPDIETMHMDPFYAQSTVAFICNVYDPITLQPYNRDPRYTAQKAIDYLQTTDIGD-----TLL 127 (458) Q Consensus 58 -----~g~~~~~~D~~l~PD~~T~~~~Pw~~~~~~~v~~d~~~~~~g~P~~~~PR~~Lk~~~~~l~~~~~G~-----~~~ 127 (458) .........+++++|= ...+...++|.-. ...+..|++++..-.--+.|- ..+ T Consensus 126 ~~~A~~~l~~~~~p~VIKaDG--------LAaGKGV~V~~~~---------~ea~~al~~i~~~~~fg~ag~~VvIEE~L 188 (424) T PRK00885 126 AEEAKAYLDEQGAPIVVKADG--------LAAGKGVVVAMTL---------EEAEAAVDDMLAGNKFGDAGARVVIEEFL 188 (424) T ss_pred HHHHHHHHHHCCCCEEEECCC--------CCCCCCEEEECCH---------HHHHHHHHHHHCCCCCCCCCCEEEEEECC T ss_conf 999999998579988996176--------5677751763789---------99999999985033234568728999705 Q ss_pred EEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHH---HHH----HHHHHHHHHHHHHHCCC Q ss_conf 73133089970677777865477666765457777775003655455635524---446----89999999999860541 Q gi|254780185|r 128 LGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQ---DKL----HDMRSEIVSALNNIGVQ 200 (458) Q Consensus 128 ~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~---~~~----~~~~~ei~~~l~~~Gi~ 200 (458) .|.|+-++.|-+.. ...|... ......... +..|..+.++|+|+-.|+ +.. ..+++.++..|...|++ T Consensus 189 ~G~E~S~~a~~DG~-~~~~lp~---aqDhKR~~d-gD~GPNTGGMGa~sP~p~~~~~~~~~i~~~Ii~pt~~~l~~eg~~ 263 (424) T PRK00885 189 DGEEASFFALVDGE-NVLPLPT---AQDHKRAGD-GDTGPNTGGMGAYSPAPVVTEEVHDRVMEEIIEPTVKGMAAEGIP 263 (424) T ss_pred CCCEEEEEEEECCC-CCCCCCC---CCCCCCCCC-CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 88317999997899-7264702---114750568-999988998856678867999999999998799999999857984 Q ss_pred CCC------CCCCCCCHHHCCCCCCCCHHHH Q ss_conf 122------2133562121036555668887 Q gi|254780185|r 201 ITK------YHNQINDAQHSFGLQSESLLHA 225 (458) Q Consensus 201 ie~------~h~E~gpGQ~Ei~~~~~~~l~a 225 (458) ..+ +.++.||--.|+|..+.||-.. T Consensus 264 y~GiLy~GlMit~~GpkVlEyN~RfGDPE~q 294 (424) T PRK00885 264 YTGVLYAGLMITKDGPKVIEFNCRFGDPETQ 294 (424) T ss_pred EEEEEEEEEEEECCCCEEEEEECCCCCHHHH T ss_conf 7999877999867975589863134885888 No 24 >PRK06395 phosphoribosylamine--glycine ligase; Provisional Probab=38.68 E-value=28 Score=15.19 Aligned_cols=96 Identities=8% Similarity=0.085 Sum_probs=58.5 Q ss_pred EEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCC----H------HHHHHHHHHHHHHHH Q ss_conf 257731330899706777778654776667654577777750036554556355----2------444689999999999 Q gi|254780185|r 125 TLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLP----P------QDKLHDMRSEIVSAL 194 (458) Q Consensus 125 ~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~----~------~~~~~~~~~ei~~~l 194 (458) .++.|.|+-++.|-+.. ...+... ......... +..|..+.++|+|+-. + ......+++.+++.| T Consensus 188 E~L~G~E~S~~a~~dG~-~~~~lp~---aqDhKR~~d-gD~GPNTGGMGaysp~~~~~p~l~~~~~e~~~~Ii~pti~~l 262 (435) T PRK06395 188 KKMTGEEFSLQAFSDGK-HLSFMPI---VQDYKRAYE-GDHGPNTGGMGSISDRDFSLPFLSKDAPERAKHILNDIIRAM 262 (435) T ss_pred EEEECCEEEEEEEECCC-EEEECCC---CCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99844168999997699-5585873---012533447-999999998745688888874327789999999999999999 Q ss_pred HHHCCCCCC------CCCCCCCHHHCCCCCCCCHHHH Q ss_conf 860541122------2133562121036555668887 Q gi|254780185|r 195 NNIGVQITK------YHNQINDAQHSFGLQSESLLHA 225 (458) Q Consensus 195 ~~~Gi~ie~------~h~E~gpGQ~Ei~~~~~~~l~a 225 (458) ...|++..+ +.++.||--.|+|..+.||-.. T Consensus 263 ~~eg~~y~GvLy~GlMit~~gpkVlEyN~RfGDPE~Q 299 (435) T PRK06395 263 KDENNPFKGIMYGQFMDTPNGVKVIEINARFADPEGI 299 (435) T ss_pred HHCCCCEEEEEEEEEEEECCCCEEEEEEECCCCCHHH T ss_conf 9649970899877898508996489986203880688 No 25 >cd04865 LigD_Pol_like_2 LigD_Pol_like_2: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD, subgroup 2. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The Pol domains of PaeLigD and Mycobacterium tuberculosis (Mt)LigD are stimulated by manganese, are error-prone, and prefer adding rNTPs to dNTPs in vitro; however PaeLigD and MtLigD belong to other subgroups, proteins in this subgroup await functional characterization. Probab=38.34 E-value=28 Score=15.16 Aligned_cols=122 Identities=12% Similarity=0.155 Sum_probs=65.2 Q ss_pred HHHCCCCCCCCCCC----CCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCC-CHHHHHHHHC Q ss_conf 86054112221335----621210365556688877699999999999987449754441210145686-0120111111 Q gi|254780185|r 195 NNIGVQITKYHNQI----NDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPKPIASHNG-SGMYLNMSIH 269 (458) Q Consensus 195 ~~~Gi~ie~~h~E~----gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~G-sG~Hih~Sl~ 269 (458) +..-|++..+.+-. -|-+.-|.+.|.++..-++ ++-.=..+|++-.+.|+. .|...+| .|+||.+.|. T Consensus 80 n~g~iE~H~w~sr~~~~~~PD~lvfDLDP~~~~~f~~-v~~~A~~vr~~L~~lgL~------s~~KTSG~kGlHV~vPl~ 152 (228) T cd04865 80 NLGCIELHPWPSRAGDLDHPDELVIDLDPQPGTSFED-VVEVALLVREVLDELGLR------GYPKTSGARGLHIYVPIA 152 (228) T ss_pred HCCCEEEECCCCCCCCCCCCCEEEEECCCCCCCCHHH-HHHHHHHHHHHHHHCCCC------EEEECCCCCEEEEEEECC T ss_conf 6294674178777787789987998289998899999-999999999999976995------004747998579999679 Q ss_pred CCCCCCEECCCCCCCCHHHHHHHHHH----HHHHHHCCHHCCCCCHHHHHHCCCCCCCEEEEECCCCCCCEEECCCC Q ss_conf 46763122155565789999999999----98864100000667001443188852120101136887640102578 Q gi|254780185|r 270 KGEKAIFTGNQHDRVSLKGLYYLGGI----IKHAKSLNALTNASTNSYKRLLTDSQSPTKLTYSTHNHSASCRIPYE 342 (458) Q Consensus 270 ~~~~n~f~~~~~~~ls~~~~~fiaGi----l~h~~al~a~~~PtvNSYkRl~p~~~ap~~~~WG~~NR~a~iRip~~ 342 (458) .. +. -+..+.|.-.+ -+..|++.-. +|-....---.++.|..|+|...+=-|.+ T Consensus 153 ~~----~~-------~~~vr~fa~~vA~~l~~~~P~~~t~--------~~~k~~R~grIfiDylqN~~g~T~vapYS 210 (228) T cd04865 153 PR----YT-------FEEVRRFAELLAREVERRLPDLATT--------ERWKKERGGRVYLDYLQNARGKTLAAPYS 210 (228) T ss_pred CC----CC-------HHHHHHHHHHHHHHHHHHCCHHHHH--------HHHHHHCCCCEEEECCCCCCCCEEEECCC T ss_conf 99----99-------8999999999999999978322467--------76687479969998622699984771566 No 26 >cd06184 flavohem_like_fad_nad_binding FAD_NAD(P)H binding domain of flavohemoglobin. Flavohemoglobins have a globin domain containing a B-type heme fused with a ferredoxin reductase-like FAD/NAD-binding domain. Flavohemoglobins detoxify nitric oxide (NO) via an NO dioxygenase reaction. The hemoglobin domain adopts a globin fold with an embedded heme molecule. Flavohemoglobins also have a C-terminal reductase domain with bindiing sites for FAD and NAD(P)H. This domain catalyzes the conversion of NO + O2 + NAD(P)H to NO3- + NAD(P)+. Instead of the oxygen transport function of hemoglobins, flavohemoglobins seem to act in NO dioxygenation and NO signalling. Probab=37.93 E-value=23 Score=15.78 Aligned_cols=28 Identities=7% Similarity=0.068 Sum_probs=14.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0365556688877699999999999987449 Q gi|254780185|r 214 SFGLQSESLLHASDNLQKYKYSVHQVANSYC 244 (458) Q Consensus 214 Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G 244 (458) ++.+-|+ .+..+++ .|+.-+.+++.+|. T Consensus 143 ~i~l~yg--~r~~~d~-~~~~eL~~l~~~~~ 170 (247) T cd06184 143 PVTFIHA--ARNSAVH-AFRDELEELAARLP 170 (247) T ss_pred CEEEEEE--CCCHHHH-HHHHHHHHHHHHCC T ss_conf 4899995--6997993-78999999997689 No 27 >pfam12224 Amidoligase_2 Putative amidoligase enzyme. This family of proteins are likely to act as amidoligase enzymes Protein in this family are found in conserved gene neighborhoods encoding a glutamine amidotransferase-like thiol peptidase (in proteobacteria) or an Aig2 family cyclotransferase protein (in firmicutes). Probab=36.75 E-value=30 Score=15.00 Aligned_cols=14 Identities=21% Similarity=0.501 Sum_probs=11.0 Q ss_pred EEEECCCCCCCCHH Q ss_conf 24404675321279 Q gi|254780185|r 349 SIEIRFPDLSANPY 362 (458) Q Consensus 349 riE~R~~da~aNPY 362 (458) -||+|+++++-+.= T Consensus 224 T~efRlpn~tld~~ 237 (250) T pfam12224 224 TVEFRLPNGTLDAP 237 (250) T ss_pred CEEEECCCCCCCHH T ss_conf 45865898865821 No 28 >TIGR01108 oadA oxaloacetate decarboxylase alpha subunit; InterPro: IPR005776 This family describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea . The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.; GO: 0008948 oxaloacetate decarboxylase activity, 0006814 sodium ion transport. Probab=35.78 E-value=31 Score=14.90 Aligned_cols=63 Identities=17% Similarity=0.170 Sum_probs=45.2 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC-CEEEEE Q ss_conf 4556355244468999999999986054112221335621210365556688877699999999999987449-754441 Q gi|254780185|r 172 KSSYTLPPQDKLHDMRSEIVSALNNIGVQITKYHNQINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYC-KIATFM 250 (458) Q Consensus 172 ~~~y~~~~~~~~~~~~~ei~~~l~~~Gi~ie~~h~E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G-~~ATFm 250 (458) .-||+++|....+.|+ |+.+-|..+||+.-++--=.| =|.|.- +=++++++=++.| +-+-.. T Consensus 138 ~iSYTtSPvHTl~~yl-~la~~L~~~G~DSI~IKDMaG------lLTP~~----------AYELV~alK~~~~n~pvhLH 200 (616) T TIGR01108 138 AISYTTSPVHTLEKYL-ELAKELLEMGVDSICIKDMAG------LLTPKV----------AYELVSALKKEFGNLPVHLH 200 (616) T ss_pred EEECCCCCHHHHHHHH-HHHHHHHHHCCCEEEEECCCC------CCCHHH----------HHHHHHHHHHHCCCEEEEEE T ss_conf 9712468436788899-999999981886055202004------644158----------99999999742397468863 Q ss_pred E Q ss_conf 2 Q gi|254780185|r 251 P 251 (458) Q Consensus 251 p 251 (458) . T Consensus 201 ~ 201 (616) T TIGR01108 201 S 201 (616) T ss_pred C T ss_conf 2 No 29 >TIGR00871 zwf glucose-6-phosphate 1-dehydrogenase; InterPro: IPR001282 Glucose-6-phosphate dehydrogenase (1.1.1.49 from EC) (G6PDH) is a ubiquitous protein, present in bacteria and all eukaryotic cell types . The enzyme catalyses the the first step in the pentose pathway, i.e. the conversion of glucose-6-phosphate to gluconolactone 6-phosphate in the presence of NADP, producing NADPH. The ubiquitous expression of the enzyme gives it a major role in the production of NADPH for the many NADPH-mediated reductive processes in all cells . Deficiency of G6PDH is a common genetic abnormality affecting millions of people worldwide. Many sequence variants, most caused by single point mutations, are known, exhibiting a wide variety of phenotypes .; GO: 0004345 glucose-6-phosphate dehydrogenase activity, 0006006 glucose metabolic process. Probab=35.56 E-value=13 Score=17.34 Aligned_cols=39 Identities=13% Similarity=0.144 Sum_probs=20.8 Q ss_pred CCCEEEEECCCCCCCEEEC-CCCCCCCCCEEEECCCCCCCCH Q ss_conf 2120101136887640102-5788765412440467532127 Q gi|254780185|r 321 QSPTKLTYSTHNHSASCRI-PYENKSDNKSIEIRFPDLSANP 361 (458) Q Consensus 321 ~ap~~~~WG~~NR~a~iRi-p~~~~~~~~riE~R~~da~aNP 361 (458) +..+...--.+|..--||| |. .+-.-+|..+.||++.+. T Consensus 366 F~~~~~~~~pE~N~LvIRiQPD--EGv~lkf~~K~PG~~~~~ 405 (498) T TIGR00871 366 FKANERDANPERNALVIRIQPD--EGVYLKFNAKKPGLAFET 405 (498) T ss_pred HHCCCCCCCCCCCCEEEEECCC--HHHHHHHHCCCCCCCCEE T ss_conf 0001234676676068986696--468785414688754423 No 30 >PRK09632 ATP-dependent DNA ligase; Reviewed Probab=34.70 E-value=32 Score=14.80 Aligned_cols=18 Identities=11% Similarity=0.117 Sum_probs=8.4 Q ss_pred EEEEECCCCCCCCCHHHHH Q ss_conf 9852014754485302555 Q gi|254780185|r 89 FICNVYDPITLQPYNRDPR 107 (458) Q Consensus 89 v~~d~~~~~~g~P~~~~PR 107 (458) ++.|+ +|..|.+|..|-+ T Consensus 131 lv~DL-DP~~g~~f~~v~~ 148 (753) T PRK09632 131 LVFDL-DPGEGVGMAQLAE 148 (753) T ss_pred EEEEC-CCCCCCCHHHHHH T ss_conf 99979-9876889999999 No 31 >TIGR01393 lepA GTP-binding protein LepA; InterPro: IPR006297 LepA (GUF1 in Saccaromyces) is a GTP-binding membrane protein related to EF-G and EF-Tu. Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including GUF1 of yeast) originated within the bacterial LepA family. The function of the proteins in this family are unknown. ; GO: 0005525 GTP binding. Probab=34.41 E-value=33 Score=14.77 Aligned_cols=53 Identities=15% Similarity=0.217 Sum_probs=31.5 Q ss_pred CCCEEEEEECCCCEECCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 98889996411058775689722799852014754485302555323556799873034 Q gi|254780185|r 64 HPNLLLIPDIETMHMDPFYAQSTVAFICNVYDPITLQPYNRDPRYTAQKAIDYLQTTDI 122 (458) Q Consensus 64 ~~D~~l~PD~~T~~~~Pw~~~~~~~v~~d~~~~~~g~P~~~~PR~~Lk~~~~~l~~~~~ 122 (458) -||-+=--+-.+--++|...+--..|+|.+| |.|.+-| .-||.+|+||+-.|. T Consensus 272 VGDTiT~~~~Pa~eplpGF~~~KP~VFaGlY-Pid~~~Y-----e~LrdALeKL~LNDA 324 (598) T TIGR01393 272 VGDTITSVKNPAKEPLPGFKEVKPMVFAGLY-PIDTEDY-----EDLRDALEKLKLNDA 324 (598) T ss_pred CCCEEECCCCCCCCCCCCCCCCCCEEEECCC-CCCCHHH-----HHHHHHHHHHHHHHH T ss_conf 0544525678737678886125765860125-8880346-----899999755544025 No 32 >PRK13289 nitric oxide dioxygenase; Provisional Probab=34.24 E-value=28 Score=15.17 Aligned_cols=12 Identities=17% Similarity=0.293 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHC Q ss_conf 999999998744 Q gi|254780185|r 232 YKYSVHQVANSY 243 (458) Q Consensus 232 ~k~~ik~iA~~~ 243 (458) |+.-+++++.+| T Consensus 306 F~~eL~~La~~~ 317 (399) T PRK13289 306 FRDEVRALAARH 317 (399) T ss_pred HHHHHHHHHHHC T ss_conf 899999999758 No 33 >COG1602 Uncharacterized conserved protein [Function unknown] Probab=33.35 E-value=32 Score=14.82 Aligned_cols=52 Identities=17% Similarity=0.175 Sum_probs=35.2 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCHH----HCCCCCCC-CHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999860541122213356212----10365556-68887769999999999998 Q gi|254780185|r 188 SEIVSALNNIGVQITKYHNQINDAQ----HSFGLQSE-SLLHASDNLQKYKYSVHQVA 240 (458) Q Consensus 188 ~ei~~~l~~~Gi~ie~~h~E~gpGQ----~Ei~~~~~-~~l~aAD~~~~~k~~ik~iA 240 (458) .|.+-.|..-|++++++..=..-|- .+=-+-|+ ..++|+|..+ -+++.++|- T Consensus 183 ~eAv~~LY~~G~d~~~I~r~LS~G~lg~gk~RrlVPTRWSITAVD~~v-s~~L~~~i~ 239 (402) T COG1602 183 EEAVVELYRRGVDVYYIQRALSVGLLGLGKRRRLVPTRWSITAVDDTV-SDYLAREIR 239 (402) T ss_pred HHHHHHHHHCCCCHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEHHHHH-HHHHHHHHH T ss_conf 999999998499899999986231320675664066630110068789-999999984 No 34 >COG4710 Predicted DNA-binding protein with an HTH domain [General function prediction only] Probab=33.32 E-value=19 Score=16.28 Aligned_cols=47 Identities=26% Similarity=0.422 Sum_probs=27.5 Q ss_pred CCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 00179899999987398989841337889999999999999999965999789 Q gi|254780185|r 399 RICISLRESLENLDKDREFLKVGNVFDDDQIDAFINLKMKEVLKLESSPSPVE 451 (458) Q Consensus 399 ~LP~~L~eAl~~l~~d~~~~~~~~~lG~~~v~~~~~~K~~E~~~~~~~vt~wE 451 (458) +||..|.|-|++|.+...=. +.+.=.+-|+.||. |.++|.-.+...| T Consensus 11 Rlp~E~~eRL~~Ls~~tgrt--kayyvrEaIE~~ie----emED~ylA~~ale 57 (80) T COG4710 11 RLPLELKERLDNLSKNTGRT--KAYYVREAIEAYIE----EMEDFYLAVNALE 57 (80) T ss_pred ECCHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHHH----HHHHHHHHHHHHH T ss_conf 27989999999998704885--46999999999999----9999999999999 No 35 >COG3867 Arabinogalactan endo-1,4-beta-galactosidase [Carbohydrate transport and metabolism] Probab=33.05 E-value=35 Score=14.63 Aligned_cols=23 Identities=22% Similarity=0.476 Sum_probs=18.0 Q ss_pred CHHHHHHHHHHCCCCEEEEE-EEC Q ss_conf 98999999985798399999-978 Q gi|254780185|r 4 KATSIIQKIAQEKVKFIDLR-FTD 26 (458) Q Consensus 4 ~~~~i~~~l~~~~I~~V~~~-~~D 26 (458) ..+++++.||+++|.|||+. |.| T Consensus 64 ~~qD~~~iLK~~GvNyvRlRvwnd 87 (403) T COG3867 64 VRQDALQILKNHGVNYVRLRVWND 87 (403) T ss_pred HHHHHHHHHHHCCCCEEEEEEECC T ss_conf 177899999974967699998538 No 36 >pfam07745 Glyco_hydro_53 Glycosyl hydrolase family 53. This domain belongs to family 53 of the glycosyl hydrolase classification. These enzymes are enzymes are endo-1,4- beta-galactanases (EC:3.2.1.89). The structure of this domain is known and has a TIM barrel fold. Probab=33.03 E-value=35 Score=14.63 Aligned_cols=26 Identities=12% Similarity=0.201 Sum_probs=20.4 Q ss_pred CHHHHHHHHHHCCCCEEEE-EEECCCC Q ss_conf 9899999998579839999-9978998 Q gi|254780185|r 4 KATSIIQKIAQEKVKFIDL-RFTDLQG 29 (458) Q Consensus 4 ~~~~i~~~l~~~~I~~V~~-~~~Dl~G 29 (458) +..++++.|++++|.+||+ .|+|=.+ T Consensus 25 ~~~d~~~iLk~~GvN~vRlRvWvnP~~ 51 (332) T pfam07745 25 QTQDLFTILKDAGVNSIRLRVWVNPYD 51 (332) T ss_pred CCCCHHHHHHHCCCCEEEEEEEECCCC T ss_conf 886899999976988699998657887 No 37 >PRK11930 putative bifunctional UDP-N-acetylmuramoyl-tripeptide:D-alanyl-D-alanine ligase/alanine racemase; Provisional Probab=32.81 E-value=35 Score=14.61 Aligned_cols=93 Identities=14% Similarity=0.071 Sum_probs=49.0 Q ss_pred CCEEEEECCCCCCCEEECCCC-CCCCCCEEEECCCC----------CCCCHHHHHHHHHHHHHHHH-----HHCCCCCCC Q ss_conf 120101136887640102578-87654124404675----------32127999999999999878-----855999866 Q gi|254780185|r 322 SPTKLTYSTHNHSASCRIPYE-NKSDNKSIEIRFPD----------LSANPYLAPAAILMAGLDGI-----AKKIHPGKN 385 (458) Q Consensus 322 ap~~~~WG~~NR~a~iRip~~-~~~~~~riE~R~~d----------a~aNPYL~lAailaAgl~Gi-----~~~l~p~~~ 385 (458) ....++|+..+..+.+++... .....+.|.++.++ +..|-+=++||+..|-..|+ .+.|.--.| T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~~G~h~v~NAlaA~ava~~lGv~~~~i~~~L~~~~p 320 (824) T PRK11930 241 TRREIAWSRTDSEAPLYISRSEKKEFHTHIQYRYLGMDRYFQIPFIDDASIENSIHCLAVALYLMYSADQITKRFARLEP 320 (824) T ss_pred CCCEEEEECCCCCCCEEEEEEEECCCCEEEEEEECCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC T ss_conf 76289983478665346665430588169999968906999936998899999999999999839999999999841896 Q ss_pred CCCCC---------HHHHHCCCCCCCCHHHHHHHHHHC Q ss_conf 55788---------777713200017989999998739 Q gi|254780185|r 386 IDELS---------LEEQNTIPRICISLRESLENLDKD 414 (458) Q Consensus 386 ~~~~~---------~~~~~~~~~LP~~L~eAl~~l~~d 414 (458) +.+-. .--.+....-|.||..||+-|.+- T Consensus 321 ~~~Rlev~~g~~g~~iIdDsYNAnP~Sm~aALdtL~~~ 358 (824) T PRK11930 321 VEMRLEVKEGINNCTLINDSYNSDLASLDIALDFLERR 358 (824) T ss_pred CCCEEEEEECCCCCEEEECCCCCCHHHHHHHHHHHHHC T ss_conf 54513799979980999566669989999999999740 No 38 >TIGR00750 lao LAO/AO transport system ATPase; InterPro: IPR005129 Bacterial periplasmic transport systems require the function of a specific substrate-binding protein, located in the periplasm, and several cytoplasmic membrane transport components. In Escherichia coli K-12, the arginine-ornithine transport system requires an arginine-ornithine-binding protein and the lysine-arginine-ornithine (LAO) transport system includes a LAO-binding protein. Both periplasmic proteins can be phosphorylated by a single kinase, ArgK resulting in reduced levels of transport activity of the periplasmic transport systems that include each of the binding proteins. The ArgK protein acts as an ATPase enzyme and as a kinase.. Probab=32.28 E-value=18 Score=16.32 Aligned_cols=41 Identities=15% Similarity=0.298 Sum_probs=29.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHH Q ss_conf 89999999999860541122213356212103655566888776999 Q gi|254780185|r 184 HDMRSEIVSALNNIGVQITKYHNQINDAQHSFGLQSESLLHASDNLQ 230 (458) Q Consensus 184 ~~~~~ei~~~l~~~Gi~ie~~h~E~gpGQ~Ei~~~~~~~l~aAD~~~ 230 (458) ..=-.|++..|+++|.++-=+.+ +|.||-|+++ +++||..+ T Consensus 122 s~at~~~~~lldA~G~DVI~vET-VGVGQSEVdi-----~~~aDT~v 162 (333) T TIGR00750 122 SKATRELVKLLDAAGYDVILVET-VGVGQSEVDI-----INMADTFV 162 (333) T ss_pred HHHHHHHHHHHHHCCCCEEEEEE-ECCCHHHHHH-----HHHHCEEE T ss_conf 79999999999863898799984-1575248788-----73415058 No 39 >PRK13895 conjugal transfer protein TraM; Provisional Probab=30.51 E-value=33 Score=14.75 Aligned_cols=17 Identities=6% Similarity=-0.031 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHCCCEEE Q ss_conf 99999999874497544 Q gi|254780185|r 232 YKYSVHQVANSYCKIAT 248 (458) Q Consensus 232 ~k~~ik~iA~~~G~~AT 248 (458) +..+|++||.|||...+ T Consensus 5 i~e~I~~IA~KHGIals 21 (144) T PRK13895 5 IEELIREIAAKHGIAVG 21 (144) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 99999999998092148 No 40 >cd07952 ED_3B_like Uncharacterized class III extradiol dioxygenases. This subfamily is composed of proteins of unknown function with similarity to the catalytic B subunit of class III extradiol dioxygenases. Class III extradiol dioxygenases use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. They play key roles in the degradation of aromatic compounds. Probab=28.25 E-value=42 Score=14.12 Aligned_cols=100 Identities=10% Similarity=0.118 Sum_probs=63.3 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCH---HHCCCCCCCCH----------------HHHHHHHHHHHHHHHHHHHHCC Q ss_conf 8999999999986054112221335621---21036555668----------------8877699999999999987449 Q gi|254780185|r 184 HDMRSEIVSALNNIGVQITKYHNQINDA---QHSFGLQSESL----------------LHASDNLQKYKYSVHQVANSYC 244 (458) Q Consensus 184 ~~~~~ei~~~l~~~Gi~ie~~h~E~gpG---Q~Ei~~~~~~~----------------l~aAD~~~~~k~~ik~iA~~~G 244 (458) .++.++|...+...||++-.++.....| .++++++..-| +.-.+.+..|-..|++++.+.+ T Consensus 80 ~~La~~I~~~a~~~gipv~~i~~~~~~~~~~~~~LDhG~~VPL~Fl~k~~~V~is~~~~~~~e~ly~fG~~I~~a~e~~~ 159 (256) T cd07952 80 RELANEIYKSARADGIPVLGINFATSSGDNSDFPLDWGELIPLSFLKKRPIVLITPPRLLPREELVEFGRALGKALEGYE 159 (256) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 99999999976555986687403223566777666752033363567788799835778998999999999999999769 Q ss_pred CEEEEEEEECCCCCCCHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 75444121014568601201111114676312215556578999999999998864 Q gi|254780185|r 245 KIATFMPKPIASHNGSGMYLNMSIHKGEKAIFTGNQHDRVSLKGLYYLGGIIKHAK 300 (458) Q Consensus 245 ~~ATFmpKP~~~~~GsG~Hih~Sl~~~~~n~f~~~~~~~ls~~~~~fiaGil~h~~ 300 (458) .++.|-+ +|--.|-| .+++|++.+..+..|=.=|++.+. T Consensus 160 rrv~~Ia------SGDlSH~l-----------~~dgPyGy~p~g~~fD~~i~~~l~ 198 (256) T cd07952 160 KRVAVII------SADHAHTH-----------DPDGPYGYSPDAAEYDAAIVEAIE 198 (256) T ss_pred CCEEEEE------ECCCCCCC-----------CCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 8679999------45766566-----------999998889871578999999997 No 41 >pfam11657 Activator-TraM Transcriptional activator TraM. TraM is required for quorum dependence. It binds to and in-activates TraR which controls the replication of the tumour-inducing virulence plasmid. TraM interacts in a two-step process with DNA-TraR to form a large, stable anti-activation complex. Probab=27.85 E-value=39 Score=14.27 Aligned_cols=17 Identities=6% Similarity=-0.031 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHCCCEEE Q ss_conf 99999999874497544 Q gi|254780185|r 232 YKYSVHQVANSYCKIAT 248 (458) Q Consensus 232 ~k~~ik~iA~~~G~~AT 248 (458) +..+|++||.|||...+ T Consensus 5 i~e~I~~IA~KHGIal~ 21 (144) T pfam11657 5 IEELIKEIAAKHGIAVG 21 (144) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 99999999998192148 No 42 >pfam06877 DUF1260 Protein of unknown function (DUF1260). This family consists of several hypothetical bacterial proteins of around 120 residues in length. The function of this family is unknown. Probab=27.77 E-value=42 Score=14.07 Aligned_cols=90 Identities=12% Similarity=0.168 Sum_probs=57.5 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHH Q ss_conf 35567998730345225773133089970677777865477666765457777775003655455635524446899999 Q gi|254780185|r 110 AQKAIDYLQTTDIGDTLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQDKLHDMRSE 189 (458) Q Consensus 110 Lk~~~~~l~~~~~G~~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~e 189 (458) -.++++.|+.. |..+-.-.++||++.-.+ ..-++. T Consensus 5 ~~~vl~~L~e~--G~dl~~~~~Ieh~~~f~~-------------------------------------------~~~~e~ 39 (103) T pfam06877 5 TGEVLAALLED--GDDLSAPYEIEHHFYFED-------------------------------------------EDDLEK 39 (103) T ss_pred HHHHHHHHHHC--CCCCCCCEEEEEEEEECC-------------------------------------------HHHHHH T ss_conf 79999999975--998678877779998099-------------------------------------------999999 Q ss_pred HHHHHHHHCCCCCCCCC--CCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999860541122213--356212103655566888776999999999999874497 Q gi|254780185|r 190 IVSALNNIGVQITKYHN--QINDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCK 245 (458) Q Consensus 190 i~~~l~~~Gi~ie~~h~--E~gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~ 245 (458) +...+.+.|..|+.... |-+.+-|.+.+.....+... .+.-.-.-+-.+|.++|- T Consensus 40 ~a~~~~~~gy~v~~~~~~~ed~~~~~~~~~~~~~~~~~~-~I~~~~~~l~~lA~~~~g 96 (103) T pfam06877 40 AAEEAFKLGYEVEDAEELEEDGDGVWCLDVSREMTLDYE-DIDAQEEELEALAEKFGG 96 (103) T ss_pred HHHHHHHCCCEEEECCCCCCCCCCEEEEEEEEECCCCHH-HHHHHHHHHHHHHHHHCC T ss_conf 999999769836664414668984189999995568989-999999999999999395 No 43 >PRK13789 phosphoribosylamine--glycine ligase; Provisional Probab=27.31 E-value=43 Score=14.02 Aligned_cols=96 Identities=8% Similarity=0.097 Sum_probs=55.4 Q ss_pred EEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHH------H-HHHHHHHHHHHHHHH Q ss_conf 257731330899706777778654776667654577777750036554556355244------4-689999999999860 Q gi|254780185|r 125 TLLLGITTEFFVFDNVHCTISPIKSGFALESTEFLQNGNNKGYDSHAKSSYTLPPQD------K-LHDMRSEIVSALNNI 197 (458) Q Consensus 125 ~~~~g~E~EF~Lf~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~y~~~~~~------~-~~~~~~ei~~~l~~~ 197 (458) ..+.|.|+-++.|-+.. ...|... ......... +..|..+.++|+|+-.+.- . ...+++.+++.|.+. T Consensus 192 E~L~G~E~S~~a~~dG~-~~~~lp~---aQDhKR~~d-gD~GPNTGGMGaysP~p~~~~~~~~~i~~~ii~pt~~~l~~~ 266 (426) T PRK13789 192 EFMEGQEASIFAISDGD-SYFLLPA---AQDHKRAFD-GDQGPNTGGMGAYCPAPVITEAILQKVKERIFDPMFDDFRKK 266 (426) T ss_pred ECCCCCCEEEEEEECCC-EEEECCC---CCCCCCCCC-CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 74589844799997299-7786886---444421337-999999998767777866899999999986048999999858 Q ss_pred CCCCCC------CCCCCC-CHHHCCCCCCCCHHHH Q ss_conf 541122------213356-2121036555668887 Q gi|254780185|r 198 GVQITK------YHNQIN-DAQHSFGLQSESLLHA 225 (458) Q Consensus 198 Gi~ie~------~h~E~g-pGQ~Ei~~~~~~~l~a 225 (458) |++... +.++.| |--.|+|..+.||-.. T Consensus 267 g~~y~GvLy~GlMlt~~G~pkVlEfN~RfGDPE~q 301 (426) T PRK13789 267 GHPYRGLLYAGLMISPEGEPKVVEFNCRFGDPETQ 301 (426) T ss_pred CCCCEEEEEEEEEECCCCCEEEEEEEEECCCCHHH T ss_conf 99988676532899269976999988531891688 No 44 >TIGR00397 mauM_napG MauM/NapG family ferredoxin-type protein; InterPro: IPR004494 MauM ferredoxin-type protein is involved in methylamine utilization. NapG ferredoxin-type protein is associated with nitrate reductase activity.; GO: 0006118 electron transport. Probab=27.10 E-value=25 Score=15.51 Aligned_cols=30 Identities=20% Similarity=0.327 Sum_probs=27.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf 025553235567998730345225773133 Q gi|254780185|r 103 NRDPRYTAQKAIDYLQTTDIGDTLLLGITT 132 (458) Q Consensus 103 ~~~PR~~Lk~~~~~l~~~~~G~~~~~g~E~ 132 (458) -.||-..|.+.+...++.|||..++++.|. T Consensus 103 RACPTGAL~~~l~~Ir~ADMGVAVLVgHET 132 (213) T TIGR00397 103 RACPTGALDPLLTDIRKADMGVAVLVGHET 132 (213) T ss_pred CCCCCCCCCCCHHHHHHHCCCEEEEECCEE T ss_conf 568865454110124541471278825601 No 45 >pfam09904 DUF2131 Uncharacterized protein conserved in bacteria (DUF2131). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=26.00 E-value=29 Score=15.07 Aligned_cols=24 Identities=50% Similarity=0.670 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCCEEE Q ss_conf 55532355679987303452257731330899 Q gi|254780185|r 105 DPRYTAQKAIDYLQTTDIGDTLLLGITTEFFV 136 (458) Q Consensus 105 ~PR~~Lk~~~~~l~~~~~G~~~~~g~E~EF~L 136 (458) =||..++.+++.|. + .|+|+||.- T Consensus 10 mPRRT~QDti~AL~--~------l~I~~~FvQ 33 (67) T pfam09904 10 MPRRTAQDTIAALA--D------IGIECEFVQ 33 (67) T ss_pred CCHHHHHHHHHHHC--C------CCEEEEEEE T ss_conf 97337899998600--3------870899975 No 46 >TIGR01793 cit_synth_euk citrate (Si)-synthase, eukaryotic; InterPro: IPR010109 Citrate synthase 2.3.3.1 from EC is a member of a small family of enzymes that can directly form a carbon-carbon bond without the presence of metal ion cofactors. It catalyses the first reaction in the Krebs' cycle, namely the conversion of oxaloacetate and acetyl-coenzyme A into citrate and coenzyme A. This reaction is important for energy generation and for carbon assimilation. The reaction proceeds via a non-covalently bound citryl-coenzyme A intermediate in a 2-step process (aldol-Claisen condensation followed by the hydrolysis of citryl-CoA). Citrate synthase enzymes are found in two distinct structural types: type I enzymes (found in eukaryotes, Gram-positive bacteria and archaea) form homodimers and have shorter sequences than type II enzymes, which are found in Gram-negative bacteria and are hexameric in structure. In both types, the monomer is composed of two domains: a large alpha-helical domain consisting of two structural repeats, where the second repeat is interrupted by a small alpha-helical domain. The cleft between these domains forms the active site, where both citrate and acetyl-coenzyme A bind. The enzyme undergoes a conformational change upon binding of the oxaloacetate ligand, whereby the active site cleft closes over in order to form the acetyl-CoA binding site . The energy required for domain closure comes from the interaction of the enzyme with the substrate. Type II enzymes possess an extra N-terminal beta-sheet domain, and some type II enzymes are allosterically inhibited by NADH . This entry includes both mitochondrial and peroxisomal forms of citrate synthase. Peroxisomal forms of the enzyme, recognized by the C-terminal targeting motif SKL, act in the glyoxylate cycle. Eukaryotic homologs include a Tetrahymena thermophila citrate synthase that doubles as a filament protein, a putative citrate synthase from Plasmodium falciparum (no TCA cycle), and a methylcitrate synthase from Aspergillus nidulans (Emericella nidulans).; GO: 0004108 citrate (Si)-synthase activity, 0044262 cellular carbohydrate metabolic process. Probab=25.63 E-value=26 Score=15.44 Aligned_cols=52 Identities=19% Similarity=0.157 Sum_probs=37.1 Q ss_pred HHHHHHHHHHCCCCEEEEEEECC-CCCEEEEEEEHH--HHHHHHHHCCCCCCCCCHH Q ss_conf 89999999857983999999789-982523787787--8305588459845665100 Q gi|254780185|r 5 ATSIIQKIAQEKVKFIDLRFTDL-QGKFRHISMNVS--LFNENTLLNGITFDSSSID 58 (458) Q Consensus 5 ~~~i~~~l~~~~I~~V~~~~~Dl-~G~~R~k~ip~~--~f~~~~~~~G~~fd~s~i~ 58 (458) .+.|+++=+|||-+.|.=+-+|+ -|-+||-.-=+. .+... ++||.|-|-||+ T Consensus 16 ~ekvKklr~EHG~~v~gn~t~D~~yGGmRg~kglvwEtSvLDp--EEGIRFRGlsIP 70 (430) T TIGR01793 16 QEKVKKLRKEHGKKVVGNVTVDMVYGGMRGMKGLVWETSVLDP--EEGIRFRGLSIP 70 (430) T ss_pred HHHHHHHHHHCCCCEEEEEEHHHHHCCCCCCCEEEEECCCCCC--CCCEECCCCCCC T ss_conf 6668878752256033012110110443224100220430266--546012788641 No 47 >pfam03851 UvdE UV-endonuclease UvdE. Probab=24.97 E-value=48 Score=13.75 Aligned_cols=67 Identities=10% Similarity=0.060 Sum_probs=39.4 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCH--HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCC Q ss_conf 99999999986054112221335621--21036555668887769999999999998744975444121014568 Q gi|254780185|r 186 MRSEIVSALNNIGVQITKYHNQINDA--QHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPKPIASHN 258 (458) Q Consensus 186 ~~~ei~~~l~~~Gi~ie~~h~E~gpG--Q~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~ 258 (458) -+..|++.+.+-||.+--+.++.=|= .-|+.+. -.+.+.-.=.-|.++|+++|++.||.|-+|.-.+ T Consensus 45 ~l~~il~~n~~~~I~~yRiSS~lfP~~shp~~~~~------~~~~~~~~l~~iG~~a~~~~~Rls~HPgqf~vL~ 113 (275) T pfam03851 45 NLIRLLHYNIAHGIRLYRLSSSLFPLASHPDVGWK------YETFAAPELREAGELAKEHGHRLTFHPGQFTVLN 113 (275) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCH------HHHHHHHHHHHHHHHHHHCCCEEEECCCCCEECC T ss_conf 99999998887597669814020466778423620------4899999999999999982955874899722169 No 48 >KOG2438 consensus Probab=24.95 E-value=48 Score=13.75 Aligned_cols=164 Identities=11% Similarity=0.098 Sum_probs=87.4 Q ss_pred CCHHHCCCCCCC--CHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHCCCCCCCEECCCCCCCCH Q ss_conf 621210365556--688877699999999999987449754441210145686012011111146763122155565789 Q gi|254780185|r 209 NDAQHSFGLQSE--SLLHASDNLQKYKYSVHQVANSYCKIATFMPKPIASHNGSGMYLNMSIHKGEKAIFTGNQHDRVSL 286 (458) Q Consensus 209 gpGQ~Ei~~~~~--~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~GsG~Hih~Sl~~~~~n~f~~~~~~~ls~ 286 (458) |.+=.||.++|+ ++.+||--+.-++.+++..+...|...+ .+=-+.+|+|+...|.-+=....-.+|+- T Consensus 170 GvpLlEiVt~Pdf~~~~eaa~fv~elqlIlr~lgis~g~mee---------G~lRvDvNiSI~~~g~~lG~RvElKNLns 240 (505) T KOG2438 170 GVPLLEIVTKPDFSDGIEAAAFVKELQLILRHLGISSGNMEE---------GALRVDVNISIAPDGSELGTRVELKNLNS 240 (505) T ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHC---------CCEEEEEEEEECCCCCCCEEEEEECCCHH T ss_conf 985278762766212899999999999999996436442203---------84577544676677762323455315513 Q ss_pred HHHHHHHHHHHHHHHCCHHCCCCCHHHHHH-----CCCCCCCEEEEECCCC-CCCEEECCCCCCCCCCEEEECCCCCCCC Q ss_conf 999999999988641000006670014431-----8885212010113688-7640102578876541244046753212 Q gi|254780185|r 287 KGLYYLGGIIKHAKSLNALTNASTNSYKRL-----LTDSQSPTKLTYSTHN-HSASCRIPYENKSDNKSIEIRFPDLSAN 360 (458) Q Consensus 287 ~~~~fiaGil~h~~al~a~~~PtvNSYkRl-----~p~~~ap~~~~WG~~N-R~a~iRip~~~~~~~~riE~R~~da~aN 360 (458) . .+|.--++|= |.|- ..+.|.+.+-.|-..+ ||++.|= +.+-..+|-+--.-= T Consensus 241 i--rsI~~AI~yE-------------i~RQvel~~nGg~i~nETR~wd~~~~rTvsmRd------KEt~~DYRfmPEPnL 299 (505) T KOG2438 241 I--RSISNAIDYE-------------IQRQVELLNNGGAIQNETRGWDSEGGRTVSMRD------KETLQDYRFMPEPNL 299 (505) T ss_pred H--HHHHHHHHHH-------------HHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECC------CCCCCCEECCCCCCC T ss_conf 8--8999988799-------------999999996588255524454356772586135------443101101689999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHC-CCCCCCCHHHHHHHH Q ss_conf 799999999999987885599986655788777713-200017989999998 Q gi|254780185|r 361 PYLAPAAILMAGLDGIAKKIHPGKNIDELSLEEQNT-IPRICISLRESLENL 411 (458) Q Consensus 361 PYL~lAailaAgl~Gi~~~l~p~~~~~~~~~~~~~~-~~~LP~~L~eAl~~l 411 (458) |||.++.- -+++|+..|. +.+.+..+. .+..--++++|+--+ T Consensus 300 Ppl~L~~~---~v~~Vre~LP------eLP~~~~~~L~e~~gl~~k~a~~L~ 342 (505) T KOG2438 300 PPLILHGK---YVDSVREELP------ELPDATREKLVEQYGLSAKDAVTLA 342 (505) T ss_pred CCEEECHH---HHHHHHHHCC------CCCHHHHHHHHHHHCCCHHHHHHHH T ss_conf 87664488---9999997552------3768899999886266545534554 No 49 >KOG1016 consensus Probab=24.83 E-value=28 Score=15.17 Aligned_cols=11 Identities=45% Similarity=0.347 Sum_probs=3.6 Q ss_pred HHHHHHHHHHH Q ss_conf 89999999999 Q gi|254780185|r 184 HDMRSEIVSAL 194 (458) Q Consensus 184 ~~~~~ei~~~l 194 (458) ++++..+..+| T Consensus 420 qe~~~li~~AL 430 (1387) T KOG1016 420 QEAYSLIRSAL 430 (1387) T ss_pred HHHHHHHHHHH T ss_conf 99999999984 No 50 >cd04862 PaeLigD_Pol_like PaeLigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. PaeLigD is monomeric, containing an N-terminal phosphoesterase module, a central polymerase (Pol) domain, and a C-terminal ATP-dependent ligase domain. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The PaeLigD Pol domain in vitro, in a manganese-dependent fashion, catalyzes templated extensions of 5'-overhang duplex DNA, and nontemplated single-nu Probab=24.55 E-value=49 Score=13.70 Aligned_cols=126 Identities=9% Similarity=0.057 Sum_probs=67.5 Q ss_pred HHHHHHHCCCCCCCCCCC----CCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCC-CHHHHH Q ss_conf 999986054112221335----621210365556688877699999999999987449754441210145686-012011 Q gi|254780185|r 191 VSALNNIGVQITKYHNQI----NDAQHSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPKPIASHNG-SGMYLN 265 (458) Q Consensus 191 ~~~l~~~Gi~ie~~h~E~----gpGQ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpKP~~~~~G-sG~Hih 265 (458) +-..+..-|++..+.+-. -|-+.-|++.|.+++.-.| ++-.=..+|++-.+.|+. .|...+| .|+||. T Consensus 75 v~lan~~~iE~H~w~s~~~~~~~PD~lv~DLDP~~~~~f~~-v~~~A~~~r~~L~~lgL~------~~~KTSG~kGlHV~ 147 (227) T cd04862 75 LALVQMGVLEFHTWGARIDRLERPDRIVFDLDPGPGVPWKA-VVEAALLVRELLDELGLE------SFVKTSGGKGLHVV 147 (227) T ss_pred HHHHHCCCEEECCCCCCCCCCCCCCEEEEECCCCCCCCHHH-HHHHHHHHHHHHHHCCCC------CCEECCCCCEEEEE T ss_conf 99987295774278777787889998999689987999999-999999999999976995------00042799757999 Q ss_pred HHHCCCCCCCEECCCCCCCCHHHHH---HHHHHHHH-HHHCCHHCCCCCHHHHHHCCCCCCCEEEEECCCCCCCEEECCC Q ss_conf 1111467631221555657899999---99999988-6410000066700144318885212010113688764010257 Q gi|254780185|r 266 MSIHKGEKAIFTGNQHDRVSLKGLY---YLGGIIKH-AKSLNALTNASTNSYKRLLTDSQSPTKLTYSTHNHSASCRIPY 341 (458) Q Consensus 266 ~Sl~~~~~n~f~~~~~~~ls~~~~~---fiaGil~h-~~al~a~~~PtvNSYkRl~p~~~ap~~~~WG~~NR~a~iRip~ 341 (458) +-|... .. -+..+. +||-.++. .|++.-. .|-....--=.++.|..|+|...+=-|. T Consensus 148 vPl~~~--------~~---~~~vr~fa~~~A~~l~~~~Pd~~t~--------~~~k~~R~grVfiDy~qN~~g~T~vapY 208 (227) T cd04862 148 VPLAPR--------AG---WDEVKAFAKALAQHLARTNPDRFVA--------TMGKAKRVGKIFIDYLRNGRGATAVAPY 208 (227) T ss_pred EECCCC--------CC---HHHHHHHHHHHHHHHHHHCCHHHHH--------HHHHHHCCCCEEEECCCCCCCCEEEECC T ss_conf 977899--------99---8999999999999999978334077--------7668738985999751158998377246 Q ss_pred C Q ss_conf 8 Q gi|254780185|r 342 E 342 (458) Q Consensus 342 ~ 342 (458) + T Consensus 209 S 209 (227) T cd04862 209 S 209 (227) T ss_pred C T ss_conf 5 No 51 >COG1702 PhoH Phosphate starvation-inducible protein PhoH, predicted ATPase [Signal transduction mechanisms] Probab=21.79 E-value=55 Score=13.37 Aligned_cols=48 Identities=23% Similarity=0.158 Sum_probs=21.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCCCCCCHHHHH Q ss_conf 3212799999999999987885599986655788777713200017989999 Q gi|254780185|r 357 LSANPYLAPAAILMAGLDGIAKKIHPGKNIDELSLEEQNTIPRICISLRESL 408 (458) Q Consensus 357 a~aNPYL~lAailaAgl~Gi~~~l~p~~~~~~~~~~~~~~~~~LP~~L~eAl 408 (458) .+-.|||+.|.-+-|--.|.-++|=+..|.- |...++.-||-+|.+-+ T Consensus 153 GTGKTyLava~av~al~~~~v~rIiLtRPaV----EAGEklGfLPGdl~eKv 200 (348) T COG1702 153 GTGKTYLAVAKAVDALGAGQVRRIILTRPAV----EAGEKLGFLPGDLREKV 200 (348) T ss_pred CCCCCHHHHHHHHHHHHCCCHHEEECCCCCH----HCCCCCCCCCCCHHHHC T ss_conf 5697111277665265406412010068313----03642370788234324 No 52 >TIGR02768 TraA_Ti Ti-type conjugative transfer relaxase TraA; InterPro: IPR014136 This entry represents the Ti-type conjugative transfer relaxase TraA. TraA contains domains distinctive of a single strand exonuclease (N-terminus, MobA/MobL, IPR005053 from INTERPRO) as well as a helicase domain (central region, homologous to the corresponding region of the F-type relaxase TraI, IPR014129 from INTERPRO). This protein likely fills the same role as TraI(F), nicking (at the oriT site) and unwinding the coiled plasmid prior to conjugative transfer .. Probab=21.45 E-value=56 Score=13.32 Aligned_cols=53 Identities=9% Similarity=0.063 Sum_probs=39.2 Q ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE-ECCCCCCCHHHHHHH Q ss_conf 1036555668887769999999999998744975444121-014568601201111 Q gi|254780185|r 213 HSFGLQSESLLHASDNLQKYKYSVHQVANSYCKIATFMPK-PIASHNGSGMYLNMS 267 (458) Q Consensus 213 ~Ei~~~~~~~l~aAD~~~~~k~~ik~iA~~~G~~ATFmpK-P~~~~~GsG~Hih~S 267 (458) |+|.|.. =|..++++-++|.++++.+.++|+.|=|.=- |..+..=-=-|+|+= T Consensus 99 ~~~ALP~--EL~~e~~i~Lar~Fvr~~~~~kG~~aD~~iH~p~~~~~~gNpH~H~l 152 (888) T TIGR02768 99 LEIALPR--ELNEEQRIELARDFVRDHFVEKGMVADWAIHAPSEEDDDGNPHAHLL 152 (888) T ss_pred HHHHCHH--HCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEE T ss_conf 8664533--16886668999999999986089578889846777783988668873 No 53 >TIGR00631 uvrb excinuclease ABC, B subunit; InterPro: IPR004807 All proteins in this family for which functions are known are DNA helicases that function in the nucleotide excision repair and are endonucleases that make the 3' incision next to DNA damage. They are part of a pathway requiring UvrA, UvrB, UvrC, and UvrD homologs.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0005737 cytoplasm, 0009380 excinuclease repair complex. Probab=21.06 E-value=49 Score=13.68 Aligned_cols=27 Identities=19% Similarity=0.545 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 899999999998605411222133562 Q gi|254780185|r 184 HDMRSEIVSALNNIGVQITKYHNQIND 210 (458) Q Consensus 184 ~~~~~ei~~~l~~~Gi~ie~~h~E~gp 210 (458) .-+-+++.+.+..+||.+.++|+|.-. T Consensus 457 KkMAEdLTdYl~E~Gikv~YLHSeIdt 483 (667) T TIGR00631 457 KKMAEDLTDYLKELGIKVRYLHSEIDT 483 (667) T ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHH T ss_conf 677889999970588379871455789 No 54 >pfam03484 B5 tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits. Probab=20.51 E-value=58 Score=13.20 Aligned_cols=46 Identities=24% Similarity=0.426 Sum_probs=23.0 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCC--CCHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999998605411222133562121036555--66888776999999999999874497 Q gi|254780185|r 188 SEIVSALNNIGVQITKYHNQINDAQHSFGLQS--ESLLHASDNLQKYKYSVHQVANSYCK 245 (458) Q Consensus 188 ~ei~~~l~~~Gi~ie~~h~E~gpGQ~Ei~~~~--~~~l~aAD~~~~~k~~ik~iA~~~G~ 245 (458) +++.+.|..+|+.++. ..+.+.+.... .|....+| ++-+||+-+|+ T Consensus 22 ~~i~~~L~~lg~~~~~-----~~~~~~v~vPs~R~Di~~~~D-------liEEi~R~~Gy 69 (69) T pfam03484 22 EEIKKILKRLGFKVEE-----DEDTLKVTVPSYRPDILHEVD-------LIEEVARLYGY 69 (69) T ss_pred HHHHHHHHHCCCEEEE-----CCCEEEEECCCCCCCCCCCCH-------HHHHHHHHHCC T ss_conf 9999999986998994-----798699989998576686256-------99999998089 Done!