Query gi|254780503|ref|YP_003064916.1| putative glutamine synthetase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 461 No_of_seqs 147 out of 2566 Neff 7.4 Searched_HMMs 39220 Date Sun May 29 23:23:56 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780503.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00653 GlnA glutamine synth 100.0 0 0 1018.8 22.9 430 18-459 1-486 (486) 2 TIGR03105 gln_synth_III glutam 100.0 0 0 930.7 36.0 419 20-459 1-435 (435) 3 COG0174 GlnA Glutamine synthet 100.0 0 0 910.5 37.4 426 16-460 2-443 (443) 4 PRK09469 glnA glutamine synthe 100.0 0 0 892.3 38.2 433 15-459 1-468 (469) 5 pfam00120 Gln-synt_C Glutamine 100.0 0 0 608.7 17.2 254 124-380 1-257 (257) 6 KOG0683 consensus 100.0 5.8E-40 1.5E-44 282.6 11.6 365 1-417 4-374 (380) 7 COG3968 Uncharacterized protei 99.9 2.5E-22 6.3E-27 166.7 18.3 272 143-422 214-575 (724) 8 pfam03951 Gln-synt_N Glutamine 99.4 7.3E-13 1.9E-17 104.4 8.4 81 28-118 1-84 (84) 9 PRK13517 carboxylate-amine lig 98.7 2.2E-07 5.6E-12 68.4 9.3 198 144-388 8-265 (371) 10 PRK13516 carboxylate-amine lig 98.6 1.3E-06 3.3E-11 63.4 11.6 195 144-382 11-262 (372) 11 TIGR02050 gshA_cyan_rel unchar 98.6 1.2E-06 3.2E-11 63.5 11.3 210 146-396 1-274 (297) 12 PRK13515 carboxylate-amine lig 98.5 3.7E-06 9.3E-11 60.4 12.0 200 144-387 5-261 (381) 13 PRK13518 carboxylate-amine lig 98.4 7.1E-06 1.8E-10 58.5 11.3 195 144-382 12-263 (357) 14 pfam04107 GCS2 Glutamate-cyste 98.1 3.6E-05 9.1E-10 53.9 9.4 196 146-383 2-257 (291) 15 COG2170 Uncharacterized conser 98.0 6.7E-05 1.7E-09 52.1 9.5 214 144-391 2-262 (369) 16 KOG0683 consensus 97.5 0.00046 1.2E-08 46.6 7.6 243 87-386 97-372 (380) 17 COG3572 GshA Gamma-glutamylcys 95.7 0.097 2.5E-06 31.3 8.3 76 195-278 156-232 (456) 18 TIGR03444 gshA_related glutama 91.4 0.83 2.1E-05 25.2 6.3 58 319-383 253-319 (390) 19 pfam08759 DUF1792 Domain of un 81.9 1.6 4E-05 23.3 3.2 83 239-327 69-157 (225) 20 KOG0558 consensus 72.9 3.4 8.6E-05 21.2 2.8 57 188-260 245-302 (474) 21 pfam06877 DUF1260 Protein of u 65.6 10 0.00026 18.0 7.7 93 128-253 2-96 (103) 22 PRK09444 pntB pyridine nucleot 56.7 15 0.00038 17.0 4.2 52 222-277 305-357 (459) 23 TIGR01515 branching_enzym 1,4- 53.6 16 0.00042 16.6 7.3 107 83-209 85-217 (608) 24 PRK11410 hypothetical protein; 51.6 13 0.00032 17.4 2.5 30 108-138 128-160 (570) 25 cd01141 TroA_d Periplasmic bin 49.1 19 0.00047 16.3 3.0 28 260-288 159-186 (186) 26 TIGR01436 glu_cys_lig_pln glut 48.0 20 0.00051 16.1 6.9 180 128-332 5-223 (446) 27 TIGR02048 gshA_cyano glutamate 47.2 21 0.00053 16.0 4.9 269 92-442 26-320 (377) 28 KOG0257 consensus 46.8 17 0.00043 16.6 2.5 49 108-159 174-222 (420) 29 cd06184 flavohem_like_fad_nad_ 45.1 17 0.00045 16.5 2.4 28 222-252 143-170 (247) 30 pfam04468 PSP1 PSP1 C-terminal 44.7 23 0.00057 15.7 3.2 59 192-255 25-83 (88) 31 COG1702 PhoH Phosphate starvat 43.0 24 0.00061 15.6 3.0 72 363-435 151-224 (348) 32 PRK13289 nitric oxide dioxygen 41.9 20 0.0005 16.1 2.2 29 221-252 290-318 (399) 33 pfam11679 DUF3275 Protein of u 41.2 21 0.00055 15.9 2.3 36 109-145 163-201 (211) 34 pfam09909 DUF2138 Uncharacteri 38.5 20 0.0005 16.1 1.8 30 108-138 117-149 (552) 35 pfam03851 UvdE UV-endonuclease 37.6 29 0.00074 15.0 5.7 71 194-268 45-115 (275) 36 TIGR01434 glu_cys_ligase gluta 37.3 27 0.00068 15.3 2.3 16 356-373 324-339 (518) 37 pfam02233 PNTB NAD(P) transhyd 36.5 30 0.00077 14.9 4.0 53 222-278 310-363 (464) 38 PHA01807 hypothetical protein 35.6 22 0.00055 15.9 1.6 42 189-230 94-135 (145) 39 pfam03484 B5 tRNA synthetase B 34.0 33 0.00084 14.7 3.2 47 195-253 21-69 (69) 40 pfam00763 THF_DHG_CYH Tetrahyd 31.6 36 0.00092 14.4 3.5 97 118-264 3-99 (117) 41 PRK05996 motB flagellar motor 29.7 39 0.00098 14.2 2.6 46 268-316 311-359 (431) 42 TIGR01663 PNK-3'Pase polynucle 29.0 40 0.001 14.1 3.5 48 22-72 207-259 (526) 43 COG3364 Zn-ribbon containing p 28.4 16 0.00042 16.6 0.0 27 200-227 52-78 (112) 44 TIGR02812 fadR_gamma fatty aci 27.8 29 0.00073 15.1 1.2 24 123-146 39-63 (275) 45 COG1602 Uncharacterized conser 27.8 42 0.0011 14.0 2.5 135 196-335 183-345 (402) 46 smart00874 B5 tRNA synthetase 27.8 42 0.0011 14.0 3.3 48 196-253 22-71 (71) 47 TIGR01793 cit_synth_euk citrat 27.4 15 0.00039 16.8 -0.3 31 17-47 16-47 (430) 48 TIGR01647 ATPase-IIIA_H plasma 27.0 43 0.0011 13.9 5.7 101 124-261 434-538 (835) 49 COG4326 Spo0M Sporulation cont 26.1 45 0.0011 13.8 2.0 40 190-229 152-198 (270) 50 PHA02105 hypothetical protein 25.9 41 0.001 14.0 1.7 39 2-43 17-55 (68) 51 TIGR02793 nikR nickel-responsi 25.9 13 0.00033 17.3 -0.9 30 241-274 61-90 (130) 52 COG0296 GlgB 1,4-alpha-glucan 25.0 47 0.0012 13.7 7.8 109 84-210 94-231 (628) 53 PRK11920 rirA iron-responsive 24.9 36 0.00091 14.4 1.2 103 191-314 37-140 (153) 54 TIGR02630 xylose_isom_A xylose 23.8 49 0.0013 13.5 2.8 41 6-49 71-111 (437) 55 PRK10853 hypothetical protein; 23.7 49 0.0012 13.5 1.7 36 12-54 8-43 (118) 56 pfam09845 DUF2072 Zn-ribbon co 23.7 25 0.00064 15.4 0.3 12 216-227 82-93 (121) 57 cd00674 LysRS_core_class_I Thi 23.5 50 0.0013 13.5 5.8 147 109-267 20-176 (354) 58 COG4716 Myosin-crossreactive a 22.5 52 0.0013 13.4 3.5 83 186-269 221-314 (587) 59 PRK13895 conjugal transfer pro 22.4 52 0.0013 13.3 1.8 16 240-255 5-20 (144) 60 PTZ00326 phenylalanyl-tRNA syn 22.2 53 0.0013 13.3 4.1 41 118-158 231-278 (505) 61 COG1282 PntB NAD/NADP transhyd 22.1 53 0.0013 13.3 4.2 33 242-277 326-362 (463) 62 PTZ00260 glycosyl transferase 22.1 33 0.00084 14.7 0.6 25 267-291 156-180 (336) 63 COG3465 Uncharacterized conser 22.0 53 0.0014 13.3 4.1 86 123-247 26-116 (171) 64 KOG1534 consensus 22.0 51 0.0013 13.4 1.5 28 298-325 147-174 (273) 65 pfam08532 Glyco_hydro_42M Beta 21.6 54 0.0014 13.2 3.9 54 191-244 28-81 (207) 66 TIGR00151 ispF 2C-methyl-D-ery 21.3 55 0.0014 13.2 2.2 58 84-146 37-95 (159) 67 PRK02308 uvsE putative UV dama 21.1 55 0.0014 13.2 5.7 25 243-267 96-120 (316) 68 TIGR00631 uvrb excinuclease AB 21.1 35 0.00088 14.5 0.5 59 44-103 275-347 (667) 69 PRK05784 phosphoribosylamine-- 21.1 55 0.0014 13.2 5.0 91 144-234 203-316 (485) 70 TIGR00092 TIGR00092 GTP-bindin 21.0 56 0.0014 13.2 1.8 32 16-47 51-83 (390) 71 KOG4116 consensus 21.0 26 0.00066 15.3 -0.1 48 292-348 25-81 (90) 72 pfam08777 RRM_3 RNA binding mo 21.0 56 0.0014 13.2 2.4 35 13-47 11-46 (102) 73 cd00948 FBP_aldolase_I_a Fruct 20.0 58 0.0015 13.0 2.9 70 118-213 142-215 (330) No 1 >TIGR00653 GlnA glutamine synthetase, type I; InterPro: IPR004809 Glutamine synthetase type I (or glutamate-ammonia ligase) has a dodecameric form, which can be subdivided into 1-alpha and 1-beta forms. The phylogeny of the 1-alpha and 1-beta forms appears polyphyletic. Escherichia coli, Synechocystis sp. (strain PCC 6803) , Aquifex aeolicus, and the crenarcheon Sulfolobus acidocaldarius have form 1-beta, while Bacillus subtilis, Thermotoga maritima, and various euryarchaea has form 1-alpha. The 1-beta dodecamer from the crenarcheon S. acidocaldarius differs from that in E. coli in that it is not regulated by adenylation. ; GO: 0004356 glutamate-ammonia ligase activity, 0009399 nitrogen fixation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1018.82 Aligned_cols=430 Identities=31% Similarity=0.484 Sum_probs=398.9 Q ss_pred HHHHHHHHH-CCCCEEEEEEECCCCCEEEEEEEHHHHH------HCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 999999971-7983999999788983312676198873------015677361000002427743466555778887789 Q gi|254780503|r 18 EQAAKWLKD-NRIEDVECITPDLAGIPRGKMMPSSKFI------SNIPLTVPSAIYRHTISGESLHSPYLFHNERDGDLR 90 (461) Q Consensus 18 ~~v~~~l~~-~~I~~V~v~~~Dl~G~~RgK~vp~~~f~------~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~D~~ 90 (461) +++.+++++ ++||||+++|+|+.|.++.-.||++.+. +.... +..|||+++.| +...++|||+ T Consensus 1 ~~~~~~~keG~~VkfV~l~F~Di~G~~~~~~iP~~~~~~i~~~~~~fe~-------G~~FDGSSi~G---f~~IeeSDM~ 70 (486) T TIGR00653 1 EEVLKLIKEGENVKFVDLRFTDIKGKLQHVEIPASALDDIDELEEAFEE-------GIMFDGSSIRG---FQGIEESDML 70 (486) T ss_pred CCHHHHHCCCCCEEEEEEEEECCCCCEEEEEEEEECCHHHHHHHHHHHC-------CCCCCCCCCCC---CCCHHHHCCC T ss_conf 9202365078805799999863788410168883132117888888642-------63446612057---4661110220 Q ss_pred EEEECCCCEECCCCCCC--EEEEEEECCCC-CCCCCCCCHHHHHHHHHHHHH--CCCCCEEECCCCCEEEEECCCCCCCC Q ss_conf 99600124538788877--09999740335-752123425777899999861--02764023224417998216788887 Q gi|254780503|r 91 LSPDLSTLSIVPWEMDP--TAQIVCDVVDY-NGKEVTYTPRNVLKRVLNFYA--QNELKPIIAPEIEFYLIAKNEDPDYP 165 (461) Q Consensus 91 l~PD~~Tl~~lPw~~~~--~a~vl~~~~~~-dG~p~~~~PR~iLkr~i~~l~--~~G~~~~~g~E~EF~L~~~~~~~~~~ 165 (461) |+||++|++++||..++ +++|+|++|++ +++|++.|||++|||++++|+ ..|.+..+|.|.||||||+.+..... T Consensus 71 l~PDp~T~~i~Pw~~~~~~~~~~ic~vY~p~~~~~y~rDPR~~A~rae~~L~~~~~g~~~~fGPEpEFFlFd~~~~~~~~ 150 (486) T TIGR00653 71 LKPDPSTAVIDPWRAEKKKTLRVICDVYEPVTGEPYERDPRSIAKRAEEYLKSTGIGDTAYFGPEPEFFLFDSIEFGSLS 150 (486) T ss_pred CCCCCCCEEECCCCCCCCCEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCEEEECCCCCCCCCC T ss_conf 10273631674531688857999988730777732788847899999998621467737786587312100546578655 Q ss_pred -----------------CCCCCC----CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCC-CCCE Q ss_conf -----------------778778----888776678777766544289999989976786311022101012786-4201 Q gi|254780503|r 166 -----------------LQPPKG----RSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPA-QFEI 223 (461) Q Consensus 166 -----------------~~~~~~----~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpG-Q~Ei 223 (461) ...+.+ ....+..+++||.+.+.|...+++++|+..|+++|+.||..|||+|+| |.|| T Consensus 151 n~~~~~~d~~eG~WN~~r~~e~g~~~n~g~~~~~kgGYF~v~P~D~~~D~R~e~v~~L~~~G~~~E~~HHEVA~aGQ~Ei 230 (486) T TIGR00653 151 NSSYYEVDSEEGAWNTGREYEGGSELNSGYKPRDKGGYFPVPPTDTAVDIRREMVLTLEQLGLDVEVHHHEVATAGQHEI 230 (486) T ss_pred CCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCEEE T ss_conf 54421144443555868446888861246034777777587689888899999999998659828886010147874253 Q ss_pred EECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHHHCC----CCCCCCCCCCC---CCHHH Q ss_conf 112665211378999999999999863066033112321135662689998763136----87423358865---52156 Q gi|254780503|r 224 NIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQNYPGSAMHIHQSVLDIK----SNQNIFSNPDG---SETDN 296 (461) Q Consensus 224 ~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~GsG~H~h~SL~d~~----~g~n~f~~~~~---~ls~~ 296 (461) +++|..++++||++++||++||.||++||++|||||||+.|+||||||+|+||| | +|+|+|+|+++ +||++ T Consensus 231 d~kf~~l~~~AD~~~~yKyvvK~vA~~~Gk~ATFMPKPlfGdNGSGMH~H~SLW--K~ePf~GenlF~g~~~~y~GLSe~ 308 (486) T TIGR00653 231 DFKFDTLLKTADDIQTYKYVVKNVAKKHGKTATFMPKPLFGDNGSGMHCHQSLW--KGEPFDGENLFAGEEGEYAGLSET 308 (486) T ss_pred ECCHHHHHHHHCCEEEHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCEEEEEEE--ECCCCCCCEECCCCCCCCCCHHHH T ss_conf 046478987405045147899899998388797458788772668554835563--277888840006858876543588 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCC--CCCCCEEEEECCCCCCCHHHHHH Q ss_conf 6654577764223212220766101344204655754334243345003650223--88652035303587799899999 Q gi|254780503|r 297 FRYFIGGLQKYIPNALVMLAPYGNSYRRLVLDMGCPVNNAWGYDNRTTAFRIPFS--DAKTKRIENRLPSSDTNPYLVLA 374 (461) Q Consensus 297 ~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~~~ap~~~~WG~~NR~~~iRip~~--~~~~~riE~R~~d~~aNPYL~la 374 (461) |+|||||||+|||||+||+||||||||||+||++||++++||..||||+||||.+ +++++|||+|+|||+||||||+| T Consensus 309 ALyyIGGiLkHA~AL~A~tNPTvNSYKRLVPGYEAPv~lays~~NRSA~iRIP~~~~n~kA~R~E~R~PDPs~NPYLAFA 388 (486) T TIGR00653 309 ALYYIGGILKHAKALAAFTNPTVNSYKRLVPGYEAPVYLAYSARNRSALIRIPASRGNPKATRIEVRFPDPSANPYLAFA 388 (486) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCCCCEEEECCEECCCCEEEEEEECCCCCCCHHHHHH T ss_conf 88875208777788889765660552236886121023021677875258740164088336998707887656789999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCC---------CHHCCCCCHHHHHHHHHCC---HHHHHHCCHHHHHHHHHHHH-HH Q ss_conf 99999999898468887433453000---------3110478989999998659---89887355899999999999-99 Q gi|254780503|r 375 ASLACGLLGILQKIEPTPPTMKSANQ---------EAINLPRGLLEAVTLLENT---SHFKDIFGSQFIDMYANLKR-NE 441 (461) Q Consensus 375 ail~agl~Gi~~~l~p~~~~~~~~~~---------~~~~LP~sL~eAl~~l~~d---~~l~~~lg~~~v~~~~~~k~-~E 441 (461) |+|+||||||+|||+||+|++.|.|+ +.+.||.||.|||++|++| .+++++||++|++.|+++|| .| T Consensus 389 A~LmAGLDGIKnKI~PgeP~d~n~Y~l~peE~~e~GI~~lP~~L~eAL~~Le~D~Ef~~~~~vfg~~~~e~fie~Kr~~E 468 (486) T TIGR00653 389 ALLMAGLDGIKNKIDPGEPVDKNLYELSPEELRELGIPQLPGSLEEALDELESDHEFLVLKGVFGEEFIEAFIELKREKE 468 (486) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHCCCCCHHHCCCCHHHHHHHHHHHHHHCC T ss_conf 99986201200145878731043335898999738901105679999999731462002203203789999987653025 Q ss_pred HHHHHCCCCHHHHHHHHH Q ss_conf 999863899789987631 Q gi|254780503|r 442 FETFMQVISPWEREFLLL 459 (461) Q Consensus 442 ~~~~~~~vt~wE~~~yl~ 459 (461) |++|+..|||||+.+||. T Consensus 469 ~~~~r~~phP~Ef~~Y~~ 486 (486) T TIGR00653 469 WDDYRLRPHPWEFEKYYD 486 (486) T ss_pred CCHHCCCCCCHHHHCCCC T ss_conf 100036774012201159 No 2 >TIGR03105 gln_synth_III glutamine synthetase, type III. This family consists of the type III isozyme of glutamine synthetase, originally described in Rhizobium meliloti, where types I and II also occur. Probab=100.00 E-value=0 Score=930.70 Aligned_cols=419 Identities=32% Similarity=0.470 Sum_probs=378.7 Q ss_pred HHHHHHHCCCCEEEEEEECCCCCEEEEEEEHHHHHHCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCE Q ss_conf 99999717983999999788983312676198873015677361000002427743466555778887789996001245 Q gi|254780503|r 20 AAKWLKDNRIEDVECITPDLAGIPRGKMMPSSKFISNIPLTVPSAIYRHTISGESLHSPYLFHNERDGDLRLSPDLSTLS 99 (461) Q Consensus 20 v~~~l~~~~I~~V~v~~~Dl~G~~RgK~vp~~~f~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~D~~l~PD~~Tl~ 99 (461) +.+++++++|+||+++|+|++|++|||+||+++|.+....|+... +.++.+ ....++++|++++||++|++ T Consensus 1 ~~~~i~~~~I~~v~~~~~D~~G~~RgK~vp~~~~~~~~~~G~~f~-------~~~~~~--~~~~~~~~D~~~~PD~~Tl~ 71 (435) T TIGR03105 1 LQALARDKGIKYFLASFVDLHGVQKAKLVPAEAIDHMATGGAGFA-------GFAAWG--LGQSPADPDLMAIPDLSSLT 71 (435) T ss_pred CHHHHHHCCCEEEEEEEECCCCCEEEEEEEHHHHHHHHHCCCCCE-------EECHHH--CCCCCCCCCEEEEECCCCEE T ss_conf 966885589859999999999957899986999999985899710-------124321--37899988689997578307 Q ss_pred ECCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 38788877099997403357521234257778999998610276402322441799821678888777877888877667 Q gi|254780503|r 100 IVPWEMDPTAQIVCDVVDYNGKEVTYTPRNVLKRVLNFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILG 179 (461) Q Consensus 100 ~lPw~~~~~a~vl~~~~~~dG~p~~~~PR~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~ 179 (461) ++||. +++++|+|+++ .||+|++.|||++|++++++++++|+++++|+|+|||||+++.+...... . ...... T Consensus 72 ~lPw~-p~~a~v~~d~~-~dG~p~~~~PR~~L~~~~~~~~~~G~~~~~g~ElEF~l~~~~~~~~~~~~---~--~~~~~~ 144 (435) T TIGR03105 72 QLPWQ-PGVAWVAADLH-VNGKPYPQAPRVVLKRQLAEAAELGLTLNTGVECEFFLLRRDEDGSLSIA---D--RADTLA 144 (435) T ss_pred ECCCC-CCEEEEEEEEC-CCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCEEEEEEECCCCCCCCCC---C--CCCCCC T ss_conf 78888-98489999974-08988886979999999999996599289767627999821688985567---8--777677 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECC Q ss_conf 87777665442899999899767863110221010127864201112665211378999999999999863066033112 Q gi|254780503|r 180 GQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMA 259 (461) Q Consensus 180 ~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmp 259 (461) .++|+...+++.++++++|++.|+++||+||++|||+|||||||++.|+++|+|||+++++|++||+||++||++||||| T Consensus 145 ~~~~~~~~~~~~~~~~~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~l~~~~~l~aAD~~~~~K~~ik~iA~~~G~~ATFMp 224 (435) T TIGR03105 145 KPCYDQRGLMRRYDVLTEISDAMNALGWDPYQNDHEDANGQFEMNFTYADALTTADRHAFFRYMVKEIAEKHGMRATFMP 224 (435) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 88667200101679999999999977997188664657981699726799899999999999999999998099899703 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC----CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCC------C Q ss_conf 32113566268999876313687423358865----5215666545777642232122207661013442046------5 Q gi|254780503|r 260 KPMQNYPGSAMHIHQSVLDIKSNQNIFSNPDG----SETDNFRYFIGGLQKYIPNALVMLAPYGNSYRRLVLD------M 329 (461) Q Consensus 260 KP~~~~~GsG~H~h~SL~d~~~g~n~f~~~~~----~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~------~ 329 (461) ||+.+.+|||||+|+|||| ++|+|+|+++++ +||+.+++||||||+|+|++++|+|||+||||||+|+ + T Consensus 225 KP~~~~~GsG~HiH~SL~d-~~g~n~f~d~~~~~~~glS~~~~~fiaGlL~~~~~l~a~~~Pt~NSYkRl~~~~~~~~~~ 303 (435) T TIGR03105 225 KPFADLTGNGCHFHLSLWD-EDGRNLFADDSDPNGLGLSKLAYHFIGGILHHAPALCAVLAPTVNSYKRLNAPRTTSGAT 303 (435) T ss_pred CCCCCCCCCEEEEEEEEEC-CCCCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCCCCCC T ss_conf 1378999870038986775-999760368987665554599999999999861323150147765211156886565765 Q ss_pred CCCCCCCCCCCCCCEEEEECCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC------HH Q ss_conf 575433424334500365022388652035303587799899999999999998984688874334530003------11 Q gi|254780503|r 330 GCPVNNAWGYDNRTTAFRIPFSDAKTKRIENRLPSSDTNPYLVLAASLACGLLGILQKIEPTPPTMKSANQE------AI 403 (461) Q Consensus 330 ~ap~~~~WG~~NR~~~iRip~~~~~~~riE~R~~d~~aNPYL~laail~agl~Gi~~~l~p~~~~~~~~~~~------~~ 403 (461) |||++++||++||+++||||.+ +|||+|++|++|||||++||+|+|||+||+++++|++|+++|++.+ .+ T Consensus 304 waP~~~~WG~~NR~~~iRip~~----~riE~R~~da~aNPYL~lAailaAGl~Gi~~~l~p~~p~~~~~~~~~~~~~~~~ 379 (435) T TIGR03105 304 WAPNFISYGGNNRTHMVRIPDP----GRFELRLADGAANPYLAQAAILAAGLDGIERKLDPGPPRDINLYAEELAARGVE 379 (435) T ss_pred CCCCEEEECCCCCCCEEEECCC----CEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHCCCC T ss_conf 5762023135776550782378----713644899887989999999998887886589999977788655630126856 Q ss_pred CCCCCHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 04789899999986598988735589999999999999999863899789987631 Q gi|254780503|r 404 NLPRGLLEAVTLLENTSHFKDIFGSQFIDMYANLKRNEFETFMQVISPWEREFLLL 459 (461) Q Consensus 404 ~LP~sL~eAl~~l~~d~~l~~~lg~~~v~~~~~~k~~E~~~~~~~vt~wE~~~yl~ 459 (461) +||+||.|||++|++|++++++||++||+.|+++|++||++|++.|||||+++||+ T Consensus 380 ~LP~sL~eAl~~l~~~~~l~~~lg~~~v~~y~~~K~~E~~~~~~~vt~wE~~~Yle 435 (435) T TIGR03105 380 TLPQNLLEALRALEADPLLAEALGAEFVDEFLKLKRQEWEEYHRHVSDWEIDRYLD 435 (435) T ss_pred CCCCCHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCC T ss_conf 66889999999997590987854999999999999999999977999999985149 No 3 >COG0174 GlnA Glutamine synthetase [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=910.45 Aligned_cols=426 Identities=39% Similarity=0.636 Sum_probs=389.7 Q ss_pred CHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEEHHH----HHHCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 9899999997179839999997889833126761988----730156773610000024277434665557788877899 Q gi|254780503|r 16 NWEQAAKWLKDNRIEDVECITPDLAGIPRGKMMPSSK----FISNIPLTVPSAIYRHTISGESLHSPYLFHNERDGDLRL 91 (461) Q Consensus 16 ~~~~v~~~l~~~~I~~V~v~~~Dl~G~~RgK~vp~~~----f~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~D~~l 91 (461) +.+++.+++++++|++|+++|+|++|++|||++|+++ +...... +..|+|+++.+ +...++||+++ T Consensus 2 ~~~~~~~~~~~~~v~~v~~~f~D~~G~~r~k~~p~~~~~~~~~~~~~~-------g~~fdgss~~g---~~~i~~sDm~l 71 (443) T COG0174 2 TKEDVLKLLKENGVKFVDLRFTDLNGVLRGKTIPAEKPVSVLAQLFEG-------GVVFDGSSIAG---FEGIGESDMVL 71 (443) T ss_pred CHHHHHHHHHHCCCEEEEEEEECCCCCEEEEEEECCCCHHHHHHHHCC-------CCCCCCCCCCC---CCCCCCCCEEE T ss_conf 178899998728936999999779997622698665532677764305-------86768864124---67789888699 Q ss_pred EEECCCCEECCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCC-EEECCCCCEEEEECCCCCCCCCCCCC Q ss_conf 96001245387888770999974033575212342577789999986102764-02322441799821678888777877 Q gi|254780503|r 92 SPDLSTLSIVPWEMDPTAQIVCDVVDYNGKEVTYTPRNVLKRVLNFYAQNELK-PIIAPEIEFYLIAKNEDPDYPLQPPK 170 (461) Q Consensus 92 ~PD~~Tl~~lPw~~~~~a~vl~~~~~~dG~p~~~~PR~iLkr~i~~l~~~G~~-~~~g~E~EF~L~~~~~~~~~~~~~~~ 170 (461) +||++|++++||..+++|+|+|++++++|+|++.|||++|+|++++++++|+. +++|+|+|||||+.+...... T Consensus 72 ~Pd~~T~~~~Pw~~~~ta~v~cdv~~~~g~p~~~dPR~vlkr~~~~l~~~G~~~~~~g~E~EFfLfd~~~~~~~~----- 146 (443) T COG0174 72 KPDLSTLVVDPWREGPTARVLCDVYDPDGTPYPRDPRSVLKRALARLKDEGLAPAVVGPELEFFLFDRDGRDPDG----- 146 (443) T ss_pred EECCCCEEECCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEECCEEEEEEECCCCCCCC----- T ss_conf 626675276777788727999998879999589883999999999998668854302024169996036677666----- Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 88887766787777665442899999899767863110221010127864201112665211378999999999999863 Q gi|254780503|r 171 GRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFK 250 (461) Q Consensus 171 ~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~ 250 (461) +++..++++|++.++++.++++++|+..|+++||++|++|||+||||+||++++++++++||++++||+++|+||++ T Consensus 147 ---~~~~~~g~yf~~~~~~~~~d~~~~i~~~l~~~Gi~ie~~hhEva~gQ~EI~~~~~~~l~~AD~~~~~K~vvk~vA~~ 223 (443) T COG0174 147 ---GRPADKGGYFDVAPLDEAEDFRRDIVEALEAAGIEIEAIHHEVAPGQFEINLRFDDALKAADQIVIFKYVVKEVAEK 223 (443) T ss_pred ---CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf ---76688776467654222789999999999987998263014505885578337878899989999999999999999 Q ss_pred HCCCEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH-HC Q ss_conf 06603311232113566268999876313687423358865--5215666545777642232122207661013442-04 Q gi|254780503|r 251 HDIYATFMAKPMQNYPGSAMHIHQSVLDIKSNQNIFSNPDG--SETDNFRYFIGGLQKYIPNALVMLAPYGNSYRRL-VL 327 (461) Q Consensus 251 ~G~~atFmpKP~~~~~GsG~H~h~SL~d~~~g~n~f~~~~~--~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl-~p 327 (461) ||++|||||||+.+.+|||||+|+||||.+|+ |+|+++++ ++|+.++|||||||+|+++++||+|||||||||| +| T Consensus 224 hG~~aTFMpKP~~g~~GsGmH~H~Sl~~~dg~-n~F~d~~~~~~lS~~~~~~igGilkha~~~~ai~~PtvNSYkRl~vp 302 (443) T COG0174 224 HGLTATFMPKPFFGDNGSGMHVHQSLWDKDGG-NLFADEDGYAGLSETALHFIGGILKHAPALTAITAPTVNSYKRLGVP 302 (443) T ss_pred HCCEEEEECCCCCCCCCCCEEEEEEEECCCCC-CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCC T ss_conf 59859994787789997730489877648997-52048888525799999999999987898676757775104214898 Q ss_pred CCCCCCCCCCCCCCCCEEEEECCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCH--- Q ss_conf 655754334243345003650223--886520353035877998999999999999989846888743345300031--- Q gi|254780503|r 328 DMGCPVNNAWGYDNRTTAFRIPFS--DAKTKRIENRLPSSDTNPYLVLAASLACGLLGILQKIEPTPPTMKSANQEA--- 402 (461) Q Consensus 328 ~~~ap~~~~WG~~NR~~~iRip~~--~~~~~riE~R~~d~~aNPYL~laail~agl~Gi~~~l~p~~~~~~~~~~~~--- 402 (461) ..|||++++||.+|||++||||.+ +++++|||+|+||++|||||++||+|+|||+||++|++|++|+++|.|+.. T Consensus 303 ~e~AP~~~~wg~~NRsa~iRIP~~~~~~~~~RiE~R~pdp~aNPYLa~AaiL~agldGI~~ki~p~~p~~~n~y~~~~~e 382 (443) T COG0174 303 YEWAPTYIAWGVRNRSASVRIPASGANGKARRVEFRVPDPDANPYLAFAAILAAGLDGIENKLEPGEPVDGNLYELSPEE 382 (443) T ss_pred CCCCCCHHCCCCCCCCEEEEECCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH T ss_conf 55586111003477515799578887777644576479998898999999999999777547999874456432289545 Q ss_pred ---HCCCCCHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHC Q ss_conf ---1047898999999865989887355899999999999999998638997899876317 Q gi|254780503|r 403 ---INLPRGLLEAVTLLENTSHFKDIFGSQFIDMYANLKRNEFETFMQVISPWEREFLLLN 460 (461) Q Consensus 403 ---~~LP~sL~eAl~~l~~d~~l~~~lg~~~v~~~~~~k~~E~~~~~~~vt~wE~~~yl~~ 460 (461) +.||.||.|||++|+.+++++++||++|++.|+++|+.||++|...||+||+++||.. T Consensus 383 ~~~~~LP~sL~eAl~~l~~~~~~~~~lge~~~~~yi~~K~~E~~~~~~~v~~wE~~~yl~~ 443 (443) T COG0174 383 RKEPTLPASLREALDALEDSEFLREALGEDFIDAYIALKRAEWEEFRSRVTPWEFERYLSV 443 (443) T ss_pred HCCCCCCCCHHHHHHHHHHCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 2335676269999998620568887628499999999999999999725999999988529 No 4 >PRK09469 glnA glutamine synthetase; Provisional Probab=100.00 E-value=0 Score=892.29 Aligned_cols=433 Identities=28% Similarity=0.427 Sum_probs=383.6 Q ss_pred CCHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEEHHHHHHC-CCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 998999999971798399999978898331267619887301-5677361000002427743466555778887789996 Q gi|254780503|r 15 KNWEQAAKWLKDNRIEDVECITPDLAGIPRGKMMPSSKFISN-IPLTVPSAIYRHTISGESLHSPYLFHNERDGDLRLSP 93 (461) Q Consensus 15 ~~~~~v~~~l~~~~I~~V~v~~~Dl~G~~RgK~vp~~~f~~~-~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~D~~l~P 93 (461) .++|+|+++|++++|+||+++|+|++|++|+|.||+++|.+. ...| ..|+|+++.+ +...+++|++++| T Consensus 1 ms~eevl~~i~e~~I~~V~~~f~Dl~G~~r~k~Ip~~~~~~~~~~~G-------~~fdGSsv~g---~~~i~~sD~~l~P 70 (469) T PRK09469 1 MSAEHVLTMLNEHEVKFVDLRFTDTKGKEQHVTIPAHQVNADFFEEG-------KMFDGSSIGG---WKGINESDMVLMP 70 (469) T ss_pred CCHHHHHHHHHHCCCEEEEEEEECCCCCCCEEEECHHHHHHHHHHCC-------CCCCCCCCCC---CCCCCCCCEEEEE T ss_conf 99899999999889859999998799885406984899509998589-------7535544035---5667777779998 Q ss_pred ECCCCEECCCCCCCEEEEEEECCCCCC-CCCCCCHHHHHHHHHHHHHCCCC--CEEECCCCCEEEEECCCCCCC------ Q ss_conf 001245387888770999974033575-21234257778999998610276--402322441799821678888------ Q gi|254780503|r 94 DLSTLSIVPWEMDPTAQIVCDVVDYNG-KEVTYTPRNVLKRVLNFYAQNEL--KPIIAPEIEFYLIAKNEDPDY------ 164 (461) Q Consensus 94 D~~Tl~~lPw~~~~~a~vl~~~~~~dG-~p~~~~PR~iLkr~i~~l~~~G~--~~~~g~E~EF~L~~~~~~~~~------ 164 (461) |++|++++||..+.++.|+||++++++ .|++.|||++|||++++++++|+ ++++|.|+|||||+....... T Consensus 71 D~~T~~v~Pw~~~~~~~v~cdi~~~~~~~~~~~dPR~iLkr~~~~l~~~G~~~~~~~g~E~EFyLfd~~~~~~~~~~~~~ 150 (469) T PRK09469 71 DASTAVLDPFFADSTLIIRCDILEPGTMQGYDRDPRSIAKRAEDYLRSTGIADTVLFGPEPEFFLFDDIRFGTSISGSHV 150 (469) T ss_pred CCCCCEECCCCCCCEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEECCCEEEEEECCCCCCCCCCCCCE T ss_conf 77767888665886199999998878876478897999999999999759963012314338998426444666666612 Q ss_pred CC-----------CCCCCCCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCC-CCCCCEEECCCCCH Q ss_conf 77-----------78778888-7766787777665442899999899767863110221010127-86420111266521 Q gi|254780503|r 165 PL-----------QPPKGRSG-RSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEG-PAQFEINIRHGDPL 231 (461) Q Consensus 165 ~~-----------~~~~~~~~-~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~g-pGQ~Ei~~~~~~~l 231 (461) .. ....++.+ ++...+++|.+.+.+..++++.+++..|+++||+||+.|||+| |||+||+++|+++| T Consensus 151 ~id~~e~~~~~g~~~~~~~~~~~p~~~~gy~~~~~~d~~~d~~~~i~~~l~~~Gi~ve~~hhE~a~~GQ~Ei~~~~~~~l 230 (469) T PRK09469 151 KIDDIEAAWNSGTQYEGGNKGHRPGVKGGYFPVPPVDSSQDIRSAMCLVMEEMGLVVEAHHHEVATAGQNEVATRFNTMT 230 (469) T ss_pred ECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCEEEEECCCCCHH T ss_conf 02653234567764457766766665676676886432777999999999986997154455677897048971578699 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC-CCCHHHHHHHHHHHHHHHHH Q ss_conf 13789999999999998630660331123211356626899987631368742335886-55215666545777642232 Q gi|254780503|r 232 KLADQVFLFKRTIREAAFKHDIYATFMAKPMQNYPGSAMHIHQSVLDIKSNQNIFSNPD-GSETDNFRYFIGGLQKYIPN 310 (461) Q Consensus 232 ~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~GsG~H~h~SL~d~~~g~n~f~~~~-~~ls~~~~~fiaGil~h~~a 310 (461) ++||++++||.+||+||++||+.|||||||+.+.+|||||+|+||| ++|+|+|.+++ +++|+++++||||||+|+++ T Consensus 231 ~aAD~~~~~k~~vk~iA~~~g~~atFmpKP~~~~~Gsg~H~H~Sl~--~~g~n~f~~~~~~~lS~~~~~~iaGll~h~~~ 308 (469) T PRK09469 231 KKADEIQIYKYVVHNVAHAFGKTATFMPKPMVGDNGSGMHCHMSLS--KNGVNLFAGDKYAGLSEQALYYIGGIIKHAKA 308 (469) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCEEEEEEC--CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 9998999999999999997498788821667999957415898842--79943668998777579999999999987686 Q ss_pred HHHHCCCCCHHHHHHHCCCCCCCCCCCCCCCCCEEEEECC-CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 1222076610134420465575433424334500365022-388652035303587799899999999999998984688 Q gi|254780503|r 311 ALVMLAPYGNSYRRLVLDMGCPVNNAWGYDNRTTAFRIPF-SDAKTKRIENRLPSSDTNPYLVLAASLACGLLGILQKIE 389 (461) Q Consensus 311 l~a~~~PtvNSYkRl~p~~~ap~~~~WG~~NR~~~iRip~-~~~~~~riE~R~~d~~aNPYL~laail~agl~Gi~~~l~ 389 (461) +++|+||||||||||+|++|||++++||.+||+++||||. ++++++|||+|+||++|||||++||+|+|||+||+++++ T Consensus 309 l~a~~~PtvNSYkRl~p~~~aP~~~~Wg~~NR~~~iRip~~~~~~~~riE~R~~d~~aNPYL~~Aa~l~Agl~Gi~~~l~ 388 (469) T PRK09469 309 INALANPTTNSYKRLVPGYEAPVMLAYSARNRSASIRIPVVPSPKARRIEVRFPDPAANPYLCFAALLMAGLDGIKNKIH 388 (469) T ss_pred HHHHHCCCHHHHHHCCCCCCCCCCCEEECCCCCCEEECCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 56576588057641489876887432206888621435767898763367407998879899999999999878863899 Q ss_pred CCCCCCCCCCC-------CHHCCCCCHHHHHHHHHCCHHHHH---HCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 87433453000-------311047898999999865989887---35589999999999999999863899789987631 Q gi|254780503|r 390 PTPPTMKSANQ-------EAINLPRGLLEAVTLLENTSHFKD---IFGSQFIDMYANLKRNEFETFMQVISPWEREFLLL 459 (461) Q Consensus 390 p~~~~~~~~~~-------~~~~LP~sL~eAl~~l~~d~~l~~---~lg~~~v~~~~~~k~~E~~~~~~~vt~wE~~~yl~ 459 (461) |++|+++|+|+ +.++||.||.|||++|++|+.+.. +||++||++|+++||+||++|++.|||||+++||. T Consensus 389 p~~~~~~~~y~~~~~~~~~~p~lP~sl~eAl~~le~d~~~~~~~g~fg~~~v~~~~~~Kr~E~~~~~~~vt~wE~~~Y~~ 468 (469) T PRK09469 389 PGEAMDKNLYDLPPEEAAEIPQVAGSLEEALNALDADREFLTAGGVFTDDAIDAYIALRREEVDRVRMTPHPVEFELYYS 468 (469) T ss_pred CCCCCCCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCC T ss_conf 99877776021997786367778447999999987298989865527599999999999999999977899899976148 No 5 >pfam00120 Gln-synt_C Glutamine synthetase, catalytic domain. Probab=100.00 E-value=0 Score=608.65 Aligned_cols=254 Identities=40% Similarity=0.611 Sum_probs=235.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 34257778999998610276402322441799821678888777877888877667877776654428999998997678 Q gi|254780503|r 124 TYTPRNVLKRVLNFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSE 203 (461) Q Consensus 124 ~~~PR~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~ 203 (461) +.|||++|||++++++++|+++++|+|+|||||+.+.+...+... .........+++|++...+..++|+++|++.|+ T Consensus 1 P~~PR~~Lk~~~~~~~~~Gl~~~~g~E~EF~l~~~~~~~~~~~~~--~~~~~~~~~~~~y~~~~~~~~~~~~~~i~~~l~ 78 (257) T pfam00120 1 PRDPRSILKRALARLAELGYTPYFGPEQEFFLFDDVRPGGGPPGG--YPDPRGPYYGGYFPVAPLDEARDIRRDIVKALE 78 (257) T ss_pred CCCHHHHHHHHHHHHHHCCCEEEEEECEEEEEEECCCCCCCCCCC--CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 989799999999999976995979956687998577777677668--879888767775777630318999999999999 Q ss_pred HHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 63110221010127864201112665211378999999999999863066033112321135662689998763136874 Q gi|254780503|r 204 KQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQNYPGSAMHIHQSVLDIKSNQ 283 (461) Q Consensus 204 ~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~GsG~H~h~SL~d~~~g~ 283 (461) .+||+||++|+|+|||||||++.|+++|+|||+++++|++||+||++||+.|||||||+.+.+|||||+|+|||| ++|+ T Consensus 79 ~~Gi~ve~~~~E~gpgQ~Ei~l~~~~~l~aAD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~GsG~HiH~SL~d-~~g~ 157 (257) T pfam00120 79 AAGIPVEGIHHEVAPGQHEIDFRFADALEAADNLQLFKYVVKRVAEKHGLTATFMPKPIFGDNGSGMHTHQSLWD-KDGK 157 (257) T ss_pred HCCCCCEEEECCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEEEEEEEC-CCCC T ss_conf 779981886657579857996254649888699999999999999983967997035457999852179986675-8897 Q ss_pred CCCCCCCC--CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCC-CCCCCEEEE Q ss_conf 23358865--521566654577764223212220766101344204655754334243345003650223-886520353 Q gi|254780503|r 284 NIFSNPDG--SETDNFRYFIGGLQKYIPNALVMLAPYGNSYRRLVLDMGCPVNNAWGYDNRTTAFRIPFS-DAKTKRIEN 360 (461) Q Consensus 284 n~f~~~~~--~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~~~ap~~~~WG~~NR~~~iRip~~-~~~~~riE~ 360 (461) |+|+++++ ++|+++++||||||+|++++++|++||+||||||+|++|||++++||++||+++||||.+ +++++|||+ T Consensus 158 n~f~~~~~~~~lS~~~~~fiaGlL~h~~~l~a~~~Pt~NSYkRl~p~~~ap~~~~WG~~NR~a~iRip~~~~~~~~riE~ 237 (257) T pfam00120 158 NLFADGDGYAGLSETARHFIGGILKHAPAITAYTAPTVNSYKRLVPGTEAPVYIAWGVRNRSASIRIPRGGGPKAGRVED 237 (257) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCCCCCCCCEEECCCCCCCEEECCCCCCCCCEEEE T ss_conf 62468887422309999997638775687625368997656515799748666776156786117867789987746786 Q ss_pred ECCCCCCCHHHHHHHHHHHH Q ss_conf 03587799899999999999 Q gi|254780503|r 361 RLPSSDTNPYLVLAASLACG 380 (461) Q Consensus 361 R~~d~~aNPYL~laail~ag 380 (461) |++|++|||||++||||+|| T Consensus 238 R~~da~aNPYL~lAailaAG 257 (257) T pfam00120 238 RLPDPDANPYLALAALLAAG 257 (257) T ss_pred CCCCCCCCHHHHHHHHHHCC T ss_conf 78878878999999998668 No 6 >KOG0683 consensus Probab=100.00 E-value=5.8e-40 Score=282.64 Aligned_cols=365 Identities=19% Similarity=0.234 Sum_probs=285.6 Q ss_pred CCCCCCCCCCHHHCCCHHHHHHHHHH-CCCCEEEEEEECCCC-CEEEEEEEHHHHHHCCCCCCCCCEEECCCCCCCCCCC Q ss_conf 98777777895558998999999971-798399999978898-3312676198873015677361000002427743466 Q gi|254780503|r 1 MNKNDQSPMNVRGVKNWEQAAKWLKD-NRIEDVECITPDLAG-IPRGKMMPSSKFISNIPLTVPSAIYRHTISGESLHSP 78 (461) Q Consensus 1 ~~~~~~~~m~~~~~~~~~~v~~~l~~-~~I~~V~v~~~Dl~G-~~RgK~vp~~~f~~~~~~g~~~~~~~~~~~g~~~~~~ 78 (461) |+++++++++ +......-.|.. .+-..+.++|+|-.| .+|+|.-..+.-.+ .+.-+-.+.++|++..-. T Consensus 4 l~~~~~~~~~----~~~~~~~~~L~~~~~kv~a~YVwidg~ge~~rsk~rt~d~~~~-----~~~~lp~wnydgsst~QA 74 (380) T KOG0683 4 LAQEKRTRLS----KTAILEYLYLRAKRKKVQAEYVWIDGTGENLRSKTRTLDAEPS-----SISELPIWNYDGSSTGQA 74 (380) T ss_pred HHCCCCCCCC----HHHHHHHCCCCCCCCEEEEEEEEECCCCCCCHHHCCCCCCCCC-----CCCCCCCCCCCCCCCCCC T ss_conf 2217321220----3344554045566762799999965756530431322467756-----600076630468636656 Q ss_pred CCCCCCCCCCEEEEEECCCCEECCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCEEEEEC Q ss_conf 55577888778999600124538788877099997403357521234257778999998610276402322441799821 Q gi|254780503|r 79 YLFHNERDGDLRLSPDLSTLSIVPWEMDPTAQIVCDVVDYNGKEVTYTPRNVLKRVLNFYAQNELKPIIAPEIEFYLIAK 158 (461) Q Consensus 79 ~~~~~~~~~D~~l~PD~~Tl~~lPw~~~~~a~vl~~~~~~dG~p~~~~PR~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~ 158 (461) ...+||.+++|. .+.+.|+..++.-+|+|++++.++.|.+..-|....+++...+-..-.+++|.|.|+.+++. T Consensus 75 ----~g~nSd~~l~Pv--a~~~dPfr~g~Nilv~c~~~~~~~~P~~tn~R~~c~~~~~~~~~~~~~PWfg~Eqeyt~l~~ 148 (380) T KOG0683 75 ----PGENSDVYLRPV--AIYPDPFRNGNNILVMCDTYDFDGKPTETNKRVACARIMPKLSTKDTEPWFGMEQEYTLLDA 148 (380) T ss_pred ----CCCCCCEEEEEE--HCCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCC T ss_conf ----678875178630--00278555899779994336678996642210347887434552345875335678776504 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHH--HHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHH Q ss_conf 678888777877888877667877776654428--999998997678631102210101278642011126652113789 Q gi|254780503|r 159 NEDPDYPLQPPKGRSGRSILGGQSYSIMGINEF--DEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQ 236 (461) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~ 236 (461) . ..++..++ ..+++.+++.+|+..+.++. .++.+.-+.+|...||.+-..+.|++|||||+.+.|+.++.|+|| T Consensus 149 ~--~~~p~gwp--~~GFp~Pqgpyyc~VGad~~~~rdiveahy~acLyaGl~i~G~N~EvmPgQwEfqvGp~~GI~~gD~ 224 (380) T KOG0683 149 L--DGHPFGWP--KGGFPGPQGPYYCGVGADRVFGRDIVEAHYRACLYAGLNISGINVEVMPGQWEFQVGPCEGISMGDQ 224 (380) T ss_pred C--CCCCCCCC--CCCCCCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHEECCCCCCCCCCEEEEEECCHHCCCCCHH T ss_conf 5--68846477--6688888777000026630124123430688877403023363231347504885445001122004 Q ss_pred HHHHHHHHHHHHHHHCCCEEECCCCCCCC-CCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999863066033112321135-66268999876313687423358865521566654577764223212220 Q gi|254780503|r 237 VFLFKRTIREAAFKHDIYATFMAKPMQNY-PGSAMHIHQSVLDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVML 315 (461) Q Consensus 237 ~~~~k~~ik~vA~~~G~~atFmpKP~~~~-~GsG~H~h~SL~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~ 315 (461) ++++|+++++||+++|+.|||-|||..+. +|+|||.++|--.+ +.+ |..++.... T Consensus 225 lw~aR~il~rVae~~Gviasf~pKp~~g~WngaG~Htn~ST~~m--------------------r~~----~g~~~i~~a 280 (380) T KOG0683 225 LWMARYILHRVAEKFGVIASFDPKPILGDWNGAGCHTNFSTKEM--------------------REA----GGLKIIEEA 280 (380) T ss_pred HHHHHHHHHHHHHHHCEEEEECCCCCCCCCCCCCCCCCCCHHHH--------------------HHC----CCHHHHHHH T ss_conf 78999999999998491687258887776667652232021666--------------------605----778999998 Q ss_pred CCCCHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 76610134420465575433424334500365022388652035303587799899999999999998984688874334 Q gi|254780503|r 316 APYGNSYRRLVLDMGCPVNNAWGYDNRTTAFRIPFSDAKTKRIENRLPSSDTNPYLVLAASLACGLLGILQKIEPTPPTM 395 (461) Q Consensus 316 ~PtvNSYkRl~p~~~ap~~~~WG~~NR~~~iRip~~~~~~~riE~R~~d~~aNPYL~laail~agl~Gi~~~l~p~~~~~ 395 (461) +|.+++|+|..+..|. .+||.+|+....+ .++.+.+.++-..-+++||||.++.+.+++.+|..+.-.| ... T Consensus 281 ~~~ls~rh~~hi~~yd---p~~G~dN~rrltg---~hEt~~i~~Fs~GvAnr~~siri~r~va~~~~Gy~edrrP--~sN 352 (380) T KOG0683 281 IPKLSKRHREHIAAYD---PKGGKDNERRLTG---RHETGSIDNFSWGVANRNPSIRIPRTVAAEGKGYFEDRRP--SSN 352 (380) T ss_pred HHHHHHHHHHHHHHCC---CCCCCCCHHHHCC---CCCCCCCCCCCCCCCCCCCEEEECHHHHCCCCCCCCCCCC--CCC T ss_conf 6420023044233128---5678633455318---8865552200010346786166213121366531245898--678 Q ss_pred CCCCC-CHHCCCCCHHHHHHHHH Q ss_conf 53000-31104789899999986 Q gi|254780503|r 396 KSANQ-EAINLPRGLLEAVTLLE 417 (461) Q Consensus 396 ~~~~~-~~~~LP~sL~eAl~~l~ 417 (461) -+.|. ....+|.+|.+++. ++ T Consensus 353 ~Dpy~Vt~~~~~t~l~~~~~-~~ 374 (380) T KOG0683 353 CDPYAVTLMIIPTTLLEAVE-LE 374 (380) T ss_pred CCCCEEEHHHHHHHHHCCCC-HH T ss_conf 87320107776677753420-33 No 7 >COG3968 Uncharacterized protein related to glutamine synthetase [General function prediction only] Probab=99.91 E-value=2.5e-22 Score=166.73 Aligned_cols=272 Identities=24% Similarity=0.257 Sum_probs=193.4 Q ss_pred CCEEECCCCCEEEEECCCCCCCCCCCCCCCC--CCCCCCCCC----CCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCC Q ss_conf 6402322441799821678888777877888--877667877----7766544289999989976786311022101012 Q gi|254780503|r 143 LKPIIAPEIEFYLIAKNEDPDYPLQPPKGRS--GRSILGGQS----YSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEE 216 (461) Q Consensus 143 ~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~--~~~~~~~~~----y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~ 216 (461) .-..+|.|+|+||++++.....+...-.|++ +.++..|+- |--.-.++...|++++-..|.++||++.+-|.|+ T Consensus 214 V~s~~GaEQEYFlvd~~~~~~RpDLi~tGRTLFGa~ppkGQEldDHYFGaipeRV~~FM~Dve~~LyaLGIpaKTrHNEV 293 (724) T COG3968 214 VFSNVGAEQEYFLVDKKSYDERPDLIFTGRTLFGAPPPKGQELDDHYFGAIPERVSAFMKDVEKELYALGIPAKTRHNEV 293 (724) T ss_pred HCCCCCCCCEEEEECHHHCCCCCCEEEECHHHCCCCCCCCCCCCCHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 13677843136886110103485626624021289999876122000124479999999889999998088533345655 Q ss_pred CCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCHHH Q ss_conf 78642011126652113789999999999998630660331123211356626899987631368742335886552156 Q gi|254780503|r 217 GPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQNYPGSAMHIHQSVLDIKSNQNIFSNPDGSETDN 296 (461) Q Consensus 217 gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~GsG~H~h~SL~d~~~g~n~f~~~~~~ls~~ 296 (461) +||||||+.-+.++..|+|+..+..++.|.+|++||+..-.+-|||.|.+|||-|.|.|+-. .+|.|++... +..++. T Consensus 294 APgQfEIApife~~N~A~DhqQL~M~vLk~tA~KhGlVCLLHEKPFAGiNGSGKH~NWSmGt-d~g~NLLdPg-D~PhdN 371 (724) T COG3968 294 APGQFEIAPIFESGNLATDHQQLVMEVLKKTALKHGLVCLLHEKPFAGINGSGKHNNWSMGT-DDGLNLLDPG-DMPHDN 371 (724) T ss_pred CCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCCCCC-CCCCCC T ss_conf 87752441001156533207999999999988762558886158756757887756454146-8775657999-998756 Q ss_pred HH--HHHH----HHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCC----------------------------------- Q ss_conf 66--5457----776422321222076610134420465575433----------------------------------- Q gi|254780503|r 297 FR--YFIG----GLQKYIPNALVMLAPYGNSYRRLVLDMGCPVNN----------------------------------- 335 (461) Q Consensus 297 ~~--~fia----Gil~h~~al~a~~~PtvNSYkRl~p~~~ap~~~----------------------------------- 335 (461) .+ -|.+ ++-++.+-+-+-.+.--|. .||..+--.|.-+ T Consensus 372 ~QFL~Fc~AvIkaVdkY~~LlRa~~a~AsND-hRLGANEAPPAI~SVflGdqLedifEqi~~G~~~ssk~~g~mdLg~~v 450 (724) T COG3968 372 KQFLLFCTAVIKAVDKYADLLRASAANASND-HRLGANEAPPAIISVFLGDQLEDIFEQIEKGKATSSKGNGKMDLGISV 450 (724) T ss_pred CEEEHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHH T ss_conf 0000215889987888799999987515775-345567799612676403069999999964877666667632443120 Q ss_pred ------CCCCCCCCEEEEECCCCCCCCEEEEECCCCC---CCHHHHHHHHHHHHHHHHHHCCC--------CCCC----- Q ss_conf ------4243345003650223886520353035877---99899999999999998984688--------8743----- Q gi|254780503|r 336 ------AWGYDNRTTAFRIPFSDAKTKRIENRLPSSD---TNPYLVLAASLACGLLGILQKIE--------PTPP----- 393 (461) Q Consensus 336 ------~WG~~NR~~~iRip~~~~~~~riE~R~~d~~---aNPYL~laail~agl~Gi~~~l~--------p~~~----- 393 (461) .-|..||+.++-. ...+||+|.++++ +-|-.+|.++++-.|.-|..+++ +... T Consensus 451 lP~v~kdAgDRNRTSPFAF-----TGNkFEFRavgSSqSvs~P~tVLN~~vAesl~~i~drLe~kl~kg~dld~ai~~vl 525 (724) T COG3968 451 LPAVEKDAGDRNRTSPFAF-----TGNKFEFRAVGSSQSVSEPNTVLNVIVAESLSEIADRLEGKLRKGSDLDAAIEAVL 525 (724) T ss_pred CCCCCCCCCCCCCCCCCEE-----CCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 5566445565567887303-----06613685278654456645788899999999998998877632465788999999 Q ss_pred -----------CCCCCCCC--------HHCCC--CCHHHHHHHHHCCHHH Q ss_conf -----------34530003--------11047--8989999998659898 Q gi|254780503|r 394 -----------TMKSANQE--------AINLP--RGLLEAVTLLENTSHF 422 (461) Q Consensus 394 -----------~~~~~~~~--------~~~LP--~sL~eAl~~l~~d~~l 422 (461) ..||.|.+ ..-|| ++--+|+.+|.+++-+ T Consensus 526 ~et~k~h~~vvFnGnGYSeeWhkeAekeRGL~NlrTT~dA~~~L~~~kn~ 575 (724) T COG3968 526 TETVKDHKKVVFNGNGYSEEWHKEAEKERGLPNLRTTVDATKALIADKNV 575 (724) T ss_pred HHHHHHHCEEEECCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCHHHH T ss_conf 99998616687579875489999999853863035467788875032678 No 8 >pfam03951 Gln-synt_N Glutamine synthetase, beta-Grasp domain. Probab=99.43 E-value=7.3e-13 Score=104.44 Aligned_cols=81 Identities=28% Similarity=0.465 Sum_probs=69.1 Q ss_pred CCCEEEEEEECCCCCEEEEEEEHHHHH-HCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCEECCCCCC Q ss_conf 983999999788983312676198873-0156773610000024277434665557788877899960012453878887 Q gi|254780503|r 28 RIEDVECITPDLAGIPRGKMMPSSKFI-SNIPLTVPSAIYRHTISGESLHSPYLFHNERDGDLRLSPDLSTLSIVPWEMD 106 (461) Q Consensus 28 ~I~~V~v~~~Dl~G~~RgK~vp~~~f~-~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~D~~l~PD~~Tl~~lPw~~~ 106 (461) +|++|+++|+|+.|.++..++|++++. +....|+ .|||+++.+ +...++||++|+||++|+.++||..+ T Consensus 1 ~v~~v~~~F~D~~G~~~~~~~~~~~~~~~~~~~g~-------~FDGSSi~G---~~~i~~SDm~L~Pd~~t~~~dP~~~~ 70 (84) T pfam03951 1 GVKFVDLRFTDILGRLKHVRIPATELDEDAFEEGI-------GFDGSSIEG---FAPINESDMYLKPDPSTAYIDPFRPD 70 (84) T ss_pred CCEEEEEEEECCCCCEEEEEEEHHHHCHHHHHCCC-------EEECHHEEC---CCCCCCCCEEEEECCCCEEECCCCCC T ss_conf 97899999978999738999868992877875795-------073000320---00378776699657997188330589 Q ss_pred C--EEEEEEECCCC Q ss_conf 7--09999740335 Q gi|254780503|r 107 P--TAQIVCDVVDY 118 (461) Q Consensus 107 ~--~a~vl~~~~~~ 118 (461) . +++|+|+++++ T Consensus 71 ~~~~~~v~Cdv~~p 84 (84) T pfam03951 71 PGKTARVICDVYDP 84 (84) T ss_pred CCCEEEEEEECCCC T ss_conf 98789999973088 No 9 >PRK13517 carboxylate-amine ligase; Provisional Probab=98.66 E-value=2.2e-07 Score=68.44 Aligned_cols=198 Identities=16% Similarity=0.177 Sum_probs=118.8 Q ss_pred CEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCE Q ss_conf 40232244179982167888877787788887766787777665442899999899767863110221010127864201 Q gi|254780503|r 144 KPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEI 223 (461) Q Consensus 144 ~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei 223 (461) .+.+|+|.||+|++..... ......+++..+..... -..+++|....|.|+ T Consensus 8 ~~TlGvEeE~~LVD~~t~~------------------------l~~~a~~vl~~~~~~~~-----~~~v~~El~~~~iEi 58 (371) T PRK13517 8 RPTLGVEEEFLLVDPVTGE------------------------LSPRATEVLAEAGENGP-----EPRLQKELLRNTVEV 58 (371) T ss_pred CCCCEEEEEEEEECCCCCC------------------------CCCCHHHHHHHHHHCCC-----CCCCCHHHHHHEEEE T ss_conf 9961603214334478678------------------------56644999997343047-----767575655016886 Q ss_pred EECCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEECCC-CCCCC-------------------------CCHHHHHHHHH Q ss_conf 11266-52113789999999999998630660331123-21135-------------------------66268999876 Q gi|254780503|r 224 NIRHG-DPLKLADQVFLFKRTIREAAFKHDIYATFMAK-PMQNY-------------------------PGSAMHIHQSV 276 (461) Q Consensus 224 ~~~~~-~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpK-P~~~~-------------------------~GsG~H~h~SL 276 (461) +..++ +.-++.+++...|..+.+.|.++|+...-..- |+... .=+|||||+.+ T Consensus 59 ~t~~~~~~~el~~~L~~~r~~l~~aA~~~G~~l~a~GthP~~~~~~~~~t~~~RY~~l~~~~~~~~~~~~i~G~HVHVgv 138 (371) T PRK13517 59 VTGVCDTVGEAREDLRRTRAVARRAADRRGARLAAAGTHPFSPWSDQPLTDNPRYRELIERTQWWAREQLICGVHVHVGV 138 (371) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHEEEEEECC T ss_conf 68777999999999999999999999985997872267788886557789647899999999999987614025798347 Q ss_pred HHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---------CHHHHHHH----CCCC-CCCCCCCCCCCC Q ss_conf 313687423358865521566654577764223212220766---------10134420----4655-754334243345 Q gi|254780503|r 277 LDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVMLAPY---------GNSYRRLV----LDMG-CPVNNAWGYDNR 342 (461) Q Consensus 277 ~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~~Pt---------vNSYkRl~----p~~~-ap~~~~WG~~NR 342 (461) -| + +.+-..+-.+.-++|.+.|+.+.+ .-|||..+ |..- .|.+-+|..-.+ T Consensus 139 ~d----------~-----e~av~v~n~lr~~lP~lLALSAnSPf~~G~DTG~aS~R~~~f~~~P~~G~p~~f~s~~~y~~ 203 (371) T PRK13517 139 PS----------R-----EKAVPVLNRLRPWLPHLLALSANSPFWRGADTGYASNRAMLFQQWPTAGPPPQFGSWAEYEA 203 (371) T ss_pred CC----------H-----HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 98----------8-----99999999999998999998438953068667634579999974888899965489899999 Q ss_pred CE-----------------EEEECCCCCCCCEEEEECCCCCCCHH--HHHHHHHHHHHHHHHHCC Q ss_conf 00-----------------36502238865203530358779989--999999999999898468 Q gi|254780503|r 343 TT-----------------AFRIPFSDAKTKRIENRLPSSDTNPY--LVLAASLACGLLGILQKI 388 (461) Q Consensus 343 ~~-----------------~iRip~~~~~~~riE~R~~d~~aNPY--L~laail~agl~Gi~~~l 388 (461) -. -|| .+++-.++|+|++|+...+= +++||++.|-..-+.+.+ T Consensus 204 ~v~~l~~~G~i~d~~~iwWdvR---Ps~~~~TvE~Rv~D~~~~~~d~~aiaal~~aLv~~~~~~~ 265 (371) T PRK13517 204 FVADLLKTGVILDEGMVYWDIR---PSPNFGTVEVRVADVPLTLRETVALAALVRCLVVTADRRL 265 (371) T ss_pred HHHHHHHCCCCCCCCCEEEEEC---CCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999986397287882798616---6788885235405888999999999999999999999888 No 10 >PRK13516 carboxylate-amine ligase; Provisional Probab=98.60 E-value=1.3e-06 Score=63.38 Aligned_cols=195 Identities=14% Similarity=0.124 Sum_probs=123.5 Q ss_pred CEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCE Q ss_conf 40232244179982167888877787788887766787777665442899999899767863110221010127864201 Q gi|254780503|r 144 KPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEI 223 (461) Q Consensus 144 ~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei 223 (461) .+.+|+|-||.|++..... ......+++..+... ..-..+++|.-..|.|+ T Consensus 11 ~~T~GVEeE~~LvD~~t~~------------------------l~~~a~~vl~~~~~~-----~~~~~v~~El~~~qvEi 61 (372) T PRK13516 11 PFTLGVELELQVVNPHDYD------------------------LTQASSDLLRAVKNQ-----ITAGEIKPEITESMIEI 61 (372) T ss_pred CCCCEEEEEEEEECCCCCC------------------------CCCCHHHHHHHHCCC-----CCCCCCCHHHHHHEEEE T ss_conf 9821656668776489756------------------------675429999971134-----35665674765134784 Q ss_pred EECCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEECC-CCCCC-------------------------CCCHHHHHHHHH Q ss_conf 11266-5211378999999999999863066033112-32113-------------------------566268999876 Q gi|254780503|r 224 NIRHG-DPLKLADQVFLFKRTIREAAFKHDIYATFMA-KPMQN-------------------------YPGSAMHIHQSV 276 (461) Q Consensus 224 ~~~~~-~~l~aaD~~~~~k~~ik~vA~~~G~~atFmp-KP~~~-------------------------~~GsG~H~h~SL 276 (461) +..++ +.-++.+++...|..+.++|.++|+...--. .||.. ..=+|||||+.+ T Consensus 62 ~T~v~~~~~e~~~~L~~~R~~l~~~A~~~g~~l~a~GthP~~~~~~~~it~~~RY~~l~~~~~~~a~~~~i~G~HVHVgv 141 (372) T PRK13516 62 ATGVCRDIDQALGQLSAMRDVLVQAADDLNLGICGGGTHPFQQWQRQRICDNPRFQYLSELYGYLAKQFTVFGQHVHIGC 141 (372) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCEEEEECC T ss_conf 68366999999999999999999999982987763378988875547789627899999999999998846102688457 Q ss_pred HHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---------CHHHHHHH----CCCC-CCCCCCCCCCCC Q ss_conf 313687423358865521566654577764223212220766---------10134420----4655-754334243345 Q gi|254780503|r 277 LDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVMLAPY---------GNSYRRLV----LDMG-CPVNNAWGYDNR 342 (461) Q Consensus 277 ~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~~Pt---------vNSYkRl~----p~~~-ap~~~~WG~~NR 342 (461) -| .+.+-..+..|.-++|.|.||.+.+ .-|||... |..- .|.+-+|..=.+ T Consensus 142 ~d---------------~e~av~v~n~lr~~LP~LLALSAnSPFw~G~DTG~aS~R~~~~~~~P~aG~pp~f~s~~~y~~ 206 (372) T PRK13516 142 PS---------------GDDAIYLLHGLSRYVPHFIALSASSPYVQGVDTGFASARLNSVSAFPLSGRAPFVLNWQEFEA 206 (372) T ss_pred CC---------------HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 97---------------888999999988887899998428932057758750024657862878898998699999999 Q ss_pred CE-------EEEEC-------CCCCCCCEEEEECCCCCCCHH--HHHHHHHHHHHH Q ss_conf 00-------36502-------238865203530358779989--999999999999 Q gi|254780503|r 343 TT-------AFRIP-------FSDAKTKRIENRLPSSDTNPY--LVLAASLACGLL 382 (461) Q Consensus 343 ~~-------~iRip-------~~~~~~~riE~R~~d~~aNPY--L~laail~agl~ 382 (461) .. .|.=+ -.+++-..+|+|++|...++= +++||++.|-.. T Consensus 207 ~v~~L~~tG~i~d~~~i~WdvRPs~~~pTvEvRv~D~~~~~~~~~~iAaL~raLv~ 262 (372) T PRK13516 207 YFRKMSYTGVIDSMKDFYWDIRPKPEFGTVEVRVMDTPLTVERAAAIAGYIQALAR 262 (372) T ss_pred HHHHHHHCCCCCCCCCEEEEECCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999871873767825898336899996035403477999999999999999999 No 11 >TIGR02050 gshA_cyan_rel uncharacterized enzyme; InterPro: IPR011793 This family represents a division of a larger family, the other branch of which is predicted to act as glutamate--cysteine ligase (the first of two enzymes in glutathione biosynthesis) in the cyanobacteria. Species containing this protein, however, are generally not believed to make glutathione, and the function is unknown.. Probab=98.59 E-value=1.2e-06 Score=63.47 Aligned_cols=210 Identities=18% Similarity=0.189 Sum_probs=137.2 Q ss_pred EECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEE Q ss_conf 23224417998216788887778778888776678777766544-28999998997678631102210101278642011 Q gi|254780503|r 146 IIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGIN-EFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEIN 224 (461) Q Consensus 146 ~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~-~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~ 224 (461) .+|+|.||+|+|..+. ++.+.. +.-.++.+..+.+..- =.=..+.+|.--.|+|+. T Consensus 1 TlGvEeE~~LVD~~~~----------------------d~~~~~~~~~~~~~~~~~~~~~~-PGGy~~~~E~~~~~vE~a 57 (297) T TIGR02050 1 TLGVEEELLLVDPHTY----------------------DLAASSDEAAALVGECREKLGED-PGGYEFKKELFESQVELA 57 (297) T ss_pred CCHHEEEHHHCCCCCC----------------------CCCCCCHHHHHHHHCCCCCCCCC-CCCCCCCHHHHHCEEEEE T ss_conf 9402000212055678----------------------87666104899986077411368-886320422221444465 Q ss_pred ECCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEECC-CCCCC-----------------CCC--------HHHHHHHHHH Q ss_conf 1266-5211378999999999999863066033112-32113-----------------566--------2689998763 Q gi|254780503|r 225 IRHG-DPLKLADQVFLFKRTIREAAFKHDIYATFMA-KPMQN-----------------YPG--------SAMHIHQSVL 277 (461) Q Consensus 225 ~~~~-~~l~aaD~~~~~k~~ik~vA~~~G~~atFmp-KP~~~-----------------~~G--------sG~H~h~SL~ 277 (461) ...+ +.-+|.-.+.-.|..+++.|..+|+...=.+ -||.. .-| .|+|||+++- T Consensus 58 t~v~~~~~eA~~~~~~~r~~l~~~A~~~G~~~~~~gtHPf~~W~~~~~~~~~rY~~~~e~~gy~~~~~Lv~G~HVHVgv~ 137 (297) T TIGR02050 58 TPVCTTLAEALAELREVRARLVEAASDLGLRIAAAGTHPFARWREQELAENPRYQRLLERYGYVARQSLVFGLHVHVGVP 137 (297) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCHHHHHHHHHCCEEEHHEEECCEEEEECCC T ss_conf 68447878999999999999999986648636513467753331477786555899998618000001321134770327 Q ss_pred HCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC---------HHHHHHH----CCCC-CCCCCCCCC---- Q ss_conf 136874233588655215666545777642232122207661---------0134420----4655-754334243---- Q gi|254780503|r 278 DIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVMLAPYG---------NSYRRLV----LDMG-CPVNNAWGY---- 339 (461) Q Consensus 278 d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~~~-ap~~~~WG~---- 339 (461) + .+.+-+-|-+++.++|-|.||.|.+. -|||+-. |..- .+..-+|-- T Consensus 138 s---------------~~~~~~vl~rl~~~lP~LlALSAsSPfw~G~DTgyAS~R~~~~~~~P~~G~p~~f~~w~ay~~~ 202 (297) T TIGR02050 138 S---------------PDDAVAVLNRLLPWLPHLLALSASSPFWQGRDTGYASYRANIFQEWPTAGLPPAFQSWDAYEAY 202 (297) T ss_pred C---------------HHHHHHHHHHHHHHHHHHHHHHHHCHHHCCCCCHHHHHHHHHHHHCCCCCCCHHHCCHHHHHHH T ss_conf 6---------------4358899865666789999998615111582523778899999846667874222058999999 Q ss_pred ---CCCCEE----------EEECCCCCCCCEEEEECCCCCCCHH--HHHHHHHHHHHHHHHH---CCCCCCCCCC Q ss_conf ---345003----------6502238865203530358779989--9999999999998984---6888743345 Q gi|254780503|r 340 ---DNRTTA----------FRIPFSDAKTKRIENRLPSSDTNPY--LVLAASLACGLLGILQ---KIEPTPPTMK 396 (461) Q Consensus 340 ---~NR~~~----------iRip~~~~~~~riE~R~~d~~aNPY--L~laail~agl~Gi~~---~l~p~~~~~~ 396 (461) -.+|.+ ||. +++-.=||+|++|+..|+= +++||++-|=..-..+ ..+-|.+... T Consensus 203 ~a~l~~tG~i~~~g~l~wDiRP---s~~~gTvEvRv~D~~~~l~~~~~~aaL~rALv~~~~~rYD~~~~p~~~~~ 274 (297) T TIGR02050 203 VADLRETGVIEDDGDLYWDIRP---SSHYGTVEVRVADACLNLEEALALAALIRALVEELLRRYDEWDAPFPVPE 274 (297) T ss_pred HHHHHHCCCCCCCCCEEEEEEE---CCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 9999854892478866883450---69988268871489988999999999999999999975151227799876 No 12 >PRK13515 carboxylate-amine ligase; Provisional Probab=98.51 E-value=3.7e-06 Score=60.39 Aligned_cols=200 Identities=15% Similarity=0.147 Sum_probs=122.4 Q ss_pred CEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCE Q ss_conf 40232244179982167888877787788887766787777665442899999899767863110221010127864201 Q gi|254780503|r 144 KPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEI 223 (461) Q Consensus 144 ~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei 223 (461) .+.+|+|.||+|+|.....-. ....+.+.+.... . .-+.+++|.-..|.|+ T Consensus 5 ~~TlGVEeE~~LvD~~t~~l~------------------------~~a~~~l~~~~~~---~--~~~~v~~El~~~qiEi 55 (381) T PRK13515 5 EFTFGIEEEYLLVDAETLDLA------------------------RSAPAALMEACRD---A--LGEQFAPEMLQSQIEV 55 (381) T ss_pred CCCCEEEEEEEEECCCCCCCC------------------------CCHHHHHHHHHHH---H--CCCCCCHHHHHCEEEE T ss_conf 987452344666758865667------------------------5238999985333---1--2566675666237994 Q ss_pred EECCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEECC-CCCCC-------------------------CCCHHHHHHHHH Q ss_conf 11266-5211378999999999999863066033112-32113-------------------------566268999876 Q gi|254780503|r 224 NIRHG-DPLKLADQVFLFKRTIREAAFKHDIYATFMA-KPMQN-------------------------YPGSAMHIHQSV 276 (461) Q Consensus 224 ~~~~~-~~l~aaD~~~~~k~~ik~vA~~~G~~atFmp-KP~~~-------------------------~~GsG~H~h~SL 276 (461) +..++ +.-++.+++...|..+.++|.++|+...--. .||.. ..=+|||||+.+ T Consensus 56 ~T~v~~~~~e~~~~L~~~r~~l~~aA~~~g~~l~a~GthP~~~w~~~~~t~~~RY~~l~~~~~~~a~~~~i~G~HVHVgv 135 (381) T PRK13515 56 GTPVCATLAEAREELGRLRQRLAELAAQFGLRIIAAGTHPFADWRRQRPTPKERYAQLFEDLGGVARRSLICGMHVHVGV 135 (381) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHEEEEEEECC T ss_conf 68777999999999999999999999986988983078988887668789607899999999999986432423898368 Q ss_pred HHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---------CHHHHHHH----CCCC-CCCCCCCCCCCC Q ss_conf 313687423358865521566654577764223212220766---------10134420----4655-754334243345 Q gi|254780503|r 277 LDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVMLAPY---------GNSYRRLV----LDMG-CPVNNAWGYDNR 342 (461) Q Consensus 277 ~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~~Pt---------vNSYkRl~----p~~~-ap~~~~WG~~NR 342 (461) -| .+.+-.-+..|.-++|.|.|+.+.+ ..|||..+ |..- .|.+-+|..=.+ T Consensus 136 ~d---------------~e~av~v~n~lr~~LP~LLALSAsSPFw~G~DTG~aS~R~~~~~~~P~aG~P~~f~s~~~y~~ 200 (381) T PRK13515 136 PD---------------SEDRMALMNQVLYFLPHLLALSTSSPFWGGRDTGLKSYRRAIFGELPRTGLPPLFPSWAEYQR 200 (381) T ss_pred CC---------------HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 98---------------899999999999888999999708931047877644168768864888899977689999999 Q ss_pred -------------CEEEEE-CCCCCCCCEEEEECCCCCCCHH--HHHHHHHHHHHHHHHHC Q ss_conf -------------003650-2238865203530358779989--99999999999989846 Q gi|254780503|r 343 -------------TTAFRI-PFSDAKTKRIENRLPSSDTNPY--LVLAASLACGLLGILQK 387 (461) Q Consensus 343 -------------~~~iRi-p~~~~~~~riE~R~~d~~aNPY--L~laail~agl~Gi~~~ 387 (461) ..+|.- .-.+++-..+|+|++|...+.= +++||++.|-..-+-.. T Consensus 201 ~v~~l~~tG~i~d~~~i~WdvRPs~~~pTlEvRv~D~~~~~~~~~~iAal~raLv~~~~~~ 261 (381) T PRK13515 201 YVALLVRTGCIDDDGKIWWDLRPSPRFPTLELRICDVCPRLDDALALAALFRALVRHLYRL 261 (381) T ss_pred HHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999986599687880589825789999625750688799999999999999999999987 No 13 >PRK13518 carboxylate-amine ligase; Provisional Probab=98.40 E-value=7.1e-06 Score=58.51 Aligned_cols=195 Identities=18% Similarity=0.165 Sum_probs=118.7 Q ss_pred CEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCE Q ss_conf 40232244179982167888877787788887766787777665442899999899767863110221010127864201 Q gi|254780503|r 144 KPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEI 223 (461) Q Consensus 144 ~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei 223 (461) -..+|+|-||.|.+...... .+ ..+.+ ......+..-+.+++|.-.+|.|+ T Consensus 12 m~TlGVEEE~~LVD~~t~~~----------------------~~---~~~~v----~~~~~~~~~~~~v~~El~~~qvEi 62 (357) T PRK13518 12 MGTLGVEEEFFVVDEHGRPT----------------------AG---TDELV----YEGEPPEPLAGRLDHELFKFVIET 62 (357) T ss_pred CCCCEEEEEEEEECCCCCCC----------------------CC---CHHHH----HHCCCCCCCCCCCCHHHHCCEEEE T ss_conf 78641446788877997656----------------------67---46899----823555324565574665547994 Q ss_pred EECCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEEECC-CCCCC-------------------------CCCHHHHHHHHH Q ss_conf 11266-5211378999999999999863066033112-32113-------------------------566268999876 Q gi|254780503|r 224 NIRHG-DPLKLADQVFLFKRTIREAAFKHDIYATFMA-KPMQN-------------------------YPGSAMHIHQSV 276 (461) Q Consensus 224 ~~~~~-~~l~aaD~~~~~k~~ik~vA~~~G~~atFmp-KP~~~-------------------------~~GsG~H~h~SL 276 (461) +..++ +.-++.+++...|..+.++|.++|+...=.. -||.. ..=+|||||+.+ T Consensus 63 ~T~vc~~~~e~~~~L~~~R~~l~~aA~~~G~~l~a~GThP~~~w~~~~it~~~RY~~l~~~~~~~ar~~li~G~HVHVGv 142 (357) T PRK13518 63 QTPTLNDVAAAPDALREVRDALVAHAEEHGFRIAAAGLHPGAKWRELEHAEKPRYRSQLDRIQYPQHRNTTAGLHIHVGV 142 (357) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHEEEEEEECC T ss_conf 58676999999999999999999999986988996268988887768889977899999998899998746103799617 Q ss_pred HHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC---------HHHHHHH----CCCC-CCCCCCCCCCCC Q ss_conf 3136874233588655215666545777642232122207661---------0134420----4655-754334243345 Q gi|254780503|r 277 LDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVMLAPYG---------NSYRRLV----LDMG-CPVNNAWGYDNR 342 (461) Q Consensus 277 ~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~~~-ap~~~~WG~~NR 342 (461) -| -+.+-..+-.+..++|.|.||.+.+. .|||-.. |..- .|.+-+|..=.+ T Consensus 143 ~d---------------~e~av~v~n~lr~~LP~LlALSAnSPFw~G~DTG~aS~R~~~~~~~P~aG~P~~f~s~~~y~~ 207 (357) T PRK13518 143 DD---------------PDKAVWVANELRWHMPVLLALSANSPYWNGFDTGLASARAKIFEALPNTGMPTAFEDYEAFQA 207 (357) T ss_pred CC---------------HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 97---------------899999999888778999999717942048888751089999972764799954489999999 Q ss_pred C-------------EEEEE-CCCCCCCCEEEEECCCCCCCHH--HHHHHHHHHHHH Q ss_conf 0-------------03650-2238865203530358779989--999999999999 Q gi|254780503|r 343 T-------------TAFRI-PFSDAKTKRIENRLPSSDTNPY--LVLAASLACGLL 382 (461) Q Consensus 343 ~-------------~~iRi-p~~~~~~~riE~R~~d~~aNPY--L~laail~agl~ 382 (461) . .+|.= .-.+++-.-+|+|++|...+.= +++||++.|-.. T Consensus 208 ~v~~L~~tG~I~d~~~i~WDiRPs~~~pTvEvRV~D~~~~~~~~~~lAaL~raLv~ 263 (357) T PRK13518 208 FERRMVETGSIEDRGELWYDVRPHSGHGTVEVRAPDAQADPDVVLAFVEYAHALVT 263 (357) T ss_pred HHHHHHHCCCCCCCCCEEEEECCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999973982778845899442699997257657888998999999999999999 No 14 >pfam04107 GCS2 Glutamate-cysteine ligase family 2(GCS2). Also known as gamma-glutamylcysteine synthetase and gamma-ECS (EC:6.3.2.2). This enzyme catalyses the first and rate limiting step in de novo glutathione biosynthesis. Members of this family are found in archaea, bacteria and plants. May and Leaver discuss the possible evolutionary origins of glutamate-cysteine ligase enzymes in different organisms and suggest that it evolved independently in different eukaryotes, from an ancestral bacterial enzyme. They also state that Arabidopsis thaliana gamma-glutamylcysteine synthetase is structurally unrelated to mammalian, yeast and Escherichia coli homologues. In plants, there are separate cytosolic and chloroplast forms of the enzyme. Probab=98.10 E-value=3.6e-05 Score=53.87 Aligned_cols=196 Identities=15% Similarity=0.054 Sum_probs=109.4 Q ss_pred EECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEE Q ss_conf 23224417998216788887778778888776678777766544289999989976786311022101012786420111 Q gi|254780503|r 146 IIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINI 225 (461) Q Consensus 146 ~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~ 225 (461) .+|+|.||+|+|.......+ ....+.......+ ..-..+.+|--.||.|++. T Consensus 2 t~GvEeE~~lvD~~t~~l~~------------------------~~~~~~~~~~~~~----~~~~~~~~e~~~~qiEist 53 (291) T pfam04107 2 TLGVEEEFGVVDLLGGDLRG------------------------WSPILEDEAKIGL----SLGGGFVKELPGGQVELST 53 (291) T ss_pred CEEEEEEEEEECCCCCCCCC------------------------CCHHHHHHHCCCC----CCCCCEEECCCCCEEEECC T ss_conf 63778878777589878788------------------------6678874200255----6689653326898899714 Q ss_pred CCC-CCHHHHHHHHHHHHHHHHHHHHHCCCEE--------------ECCCCCC-------C---CCC-----HHHHHHHH Q ss_conf 266-5211378999999999999863066033--------------1123211-------3---566-----26899987 Q gi|254780503|r 226 RHG-DPLKLADQVFLFKRTIREAAFKHDIYAT--------------FMAKPMQ-------N---YPG-----SAMHIHQS 275 (461) Q Consensus 226 ~~~-~~l~aaD~~~~~k~~ik~vA~~~G~~at--------------FmpKP~~-------~---~~G-----sG~H~h~S 275 (461) .++ +.-++++.+..++..++++|.++|+... ++|||-- . ..| +|||+|++ T Consensus 54 ~p~~~~~e~~~~l~~~~~~l~~~a~~~g~~l~~~Gt~P~~~~~~~~~~~~~RY~~m~~~~~~~g~~~~~mm~~g~hVhV~ 133 (291) T pfam04107 54 PPLESLAEAAEEISQHREELRHVADELGLGLLGLGTHPFALRSRDPVMPKGRYRRMYEYMPRVGVYGRQMMVAGCHVQVN 133 (291) T ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCEEEEEEC T ss_conf 22189999999999999999999998298068227787788754776888425799999987533789897722258854 Q ss_pred HHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---------CHHHHHHH----CCC-CCCCCCCCCCCC Q ss_conf 6313687423358865521566654577764223212220766---------10134420----465-575433424334 Q gi|254780503|r 276 VLDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVMLAPY---------GNSYRRLV----LDM-GCPVNNAWGYDN 341 (461) Q Consensus 276 L~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~~Pt---------vNSYkRl~----p~~-~ap~~~~WG~~N 341 (461) +=+. + +..-.-+.-+..++|-+.|++|.+ ..|||-.. |.. -.|...+|+.=. T Consensus 134 vd~~---------~-----e~~~~~~~~~~~~~P~llALsAnSPf~~G~dtG~~S~R~~iw~~~p~~G~~p~~~~~~~~~ 199 (291) T pfam04107 134 IDSS---------S-----EAIMAVLRLVRALLPVLLALSANSPFWGGRDTGYASTRALIFTQTPQAGPLPLAFEDGAFE 199 (291) T ss_pred CCCC---------C-----HHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCHHHHHHHHHCCCCCCCCCCCCCHHHHH T ss_conf 7887---------5-----8899999999888899999972982335887641317789874089999999879838999 Q ss_pred -------CCEEEEEC--------CCCCCCCEEEEECCCCCCCHHHHHHHHHHHH-HHH Q ss_conf -------50036502--------2388652035303587799899999999999-998 Q gi|254780503|r 342 -------RTTAFRIP--------FSDAKTKRIENRLPSSDTNPYLVLAASLACG-LLG 383 (461) Q Consensus 342 -------R~~~iRip--------~~~~~~~riE~R~~d~~aNPYL~laail~ag-l~G 383 (461) ++-+|.+. .+....+.+|+|..|++.+|=..+++++-+. +|. T Consensus 200 ~yv~~ll~~~~i~~~~~~~~d~r~~~~~~~t~E~r~~ds~~f~~vr~~~~~e~r~~d~ 257 (291) T pfam04107 200 RYARYALDTGIIFVRRRLWWDGRPPGLPGETLELRIHDTTAFPPVRLRALLEARLLDA 257 (291) T ss_pred HHHHHHHCCCCEEECCCCEEECCCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHC T ss_conf 9999997189769669826831789899768899983046987999999999988741 No 15 >COG2170 Uncharacterized conserved protein [Function unknown] Probab=98.02 E-value=6.7e-05 Score=52.08 Aligned_cols=214 Identities=20% Similarity=0.286 Sum_probs=121.6 Q ss_pred CEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCE Q ss_conf 40232244179982167888877787788887766787777665442899999899767863110221010127864201 Q gi|254780503|r 144 KPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEI 223 (461) Q Consensus 144 ~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei 223 (461) .+..|+|.||||.+..+ |++.+. ...+++.+... +.-.-+.+|.--.+.|+ T Consensus 2 ~ltlGvE~E~~lvd~~g----------------------~dl~~~--s~~ii~~~~~~-----~~~~~~~~e~~e~~vE~ 52 (369) T COG2170 2 RLTLGVELEFQLVDPQG----------------------YDLVGS--SDAIIEALKGK-----VTAGHLKHEITESTVEL 52 (369) T ss_pred CCCCCEEEEEEECCCCC----------------------CCCCCC--CHHHHHHCCCC-----CCCCCHHHHHHHHHHCC T ss_conf 96653379999607777----------------------666454--17899853788-----88751168888774303 Q ss_pred EECCCC-CHHHHHHHHHHHHHHHHHHHHHCCCEEE-CCCCCCCC-----CCHHHHHHHHHHHCC---CCCCCCCCC---C Q ss_conf 112665-2113789999999999998630660331-12321135-----662689998763136---874233588---6 Q gi|254780503|r 224 NIRHGD-PLKLADQVFLFKRTIREAAFKHDIYATF-MAKPMQNY-----PGSAMHIHQSVLDIK---SNQNIFSNP---D 290 (461) Q Consensus 224 ~~~~~~-~l~aaD~~~~~k~~ik~vA~~~G~~atF-mpKP~~~~-----~GsG~H~h~SL~d~~---~g~n~f~~~---~ 290 (461) .-.+++ .-+|+-..--.|..+++.|..||+..+- -.-||.+. +-|. |.| ++.+.. +.+++-++- - T Consensus 53 ~t~vc~~~~eA~~~~r~~r~~l~q~a~d~gL~~~~~GtHPfadw~~~~~~~~p-rY~-~~ie~~~y~~~q~~v~G~HVHV 130 (369) T COG2170 53 ATGVCRLLAEAAAQLRALRDYLVQAASDHGLRICGGGTHPFADWRRQEVPDNP-RYQ-RLIERTGYLGRQMTVAGQHVHV 130 (369) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCHHHHHHCCCCCCH-HHH-HHHHHHHHHHHHEEEEEEEEEE T ss_conf 40366659999999999999999974423803323678850255652478872-399-9999851677621564577883 Q ss_pred C-CCHHHHHHHHHHHHHHHHHHHHHCCCCC---------HHHHHHH----CCC-CCCCCCCCCC--------------CC Q ss_conf 5-5215666545777642232122207661---------0134420----465-5754334243--------------34 Q gi|254780503|r 291 G-SETDNFRYFIGGLQKYIPNALVMLAPYG---------NSYRRLV----LDM-GCPVNNAWGY--------------DN 341 (461) Q Consensus 291 ~-~ls~~~~~fiaGil~h~~al~a~~~Ptv---------NSYkRl~----p~~-~ap~~~~WG~--------------~N 341 (461) | ..++.+-..+-+++.++|-+.|+.|++. -|+|+.. |.. -.|.+-+|+. +| T Consensus 131 Gi~~~d~~~~~l~~l~~~~PhlLALSASSPf~~G~dTGyAS~R~~if~~~P~~g~pp~f~sw~~f~~~~~~~~~tG~I~~ 210 (369) T COG2170 131 GIPSPDDAMYLLHRLLRYVPHLLALSASSPFWQGTDTGYASARANIFSQLPTNGLPPAFQSWAAFEAFFRDQLETGTIDS 210 (369) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 68988999999998876505887650378531673231356557663207767999651688999999999887364144 Q ss_pred CC---EEEEECCCCCCCCEEEEECCCCCCCHH--HHHHHHHHHHHHHHHHCCCCC Q ss_conf 50---036502238865203530358779989--999999999999898468887 Q gi|254780503|r 342 RT---TAFRIPFSDAKTKRIENRLPSSDTNPY--LVLAASLACGLLGILQKIEPT 391 (461) Q Consensus 342 R~---~~iRip~~~~~~~riE~R~~d~~aNPY--L~laail~agl~Gi~~~l~p~ 391 (461) .. -.|| | ++.-.++|+|++|+-.||= ++++|++-|-+.-+....+.. T Consensus 211 ~~~lwwdIR-P--sph~gTlEvRi~D~~~~l~~~~aivaL~~Alv~~ll~~~d~~ 262 (369) T COG2170 211 MGDLWWDIR-P--SPHLGTLEVRICDTVLNLAELLAIVALIHALVVWLLRRRDAD 262 (369) T ss_pred CCCEEEECC-C--CCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 663378613-6--887782588844787999999999999999999996388999 No 16 >KOG0683 consensus Probab=97.55 E-value=0.00046 Score=46.59 Aligned_cols=243 Identities=17% Similarity=0.130 Sum_probs=118.0 Q ss_pred CCEEEEEECCCCEECCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHC-CCC---CEEECCCCCEEEEECCCCC Q ss_conf 778999600124538788877099997403357521234257778999998610-276---4023224417998216788 Q gi|254780503|r 87 GDLRLSPDLSTLSIVPWEMDPTAQIVCDVVDYNGKEVTYTPRNVLKRVLNFYAQ-NEL---KPIIAPEIEFYLIAKNEDP 162 (461) Q Consensus 87 ~D~~l~PD~~Tl~~lPw~~~~~a~vl~~~~~~dG~p~~~~PR~iLkr~i~~l~~-~G~---~~~~g~E~EF~L~~~~~~~ 162 (461) +..+..=|-.++.-.|- +.+-++.|.-...++++....|+...++..-.+.. -|- -+..||+-+++++...-.. T Consensus 97 ~Nilv~c~~~~~~~~P~--~tn~R~~c~~~~~~~~~~~~~PWfg~Eqeyt~l~~~~~~p~gwp~~GFp~Pqgpyyc~VGa 174 (380) T KOG0683 97 NNILVMCDTYDFDGKPT--ETNKRVACARIMPKLSTKDTEPWFGMEQEYTLLDALDGHPFGWPKGGFPGPQGPYYCGVGA 174 (380) T ss_pred CCEEEEEECCCCCCCCC--CCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEECCCC T ss_conf 97799943366789966--4221034788743455234587533567877650456884647766888887770000266 Q ss_pred CC--CCC--------------CCCCCC------CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCC Q ss_conf 88--777--------------877888------87766787777665442899999899767863110221010127864 Q gi|254780503|r 163 DY--PLQ--------------PPKGRS------GRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQ 220 (461) Q Consensus 163 ~~--~~~--------------~~~~~~------~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ 220 (461) .. ..+ ...+.. ..-..-+++..+..-|...-.+.-|.+..+..|+ +-+++.+.-+|. T Consensus 175 d~~~~rdiveahy~acLyaGl~i~G~N~EvmPgQwEfqvGp~~GI~~gD~lw~aR~il~rVae~~Gv-iasf~pKp~~g~ 253 (380) T KOG0683 175 DRVFGRDIVEAHYRACLYAGLNISGINVEVMPGQWEFQVGPCEGISMGDQLWMARYILHRVAEKFGV-IASFDPKPILGD 253 (380) T ss_pred CCCCCCHHHHHHHHHHHHHHHEECCCCCCCCCCEEEEEECCHHCCCCCHHHHHHHHHHHHHHHHHCE-EEEECCCCCCCC T ss_conf 3012412343068887740302336323134750488544500112200478999999999998491-687258887776 Q ss_pred CCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEE--CCCCCCCCCCHHHH-HHHHHHHCCCCCCCCCCCCCCCHHHH Q ss_conf 2011126652113789999999999998630660331--12321135662689-99876313687423358865521566 Q gi|254780503|r 221 FEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATF--MAKPMQNYPGSAMH-IHQSVLDIKSNQNIFSNPDGSETDNF 297 (461) Q Consensus 221 ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atF--mpKP~~~~~GsG~H-~h~SL~d~~~g~n~f~~~~~~ls~~~ 297 (461) +--+=-|+. ...++.-..||+.+.. +||+ |--| .|++.||.++|+. + . T Consensus 254 WngaG~Htn------------~ST~~mr~~~g~~~i~~a~~~l------s~rh~~hi~~ydp~~G~d-----N------~ 304 (380) T KOG0683 254 WNGAGCHTN------------FSTKEMREAGGLKIIEEAIPKL------SKRHREHIAAYDPKGGKD-----N------E 304 (380) T ss_pred CCCCCCCCC------------CCHHHHHHCCCHHHHHHHHHHH------HHHHHHHHHHCCCCCCCC-----C------H T ss_conf 667652232------------0216666057789999986420------023044233128567863-----3------4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCC--CCCCCCCCCCCCEEEEECCC--CCCCCEEEEECCCCCCCHHHHH Q ss_conf 6545777642232122207661013442046557--54334243345003650223--8865203530358779989999 Q gi|254780503|r 298 RYFIGGLQKYIPNALVMLAPYGNSYRRLVLDMGC--PVNNAWGYDNRTTAFRIPFS--DAKTKRIENRLPSSDTNPYLVL 373 (461) Q Consensus 298 ~~fiaGil~h~~al~a~~~PtvNSYkRl~p~~~a--p~~~~WG~~NR~~~iRip~~--~~~~~riE~R~~d~~aNPYL~l 373 (461) |...+ ++ +. -+..+||.-||.+.||||.. ......+|.|-|++.+.||++- T Consensus 305 rrltg------------------------~h-Et~~i~~Fs~GvAnr~~siri~r~va~~~~Gy~edrrP~sN~Dpy~Vt 359 (380) T KOG0683 305 RRLTG------------------------RH-ETGSIDNFSWGVANRNPSIRIPRTVAAEGKGYFEDRRPSSNCDPYAVT 359 (380) T ss_pred HHHCC------------------------CC-CCCCCCCCCCCCCCCCCEEEECHHHHCCCCCCCCCCCCCCCCCCCEEE T ss_conf 55318------------------------88-655522000103467861662131213665312458986788732010 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999998984 Q gi|254780503|r 374 AASLACGLLGILQ 386 (461) Q Consensus 374 aail~agl~Gi~~ 386 (461) .+++-.-|..... T Consensus 360 ~~~~~t~l~~~~~ 372 (380) T KOG0683 360 LMIIPTTLLEAVE 372 (380) T ss_pred HHHHHHHHHCCCC T ss_conf 7776677753420 No 17 >COG3572 GshA Gamma-glutamylcysteine synthetase [Coenzyme metabolism] Probab=95.71 E-value=0.097 Score=31.29 Aligned_cols=76 Identities=9% Similarity=0.134 Sum_probs=32.7 Q ss_pred HHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHH-H Q ss_conf 998997678631102210101278642011126652113789999999999998630660331123211356626899-9 Q gi|254780503|r 195 IDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQNYPGSAMHI-H 273 (461) Q Consensus 195 l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~GsG~H~-h 273 (461) ++-+.++|...|+.=-.+|+-...-|.-+.+.+-.... |.+=-..+.+-=.+|-|---||.+.-=||.-. - T Consensus 156 Y~iM~~Ympkvg~~glDMm~rtctiQVNLD~sse~dm~--------rk~rvs~~Lqp~ataLFANSpFtegkpng~lS~R 227 (456) T COG3572 156 YAIMTRYMPKVGVKGLDMMTRTCTIQVNLDFSSETDMR--------RKMRVSLALQPLATALFANSPFTEGKPNGLLSWR 227 (456) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHEEEEEECCCCCHHHH--------HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHH T ss_conf 99999987652883155553101158752367303499--------9999986541899999706976579876411066 Q ss_pred HHHHH Q ss_conf 87631 Q gi|254780503|r 274 QSVLD 278 (461) Q Consensus 274 ~SL~d 278 (461) -.+|. T Consensus 228 ~~iW~ 232 (456) T COG3572 228 GDIWR 232 (456) T ss_pred HHHHH T ss_conf 44542 No 18 >TIGR03444 gshA_related glutamate--cysteine ligase family protein. Members of this bacterial protein family bear homology to glutamate--cysteine ligase, an enzyme in the two-step pathway of glutathione (GSH) biosynthesis, but are distinctly different. Among the bacterial genomes that carry the uncharacterized methyltransferase (TIGR03438) and conserved hypothetical protein TIGR03440, this protein is found in a subset, always in the vicinity of these other genes. Conserved hypothetical protein TIGR03442 is found in these same genomes. The role of this cassette is probably biosynthetic, but the product is unknown. Probab=91.42 E-value=0.83 Score=25.19 Aligned_cols=58 Identities=22% Similarity=0.166 Sum_probs=31.9 Q ss_pred CHHHHHHHCC-----CCCCCCCCCCCCCCCEE---EEECCCCCCCCEEEEECCCCCCCHH-HHHHHHHHHHHHH Q ss_conf 1013442046-----55754334243345003---6502238865203530358779989-9999999999998 Q gi|254780503|r 319 GNSYRRLVLD-----MGCPVNNAWGYDNRTTA---FRIPFSDAKTKRIENRLPSSDTNPY-LVLAASLACGLLG 383 (461) Q Consensus 319 vNSYkRl~p~-----~~ap~~~~WG~~NR~~~---iRip~~~~~~~riE~R~~d~~aNPY-L~laail~agl~G 383 (461) --||+-+..| ..-|+...|.++ =|+. ||+ .+.||.|..|+.--.+ .++.|++.+-|+. T Consensus 253 g~tFrdf~~G~~~~~g~~PT~~Dw~~H-LsTlFPeVRl------K~yLE~R~aDa~p~~~~~a~~A~~~gLlyD 319 (390) T TIGR03444 253 GVTFADWVAGRDPVIGRRPTRADLDYH-LTTLFPPVRP------RGWLELRYLDAQPGDGWPVPVATLTALLDD 319 (390) T ss_pred CCCHHHHHCCCCCCCCCCCCHHHHHHH-HHCCCCCCCC------CCEEEECCCCCCCCCCCCHHHHHHHHHHCC T ss_conf 977999966887778999998999987-5237986555------563475016778813000999999998428 No 19 >pfam08759 DUF1792 Domain of unknown function (DUF1792). This putative domain is probably missannotated as a glycosyl transferase 8 family member. This domain is found at the C-terminus of proteins that also contain the glycosyl transferase domain at the N-terminus. Probab=81.92 E-value=1.6 Score=23.35 Aligned_cols=83 Identities=16% Similarity=0.175 Sum_probs=52.8 Q ss_pred HHHHHHHHHH-HHHCCCEEECCCCCCCCC---CHHHHHHH--HHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999998-630660331123211356---62689998--76313687423358865521566654577764223212 Q gi|254780503|r 239 LFKRTIREAA-FKHDIYATFMAKPMQNYP---GSAMHIHQ--SVLDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNAL 312 (461) Q Consensus 239 ~~k~~ik~vA-~~~G~~atFmpKP~~~~~---GsG~H~h~--SL~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~ 312 (461) .++..++... ...=|..||+.+|+.+.. -|+-++|. +||. |..-++-.+....+..+ --|...+..+- T Consensus 69 ~~~~~~~k~l~~~~~Y~~tfiSRpY~d~~dKs~~~~~F~~lK~iW~--~rdIliVEG~~sR~GvG----NdLF~naksI~ 142 (225) T pfam08759 69 RNGDFIKKLLVPNRWYGSTFISRPYIDYKDKSKSARYFHKLKQIWQ--NRDILIVEGEKSRSGVG----NDLFDNAKSIK 142 (225) T ss_pred HHHHHHHHHHCCCCEECCEEECCCCCCCCCCHHHHHHHHHHHHHHC--CCEEEEEECCCCCCCCC----HHHHHCCCEEE T ss_conf 8689999974567746110423322121222576999999999847--99089992365114677----15563664078 Q ss_pred HHCCCCCHHHHHHHC Q ss_conf 220766101344204 Q gi|254780503|r 313 VMLAPYGNSYRRLVL 327 (461) Q Consensus 313 a~~~PtvNSYkRl~p 327 (461) =+.||+-|+|.++.. T Consensus 143 rIicPsknAy~~yd~ 157 (225) T pfam08759 143 RIICPSKNAYSKYDE 157 (225) T ss_pred EEECCCHHHHHHHHH T ss_conf 895787747999999 No 20 >KOG0558 consensus Probab=72.88 E-value=3.4 Score=21.16 Aligned_cols=57 Identities=16% Similarity=0.158 Sum_probs=41.8 Q ss_pred CCHHHHHHHHHHHHHHHH-HCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCC Q ss_conf 442899999899767863-1102210101278642011126652113789999999999998630660331123 Q gi|254780503|r 188 INEFDEIIDDIWKFSEKQ-GLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAK 260 (461) Q Consensus 188 ~~~~~~~l~~i~~~l~~~-Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpK 260 (461) ......|-+.+++.|..+ -||- |.|.|-+ .+|.++.+|+-.|++|.+.|...||||- T Consensus 245 t~plrGf~rAMvKtMt~alkiPH---------------F~y~dEI-n~~sLvklr~elk~~a~e~~IKltfmPf 302 (474) T KOG0558 245 TVPLRGFSRAMVKTMTEALKIPH---------------FGYVDEI-NCDSLVKLRQELKENAKERGIKLTFMPF 302 (474) T ss_pred EEECHHHHHHHHHHHHHHHCCCC---------------CCCCCCC-CHHHHHHHHHHHHHHHHHCCCEEEEHHH T ss_conf 12123478999999888745885---------------4424424-7688999999986666645732652378 No 21 >pfam06877 DUF1260 Protein of unknown function (DUF1260). This family consists of several hypothetical bacterial proteins of around 120 residues in length. The function of this family is unknown. Probab=65.58 E-value=10 Score=17.98 Aligned_cols=93 Identities=15% Similarity=0.107 Sum_probs=64.7 Q ss_pred HHHHHHHHHHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 77789999986102764023224417998216788887778778888776678777766544289999989976786311 Q gi|254780503|r 128 RNVLKRVLNFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGL 207 (461) Q Consensus 128 R~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi 207 (461) ...-+++++.+.+.|..+-.-.++||++.-.+ +.-++.+...+.+.|. T Consensus 2 ~~~~~~vl~~L~e~G~dl~~~~~Ieh~~~f~~--------------------------------~~~~e~~a~~~~~~gy 49 (103) T pfam06877 2 DDETGEVLAALLEDGDDLSAPYEIEHHFYFED--------------------------------EDDLEKAAEEAFKLGY 49 (103) T ss_pred CCHHHHHHHHHHHCCCCCCCCEEEEEEEEECC--------------------------------HHHHHHHHHHHHHCCC T ss_conf 54179999999975998678877779998099--------------------------------9999999999997698 Q ss_pred CHHHHHC--CCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0221010--127864201112665211378999999999999863066 Q gi|254780503|r 208 EIDTLIH--EEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDI 253 (461) Q Consensus 208 ~ve~~~~--E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~ 253 (461) .|+.... |.+.+-|.+.+.....+...+=.-.. .-+-.+|.++|- T Consensus 50 ~v~~~~~~~ed~~~~~~~~~~~~~~~~~~~I~~~~-~~l~~lA~~~~g 96 (103) T pfam06877 50 EVEDAEELEEDGDGVWCLDVSREMTLDYEDIDAQE-EELEALAEKFGG 96 (103) T ss_pred EEEECCCCCCCCCCEEEEEEEEECCCCHHHHHHHH-HHHHHHHHHHCC T ss_conf 36664414668984189999995568989999999-999999999395 No 22 >PRK09444 pntB pyridine nucleotide transhydrogenase; Provisional Probab=56.68 E-value=15 Score=16.96 Aligned_cols=52 Identities=17% Similarity=0.288 Sum_probs=31.3 Q ss_pred CEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCC-CCHHHHHHHHHH Q ss_conf 01112665211378999999999999863066033112321135-662689998763 Q gi|254780503|r 222 EINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQNY-PGSAMHIHQSVL 277 (461) Q Consensus 222 Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~-~GsG~H~h~SL~ 277 (461) .+-+-|.-++..|-.....|++.+.. +++|..+.|-=.|..|. || |+|+=|- T Consensus 305 ~ViIVPGYGmAVAqAQh~v~el~~~L-~~~g~~V~faIHPVAGRMPG---HMNVLLA 357 (459) T PRK09444 305 SVIITPGYGMAVAQAQYPVAEITEKL-RARGINVRFGIHPVAGRLPG---HMNVLLA 357 (459) T ss_pred EEEEECCHHHHHHHHHHHHHHHHHHH-HHCCCEEEEEECCCCCCCCC---CCEEEEE T ss_conf 49996680899998889999999999-97798589983646788886---4013665 No 23 >TIGR01515 branching_enzym 1,4-alpha-glucan branching enzyme; InterPro: IPR006407 This entry represents the core region of the glycogen branching enzymes, which are responsible for the transfer of chains of approximately seven alpha(1,4)-linked glucosyl residues to other similar chains (in new alpha-(1,6) linkages) in the biosynthesis of glycogen . The branching enzyme is responsible for the degree of alpha(1,6) branch linkages found in polysaccharides .; GO: 0003844 14-alpha-glucan branching enzyme activity, 0005978 glycogen biosynthetic process. Probab=53.61 E-value=16 Score=16.64 Aligned_cols=107 Identities=18% Similarity=0.287 Sum_probs=72.8 Q ss_pred CCCCCCEEEEEECCCCEECCCCCCCEEEEEEECCCCCC-------------CCC-----------CCCHHHHHHHHHHHH Q ss_conf 78887789996001245387888770999974033575-------------212-----------342577789999986 Q gi|254780503|r 83 NERDGDLRLSPDLSTLSIVPWEMDPTAQIVCDVVDYNG-------------KEV-----------TYTPRNVLKRVLNFY 138 (461) Q Consensus 83 ~~~~~D~~l~PD~~Tl~~lPw~~~~~a~vl~~~~~~dG-------------~p~-----------~~~PR~iLkr~i~~l 138 (461) ...+|.++|+-||=.+..- -+|.||.++|++- .|- +|+ ...=|.+..+.+-+. T Consensus 85 ~~~~G~~~lkaDPyA~y~e--vRP~tASlv~~l~-~d~~W~~~r~~~~~~~~PvsvYElHLGSW~~lSY~eL~~~L~pYv 161 (608) T TIGR01515 85 VDANGEIRLKADPYAFYAE--VRPETASLVADLE-QDEEWLEKRKAKDPYEKPVSVYELHLGSWRKLSYRELADQLIPYV 161 (608) T ss_pred ECCCCCEEECCCCCCCCCC--CCCCCEEEEECCH-HHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 7689875643675301246--8987402552585-358999985210220088289984000054212788863101578 Q ss_pred HCCCCCEEECCCCCEEEEECCCCCCCCCCCC-CCCCCCCCCCCCCC-CCCCCCHHHHHHHHHHHHHHHHHCCH Q ss_conf 1027640232244179982167888877787-78888776678777-76654428999998997678631102 Q gi|254780503|r 139 AQNELKPIIAPEIEFYLIAKNEDPDYPLQPP-KGRSGRSILGGQSY-SIMGINEFDEIIDDIWKFSEKQGLEI 209 (461) Q Consensus 139 ~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~-~~~~~~~~~~~~~y-~~~~~~~~~~~l~~i~~~l~~~Gi~v 209 (461) +++||+ =+|- + .....| .+.-|+.. +++| -++|+..-+||+. +++.|.++||.| T Consensus 162 K~~GFT-----HiEL--L-------Pv~EHPFDGSWGYQ~--TGyYAPTsRfG~P~Df~y-fvD~~H~~giGV 217 (608) T TIGR01515 162 KELGFT-----HIEL--L-------PVAEHPFDGSWGYQV--TGYYAPTSRFGTPDDFRY-FVDACHQAGIGV 217 (608) T ss_pred HHCCCE-----EEEE--C-------CCCCCCCCCCCCCCC--CCCCCCCCCCCCHHHHHH-HHHHHHHCCCCE T ss_conf 750871-----2541--4-------664278878655357--755278888877456899-999998658957 No 24 >PRK11410 hypothetical protein; Provisional Probab=51.59 E-value=13 Score=17.40 Aligned_cols=30 Identities=23% Similarity=0.228 Sum_probs=18.2 Q ss_pred EEEEEEECCCCCCCC---CCCCHHHHHHHHHHHH Q ss_conf 099997403357521---2342577789999986 Q gi|254780503|r 108 TAQIVCDVVDYNGKE---VTYTPRNVLKRVLNFY 138 (461) Q Consensus 108 ~a~vl~~~~~~dG~p---~~~~PR~iLkr~i~~l 138 (461) .|.|..+ ...||.. +..--|+-|.|+++-+ T Consensus 128 PA~VaLW-Rg~dGrL~~fv~~m~R~GLAklLE~l 160 (570) T PRK11410 128 PAQVALW-RGADGRLSHFVVLMQRSGLAKLLEPL 160 (570) T ss_pred CCCEEEE-ECCCCCHHHEEEEEECCHHHHHHHHH T ss_conf 4003656-57999812448888412089998766 No 25 >cd01141 TroA_d Periplasmic binding protein TroA_d. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=49.05 E-value=19 Score=16.30 Aligned_cols=28 Identities=14% Similarity=0.223 Sum_probs=16.2 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 32113566268999876313687423358 Q gi|254780503|r 260 KPMQNYPGSAMHIHQSVLDIKSNQNIFSN 288 (461) Q Consensus 260 KP~~~~~GsG~H~h~SL~d~~~g~n~f~~ 288 (461) ++..-..|++..++- |...-||+|+|.| T Consensus 159 ~~~~~~~g~~s~~~~-li~~AGG~Ni~~D 186 (186) T cd01141 159 KGLWYMPGGNSYVAK-MLRDAGGRYLSAE 186 (186) T ss_pred CCCEEECCCCCHHHH-HHHHCCCEEECCC T ss_conf 895377189878999-9998499572589 No 26 >TIGR01436 glu_cys_lig_pln glutamate--cysteine ligase; InterPro: IPR011556 These sequences represent one of two highly dissimilar forms of glutamate--cysteine ligase (gamma-glutamylcysteine synthetase), an enzyme of glutathione biosynthesis. The other group is represented by IPR006335 from INTERPRO. This form is found in plants (with a probable transit peptide), root nodule and other bacteria, but not Escherichia coli and closely related species. ; GO: 0004357 glutamate-cysteine ligase activity, 0006750 glutathione biosynthetic process, 0009507 chloroplast. Probab=47.98 E-value=20 Score=16.07 Aligned_cols=180 Identities=17% Similarity=0.153 Sum_probs=87.6 Q ss_pred HHHHHHHHHHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 77789999986102764023224417998216788887778778888776678777766544289999989976786311 Q gi|254780503|r 128 RNVLKRVLNFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGL 207 (461) Q Consensus 128 R~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi 207 (461) |.-|-+.++.=.+..=+-++|+|.|=+.|++++-... .|.-.+ -.-++|+-|.+.-.-+-| T Consensus 5 rddL~~yl~aGckpK~qwRiGTEHEKfgf~k~~l~p~-----------------~Ye~~~--GI~~LLng~a~RfgWq~v 65 (446) T TIGR01436 5 RDDLVAYLEAGCKPKEQWRIGTEHEKFGFKKRDLRPI-----------------KYEEKA--GIAELLNGIAERFGWQKV 65 (446) T ss_pred HHHHHHHHHCCCCCHHCCCCCCCCCCCCEECCCCCCC-----------------CCCCCC--CHHHHHHHHHHHCCCCEE T ss_conf 8899999973788203102464000022231466798-----------------855555--678998887644067245 Q ss_pred ----CHHHHHCCCC------CCCCCEEECCC-CCHHHHHHHHHHHHHHHHHHHHHCCC---EEE-----------CCCC- Q ss_conf ----0221010127------86420111266-52113789999999999998630660---331-----------1232- Q gi|254780503|r 208 ----EIDTLIHEEG------PAQFEINIRHG-DPLKLADQVFLFKRTIREAAFKHDIY---ATF-----------MAKP- 261 (461) Q Consensus 208 ----~ve~~~~E~g------pGQ~Ei~~~~~-~~l~aaD~~~~~k~~ik~vA~~~G~~---atF-----------mpKP- 261 (461) .|-......+ .||||++=.|. ..=+++|.+--==+=+++||.+.|+- .=| |||= T Consensus 66 ~E~dkiIgL~~d~~sisLEPgGQfELSGAPl~TiHet~dEin~Hl~Q~~ev~~emg~gFlGlG~qPkwrr~DiP~MPKGR 145 (446) T TIGR01436 66 EEDDKIIGLKKDDASISLEPGGQFELSGAPLETIHETYDEINEHLYQVKEVAEEMGLGFLGLGLQPKWRREDIPLMPKGR 145 (446) T ss_pred EECCEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCCCCCCCCCCCCCCC T ss_conf 44887888626881476468642101556134666328999889999999987616201035546665447888898752 Q ss_pred ---CCCC---CCH-HHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC------CCCHHHHHHHCC Q ss_conf ---1135---662-689998763136874233588655215666545777642232122207------661013442046 Q gi|254780503|r 262 ---MQNY---PGS-AMHIHQSVLDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVMLA------PYGNSYRRLVLD 328 (461) Q Consensus 262 ---~~~~---~Gs-G~H~h~SL~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~~~------PtvNSYkRl~p~ 328 (461) +.+. .|. |+-+-+--+.- .-|+=++.++.+=...|--+| .=|=.||||| -..|.|--+..+ T Consensus 146 Y~IMr~YmPkvG~LgLDMMlRTCTv--Q~N~DfsSEADMv~KFR~slA----LQPlatALFANSPFlEGkPNGFlSyRsh 219 (446) T TIGR01436 146 YKIMRDYMPKVGKLGLDMMLRTCTV--QANLDFSSEADMVKKFRVSLA----LQPLATALFANSPFLEGKPNGFLSYRSH 219 (446) T ss_pred HHHHHHCCCCCCCCHHHHHHHHHHH--HCCCCCCCHHHHHHHHHHHHH----HHHHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 5888624787786224566545244--305663405778899877655----5689999853786668787875435535 Q ss_pred CCCC Q ss_conf 5575 Q gi|254780503|r 329 MGCP 332 (461) Q Consensus 329 ~~ap 332 (461) .|.- T Consensus 220 IWtd 223 (446) T TIGR01436 220 IWTD 223 (446) T ss_pred CCCC T ss_conf 4668 No 27 >TIGR02048 gshA_cyano glutamate--cysteine ligase, putative; InterPro: IPR011792 This family consists of proteins believed to be the glutamate--cysteine ligases of several cyanobacteria, which are known to make glutathione .. Probab=47.20 E-value=21 Score=16.00 Aligned_cols=269 Identities=17% Similarity=0.188 Sum_probs=131.1 Q ss_pred EEECCCCEECCCCCCCEEEEEEECC-CCCCCC-CCCCHHHHHHHHHHHHH-CC----CCCEEECCCCCEEEEECCCCCCC Q ss_conf 9600124538788877099997403-357521-23425777899999861-02----76402322441799821678888 Q gi|254780503|r 92 SPDLSTLSIVPWEMDPTAQIVCDVV-DYNGKE-VTYTPRNVLKRVLNFYA-QN----ELKPIIAPEIEFYLIAKNEDPDY 164 (461) Q Consensus 92 ~PD~~Tl~~lPw~~~~~a~vl~~~~-~~dG~p-~~~~PR~iLkr~i~~l~-~~----G~~~~~g~E~EF~L~~~~~~~~~ 164 (461) +-|++-|+.-|-. .......+.. ..|..- --.-||..|+.-++.+. +. |-.+-.|-.-.|| T Consensus 26 v~~l~GfvrePds--rnveytt~P~~~y~~~lC~~l~Pr~~lr~yl~~lG~d~tl~PGs~l~lG~~~~fy---------- 93 (377) T TIGR02048 26 VKDLDGFVREPDS--RNVEYTTPPLYSYDRLLCDLLRPRRRLRDYLKQLGEDYTLIPGSTLSLGDTDRFY---------- 93 (377) T ss_pred HHHCCCCEECCCC--CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCC---------- T ss_conf 7433653107776--5322336347899999999863489999999972655031475300037765310---------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHH Q ss_conf 77787788887766787777665442899999899767863110221010127864201112665211378999999999 Q gi|254780503|r 165 PLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTI 244 (461) Q Consensus 165 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~i 244 (461) -..+.+.++.++++-+.. .+-.-++|-. -++.-...++.+..+| T Consensus 94 -------------------rsdP~nPyh~yieqty~t----~vvtas~hin-------------~G~~~~e~l~raCrl~ 137 (377) T TIGR02048 94 -------------------RSDPQNPYHTYIEQTYGT----KVVTASVHIN-------------IGIDDPEALMRACRLV 137 (377) T ss_pred -------------------CCCCCCHHHHHHHHHCCC----EEEEEEEEEE-------------CCCCCHHHHHHHHHHH T ss_conf -------------------158886246887642054----4888888886-------------0664278898663333 Q ss_pred HHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC-CCC-CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 99986306603311232113566268999876313687423358-865-5215666545777642232122207661013 Q gi|254780503|r 245 REAAFKHDIYATFMAKPMQNYPGSAMHIHQSVLDIKSNQNIFSN-PDG-SETDNFRYFIGGLQKYIPNALVMLAPYGNSY 322 (461) Q Consensus 245 k~vA~~~G~~atFmpKP~~~~~GsG~H~h~SL~d~~~g~n~f~~-~~~-~ls~~~~~fiaGil~h~~al~a~~~PtvNSY 322 (461) |--|--+ .|--..-||.+..-.|-| -+-| .+|-. |.. ++-..--|||-=.-+.+. .-|...- T Consensus 138 r~eaPl~--lalsa~sPf~~G~~tGyh--s~rW------~~fP~tP~~vPlftsh~h~~~W~e~ql~------~Gtmqnv 201 (377) T TIGR02048 138 RLEAPLF--LALSASSPFLDGQVTGYH--STRW------AVFPKTPTEVPLFTSHAHFIEWTEEQLA------LGTMQNV 201 (377) T ss_pred HCCCHHH--HHHHHCCCCCCCCCCEEE--CCCC------CCCCCCCCCCCCCCCHHHHHHHHHHHHH------HHHHHHH T ss_conf 0111367--776521632266100011--1001------1047888757411020578888898873------1137666 Q ss_pred HHHHCCCCCCCCCCCCCCCCCEEEEECCCCC---CCCEEEEECCCCCCCHH--HHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 4420465575433424334500365022388---65203530358779989--999999999999898468887433453 Q gi|254780503|r 323 RRLVLDMGCPVNNAWGYDNRTTAFRIPFSDA---KTKRIENRLPSSDTNPY--LVLAASLACGLLGILQKIEPTPPTMKS 397 (461) Q Consensus 323 kRl~p~~~ap~~~~WG~~NR~~~iRip~~~~---~~~riE~R~~d~~aNPY--L~laail~agl~Gi~~~l~p~~~~~~~ 397 (461) | +-|+ ++| |.++. .-.|+|.|+.|--.||- |++.|+|-|-|.-+-..-++.|-..+. T Consensus 202 r----hlW~-------------~vr-PnGd~rP~~lnrlelriCdlvt~P~~llai~allearl~ql~~~P~ldPl~~~~ 263 (377) T TIGR02048 202 R----HLWS-------------AVR-PNGDRRPYDLNRLELRICDLVTDPISLLAITALLEARLLQLLEDPDLDPLRQSK 263 (377) T ss_pred H----HHHH-------------HCC-CCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHCC T ss_conf 7----6664-------------106-676778630114555565564044899999999999999997188867211104 Q ss_pred CCC-CH---------HCCCCCHHHHHHHHHCCHHH--HHHCCHHHHHHHHHHHHHHH Q ss_conf 000-31---------10478989999998659898--87355899999999999999 Q gi|254780503|r 398 ANQ-EA---------INLPRGLLEAVTLLENTSHF--KDIFGSQFIDMYANLKRNEF 442 (461) Q Consensus 398 ~~~-~~---------~~LP~sL~eAl~~l~~d~~l--~~~lg~~~v~~~~~~k~~E~ 442 (461) ... +. ..--.+|+..|...+....+ ++++.+-+-+.+...|.+-. T Consensus 264 ~~~~~l~~l~~~ne~~aa~~sl~~~l~hW~d~r~~~a~~W~~e~y~~v~~~ak~~Gf 320 (377) T TIGR02048 264 LSPEELVALADKNEAAAAKASLDAVLTHWRDGRAIIAREWIEELYQEVWAIAKKQGF 320 (377) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 688999999886778999887877776544356789999999999999999875485 No 28 >KOG0257 consensus Probab=46.85 E-value=17 Score=16.60 Aligned_cols=49 Identities=22% Similarity=0.259 Sum_probs=42.0 Q ss_pred EEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCEEEEECC Q ss_conf 0999974033575212342577789999986102764023224417998216 Q gi|254780503|r 108 TAQIVCDVVDYNGKEVTYTPRNVLKRVLNFYAQNELKPIIAPEIEFYLIAKN 159 (461) Q Consensus 108 ~a~vl~~~~~~dG~p~~~~PR~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~~ 159 (461) -+.++|+-.++.|+-+ +|..|.++.+-+++.|+-+.+---+|+++++.+ T Consensus 174 k~Ii~ntPhNPtGkvf---sReeLe~ia~l~~k~~~lvisDevYe~~v~d~~ 222 (420) T KOG0257 174 KAIILNTPHNPTGKVF---SREELERIAELCKKHGLLVISDEVYEWLVYDGN 222 (420) T ss_pred CEEEEECCCCCCCCCC---CHHHHHHHHHHHHHCCEEEEEHHHHHHHHHCCC T ss_conf 4799948999867305---799999999999878979998257478821787 No 29 >cd06184 flavohem_like_fad_nad_binding FAD_NAD(P)H binding domain of flavohemoglobin. Flavohemoglobins have a globin domain containing a B-type heme fused with a ferredoxin reductase-like FAD/NAD-binding domain. Flavohemoglobins detoxify nitric oxide (NO) via an NO dioxygenase reaction. The hemoglobin domain adopts a globin fold with an embedded heme molecule. Flavohemoglobins also have a C-terminal reductase domain with bindiing sites for FAD and NAD(P)H. This domain catalyzes the conversion of NO + O2 + NAD(P)H to NO3- + NAD(P)+. Instead of the oxygen transport function of hemoglobins, flavohemoglobins seem to act in NO dioxygenation and NO signalling. Probab=45.13 E-value=17 Score=16.47 Aligned_cols=28 Identities=14% Similarity=0.215 Sum_probs=11.8 Q ss_pred CEEECCCCCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 0111266521137899999999999986306 Q gi|254780503|r 222 EINIRHGDPLKLADQVFLFKRTIREAAFKHD 252 (461) Q Consensus 222 Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G 252 (461) ++.+-|. .+..+.+ .|+.-+.+++.++. T Consensus 143 ~i~l~yg--~r~~~d~-~~~~eL~~l~~~~~ 170 (247) T cd06184 143 PVTFIHA--ARNSAVH-AFRDELEELAARLP 170 (247) T ss_pred CEEEEEE--CCCHHHH-HHHHHHHHHHHHCC T ss_conf 4899995--6997993-78999999997689 No 30 >pfam04468 PSP1 PSP1 C-terminal conserved region. This region is present in both eukaryotes and eubacteria. The yeast PSP1 protein is involved in suppressing mutations in the DNA polymerase alpha subunit in yeast. Probab=44.68 E-value=23 Score=15.75 Aligned_cols=59 Identities=12% Similarity=0.104 Sum_probs=47.9 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 9999989976786311022101012786420111266521137899999999999986306603 Q gi|254780503|r 192 DEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYA 255 (461) Q Consensus 192 ~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~a 255 (461) .+.+....+.....|++.+-+..|+-.+.--+++-|+ ||.-+=||.++|++++.++.+. T Consensus 25 ~~a~~~c~~~v~~~~L~Mkivd~e~~~D~~kl~fyy~-----a~~RVDFR~Lvk~L~~~f~~RI 83 (88) T pfam04468 25 EEALEICREKIREHGLPMKLVDVEYQFDRSKLTFYYT-----AEGRVDFRELVKDLAREFKTRI 83 (88) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEEECCCCEEEEEEE-----CCCCCCHHHHHHHHHHHHCCEE T ss_conf 9999999999998299938999999469968999997-----6984238999999999849738 No 31 >COG1702 PhoH Phosphate starvation-inducible protein PhoH, predicted ATPase [Signal transduction mechanisms] Probab=42.99 E-value=24 Score=15.58 Aligned_cols=72 Identities=25% Similarity=0.341 Sum_probs=29.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHCCCCCHHHHHH-HHHC-CHHHHHHCCHHHHHHHH Q ss_conf 5877998999999999999989846888743345300031104789899999-9865-98988735589999999 Q gi|254780503|r 363 PSSDTNPYLVLAASLACGLLGILQKIEPTPPTMKSANQEAINLPRGLLEAVT-LLEN-TSHFKDIFGSQFIDMYA 435 (461) Q Consensus 363 ~d~~aNPYL~laail~agl~Gi~~~l~p~~~~~~~~~~~~~~LP~sL~eAl~-~l~~-d~~l~~~lg~~~v~~~~ 435 (461) |..+--|||+.|+-+-|-..|.-+++-+..| .-++.++.--||-+|.+-++ .|+- .+.+.+.+|.+.+.... T Consensus 151 pAGTGKTyLava~av~al~~~~v~rIiLtRP-aVEAGEklGfLPGdl~eKvdPylRPLyDAl~d~l~~~~~~~~~ 224 (348) T COG1702 151 PAGTGKTYLAVAKAVDALGAGQVRRIILTRP-AVEAGEKLGFLPGDLREKVDPYLRPLYDALYDILGAERVEALD 224 (348) T ss_pred CCCCCCCHHHHHHHHHHHHCCCHHEEECCCC-CHHCCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 6456971112776652654064120100683-1303642370788234324611041789999986489773055 No 32 >PRK13289 nitric oxide dioxygenase; Provisional Probab=41.89 E-value=20 Score=16.14 Aligned_cols=29 Identities=21% Similarity=0.346 Sum_probs=16.2 Q ss_pred CCEEECCCCCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 20111266521137899999999999986306 Q gi|254780503|r 221 FEINIRHGDPLKLADQVFLFKRTIREAAFKHD 252 (461) Q Consensus 221 ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G 252 (461) -++.|-|+. +..+. ..|+.-+++++.+|. T Consensus 290 r~V~liyga--R~~~~-~~F~~eL~~La~~~p 318 (399) T PRK13289 290 RPVHFIHAA--ENGDV-HAFRDEVRALAARHP 318 (399) T ss_pred CCEEEEEEC--CCHHH-HHHHHHHHHHHHHCC T ss_conf 857999956--98798-888999999997589 No 33 >pfam11679 DUF3275 Protein of unknown function (DUF3275). This family of proteins with unknown function appear to be restricted to Proteobacteria. Probab=41.15 E-value=21 Score=15.90 Aligned_cols=36 Identities=19% Similarity=0.214 Sum_probs=26.7 Q ss_pred EEEEEECCCCCCCCCCCCH---HHHHHHHHHHHHCCCCCE Q ss_conf 9999740335752123425---777899999861027640 Q gi|254780503|r 109 AQIVCDVVDYNGKEVTYTP---RNVLKRVLNFYAQNELKP 145 (461) Q Consensus 109 a~vl~~~~~~dG~p~~~~P---R~iLkr~i~~l~~~G~~~ 145 (461) +.+|+. +|+=|+++.+|| |..|+.|+.++.+.||.+ T Consensus 163 a~LFG~-lWplge~VKLD~TVDRr~LR~Q~~rLg~LGY~~ 201 (211) T pfam11679 163 AELFGE-LWPAGEPVKLDPTVDRSRLRQQCARLGQLGYEL 201 (211) T ss_pred HHHHHH-HCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 888751-112788301555112899999999986437111 No 34 >pfam09909 DUF2138 Uncharacterized protein conserved in bacteria (DUF2138). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=38.50 E-value=20 Score=16.13 Aligned_cols=30 Identities=27% Similarity=0.240 Sum_probs=16.7 Q ss_pred EEEEEEECCCCCCCC---CCCCHHHHHHHHHHHH Q ss_conf 099997403357521---2342577789999986 Q gi|254780503|r 108 TAQIVCDVVDYNGKE---VTYTPRNVLKRVLNFY 138 (461) Q Consensus 108 ~a~vl~~~~~~dG~p---~~~~PR~iLkr~i~~l 138 (461) .|.|..+ ...||.. +..--|+-|.|+++-+ T Consensus 117 PA~VaLW-R~~dGrL~~fv~~m~R~GLAklLE~l 149 (552) T pfam09909 117 PAQVALW-RGADGRLRHFVVSMQRNGLAKLLEPL 149 (552) T ss_pred CCCEEEE-ECCCCCHHHEEEEEECCCHHHHHHHH T ss_conf 3012556-57999812358888423089998766 No 35 >pfam03851 UvdE UV-endonuclease UvdE. Probab=37.57 E-value=29 Score=15.04 Aligned_cols=71 Identities=14% Similarity=0.041 Sum_probs=37.8 Q ss_pred HHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCH Q ss_conf 999899767863110221010127864201112665211378999999999999863066033112321135662 Q gi|254780503|r 194 IIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQNYPGS 268 (461) Q Consensus 194 ~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~~Gs 268 (461) -+..|.+.+.+-||.+--+.++.-|= .-.|.-...-.+.+.-.=.-|.+.|+++|.+.||.|-+|.-.+.. T Consensus 45 ~l~~il~~n~~~~I~~yRiSS~lfP~----~shp~~~~~~~~~~~~~l~~iG~~a~~~~~Rls~HPgqf~vL~S~ 115 (275) T pfam03851 45 NLIRLLHYNIAHGIRLYRLSSSLFPL----ASHPDVGWKYETFAAPELREAGELAKEHGHRLTFHPGQFTVLNSP 115 (275) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCC----CCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCEECCCC T ss_conf 99999998887597669814020466----778423620489999999999999998295587489972216999 No 36 >TIGR01434 glu_cys_ligase glutamate--cysteine ligase; InterPro: IPR006334 These sequences represent glutamate--cysteine ligase, also known as gamma-glutamylcysteine synthetase, an enzyme in the biosynthesis of glutathione (GSH). GSH is one of several low molecular weight cysteine derivatives that can serve to protect against oxidative damage and participate in biosynthetic or detoxification reactions.; GO: 0004357 glutamate-cysteine ligase activity, 0006750 glutathione biosynthetic process. Probab=37.28 E-value=27 Score=15.27 Aligned_cols=16 Identities=25% Similarity=0.407 Sum_probs=9.5 Q ss_pred CEEEEECCCCCCCHHHHH Q ss_conf 203530358779989999 Q gi|254780503|r 356 KRIENRLPSSDTNPYLVL 373 (461) Q Consensus 356 ~riE~R~~d~~aNPYL~l 373 (461) -.||+|+-| .||.+=+ T Consensus 324 eYvEvRsLD--iNPFsPi 339 (518) T TIGR01434 324 EYVEVRSLD--INPFSPI 339 (518) T ss_pred EEEEEEEEC--CCCCCCC T ss_conf 089985313--6889876 No 37 >pfam02233 PNTB NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide transhydrogenase catalyses the reduction of NAD+ to NADPH. A complete loss of activity occurs upon mutation of Gly314 in E. coli. Probab=36.50 E-value=30 Score=14.93 Aligned_cols=53 Identities=15% Similarity=0.253 Sum_probs=35.9 Q ss_pred CEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCC-CCHHHHHHHHHHH Q ss_conf 01112665211378999999999999863066033112321135-6626899987631 Q gi|254780503|r 222 EINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQNY-PGSAMHIHQSVLD 278 (461) Q Consensus 222 Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~~-~GsG~H~h~SL~d 278 (461) .+-+-|.-++..|-.....|++.+.. +++|..+.|-=.|..|. || |+|+=|-. T Consensus 310 ~ViIVPGYGmAvAqAQ~~v~el~~~L-~~~g~~V~faIHPVAGRMPG---HMNVLLAE 363 (464) T pfam02233 310 SVIIVPGYGMAVAQAQHPVADLAKLL-EERGVNVRFAIHPVAGRMPG---HMNVLLAE 363 (464) T ss_pred EEEEECCCHHHHHHHHHHHHHHHHHH-HHCCCEEEEEECCCCCCCCC---CHHHHHHH T ss_conf 39997780899998789999999999-97898699985766888873---05552231 No 38 >PHA01807 hypothetical protein Probab=35.61 E-value=22 Score=15.87 Aligned_cols=42 Identities=19% Similarity=0.419 Sum_probs=36.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCC Q ss_conf 428999998997678631102210101278642011126652 Q gi|254780503|r 189 NEFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDP 230 (461) Q Consensus 189 ~~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~ 230 (461) .....|++.+.+.....+|++-.+.|-.|+|-|+|++.-..| T Consensus 94 Gv~~~f~R~l~~lA~~~n~p~va~tHR~g~grY~i~Yrr~kp 135 (145) T PHA01807 94 GVAREFIRELHRLAGWGNIPLVAWTHREGDGRYTIHYRRAKP 135 (145) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEEEECCCCEEEEEEEECCC T ss_conf 424999999999750368725898740388457888765177 No 39 >pfam03484 B5 tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits. Probab=33.99 E-value=33 Score=14.67 Aligned_cols=47 Identities=19% Similarity=0.282 Sum_probs=27.9 Q ss_pred HHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECC--CCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 998997678631102210101278642011126--65211378999999999999863066 Q gi|254780503|r 195 IDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRH--GDPLKLADQVFLFKRTIREAAFKHDI 253 (461) Q Consensus 195 l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~--~~~l~aaD~~~~~k~~ik~vA~~~G~ 253 (461) .+++.+.|..+|+.++. ..+.+.+.... .|....+| ++-+||+-+|+ T Consensus 21 ~~~i~~~L~~lg~~~~~-----~~~~~~v~vPs~R~Di~~~~D-------liEEi~R~~Gy 69 (69) T pfam03484 21 PEEIKKILKRLGFKVEE-----DEDTLKVTVPSYRPDILHEVD-------LIEEVARLYGY 69 (69) T ss_pred HHHHHHHHHHCCCEEEE-----CCCEEEEECCCCCCCCCCCCH-------HHHHHHHHHCC T ss_conf 99999999986998994-----798699989998576686256-------99999998089 No 40 >pfam00763 THF_DHG_CYH Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain. Probab=31.57 E-value=36 Score=14.41 Aligned_cols=97 Identities=18% Similarity=0.248 Sum_probs=60.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 57521234257778999998610276402322441799821678888777877888877667877776654428999998 Q gi|254780503|r 118 YNGKEVTYTPRNVLKRVLNFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDD 197 (461) Q Consensus 118 ~dG~p~~~~PR~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~ 197 (461) .||+++...-+.-++..++.+++.|..|..+. .+.. +++ ...-|.+. T Consensus 3 LdGk~vA~~i~~~l~~~i~~l~~~g~~P~Lai----i~vg--~~~---------------------------~s~~Yv~~ 49 (117) T pfam00763 3 LDGKALAKKIREELKEEVAKLKEKGVTPKLAV----ILVG--DDP---------------------------ASQVYVRS 49 (117) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCEEEE----EEEC--CCH---------------------------HHHHHHHH T ss_conf 32699999999999999999985799971899----9848--975---------------------------67999999 Q ss_pred HHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCC Q ss_conf 9976786311022101012786420111266521137899999999999986306603311232113 Q gi|254780503|r 198 IWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPMQN 264 (461) Q Consensus 198 i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~~~ 264 (461) ..+.|+..||..+.++=.....| --+...++++......+.-+.-.|.+. T Consensus 50 k~k~~~~~Gi~~~~~~l~~~~s~-----------------~~l~~~I~~lN~D~~V~GIlvqlPLP~ 99 (117) T pfam00763 50 KRKAAEELGIESELIRLPEDITE-----------------EELLALIEKLNADPSVHGILVQLPLPK 99 (117) T ss_pred HHHHHHHCCCEEEEEECCCCCCH-----------------HHHHHHHHHHHCCCCCCEEEEECCCCC T ss_conf 99999972871699877763349-----------------999999999967887887988388988 No 41 >PRK05996 motB flagellar motor protein MotB; Validated Probab=29.72 E-value=39 Score=14.21 Aligned_cols=46 Identities=17% Similarity=0.233 Sum_probs=29.7 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCHHHHH---HHHHHHHHHHHHHHHHCC Q ss_conf 2689998763136874233588655215666---545777642232122207 Q gi|254780503|r 268 SAMHIHQSVLDIKSNQNIFSNPDGSETDNFR---YFIGGLQKYIPNALVMLA 316 (461) Q Consensus 268 sG~H~h~SL~d~~~g~n~f~~~~~~ls~~~~---~fiaGil~h~~al~a~~~ 316 (461) -| +-++|.| +.++..|..+...+....+ .-||.+|...|.-..|.- T Consensus 311 eG--LrIqI~D-~~~~~MF~~GSA~p~p~~~~lL~~Ia~vL~~~pn~I~I~G 359 (431) T PRK05996 311 GG--LLISLTD-QVDYGMFAIGSAVPRAELVLAMEKIGQLLAEQPGAVIIRG 359 (431) T ss_pred CE--EEEEEEE-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 92--7999985-8999884779775688999999999999963898179987 No 42 >TIGR01663 PNK-3'Pase polynucleotide kinase 3'-phosphatase; InterPro: IPR006550 These sequences represent the metazoan 5'-polynucleotide-kinase-3'-phosphatase, PNKP, which is believed to be involved in repair of oxidative DNA damage. Removal of 3'-phosphates is essential for the further processing of the break by DNA polymerases , . The central phosphatase domain is a member of the IIIA subfamily (IPR006549 from INTERPRO) of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. As is common in this superfamily, the enzyme is magnesium dependent. A difference between this enzyme and other HAD-superfamily phosphatases is in the third conserved catalytic motif which usually contains two conserved aspartate residues believed to be involved in binding the magnesium ion. Here, the second aspartate is replaced by a conserved arginine residue which may indicate an interaction with the phosphate backbone of the substrate. Very close relatives of this domain are also found separate from the N- and C-terminal domains seen here, as in the 3'-phosphatase found in plants. The larger family of these domains is described by IPR006551 from INTERPRO. Outside of the phosphatase domain is a P-loop ATP-binding motif associated with the kinase activity. The entry for the mouse homolog appears to be missing a large piece of sequence corresponding to the first conserved catalytic motif of the phosphatase domain as well as the conserved threonine of the second motif. Either this is a sequencing artifact or this may represent a pseudo- or non-functional gene. . Probab=29.03 E-value=40 Score=14.13 Aligned_cols=48 Identities=23% Similarity=0.317 Sum_probs=25.8 Q ss_pred HHHHHCCCCEEEEEEECCCCCEEEEEEEHHHHHHCC-----CCCCCCCEEECCCCC Q ss_conf 999717983999999788983312676198873015-----677361000002427 Q gi|254780503|r 22 KWLKDNRIEDVECITPDLAGIPRGKMMPSSKFISNI-----PLTVPSAIYRHTISG 72 (461) Q Consensus 22 ~~l~~~~I~~V~v~~~Dl~G~~RgK~vp~~~f~~~~-----~~g~~~~~~~~~~~g 72 (461) +-|.+.|-|. +.||..-|+.|||+ ++++|.-++ ..|+|..++...-+| T Consensus 207 ~~L~~~GyK~--~~FTNQ~~I~RGK~-~~~~FK~K~~~~V~KLGVP~QVl~A~~~G 259 (526) T TIGR01663 207 KELEKDGYKL--VLFTNQGSIARGKL-NLDDFKVKVKAVVEKLGVPIQVLVAIGDG 259 (526) T ss_pred HHHHHCCCEE--EEEECCCCEECCCC-CHHHHHHHHHHHHHHCCCCEEEEEECCCC T ss_conf 8874188279--99845773001555-65676668888898618976887650687 No 43 >COG3364 Zn-ribbon containing protein [General function prediction only] Probab=28.45 E-value=16 Score=16.65 Aligned_cols=27 Identities=30% Similarity=0.391 Sum_probs=19.4 Q ss_pred HHHHHHHCCHHHHHCCCCCCCCCEEECC Q ss_conf 7678631102210101278642011126 Q gi|254780503|r 200 KFSEKQGLEIDTLIHEEGPAQFEINIRH 227 (461) Q Consensus 200 ~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~ 227 (461) +...++|-+||++.-+ +|||||||+.- T Consensus 52 de~~a~ge~ietIrI~-~pG~YeiNl~~ 78 (112) T COG3364 52 DEDGAQGEPIETIRIL-RPGVYEINLES 78 (112) T ss_pred CHHHCCCCCCEEEEEE-CCCEEEEEHHH T ss_conf 1122036750489994-49648974456 No 44 >TIGR02812 fadR_gamma fatty acid metabolism transcriptional regulator FadR; InterPro: IPR014178 Members of this entry are transcriptional regulators of fatty acid metabolism (FadR), including both biosynthesis and beta-oxidation reactions, which are found exclusively in a subset of Gammaproteobacteria with strictly one copy per genome. Proteins have an N-terminal DNA-binding domain and a less well conserved C-terminal long chain acyl-CoA-binding domain. FadR when expressed in Escherichia coli shows differences in regulatory response and fatty acid binding profiles. All members of this entry are expected to have the same function.; GO: 0000062 acyl-CoA binding, 0003677 DNA binding, 0003700 transcription factor activity, 0019217 regulation of fatty acid metabolic process. Probab=27.84 E-value=29 Score=15.07 Aligned_cols=24 Identities=21% Similarity=0.206 Sum_probs=19.1 Q ss_pred CCCCHHHHHHHHHHHHHCCCC-CEE Q ss_conf 234257778999998610276-402 Q gi|254780503|r 123 VTYTPRNVLKRVLNFYAQNEL-KPI 146 (461) Q Consensus 123 ~~~~PR~iLkr~i~~l~~~G~-~~~ 146 (461) .-+=+|+.||-|++||+..|+ +.+ T Consensus 39 lIGVTRTTLREVLQRLARDGWLTIQ 63 (275) T TIGR02812 39 LIGVTRTTLREVLQRLARDGWLTIQ 63 (275) T ss_pred HCCCCCCHHHHHHHHHHHHCCCCCC T ss_conf 2376630378999887641134365 No 45 >COG1602 Uncharacterized conserved protein [Function unknown] Probab=27.77 E-value=42 Score=13.99 Aligned_cols=135 Identities=10% Similarity=0.035 Sum_probs=78.0 Q ss_pred HHHHHHHHHHHCCHHHHHCCCCCCCC----CEEECCC-CCHHHHHHHHHHHHHHHHHHH---------------HHCCCE Q ss_conf 98997678631102210101278642----0111266-521137899999999999986---------------306603 Q gi|254780503|r 196 DDIWKFSEKQGLEIDTLIHEEGPAQF----EINIRHG-DPLKLADQVFLFKRTIREAAF---------------KHDIYA 255 (461) Q Consensus 196 ~~i~~~l~~~Gi~ve~~~~E~gpGQ~----Ei~~~~~-~~l~aaD~~~~~k~~ik~vA~---------------~~G~~a 255 (461) .+.+-.|..-|++++++..-..-|-. +=-+-|+ -.++|+|..+ -+++.++|-. .+-+.+ T Consensus 183 ~eAv~~LY~~G~d~~~I~r~LS~G~lg~gk~RrlVPTRWSITAVD~~v-s~~L~~~i~~~p~Id~~~v~~~e~~gN~f~v 261 (402) T COG1602 183 EEAVVELYRRGVDVYYIQRALSVGLLGLGKRRRLVPTRWSITAVDDTV-SDYLAREIRDYPEIDKYEVYSGEYLGNHFLV 261 (402) T ss_pred HHHHHHHHHCCCCHHHHHHHHHCCHHHCCCCCCCCCCCCEEEEHHHHH-HHHHHHHHHCCCCCCCEEEEEEEECCCEEEE T ss_conf 999999998499899999986231320675664066630110068789-9999999843887672899977761868999 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC-----CCHHHHHHHHH---HHHHHHHHHHHHCCCCCHHHHHHHC Q ss_conf 311232113566268999876313687423358865-----52156665457---7764223212220766101344204 Q gi|254780503|r 256 TFMAKPMQNYPGSAMHIHQSVLDIKSNQNIFSNPDG-----SETDNFRYFIG---GLQKYIPNALVMLAPYGNSYRRLVL 327 (461) Q Consensus 256 tFmpKP~~~~~GsG~H~h~SL~d~~~g~n~f~~~~~-----~ls~~~~~fia---Gil~h~~al~a~~~PtvNSYkRl~p 327 (461) -.+|.|+.-..+-..|-| |+|+. +.--+..|.++ +-++++-.+-| ++|+|+..+-== .++-=.|...| T Consensus 262 IL~P~~~~~E~~E~w~pg-s~W~~-~~~~i~~d~E~~~Gr~~Y~~~gG~YyAARLaVlE~L~~~~Rq--a~aiv~REi~p 337 (402) T COG1602 262 ILLPRPWSFELVEAWEPG-SLWSP-GDLVISVDSEGYRGRKGYPDTGGAYYAARLAVLEHLSRIGRQ--ASAIVLREIYP 337 (402) T ss_pred EECCCCCCHHHHHHHCCC-CCCCC-CCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC--CCEEEEEEECC T ss_conf 990687631073464468-75678-885589842455576577666651267889999999984165--65699998157 Q ss_pred CCCCCCCC Q ss_conf 65575433 Q gi|254780503|r 328 DMGCPVNN 335 (461) Q Consensus 328 ~~~ap~~~ 335 (461) ++|||+-+ T Consensus 338 ~Y~aPvGv 345 (402) T COG1602 338 GYYAPVGV 345 (402) T ss_pred CCCCCEEE T ss_conf 75233033 No 46 >smart00874 B5 tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits. Probab=27.75 E-value=42 Score=13.99 Aligned_cols=48 Identities=23% Similarity=0.275 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHCCHHHHHCCCCCCCCCEEECC--CCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 98997678631102210101278642011126--65211378999999999999863066 Q gi|254780503|r 196 DDIWKFSEKQGLEIDTLIHEEGPAQFEINIRH--GDPLKLADQVFLFKRTIREAAFKHDI 253 (461) Q Consensus 196 ~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~--~~~l~aaD~~~~~k~~ik~vA~~~G~ 253 (461) +++.+.|..+|+.++. ....+++++...+ .|....+| ++-+||+-+|| T Consensus 22 ~~i~~~L~~lg~~~~~---~~~~~~~~v~~Ps~R~Di~~~~D-------liEEi~R~~Gy 71 (71) T smart00874 22 EEIEEILKRLGFEVEV---SGDDDTLEVTVPSYRFDILIEAD-------LIEEVARIYGY 71 (71) T ss_pred HHHHHHHHHCCCEEEE---CCCCCEEEEECCCCCCCCCCCCH-------HHHHHHHHHCC T ss_conf 9999999986998996---57897599978997065597346-------99988898079 No 47 >TIGR01793 cit_synth_euk citrate (Si)-synthase, eukaryotic; InterPro: IPR010109 Citrate synthase 2.3.3.1 from EC is a member of a small family of enzymes that can directly form a carbon-carbon bond without the presence of metal ion cofactors. It catalyses the first reaction in the Krebs' cycle, namely the conversion of oxaloacetate and acetyl-coenzyme A into citrate and coenzyme A. This reaction is important for energy generation and for carbon assimilation. The reaction proceeds via a non-covalently bound citryl-coenzyme A intermediate in a 2-step process (aldol-Claisen condensation followed by the hydrolysis of citryl-CoA). Citrate synthase enzymes are found in two distinct structural types: type I enzymes (found in eukaryotes, Gram-positive bacteria and archaea) form homodimers and have shorter sequences than type II enzymes, which are found in Gram-negative bacteria and are hexameric in structure. In both types, the monomer is composed of two domains: a large alpha-helical domain consisting of two structural repeats, where the second repeat is interrupted by a small alpha-helical domain. The cleft between these domains forms the active site, where both citrate and acetyl-coenzyme A bind. The enzyme undergoes a conformational change upon binding of the oxaloacetate ligand, whereby the active site cleft closes over in order to form the acetyl-CoA binding site . The energy required for domain closure comes from the interaction of the enzyme with the substrate. Type II enzymes possess an extra N-terminal beta-sheet domain, and some type II enzymes are allosterically inhibited by NADH . This entry includes both mitochondrial and peroxisomal forms of citrate synthase. Peroxisomal forms of the enzyme, recognized by the C-terminal targeting motif SKL, act in the glyoxylate cycle. Eukaryotic homologs include a Tetrahymena thermophila citrate synthase that doubles as a filament protein, a putative citrate synthase from Plasmodium falciparum (no TCA cycle), and a methylcitrate synthase from Aspergillus nidulans (Emericella nidulans).; GO: 0004108 citrate (Si)-synthase activity, 0044262 cellular carbohydrate metabolic process. Probab=27.43 E-value=15 Score=16.83 Aligned_cols=31 Identities=29% Similarity=0.236 Sum_probs=24.7 Q ss_pred HHHHHHHHHHCCCCEEEEEEEC-CCCCEEEEE Q ss_conf 8999999971798399999978-898331267 Q gi|254780503|r 17 WEQAAKWLKDNRIEDVECITPD-LAGIPRGKM 47 (461) Q Consensus 17 ~~~v~~~l~~~~I~~V~v~~~D-l~G~~RgK~ 47 (461) -+.|+++=+|||=+.|.=+-+| +-|-+||-. T Consensus 16 ~ekvKklr~EHG~~v~gn~t~D~~yGGmRg~k 47 (430) T TIGR01793 16 QEKVKKLRKEHGKKVVGNVTVDMVYGGMRGMK 47 (430) T ss_pred HHHHHHHHHHCCCCEEEEEEHHHHHCCCCCCC T ss_conf 66688787522560330121101104432241 No 48 >TIGR01647 ATPase-IIIA_H plasma-membrane proton-efflux P-type ATPase; InterPro: IPR006534 These sequences represent the plasma membrane proton efflux P-type ATPase found in plants, fungi, protozoa, slime molds and archaea. The best studied representative is from yeast . ; GO: 0016887 ATPase activity, 0015992 proton transport, 0016021 integral to membrane. Probab=26.97 E-value=43 Score=13.90 Aligned_cols=101 Identities=12% Similarity=0.074 Sum_probs=51.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCCEE-ECCC--CCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 34257778999998610276402-3224--41799821678888777877888877667877776654428999998997 Q gi|254780503|r 124 TYTPRNVLKRVLNFYAQNELKPI-IAPE--IEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWK 200 (461) Q Consensus 124 ~~~PR~iLkr~i~~l~~~G~~~~-~g~E--~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~ 200 (461) .-.-|+.....++.|+++||+.+ |+.+ +|=+- ++......--.-. +.|+=.|-| =.+-++ T Consensus 434 ~~~v~~~Ve~~v~~lA~RG~RaLGVA~~evP~g~~----~~~~g~W~~~Gll--------plfDPPR~D-----T~eTI~ 496 (835) T TIGR01647 434 EEEVREKVEEKVEELASRGYRALGVARTEVPSGEK----ADEKGRWHFLGLL--------PLFDPPRHD-----TKETIE 496 (835) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCC----CCCCCCEEEEEEC--------CCCCCCCCC-----HHHHHH T ss_conf 88999888999999970787142143201778655----5788735888720--------027836987-----389999 Q ss_pred HHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHC-CCEEECCCC Q ss_conf 6786311022101012786420111266521137899999999999986306-603311232 Q gi|254780503|r 201 FSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHD-IYATFMAKP 261 (461) Q Consensus 201 ~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G-~~atFmpKP 261 (461) .+.++||.|..+ ..|++..+|+++|++-.--+ +.++=.+|- T Consensus 497 ~A~~~GV~VKMv--------------------TGDhlaIAKEtaR~LGlGtni~~~~~L~~~ 538 (835) T TIGR01647 497 RARELGVEVKMV--------------------TGDHLAIAKETARRLGLGTNIYNADVLLKK 538 (835) T ss_pred HHHHCCCEEEEE--------------------EHHHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 998769769897--------------------002678776653215798753474100467 No 49 >COG4326 Spo0M Sporulation control protein [General function prediction only] Probab=26.11 E-value=45 Score=13.80 Aligned_cols=40 Identities=13% Similarity=0.124 Sum_probs=32.6 Q ss_pred HHHHHHHHHHHHHHHHHCCHHHHHCCCCCCC-------CCEEECCCC Q ss_conf 2899999899767863110221010127864-------201112665 Q gi|254780503|r 190 EFDEIIDDIWKFSEKQGLEIDTLIHEEGPAQ-------FEINIRHGD 229 (461) Q Consensus 190 ~~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ-------~Ei~~~~~~ 229 (461) +.+++++.+..+++.+|+...+...|.+++- .|+.|.|+. T Consensus 152 r~hP~m~~vl~AiE~lGfrL~~vdCEqa~yF~~a~PFVQEfEFvPTt 198 (270) T COG4326 152 RPHPMMDGVLSAIEALGFRLRQVDCEQAKYFGGALPFVQEFEFVPTT 198 (270) T ss_pred ECCHHHHHHHHHHHHHCCEEEECCCCCCCCCCCCCCCEEEEEEECCC T ss_conf 36667988999998516373101313154335556423678872158 No 50 >PHA02105 hypothetical protein Probab=25.95 E-value=41 Score=14.04 Aligned_cols=39 Identities=15% Similarity=0.206 Sum_probs=27.2 Q ss_pred CCCCCCCCCHHHCCCHHHHHHHHHHCCCCEEEEEEECCCCCE Q ss_conf 877777789555899899999997179839999997889833 Q gi|254780503|r 2 NKNDQSPMNVRGVKNWEQAAKWLKDNRIEDVECITPDLAGIP 43 (461) Q Consensus 2 ~~~~~~~m~~~~~~~~~~v~~~l~~~~I~~V~v~~~Dl~G~~ 43 (461) |++.+.|.+.+ .+++++....-..|+||.+.|-..+.++ T Consensus 17 ndq~eyp~~~e---~~~ql~svfsipqi~yvyls~~e~~si~ 55 (68) T PHA02105 17 NDQNEYPVELE---LFDQLKTVFSIPQIKYVYLSYEEFNSIM 55 (68) T ss_pred CCCCCCCCCHH---HHHHHHHHCCCCCEEEEEEEHHHHCCCC T ss_conf 76544652078---8998887613553478997688851513 No 51 >TIGR02793 nikR nickel-responsive transcriptional regulator NikR; InterPro: IPR014160 The members of this entry from Escherichia coli, Pseudomonas putida, and Brucella melitensis, are found associated with a nickel ABC transporter operon that acts to import nickel for use as a cofactor in urease or hydrogenase. These proteins, with characterised nickel-binding and DNA-binding domains, act as nickel-responsive transcriptional regulators. In the larger family of full-length homologues, most others both lack proximity to the nickel ABC transporter operon and form a separate clade. Several of the homologues not in this entry have been shown to bind nickel, copper, or both, and to regulate genes in response to nickel.. Probab=25.90 E-value=13 Score=17.30 Aligned_cols=30 Identities=13% Similarity=0.325 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHHH Q ss_conf 9999999863066033112321135662689998 Q gi|254780503|r 241 KRTIREAAFKHDIYATFMAKPMQNYPGSAMHIHQ 274 (461) Q Consensus 241 k~~ik~vA~~~G~~atFmpKP~~~~~GsG~H~h~ 274 (461) -.-.|++++| ++-||.. --+.+-+.||||+ T Consensus 61 dHs~RdL~~r--L~~t~H~--HHdLs~atlHVHL 90 (130) T TIGR02793 61 DHSKRDLPRR--LTQTLHH--HHDLSRATLHVHL 90 (130) T ss_pred CCCCCHHHHH--HHHCCCC--CCHHHHHHHHEEC T ss_conf 1753113455--5101432--0011142120123 No 52 >COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism] Probab=24.99 E-value=47 Score=13.67 Aligned_cols=109 Identities=17% Similarity=0.146 Sum_probs=59.5 Q ss_pred CCCCCEEEEEECCCCEECCCCCCCEEEEEEECC---CCCC--------CC-----------------CCCCHHHHHHHHH Q ss_conf 888778999600124538788877099997403---3575--------21-----------------2342577789999 Q gi|254780503|r 84 ERDGDLRLSPDLSTLSIVPWEMDPTAQIVCDVV---DYNG--------KE-----------------VTYTPRNVLKRVL 135 (461) Q Consensus 84 ~~~~D~~l~PD~~Tl~~lPw~~~~~a~vl~~~~---~~dG--------~p-----------------~~~~PR~iLkr~i 135 (461) ...|..+.+|||-+.+.-+ .+.++.++++.. +.|. .+ ....-+....+.+ T Consensus 94 ~~~g~~~~~~DP~a~~~~~--~p~~aS~v~~~~~y~W~d~~~~~~~~~~~~e~~vIYElHvGs~~~~~~~~~~e~a~~ll 171 (628) T COG0296 94 DPSGQLRLKADPYARRQEV--GPHTASQVVDLPDYEWQDERWDRAWRGRFWEPIVIYELHVGSFTPDRFLGYFELAIELL 171 (628) T ss_pred CCCCCEEECCCCHHHCCCC--CCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCHHHHHHHHH T ss_conf 8988644226831420588--99985066477773146544442346877787269999765035887767899999875 Q ss_pred HHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHHHHHCCHH Q ss_conf 98610276402322441799821678888777877888877667877776-6544289999989976786311022 Q gi|254780503|r 136 NFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSI-MGINEFDEIIDDIWKFSEKQGLEID 210 (461) Q Consensus 136 ~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~-~~~~~~~~~l~~i~~~l~~~Gi~ve 210 (461) .+++++|++. +|+.=+.+ .+.. ..-|+. ...+|.. ++...-++ ++.++++|.++||.|- T Consensus 172 pYl~elG~T~-----IELMPv~e-----~p~~---~sWGYq--~~g~yAp~sryGtPed-fk~fVD~aH~~GIgVi 231 (628) T COG0296 172 PYLKELGITH-----IELMPVAE-----HPGD---RSWGYQ--GTGYYAPTSRYGTPED-FKALVDAAHQAGIGVI 231 (628) T ss_pred HHHHHHCCCE-----EEECCCCC-----CCCC---CCCCCC--CCEECCCCCCCCCHHH-HHHHHHHHHHCCCEEE T ss_conf 8999709877-----99714435-----7988---887777--4300156556799899-9999999987699899 No 53 >PRK11920 rirA iron-responsive transcriptional regulator; Reviewed Probab=24.87 E-value=36 Score=14.45 Aligned_cols=103 Identities=17% Similarity=0.147 Sum_probs=62.7 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCC-CCCCCHH Q ss_conf 899999899767863110221010127864201112665211378999999999999863066033112321-1356626 Q gi|254780503|r 191 FDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMAKPM-QNYPGSA 269 (461) Q Consensus 191 ~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmpKP~-~~~~GsG 269 (461) -.+|+..|...|...|+ |++.-.- -|=|.++-.|.+ +....+++.+=-...+..||...+. .. ..+. T Consensus 37 s~~~l~kIl~~L~~~Gl-v~S~RG~--~GGy~Lar~p~~--------Isl~dIi~a~eg~~~~~~C~~~~~~~C~-~~~~ 104 (153) T PRK11920 37 SELFLFKILQPLVEAGL-VETVRGR--NGGVRLGKPAAD--------ISLFDVVRVTEDNFSMAECFENDASECP-LVDS 104 (153) T ss_pred CHHHHHHHHHHHHHCCC-EEECCCC--CCCEEECCCHHH--------CCHHHHHHHHCCCCCEEECCCCCCCCCC-CCCC T ss_conf 99999999999998898-7845889--886354489898--------9999999997489881236889999999-9998 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 899987631368742335886552156665457776422321222 Q gi|254780503|r 270 MHIHQSVLDIKSNQNIFSNPDGSETDNFRYFIGGLQKYIPNALVM 314 (461) Q Consensus 270 ~H~h~SL~d~~~g~n~f~~~~~~ls~~~~~fiaGil~h~~al~a~ 314 (461) |.+| ++|. + -+|.|.+ .....-|+-|++..|.+-.+ T Consensus 105 C~~~-~~w~-~-~~~~~~~------~L~~~TLaDLi~~~~~~r~l 140 (153) T PRK11920 105 CGLN-AALR-K-ALNAFFA------VLSKYSIADLVKARPNVRFL 140 (153) T ss_pred CCCH-HHHH-H-HHHHHHH------HHHCCCHHHHHCCCHHHHHH T ss_conf 7567-9999-9-9999999------99159199997488879987 No 54 >TIGR02630 xylose_isom_A xylose isomerase; InterPro: IPR013452 Xylose isomerase (5.3.1.5 from EC) is an enzyme which interconverts D-xylose and D-xylulose.; GO: 0009045 xylose isomerase activity. Probab=23.84 E-value=49 Score=13.52 Aligned_cols=41 Identities=15% Similarity=0.221 Sum_probs=33.2 Q ss_pred CCCCCHHHCCCHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEE Q ss_conf 77789555899899999997179839999997889833126761 Q gi|254780503|r 6 QSPMNVRGVKNWEQAAKWLKDNRIEDVECITPDLAGIPRGKMMP 49 (461) Q Consensus 6 ~~~m~~~~~~~~~~v~~~l~~~~I~~V~v~~~Dl~G~~RgK~vp 49 (461) +.||+.-. ..++.+.+|.++=+|+| +||.|.+=.+=|+++. T Consensus 71 ~~~~~~A~-~kvDA~FEf~~KLgv~Y--yCFHD~D~aPEG~~L~ 111 (437) T TIGR02630 71 TDPMDKAK-AKVDAAFEFFEKLGVPY--YCFHDVDIAPEGASLK 111 (437) T ss_pred CCHHHHHH-HHHHHHHHHHHHCCCCE--EEECCCCCCCCCCCHH T ss_conf 44899998-87899999998718980--5544610257887789 No 55 >PRK10853 hypothetical protein; Provisional Probab=23.74 E-value=49 Score=13.54 Aligned_cols=36 Identities=22% Similarity=0.347 Sum_probs=24.0 Q ss_pred HHCCCHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEEHHHHH Q ss_conf 5589989999999717983999999788983312676198873 Q gi|254780503|r 12 RGVKNWEQAAKWLKDNRIEDVECITPDLAGIPRGKMMPSSKFI 54 (461) Q Consensus 12 ~~~~~~~~v~~~l~~~~I~~V~v~~~Dl~G~~RgK~vp~~~f~ 54 (461) .+..+...+.+||++++|+| +|+|+ |-..++.+.+. T Consensus 8 knCdTckKA~kwL~~~~i~y---~f~D~----k~~~~~~~~l~ 43 (118) T PRK10853 8 KNCDTIKKARRWLEENGIDY---RFHDY----RVDGLDSELLN 43 (118) T ss_pred CCCHHHHHHHHHHHHCCCCC---EEEEC----CCCCCCHHHHH T ss_conf 99889999999999879981---89631----14698999999 No 56 >pfam09845 DUF2072 Zn-ribbon containing protein (DUF2072). This archaeal protein has no known function. Probab=23.72 E-value=25 Score=15.42 Aligned_cols=12 Identities=33% Similarity=0.866 Sum_probs=9.6 Q ss_pred CCCCCCCEEECC Q ss_conf 278642011126 Q gi|254780503|r 216 EGPAQFEINIRH 227 (461) Q Consensus 216 ~gpGQ~Ei~~~~ 227 (461) .+||+||+|+.- T Consensus 82 ~~pG~YElnL~~ 93 (121) T pfam09845 82 LGPGSYELNLER 93 (121) T ss_pred CCCCEEEECHHH T ss_conf 279726746899 No 57 >cd00674 LysRS_core_class_I This is the catalytic core domain of lysyl tRNA synthetase (LysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. The class I LysRS is found only in archaea and some bacteria and has evolved separately from class II LysRS, as the two do not share structural or sequence similarity. Probab=23.52 E-value=50 Score=13.49 Aligned_cols=147 Identities=12% Similarity=0.044 Sum_probs=79.7 Q ss_pred EEEEEECCCCCCCCCCCCHHHHHHH--HHHHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999740335752123425777899--99986102764023224417998216788887778778888776678777766 Q gi|254780503|r 109 AQIVCDVVDYNGKEVTYTPRNVLKR--VLNFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIM 186 (461) Q Consensus 109 a~vl~~~~~~dG~p~~~~PR~iLkr--~i~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 186 (461) -.|+.+=+.+.|.|+-+.-|.+++- +.+.|+++| .+.+|..+-.+-|+-..+-.........-.+.+...+. T Consensus 20 ~~v~~tG~~PSG~~HIGn~rEv~~~~~V~ral~~~g------~~~~~i~~~DD~D~lRKVp~~lpe~~~~ylg~PL~~IP 93 (354) T cd00674 20 KYVVASGISPSGLIHIGNFREVVTADAVRRALRDLG------FEVRLIYSWDDYDPLRKVPDNVPESLEQYIGMPLSSVP 93 (354) T ss_pred CEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC------CCEEEEEEEECCCHHCCCCCCCCHHHHHHCCCCCEECC T ss_conf 589947879998863205501788999999999759------96799998405551003755787999986299630068 Q ss_pred -----CCCHHHHHHHHHHHHHHHHHCCHHHH--HCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEECC Q ss_conf -----54428999998997678631102210--10127864201112665211378999999999999863066033112 Q gi|254780503|r 187 -----GINEFDEIIDDIWKFSEKQGLEIDTL--IHEEGPAQFEINIRHGDPLKLADQVFLFKRTIREAAFKHDIYATFMA 259 (461) Q Consensus 187 -----~~~~~~~~l~~i~~~l~~~Gi~ve~~--~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~ik~vA~~~G~~atFmp 259 (461) ..+-.+.+...+.+.|+..||++|.+ +.-|-.|.|.-. ...+.++.-..+.++..+-.+ -+..++.| T Consensus 94 dP~G~~~SyaeHf~~~f~~~L~~~gi~~e~~s~te~Y~sG~f~~~-----i~~~l~~~~~I~~Il~~~~g~-e~~~~y~P 167 (354) T cd00674 94 DPFGCHESYAEHFNAPFEESLDRFGIEVEFISASEMYKSGLYDEQ-----IKRALEKRDEIMEILNEYRGR-ELQETWYP 167 (354) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEHHHCCCCCCCHHH-----HHHHHHHHHHHHHHHHHHCCC-CCCCCEEE T ss_conf 977775459999999999999984982899885660225762799-----999998289999999985475-55786477 Q ss_pred -CCCCCCCC Q ss_conf -32113566 Q gi|254780503|r 260 -KPMQNYPG 267 (461) Q Consensus 260 -KP~~~~~G 267 (461) .|+....| T Consensus 168 ~~piC~~cG 176 (354) T cd00674 168 YMPYCEKCG 176 (354) T ss_pred EEEECCCCC T ss_conf 764058888 No 58 >COG4716 Myosin-crossreactive antigen [Function unknown] Probab=22.50 E-value=52 Score=13.36 Aligned_cols=83 Identities=12% Similarity=0.177 Sum_probs=39.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCC------HHHHHCCCCCCCCCEEEC--CCC---CHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 65442899999899767863110------221010127864201112--665---2113789999999999998630660 Q gi|254780503|r 186 MGINEFDEIIDDIWKFSEKQGLE------IDTLIHEEGPAQFEINIR--HGD---PLKLADQVFLFKRTIREAAFKHDIY 254 (461) Q Consensus 186 ~~~~~~~~~l~~i~~~l~~~Gi~------ve~~~~E~gpGQ~Ei~~~--~~~---~l~aaD~~~~~k~~ik~vA~~~G~~ 254 (461) ...++.+.++..++..|..-|++ ||.+.-+..|||--..-- +.+ .-...|++++.-..+-.-..-+|-. T Consensus 221 tkyNQYeSlvlPli~yL~~H~Vdf~~~~~Vedi~v~~t~gkkvA~aih~~~d~~~ieLt~dDlVfvTNgsiTeSsT~Gd~ 300 (587) T COG4716 221 TKYNQYESLVLPLITYLKSHGVDFTYDQKVEDIDVDDTPGKKVAKAIHVLGDAETIELTPDDLVFVTNGSITESSTYGDM 300 (587) T ss_pred CCCCHHHHHHHHHHHHHHHCCCCEEECCEEEEEEECCCCCHHHHHHHHHHCCCCEEECCCCCEEEEECCEECCCCCCCCC T ss_conf 44304788899999999873975472567765431147636688999873575525437876599855600233336888 Q ss_pred EEECCCCCCCCCCHH Q ss_conf 331123211356626 Q gi|254780503|r 255 ATFMAKPMQNYPGSA 269 (461) Q Consensus 255 atFmpKP~~~~~GsG 269 (461) -|=-||| ....||+ T Consensus 301 ~tpAp~t-~elggSw 314 (587) T COG4716 301 NTPAPKT-EELGGSW 314 (587) T ss_pred CCCCCCH-HHCCCCH T ss_conf 9989982-4329717 No 59 >PRK13895 conjugal transfer protein TraM; Provisional Probab=22.40 E-value=52 Score=13.34 Aligned_cols=16 Identities=44% Similarity=0.503 Sum_probs=7.8 Q ss_pred HHHHHHHHHHHHCCCE Q ss_conf 9999999986306603 Q gi|254780503|r 240 FKRTIREAAFKHDIYA 255 (461) Q Consensus 240 ~k~~ik~vA~~~G~~a 255 (461) +..+|++||.|||... T Consensus 5 i~e~I~~IA~KHGIal 20 (144) T PRK13895 5 IEELIREIAAKHGIAV 20 (144) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 9999999999809214 No 60 >PTZ00326 phenylalanyl-tRNA synthetase; Provisional Probab=22.24 E-value=53 Score=13.32 Aligned_cols=41 Identities=17% Similarity=0.175 Sum_probs=23.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHH----CCCCC-EEEC--CCCCEEEEEC Q ss_conf 5752123425777899999861----02764-0232--2441799821 Q gi|254780503|r 118 YNGKEVTYTPRNVLKRVLNFYA----QNELK-PIIA--PEIEFYLIAK 158 (461) Q Consensus 118 ~dG~p~~~~PR~iLkr~i~~l~----~~G~~-~~~g--~E~EF~L~~~ 158 (461) ..|.+...--++-|.+++++.+ ++||+ +..+ +|.+|+-|+. T Consensus 231 a~g~~~~~g~~hp~~~~~~~~r~if~~mGF~E~~~~~~ves~fwNfDa 278 (505) T PTZ00326 231 ASGKKVSMGNLHPLLKVRQEFREIFMEMGFEEMPTNRWVESSFWNFDA 278 (505) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEECCCCCEEEECCCCCHH T ss_conf 988888877668899999999999997797676689713313213111 No 61 >COG1282 PntB NAD/NADP transhydrogenase beta subunit [Energy production and conversion] Probab=22.15 E-value=53 Score=13.31 Aligned_cols=33 Identities=18% Similarity=0.404 Sum_probs=18.4 Q ss_pred HHHHHHH---HHHCCCEEECCCCCCCC-CCHHHHHHHHHH Q ss_conf 9999998---63066033112321135-662689998763 Q gi|254780503|r 242 RTIREAA---FKHDIYATFMAKPMQNY-PGSAMHIHQSVL 277 (461) Q Consensus 242 ~~ik~vA---~~~G~~atFmpKP~~~~-~GsG~H~h~SL~ 277 (461) +.+|+++ ++.|..+.|-=.|..+. || |+|+=|- T Consensus 326 h~v~E~~~~L~~~Gv~VrfaIHPVAGRmPG---HMNVLLA 362 (463) T COG1282 326 HPVAEITEKLRARGVNVRFAIHPVAGRMPG---HMNVLLA 362 (463) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCC---CHHHHHH T ss_conf 489999999986697157862213567885---1666212 No 62 >PTZ00260 glycosyl transferase group 2; Provisional Probab=22.11 E-value=33 Score=14.66 Aligned_cols=25 Identities=16% Similarity=0.341 Sum_probs=11.1 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 6268999876313687423358865 Q gi|254780503|r 267 GSAMHIHQSVLDIKSNQNIFSNPDG 291 (461) Q Consensus 267 GsG~H~h~SL~d~~~g~n~f~~~~~ 291 (461) |-|.-+-.-+..++|.--+|.|.++ T Consensus 156 GKG~AVr~Gml~ArG~~ILfaDADg 180 (336) T PTZ00260 156 GKGGAVKLGMLASAGKYQLMMDADG 180 (336) T ss_pred CCHHHHHHHHHHHCCCEEEEEECCC T ss_conf 8469999999981798899994788 No 63 >COG3465 Uncharacterized conserved protein [Function unknown] Probab=21.96 E-value=53 Score=13.29 Aligned_cols=86 Identities=16% Similarity=0.159 Sum_probs=51.5 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 23425777899999861027640232244179982167888877787788887766787777665442899999899767 Q gi|254780503|r 123 VTYTPRNVLKRVLNFYAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGINEFDEIIDDIWKFS 202 (461) Q Consensus 123 ~~~~PR~iLkr~i~~l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~l~~i~~~l 202 (461) .-.|-|-.|++++=-.++.|+. +..-++|.+.. . .++-+.+.+..| T Consensus 26 n~~d~R~KlQKlVYi~Kkl~~~--~~~~Y~FnlYG------------------------P--------YS~eLt~~v~~L 71 (171) T COG3465 26 NYGDGRKKLQKLVYIAKKLGFP--FSLDYDFNLYG------------------------P--------YSEELTDDVEEL 71 (171) T ss_pred CCCCHHHHHHHHHHHHHHHCCC--CHHHCCCCCCC------------------------C--------CCHHHHHHHHHH T ss_conf 1310588889898788764577--34314753358------------------------8--------658888999998 Q ss_pred HHHHCCHHHHHCCCCCC-CCCEEEC---CCC-CHHHHHHHHHHHHHHHHH Q ss_conf 86311022101012786-4201112---665-211378999999999999 Q gi|254780503|r 203 EKQGLEIDTLIHEEGPA-QFEINIR---HGD-PLKLADQVFLFKRTIREA 247 (461) Q Consensus 203 ~~~Gi~ve~~~~E~gpG-Q~Ei~~~---~~~-~l~aaD~~~~~k~~ik~v 247 (461) ...|. .+|.+.| |+|+.+. ..+ .....|.--+..++...+ T Consensus 72 ~N~G~-----i~E~~~G~q~e~~lte~~~~~L~~~~~dkd~~~l~I~a~l 116 (171) T COG3465 72 SNLGF-----IIEVKEGQQYEISLTEKGVSFLAITESDKDALKLEIAATL 116 (171) T ss_pred HCCCE-----EEEECCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 05642-----3554265046875107778899998728606889999999 No 64 >KOG1534 consensus Probab=21.96 E-value=51 Score=13.43 Aligned_cols=28 Identities=18% Similarity=0.282 Sum_probs=22.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 6545777642232122207661013442 Q gi|254780503|r 298 RYFIGGLQKYIPNALVMLAPYGNSYRRL 325 (461) Q Consensus 298 ~~fiaGil~h~~al~a~~~PtvNSYkRl 325 (461) ..||+|+|..+.|+..+--|.||--..+ T Consensus 147 ~KfiSG~lsAlsAMi~lE~P~INvlsKM 174 (273) T KOG1534 147 TKFISGCLSALSAMISLEVPHINVLSKM 174 (273) T ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 8899999999999997367602244687 No 65 >pfam08532 Glyco_hydro_42M Beta-galactosidase trimerisation domain. This is non catalytic domain B of beta-galactosidase enzymes belong to the glycosyl hydrolase 42 family. This domain is related to glutamine amidotransferase enzymes, but the catalytic residues are replaced by non functional amino acids. This domain is involved in trimerisation. Probab=21.59 E-value=54 Score=13.24 Aligned_cols=54 Identities=6% Similarity=0.174 Sum_probs=39.1 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCEEECCCCCHHHHHHHHHHHHHH Q ss_conf 899999899767863110221010127864201112665211378999999999 Q gi|254780503|r 191 FDEIIDDIWKFSEKQGLEIDTLIHEEGPAQFEINIRHGDPLKLADQVFLFKRTI 244 (461) Q Consensus 191 ~~~~l~~i~~~l~~~Gi~ve~~~~E~gpGQ~Ei~~~~~~~l~aaD~~~~~k~~i 244 (461) ..+.+.+.+++|..+||+|.-++.+.--.+|.+-+.|.-.+-.-+-.-..+..+ T Consensus 28 Y~~~v~~~Y~aL~~~gv~vDiv~~~~dls~YklvvvP~l~~~~~~~~~~L~~yv 81 (207) T pfam08532 28 YLDLVRDWYRALRRLGIPVDVVPPDADLSGYKLVVAPMLYMVSEELAKKLEAYV 81 (207) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCEEEEEEEEEECHHHHHHHHHHH T ss_conf 899999999999986996777047688121959998557880999999999999 No 66 >TIGR00151 ispF 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; InterPro: IPR003526 This entry represents MECDP (2-C-methyl-D-erythritol 2,4-cyclodiphosphate) synthetase, an enzyme in the non-mevalonate pathway of isoprenoid synthesis, isoprenoids being essential in all organisms. Isoprenoids can also be synthesized through the mevalonate pathway. The non-mevolante route is used by many bacteria and human pathogens, including Mycobacterium tuberculosis and Plasmodium falciparum. This route appears to involve seven enzymes. MECDP synthetase catalyses the intramolecular attack by a phosphate group on a diphosphate, with cytidine monophosphate (CMP) acting as the leaving group to give the cyclic diphosphate product MEDCP. The enzyme is a trimer with three active sites shared between adjacent copies of the protein. The enzyme also has two metal binding sites, the metals playing key roles in catalysis. A number of proteins from eukaryotes and prokaryotes share this common N-terminal signature and appear to be involved in terpenoid biosynthesis. The ygbB protein is a putative enzyme of this type .; GO: 0008685 2-C-methyl-D-erythritol 24-cyclodiphosphate synthase activity, 0016114 terpenoid biosynthetic process. Probab=21.33 E-value=55 Score=13.20 Aligned_cols=58 Identities=12% Similarity=0.056 Sum_probs=33.5 Q ss_pred CCCCCEEEEEECCCCEECCCCCCCEEEEEEECCCCCCCCC-CCCHHHHHHHHHHHHHCCCCCEE Q ss_conf 8887789996001245387888770999974033575212-34257778999998610276402 Q gi|254780503|r 84 ERDGDLRLSPDLSTLSIVPWEMDPTAQIVCDVVDYNGKEV-TYTPRNVLKRVLNFYAQNELKPI 146 (461) Q Consensus 84 ~~~~D~~l~PD~~Tl~~lPw~~~~~a~vl~~~~~~dG~p~-~~~PR~iLkr~i~~l~~~G~~~~ 146 (461) --|+|..|.---+.+ +--.. .-=++..+..+-..| ..|.|.+||++++..+++||++- T Consensus 37 HSDGDvllHAl~DAl--LGA~g---lGDIG~~FPdtd~~wKgaDS~~lL~~~~~~~~~~gy~~~ 95 (159) T TIGR00151 37 HSDGDVLLHALTDAL--LGALG---LGDIGKHFPDTDPRWKGADSRVLLRKAVELVKEKGYRIG 95 (159) T ss_pred ECHHHHHHHHHHHHH--HHHHH---HCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHCCCCEE T ss_conf 162788999999888--87752---000224378623345688778999999999985588045 No 67 >PRK02308 uvsE putative UV damage endonuclease; Provisional Probab=21.13 E-value=55 Score=13.18 Aligned_cols=25 Identities=12% Similarity=0.161 Sum_probs=13.9 Q ss_pred HHHHHHHHHCCCEEECCCCCCCCCC Q ss_conf 9999986306603311232113566 Q gi|254780503|r 243 TIREAAFKHDIYATFMAKPMQNYPG 267 (461) Q Consensus 243 ~ik~vA~~~G~~atFmpKP~~~~~G 267 (461) -|.+.|+++|.+.||+|-+|.-.+. T Consensus 96 ~iG~~ak~~~iRls~HPgqf~vL~S 120 (316) T PRK02308 96 KIGALIKEHNMRLSFHPDQFVVLNS 120 (316) T ss_pred HHHHHHHHHCCEEEECCCCCEECCC T ss_conf 9999999829458627976232689 No 68 >TIGR00631 uvrb excinuclease ABC, B subunit; InterPro: IPR004807 All proteins in this family for which functions are known are DNA helicases that function in the nucleotide excision repair and are endonucleases that make the 3' incision next to DNA damage. They are part of a pathway requiring UvrA, UvrB, UvrC, and UvrD homologs.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0005737 cytoplasm, 0009380 excinuclease repair complex. Probab=21.11 E-value=35 Score=14.52 Aligned_cols=59 Identities=19% Similarity=0.177 Sum_probs=32.2 Q ss_pred EEEEEEHHHHHH--------CCCCCCCCCEEECC--CCCCCCCCCCCCCCCCC---C-CEEEEEECCCCEECCC Q ss_conf 126761988730--------15677361000002--42774346655577888---7-7899960012453878 Q gi|254780503|r 44 RGKMMPSSKFIS--------NIPLTVPSAIYRHT--ISGESLHSPYLFHNERD---G-DLRLSPDLSTLSIVPW 103 (461) Q Consensus 44 RgK~vp~~~f~~--------~~~~g~~~~~~~~~--~~g~~~~~~~~~~~~~~---~-D~~l~PD~~Tl~~lPw 103 (461) .+|.+-+.++.. -...|+|..+-|.+ ++|..-..|+...-+-. | |++|+-| +|=+-+|- T Consensus 275 ~~kllEaQRL~qRT~yDLEMl~e~G~C~GIENYSRhl~GR~~GEpP~tLlDYF~gfP~DfLlviD-ESHVT~PQ 347 (667) T TIGR00631 275 QGKLLEAQRLKQRTEYDLEMLREMGYCSGIENYSRHLSGRKPGEPPYTLLDYFIGFPEDFLLVID-ESHVTLPQ 347 (667) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHCCCCCCCCCCCCCCCHHHHCCCCCCCEEEEEE-ECCCCHHH T ss_conf 79517888898868877888750673377000000216777631596477650468865089884-02575022 No 69 >PRK05784 phosphoribosylamine--glycine ligase; Provisional Probab=21.10 E-value=55 Score=13.17 Aligned_cols=91 Identities=24% Similarity=0.189 Sum_probs=55.5 Q ss_pred CEEECCCCCEEEEECCCCCC--CCCC----CCCCCCCCCCCCCCCCCCC----C-CCH-----HHHHHHHHHHHHHH-HH Q ss_conf 40232244179982167888--8777----8778888776678777766----5-442-----89999989976786-31 Q gi|254780503|r 144 KPIIAPEIEFYLIAKNEDPD--YPLQ----PPKGRSGRSILGGQSYSIM----G-INE-----FDEIIDDIWKFSEK-QG 206 (461) Q Consensus 144 ~~~~g~E~EF~L~~~~~~~~--~~~~----~~~~~~~~~~~~~~~y~~~----~-~~~-----~~~~l~~i~~~l~~-~G 206 (461) +++.|.|.-|+.|-...+-. .+.+ --.+..+.-+.++++|+-. + +.+ ...+++..+..|.. +| T Consensus 203 EfL~G~E~S~~a~~DG~~~~~lp~aQDhKR~~dgD~GPNTGGMGAysP~~~~~P~l~~~~~~~~~~Ii~pti~~l~~e~g 282 (485) T PRK05784 203 EKVDGVEYTVQVLTDGESVFPLPPVQDNPHAYEYGIGPETGGMGSVSGPGELLPFIEEEEYEEAVEIVKATVEAIRKETG 282 (485) T ss_pred ECCCCCEEEEEEEEECCEEEECCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 64468179999998299699892220576556899998999870067876677544799999998748999999986169 Q ss_pred CCHHHH------HCCCCCCCCCEEECCCCCHHHH Q ss_conf 102210------1012786420111266521137 Q gi|254780503|r 207 LEIDTL------IHEEGPAQFEINIRHGDPLKLA 234 (461) Q Consensus 207 i~ve~~------~~E~gpGQ~Ei~~~~~~~l~aa 234 (461) ++...+ .++.||--.|+|.++.||-..+ T Consensus 283 ~~y~GvLY~GlMit~~GPkVlEfN~RfGDPEtQ~ 316 (485) T PRK05784 283 ERYVGVLSGQMMLTALGPTVIEYYSRLGDPEALN 316 (485) T ss_pred CCCEEEEEEEEEECCCCCEEEEEECCCCCCHHHH T ss_conf 9727888767886189848999968878833888 No 70 >TIGR00092 TIGR00092 GTP-binding protein YchF; InterPro: IPR004396 This is a family of conserved hypothetical proteins found in both prokaryotes and eukaryotes. While the function of these proteins is not known, the crystal structure of P44681 from SWISSPROT from Haemophilus influenzae has been determined . This protein consists of three domains: an N-terminal domain which has a mononucleotide binding fold typical for the P-loop NTPases, a central domain which forms an alpha-helical coiled coil, and a C-terminal domain composed of a six-stranded half-barrel curved around an alpha helix. The central and C-terminal domains are topologically similar to RNA-binding proteins, while the N-terminal region contains the features typical of GTP-dependent molecular switches. The purified protein was capable of binding both double-stranded nucleic acid and GTP. It was suggested, therefore, that this protein might be part of a nucleoprotein complex and could function as a GTP-dependent translation factor.; GO: 0005525 GTP binding. Probab=21.05 E-value=56 Score=13.17 Aligned_cols=32 Identities=19% Similarity=0.290 Sum_probs=26.5 Q ss_pred CHHHHHHHHHHCCC-CEEEEEEECCCCCEEEEE Q ss_conf 98999999971798-399999978898331267 Q gi|254780503|r 16 NWEQAAKWLKDNRI-EDVECITPDLAGIPRGKM 47 (461) Q Consensus 16 ~~~~v~~~l~~~~I-~~V~v~~~Dl~G~~RgK~ 47 (461) .++.+.+.++...+ -+=++.|+|+.|..||=+ T Consensus 51 RLd~La~i~k~~k~evptt~~fvDIAGLvkGAS 83 (390) T TIGR00092 51 RLDKLAEIYKPEKVEVPTTLEFVDIAGLVKGAS 83 (390) T ss_pred CHHHHHHHHCCCEEECCEEEEEEEECCCCCCCC T ss_conf 333477640642041140489986223410001 No 71 >KOG4116 consensus Probab=20.99 E-value=26 Score=15.35 Aligned_cols=48 Identities=19% Similarity=0.232 Sum_probs=35.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCC---CCCCC------CCCCCCCCEEEEE Q ss_conf 521566654577764223212220766101344204655---75433------4243345003650 Q gi|254780503|r 292 SETDNFRYFIGGLQKYIPNALVMLAPYGNSYRRLVLDMG---CPVNN------AWGYDNRTTAFRI 348 (461) Q Consensus 292 ~ls~~~~~fiaGil~h~~al~a~~~PtvNSYkRl~p~~~---ap~~~------~WG~~NR~~~iRi 348 (461) .||+..+...+|+.+ ...+|++||+....| .|..| +||.+-.+++-|- T Consensus 25 aLSPfeQra~~g~F~---------~~~~n~fRr~~~~~~y~~iP~~Iv~yliy~wg~e~ne~l~rK 81 (90) T KOG4116 25 ALSPFEQRAYAGFFD---------KAFPNMFRRFRSDQLYVVIPQFIVAYLIYDWGKETNEALNRK 81 (90) T ss_pred ECCCHHHCCCCCHHH---------HHHHHHHHHHHHCCEEEEECCCEEEEEEEECCHHHHHHHHCC T ss_conf 518113021123565---------540799999554457998344018998886351676887611 No 72 >pfam08777 RRM_3 RNA binding motif. This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif. Probab=20.97 E-value=56 Score=13.15 Aligned_cols=35 Identities=17% Similarity=0.180 Sum_probs=26.3 Q ss_pred HCCCHHHHHHHHHHC-CCCEEEEEEECCCCCEEEEE Q ss_conf 589989999999717-98399999978898331267 Q gi|254780503|r 13 GVKNWEQAAKWLKDN-RIEDVECITPDLAGIPRGKM 47 (461) Q Consensus 13 ~~~~~~~v~~~l~~~-~I~~V~v~~~Dl~G~~RgK~ 47 (461) ...+.+++...+++. .|+||.+.--|..|.+|-+. T Consensus 11 ~~~sRediK~~f~~~g~V~yVD~~~Gd~eg~vRf~~ 46 (102) T pfam08777 11 KPTSREDIKEAFSQHGEVKYVDFLEGDKEGYVRFKT 46 (102) T ss_pred CCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEECC T ss_conf 888999999999835975689842788369999679 No 73 >cd00948 FBP_aldolase_I_a Fructose-1,6-bisphosphate aldolase. The enzyme catalyzes the cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate (DHAP). This family includes proteins found in vertebrates, plants, and bacterial plant pathogens. Mutations in the aldolase genes in humans cause hemolytic anemia and hereditary fructose intolerance. The enzyme is a member of the class I aldolase family, which utilizes covalent catalysis through a Schiff base formed between a lysine residue of the enzyme and ketose substrates. Probab=20.02 E-value=58 Score=13.03 Aligned_cols=70 Identities=16% Similarity=0.278 Sum_probs=46.4 Q ss_pred CCCCCCCCCHHHHHHHHHHH---HHCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHH Q ss_conf 57521234257778999998---610276402322441799821678888777877888877667877776654-42899 Q gi|254780503|r 118 YNGKEVTYTPRNVLKRVLNF---YAQNELKPIIAPEIEFYLIAKNEDPDYPLQPPKGRSGRSILGGQSYSIMGI-NEFDE 193 (461) Q Consensus 118 ~dG~p~~~~PR~iLkr~i~~---l~~~G~~~~~g~E~EF~L~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~-~~~~~ 193 (461) .+|.|...|-+......-++ .++.|+.|.| |+|- |++.+ +++.+. ..-+. T Consensus 142 ~~~~PS~~~I~~Na~~LArYA~icQ~~glVPIV--EPEV-LmdG~-----------------------H~ie~c~~vt~~ 195 (330) T cd00948 142 GNGTPSELAIKENAHGLARYAAICQENGLVPIV--EPEV-LMDGD-----------------------HDIERCQEVTEK 195 (330) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCCC-CCCCC-----------------------CCHHHHHHHHHH T ss_conf 999986999999899999999999985981321--5010-01676-----------------------347999999999 Q ss_pred HHHHHHHHHHHHHCCHHHHH Q ss_conf 99989976786311022101 Q gi|254780503|r 194 IIDDIWKFSEKQGLEIDTLI 213 (461) Q Consensus 194 ~l~~i~~~l~~~Gi~ve~~~ 213 (461) ++..+++.|...||.+|... T Consensus 196 vL~~vf~~L~~~~V~leG~l 215 (330) T cd00948 196 VLAAVYKALNDHHVLLEGTL 215 (330) T ss_pred HHHHHHHHHHHCCCCCCCCE T ss_conf 99999999998297657744 Done!