Query gi|254780504|ref|YP_003064917.1| glucose-6-phosphate 1-dehydrogenase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 492 No_of_seqs 158 out of 1448 Neff 6.0 Searched_HMMs 39220 Date Sun May 29 23:25:56 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780504.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00871 zwf glucose-6-phosph 100.0 0 0 1547.3 40.8 480 9-491 1-498 (498) 2 COG0364 Zwf Glucose-6-phosphat 100.0 0 0 1365.9 49.2 478 7-491 4-483 (483) 3 PRK05722 glucose-6-phosphate 1 100.0 0 0 1354.9 54.3 482 7-491 6-495 (495) 4 PRK12853 glucose-6-phosphate 1 100.0 0 0 1352.6 54.3 481 5-492 3-485 (486) 5 PTZ00309 glucose-6-phosphate d 100.0 0 0 1329.4 52.1 470 9-492 67-554 (556) 6 PRK12854 glucose-6-phosphate 1 100.0 0 0 1312.4 50.6 463 5-492 6-475 (475) 7 KOG0563 consensus 100.0 0 0 1176.8 43.1 472 9-490 16-498 (499) 8 pfam02781 G6PD_C Glucose-6-pho 100.0 0 0 862.0 29.8 292 192-491 1-294 (294) 9 pfam00479 G6PD_N Glucose-6-pho 100.0 0 0 456.2 20.0 175 14-190 1-183 (183) 10 pfam01408 GFO_IDH_MocA Oxidore 96.9 0.035 9E-07 35.6 11.3 49 117-170 64-112 (120) 11 COG0673 MviM Predicted dehydro 95.6 0.24 6E-06 29.7 12.2 186 10-251 3-194 (342) 12 PRK10206 putative dehydrogenas 94.5 0.47 1.2E-05 27.5 21.5 69 116-189 66-139 (345) 13 PRK11579 putative oxidoreducta 94.2 0.17 4.3E-06 30.7 6.1 109 12-170 6-114 (346) 14 TIGR03649 ergot_EASG ergot alk 79.7 4.9 0.00012 20.2 7.7 78 12-126 1-78 (285) 15 KOG0890 consensus 79.5 4.7 0.00012 20.3 5.0 84 183-277 821-906 (2382) 16 KOG2741 consensus 79.3 5 0.00013 20.1 10.5 184 22-259 16-209 (351) 17 KOG0946 consensus 74.3 3.7 9.5E-05 21.1 3.3 113 158-294 231-351 (970) 18 PRK05989 cobN cobaltochelatase 65.6 10 0.00026 17.9 4.0 99 186-296 912-1031(1251) 19 pfam02514 CobN-Mg_chel CobN/Ma 62.1 9 0.00023 18.3 3.2 97 186-296 739-857 (1064) 20 pfam02446 Glyco_hydro_77 4-alp 55.4 3.2 8.2E-05 21.5 -0.1 23 164-186 270-292 (494) 21 COG5649 Uncharacterized conser 54.3 16 0.0004 16.6 4.7 30 325-357 47-81 (132) 22 PRK12321 cobN cobaltochelatase 53.2 10 0.00027 17.9 2.3 129 151-296 726-887 (1101) 23 pfam05542 DUF760 Protein of un 52.9 7.7 0.0002 18.8 1.5 16 237-252 121-136 (328) 24 pfam01474 DAHP_synth_2 Class-I 52.7 17 0.00042 16.4 4.4 16 55-70 18-33 (437) 25 TIGR02025 BchH magnesium chela 52.0 15 0.00039 16.7 2.9 97 232-350 1087-1195(1384) 26 pfam05368 NmrA NmrA-like famil 51.0 18 0.00045 16.2 9.6 98 13-149 1-100 (232) 27 KOG3363 consensus 49.7 17 0.00044 16.3 2.9 31 40-71 109-139 (196) 28 PRK09072 short chain dehydroge 49.6 18 0.00047 16.1 5.3 74 7-110 2-75 (262) 29 pfam04989 CmcI Cephalosporin h 48.6 19 0.00049 16.0 9.2 91 87-184 81-186 (202) 30 COG1640 MalQ 4-alpha-glucanotr 48.5 6.3 0.00016 19.4 0.5 23 163-185 286-308 (520) 31 PRK10525 cytochrome o ubiquino 47.3 20 0.00051 15.8 4.6 11 199-209 76-86 (305) 32 PRK13405 bchH magnesium chelat 46.7 20 0.00052 15.8 3.0 23 419-441 482-504 (1207) 33 TIGR01214 rmlD dTDP-4-dehydror 46.3 21 0.00052 15.7 4.7 32 12-53 1-32 (317) 34 PRK00092 hypothetical protein; 43.3 23 0.00058 15.4 3.2 34 145-178 35-68 (153) 35 cd06842 PLPDE_III_Y4yA_like Ty 42.4 23 0.0006 15.3 4.2 34 151-184 144-177 (423) 36 TIGR00565 trpE_proteo anthrani 42.2 13 0.00032 17.3 1.3 118 53-197 228-359 (505) 37 COG0779 Uncharacterized protei 42.0 24 0.0006 15.3 3.0 31 145-175 39-69 (153) 38 TIGR02778 ligD_pol DNA polymer 40.9 23 0.00058 15.4 2.4 49 337-390 86-136 (251) 39 pfam09861 DUF2088 Uncharacteri 38.7 27 0.00068 14.9 5.9 115 123-245 68-194 (203) 40 KOG4128 consensus 38.4 23 0.00058 15.4 2.1 73 147-261 160-234 (457) 41 TIGR01915 npdG NADPH-dependent 38.4 27 0.00068 14.9 4.8 50 13-70 3-57 (233) 42 cd02964 TryX_like_family Trypa 38.1 27 0.00069 14.9 5.6 55 11-66 19-73 (132) 43 TIGR03346 chaperone_ClpB ATP-d 38.1 27 0.00069 14.9 3.0 31 23-66 290-320 (852) 44 COG4785 NlpI Lipoprotein NlpI, 38.0 18 0.00046 16.2 1.5 10 179-188 206-215 (297) 45 TIGR02788 VirB11 P-type DNA tr 37.7 15 0.00039 16.7 1.1 32 11-49 159-190 (328) 46 TIGR00873 gnd 6-phosphoglucona 37.1 20 0.00051 15.8 1.6 15 44-58 22-36 (480) 47 PRK05653 fabG 3-ketoacyl-(acyl 36.6 28 0.00073 14.7 5.2 74 8-109 3-76 (246) 48 pfam07505 Gp37_Gp68 Phage prot 36.4 18 0.00045 16.2 1.3 40 449-490 205-251 (253) 49 CHL00194 ycf39 Ycf39; Provisio 36.4 29 0.00073 14.7 9.6 105 12-156 2-114 (319) 50 pfam08877 MepB MepB protein. M 36.2 29 0.00074 14.7 8.1 71 317-392 5-75 (123) 51 KOG3437 consensus 36.0 27 0.00069 14.9 2.2 23 338-360 93-115 (184) 52 cd01067 globin_like superfamil 35.8 29 0.00075 14.6 3.8 34 56-89 56-90 (117) 53 COG1429 CobN Cobalamin biosynt 33.5 31 0.00078 14.5 2.1 48 201-249 919-968 (1388) 54 pfam08659 KR KR domain. This e 33.3 32 0.00082 14.4 7.5 73 12-109 2-75 (181) 55 pfam05320 Pox_RNA_Pol_19 Poxvi 32.7 33 0.00083 14.3 3.4 41 15-70 82-122 (167) 56 TIGR03159 cas_Csc1 CRISPR-asso 32.6 33 0.00084 14.3 2.5 47 302-353 112-159 (223) 57 pfam08644 SPT16 FACT complex s 32.3 33 0.00085 14.2 2.4 27 333-359 18-48 (152) 58 TIGR00763 lon ATP-dependent pr 29.8 25 0.00065 15.1 1.2 57 155-216 533-597 (941) 59 KOG4278 consensus 29.6 26 0.00065 15.1 1.2 57 199-258 282-339 (1157) 60 KOG0722 consensus 29.5 37 0.00094 13.9 2.5 131 22-168 11-150 (329) 61 PRK07452 DNA polymerase III su 29.5 37 0.00094 13.9 7.7 13 7-19 29-41 (326) 62 TIGR02776 NHEJ_ligase_prk DNA 29.1 34 0.00086 14.2 1.7 159 83-245 96-310 (645) 63 COG4884 Uncharacterized protei 29.0 19 0.00049 16.0 0.5 67 150-237 39-107 (176) 64 COG3240 Phospholipase/lecithin 28.7 34 0.00086 14.2 1.7 40 161-200 240-282 (370) 65 TIGR01085 murE UDP-N-acetylmur 27.4 27 0.0007 14.9 1.0 15 40-54 83-97 (494) 66 pfam07993 NAD_binding_4 Male s 27.2 40 0.001 13.7 6.5 77 16-107 2-84 (245) 67 TIGR02088 LEU3_arch isopropylm 27.2 31 0.00079 14.5 1.2 15 10-24 61-75 (350) 68 pfam02670 DXP_reductoisom 1-de 27.1 40 0.001 13.6 6.6 66 13-89 1-66 (129) 69 COG0542 clpA ATP-binding subun 26.9 41 0.001 13.6 3.4 31 23-66 287-317 (786) 70 COG3935 DnaD Putative primosom 26.9 41 0.001 13.6 2.1 18 446-463 178-195 (246) 71 KOG1151 consensus 26.6 29 0.00073 14.7 1.0 146 167-350 402-571 (775) 72 COG3487 IrpA Uncharacterized i 26.1 35 0.00089 14.1 1.4 38 153-202 66-120 (446) 73 PRK05854 short chain dehydroge 25.8 42 0.0011 13.5 5.8 129 3-172 7-139 (314) 74 COG3875 Uncharacterized conser 25.7 43 0.0011 13.5 6.6 46 24-69 82-129 (423) 75 TIGR01849 PHB_depoly_PhaZ poly 25.4 28 0.00072 14.7 0.8 52 1-56 197-258 (414) 76 KOG1384 consensus 25.2 31 0.00078 14.5 0.9 59 10-73 7-94 (348) 77 TIGR02746 TraC-F-type type-IV 25.1 41 0.001 13.6 1.5 39 435-474 336-374 (900) 78 cd05567 PTS_IIB_mannitol PTS_I 24.7 44 0.0011 13.3 2.1 34 159-192 51-84 (87) 79 KOG2733 consensus 24.6 45 0.0011 13.3 6.7 49 10-67 5-53 (423) 80 pfam04271 consensus 24.5 45 0.0011 13.3 2.1 19 445-463 48-66 (73) 81 KOG1774 consensus 24.4 27 0.0007 14.9 0.5 41 183-233 6-49 (88) 82 pfam03699 UPF0182 Uncharacteri 24.1 14 0.00035 17.0 -1.0 44 424-477 597-640 (771) 83 PRK13245 hetR heterocyst diffe 24.0 28 0.00071 14.8 0.5 24 437-460 256-279 (299) 84 PRK07024 short chain dehydroge 23.6 46 0.0012 13.2 4.6 70 10-109 2-71 (256) 85 COG2607 Predicted ATPase (AAA+ 23.6 46 0.0012 13.2 4.0 91 11-121 86-179 (287) 86 PTZ00213 asparagine synthetase 23.5 6.6 0.00017 19.3 -2.7 41 259-307 182-222 (350) 87 cd03009 TryX_like_TryX_NRX Try 23.5 47 0.0012 13.2 8.6 43 11-53 20-62 (131) 88 COG0300 DltE Short-chain dehyd 23.2 47 0.0012 13.1 7.2 196 8-254 4-225 (265) 89 COG3268 Uncharacterized conser 23.2 39 0.001 13.7 1.2 123 10-150 6-131 (382) 90 TIGR00570 cdk7 CDK-activating 23.2 44 0.0011 13.4 1.4 14 419-432 303-316 (322) 91 PRK09537 pylS pyrolysyl-tRNA s 22.9 48 0.0012 13.1 1.6 40 47-89 47-86 (420) 92 pfam09093 Lyase_catalyt Lyase, 22.5 49 0.0012 13.1 3.4 55 234-303 187-241 (356) 93 COG2382 Fes Enterochelin ester 22.3 49 0.0013 13.0 3.8 107 120-248 58-166 (299) 94 PRK11052 malQ 4-alpha-glucanot 22.3 30 0.00075 14.6 0.4 27 156-185 428-454 (694) 95 TIGR02371 ala_DH_arch alanine 21.9 45 0.0011 13.3 1.3 66 15-92 112-179 (327) 96 TIGR02769 nickel_nikE nickel i 21.8 50 0.0013 13.0 1.5 35 328-362 68-102 (267) 97 TIGR00154 ispE 4-diphosphocyti 21.7 39 0.00098 13.8 0.9 29 332-360 147-177 (322) 98 PRK12493 magnesium chelatase; 21.6 51 0.0013 12.9 3.6 40 202-244 993-1036(1318) 99 PRK01060 endonuclease IV; Prov 21.3 35 0.00088 14.1 0.6 21 322-342 242-262 (281) 100 KOG3559 consensus 21.2 40 0.001 13.7 0.9 14 207-220 317-330 (598) 101 COG1947 IspE 4-diphosphocytidy 20.9 48 0.0012 13.1 1.2 26 332-357 136-161 (289) 102 COG3564 Uncharacterized protei 20.7 53 0.0013 12.8 2.8 67 287-364 27-94 (116) 103 PRK02509 hypothetical protein; 20.7 19 0.00049 16.0 -0.9 44 424-477 725-768 (980) 104 pfam04076 BOF Bacterial OB fol 20.7 41 0.001 13.6 0.8 15 322-336 84-98 (126) 105 pfam03767 Acid_phosphat_B HAD 20.6 53 0.0013 12.8 5.0 100 76-185 104-204 (230) 106 PRK08313 acetyl-CoA acetyltran 20.5 53 0.0014 12.8 1.9 45 42-86 3-51 (391) 107 PRK11034 clpA ATP-dependent Cl 20.5 53 0.0014 12.8 3.0 22 425-446 548-569 (758) 108 KOG0672 consensus 20.4 53 0.0014 12.8 3.9 41 8-48 17-57 (218) 109 pfam02576 DUF150 Uncharacteriz 20.3 54 0.0014 12.7 3.2 33 145-177 27-59 (141) 110 COG2502 AsnA Asparagine synthe 20.3 19 0.00048 16.0 -0.9 161 107-305 46-218 (330) 111 TIGR00587 nfo apurinic endonuc 20.3 9.4 0.00024 18.2 -2.5 23 319-341 271-293 (318) No 1 >TIGR00871 zwf glucose-6-phosphate 1-dehydrogenase; InterPro: IPR001282 Glucose-6-phosphate dehydrogenase (1.1.1.49 from EC) (G6PDH) is a ubiquitous protein, present in bacteria and all eukaryotic cell types . The enzyme catalyses the the first step in the pentose pathway, i.e. the conversion of glucose-6-phosphate to gluconolactone 6-phosphate in the presence of NADP, producing NADPH. The ubiquitous expression of the enzyme gives it a major role in the production of NADPH for the many NADPH-mediated reductive processes in all cells . Deficiency of G6PDH is a common genetic abnormality affecting millions of people worldwide. Many sequence variants, most caused by single point mutations, are known, exhibiting a wide variety of phenotypes .; GO: 0004345 glucose-6-phosphate dehydrogenase activity, 0006006 glucose metabolic process. Probab=100.00 E-value=0 Score=1547.33 Aligned_cols=480 Identities=44% Similarity=0.763 Sum_probs=463.0 Q ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 89878998644356676577999999972898757817999988869988999999999987423467998999989851 Q gi|254780504|r 9 HNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSL 88 (492) Q Consensus 9 ~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~ 88 (492) ++|.|||||||||||||||+||||+|++.|+||+ +|.|||+||++||+++||+.+++++.++.+. +.+...|++|+++ T Consensus 1 ~~~~lVIFGAsGDLA~rKl~PaLy~L~~~g~L~~-~~~ivG~AR~~~s~~~fr~~~~~a~~~~~~~-~~~~~~~d~F~~~ 78 (498) T TIGR00871 1 DPCILVIFGASGDLARRKLIPALYRLFREGLLPP-DFRIVGVARRDLSDEEFRKVVREALIKFETE-EIDEQLLDEFAQR 78 (498) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCHHHHHHHHHHHHHCCCCC-CCCHHHHHHHHCC T ss_conf 9960798167502034334468899987065989-8479987457688789999997433105786-6442259998607 Q ss_pred CCEEECCCCCHHHHHHHHHHHCC-----CCCCCEEEEECCCHHHHHHHHHHHHHHCCCC-----C--CCEEEEECCCCCC Q ss_conf 65786467897789988887405-----7678469984069568666777777725876-----4--4357760566789 Q gi|254780504|r 89 IFYINLDVEKNYGWELLGKLLDI-----DKTRIRVFYLAMSSAFFGKISQRIHANNLVT-----E--HTRVVLEKPIGSN 156 (492) Q Consensus 89 ~~Y~~~d~~~~~~y~~L~~~l~~-----~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~-----~--~~RiViEKPFG~D 156 (492) ++|+++|++|+++|..|++.|+. ....|||||||+||++|++||++|+++||+. + |+||||||||||| T Consensus 79 ~~Y~~g~~dd~~~y~~L~~~L~~l~~~~~~~gNr~fYLa~PP~~F~~i~~~L~~~~l~~~nq~~~~~w~Rvv~EKPFG~D 158 (498) T TIGR00871 79 LSYVSGDVDDDESYKKLAELLEKLDKTYQTEGNRLFYLATPPSLFGTIIKQLKKHGLNERNQEQGKTWSRVVIEKPFGHD 158 (498) T ss_pred CEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCCC T ss_conf 35774467876889999999989998518889745650478441026788776531000135578844799987889998 Q ss_pred HHHHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 88999999999873896883512420125687777887747752002528033307999998467702455553220017 Q gi|254780504|r 157 LSSAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGAL 236 (492) Q Consensus 157 l~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~Gai 236 (492) |+||++||+.|.++|+|+||||||||||||||||||+|||||.+|||+|||+|||+||||++|++||||||||||++||| T Consensus 159 L~SA~~Ln~~~~~~F~E~qiYRIDHYLGKEtVqNllvlRFAN~~Fep~WNr~~Id~VqIT~aE~~GvEgRggYyD~~Gal 238 (498) T TIGR00871 159 LASAQELNKQLRAVFKEDQIYRIDHYLGKETVQNLLVLRFANQIFEPLWNRRYIDHVQITVAESFGVEGRGGYYDKSGAL 238 (498) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCCCCCCHHHHHCCHH T ss_conf 78999999998865484644674101264578899999647887532115776444899766504658630154540426 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCCC-CC Q ss_conf 88765589999999871854458858999999999973068881122361365213667768734334101479886-64 Q gi|254780504|r 237 RDMIQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLGI-SD 315 (492) Q Consensus 237 RDmvQNHllQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~~-S~ 315 (492) ||||||||||||||||||||.+++||+|||||||||+||||++++||..++|||||.+|+++|++||||+||+||++ |+ T Consensus 239 RDmvQNHLLQlL~L~AME~P~~~~a~~iRdEKVKVL~a~r~~~~~dv~~~~VrGQY~ag~~~G~~v~GY~eE~gV~~dS~ 318 (498) T TIGR00871 239 RDMVQNHLLQLLALVAMEPPVSFDADSIRDEKVKVLKALRPIDPDDVENNVVRGQYGAGEIGGVSVPGYLEEEGVDKDSN 318 (498) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCEEEEECCCCCCCEECCCCCCCCCCCCCCC T ss_conf 67889899999999825888756867752036668864151483213104020032255558872256337788874548 Q ss_pred CCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCCCCCCC--CEEEEEECCCCCEEEEEEECC Q ss_conf 874389999970858678577755133468851799999854886437666547898--569999717965044433126 Q gi|254780504|r 316 TETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQMSKIET--NKLILRLQNNGEIEQFITTKD 393 (492) Q Consensus 316 TeTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~~~~~~--N~Lvi~iqP~~~i~l~~~~K~ 393 (492) ||||||+|++||||||+||||||||||||++|+|||+|+||++|+++|.....+..| |.|||||||+|||.|+|++|. T Consensus 319 TeTF~A~~~~IdN~RW~GVPFYlRtGKrL~~k~tEi~~~Fk~~p~~~F~~~~~~~~pE~N~LvIRiQPDEGv~lkf~~K~ 398 (498) T TIGR00871 319 TETFAALKLEIDNWRWAGVPFYLRTGKRLPEKVTEIRIQFKTVPLNLFKANERDANPERNALVIRIQPDEGVYLKFNAKK 398 (498) T ss_pred CCEEEEEEEEECCCCCCCCCHHHHCCCCCCCCCCEEEEEEECCCCHHHHCCCCCCCCCCCCEEEEECCCHHHHHHHHCCC T ss_conf 21136778676143268782341026311576304899830665111000123467667606898669646878541468 Q ss_pred CCCCCCEEEEEEEEEEECCCCCCC-CCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC--CCCCCCC Q ss_conf 888774454666655302234568-9862489999998798335869899998458846999988734898--7457988 Q gi|254780504|r 394 HSTSNMKLKTTALRICCCSKRIKR-TPDGYERLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQK--VDYYAAG 470 (492) Q Consensus 394 pg~~~~~~~~~~l~~~~~~~~~~~-~pdaYe~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~--~~~Y~~G 470 (492) ||. ++.+++++|||+|...+... .|+||||||+|||.||+|||+|+|||++|||||||||++|+..+.+ |..|++| T Consensus 399 PG~-~~~~~~~~lDf~Y~~~~~~~~~peAYERLllD~l~GD~tLF~r~DEv~~aW~~v~P~L~~w~~~~~~~~~~~Y~aG 477 (498) T TIGR00871 399 PGL-AFETRPVKLDFSYASRFGETKLPEAYERLLLDALLGDQTLFARDDEVEEAWRIVTPILEAWAANKGPSSPPNYEAG 477 (498) T ss_pred CCC-CCEEEEEEEEECCCHHHCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 875-4423786542020011022577344579999985357777405689987414522589998487788887778888 Q ss_pred CCCHHHHHHHHHHCCCCCCCC Q ss_conf 997188999999818844778 Q gi|254780504|r 471 TWGPDQSDVLLQKDGRRWHNN 491 (492) Q Consensus 471 S~GP~~a~~l~~~~g~~W~~~ 491 (492) ||||++|++|++|+|++|++. T Consensus 478 S~GP~~ad~l~~~~G~~W~~~ 498 (498) T TIGR00871 478 SWGPKEADELIEKDGRSWRKP 498 (498) T ss_pred CCCHHHHHHHHHCCCCCCCCC T ss_conf 734589999974278645789 No 2 >COG0364 Zwf Glucose-6-phosphate 1-dehydrogenase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=1365.94 Aligned_cols=478 Identities=47% Similarity=0.796 Sum_probs=459.9 Q ss_pred CCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 89898789986443566765779999999728987578179999888699889999999999874234679989999898 Q gi|254780504|r 7 NIHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFL 86 (492) Q Consensus 7 ~~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~ 86 (492) ...+|++||||||||||+||||||||+|+..|++++ +|+|||+||++|++++|++.+++++ .+.+..+.+++.|++|+ T Consensus 4 ~~~~~~lvIFGatGDLA~RKL~PALy~L~~~g~l~~-~~~IiG~aR~~~s~e~f~~~~~~~i-~~~~~~~~~~~~~~~F~ 81 (483) T COG0364 4 TVEPFDLVIFGATGDLARRKLFPALYRLYKEGLLPE-DFRIIGVARSKWSNEEFRALVREAI-EFAKTEEIDEAVWEEFA 81 (483) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCHHHHHHHHHHHH-HHCCCCCCCHHHHHHHH T ss_conf 667725999756316665068899999997588899-8659988567478699999999876-41355544689999998 Q ss_pred HHCCEEECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHH Q ss_conf 51657864678977899888874057678469984069568666777777725876443577605667898899999999 Q gi|254780504|r 87 SLIFYINLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAI 166 (492) Q Consensus 87 ~~~~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~ 166 (492) ++++|+++|++++++|+.|++.++..+. +++||||+||++|++||++|+++|++.+..|||||||||+||+||++||+. T Consensus 82 ~~~~Y~~~d~~~~~~~~~L~~~l~~~~~-~~vfYLa~pP~~f~~i~~~L~~~~l~~~~~RlviEKPfG~dL~SA~~Ln~~ 160 (483) T COG0364 82 SRLSYVSGDYDDPESFDELKDLLGELEG-NRVFYLAVPPSLFGTIAENLAKAGLNEGNGRLVIEKPFGHDLASARELNDQ 160 (483) T ss_pred HCEEEEECCCCCHHHHHHHHHHHHCCCC-CEEEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCHHHHHHHHHH T ss_conf 5017971378987899999999850357-559999538388899999998756888881499968888877879999999 Q ss_pred HHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98738968835124201256877778877477520025280333079999984677024555532200178876558999 Q gi|254780504|r 167 TRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHLLQ 246 (492) Q Consensus 167 l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHllQ 246 (492) |+++|+|+|||||||||||||||||++|||||.+|||+|||+||+|||||++|++||||||+|||++||||||||||||| T Consensus 161 i~~~F~E~qIyRIDHYLGKetVQNllalRFaN~~fE~lWNr~~Id~VqIt~aE~~GvEgRggYYD~~GalRDMvQNHlLQ 240 (483) T COG0364 161 ISAVFKEEQIYRIDHYLGKETVQNLLALRFANAIFEPLWNRNYIDHVQITVAETLGVEGRGGYYDKAGALRDMVQNHLLQ 240 (483) T ss_pred HHHHCCHHHEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCCHHCCCHHHHHHHHHHHH T ss_conf 98747841367631025789988899998715666545332551058999863035365422111146199999999999 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCC-CCCCCCEEEEEEE Q ss_conf 999987185445885899999999997306888112236136521366776873433410147988-6648743899999 Q gi|254780504|r 247 LLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLG-ISDTETFVAIKAN 325 (492) Q Consensus 247 lL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~-~S~TeTfaa~~l~ 325 (492) ||||+|||||+++++++||+||+|||||+||++.++|.+++|||||++|.++|++||||++|+||+ +|+||||||+|++ T Consensus 241 lL~LvAME~P~~~~ad~irdEKvKvLkal~p~~~~~~~~~~VrGQY~ag~~~g~~v~gY~eE~gv~~dS~tETFvA~k~~ 320 (483) T COG0364 241 LLCLVAMEPPASFSADDIRDEKVKVLKALRPISEENVKEDTVRGQYTAGEIDGKKVPGYLEEEGVAKDSNTETFVAIKLE 320 (483) T ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHCEEECCEECCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEE T ss_conf 99998317998899799999999999752889701112305631200243278556763057789999985435899998 Q ss_pred ECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCCC-CCCCCEEEEEECCCCCEEEEEEECCCCCCCCEEEEE Q ss_conf 7085867857775513346885179999985488643766654-789856999971796504443312688877445466 Q gi|254780504|r 326 IDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQMS-KIETNKLILRLQNNGEIEQFITTKDHSTSNMKLKTT 404 (492) Q Consensus 326 idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~~-~~~~N~Lvi~iqP~~~i~l~~~~K~pg~~~~~~~~~ 404 (492) ||||||+||||||||||||++|.|||+|+||++|+++|..... .+.+|.|||||||+|+|.+++++|.||. +++..++ T Consensus 321 IdnwRW~GVPFylRtGKrl~~k~teI~i~FK~~p~~lF~~~~~~~~~~N~LviriQPdegI~l~~~~K~PG~-~~~~~~l 399 (483) T COG0364 321 IDNWRWAGVPFYLRTGKRLPKKVTEIVIHFKRVPHNLFSDPSRSSLEQNRLVIRIQPDEGISLKFNVKVPGL-GLQTRPL 399 (483) T ss_pred ECCCCCCCCCEEEECCCCCCCCEEEEEEEECCCCHHHCCCCCCCCCCCCEEEEEECCCCCEEEEEECCCCCC-CCCCCEE T ss_conf 558751799889973788887736999997778844227755576667679999779886489995368987-6410014 Q ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 66553022345689862489999998798335869899998458846999988734898745798899718899999981 Q gi|254780504|r 405 ALRICCCSKRIKRTPDGYERLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQSDVLLQKD 484 (492) Q Consensus 405 ~l~~~~~~~~~~~~pdaYe~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a~~l~~~~ 484 (492) +|++.+.... ...|+||||||+|||+||+|||+|+||||+||+||||||++|+ ..+|+.|++|||||++|++|++++ T Consensus 400 ~l~f~~~~~~-~~~~~AYErLllD~i~Gd~tlF~r~DEve~aWk~vdpIl~~w~--~~~~~~Y~aGswGP~~a~~li~~~ 476 (483) T COG0364 400 DLDFSYDSKF-MRIPEAYERLLLDAIRGDQTLFVRRDEVEAAWKIVDPILEAWK--NDPPPPYEAGSWGPKAADELLERD 476 (483) T ss_pred EEECCCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCHHHHHHHHHCC T ss_conf 5202004455-6681379999999986986446767999988886058876226--678877678886869899886235 Q ss_pred CCCCCCC Q ss_conf 8844778 Q gi|254780504|r 485 GRRWHNN 491 (492) Q Consensus 485 g~~W~~~ 491 (492) |+.||.. T Consensus 477 g~~W~~~ 483 (483) T COG0364 477 GREWRRP 483 (483) T ss_pred CCCCCCC T ss_conf 8787899 No 3 >PRK05722 glucose-6-phosphate 1-dehydrogenase; Validated Probab=100.00 E-value=0 Score=1354.93 Aligned_cols=482 Identities=48% Similarity=0.823 Sum_probs=465.7 Q ss_pred CCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 89898789986443566765779999999728987578179999888699889999999999874234679989999898 Q gi|254780504|r 7 NIHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFL 86 (492) Q Consensus 7 ~~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~ 86 (492) ..+||+|||||||||||+||||||||+|+++|+||+ +++|||+||+++++++|++.+++++.++.+. ..+++.|++|+ T Consensus 6 ~~~p~~~VifGatGDLa~rkL~PaL~~L~~~g~l~~-~~~Iig~aR~~~~~e~f~~~v~~~l~~~~~~-~~~~~~~~~F~ 83 (495) T PRK05722 6 TAEPCDLVIFGATGDLARRKLLPALYQLYKAGLLPE-DFRIIGVARREWSDEDFREVVREALKEFARE-PFDEEVWDRFL 83 (495) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCHHHHHHHHHHHHHHHCCC-CCCHHHHHHHH T ss_conf 999979999777138877479999999997599999-9789999899899899999999999975568-77999999999 Q ss_pred HHCCEEECCCCCHHHHHHHHHHHCC-----CCCCCEEEEECCCHHHHHHHHHHHHHHCCCC--CCCEEEEECCCCCCHHH Q ss_conf 5165786467897789988887405-----7678469984069568666777777725876--44357760566789889 Q gi|254780504|r 87 SLIFYINLDVEKNYGWELLGKLLDI-----DKTRIRVFYLAMSSAFFGKISQRIHANNLVT--EHTRVVLEKPIGSNLSS 159 (492) Q Consensus 87 ~~~~Y~~~d~~~~~~y~~L~~~l~~-----~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~--~~~RiViEKPFG~Dl~S 159 (492) ++++|+++|++++++|..|++.++. ....+|+||||+||++|..||++|+++||+. +|+|||||||||+||+| T Consensus 84 ~~~~Y~~~d~~~~~~y~~L~~~l~~~~~~~~~~~~~~fYLA~pP~~f~~i~~~l~~~~l~~~~~~~RvvvEKPFG~Dl~S 163 (495) T PRK05722 84 ERLFYVSGDVTDPESYEALKELLEELDEKYGTGGNRVFYLATPPSLFGTICENLAAAGLNATGGWSRVVIEKPFGHDLAS 163 (495) T ss_pred HHCEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCHHH T ss_conf 75889952899867899999999976653078875599995787999999999997489878885037885478787677 Q ss_pred HHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 99999999873896883512420125687777887747752002528033307999998467702455553220017887 Q gi|254780504|r 160 AQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDM 239 (492) Q Consensus 160 A~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDm 239 (492) |++||+.|+++|+|+||||||||||||||||||+|||||++|||+|||+||++||||++|++||||||+|||++|||||| T Consensus 164 A~~Ln~~l~~~f~E~qIyRIDHYLGKe~VqNll~lRFaN~ifeplWN~~~I~~VqIt~~E~~GvegR~~yYD~~GalRDm 243 (495) T PRK05722 164 ARELNDQVGEVFKEEQIYRIDHYLGKETVQNILALRFANSLFEPLWNRNYIDHVQITVAETVGVEGRGGYYDKAGALRDM 243 (495) T ss_pred HHHHHHHHHHHCCHHHEEEEEHHCCHHHHHHHHHHHHCCHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHCCCCHHHHH T ss_conf 99999999986685346652000567899989999844434454544034116899985277746165564045409999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCCC-CCCCC Q ss_conf 65589999999871854458858999999999973068881122361365213667768734334101479886-64874 Q gi|254780504|r 240 IQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLGI-SDTET 318 (492) Q Consensus 240 vQNHllQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~~-S~TeT 318 (492) |||||||||||||||+|.++++++||+||+||||++||+++++|.+++|||||++|.++|+++|||++|+||++ |+||| T Consensus 244 vQNHLLQlL~lvAME~P~~~~~~~ir~eK~kVLksir~i~~~~v~~~~VrGQY~~g~~~g~~~~gY~~e~gV~~~S~TeT 323 (495) T PRK05722 244 VQNHLLQLLALVAMEPPASLDADSIRDEKVKVLRALRPITPEDVKENTVRGQYTAGWIGGKPVPGYREEEGVNPDSNTET 323 (495) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHEEEEEECCCCCCCEECCCCCCCCCCCCCCCCCC T ss_conf 99999999999962798899989999999999971789996886430431000256658821676446689998998751 Q ss_pred EEEEEEEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCCCC Q ss_conf 38999997085867857775513346885179999985488643766654789856999971796504443312688877 Q gi|254780504|r 319 FVAIKANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQMSKIETNKLILRLQNNGEIEQFITTKDHSTSN 398 (492) Q Consensus 319 faa~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~~~~~~N~Lvi~iqP~~~i~l~~~~K~pg~~~ 398 (492) |||++++||||||+||||||||||+|+++.|||+|+||++|+++|...+....+|+|||+|||+++|.+++++|.||. + T Consensus 324 Faa~~l~Idn~RW~GVPF~lrtGK~L~~~~teI~i~FK~~p~~~f~~~~~~~~~N~Lvi~iqP~e~i~l~~~~K~pG~-~ 402 (495) T PRK05722 324 FVALKLFIDNWRWAGVPFYLRTGKRLPKKVTEIVIHFKPVPHNLFGESAQNLPPNKLVIRIQPDEGISLRFNAKVPGE-G 402 (495) T ss_pred CEEEEEEECCCCCCCCCEEEECCCCCCCCEEEEEEEEECCCHHHCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCC-C T ss_conf 014689962775578878997056667660599999826981105776678889879999668874579998637898-8 Q ss_pred CEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 44546666553022345689862489999998798335869899998458846999988734898745798899718899 Q gi|254780504|r 399 MKLKTTALRICCCSKRIKRTPDGYERLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQSD 478 (492) Q Consensus 399 ~~~~~~~l~~~~~~~~~~~~pdaYe~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a~ 478 (492) +.+.++.|+++|...+..+.|+|||+||+|||+||+++|+|+||||+||+||||||++|+..+.+|.+|++|||||++|+ T Consensus 403 ~~~~~~~l~~~~~~~~~~~~~~aYErLl~D~~~Gd~tlF~r~dEve~sW~ivdpiL~~w~~~~~~~~~Y~~GS~GP~~a~ 482 (495) T PRK05722 403 MELRPVKLDFSYSEAFGEESPDAYERLLLDVMRGDQTLFVRRDEVEAAWKWVDPILEAWAENNDPPPPYPAGTWGPEAAD 482 (495) T ss_pred CCEEEEEEEEECHHCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH T ss_conf 70379888866120169989774899999997699666789899999999879999999757999878899997989999 Q ss_pred HHHHHCCCCCCCC Q ss_conf 9999818844778 Q gi|254780504|r 479 VLLQKDGRRWHNN 491 (492) Q Consensus 479 ~l~~~~g~~W~~~ 491 (492) +|++++|++||+. T Consensus 483 ~l~~~~g~~W~~~ 495 (495) T PRK05722 483 ALLARDGRSWREP 495 (495) T ss_pred HHHHHCCCCCCCC T ss_conf 9997159906898 No 4 >PRK12853 glucose-6-phosphate 1-dehydrogenase; Provisional Probab=100.00 E-value=0 Score=1352.63 Aligned_cols=481 Identities=44% Similarity=0.739 Sum_probs=463.9 Q ss_pred CCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHH Q ss_conf 89898987899864435667657799999997289875781799998886998899999999998742346799899998 Q gi|254780504|r 5 TPNIHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQK 84 (492) Q Consensus 5 tp~~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~ 84 (492) ++..+||+|||||||||||+||||||||+|++.|++|+ +++|||+||+++++++|++.++++++++.++ ..+++.|++ T Consensus 3 ~~~~~p~~~VIfGaTGDLa~RKL~PAL~~L~~~g~lp~-~~~Iig~aR~~~~~e~f~~~v~~~l~~~~~~-~~~~~~~~~ 80 (486) T PRK12853 3 TAPAPPCTLVIFGATGDLARRKLLPALYRLARAGLLPE-DLRIIGVGRDDWSDEQWRKRVRESLRAFGAD-KFDDAVWDR 80 (486) T ss_pred CCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCHHHHHHHHHHHHHHHCCC-CCCHHHHHH T ss_conf 88899826999766338877479999999997699999-9889999799899899999999999985357-689999999 Q ss_pred HHHHCCEEECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHH Q ss_conf 98516578646789778998888740576784699840695686667777777258764435776056678988999999 Q gi|254780504|r 85 FLSLIFYINLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIH 164 (492) Q Consensus 85 F~~~~~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn 164 (492) |+++++|+++|++++++|..|++.++ ...+|+||||+||++|..|+++|+++|++.+++|||||||||+||+||++|| T Consensus 81 F~~~~~Y~~~d~~~~~~~~~L~~~l~--~~~~~ifYLAvpP~~f~~i~~~L~~~~l~~~~~RvvvEKPFG~Dl~SA~~Ln 158 (486) T PRK12853 81 LAERLSYVAGDVTDPADYARLAEALE--SLGNPVFYLAVPPSLFAPIVENLAAAGLLEGTARVVLEKPFGHDLASARALN 158 (486) T ss_pred HHHHCEEEECCCCCHHHHHHHHHHHC--CCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCHHHHHHHH T ss_conf 98648899648998789999999863--6675489994587998999999997089876632788547778768899999 Q ss_pred HHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99987389688351242012568777788774775200252803330799999846770245555322001788765589 Q gi|254780504|r 165 AITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHL 244 (492) Q Consensus 165 ~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHl 244 (492) +.|+++|+|+||||||||||||||||||+|||||++|||+|||+||++||||++|++||||||+|||++||||||||||| T Consensus 159 ~~l~~~f~E~qIyRIDHYLGKe~VqNil~lRFaN~~fe~lWN~~~I~~VqIt~~E~~GvegR~~YYD~~GalRDmvQNHL 238 (486) T PRK12853 159 ATLAAVFDEDQIYRIDHFLGKETVQNLLALRFANALFEPLWNRNYIDHVQITVAETLGVEGRGGFYDATGALRDMVQNHL 238 (486) T ss_pred HHHHHHCCHHHEEEEECCCCHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEECCCCCCHHHHHHCCCCHHHHHHHHHH T ss_conf 99998668545675300066899998999986353330244201625789998457774636666504550999999999 Q ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCCC-CCCCCEEEEE Q ss_conf 999999871854458858999999999973068881122361365213667768734334101479886-6487438999 Q gi|254780504|r 245 LQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLGI-SDTETFVAIK 323 (492) Q Consensus 245 lQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~~-S~TeTfaa~~ 323 (492) ||||||||||||.++++++||+||+||||++||++++++.. |||||++|.++|++||||++|+||++ |+||||||++ T Consensus 239 lQlL~LvAME~P~~~~a~~ir~eKvkVLkalr~i~~~~v~~--VrGQY~~g~~~g~~v~gY~~e~gV~~dS~TeTf~a~~ 316 (486) T PRK12853 239 LQLLALVAMEPPASFDADAVRDEKAKVLRALRPLDPDDVAR--VRGQYTAGAVGGEPVPGYRDEPGVDPDSRTETFVALK 316 (486) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHH--HHCCCCCCCCCCEECCCCCCCCCCCCCCCCCCEEEEE T ss_conf 99999997489778998999999999997078999788645--4402335664882167733578999988875112468 Q ss_pred EEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCC-CCCCCCEEEEEECCCCCEEEEEEECCCCCCCCEEE Q ss_conf 99708586785777551334688517999998548864376665-47898569999717965044433126888774454 Q gi|254780504|r 324 ANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQM-SKIETNKLILRLQNNGEIEQFITTKDHSTSNMKLK 402 (492) Q Consensus 324 l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~-~~~~~N~Lvi~iqP~~~i~l~~~~K~pg~~~~~~~ 402 (492) ++||||||+||||||||||+|++|.|||+|+||++|+.+|.... ....+|+|+|+|||+++|.+++++|.||. .+.++ T Consensus 317 l~Idn~RW~GVPF~lrtGK~L~~~~teI~i~Fk~~p~~~f~~~~~~~~~~N~Lvi~iqP~e~i~l~~~~K~PG~-~~~~~ 395 (486) T PRK12853 317 LEIDNWRWAGVPFYLRTGKRLAERRTEIVITFKPVPHALFRPTGVGGLEPNRLVIRLQPDEGISLELNVKRPGP-GMRLE 395 (486) T ss_pred EEECCCCCCCCCEEEECCCCCCCCEEEEEEEEECCCHHHCCCCCCCCCCCCEEEEEECCCCEEEEEEEECCCCC-CCEEE T ss_conf 88627755788789970566676615999998168822136876678888879999669982789997048899-87058 Q ss_pred EEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 66665530223456898624899999987983358698999984588469999887348987457988997188999999 Q gi|254780504|r 403 TTALRICCCSKRIKRTPDGYERLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQSDVLLQ 482 (492) Q Consensus 403 ~~~l~~~~~~~~~~~~pdaYe~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a~~l~~ 482 (492) +++|+++|...+....|+||||||+|||.||+|+|+|+||||+||+||||||++|+.++.+|++|++|||||++|++|++ T Consensus 396 ~~~l~~~~~~~~~~~~~~aYErLl~d~~~Gd~tlF~r~dEve~aW~ivdpil~~w~~~~~~~~~Y~~GS~GP~~a~~ll~ 475 (486) T PRK12853 396 PVSLDFDLADAFDERPLEAYERLLLDVLRGDPTLFVRADEVEAAWRIVDPILDAWAQDPVPPREYPAGSWGPAAADALLA 475 (486) T ss_pred EEEEEEECHHHCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 98744141431599998808999999976996666998999999998799999997589998888999979899999997 Q ss_pred HCCCCCCCCC Q ss_conf 8188447789 Q gi|254780504|r 483 KDGRRWHNNT 492 (492) Q Consensus 483 ~~g~~W~~~~ 492 (492) ++|++||++. T Consensus 476 ~~g~~W~~p~ 485 (486) T PRK12853 476 RDGRAWREPA 485 (486) T ss_pred HCCCCCCCCC T ss_conf 4599358999 No 5 >PTZ00309 glucose-6-phosphate dehydrogenase; Provisional Probab=100.00 E-value=0 Score=1329.43 Aligned_cols=470 Identities=35% Similarity=0.580 Sum_probs=439.3 Q ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 89878998644356676577999999972898757817999988869988999999999987423467998999989851 Q gi|254780504|r 9 HNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSL 88 (492) Q Consensus 9 ~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~ 88 (492) .+-||||||||||||+||||||||+||.+|+||+ +++|||+||+++++.+ ...++.+.++......+++.|++|+++ T Consensus 67 ~~LTiVIFGAtGDLA~RKL~PAL~~L~~~g~Lp~-~~~IiG~aR~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~F~~~ 143 (556) T PTZ00309 67 RALTIIVLGASGDLAKKKTFPALFDLYCNGLLPP-EVNILGYARSKVEDVE--KWKKDTLARYFPNLKERECHAEDFLKH 143 (556) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCHHH--HHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 9765999754228776069999999997599999-9889999899886489--999999999603577678999999962 Q ss_pred CCEEECCCCCHHHHHHHHHHHCC--------CCCCCEEEEECCCHHHHHHHHHHHHHHCCCC---CCCEEEEECCCCCCH Q ss_conf 65786467897789988887405--------7678469984069568666777777725876---443577605667898 Q gi|254780504|r 89 IFYINLDVEKNYGWELLGKLLDI--------DKTRIRVFYLAMSSAFFGKISQRIHANNLVT---EHTRVVLEKPIGSNL 157 (492) Q Consensus 89 ~~Y~~~d~~~~~~y~~L~~~l~~--------~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~---~~~RiViEKPFG~Dl 157 (492) ++|+++|++++++|..|++.+.. ....|||||||+||++|+.||++|+++|++. +|+|||||||||+|| T Consensus 144 l~Y~~gd~~~~~~y~~L~~~l~~~~~~~~~~~~~~nrifYLAvpP~~f~~i~~~L~~~~l~~~~~g~~RiVvEKPFG~Dl 223 (556) T PTZ00309 144 ISYISGSYDEADDFKRLNKVILEKEEAFQGPEKGGNRLFYLALPPSVFAEVCEGIHKGAMPKEVGGWVRVIVEKPFGRDT 223 (556) T ss_pred CEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCC T ss_conf 90880689987899999999998775312566777558999248889999999999846888888841477525788850 Q ss_pred HHHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 89999999998738968835124201256877778877477520025280333079999984677024555532200178 Q gi|254780504|r 158 SSAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALR 237 (492) Q Consensus 158 ~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiR 237 (492) +||++||+.|+++|+|+||||||||||||||||||+|||||++|||+|||+||+|||||++|++||||||+|||++|||| T Consensus 224 ~SA~~Ln~~L~~~F~E~QIyRIDHYLGKE~VQNil~lRFaN~ifEplWNr~~Id~VQIt~aE~lGvegRg~YYD~~GalR 303 (556) T PTZ00309 224 ESSAELSSALEPLFDESQLFRIDHYLGKEMVQNIIVTRFANRVFSALWNSSNIACVQITFKEDIGTEGRGGYFDSIGIIR 303 (556) T ss_pred HHHHHHHHHHHHHCCHHHEEECCCCCCHHHHHHHHHHHHCCCCHHHHHCCCCEEEEEEEEECCCCCCHHHHHHCCCCHHH T ss_conf 33999999999657997844001003489999999998613302435363662478999826888653665750320699 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCCC-CCC Q ss_conf 8765589999999871854458858999999999973068881122361365213667768734334101479886-648 Q gi|254780504|r 238 DMIQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLGI-SDT 316 (492) Q Consensus 238 DmvQNHllQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~~-S~T 316 (492) |||||||||||||||||||.++++++||+||+|||||+||++++++ |||||++|..+ .||||+||+||++ |+| T Consensus 304 DMvQNHLLQlLaLvAMEpP~s~~a~~IRdEKvkVLrslrpi~~edv----VrGQY~a~~~g--~v~GY~dE~gV~~dS~T 377 (556) T PTZ00309 304 DVMQNHLTQILSLLAMEKPRSLSAEDIRDEKVKVLRCIEPITKEEC----VLGQYTASADG--SIPGYLDDETVPKGSTC 377 (556) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHCE----EEEEECCCCCC--CCCCCCCCCCCCCCCCC T ss_conf 9999999999999971897779989999999999963688983115----66454367788--88761146899988998 Q ss_pred CCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCC Q ss_conf 74389999970858678577755133468851799999854886437666547898569999717965044433126888 Q gi|254780504|r 317 ETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQMSKIETNKLILRLQNNGEIEQFITTKDHST 396 (492) Q Consensus 317 eTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~~~~~~N~Lvi~iqP~~~i~l~~~~K~pg~ 396 (492) |||||++++||||||+||||||||||+|++|.|||+|+||++|+.+|.. ..+|+|||+|||+|+|.+++++|.||. T Consensus 378 ETFaA~kl~IDN~RW~GVPFyLRTGKrL~~k~tEI~I~FK~~p~~~~~~----~~~N~LviriQP~e~I~l~~~~K~PG~ 453 (556) T PTZ00309 378 PTFAVARLHINNDRWRGVPFILKAGKALEERYVEIRIQFKDEVHPYGDA----TQRNELVIRAQPSEAMYLKLTTKVPGL 453 (556) T ss_pred CCCEEEEEEECCCCCCCCCEEEECCCCCCCCCEEEEEEECCCCCCCCCC----CCCCEEEEEECCCCEEEEEEEECCCCC T ss_conf 6105789997288427888799747787878259999968898544678----889879999889980799998138988 Q ss_pred CCCEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 77445466665530223456898624899999987983358698999984588469999887348987457988997188 Q gi|254780504|r 397 SNMKLKTTALRICCCSKRIKRTPDGYERLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQ 476 (492) Q Consensus 397 ~~~~~~~~~l~~~~~~~~~~~~pdaYe~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~ 476 (492) .+.+.+++|+++|...+....||||||||+|||.||+|||+|+||||+||+||||||++|+..+.+|++|++|||||++ T Consensus 454 -~~~~~~~~ld~~y~~~~~~~~pdAYErLLlDal~Gd~tLF~r~DEve~aWrivdPiL~~w~~~~~~~~~Y~aGS~GP~~ 532 (556) T PTZ00309 454 -SNDTHQTELDLSYHHRYNVRLPDAYESLINDALLGNSTNFVRKDELDVAWRIFTPLLHQIDRGEIKPIPYSAGSRGPKE 532 (556) T ss_pred -CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH T ss_conf -8855687630205112699687649999999977997678988999999998799999998579998888999979899 Q ss_pred HHHHHHHCCC------CCCCCC Q ss_conf 9999998188------447789 Q gi|254780504|r 477 SDVLLQKDGR------RWHNNT 492 (492) Q Consensus 477 a~~l~~~~g~------~W~~~~ 492 (492) |++|++|+|+ .|++++ T Consensus 533 Ad~Ll~r~G~~~~~~~~w~~~~ 554 (556) T PTZ00309 533 ADEFIKKNGFKRSLGYNWPQSN 554 (556) T ss_pred HHHHHHHCCCCCCCCCCCCCCC T ss_conf 9999996496578875588989 No 6 >PRK12854 glucose-6-phosphate 1-dehydrogenase; Provisional Probab=100.00 E-value=0 Score=1312.38 Aligned_cols=463 Identities=40% Similarity=0.647 Sum_probs=438.9 Q ss_pred CCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHH Q ss_conf 89898987899864435667657799999997289875781799998886998899999999998742346799899998 Q gi|254780504|r 5 TPNIHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQK 84 (492) Q Consensus 5 tp~~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~ 84 (492) .|...||+|||||||||||+||||||||+|++.|+||+ +++|||+||+++++++||+.+++++.++... ..+++.|++ T Consensus 6 ~p~~~P~~~VIfGatGDLa~rKL~PaL~~L~~~g~Lp~-~~~IiG~aR~~~~~~~fr~~v~~~l~~~~~~-~~~~~~~~~ 83 (475) T PRK12854 6 IPAAGPTVFVLFGATGDLAKRLLMPALFNLARDGLLAR-NWRIVGTGRNPLSADEFRARLRDAVDEFGAH-CLDDELWEG 83 (475) T ss_pred CCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCHHHHHHHHHHHHHHHCCC-CCCHHHHHH T ss_conf 99999979999534108866069999999997599999-9789999899899899999999999975678-889999999 Q ss_pred HHHHCCEEECCCCCHHHHHHHHHHHCC-----CCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHH Q ss_conf 985165786467897789988887405-----767846998406956866677777772587644357760566789889 Q gi|254780504|r 85 FLSLIFYINLDVEKNYGWELLGKLLDI-----DKTRIRVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSS 159 (492) Q Consensus 85 F~~~~~Y~~~d~~~~~~y~~L~~~l~~-----~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~S 159 (492) |+++++|+++|++ +++|..|++.++. ....+|+||||+||++|+.||++|+++||+ +++|||||||||+||+| T Consensus 84 f~~~~~y~~~d~~-~~~~~~L~~~l~~~~~~~~~~~~~ifYLA~pP~~f~~i~~~l~~~~l~-~~~RvvvEKPFG~dl~S 161 (475) T PRK12854 84 FAKRLDYAGGDFT-PAGPGALAAAVRAARAELGPDAQLVHYLATPPRAFPEVTTMLGAAGLV-DGSRVVYEKPFGTDLAS 161 (475) T ss_pred HHHCCEEEEECCC-CHHHHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHHHCCC-CCCCEEEECCCCCCCHH T ss_conf 9852889951588-132999999999877640887646999935869889999999972789-87714773688887152 Q ss_pred HHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 99999999873896883512420125687777887747752002528033307999998467702455553220017887 Q gi|254780504|r 160 AQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDM 239 (492) Q Consensus 160 A~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDm 239 (492) |++||+.|+++|+|+||||||||||||||||||+|||||++|||+|||+||+|||||++|++||||||+|||++|||||| T Consensus 162 a~~Ln~~l~~~f~E~qiyRIDHYLGKE~VQNll~lRFaN~ifeplWNr~~I~~VqIt~~E~~GvegR~~YYD~~GalRDm 241 (475) T PRK12854 162 AEELNAFVHKVFDESQIFRIDHFLGKEAVQNILAFRFANGLFEPIWNREHIDHVQIDIPETLGVDTRAAFYDSTGALRDM 241 (475) T ss_pred HHHHHHHHHHHCCHHHEEECCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEECCCCCCHHHHHHCCCCHHHHH T ss_conf 99999999975786552410101248999989999853555122434077248999984677756677675044469999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCCC-CCCCC Q ss_conf 65589999999871854458858999999999973068881122361365213667768734334101479886-64874 Q gi|254780504|r 240 IQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLGI-SDTET 318 (492) Q Consensus 240 vQNHllQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~~-S~TeT 318 (492) |||||||||||||||||.++++++|||||+|||||+||+++++| |||||+ ||++|+||++ |+||| T Consensus 242 vQNHLLQlL~LvAMEpP~~~~~~~iRdeKvkvL~alrp~~~~~v----vrGQY~----------gY~~E~gV~~~S~TeT 307 (475) T PRK12854 242 VVTHLFQVLAMVAMEPPAALGPDAIREEKNKVLGAFRPLDPANV----VRGQYS----------GYREEDGVAPDSTTET 307 (475) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHE----EECCCC----------CCCCCCCCCCCCCCCC T ss_conf 99999999999973898899989999999999985578988981----102435----------6455789899898865 Q ss_pred EEEEEEEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCC-CCCCCCEEEEEECCCCCEEEEEEECCCCCC Q ss_conf 3899999708586785777551334688517999998548864376665-478985699997179650444331268887 Q gi|254780504|r 319 FVAIKANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQM-SKIETNKLILRLQNNGEIEQFITTKDHSTS 397 (492) Q Consensus 319 faa~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~-~~~~~N~Lvi~iqP~~~i~l~~~~K~pg~~ 397 (492) |||++++||||||+||||||||||+|++|+|||+|+||++|+.+|.... +...||+|||+|||+|+|.+++++|.||. T Consensus 308 faa~kl~IdN~RW~GVPFylRtGK~L~~~~teI~I~Fk~~p~~~f~~~~~~~~~pN~LviriqP~e~i~l~~~~K~PG~- 386 (475) T PRK12854 308 FVALKVWIDNWRWAGVPFYLRTGKRMAERDQEISIVFKEAPRSPFPESGLGRPGPNHLTFDLQPNEGVSLSFYAKRPGP- 386 (475) T ss_pred CEEEEEEECCCCCCCCCEEEECCCCCCCCEEEEEEEEECCCHHHCCCCCCCCCCCCEEEEEECCCCEEEEEEEEECCCC- T ss_conf 4679999728734788889982467788717999998579733147865467889879998479985899999778999- Q ss_pred CCEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 74454666655302234568986248999999879833586989999845884699998873489874579889971889 Q gi|254780504|r 398 NMKLKTTALRICCCSKRIKRTPDGYERLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQS 477 (492) Q Consensus 398 ~~~~~~~~l~~~~~~~~~~~~pdaYe~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a 477 (492) ++.+.+++|+++|...+....++||||||+|||+||+|||+|+||||+||+||||||+. ..+|.+|++|||||++| T Consensus 387 ~~~~~~~~l~~~~~~~~~~~~~~aYErLLlD~l~Gd~tlF~r~DEve~aW~ivdpil~~----~~~~~~Y~aGS~GP~~A 462 (475) T PRK12854 387 GMKLEKGSLGFSLQEFGKATPALGYERLILDVLRGDRTLFTRADGIESAWRRVQPLLDD----PPPVKPYQPGTWGPEAA 462 (475) T ss_pred CCCEEEEECCCCCHHCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCC----CCCCCCCCCCCCCHHHH T ss_conf 87246874455100016898987389999999779966779879999999962898759----99998879999798999 Q ss_pred HHHHHHCCCCCCCCC Q ss_conf 999998188447789 Q gi|254780504|r 478 DVLLQKDGRRWHNNT 492 (492) Q Consensus 478 ~~l~~~~g~~W~~~~ 492 (492) ++|++++| ||+.. T Consensus 463 ~~Ll~~dg--w~~pg 475 (475) T PRK12854 463 HELLAPDA--WHLPG 475 (475) T ss_pred HHHHHCCC--CCCCC T ss_conf 99986658--77999 No 7 >KOG0563 consensus Probab=100.00 E-value=0 Score=1176.80 Aligned_cols=472 Identities=38% Similarity=0.648 Sum_probs=448.2 Q ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 89878998644356676577999999972898757817999988869988999999999987423467998999989851 Q gi|254780504|r 9 HNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSL 88 (492) Q Consensus 9 ~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~ 88 (492) .+.+|||||||||||+||+|||||+||..|.+|+ +|.|+|+||++++++++|+.+.+.+.+..... .+.+++++|.++ T Consensus 16 ~~~~iiVfGASGDLAKKK~fPaLf~L~~~g~lp~-~~~i~GYARSklt~ee~~~~~~~~l~~~~~~~-~~~~k~~~F~~~ 93 (499) T KOG0563 16 STLSIIVFGASGDLAKKKIFPALFALYREGLLPE-DFKIFGYARSKLTDEELRKSISETLKCRKDEK-NCGEKLEDFLKR 93 (499) T ss_pred CEEEEEEEECCCHHHHCCHHHHHHHHHHHCCCCC-CEEEEEEECCCCCHHHHHHHHHHHCCCCCCHH-HHHHHHHHHHHH T ss_conf 4179999944730542431589999987254777-60799886323776999998774157884145-565649999987 Q ss_pred CCEEECCCCCHHHHHHHHHHHCC-----CCCCCEEEEECCCHHHHHHHHHHHHHHCCC-CCCCEEEEECCCCCCHHHHHH Q ss_conf 65786467897789988887405-----767846998406956866677777772587-644357760566789889999 Q gi|254780504|r 89 IFYINLDVEKNYGWELLGKLLDI-----DKTRIRVFYLAMSSAFFGKISQRIHANNLV-TEHTRVVLEKPIGSNLSSAQE 162 (492) Q Consensus 89 ~~Y~~~d~~~~~~y~~L~~~l~~-----~~~~~rifYLAiPP~~f~~i~~~L~~~~l~-~~~~RiViEKPFG~Dl~SA~e 162 (492) ++|++|.|+++++|+.|++.++. +...|||||||+||++|.+|+++|++.|.. .+|+|||||||||+|++||++ T Consensus 94 ~sY~~G~YD~~e~f~~Ln~~i~~~e~~~~~~a~RiFYlalPPsvy~~V~~~I~~~~~~~~GwtRvIVEKPFG~d~~Sa~~ 173 (499) T KOG0563 94 VSYVSGQYDTAEGFQELNKHIEEHEKEANSEANRIFYLALPPSVYVDVAKNIKKSCSSVNGWTRVIVEKPFGRDLESAQE 173 (499) T ss_pred HEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHH T ss_conf 41437877977889999999999872245654428999548288799999876512688984479985578876686999 Q ss_pred HHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999873896883512420125687777887747752002528033307999998467702455553220017887655 Q gi|254780504|r 163 IHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQN 242 (492) Q Consensus 163 Ln~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQN 242 (492) |.+.|+++|+|+||||||||||||||||+++|||+|.+|||+|||+||++|||+++|++|+|||+||||++|+||||||| T Consensus 174 L~~~l~~~f~E~qiyRIDHYLGKemV~nl~~lRf~N~i~~~lWNR~~I~sV~I~fkE~fGtEGRggYfD~~GIIRDvvQN 253 (499) T KOG0563 174 LSSELGKLFDEEQIYRIDHYLGKELVQNLLVLRFANRIFEPLWNRDYIESVQIVFKEDFGTEGRGGYFDEYGIIRDVVQN 253 (499) T ss_pred HHHHHHHHCCCHHEEEEHHHHHHHHHHHHHHHEECCHHHCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 99888764572021540155119999654662102034312015003468999970567766756411356509999999 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCC-CCCCCCEEE Q ss_conf 8999999987185445885899999999997306888112236136521366776873433410147988-664874389 Q gi|254780504|r 243 HLLQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLG-ISDTETFVA 321 (492) Q Consensus 243 HllQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~-~S~TeTfaa 321 (492) ||||||||+|||.|.|++|++||+||||||||+||++.+|+ |+|||+++..++.+ ||+|.++|+ +|.|||||| T Consensus 254 HLlQiL~LvAME~P~s~~aedir~eKVkvLks~~~v~~~dv----VlGQY~~~~~g~~~--gy~dd~~V~~dS~tpTfaa 327 (499) T KOG0563 254 HLLQILTLVAMEKPKSLDAEDIRDEKVKVLKSIRPVDLEDV----VLGQYKSSSDGKVP--GYLDDKTVPKDSLTPTFAA 327 (499) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCHHHE----EEEEECCCCCCCCC--CCCCCCCCCCCCCCCCEEE T ss_conf 89999999741788789989999999999986167865434----67622056557877--6446888878998851056 Q ss_pred EEEEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCCCCCEE Q ss_conf 99997085867857775513346885179999985488643766654789856999971796504443312688877445 Q gi|254780504|r 322 IKANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQMSKIETNKLILRLQNNGEIEQFITTKDHSTSNMKL 401 (492) Q Consensus 322 ~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~~~~~~N~Lvi~iqP~~~i~l~~~~K~pg~~~~~~ 401 (492) +.++|||+||+||||+||+||+|+++++||+|+||.+|+++|.... ++.+|+||||+||+|.|.+++++|.||. ++.+ T Consensus 328 ~~l~Idn~RW~GVPFil~aGKal~e~~~eiriqFk~v~g~lf~~~~-~~~~neLVirvqP~eavylk~~~k~Pgl-~~~~ 405 (499) T KOG0563 328 VALHIDNERWEGVPFILRAGKALNERKAEIRIQFKAVPGGLFSDVR-DCKRNELVIRVQPDEAVYLKINIKQPGL-GMQP 405 (499) T ss_pred EEEEECCCCCCCCCEEEECCCCCCCCEEEEEEEEECCCCCCCCCCC-CCCCCEEEEEECCCCEEEEEEECCCCCC-CCCC T ss_conf 7776358653688789972644465506999996036876304755-5665269999468800047762378876-6776 Q ss_pred EEEEEEEEECCCCC-CCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 46666553022345-68986248999999879833586989999845884699998873489874579889971889999 Q gi|254780504|r 402 KTTALRICCCSKRI-KRTPDGYERLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQSDVL 480 (492) Q Consensus 402 ~~~~l~~~~~~~~~-~~~pdaYe~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a~~l 480 (492) ....+++.|++.+. ...|||||+||+|+|+|||++|+|+||+++||+||||+|+..++..++|.+|++||.||.+|+++ T Consensus 406 ~~~eldl~y~~ry~d~~~pdaYE~Li~d~i~G~~~~FvrsDEl~~aW~iftPlL~~i~~~~~~p~~Y~~GsrGP~~a~e~ 485 (499) T KOG0563 406 DESELDLLYSDRYKDVEIPDAYERLILDAIRGDQTHFVRSDELEAAWKIFTPLLEEIEKKKPKPEPYPYGSRGPKEADEL 485 (499) T ss_pred CHHHCCCCHHHHCCCCCCCCHHHHHHHHHHCCCCCCEEEHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCCCHHHHHH T ss_conf 51221773466556664762499999998748865210228899888754488887513678875557788885669999 Q ss_pred HHH---CCCCCCC Q ss_conf 998---1884477 Q gi|254780504|r 481 LQK---DGRRWHN 490 (492) Q Consensus 481 ~~~---~g~~W~~ 490 (492) +++ ++..|.. T Consensus 486 ~~k~~~~~~~w~~ 498 (499) T KOG0563 486 MKKHGTHGYKWPD 498 (499) T ss_pred HHHCCCCCCCCCC T ss_conf 9743678853899 No 8 >pfam02781 G6PD_C Glucose-6-phosphate dehydrogenase, C-terminal domain. Probab=100.00 E-value=0 Score=862.05 Aligned_cols=292 Identities=41% Similarity=0.698 Sum_probs=282.0 Q ss_pred HHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 88774775200252803330799999846770245555322001788765589999999871854458858999999999 Q gi|254780504|r 192 MVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKV 271 (492) Q Consensus 192 l~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHllQlL~lvAMe~P~~~~~~~ir~eK~kv 271 (492) |+|||||++|||+|||+||+|||||++|++||||||+|||++|||||||||||||||||+|||||.++++++||+||+|| T Consensus 1 l~lRFaN~ifeplWN~~~I~~VqIt~~E~~GvegR~~yYd~~GalRDmvQNHllQll~lvaMe~P~~~~~~~ir~eK~kv 80 (294) T pfam02781 1 LVLRFGNEIFNPLWNRDNIDNVQITFKEPFGTEGRGGYFDEFGIIRDMMQNHLLQILCLVAMEKPVSFSAEDIRDEKVKV 80 (294) T ss_pred CCEEECHHHHHHHHCCCCEEEEEEEECCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 94631234414354725503789986066774715544404672888777099999999974997889999999999999 Q ss_pred HHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCCCCCCCC-CCCCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEE Q ss_conf 973068881122361365213667768734334101479886-6487438999997085867857775513346885179 Q gi|254780504|r 272 LQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLEEIPLGI-SDTETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISE 350 (492) Q Consensus 272 lk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~e~~v~~-S~TeTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~e 350 (492) |+|+||+++++| |||||.+|.++|+.++||++|+||++ |+||||||+|++||||||+||||||||||+|++|+|| T Consensus 81 L~~lr~~~~~~v----vrGQY~~g~~~g~~~~gY~~e~gV~~~S~TeTf~A~kl~IDn~RW~GVPFylRtGK~L~~k~te 156 (294) T pfam02781 81 LKALRPIDEDDV----VLGQYGAGEDGGKAKPGYLEDPTVPADSNTETFAAGVLHIDNERWEGVPFYLRAGKALNERKAE 156 (294) T ss_pred HHCCCCCCHHCE----EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEECCCCCCCEEE T ss_conf 962578974426----9811367777980046765678989999975013678998388207961798706676877379 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCCCCCEEEEEEEEEEECCCCC-CCCCCHHHHHHHHH Q ss_conf 99998548864376665478985699997179650444331268887744546666553022345-68986248999999 Q gi|254780504|r 351 IVIALKPAPCTIFDHQMSKIETNKLILRLQNNGEIEQFITTKDHSTSNMKLKTTALRICCCSKRI-KRTPDGYERLLMDI 429 (492) Q Consensus 351 I~i~FK~~p~~~f~~~~~~~~~N~Lvi~iqP~~~i~l~~~~K~pg~~~~~~~~~~l~~~~~~~~~-~~~pdaYe~Ll~D~ 429 (492) |+|+||++|+++|... +++|+|+|+|||+++|.+++++|.||. ++.++++.|+++|.+.+. ...||||||||+|| T Consensus 157 I~I~FK~~p~~~f~~~---~~~N~Lii~iqP~e~i~l~~~~K~PG~-~~~~~~~~L~~~~~~~~~~~~~pdaYErLl~Da 232 (294) T pfam02781 157 IRIQFKDVPGDIFSEQ---APRNELVIRVQPDEAIYLKMNAKSPGL-SFNTRETELDLTYSDRYKDFKIPEAYERLILDT 232 (294) T ss_pred EEEEEECCCCHHCCCC---CCCCEEEEEECCCCEEEEEEECCCCCC-CCEEEEEEEEEECCCCCCCCCCCCHHHHHHHHH T ss_conf 9999707880112567---789879999679893899996578888-852678755555032357886851599999999 Q ss_pred HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCC Q ss_conf 87983358698999984588469999887348987457988997188999999818844778 Q gi|254780504|r 430 IHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQSDVLLQKDGRRWHNN 491 (492) Q Consensus 430 i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a~~l~~~~g~~W~~~ 491 (492) |+||+|||+|+||||+||+|+||||++|+.++.+|++|++|||||++|++|++++|+.||+- T Consensus 233 ~~Gd~tlF~r~dEve~aW~ivdpIl~~w~~~~~~~~~Y~~GS~GP~~a~~Ll~~~g~~W~~~ 294 (294) T pfam02781 233 MCGDQSNFVRRDELEAAWKIFTPILHAIEADKVKPEPYPYGSRGPKEADKLLARNGYVWNDK 294 (294) T ss_pred HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCC T ss_conf 76995556887999999987589999997679988787989979899999998609903689 No 9 >pfam00479 G6PD_N Glucose-6-phosphate dehydrogenase, NAD binding domain. Probab=100.00 E-value=0 Score=456.19 Aligned_cols=175 Identities=41% Similarity=0.704 Sum_probs=165.7 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCEEE Q ss_conf 99864435667657799999997289875781799998886998899999999998742346799899998985165786 Q gi|254780504|r 14 IIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFYIN 93 (492) Q Consensus 14 VIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y~~ 93 (492) ||||||||||+||||||||+|+++|++|+ +++|||+||+++++++|++.|+++++++... +.+++.|++|+++++|++ T Consensus 1 VIfGatGDLa~RKL~PAL~~L~~~g~Lp~-~~~Iig~aR~~~~~e~f~~~v~~~l~~~~~~-~~~~~~~~~F~~~~~Y~~ 78 (183) T pfam00479 1 VIFGATGDLAKRKLFPALYNLYREGLLPE-DFRIVGVARSEWSDEDFRERVREALKKFVRT-ELDEEVWEEFLSRLSYVS 78 (183) T ss_pred CEECCCCHHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHCEEEE T ss_conf 98846248887479999999998099998-9479998478799899999999999975577-579999999997387995 Q ss_pred CCCCCHHHHHHHHHHHCC-----CCCCCEEEEECCCHHHHHHHHHHHHHHCCCC---CCCEEEEECCCCCCHHHHHHHHH Q ss_conf 467897789988887405-----7678469984069568666777777725876---44357760566789889999999 Q gi|254780504|r 94 LDVEKNYGWELLGKLLDI-----DKTRIRVFYLAMSSAFFGKISQRIHANNLVT---EHTRVVLEKPIGSNLSSAQEIHA 165 (492) Q Consensus 94 ~d~~~~~~y~~L~~~l~~-----~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~---~~~RiViEKPFG~Dl~SA~eLn~ 165 (492) +|++++++|+.|++.++. ....+|+||||+||++|+.||++|+++||+. +|+|||||||||+||+||++||+ T Consensus 79 ~d~~~~~~~~~L~~~l~~~~~~~~~~~~rifYLA~pP~~f~~i~~~L~~~~L~~~~~g~~RiVvEKPfG~Dl~Sa~~ln~ 158 (183) T pfam00479 79 GDYDDPEGYERLAERLEELEKKYGTGGNRLFYLALPPSVFGTVAENLKKAGLNKEEGGWTRVVIEKPFGHDLESARELND 158 (183) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHH T ss_conf 37998036999999999887640888755899955889999999999982899888882479985788897788999999 Q ss_pred HHHHHCCHHHEEECCCHHHHHHHHH Q ss_conf 9987389688351242012568777 Q gi|254780504|r 166 ITRKIFKESQIFRIDHYLGKEAVQG 190 (492) Q Consensus 166 ~l~~~f~E~qiyRIDHYLGKe~vqn 190 (492) .|+++|+|+|||||||||||||||| T Consensus 159 ~l~~~f~E~qIyRIDHYLGKe~VQN 183 (183) T pfam00479 159 QLASVFDEDQIYRIDHYLGKETVQN 183 (183) T ss_pred HHHHHCCHHHEEEECCCCCHHHHCC T ss_conf 9994479978412333256653168 No 10 >pfam01408 GFO_IDH_MocA Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilize NADP or NAD. This family is called the GFO/IDH/MOCA family in swiss-prot. Probab=96.87 E-value=0.035 Score=35.58 Aligned_cols=49 Identities=16% Similarity=0.292 Sum_probs=40.2 Q ss_pred EEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHH Q ss_conf 699840695686667777777258764435776056678988999999999873 Q gi|254780504|r 117 RVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAITRKI 170 (492) Q Consensus 117 rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~l~~~ 170 (492) -+.+.++||+.-..++..+-++| .-|.+|||++.+++.+++|.+...+. T Consensus 64 D~v~I~tp~~~H~~~~~~~l~~g-----~~v~~EKP~~~~~~e~~~l~~~a~~~ 112 (120) T pfam01408 64 DAVSVATPPGLHFELALAALEAG-----KHVLVEKPLATTVEEAKELVELAEKK 112 (120) T ss_pred CEEEECCCHHHHHHHHHHHHHHC-----CEEEEECCCCCCHHHHHHHHHHHHHC T ss_conf 98999087461899999999819-----98999689819999999999999982 No 11 >COG0673 MviM Predicted dehydrogenases and related proteins [General function prediction only] Probab=95.58 E-value=0.24 Score=29.67 Aligned_cols=186 Identities=19% Similarity=0.177 Sum_probs=98.0 Q ss_pred CCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHC Q ss_conf 98789986443566765779999999728987578179999888699889999999999874234679989999898516 Q gi|254780504|r 10 NSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLI 89 (492) Q Consensus 10 ~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~ 89 (492) ...+-|.|+. --+.+..+|++-.+ +. .+.++++.-.+. +..++|.+.+ T Consensus 3 ~irvgiiG~G-~~~~~~~~~~~~~~------~~-~~~~vav~d~~~------------------------~~a~~~a~~~ 50 (342) T COG0673 3 MIRVGIIGAG-GIAGKAHLPALAAL------GG-GLELVAVVDRDP------------------------ERAEAFAEEF 50 (342) T ss_pred CCEEEEECCC-HHHHHHHHHHHHHC------CC-CEEEEEEECCCH------------------------HHHHHHHHHC T ss_conf 3279998987-67888889999738------87-469999964998------------------------9999999981 Q ss_pred CEEECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHH Q ss_conf 57864678977899888874057678469984069568666777777725876443577605667898899999999987 Q gi|254780504|r 90 FYINLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAITRK 169 (492) Q Consensus 90 ~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~l~~ 169 (492) ..- ..|..+.+++...+ .=++|.++||.+=..++..--++ +.=|++|||++.+++-|++|-+.-.+ T Consensus 51 ~~~-------~~~~~~~~ll~~~~--iD~V~Iatp~~~H~~~a~~AL~a-----GkhVl~EKPla~t~~ea~~l~~~a~~ 116 (342) T COG0673 51 GIA-------KAYTDLEELLADPD--IDAVYIATPNALHAELALAALEA-----GKHVLCEKPLALTLEEAEELVELARK 116 (342) T ss_pred CCC-------CCCCCHHHHHCCCC--CCEEEEECCCHHHHHHHHHHHHC-----CCEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 997-------45299999945999--98899969806779999999977-----99699928998999999999999997 Q ss_pred HCCHHHEEECCCHH-HHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCC-HH-HHHHH---HHHHHHHHHHHHH Q ss_conf 38968835124201-2568777788774775200252803330799999846770-24-55553---2200178876558 Q gi|254780504|r 170 IFKESQIFRIDHYL-GKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGV-ED-RVDYY---NNTGALRDMIQNH 243 (492) Q Consensus 170 ~f~E~qiyRIDHYL-GKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gv-eg-R~~yy---D~~GaiRDmvQNH 243 (492) . ..+.-+.|=. --+.+|.+-.+--++.+= .|-++++...-...- .. +.-+. +..|++-||--.+ T Consensus 117 ~---g~~l~v~~~~Rf~p~~~~~k~li~~g~iG-------~i~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~l~d~giH~ 186 (342) T COG0673 117 A---GVKLMVGFNRRFDPAVQALKELIDSGALG-------EVVSVQASFSRDRPNPPPPPWWRFDRADGGGALLDLGIHD 186 (342) T ss_pred C---CCEEEEEEHHHCCHHHHHHHHHHHCCCCC-------CEEEEEEEECCCCCCCCCCCCCCCCCCCCCCEEECCCCCH T ss_conf 5---99499988465498999999998659874-------1599999711465555664322243235884461477249 Q ss_pred HHHHHHHH Q ss_conf 99999998 Q gi|254780504|r 244 LLQLLCLV 251 (492) Q Consensus 244 llQlL~lv 251 (492) +=+++.|+ T Consensus 187 lD~~~~l~ 194 (342) T COG0673 187 LDLLRFLL 194 (342) T ss_pred HHHHHHHH T ss_conf 99999985 No 12 >PRK10206 putative dehydrogenase; Provisional Probab=94.46 E-value=0.47 Score=27.54 Aligned_cols=69 Identities=20% Similarity=0.321 Sum_probs=46.4 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHH---CCHHHEEECCC--HHHHHHHH Q ss_conf 4699840695686667777777258764435776056678988999999999873---89688351242--01256877 Q gi|254780504|r 116 IRVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAITRKI---FKESQIFRIDH--YLGKEAVQ 189 (492) Q Consensus 116 ~rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~l~~~---f~E~qiyRIDH--YLGKe~vq 189 (492) .-+.+.++|+.+=...+..--++| .-|++||||+.+++.|++|-+.-.+. +-.-|-+|-|- ..-|+.++ T Consensus 66 id~V~i~tP~~~H~~~a~~al~aG-----khV~~EKP~~~~~~ea~~l~~~a~~~g~~l~v~~nrR~~~~~~~~k~~i~ 139 (345) T PRK10206 66 VKLVVVCTHADSHFEYAKRALEAG-----KNVLVEKPFTPTLAQAKELFALAKSKGLTVTPYQNRRFDSCFLTAKKAIE 139 (345) T ss_pred CCEEEECCCCHHHHHHHHHHHHCC-----CCEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHH T ss_conf 999998799578999999999789-----92898036649899999999999970997999872017949999999866 No 13 >PRK11579 putative oxidoreductase; Provisional Probab=94.21 E-value=0.17 Score=30.69 Aligned_cols=109 Identities=15% Similarity=0.231 Sum_probs=67.8 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCE Q ss_conf 78998644356676577999999972898757817999988869988999999999987423467998999989851657 Q gi|254780504|r 12 DFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFY 91 (492) Q Consensus 12 ~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y 91 (492) .+=|.| .|--++..-.|+|- .+| ++.+++++-.+ .+. +.. ++ ..+.. T Consensus 6 rvgiiG-~G~~~~~~h~~~~~------~~~--~~~l~av~d~~--~~~--------~~a-----~~---------~~~~~ 52 (346) T PRK11579 6 RVGLIG-YGYASKTFHAPLIA------GTP--GLELAAVSSSD--ETK--------VKA-----DW---------PTVTV 52 (346) T ss_pred EEEEEC-CCHHHHHHHHHHHH------CCC--CEEEEEEECCC--HHH--------HHC-----CC---------CCCCE T ss_conf 599993-62999999999996------299--91999997989--999--------950-----25---------89953 Q ss_pred EECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHH Q ss_conf 8646789778998888740576784699840695686667777777258764435776056678988999999999873 Q gi|254780504|r 92 INLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAITRKI 170 (492) Q Consensus 92 ~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~l~~~ 170 (492) +++ | .++++..+ .=+.|.++|+.+=..++...-++| .-|.+|||+..+++.|++|-+.-.+. T Consensus 53 ----~~~---~---~~ll~~~~--id~V~i~tp~~~H~~~~~~al~aG-----khv~~EKP~a~~~~~a~~l~~~a~~~ 114 (346) T PRK11579 53 ----VSE---P---KHLFNDPN--IDLIVIPTPNDTHFPLAKAALEAG-----KHVVVDKPFTVTLSQARELDALAKSL 114 (346) T ss_pred ----ECC---H---HHHHCCCC--CCEEEECCCCHHHHHHHHHHHHCC-----CCEEECCCCCCCHHHHHHHHHHHHHC T ss_conf ----899---9---99945999--999999799678999999999879-----94895387678799999999999872 No 14 >TIGR03649 ergot_EASG ergot alkaloid biosynthesis protein, AFUA_2G17970 family. This family consists of fungal proteins of unknown function associated with secondary metabolite biosynthesis, such as of the ergot alkaloids such as ergovaline. Nomenclature differs because gene order differs - this is EasG in Neotyphodium lolii but is designated ergot alkaloid biosynthetic protein A in several other fungi. Probab=79.70 E-value=4.9 Score=20.22 Aligned_cols=78 Identities=19% Similarity=0.160 Sum_probs=44.2 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCE Q ss_conf 78998644356676577999999972898757817999988869988999999999987423467998999989851657 Q gi|254780504|r 12 DFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFY 91 (492) Q Consensus 12 ~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y 91 (492) ||.|.||||.+-++ +...| ... +..+...+|+.-. .. ...... T Consensus 1 TIlVtGATG~iG~~-v~~~L---~~~------g~~v~~~~R~~~~-----------~~----------------~~~~~~ 43 (285) T TIGR03649 1 TILLTGGTGKTASR-IARLL---QAA------SVPFLVASRSSSS-----------SA----------------GPNEKH 43 (285) T ss_pred CEEEECCCCHHHHH-HHHHH---HHC------CCCEEEEECCHHH-----------CC----------------CCCCCE T ss_conf 98999899818999-99999---868------9978999588566-----------46----------------666753 Q ss_pred EECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHH Q ss_conf 86467897789988887405767846998406956 Q gi|254780504|r 92 INLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSA 126 (492) Q Consensus 92 ~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~ 126 (492) +++|+.+++++.......+.-....-.+||..|+. T Consensus 44 v~~d~~d~~~~~~a~~~~d~~~~~v~~v~l~~p~~ 78 (285) T TIGR03649 44 VKFDWLDEDTWDNPFSSDDGMEPEISAVYLVAPPI 78 (285) T ss_pred EEEECCCCCCHHHHHHHHHHHHCCCEEEEECCCCC T ss_conf 68644481148889763532312741899838998 No 15 >KOG0890 consensus Probab=79.48 E-value=4.7 Score=20.31 Aligned_cols=84 Identities=25% Similarity=0.319 Sum_probs=59.7 Q ss_pred HHHHHHHHHH-HHHHCCCCCHH-HHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 1256877778-87747752002-528033307999998467702455553220017887655899999998718544588 Q gi|254780504|r 183 LGKEAVQGLM-VFRFANTFYES-LWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHLLQLLCLVAMEMPPSVE 260 (492) Q Consensus 183 LGKe~vqnil-~lRFaN~~~e~-~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHllQlL~lvAMe~P~~~~ 260 (492) ++.|.-+++. .+||-|.-+.+ .|+.-.=..+.| .+++ ..=++.+.++|-+|||++-+++..-++. ++ T Consensus 821 ~~~e~~~~~~ell~~~~~~~~~~~~~~~~~~~~~~-------~~~~-~t~~~~e~~~~fl~~~llgil~~f~~~l---~~ 889 (2382) T KOG0890 821 LISETFALLVELLRFYNNNFTPEKRKFSSRKCDFI-------DEDI-RTLISSELFQDFLQNHLLGILAVFSSRL---LE 889 (2382) T ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH-------HCCC-HHCCHHHHHHHHHHHHHHHHHHHHHHHH---CC T ss_conf 87527889999999870267588875655400333-------1130-0001289999998999999999987750---47 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 58999999999973068 Q gi|254780504|r 261 AKSIKNEKIKVLQALQM 277 (492) Q Consensus 261 ~~~ir~eK~kvlk~~r~ 277 (492) +..+..+|.++++++.- T Consensus 890 ~~~~~~~k~~tl~~I~~ 906 (2382) T KOG0890 890 PSTIIEQKKKTLKGIKK 906 (2382) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 62279999999971899 No 16 >KOG2741 consensus Probab=79.34 E-value=5 Score=20.14 Aligned_cols=184 Identities=17% Similarity=0.161 Sum_probs=96.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCEEECCCCCHHH Q ss_conf 66765779999999728987578179999888699889999999999874234679989999898516578646789778 Q gi|254780504|r 22 LAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFYINLDVEKNYG 101 (492) Q Consensus 22 LA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y~~~d~~~~~~ 101 (492) +..|+..-||. .+|+.++.|++++-... +...+|+++..+- ...--.+ T Consensus 16 ~ia~~f~~al~------~~p~s~~~Ivava~~s~------------------------~~A~~fAq~~~~~--~~k~y~s 63 (351) T KOG2741 16 RIARDFVRALH------TLPESNHQIVAVADPSL------------------------ERAKEFAQRHNIP--NPKAYGS 63 (351) T ss_pred HHHHHHHHHHC------CCCCCCCEEEEEECCCH------------------------HHHHHHHHHCCCC--CCCCCCC T ss_conf 77899999721------47545827999965527------------------------8899999865999--8742557 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCHHHEEECCC Q ss_conf 99888874057678469984069568666777777725876443577605667898899999999987389688351242 Q gi|254780504|r 102 WELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAITRKIFKESQIFRIDH 181 (492) Q Consensus 102 y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~l~~~f~E~qiyRIDH 181 (492) |+.|.+ +.. .-+.|+++|-..-..++.-+...+- -|.+|||...+.+-+++|-+.-.+. -+|-+|- T Consensus 64 yEeLak----d~~-vDvVyi~~~~~qH~evv~l~l~~~K-----~VL~EKPla~n~~e~~~iveaA~~r----gv~~meg 129 (351) T KOG2741 64 YEELAK----DPE-VDVVYISTPNPQHYEVVMLALNKGK-----HVLCEKPLAMNVAEAEEIVEAAEAR----GVFFMEG 129 (351) T ss_pred HHHHHC----CCC-CCEEEECCCCCCHHHHHHHHHHCCC-----CEEECCCCCCCHHHHHHHHHHHHHC----CCEEEEE T ss_conf 998824----987-6879967998008999999997599-----5786565538899999999999976----9488764 Q ss_pred HHH--HHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCC-----CHHHHH-HHHHHHHHHHHHHHHHHHHHHHH-- Q ss_conf 012--56877778877477520025280333079999984677-----024555-53220017887655899999998-- Q gi|254780504|r 182 YLG--KEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIG-----VEDRVD-YYNNTGALRDMIQNHLLQLLCLV-- 251 (492) Q Consensus 182 YLG--Ke~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~g-----vegR~~-yyD~~GaiRDmvQNHllQlL~lv-- 251 (492) .-+ -+.++-+--+=+ +..| --|.+|+|++.=.+- ...|.. +-++-|++-|+.+=-+ |.--++ T Consensus 130 ~~~R~~P~~~~lke~l~-~~~~------Gdvk~v~~~~~f~~~~~~l~~~~r~~~~~~g~G~l~D~g~Y~i-~~~~~~~~ 201 (351) T KOG2741 130 LWWRFFPRYAKLKELLS-SGVL------GDVKSVEVEFGFPFPEDELPHKSRLRTGLLGGGALGDLGIYPI-QAALWVNN 201 (351) T ss_pred EEEECCCHHHHHHHHHH-CCCC------CCCEEEEEECCCCCCHHHCCCCCCHHEECCCCCEEHHHHHHHH-HHHHHHHH T ss_conf 44210738999999971-2655------5624899964777511211343210000026752006489888-88999998 Q ss_pred HCCCCCCC Q ss_conf 71854458 Q gi|254780504|r 252 AMEMPPSV 259 (492) Q Consensus 252 AMe~P~~~ 259 (492) .=++|... T Consensus 202 f~~~p~~~ 209 (351) T KOG2741 202 FQEPPEVI 209 (351) T ss_pred HCCCCCEE T ss_conf 06996044 No 17 >KOG0946 consensus Probab=74.27 E-value=3.7 Score=21.06 Aligned_cols=113 Identities=19% Similarity=0.340 Sum_probs=70.6 Q ss_pred HHHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCC-CCHHHHCCCCEEEEE-------EEEEECCCCHHHHHH Q ss_conf 899999999987389688351242012568777788774775-200252803330799-------999846770245555 Q gi|254780504|r 158 SSAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANT-FYESLWNNKYIDHVQ-------ITTAETIGVEDRVDY 229 (492) Q Consensus 158 ~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~-~~e~~WNr~~I~~Vq-------It~~E~~gvegR~~y 229 (492) +--.-||..|...-.-.++||=+-|+-+=. -=+..|-|.+. .| -|+..-|.+|. .++.-+--.+. T Consensus 231 DCL~ll~NLLK~N~SNQ~~FrE~~~i~rL~-klL~~f~~~d~Ev~--~W~~Qrv~Nv~~~Lqivr~lVsP~Nt~~~---- 303 (970) T KOG0946 231 DCLILLNNLLKNNISNQNFFREGSYIPRLL-KLLSVFEFGDGEVF--GWSTQRVQNVIEALQIVRSLVSPGNTSSI---- 303 (970) T ss_pred HHHHHHHHHHHHCCCHHHHHHCCCCHHHHH-HHCCCCCCCCCCCC--CCCHHHHHHHHHHHHHHHHHCCCCCCHHH---- T ss_conf 999999999861754156776245177887-40571001575102--55188888999999999986189983777---- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCC Q ss_conf 32200178876558999999987185445885899999999997306888112236136521366 Q gi|254780504|r 230 YNNTGALRDMIQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQS 294 (492) Q Consensus 230 yD~~GaiRDmvQNHllQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~ 294 (492) ..-+-+=|+|||+|+.||.++|-+- -+.+|+.+-+- ...+ +|||-|.+ T Consensus 304 --~~q~qk~l~ss~ll~~Lc~il~~~~---vp~dIltesii--------tvAe----vVRgn~~n 351 (970) T KOG0946 304 --THQNQKALVSSHLLDVLCTILMHPG---VPADILTESII--------TVAE----VVRGNARN 351 (970) T ss_pred --HHHHHHHHHHCCHHHHHHHHHCCCC---CCHHHHHHHHH--------HHHH----HHHHCHHH T ss_conf --8899999987204999999983789---80767789999--------9999----99704677 No 18 >PRK05989 cobN cobaltochelatase; Reviewed Probab=65.61 E-value=10 Score=17.93 Aligned_cols=99 Identities=15% Similarity=0.304 Sum_probs=52.0 Q ss_pred HHHHHHHHHHHCCCCCHHHHCCCC--EEEEEEEEEECCCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHC----CCCCC Q ss_conf 687777887747752002528033--307999998467702455-55322001788765589999999871----85445 Q gi|254780504|r 186 EAVQGLMVFRFANTFYESLWNNKY--IDHVQITTAETIGVEDRV-DYYNNTGALRDMIQNHLLQLLCLVAM----EMPPS 258 (492) Q Consensus 186 e~vqnil~lRFaN~~~e~~WNr~~--I~~VqIt~~E~~gvegR~-~yyD~~GaiRDmvQNHllQlL~lvAM----e~P~~ 258 (492) +++-.||.|== .+|+|+... |.-++|.=.|++|== |. --.--+|..||++.|-+--|=.-|.| ++|.. T Consensus 912 ~~iAqiL~LlG----VrPvWd~~~~RV~g~eiIPl~eLgRP-RIDVtvriSG~fRD~Fp~~i~LlD~Av~~vA~ldEp~~ 986 (1251) T PRK05989 912 DDIAQALALLG----VRPVWDDGSRRVTDFEIIPLAELGRP-RIDVTLRISGFFRDAFPNLMRLFDDAVRAVAALDEPEE 986 (1251) T ss_pred HHHHHHHHHHC----CCCCCCCCCCCEEEEEEECHHHCCCC-CEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 62999999828----72011489983201376068893999-74799851316676568999999999999983799844 Q ss_pred CCH--HHHHHHHHHHH------------HHHCCCCHHHCCCCEEEECCCCCC Q ss_conf 885--89999999999------------730688811223613652136677 Q gi|254780504|r 259 VEA--KSIKNEKIKVL------------QALQMITPENVQKLTVRGQYQSGI 296 (492) Q Consensus 259 ~~~--~~ir~eK~kvl------------k~~r~~~~~~v~~~~v~GQY~~~~ 296 (492) .+. ...+.+..+.. ...|.+.... |.|.+|. T Consensus 987 ~N~vr~h~~~~~~~l~~~G~~~~~A~~~a~~RIFg~~p-------G~YGaGv 1031 (1251) T PRK05989 987 DNPVRAHVRAELAELGARGLDAAEARRRATLRIFGSKP-------GAYGAGL 1031 (1251) T ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCEECCCC-------CCHHHHH T ss_conf 27898989999999985589867787544350604999-------8423528 No 19 >pfam02514 CobN-Mg_chel CobN/Magnesium Chelatase. This family contains a domain common to the cobN protein and to magnesium protoporphyrin chelatase. CobN is implicated in the conversion of hydrogenobyrinic acid a,c-diamide to cobyrinic acid. Magnesium protoporphyrin chelatase is involved in chlorophyll biosynthesis. Probab=62.10 E-value=9 Score=18.30 Aligned_cols=97 Identities=20% Similarity=0.334 Sum_probs=50.8 Q ss_pred HHHHHHHHHHHCCCCCHHHHCC-CCEEEEEEEEEECCCCHHHHH---HHHHHHHHHHHHHHHHHHHHH---HHHC-CCCC Q ss_conf 6877778877477520025280-333079999984677024555---532200178876558999999---9871-8544 Q gi|254780504|r 186 EAVQGLMVFRFANTFYESLWNN-KYIDHVQITTAETIGVEDRVD---YYNNTGALRDMIQNHLLQLLC---LVAM-EMPP 257 (492) Q Consensus 186 e~vqnil~lRFaN~~~e~~WNr-~~I~~VqIt~~E~~gvegR~~---yyD~~GaiRDmvQNHllQlL~---lvAM-e~P~ 257 (492) +++-.||.|== .+|+|+. .-|..|+|.=.|++| |-. ----+|..|||+.|-+--|=- ++|. +.|. T Consensus 739 ~~iaqiL~LlG----VrPvwd~~grV~g~e~IPl~eLg---RPRIDV~v~iSG~fRD~fp~~i~LlD~Av~~va~ldEp~ 811 (1064) T pfam02514 739 EDIAQILALLG----VRPVWDASGRVTGVELIPLEELG---RPRIDVTVRISGLFRDAFPNLIELIDEAVRLVAALDEPD 811 (1064) T ss_pred HHHHHHHHHCC----CEEEECCCCCCCCEEEECHHHCC---CCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 75999999838----70556698966631775689929---998679983350368778999999999999997389983 Q ss_pred CCCH--HHHHHH------------HHHHHHHHCCCCHHHCCCCEEEECCCCCC Q ss_conf 5885--899999------------99999730688811223613652136677 Q gi|254780504|r 258 SVEA--KSIKNE------------KIKVLQALQMITPENVQKLTVRGQYQSGI 296 (492) Q Consensus 258 ~~~~--~~ir~e------------K~kvlk~~r~~~~~~v~~~~v~GQY~~~~ 296 (492) ..+. ...+.. ...-+...|.+.+.. |.|.+|. T Consensus 812 e~N~vr~h~~~~~~~l~~~G~~~~~A~~~A~~RIFg~~p-------G~YGtGv 857 (1064) T pfam02514 812 EMNYVRKHSLALEAELLEEGYDEEEARELATARVFGSAP-------GAYGAGV 857 (1064) T ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCEECCCC-------CCCHHHH T ss_conf 205899999999999986589856677651141403899-------8500328 No 20 >pfam02446 Glyco_hydro_77 4-alpha-glucanotransferase. These enzymes EC:2.4.1.25 transfer a segment of a (1,4)-alpha-D-glucan to a new 4-position in an acceptor, which may be glucose or (1,4)-alpha-D-glucan. Probab=55.39 E-value=3.2 Score=21.51 Aligned_cols=23 Identities=30% Similarity=0.307 Sum_probs=14.8 Q ss_pred HHHHHHHCCHHHEEECCCHHHHH Q ss_conf 99998738968835124201256 Q gi|254780504|r 164 HAITRKIFKESQIFRIDHYLGKE 186 (492) Q Consensus 164 n~~l~~~f~E~qiyRIDHYLGKe 186 (492) -+.|...++-=.++||||++|=. T Consensus 270 ~~rl~~~~~~~d~lRIDH~~Gf~ 292 (494) T pfam02446 270 IERLRAALKYFDALRIDHFRGFF 292 (494) T ss_pred HHHHHHHHHHCCCHHHHHHHHHH T ss_conf 99999999865830022276664 No 21 >COG5649 Uncharacterized conserved protein [Function unknown] Probab=54.31 E-value=16 Score=16.59 Aligned_cols=30 Identities=17% Similarity=0.472 Sum_probs=19.5 Q ss_pred EECCCCCCCCEEE-----EEECCCCCCCEEEEEEEEEC Q ss_conf 9708586785777-----55133468851799999854 Q gi|254780504|r 325 NIDNPRWSGIPFY-----LRTGKYLAKHISEIVIALKP 357 (492) Q Consensus 325 ~idn~RW~gVPF~-----lrtGK~l~~k~~eI~i~FK~ 357 (492) -+..|.|.|+||| +.||-.+ +..|.++|-+ T Consensus 47 v~e~vKWrg~Pvweh~g~ictge~y---k~vvK~tFak 81 (132) T COG5649 47 VHEAVKWRGSPVWEHDGWICTGECY---KGVVKVTFAK 81 (132) T ss_pred HHHEEEECCCCCCCCCCEEEEEEEE---EEEEEEEEEC T ss_conf 5550352576320377569840345---2577777742 No 22 >PRK12321 cobN cobaltochelatase; Reviewed Probab=53.21 E-value=10 Score=17.85 Aligned_cols=129 Identities=16% Similarity=0.244 Sum_probs=68.3 Q ss_pred CCCCCCH----------HHHHHHHHHHHHHCCHHHEEECCC--HH---------------HHHHHHHHHHHHHCCCCCHH Q ss_conf 5667898----------899999999987389688351242--01---------------25687777887747752002 Q gi|254780504|r 151 KPIGSNL----------SSAQEIHAITRKIFKESQIFRIDH--YL---------------GKEAVQGLMVFRFANTFYES 203 (492) Q Consensus 151 KPFG~Dl----------~SA~eLn~~l~~~f~E~qiyRIDH--YL---------------GKe~vqnil~lRFaN~~~e~ 203 (492) -|=|+|+ .+|-++...+.+.+-+.. +=|| |. |=+++-.+|.| .=.+| T Consensus 726 LPTGRNfys~Dpr~iPT~aAw~~G~~~A~~ll~~y--~~~~G~yP~~v~~~lWgt~tmRt~G~~iAq~L~L----lGVrP 799 (1101) T PRK12321 726 LPTGRNLYTIDPRAVPTRAAHAQGVKAAEELLRRH--LQDHGDWPRGLVVDLWGSATMRTGGEEFAMALAL----MGVRP 799 (1101) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH--HHHCCCCCCEEEEEEEEHHHHHHCCHHHHHHHHH----CCCCC T ss_conf 78887654658232778999999999999999999--9872998503788886354443156759999997----19721 Q ss_pred HHC--CCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHH---HH-HCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 528--0333079999984677024555532200178876558999999---98-71854458858999999999973068 Q gi|254780504|r 204 LWN--NKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHLLQLLC---LV-AMEMPPSVEAKSIKNEKIKVLQALQM 277 (492) Q Consensus 204 ~WN--r~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHllQlL~---lv-AMe~P~~~~~~~ir~eK~kvlk~~r~ 277 (492) +|+ ..-|.-++|.=.|++|==-=---.--+|..||++.|-+--+=. ++ +-|+|...++ ++. ..-....|. T Consensus 800 vWd~~s~RV~g~eiIPl~eLgRPRIDVt~riSG~FRD~Fp~~i~Lld~Av~~vA~ldE~~~~Np--~~~--a~~~~~~RI 875 (1101) T PRK12321 800 VWDHASERVTGIEVLPLALLGRPRIDVTLRVSGLFRDVFPALIALFDQAARAVAARDEADDDNP--LAA--RRGERAARV 875 (1101) T ss_pred CCCCCCCCEEEEEEECHHHCCCCCEEEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCH--HHH--HHCCCCCCC T ss_conf 4447888542247606889299987799961404787788999999999999983469954289--988--624575423 Q ss_pred CCHHHCCCCEEEECCCCCC Q ss_conf 8811223613652136677 Q gi|254780504|r 278 ITPENVQKLTVRGQYQSGI 296 (492) Q Consensus 278 ~~~~~v~~~~v~GQY~~~~ 296 (492) +.+.. |.|.+|. T Consensus 876 Fg~~p-------G~YGaGv 887 (1101) T PRK12321 876 FGPAP-------GSYGAGA 887 (1101) T ss_pred CCCCC-------CCHHHHH T ss_conf 04999-------8554628 No 23 >pfam05542 DUF760 Protein of unknown function (DUF760). This family contains several uncharacterized plant proteins. Probab=52.89 E-value=7.7 Score=18.79 Aligned_cols=16 Identities=44% Similarity=0.563 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8876558999999987 Q gi|254780504|r 237 RDMIQNHLLQLLCLVA 252 (492) Q Consensus 237 RDmvQNHllQlL~lvA 252 (492) -+||||||..||..-+ T Consensus 121 lEMI~eHL~~ilG~r~ 136 (328) T pfam05542 121 LEMIQEHLSAILGLRV 136 (328) T ss_pred HHHHHHHHHHHHCCCC T ss_conf 9999999999957532 No 24 >pfam01474 DAHP_synth_2 Class-II DAHP synthetase family. Members of this family are aldolase enzymes that catalyse the first step of the shikimate pathway. Probab=52.71 E-value=17 Score=16.42 Aligned_cols=16 Identities=19% Similarity=-0.035 Sum_probs=6.5 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 9988999999999987 Q gi|254780504|r 55 MTVESYRSFVDKELQR 70 (492) Q Consensus 55 ~~~e~fr~~v~~~l~~ 70 (492) |.|.+-.+.+.+.|.. T Consensus 18 ypD~~~l~~v~~~L~~ 33 (437) T pfam01474 18 YPDPAALAAVLAELAS 33 (437) T ss_pred CCCHHHHHHHHHHHHC T ss_conf 9599999999999961 No 25 >TIGR02025 BchH magnesium chelatase, H subunit; InterPro: IPR011771 This entry represents the H subunit of the magnesium chelatase complex responsible for magnesium insertion into the protoporphyrin IX ring in the biosynthesis of both chlorophyll and bacteriochlorophyll. In chlorophyll-utilizing species, this gene is known as ChlH, while in bacteriochlorophyll-utilizing spoecies it is called BchH. Subunit H is the largest (~140kDa) of the three subunits (the others being BchD/ChlD and BchI/ChlI), and is known to bind protoporphyrin IX. Subunit H is homologous to the CobN subunit of cobaltochelatase and by anology with that enzyme, subunit H is believed to also bind the magnesium ion which is inserted into the ring. In conjunction with the hydrolysis of ATP by subunits I and D, a conformation change is believed to happen in subunit H causing the magnesium ion insertion into the distorted protoporphyrin ring .; GO: 0016851 magnesium chelatase activity, 0015995 chlorophyll biosynthetic process. Probab=51.98 E-value=15 Score=16.70 Aligned_cols=97 Identities=25% Similarity=0.339 Sum_probs=51.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC----CCCCCCC-----HHHHHHHH-HHHHH-HHCCCCHHHCCCCEEEECCCCCCCCCE Q ss_conf 2001788765589999999871----8544588-----58999999-99997-306888112236136521366776873 Q gi|254780504|r 232 NTGALRDMIQNHLLQLLCLVAM----EMPPSVE-----AKSIKNEK-IKVLQ-ALQMITPENVQKLTVRGQYQSGIINGI 300 (492) Q Consensus 232 ~~GaiRDmvQNHllQlL~lvAM----e~P~~~~-----~~~ir~eK-~kvlk-~~r~~~~~~v~~~~v~GQY~~~~~~~~ 300 (492) =+||.||+++|=|=+|=-+|=| |+|..++ +.+...|- +.|=. |.|.++..+ |-|.+.-.+ - T Consensus 1087 lSGIFRD~f~~q~~LLD~avk~AA~ADEP~emNfvRKHaL~~~~e~Gi~~e~Aa~R~FsNA~-------G~YGsnVN~-~ 1158 (1384) T TIGR02025 1087 LSGIFRDLFVNQMELLDEAVKLAAKADEPLEMNFVRKHALAQAEEGGIDVEEAAARVFSNAP-------GSYGSNVND-L 1158 (1384) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCCC-------CCCCCCCCC-C T ss_conf 28448778999999999999998512778446616788999997459767888622046889-------875533035-2 Q ss_pred ECCC-CCCCCCCCCCCCCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEE Q ss_conf 4334-1014798866487438999997085867857775513346885179 Q gi|254780504|r 301 PVKG-YLEEIPLGISDTETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISE 350 (492) Q Consensus 301 ~v~g-Y~~e~~v~~S~TeTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~e 350 (492) -+-| .-+|. -..+||.-=|-|= |=+.+|+|-+...| T Consensus 1159 vE~s~WE~e~----eL~d~f~~RksFA----------Yg~~~~gmm~~~~E 1195 (1384) T TIGR02025 1159 VENSAWEDEG----ELADTFIKRKSFA----------YGRQGKGMMEQKPE 1195 (1384) T ss_pred EECCCCCCHH----HHHHHHHHCCEEE----------CCCCCCCHHHCCHH T ss_conf 2268888766----7987756403022----------07887871000358 No 26 >pfam05368 NmrA NmrA-like family. NmrA is a negative transcriptional regulator involved in the post-translational modification of the transcription factor AreA. NmrA is part of a system controlling nitrogen metabolite repression in fungi. This family only contains a few sequences as iteration results in significant matches to other Rossmann fold families. Probab=50.99 E-value=18 Score=16.24 Aligned_cols=98 Identities=11% Similarity=0.069 Sum_probs=51.9 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCEE Q ss_conf 89986443566765779999999728987578179999888699889999999999874234679989999898516578 Q gi|254780504|r 13 FIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFYI 92 (492) Q Consensus 13 lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y~ 92 (492) +.|+||||.+-+. +..+|- . .++++.+..|..-+. .. +.+. ...+..+ T Consensus 1 IlV~GatG~iG~~-vv~~L~---~------~g~~Vr~l~R~~~~~-~~-----~~l~----------------~~gve~v 48 (232) T pfam05368 1 ILVFGATGYQGGS-VVRASL---K------AGHPVRALVRDPKSE-LA-----KSLK----------------AAGVELV 48 (232) T ss_pred EEEECCCHHHHHH-HHHHHH---H------CCCCEEEEECCCCHH-HH-----HHHH----------------HCCCEEE T ss_conf 0998968289999-999998---5------899389997187366-56-----6664----------------1798899 Q ss_pred ECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHHHH--HHHHHHHHHCCCCCCCEEEE Q ss_conf 6467897789988887405767846998406956866--67777777258764435776 Q gi|254780504|r 93 NLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAFFG--KISQRIHANNLVTEHTRVVL 149 (492) Q Consensus 93 ~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~--~i~~~L~~~~l~~~~~RiVi 149 (492) ++|++++++. .+.++. ...+ |+.+|+..-. ....++-++....+=.|+|. T Consensus 49 ~gD~~d~~sl---~~al~g---vd~v-~~~~~~~~~~~~~~~~~~~~AA~~aGVk~~V~ 100 (232) T pfam05368 49 EGDLDDHESL---VEALKG---VDVV-FSVTGFWLSKEIEDGKKLADAAKEAGVKHFIP 100 (232) T ss_pred EECCCCHHHH---HHHHCC---CCEE-EEECCCCCCHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 9068887899---999679---9889-99158874177999999999999739983455 No 27 >KOG3363 consensus Probab=49.70 E-value=17 Score=16.26 Aligned_cols=31 Identities=23% Similarity=0.219 Sum_probs=16.6 Q ss_pred CCCCCEEEEEEECCCCCHHHHHHHHHHHHHHH Q ss_conf 87578179999888699889999999999874 Q gi|254780504|r 40 ENQISSRIIGICRIKMTVESYRSFVDKELQRY 71 (492) Q Consensus 40 l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~ 71 (492) -|..+..|+|+||...- +..+..|++.++++ T Consensus 109 ePkidlLIvG~Gd~~~p-~~v~~~V~~F~k~~ 139 (196) T KOG3363 109 EPKIDLLIVGCGDKKHP-DKVRPSVRQFVKSH 139 (196) T ss_pred CCCCCEEEEECCCCCCC-HHCCHHHHHHHHHH T ss_conf 77765799944786881-22288899999983 No 28 >PRK09072 short chain dehydrogenase; Provisional Probab=49.58 E-value=18 Score=16.09 Aligned_cols=74 Identities=12% Similarity=0.244 Sum_probs=45.9 Q ss_pred CCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 89898789986443566765779999999728987578179999888699889999999999874234679989999898 Q gi|254780504|r 7 NIHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFL 86 (492) Q Consensus 7 ~~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~ 86 (492) ++.+-+.+|-|||+-+-+- +-..+. +.+.+++.++|++-..++. .+.+ .. T Consensus 2 ~l~~K~vlITGassGIG~a-----~A~~la-----~~G~~vil~~R~~~~L~~~----~~~l----------------~~ 51 (262) T PRK09072 2 DLKDKRVLLTGASGGIGEA-----LAEALC-----AAGARLLLVGRNAEKLEAL----AARP----------------YP 51 (262) T ss_pred CCCCCEEEEECCCCHHHHH-----HHHHHH-----HCCCEEEEEECCHHHHHHH----HHHH----------------CC T ss_conf 9899889994862399999-----999999-----8799899998989999999----9984----------------58 Q ss_pred HHCCEEECCCCCHHHHHHHHHHHC Q ss_conf 516578646789778998888740 Q gi|254780504|r 87 SLIFYINLDVEKNYGWELLGKLLD 110 (492) Q Consensus 87 ~~~~Y~~~d~~~~~~y~~L~~~l~ 110 (492) ..+.++.+|+++.++.+.+.+..+ T Consensus 52 ~~~~~~~~Dls~~~~~~~~~~~~~ 75 (262) T PRK09072 52 GRVRWVVADLTSEAGREAVLARAR 75 (262) T ss_pred CCEEEEEEECCCHHHHHHHHHHHH T ss_conf 976999971799999999999999 No 29 >pfam04989 CmcI Cephalosporin hydroxylase. Members of this family are about 220 amino acids long. The CmcI protein is presumed to represent the cephalosporin-7--hydroxylase. However this has not been experimentally verified. Probab=48.56 E-value=19 Score=15.98 Aligned_cols=91 Identities=19% Similarity=0.181 Sum_probs=55.4 Q ss_pred HHCCEEECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHH-HCCCCCCCEEEEECCCCCCH-------- Q ss_conf 516578646789778998888740576784699840695686667777777-25876443577605667898-------- Q gi|254780504|r 87 SLIFYINLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHA-NNLVTEHTRVVLEKPIGSNL-------- 157 (492) Q Consensus 87 ~~~~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~-~~l~~~~~RiViEKPFG~Dl-------- 157 (492) .++.+++||-.+.+.+....++.... ...+.-|- ..+....|.+-|.. +.+.+.++-+|||--.=.++ T Consensus 81 ~~I~lieg~s~d~~~~~~v~~~~~~~--~~vlVilD-s~Ht~~hVl~eL~~y~~lv~~GsY~IVeDTii~~~p~~~~~~r 157 (202) T pfam04989 81 PRITFIQGSSTDPEIIEQVRSLAEPP--HPVLVILD-SDHTHEHVLAELRAYAPLVSEGSYLVVEDTIIEDLPESPFPDR 157 (202) T ss_pred CCEEEEECCCCCHHHHHHHHHHHCCC--CCEEEEEC-CCCCHHHHHHHHHHHHHHCCCCCEEEEECCCHHHCCCCCCCCC T ss_conf 87699976856788998899985379--85899964-8865688999999875215768899995762431755667888 Q ss_pred ------HHHHHHHHHHHHHCCHHHEEECCCHHH Q ss_conf ------899999999987389688351242012 Q gi|254780504|r 158 ------SSAQEIHAITRKIFKESQIFRIDHYLG 184 (492) Q Consensus 158 ------~SA~eLn~~l~~~f~E~qiyRIDHYLG 184 (492) --..++++-|.+. .=|.||+++- T Consensus 158 ~w~~gnnP~~Av~eFL~~~----~~F~iD~~~~ 186 (202) T pfam04989 158 PWGKGNNPKTAVTEFLAEH----PEFEIDTAIE 186 (202) T ss_pred CCCCCCCHHHHHHHHHHHC----CCCEECCCCC T ss_conf 6878989799999999878----9938653202 No 30 >COG1640 MalQ 4-alpha-glucanotransferase [Carbohydrate transport and metabolism] Probab=48.48 E-value=6.3 Score=19.42 Aligned_cols=23 Identities=35% Similarity=0.379 Sum_probs=15.6 Q ss_pred HHHHHHHHCCHHHEEECCCHHHH Q ss_conf 99999873896883512420125 Q gi|254780504|r 163 IHAITRKIFKESQIFRIDHYLGK 185 (492) Q Consensus 163 Ln~~l~~~f~E~qiyRIDHYLGK 185 (492) --+.|+..++--+++||||+.|- T Consensus 286 wierlr~~~~~~~~lRIDHf~Gl 308 (520) T COG1640 286 WIERLRANLKLYGILRIDHFRGL 308 (520) T ss_pred HHHHHHHHHHHCCEEEEEEECCH T ss_conf 99999999875680665100013 No 31 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=47.26 E-value=20 Score=15.84 Aligned_cols=11 Identities=18% Similarity=0.537 Sum_probs=6.1 Q ss_pred CCCHHHHCCCC Q ss_conf 52002528033 Q gi|254780504|r 199 TFYESLWNNKY 209 (492) Q Consensus 199 ~~~e~~WNr~~ 209 (492) .-++|-|.+.+ T Consensus 76 a~y~P~w~hs~ 86 (305) T PRK10525 76 AKYSPNWSHSN 86 (305) T ss_pred CCCCCCCCCCC T ss_conf 88799755787 No 32 >PRK13405 bchH magnesium chelatase subunit H; Provisional Probab=46.68 E-value=20 Score=15.78 Aligned_cols=23 Identities=17% Similarity=0.249 Sum_probs=9.0 Q ss_pred CCHHHHHHHHHHCCCCCCCCCHH Q ss_conf 86248999999879833586989 Q gi|254780504|r 419 PDGYERLLMDIIHANQTLFMGYD 441 (492) Q Consensus 419 pdaYe~Ll~D~i~Gd~tlF~r~d 441 (492) |+.=+.|+..++.|+...+.... T Consensus 482 P~~~~~L~~~i~~~~~~~~~~~~ 504 (1207) T PRK13405 482 PESVDALREAILGGNASRYGTPA 504 (1207) T ss_pred CCCHHHHHHHHHCCCCCCCCCCH T ss_conf 99999999999748854347823 No 33 >TIGR01214 rmlD dTDP-4-dehydrorhamnose reductase; InterPro: IPR005913 dTDP-4-dehydrorhamnose reductase (1.1.1.133 from EC) catalyzes the last of 4 steps in making dTDP-rhamnose, a precursor of LPS molecules such as core antigen and O-antigen.dTDP-6-deoxy-L-mannose + NADP+ = dTDP-4-dehydro-6-deoxy-L-mannose + NADPH ; GO: 0008831 dTDP-4-dehydrorhamnose reductase activity, 0045226 extracellular polysaccharide biosynthetic process. Probab=46.27 E-value=21 Score=15.74 Aligned_cols=32 Identities=16% Similarity=0.342 Sum_probs=23.8 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECC Q ss_conf 789986443566765779999999728987578179999888 Q gi|254780504|r 12 DFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRI 53 (492) Q Consensus 12 ~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~ 53 (492) .|.|+||+|=|++. |.. .+++.....+.+.|+ T Consensus 1 rilitGa~GQlG~~--------L~~--~l~~~g~~~~~~~~~ 32 (317) T TIGR01214 1 RILITGANGQLGRE--------LVQ--QLSKPGRVVVALTRS 32 (317) T ss_pred CEEEECCCCHHHHH--------HHH--HCCCCCCEEEEECCC T ss_conf 97887387567999--------999--707888278643687 No 34 >PRK00092 hypothetical protein; Reviewed Probab=43.26 E-value=23 Score=15.42 Aligned_cols=34 Identities=9% Similarity=0.216 Sum_probs=29.3 Q ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHCCHHHEEE Q ss_conf 3577605667898899999999987389688351 Q gi|254780504|r 145 TRVVLEKPIGSNLSSAQEIHAITRKIFKESQIFR 178 (492) Q Consensus 145 ~RiViEKPFG~Dl~SA~eLn~~l~~~f~E~qiyR 178 (492) .||.|+||=|-+++-+..+++.+...++.+.++- T Consensus 35 lrI~ID~~~gv~lddc~~vSr~is~~LD~~d~i~ 68 (153) T PRK00092 35 LRIYIDSDGGITLDDCEDVSRQLSAVLDVEDPIP 68 (153) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999988999189999998899887526365678 No 35 >cd06842 PLPDE_III_Y4yA_like Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Y4yA. This subfamily is composed of the hypothetical Rhizobium sp. protein Y4yA and similar uncharacterized bacterial proteins. These proteins are homologous to eukaryotic ornithine decarboxylase (ODC) and diaminopimelate decarboxylase (DapDC). ODC and DapDC are fold type III PLP-dependent enzymes that contain an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain, similar to bacterial alanine racemases. ODC participates in the formation of putrescine by catalyzing the decarboxylation of ornithine, the first step in polyamine biosynthesis. DapDC participates in the last step of lysine biosynthesis, the conversion of meso-2,6-diaminoheptanedioate to L-lysine. Proteins in this subfamily may function as PLP-dependent decarboxylases. Probab=42.38 E-value=23 Score=15.33 Aligned_cols=34 Identities=6% Similarity=0.075 Sum_probs=21.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCHHHEEECCCHHH Q ss_conf 5667898899999999987389688351242012 Q gi|254780504|r 151 KPIGSNLSSAQEIHAITRKIFKESQIFRIDHYLG 184 (492) Q Consensus 151 KPFG~Dl~SA~eLn~~l~~~f~E~qiyRIDHYLG 184 (492) -.||-+.+.+.++-+.+.+.-+-=++.-|--++| T Consensus 144 sKFGi~~~~~~~~l~~~~~~~~~l~~~GlH~HiG 177 (423) T cd06842 144 SRFGMPAAEVRTALERLAQLRERVRLVGFHFHLD 177 (423) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECC T ss_conf 9998999999999999996399983689888648 No 36 >TIGR00565 trpE_proteo anthranilate synthase component I; InterPro: IPR005257 This family represents anthranilate/para-aminobenzoate synthase component I from proteobacteria and actinobacteria. This enzyme resembles some other chorismate-binding enzymes, including para-aminobenzoate synthase (pabB) and isochorismate synthase. There is a fairly deep split between two sets, seen in the pattern of gaps as well as in amino acid sequence differences. This group includes proteobacteria such as Escherichia coli and Helicobacter pylori but also the Gram-positive organism Corynebacterium glutamicum. The second group (IPR005256 from INTERPRO) includes eukaryotes, archaea, and most other bacterial lineages; sequences from the second group may resemble pabB more closely than other trpE from this group. ; GO: 0004049 anthranilate synthase activity, 0009058 biosynthetic process. Probab=42.19 E-value=13 Score=17.25 Aligned_cols=118 Identities=27% Similarity=0.380 Sum_probs=55.3 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCEEECCCCCHHHHHHHHHHHCCCCCCCEEEEE---------CC Q ss_conf 869988999999999987423467998999989851657864678977899888874057678469984---------06 Q gi|254780504|r 53 IKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFYINLDVEKNYGWELLGKLLDIDKTRIRVFYL---------AM 123 (492) Q Consensus 53 ~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYL---------Ai 123 (492) ...+|+||...| .+|+++++.|++ ++--.+|-+|..+. +.-.+|-.|++ .-+.+=+||+ |= T Consensus 228 ~N~sD~EF~g~V-~~Lq~~I~~Gei----FQ~VpSRrFsl~Cp-~~LaaYy~LK~----~NPSPYMFy~~D~DFiLFGAS 297 (505) T TIGR00565 228 ANFSDEEFEGVV-KALQKEIKKGEI----FQVVPSRRFSLECP-EGLAAYYQLKK----SNPSPYMFYMKDEDFILFGAS 297 (505) T ss_pred CCCCCCHHHHHH-HHHHHHCCCCCE----EEEEECCEECCCCC-CHHHHHHHHHH----CCCCCCEEEEECCCEEEECCC T ss_conf 588961156799-999984215855----78840311003366-55479998751----485960323204753554687 Q ss_pred CHHHH--HHHHHHHHHHCCCCCCCEE---EEECCCCCCHHHHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHC Q ss_conf 95686--6677777772587644357---76056678988999999999873896883512420125687777887747 Q gi|254780504|r 124 SSAFF--GKISQRIHANNLVTEHTRV---VLEKPIGSNLSSAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFA 197 (492) Q Consensus 124 PP~~f--~~i~~~L~~~~l~~~~~Ri---ViEKPFG~Dl~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFa 197 (492) |-|.- .+..+.+.---. -++|. =-.=-+-+||+|=-||. -|.|| ||.-.-||-.=.| T Consensus 298 PESaLKY~a~~r~~EIYPI--AGtRpRG~da~GniD~dLDsr~Ele------------lr~Dh---KE~aEHlMLVDLA 359 (505) T TIGR00565 298 PESALKYDALSRQLEIYPI--AGTRPRGKDADGNIDRDLDSRLELE------------LRLDH---KERAEHLMLVDLA 359 (505) T ss_pred CCHHHHCCCCCCCEEEECC--CCCCCCCCCCCCCCCHHHHHHHHHH------------HHCCC---HHHHHHHHHHHHH T ss_conf 0012310312582777035--5788888778889784334466355------------30251---2345556677764 No 37 >COG0779 Uncharacterized protein conserved in bacteria [Function unknown] Probab=42.03 E-value=24 Score=15.29 Aligned_cols=31 Identities=13% Similarity=0.303 Sum_probs=26.9 Q ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHCCHHH Q ss_conf 3577605667898899999999987389688 Q gi|254780504|r 145 TRVVLEKPIGSNLSSAQEIHAITRKIFKESQ 175 (492) Q Consensus 145 ~RiViEKPFG~Dl~SA~eLn~~l~~~f~E~q 175 (492) .||.++||.|-+++-+.++.+.++..|+.+. T Consensus 39 lrI~id~~g~v~ldDC~~vSr~is~~LD~ed 69 (153) T COG0779 39 LRIYIDKEGGVTLDDCADVSRAISALLDVED 69 (153) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHCCCC T ss_conf 9999679999888999999999898736587 No 38 >TIGR02778 ligD_pol DNA polymerase LigD, polymerase domain; InterPro: IPR014145 DNA repair of double-stranded breaks by non-homologous end joining (NHEJ) is accomplished by a two-protein system that is present in a minority of prokaryotes. One component is the Ku protein (see IPR009187 from INTERPRO), which binds DNA ends. The other is a DNA ligase, a protein that is a multidomain polypeptide in most of those bacteria that have NheJ, a permuted polypeptide in Mycobacterium tuberculosis and a few other species, and the product of tandem genes in some other bacteria. This region represents the C-terminal polymerase domain.. Probab=40.90 E-value=23 Score=15.42 Aligned_cols=49 Identities=6% Similarity=0.032 Sum_probs=34.3 Q ss_pred EEEECCCCCCCEEEEEEEEECCCCCCCCCCCCCC--CCCEEEEEECCCCCEEEEEE Q ss_conf 7551334688517999998548864376665478--98569999717965044433 Q gi|254780504|r 337 YLRTGKYLAKHISEIVIALKPAPCTIFDHQMSKI--ETNKLILRLQNNGEIEQFIT 390 (492) Q Consensus 337 ~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~~~~--~~N~Lvi~iqP~~~i~l~~~ 390 (492) ++..=.+|-.=...=+|+|+..-+.+ ... +|.+|||.|.|.+++.+... T Consensus 86 ~~~d~~gL~~LvQ~g~lEfH~w~~~~-----~~~P~~PDriVFDLDPgp~~~~~~v 136 (251) T TIGR02778 86 VIEDLEGLLWLVQMGALEFHIWGARI-----DAPPEKPDRIVFDLDPGPGVAWKLV 136 (251) T ss_pred EEECHHHHHHHHHCCCEEEECCCCCC-----CCCCCCCCEEEEECCCCCHHHHHHH T ss_conf 77168999999872805771788878-----8872488668881588886579999 No 39 >pfam09861 DUF2088 Uncharacterized conserved protein (DUF2088). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=38.68 E-value=27 Score=14.94 Aligned_cols=115 Identities=13% Similarity=0.196 Sum_probs=56.1 Q ss_pred CCHHHH-HHHHHHHHHHCCCCCCCEEEEECCCC-CCHHHHHHHHHHHHH-HCCHHHEEECCCHH-HHHHHHHHHHHHHCC Q ss_conf 695686-66777777725876443577605667-898899999999987-38968835124201-256877778877477 Q gi|254780504|r 123 MSSAFF-GKISQRIHANNLVTEHTRVVLEKPIG-SNLSSAQEIHAITRK-IFKESQIFRIDHYL-GKEAVQGLMVFRFAN 198 (492) Q Consensus 123 iPP~~f-~~i~~~L~~~~l~~~~~RiViEKPFG-~Dl~SA~eLn~~l~~-~f~E~qiyRIDHYL-GKe~vqnil~lRFaN 198 (492) +|...+ +.+.+.|.+.|...+.-.||+ .-| |--.+-.|+...+.+ .+++-+ -|+|.= -++....+=..... T Consensus 68 ~p~~~il~~ll~~L~~~Gv~~~~I~iv~--A~G~Hr~~t~eE~~~~lG~~i~~~~~--v~~Hd~~d~~~lv~lG~T~~G- 142 (203) T pfam09861 68 TPSDIILPLLLEELNAVGIPDEDITILV--ATGTHRPMTEEELKKILGEEVVKRIR--IVNHDADDPDELVYLGTTSRG- 142 (203) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCEEEEE--ECCCCCCCCHHHHHHHHHHHHHCCEE--EEECCCCCHHHCEEECCCCCC- T ss_conf 8567559999999997599812589998--26879999989999986088663868--995799985783781207999- Q ss_pred CCCHHHH-CCCCEEEEEEEEEEC------CCC-HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5200252-803330799999846------770-2455553220017887655899 Q gi|254780504|r 199 TFYESLW-NNKYIDHVQITTAET------IGV-EDRVDYYNNTGALRDMIQNHLL 245 (492) Q Consensus 199 ~~~e~~W-Nr~~I~~VqIt~~E~------~gv-egR~~yyD~~GaiRDmvQNHll 245 (492) .|+| ||..+++=-+..--. .|- +||-..+=..+-.+-+..||.+ T Consensus 143 ---tpV~inr~~~eaD~~I~~G~I~pH~~aGfsGG~K~i~PGia~~~tI~~nH~~ 194 (203) T pfam09861 143 ---TPVYVNRLVAEADLVILTGFIEPHYFAGFSGGRKSILPGIASYETIMANHSL 194 (203) T ss_pred ---CEEEEEHHHHHCCEEEEECCCCCCCCCCCCCCCCEEEECCCCHHHHHHHHHH T ss_conf ---8899846786388689978723434566587663044054788999999876 No 40 >KOG4128 consensus Probab=38.38 E-value=23 Score=15.40 Aligned_cols=73 Identities=14% Similarity=0.291 Sum_probs=51.5 Q ss_pred EEEECCCCCC--HHHHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCH Q ss_conf 7760566789--88999999999873896883512420125687777887747752002528033307999998467702 Q gi|254780504|r 147 VVLEKPIGSN--LSSAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVE 224 (492) Q Consensus 147 iViEKPFG~D--l~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gve 224 (492) ||=-|=||+. .+-.+..|..|.....|=- +|..|.+.+|-+ + T Consensus 160 viPKkcy~~sysT~atrkmN~iL~~KlREfa----------~~Lr~~~~~~~~--------------------------~ 203 (457) T KOG4128 160 VIPKKCYLHSYSTQATRKMNLILKSKLREFA----------SMLRAQFTFNGN--------------------------G 203 (457) T ss_pred CCCHHHCCCCCHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHCCCC--------------------------C T ss_conf 7847764542103677899999999999999----------999999852788--------------------------9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCH Q ss_conf 4555532200178876558999999987185445885 Q gi|254780504|r 225 DRVDYYNNTGALRDMIQNHLLQLLCLVAMEMPPSVEA 261 (492) Q Consensus 225 gR~~yyD~~GaiRDmvQNHllQlL~lvAMe~P~~~~~ 261 (492) +|. .-.++.||| |++-++|+..=+||..|+= T Consensus 204 ~~i-----~~t~~emm~-eiFrviciclg~PPe~FTW 234 (457) T KOG4128 204 CRI-----PDTIQEMMP-EIFRVICICLGEPPEVFTW 234 (457) T ss_pred CCC-----HHHHHHHHH-HHHHHHHHHCCCCCCEEEE T ss_conf 821-----279999999-9999974104889650147 No 41 >TIGR01915 npdG NADPH-dependent F420 reductase; InterPro: IPR010185 Members of this family are found only in species with evidence of coenzyme F420. All members of this family are believed to act as NADPH-dependent F420 reductase.. Probab=38.35 E-value=27 Score=14.91 Aligned_cols=50 Identities=30% Similarity=0.215 Sum_probs=29.5 Q ss_pred EEEECCCCH----HHHHHHHHHHHHHHHCCCCCCC-CEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 899864435----6676577999999972898757-817999988869988999999999987 Q gi|254780504|r 13 FIIFGGTGD----LAKRKLFPALYNNYTNREENQI-SSRIIGICRIKMTVESYRSFVDKELQR 70 (492) Q Consensus 13 lVIFGATGD----LA~RKL~PALf~L~~~~~l~~~-~~~IiG~aR~~~~~e~fr~~v~~~l~~ 70 (492) +=|+||||| ||-| |..+|++|+- +-.||=-+|+.=.-++--....+.+.. T Consensus 3 IAvLGGTGdqG~GLALR--------lA~~glmPeG~~~~iIIGSR~~EkA~EaA~ka~e~l~~ 57 (233) T TIGR01915 3 IAVLGGTGDQGKGLALR--------LAKAGLMPEGVDNEIIIGSRDKEKAEEAAAKALEELGD 57 (233) T ss_pred EEEECCCCCHHHHHHHH--------HHHCCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHC T ss_conf 78844888402569999--------98707787877755577048845699999999999970 No 42 >cd02964 TryX_like_family Tryparedoxin (TryX)-like family; composed of TryX and related proteins including nucleoredoxin (NRX), rod-derived cone viability factor (RdCVF) and the nematode homolog described as a 16-kD class of TRX. Most members of this family, except RdCVF, are protein disulfide oxidoreductases containing an active site CXXC motif, similar to TRX. Probab=38.10 E-value=27 Score=14.88 Aligned_cols=55 Identities=18% Similarity=0.237 Sum_probs=38.3 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHH Q ss_conf 87899864435667657799999997289875781799998886998899999999 Q gi|254780504|r 11 SDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDK 66 (492) Q Consensus 11 ~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~ 66 (492) ..+|-|+||===.=|+-+|.|=.+|..-.-...++.||++++.+ +.+++.+.+.+ T Consensus 19 ~vll~FWAsWC~pCr~e~P~L~~ly~~~~~~~~~~~vi~is~D~-~~~~~~~~~~~ 73 (132) T cd02964 19 TVGLYFSASWCPPCRAFTPKLVEFYEKLKEEGKNFEIVFVSRDR-SEESFNEYFSE 73 (132) T ss_pred EEEEEEECCCCHHHHHHCHHHHHHHHHHHHCCCCEEEEEEECCC-CHHHHHHHHHH T ss_conf 89999984568204310589999999987418868999976789-99999999996 No 43 >TIGR03346 chaperone_ClpB ATP-dependent chaperone ClpB. Members of this protein family are the bacterial ATP-dependent chaperone ClpB. This protein belongs to the AAA family, ATPases associated with various cellular activities (pfam00004). This molecular chaperone does not act as a protease, but rather serves to disaggregate misfolded and aggregated proteins. Probab=38.09 E-value=27 Score=14.88 Aligned_cols=31 Identities=35% Similarity=0.568 Sum_probs=22.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHH Q ss_conf 67657799999997289875781799998886998899999999 Q gi|254780504|r 23 AKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDK 66 (492) Q Consensus 23 A~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~ 66 (492) |...|=|||- .| .+++||.. +.++||.++.. T Consensus 290 AaNlLKPaLa----rG-----elr~IgAT----T~~EYrk~iEk 320 (852) T TIGR03346 290 AGNMLKPALA----RG-----ELHCIGAT----TLDEYRKYIEK 320 (852) T ss_pred HHHHHHHHHH----CC-----CCEEEEEC----CHHHHHHHHHC T ss_conf 6777437874----79-----85599827----89999988322 No 44 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=37.97 E-value=18 Score=16.16 Aligned_cols=10 Identities=40% Similarity=0.793 Sum_probs=3.5 Q ss_pred CCCHHHHHHH Q ss_conf 2420125687 Q gi|254780504|r 179 IDHYLGKEAV 188 (492) Q Consensus 179 IDHYLGKe~v 188 (492) +--||||=.+ T Consensus 206 V~~yLgkiS~ 215 (297) T COG4785 206 VEFYLGKISE 215 (297) T ss_pred HHHHHHHCCH T ss_conf 9999704039 No 45 >TIGR02788 VirB11 P-type DNA transfer ATPase VirB11; InterPro: IPR014155 This entry contains VirB11, a protein that is found in the vir locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for DNA transfer . VirB11 is believed to be an ATPase and is a homologue of the P-like conjugation system TrbB protein and the Flp pilus system protein TadA .. Probab=37.70 E-value=15 Score=16.66 Aligned_cols=32 Identities=38% Similarity=0.614 Sum_probs=16.5 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEE Q ss_conf 878998644356676577999999972898757817999 Q gi|254780504|r 11 SDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIG 49 (492) Q Consensus 11 ~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG 49 (492) -|+||+||||= .|-=++=||- +.+|. +=|||. T Consensus 159 knIii~GGTgS-GKTTf~kal~-----~~IP~-~ER~iT 190 (328) T TIGR02788 159 KNIIISGGTGS-GKTTFLKALV-----KEIPK-DERLIT 190 (328) T ss_pred CEEEEEECCCC-HHHHHHHHHH-----HCCCC-CCCEEE T ss_conf 91999906897-1899999997-----32762-252788 No 46 >TIGR00873 gnd 6-phosphogluconate dehydrogenase, decarboxylating; InterPro: IPR006113 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) , . Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved . The protein is a homodimer in which the monomers act independently : each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet . NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket . This model does not specify whether the cofactor is NADP only, NAD only, or both.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0050661 NADP binding, 0006098 pentose-phosphate shunt. Probab=37.05 E-value=20 Score=15.81 Aligned_cols=15 Identities=7% Similarity=-0.037 Sum_probs=6.1 Q ss_pred CEEEEEEECCCCCHH Q ss_conf 817999988869988 Q gi|254780504|r 44 SSRIIGICRIKMTVE 58 (492) Q Consensus 44 ~~~IiG~aR~~~~~e 58 (492) .|.|.-+-|+.--+| T Consensus 22 GF~V~VyNRt~~Ktd 36 (480) T TIGR00873 22 GFTVAVYNRTPEKTD 36 (480) T ss_pred CCCEEEEECCHHHHH T ss_conf 982799726847999 No 47 >PRK05653 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Validated Probab=36.62 E-value=28 Score=14.72 Aligned_cols=74 Identities=11% Similarity=0.107 Sum_probs=44.9 Q ss_pred CCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 98987899864435667657799999997289875781799998886998899999999998742346799899998985 Q gi|254780504|r 8 IHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLS 87 (492) Q Consensus 8 ~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~ 87 (492) +..-+.+|-||+|.+-+- +-..++ +.+..++.++|++-..++ +.+.+.. .-. T Consensus 3 L~~Kv~lITGgs~GIG~a-----~a~~la-----~~G~~V~~~~r~~~~l~~----~~~~~~~--------------~~~ 54 (246) T PRK05653 3 LQGKTALVTGASRGIGRA-----IALRLA-----ADGARVVIYDSNEEAAEA----LAEELRA--------------AGG 54 (246) T ss_pred CCCCEEEEECCCCHHHHH-----HHHHHH-----HCCCEEEEEECCHHHHHH----HHHHHHH--------------CCC T ss_conf 899889993897589999-----999999-----879999999799999999----9999996--------------599 Q ss_pred HCCEEECCCCCHHHHHHHHHHH Q ss_conf 1657864678977899888874 Q gi|254780504|r 88 LIFYINLDVEKNYGWELLGKLL 109 (492) Q Consensus 88 ~~~Y~~~d~~~~~~y~~L~~~l 109 (492) .+.++++|++++++.+.+-+.+ T Consensus 55 ~~~~~~~Dl~~~~~~~~~~~~~ 76 (246) T PRK05653 55 EAALLVFDVTDEAAVRALIEAA 76 (246) T ss_pred CEEEEEEECCCHHHHHHHHHHH T ss_conf 4899997289999999999999 No 48 >pfam07505 Gp37_Gp68 Phage protein Gp37/Gp68. Homologues of phage proteins Gp37 and Gp68 are found in several bacteria. Probab=36.43 E-value=18 Score=16.20 Aligned_cols=40 Identities=23% Similarity=0.414 Sum_probs=23.3 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCH-------HHHHHHHHHCCCCCCC Q ss_conf 84699998873489874579889971-------8899999981884477 Q gi|254780504|r 449 WSDSILKSWQIINQKVDYYAAGTWGP-------DQSDVLLQKDGRRWHN 490 (492) Q Consensus 449 ivdpiL~~~~~~~~~~~~Y~~GS~GP-------~~a~~l~~~~g~~W~~ 490 (492) ++..|.+.++...++.+.=.-|+||| +++-++| +|+.|.+ T Consensus 205 Wvr~irdQC~~agvpFfFKQwG~~~~~~~~~~kk~~Gr~l--dg~~~~~ 251 (253) T pfam07505 205 WVRDIRDQCTAAGVPFFFKQWGGHGKDLVGRGKKATGREL--DGRTWDE 251 (253) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCC--CCEECCC T ss_conf 9999999999809988998579998654434743445635--8806375 No 49 >CHL00194 ycf39 Ycf39; Provisional Probab=36.40 E-value=29 Score=14.70 Aligned_cols=105 Identities=10% Similarity=0.067 Sum_probs=53.6 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCE Q ss_conf 78998644356676577999999972898757817999988869988999999999987423467998999989851657 Q gi|254780504|r 12 DFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFY 91 (492) Q Consensus 12 ~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y 91 (492) ++.|+||||-+-+. +...| .. .+..+.+..|++-... .++ . ..+.. T Consensus 2 ~ILV~GATG~lGr~-vVr~L---l~------~G~~Vr~lvRnp~ka~--------~l~----~------------~Gve~ 47 (319) T CHL00194 2 SLLVIGATGTLGRQ-IVRRA---LD------EGYQVKCLVRNLRKAA--------FLK----E------------WGAEL 47 (319) T ss_pred EEEEECCCCHHHHH-HHHHH---HH------CCCCEEEEECCHHHHH--------HHH----H------------CCCEE T ss_conf 79998998589999-99999---96------8890899957867632--------342----1------------59679 Q ss_pred EECCCCCHHHHHHHHHHHCCCCCCCEEEEECCC-HH---HHHHH----HHHHHHHCCCCCCCEEEEECCCCCC Q ss_conf 864678977899888874057678469984069-56---86667----7777772587644357760566789 Q gi|254780504|r 92 INLDVEKNYGWELLGKLLDIDKTRIRVFYLAMS-SA---FFGKI----SQRIHANNLVTEHTRVVLEKPIGSN 156 (492) Q Consensus 92 ~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiP-P~---~f~~i----~~~L~~~~l~~~~~RiViEKPFG~D 156 (492) +.+|++++++.. ..++. ...++..+.. |. -+..+ ..++-+++...+=.|+|.==-.|-| T Consensus 48 v~gDl~dpesl~---~Al~G---vdaVi~~~~~~~~~~~~~~~vd~~g~~~li~AAk~aGVkr~V~lS~lga~ 114 (319) T CHL00194 48 VYGDLSLPETIP---PALEG---ITAIIDASTSRPSDLNNAYQIDLEGKLALIEAAKAAKVKRFIFFSILNAE 114 (319) T ss_pred EEECCCCHHHHH---HHHCC---CCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 994278877899---99659---96799945667788620889889889999999998499889996135666 No 50 >pfam08877 MepB MepB protein. MepB is a functionally uncharacterized protein in the mepRAB gene cluster of Staphylococcus aureus. Probab=36.15 E-value=29 Score=14.67 Aligned_cols=71 Identities=14% Similarity=0.049 Sum_probs=51.1 Q ss_pred CCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEEC Q ss_conf 7438999997085867857775513346885179999985488643766654789856999971796504443312 Q gi|254780504|r 317 ETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFDHQMSKIETNKLILRLQNNGEIEQFITTK 392 (492) Q Consensus 317 eTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~~~~~~~~~N~Lvi~iqP~~~i~l~~~~K 392 (492) .-|.|+.+.+++.+ +..|.+|.-++|..+-+..-|+.+.+.-..-...-.++.|||.+.-+...-+++--| T Consensus 5 ~eY~a~~f~l~~~~-----i~~R~AK~TP~K~G~FVt~Wkr~~~g~~~Pf~~~d~~d~lvI~v~d~~~~G~FiFpk 75 (123) T pfam08877 5 SDYEAGLFQLNGHT-----IRFRLAKKTPTKPGYFVAFWEKDENGQNQPFDYDDAPDLLIIVVIDDNRRGQFIFPK 75 (123) T ss_pred CCCCEEEEEECCEE-----EEEEECCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCEEEEEEEEH T ss_conf 62324999999979-----999842468998506999998789998478743347897999997289067799039 No 51 >KOG3437 consensus Probab=35.98 E-value=27 Score=14.87 Aligned_cols=23 Identities=13% Similarity=0.107 Sum_probs=12.0 Q ss_pred EEECCCCCCCEEEEEEEEECCCC Q ss_conf 55133468851799999854886 Q gi|254780504|r 338 LRTGKYLAKHISEIVIALKPAPC 360 (492) Q Consensus 338 lrtGK~l~~k~~eI~i~FK~~p~ 360 (492) +|+|-+.++=.-.-.++|+.|.+ T Consensus 93 I~~G~g~~dl~~~~~~el~ep~G 115 (184) T KOG3437 93 IRAGNGFNDLWEIQSVELVEPKG 115 (184) T ss_pred EEECCCHHHEEEEEEEEEECCCC T ss_conf 99169854304356789856773 No 52 >cd01067 globin_like superfamily containing globins and truncated hemoglobins Probab=35.80 E-value=29 Score=14.63 Aligned_cols=34 Identities=15% Similarity=0.172 Sum_probs=26.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCC-CCHHHHHHHHHHC Q ss_conf 988999999999987423467-9989999898516 Q gi|254780504|r 56 TVESYRSFVDKELQRYLKDGE-YNPLKVQKFLSLI 89 (492) Q Consensus 56 ~~e~fr~~v~~~l~~~~~~~~-~~~~~~~~F~~~~ 89 (492) ++++|+..+++-+..|.+.+- .+++.|.+|++.+ T Consensus 56 dd~~F~a~~rel~~rH~~~~v~l~~~~w~~Fw~~~ 90 (117) T cd01067 56 DDREFNAKTRELASRHKRDHVHMPPEVFTAFWKLL 90 (117) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 51989999999998988635779989999999999 No 53 >COG1429 CobN Cobalamin biosynthesis protein CobN and related Mg-chelatases [Coenzyme metabolism] Probab=33.53 E-value=31 Score=14.50 Aligned_cols=48 Identities=21% Similarity=0.380 Sum_probs=28.8 Q ss_pred CHHHHCCC-CEEEEEEEEEECCCCHHHH-HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00252803-3307999998467702455-5532200178876558999999 Q gi|254780504|r 201 YESLWNNK-YIDHVQITTAETIGVEDRV-DYYNNTGALRDMIQNHLLQLLC 249 (492) Q Consensus 201 ~e~~WNr~-~I~~VqIt~~E~~gvegR~-~yyD~~GaiRDmvQNHllQlL~ 249 (492) .+|+|+.. .|.-|++.-.|++| --|. --.-.+|+.||++.|-+--|-- T Consensus 919 v~Pvwd~~grV~gveviPl~eL~-RPRIDV~v~~SG~fRD~fp~~i~lld~ 968 (1388) T COG1429 919 VRPVWDAGGRVTGVEVIPLEELG-RPRIDVVVRISGLFRDAFPNQIELLDE 968 (1388) T ss_pred CEEEECCCCCCCCEEECCHHHCC-CCCEEEEEEEECCCCCCCHHHHHHHHH T ss_conf 70540589866224725888929-984138998401120014789999999 No 54 >pfam08659 KR KR domain. This enzymatic domain is part of bacterial polyketide synthases and catalyses the first step in the reductive modification of the beta-carbonyl centres in the growing polyketide chain. It uses NADPH to reduce the keto group to a hydroxy group. Probab=33.29 E-value=32 Score=14.36 Aligned_cols=73 Identities=14% Similarity=0.164 Sum_probs=46.5 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCE-EEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCC Q ss_conf 7899864435667657799999997289875781-799998886998899999999998742346799899998985165 Q gi|254780504|r 12 DFIIFGGTGDLAKRKLFPALYNNYTNREENQISS-RIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIF 90 (492) Q Consensus 12 ~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~-~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~ 90 (492) |.+|-||++.+-+- +-..+.. ... .++-++|++-..++..+.+.+ +.. .-.++. T Consensus 2 tvlITGas~GIG~a-----ia~~la~-----~Ga~~vvl~~r~~~~~~~~~~~~~~-l~~--------------~g~~~~ 56 (181) T pfam08659 2 TYLVTGGLGGLGLE-----LARWLAE-----RGARHLVLLSRSGAPDPEAEALLAE-LEA--------------RGAEVT 56 (181) T ss_pred EEEEECCCCHHHHH-----HHHHHHH-----CCCCEEEEEECCCCCCHHHHHHHHH-HHH--------------CCCEEE T ss_conf 89996878789999-----9999998-----7997899986897662999999999-996--------------599699 Q ss_pred EEECCCCCHHHHHHHHHHH Q ss_conf 7864678977899888874 Q gi|254780504|r 91 YINLDVEKNYGWELLGKLL 109 (492) Q Consensus 91 Y~~~d~~~~~~y~~L~~~l 109 (492) ++++|++++++.+.+-+.+ T Consensus 57 ~~~~Dv~d~~~v~~~~~~~ 75 (181) T pfam08659 57 VVACDVSDRDAVAALLAEI 75 (181) T ss_pred EEECCCCCHHHHHHHHHHH T ss_conf 9975689999999888657 No 55 >pfam05320 Pox_RNA_Pol_19 Poxvirus DNA-directed RNA polymerase 19 kDa subunit. This family contains several DNA-directed RNA polymerase 19 kDa polypeptides. The Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6) catalyses DNA-template-directed extension of the 3'-end of an RNA strand by one nucleotide at a time. Probab=32.73 E-value=33 Score=14.30 Aligned_cols=41 Identities=20% Similarity=0.285 Sum_probs=18.2 Q ss_pred EECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 98644356676577999999972898757817999988869988999999999987 Q gi|254780504|r 15 IFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQR 70 (492) Q Consensus 15 IFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~ 70 (492) +|--||=+|-- |+|++.|+||-. .+++++.|++.+..-+-+ T Consensus 82 LfEiTGIiaES------yNLLqRGRlPLv---------~dlsdeT~k~n~l~vvi~ 122 (167) T pfam05320 82 LFEITGIIAES------YNLLQRGRLPLV---------SDLSDETFKDNILHVVIR 122 (167) T ss_pred HHHHHHHHHHH------HHHHHCCCCCCC---------CCCCHHHHHHHHHHHHHH T ss_conf 78877899988------889873756321---------445567888769999999 No 56 >TIGR03159 cas_Csc1 CRISPR-associated protein Csc1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a widespread family of prokaryotic direct repeats with spacers of unique sequence between consecutive repeats. This protein family is a CRISPR-associated (Cas) family strictly associated with the Cyano subtype of CRISPR/Cas locus, found in several species of Cyanobacteria and several archaeal species. This family is designated Csc1 for CRISPR/Cas Subtype Cyano protein 1, as it is often the first gene upstream of the core cas genes, cas3-cas4-cas1-cas2. Probab=32.59 E-value=33 Score=14.28 Aligned_cols=47 Identities=15% Similarity=0.162 Sum_probs=27.5 Q ss_pred CCCCCCCCCCCC-CCCCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEEEE Q ss_conf 334101479886-6487438999997085867857775513346885179999 Q gi|254780504|r 302 VKGYLEEIPLGI-SDTETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISEIVI 353 (492) Q Consensus 302 v~gY~~e~~v~~-S~TeTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i 353 (492) .|.|--+..+++ |.=+||+.. .-.=...|-|||-||.+.+-+.++.+ T Consensus 112 ~p~fgr~k~ia~gn~F~~yv~~-----~~~~~~LP~yIRLGK~~sK~~V~~~~ 159 (223) T TIGR03159 112 YPNFGRAKEIAPGNKFYFYVFT-----DQGEEKLPVYIRLGKKRSKAKVTVKE 159 (223) T ss_pred CCCCCEEEEECCCCEEEEEEEC-----CCCCCCCCCEEEECCEEEEEEEEEEE T ss_conf 8864412562468889999955-----88756776216733601301899998 No 57 >pfam08644 SPT16 FACT complex subunit (SPT16/CDC68). Proteins in this family are subunits the FACT complex. The FACT complex plays a role in transcription initiation and promotes binding of TATA-binding protein (TBP) to a TATA box in chromatin. Probab=32.30 E-value=33 Score=14.25 Aligned_cols=27 Identities=22% Similarity=0.331 Sum_probs=21.1 Q ss_pred CCEEEEEECCCCCCC----EEEEEEEEECCC Q ss_conf 857775513346885----179999985488 Q gi|254780504|r 333 GIPFYLRTGKYLAKH----ISEIVIALKPAP 359 (492) Q Consensus 333 gVPF~lrtGK~l~~k----~~eI~i~FK~~p 359 (492) =|||.+.|=|....+ .++++|.|..|. T Consensus 18 ~VPFHIstIKNvs~~~Eg~~t~lRINF~~Pg 48 (152) T pfam08644 18 PVPFHISTIKNVSKSDEGNYTYLRINFNSPG 48 (152) T ss_pred CCCEEEEEEEEEEEECCCCEEEEEEEECCCC T ss_conf 1146857476007604698189999964898 No 58 >TIGR00763 lon ATP-dependent protease La; InterPro: IPR004815 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This signature defines the bacterial and eukaryotic lon proteases, which are ATP-dependent serine peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SF). This family of sequences does not include the archaeal lon homologs, IPR004663 from INTERPRO. In the eukaryotes the majority of the proteins are located in the mitochondrial matrix , . In yeast, Pim1, is located in the mitochondrial matrix, is required for mitochondrial function, is constitutively expressed but is increased after thermal stress, suggesting that Pim1 may play a role in the heat shock response .; GO: 0004176 ATP-dependent peptidase activity, 0005524 ATP binding, 0006510 ATP-dependent proteolysis. Probab=29.78 E-value=25 Score=15.08 Aligned_cols=57 Identities=19% Similarity=0.375 Sum_probs=38.9 Q ss_pred CCHHHHHHHHHHHHHHCCHHH--EEECCCHHHHHH-HHHHH--HHHHCC---CCCHHHHCCCCEEEEEEE Q ss_conf 898899999999987389688--351242012568-77778--877477---520025280333079999 Q gi|254780504|r 155 SNLSSAQEIHAITRKIFKESQ--IFRIDHYLGKEA-VQGLM--VFRFAN---TFYESLWNNKYIDHVQIT 216 (492) Q Consensus 155 ~Dl~SA~eLn~~l~~~f~E~q--iyRIDHYLGKe~-vqnil--~lRFaN---~~~e~~WNr~~I~~VqIt 216 (492) .|..- .=-++|.+++|.|| -|+ ||||.=+- +-+.| -+==|| +|-+||.+| -+=|+|+ T Consensus 533 ~~~~G--DPaSALLEvLDPEQN~~F~-DHYldvp~DLS~V~CyFi~TAN~~d~IP~PLLDR--MEvI~ls 597 (941) T TIGR00763 533 SSFRG--DPASALLEVLDPEQNNAFS-DHYLDVPFDLSKVLCYFIATANSIDTIPRPLLDR--MEVIELS 597 (941) T ss_pred CCCCC--CHHHHHHHHCCHHHCCCCC-CCCCCCCCCHHHHHHHEEECCCCCCCCCCCCCCC--EEEEECC T ss_conf 88655--6378886412864360425-5300234004200210002447576777221374--0245238 No 59 >KOG4278 consensus Probab=29.59 E-value=26 Score=15.06 Aligned_cols=57 Identities=23% Similarity=0.441 Sum_probs=39.1 Q ss_pred CCCHHHHCCCCEEEEEEEEEEC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 5200252803330799999846-77024555532200178876558999999987185445 Q gi|254780504|r 199 TFYESLWNNKYIDHVQITTAET-IGVEDRVDYYNNTGALRDMIQNHLLQLLCLVAMEMPPS 258 (492) Q Consensus 199 ~~~e~~WNr~~I~~VqIt~~E~-~gvegR~~yyD~~GaiRDmvQNHllQlL~lvAMe~P~~ 258 (492) -++|.+|-+....----|++|+ .-|| .|.-.+-+++|+-.-.|.|||....-|||-- T Consensus 282 eVYeGvWKkyslTvAVKtLKEDtMeve---EFLkEAAvMKeikHpNLVqLLGVCT~EpPFY 339 (1157) T KOG4278 282 EVYEGVWKKYSLTVAVKTLKEDTMEVE---EFLKEAAVMKEIKHPNLVQLLGVCTHEPPFY 339 (1157) T ss_pred CEEEEEEECCEEEEEHHHHHHHHHHHH---HHHHHHHHHHHHCCCCHHHHHHHHCCCCCEE T ss_conf 022322211204532333310005699---9999989887604810877750112689707 No 60 >KOG0722 consensus Probab=29.54 E-value=37 Score=13.93 Aligned_cols=131 Identities=17% Similarity=0.201 Sum_probs=67.8 Q ss_pred HHHHHHHHHHHHHHHC----CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCEEECCCC Q ss_conf 6676577999999972----898757817999988869988999999999987423467998999989851657864678 Q gi|254780504|r 22 LAKRKLFPALYNNYTN----REENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFYINLDVE 97 (492) Q Consensus 22 LA~RKL~PALf~L~~~----~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y~~~d~~ 97 (492) |.---|+|+|+--+.. |.-+ -..++|++|+. +..+...--++--.+|-++-..+++.-+.|... T Consensus 11 Lvl~~Llp~l~vgl~egLYCG~en--CYdVLgV~Rea-~KseIakAYRqLARrhHPDr~r~~e~k~~F~~i--------- 78 (329) T KOG0722 11 LVLILLLPSLFVGLSEGLYCGAEN--CYDVLGVAREA-NKSEIAKAYRQLARRHHPDRNRDPESKKLFVKI--------- 78 (329) T ss_pred HHHHHHHHHHHHHHHHHHCCCCHH--HHHHHHHHHHC-CHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH--------- T ss_conf 999999999998643211046064--89996111321-678999999999987397656992243356422--------- Q ss_pred CHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCE-EEEECCCCCC----HHHHHHHHHHHH Q ss_conf 9778998888740576784699840695686667777777258764435-7760566789----889999999998 Q gi|254780504|r 98 KNYGWELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHANNLVTEHTR-VVLEKPIGSN----LSSAQEIHAITR 168 (492) Q Consensus 98 ~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~~~~R-iViEKPFG~D----l~SA~eLn~~l~ 168 (492) ...|+.|..- ....+-.++|+-|-..|....+-+...+...-..| |+|--|-|-+ +.+...+|+.+. T Consensus 79 -Atayeilkd~---e~rt~ydyaldhpd~~fynyyqyyr~r~apkvd~raviVGvl~i~s~Fqyls~~ary~eAI~ 150 (329) T KOG0722 79 -ATAYEILKDN---ETRTQYDYALDHPDEVFYNYYQYYRARYAPKVDPRAVIVGVLVILSAFQYLSNVARYNEAIA 150 (329) T ss_pred -HCCCCCCCCH---HHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHH T ss_conf -0132326650---66776788861826899999999999845336875799840534107887788999999999 No 61 >PRK07452 DNA polymerase III subunit delta; Validated Probab=29.52 E-value=37 Score=13.93 Aligned_cols=13 Identities=8% Similarity=0.248 Sum_probs=6.1 Q ss_pred CCCCCCEEEECCC Q ss_conf 8989878998644 Q gi|254780504|r 7 NIHNSDFIIFGGT 19 (492) Q Consensus 7 ~~~~~~lVIFGAT 19 (492) ....++++.|-|. T Consensus 29 ~~~~fn~~~~dg~ 41 (326) T PRK07452 29 EWKSFNYSRLDGD 41 (326) T ss_pred CCCCCCEEEECCC T ss_conf 4467623787375 No 62 >TIGR02776 NHEJ_ligase_prk DNA ligase D; InterPro: IPR014143 Members of this entry are DNA ligases involved in the repair of DNA double-stranded breaks by non-homologous end joining (NheJ). The system of the bacterial Ku protein (IPR009187 from INTERPRO) plus this DNA ligase is seen in about 200f bacterial genomes to date and at least one archaeon (Archeoglobus fulgidus). This entry describes a central and C-terminal domain. These two domains may be permuted, as in genus Mycobacterium, or divided into tandem ORFs. An additional N-terminal 3 -phosphoesterase (PE) domain (IPR014144 from INTERPRO) is present in some members of this ligase. Most examples of genes for this ligase are adjacent to the gene for Ku.. Probab=29.11 E-value=34 Score=14.21 Aligned_cols=159 Identities=13% Similarity=0.143 Sum_probs=76.7 Q ss_pred HHHHHHCCEEECCCCCHHHH-----------HHHHHHHCCCCC--------CCEEE-EECCCH---HHHHHHHHH-H--- Q ss_conf 98985165786467897789-----------988887405767--------84699-840695---686667777-7--- Q gi|254780504|r 83 QKFLSLIFYINLDVEKNYGW-----------ELLGKLLDIDKT--------RIRVF-YLAMSS---AFFGKISQR-I--- 135 (492) Q Consensus 83 ~~F~~~~~Y~~~d~~~~~~y-----------~~L~~~l~~~~~--------~~rif-YLAiPP---~~f~~i~~~-L--- 135 (492) +.-...|.|+..|+-.-++- +.|.++|+.... .-++. ++.-+. .+...+|+. | T Consensus 96 ~~~~~~~~~~aFDLl~l~G~dL~~lPL~~RK~~L~~LL~~~~~~~~~~~~~~~~~~r~~~~~~~~~~ll~~a~~~~lEGv 175 (645) T TIGR02776 96 EGASEPLLYYAFDLLFLSGEDLRALPLEERKKRLKELLKAQDGPAEPDPAIRYSDHRVFEEDGDGQALLESACRLGLEGV 175 (645) T ss_pred HHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEECCCCCCCCHHHHHHHHHHHHCCCEE T ss_conf 72189808999715668806778624899999999999844687567741200001100685158999999998579738 Q ss_pred -H-------HHC-CCCCCC--------EEEEECCCCCCHHH-HHH-HHHHHHHHCC---HHH---EEECCCHHHHHHHHH Q ss_conf -7-------725-876443--------57760566789889-999-9999987389---688---351242012568777 Q gi|254780504|r 136 -H-------ANN-LVTEHT--------RVVLEKPIGSNLSS-AQE-IHAITRKIFK---ESQ---IFRIDHYLGKEAVQG 190 (492) Q Consensus 136 -~-------~~~-l~~~~~--------RiViEKPFG~Dl~S-A~e-Ln~~l~~~f~---E~q---iyRIDHYLGKe~vqn 190 (492) + ..| .+.+|. -+|| =|++--. |+. |-..+-.+++ +.| +=++=|=++.++++. T Consensus 176 v~Kr~~s~Y~sgkRs~~W~K~Kc~~~~~fvi---~GY~~~~gar~gfg~L~~gv~~~~d~G~L~y~G~VgtGF~~~~~~~ 252 (645) T TIGR02776 176 VSKRLDSPYRSGKRSKDWLKLKCRRRQEFVI---TGYTPPNGARRGFGALLVGVYEQEDGGQLRYAGKVGTGFGADTLKT 252 (645) T ss_pred EEECCCCCCCCCCCCCCHHHHHHCCCCCEEE---EEECCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHH T ss_conf 9854777301677686024112026863699---8652874568999988776514688894278886027888899999 Q ss_pred HHH----HHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 788----7747752002528033307999998467702455553220017887655899 Q gi|254780504|r 191 LMV----FRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHLL 245 (492) Q Consensus 191 il~----lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHll 245 (492) |+. ++-...=|+..--+.-...|+-.-.+=..-=.=++ ...-|+||-.+=-+|- T Consensus 253 l~~~L~~~~~~~sPf~~~~~~~~~~g~~wv~P~l~~ev~y~~-~t~dg~lR~a~F~GLR 310 (645) T TIGR02776 253 LLARLKALGAKASPFAGKPAGAKKRGVHWVRPSLVAEVEYAG-ITRDGILRQASFKGLR 310 (645) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCEEEECCEEEEEEEEEC-CCCCCEEEHHHHHCHH T ss_conf 999999874307787788887767745885630188774406-3668815125021200 No 63 >COG4884 Uncharacterized protein conserved in bacteria [Function unknown] Probab=29.04 E-value=19 Score=15.96 Aligned_cols=67 Identities=30% Similarity=0.572 Sum_probs=34.9 Q ss_pred ECCCCCCHHHHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHH Q ss_conf 05667898899999999987389688351242012568777788774775200252803330799999846770245555 Q gi|254780504|r 150 EKPIGSNLSSAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDY 229 (492) Q Consensus 150 EKPFG~Dl~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~y 229 (492) |-|||.| +.+..|+. ...|+-|+.-.|+ +-|---+.|.+|.-.||.- .+.+.-..|..+ T Consensus 39 esPFGsd-eG~d~L~~-------------Lee~~~kk~dld~--adFPk~~iEt~w~mkYi~~-----d~~lee~~r~q~ 97 (176) T COG4884 39 ESPFGSD-EGMDVLHR-------------LEEYFSKKSDLDL--ADFPKYMIETVWHMKYIPP-----DAKLEEVDRLQQ 97 (176) T ss_pred CCCCCCC-CHHHHHHH-------------HHHHHHCCCCCCH--HHHHHHHHHHHHCCCCCCC-----HHHHHHHHHHHH T ss_conf 6998762-05789999-------------9998721657885--5607999999960533584-----366778889999 Q ss_pred --HHHHHHHH Q ss_conf --32200178 Q gi|254780504|r 230 --YNNTGALR 237 (492) Q Consensus 230 --yD~~GaiR 237 (492) -..-|++| T Consensus 98 ~~l~~e~~~~ 107 (176) T COG4884 98 LLLEYEGALR 107 (176) T ss_pred HHHHCCCHHH T ss_conf 8761223011 No 64 >COG3240 Phospholipase/lecithinase/hemolysin [Lipid metabolism / General function prediction only] Probab=28.71 E-value=34 Score=14.19 Aligned_cols=40 Identities=33% Similarity=0.355 Sum_probs=32.6 Q ss_pred HHHHHHHHHHCCHHH--EEECCCH-HHHHHHHHHHHHHHCCCC Q ss_conf 999999987389688--3512420-125687777887747752 Q gi|254780504|r 161 QEIHAITRKIFKESQ--IFRIDHY-LGKEAVQGLMVFRFANTF 200 (492) Q Consensus 161 ~eLn~~l~~~f~E~q--iyRIDHY-LGKe~vqnil~lRFaN~~ 200 (492) ..+|..|.+...+-+ |-|||+| |-||+..|=-.|-|+|.- T Consensus 240 ~~~Na~L~~~L~~~g~nIi~iD~~~llk~im~nPa~fGlant~ 282 (370) T COG3240 240 IAFNASLTSQLEQLGGNIIRIDTYTLLKEIMTNPAEFGLANTT 282 (370) T ss_pred HHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHHCHHHCCCCCCC T ss_conf 9987889999997248679967489999998498863845687 No 65 >TIGR01085 murE UDP-N-acetylmuramyl-tripeptide synthetases; InterPro: IPR005761 The bacterial cell wall provides strength and rigidity to counteract internal osmotic pressure, and protection against the environment. The peptidoglycan layer gives the cell wall its strength, and helps maintain the overall shape of the cell. The basic peptidoglycan structure of both Gram-positive and Gram-negative bacteria is comprised of a sheet of glycan chains connected by short cross-linking polypeptides. Biosynthesis of peptidoglycan is a multi-step (11-12 steps) process comprising three main stages: (1) formation of UDP-N-acetylmuramic acid (UDPMurNAc) from N-acetylglucosamine (GlcNAc). (2) addition of a short polypeptide chain to the UDPMurNAc. (3) addition of a second GlcNAc to the disaccharide-pentapeptide building block and transport of this unit through the cytoplasmic membrane and incorporation into the growing peptidoglycan layer. Stage two involves four key Mur ligase enzymes: MurC (6.3.2.8 from EC) , MurD (6.3.2.9 from EC) , MurE (6.3.2.13 from EC) and MurF (6.3.2.10 from EC) . These four Mur ligases are responsible for the successive additions of L-alanine, D-glutamate, meso-diaminopimelate or L-lysine, and D-alanyl-D-alanine to UDP-N-acetylmuramic acid. All four Mur ligases are topologically similar to one another, even though they display low sequence identity. They are each composed of three domains: an N-terminal Rossmann-fold domain responsible for binding the UDPMurNAc substrate; a central domain (similar to ATP-binding domains of several ATPases and GTPases); and a C-terminal domain (similar to dihydrofolate reductase fold) that appears to be associated with binding the incoming amino acid. The conserved sequence motifs found in the four Mur enzymes also map to other members of the Mur ligase family, including folylpolyglutamate synthetase, cyanophycin synthetase and the capB enzyme from Bacillales . This entry represents DP-N-acetylmuramoylalanyl-D-glutamate-2,6-diaminopimelate ligases (MurE; 6.3.2.13 from EC). An exception is found with Staphylococcus aureus, in which diaminopimelate is replaced by lysine in the peptidoglycan and MurE is (6.3.2.7 from EC). The Mycobacteria, part of the closest neighbouring branch outside of the low-GC Gram-positive bacteria, use diaminopimelate.; GO: 0005524 ATP binding, 0016881 acid-amino acid ligase activity, 0008360 regulation of cell shape, 0009252 peptidoglycan biosynthetic process, 0009273 peptidoglycan-based cell wall biogenesis, 0051301 cell division, 0005737 cytoplasm. Probab=27.39 E-value=27 Score=14.85 Aligned_cols=15 Identities=13% Similarity=0.308 Sum_probs=10.5 Q ss_pred CCCCCEEEEEEECCC Q ss_conf 875781799998886 Q gi|254780504|r 40 ENQISSRIIGICRIK 54 (492) Q Consensus 40 l~~~~~~IiG~aR~~ 54 (492) .|.....+||+.=++ T Consensus 83 ~Ps~~l~viGvTGTN 97 (494) T TIGR01085 83 HPSKKLKVIGVTGTN 97 (494) T ss_pred CCCCCEEEEEEEECC T ss_conf 855251689997128 No 66 >pfam07993 NAD_binding_4 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included. Probab=27.20 E-value=40 Score=13.65 Aligned_cols=77 Identities=25% Similarity=0.187 Sum_probs=45.9 Q ss_pred ECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCEEECC Q ss_conf 86443566765779999999728987578179999888699889999999999874234679989999898516578646 Q gi|254780504|r 16 FGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIFYINLD 95 (492) Q Consensus 16 FGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~Y~~~d 95 (492) =||||=|-+.-|- +|...+ + ..+|++.-|.+-+ +.-++.+.+.+..+... + .....+++.++.+| T Consensus 2 TGaTGFlG~~ll~----~Ll~~~---~-~~~V~~LvR~~~~-~~~~~r~~~~~~~~~~~---~---~~~~~~ri~~v~gD 66 (245) T pfam07993 2 TGATGFLGKVLLE----KLLRSC---P-EVKIYCLVRAKDG-ESALERLRQELLKYGLF---D---RLKALERIIPVAGD 66 (245) T ss_pred CCCHHHHHHHHHH----HHHHCC---C-CCEEEEEECCCCC-CCHHHHHHHHHHHCCCC---C---HHHCCCCEEEEECC T ss_conf 3843599999999----999579---9-9789999678984-05899999999856753---1---01034777999561 Q ss_pred CCC------HHHHHHHHH Q ss_conf 789------778998888 Q gi|254780504|r 96 VEK------NYGWELLGK 107 (492) Q Consensus 96 ~~~------~~~y~~L~~ 107 (492) +++ .+.|+.|.+ T Consensus 67 l~~~~lGL~~~~~~~l~~ 84 (245) T pfam07993 67 LSEPNLGLSDEDFQELAE 84 (245) T ss_pred CCCCCCCCCHHHHHHHHH T ss_conf 688657989999999983 No 67 >TIGR02088 LEU3_arch isopropylmalate/isohomocitrate dehydrogenases; InterPro: IPR011828 This entry represents a group of archaeal decarboxylating dehydrogenases which include the leucine biosynthesis enzyme 3-isopropylmalate dehydrogenase (LeuB, LEU3) and the methanogenic cofactor CoB biosynthesis enzyme isohomocitrate dehydrogenase (AksF). Both of these have been characterised in Methanococcus janaschii . Sequences from the non-methanogenic archaea are also represented in this entry, presumably LeuB, although phylogenetic trees cannot establish which gene is which in the methanogens.; GO: 0003862 3-isopropylmalate dehydrogenase activity, 0051287 NAD binding, 0009098 leucine biosynthetic process. Probab=27.17 E-value=31 Score=14.48 Aligned_cols=15 Identities=33% Similarity=0.612 Sum_probs=8.0 Q ss_pred CCCEEEECCCCHHHH Q ss_conf 987899864435667 Q gi|254780504|r 10 NSDFIIFGGTGDLAK 24 (492) Q Consensus 10 ~~~lVIFGATGDLA~ 24 (492) -++.|+|||+|.-+. T Consensus 61 k~DaiLFGA~t~~~~ 75 (350) T TIGR02088 61 KADAILFGAVTETAA 75 (350) T ss_pred HCCEEEECCCCCCCC T ss_conf 279475378788887 No 68 >pfam02670 DXP_reductoisom 1-deoxy-D-xylulose 5-phosphate reductoisomerase. This is a family of 1-deoxy-D-xylulose 5-phosphate reductoisomerases. This enzyme catalyses the formation of 2-C-methyl-D-erythritol 4-phosphate from 1-deoxy-D-xylulose-5-phosphate in the presence of NADPH. This reaction is part of the terpenoid biosynthesis pathway. Probab=27.06 E-value=40 Score=13.64 Aligned_cols=66 Identities=14% Similarity=0.269 Sum_probs=34.3 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHC Q ss_conf 89986443566765779999999728987578179999888699889999999999874234679989999898516 Q gi|254780504|r 13 FIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLI 89 (492) Q Consensus 13 lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~ 89 (492) ++|||+||---+-- | ... ...|+ .|+|+|.+- .-+.+...+.+++.--++.- -.++...+.+...+ T Consensus 1 I~IlGsTGSIG~~t----L-~Vi--~~~~~-~f~v~~Lsa-~~N~~~L~~q~~~f~p~~v~--i~~~~~~~~l~~~~ 66 (129) T pfam02670 1 ITILGSTGSIGTQT----L-DVI--RRNPD-RFEVVALSA-GRNVELLAEQIKEFKPKYVA--VADEEAAEELKEAL 66 (129) T ss_pred CEEECCCCHHHHHH----H-HHH--HHCCC-CEEEEEEEE-CCCHHHHHHHHHHCCCCEEE--ECCHHHHHHHHHHC T ss_conf 98976786889999----9-999--95956-718999983-47899999999973997999--95899999999863 No 69 >COG0542 clpA ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones] Probab=26.94 E-value=41 Score=13.62 Aligned_cols=31 Identities=39% Similarity=0.611 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHH Q ss_conf 67657799999997289875781799998886998899999999 Q gi|254780504|r 23 AKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDK 66 (492) Q Consensus 23 A~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~ 66 (492) |...|-|||- .| ..+.||. -+.++||.++.. T Consensus 287 AaNiLKPaLA----RG-----eL~~IGA----TT~~EYRk~iEK 317 (786) T COG0542 287 AANLLKPALA----RG-----ELRCIGA----TTLDEYRKYIEK 317 (786) T ss_pred HHHHHHHHHH----CC-----CEEEEEE----CCHHHHHHHHHH T ss_conf 5664677874----58-----7379973----558999887330 No 70 >COG3935 DnaD Putative primosome component and related proteins [DNA replication, recombination, and repair] Probab=26.91 E-value=41 Score=13.62 Aligned_cols=18 Identities=33% Similarity=0.739 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHHCCCC Q ss_conf 458846999988734898 Q gi|254780504|r 446 TWKWSDSILKSWQIINQK 463 (492) Q Consensus 446 sWrivdpiL~~~~~~~~~ 463 (492) .|++|+.||..|.+.+.+ T Consensus 178 n~~YI~~IL~nW~k~gvk 195 (246) T COG3935 178 NFKYIDAILRNWKKNGVK 195 (246) T ss_pred CHHHHHHHHHHHHHCCCC T ss_conf 689999999999983875 No 71 >KOG1151 consensus Probab=26.64 E-value=29 Score=14.71 Aligned_cols=146 Identities=20% Similarity=0.402 Sum_probs=76.4 Q ss_pred HHHHCCHHHEE--ECCCHHHH-----------HHHHHHHH--H-HHCC---CCC--HHHHCCCCEEEEEEEEEECCCCHH Q ss_conf 98738968835--12420125-----------68777788--7-7477---520--025280333079999984677024 Q gi|254780504|r 167 TRKIFKESQIF--RIDHYLGK-----------EAVQGLMV--F-RFAN---TFY--ESLWNNKYIDHVQITTAETIGVED 225 (492) Q Consensus 167 l~~~f~E~qiy--RIDHYLGK-----------e~vqnil~--l-RFaN---~~~--e~~WNr~~I~~VqIt~~E~~gveg 225 (492) +.+|-.-++|| |..|.--. |-|.|+.+ | |..| .-| -|+.|..|. -+-+-| T Consensus 402 laEYhEqeEIfKLRlgHLkKEEaeiqaElERLErvrnlHiRELKRi~NEdnSQFkDHptLn~RYL---------lLhLLG 472 (775) T KOG1151 402 LAEYHEQEEIFKLRLGHLKKEEAEIQAELERLERVRNLHIRELKRIHNEDNSQFKDHPTLNDRYL---------LLHLLG 472 (775) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCHHHHHH---------HHHHHC T ss_conf 99888689999998865455677888899999998877899988642442333045741677899---------999861 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH--HHHCCCCHH-HCCCCEEEECCCCCCCCCEEC Q ss_conf 55553220017887655899999998718544588589999999999--730688811-223613652136677687343 Q gi|254780504|r 226 RVDYYNNTGALRDMIQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKVL--QALQMITPE-NVQKLTVRGQYQSGIINGIPV 302 (492) Q Consensus 226 R~~yyD~~GaiRDmvQNHllQlL~lvAMe~P~~~~~~~ir~eK~kvl--k~~r~~~~~-~v~~~~v~GQY~~~~~~~~~v 302 (492) ||||-+-+-|. |.-.|. .||..-.- + -.+-||||-.-. .++|-.+.. .+++.-+.-||. T Consensus 473 rGGFSEVyKAF-Dl~EqR------YvAvKIHq-l-NK~WrdEKKeNYhKHAcREyrIHKeLDHpRIVKlYD--------- 534 (775) T KOG1151 473 RGGFSEVYKAF-DLTEQR------YVAVKIHQ-L-NKNWRDEKKENYHKHACREYRIHKELDHPRIVKLYD--------- 534 (775) T ss_pred CCCHHHHHHHC-CCCHHH------EEEEEEEH-H-CCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEE--------- T ss_conf 56478899860-301101------33565011-0-201566766658887777776654047531024010--------- Q ss_pred CCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEE Q ss_conf 341014798866487438999997085867857775513346885179 Q gi|254780504|r 303 KGYLEEIPLGISDTETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISE 350 (492) Q Consensus 303 ~gY~~e~~v~~S~TeTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~e 350 (492) |.. -+|.+||.|--|+.- .+.-|||+--|-|.+|.+. T Consensus 535 --yfs------lDtdsFCTVLEYceG---NDLDFYLKQhklmSEKEAR 571 (775) T KOG1151 535 --YFS------LDTDSFCTVLEYCEG---NDLDFYLKQHKLMSEKEAR 571 (775) T ss_pred --EEE------ECCCCCEEEEEECCC---CCHHHHHHHHHHHHHHHHH T ss_conf --245------226653343453478---8504787751100078889 No 72 >COG3487 IrpA Uncharacterized iron-regulated protein [Inorganic ion transport and metabolism] Probab=26.12 E-value=35 Score=14.09 Aligned_cols=38 Identities=32% Similarity=0.496 Sum_probs=20.9 Q ss_pred CCCCHHHHHHHHHHHHHHCCHHHEEECCCHHHHHHH-----------------HHHHHHHHCCCCCH Q ss_conf 678988999999999873896883512420125687-----------------77788774775200 Q gi|254780504|r 153 IGSNLSSAQEIHAITRKIFKESQIFRIDHYLGKEAV-----------------QGLMVFRFANTFYE 202 (492) Q Consensus 153 FG~Dl~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~v-----------------qnil~lRFaN~~~e 202 (492) |-..|.+|+.|.+. ||-+|.|+.- |.--++||.|.|.. T Consensus 66 yeDsl~TA~aL~Ka------------iDalla~PsaetLkaareAW~aARvPYqqsEvyRFGN~ivD 120 (446) T COG3487 66 YEDSLTTAQALQKA------------IDALLAKPSAETLKAAREAWLAARVPYQQSEVYRFGNAIVD 120 (446) T ss_pred HHHHHHHHHHHHHH------------HHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHCCCCHHH T ss_conf 88777789999999------------99997198688999999999985385126567760671034 No 73 >PRK05854 short chain dehydrogenase; Provisional Probab=25.80 E-value=42 Score=13.48 Aligned_cols=129 Identities=16% Similarity=0.151 Sum_probs=72.0 Q ss_pred CCCCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 88898989878998644356676577999999972898757817999988869988999999999987423467998999 Q gi|254780504|r 3 NHTPNIHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKV 82 (492) Q Consensus 3 ~~tp~~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~ 82 (492) ...|++..-++||=|||.-+-.-- | -.|.. .+.+||-.+|+.-.-++-.+.++. .... T Consensus 7 ~~iPdL~GK~~vITGa~sGIG~~~---a-~~La~------~Ga~Vil~~R~~~k~~~a~~~i~~----~~~~-------- 64 (314) T PRK05854 7 ATVPDLSGKLAVVTGASSGLGFGL---A-RRLAA------AGADVILPVRNRAKGEAAVAAIRT----AVPD-------- 64 (314) T ss_pred CCCCCCCCCEEEECCCCCHHHHHH---H-HHHHH------CCCEEEEEECCHHHHHHHHHHHHH----HCCC-------- T ss_conf 789899999899906882999999---9-99997------849899997999999999999998----6899-------- Q ss_pred HHHHHHCCEEECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCC-CC---CEEEEECCCCCCHH Q ss_conf 989851657864678977899888874057678469984069568666777777725876-44---35776056678988 Q gi|254780504|r 83 QKFLSLIFYINLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAFFGKISQRIHANNLVT-EH---TRVVLEKPIGSNLS 158 (492) Q Consensus 83 ~~F~~~~~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~f~~i~~~L~~~~l~~-~~---~RiViEKPFG~Dl~ 158 (492) ..+.++.+|+.+.++-..+.+.+..... +|-.| +..+|... .. +.=-+|.-|+.|.- T Consensus 65 ----~~v~~~~lDLs~l~sVr~~a~~~~~~~~--~lDiL-------------InNAGv~~~~~~~~T~dG~E~~f~vN~L 125 (314) T PRK05854 65 ----AKLTIRALDLSSLASVAALGEQLLAEGR--PIHLL-------------INNAGVMTPPERQTTADGFELQFGTNHL 125 (314) T ss_pred ----CCEEEEECCCCCHHHHHHHHHHHHHCCC--CCCEE-------------EECCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf ----8569996463168999999998753068--75278-------------7267666588654057763665553457 Q ss_pred HHHHHHHHHHHHCC Q ss_conf 99999999987389 Q gi|254780504|r 159 SAQEIHAITRKIFK 172 (492) Q Consensus 159 SA~eLn~~l~~~f~ 172 (492) +---|...|..... T Consensus 126 ghflLt~~Llp~l~ 139 (314) T PRK05854 126 GHFALTAHLLPLLR 139 (314) T ss_pred HHHHHHHHHHHHHC T ss_conf 78889887787632 No 74 >COG3875 Uncharacterized conserved protein [Function unknown] Probab=25.69 E-value=43 Score=13.47 Aligned_cols=46 Identities=17% Similarity=0.163 Sum_probs=32.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCEEEEEE-E-CCCCCHHHHHHHHHHHHH Q ss_conf 765779999999728987578179999-8-886998899999999998 Q gi|254780504|r 24 KRKLFPALYNNYTNREENQISSRIIGI-C-RIKMTVESYRSFVDKELQ 69 (492) Q Consensus 24 ~RKL~PALf~L~~~~~l~~~~~~IiG~-a-R~~~~~e~fr~~v~~~l~ 69 (492) .+++.|.||.-...|..++.+..||.. | .+.++.|+|.+.+-+.+. T Consensus 82 t~~Iap~L~reL~~gg~~d~ni~ii~A~G~Hrt~t~EEf~kklGeei~ 129 (423) T COG3875 82 TKKIAPNLLRELYLGGAKDENITIIFALGLHRTQTEEEFEKKLGEEIV 129 (423) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 444209999999965788654799871256676539999998717777 No 75 >TIGR01849 PHB_depoly_PhaZ polyhydroxyalkanoate depolymerase, intracellular; InterPro: IPR010915 This entry represents an intracellular depolymerase for polyhydroxyalkanoate (PHA), a carbon and energy storing polyester that accumulates in granules in many bacterial species when carbon sources are abundant but other nutrients are limiting.. Probab=25.39 E-value=28 Score=14.73 Aligned_cols=52 Identities=17% Similarity=0.208 Sum_probs=34.1 Q ss_pred CCCCCCCCCCCCEEEECC----------CCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCC Q ss_conf 998889898987899864----------43566765779999999728987578179999888699 Q gi|254780504|r 1 MQNHTPNIHNSDFIIFGG----------TGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMT 56 (492) Q Consensus 1 m~~~tp~~~~~~lVIFGA----------TGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~ 56 (492) |....++..|.|+|+.|+ +-|||++|-+==| ++..+-...+.==|+||.=+- T Consensus 197 mae~~~P~~P~sm~lmGGPIDaR~~PT~VN~lA~~~~~eWF----~~n~i~~VP~~YpG~GR~VYP 258 (414) T TIGR01849 197 MAENEEPAQPRSMTLMGGPIDARASPTVVNELAKEKPIEWF----QENLIMRVPFPYPGAGRKVYP 258 (414) T ss_pred HHHCCCCCCCCEEEEECCCCCCCCCCHHHHHHHCCCCHHHH----HHCCEEECCCCCCCCCCCCCC T ss_conf 87457888884057627776645552134430012672788----538764084267778854371 No 76 >KOG1384 consensus Probab=25.18 E-value=31 Score=14.50 Aligned_cols=59 Identities=22% Similarity=0.442 Sum_probs=32.4 Q ss_pred CCCEEEECCCC--------HHHHHHHHHHHH------HHHH-------------CCCCCCCCEEEEEEEC--CCCCHHHH Q ss_conf 98789986443--------566765779999------9997-------------2898757817999988--86998899 Q gi|254780504|r 10 NSDFIIFGGTG--------DLAKRKLFPALY------NNYT-------------NREENQISSRIIGICR--IKMTVESY 60 (492) Q Consensus 10 ~~~lVIFGATG--------DLA~RKL~PALf------~L~~-------------~~~l~~~~~~IiG~aR--~~~~~e~f 60 (492) .-.+||-|+|| |||.| ||+=- ++|. ....|. +++|.-- .+++..+| T Consensus 7 ~KVvvI~G~TGsGKSrLaVdLA~r--f~~EIINsDkmQvYkGldivTnK~t~~e~~gVPH---HLlg~l~~~~e~t~~~F 81 (348) T KOG1384 7 DKVVVIMGATGAGKSRLAVDLATR--FPGEIINSDKMQVYKGLDIVTNKITLQERKGVPH---HLLGHLHPEAEYTAGEF 81 (348) T ss_pred CEEEEEECCCCCCHHHHHHHHHHH--CCCEEECCCCEEEECCCCCCCCCCCHHHCCCCCH---HHHCCCCHHHHCCHHHH T ss_conf 359999557777704667888975--7864651563356327662016687554079876---77076886764269999 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999999987423 Q gi|254780504|r 61 RSFVDKELQRYLK 73 (492) Q Consensus 61 r~~v~~~l~~~~~ 73 (492) +.....+++.-.. T Consensus 82 ~~~a~~aie~I~~ 94 (348) T KOG1384 82 EDDASRAIEEIHS 94 (348) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999985 No 77 >TIGR02746 TraC-F-type type-IV secretion system protein TraC; InterPro: IPR014117 The proteins in this entry are found in the F, P and I-like type IV secretion systems. Gene symbols include TraC (F-type), TrbE/VirB4 (P-type) and TraU (I-type). The proteins contain the Walker A and B motifs and so are putative nucleotide triphosphatases.. Probab=25.06 E-value=41 Score=13.59 Aligned_cols=39 Identities=8% Similarity=-0.121 Sum_probs=19.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 3586989999845884699998873489874579889971 Q gi|254780504|r 435 TLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGP 474 (492) Q Consensus 435 tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP 474 (492) .++..-.+=.+-|+.+-.-|..- +.+.....|....|.| T Consensus 336 ~~~P~~~~k~~e~~~~~~~i~~~-~~~~~~~~~~v~lf~~ 374 (900) T TIGR02746 336 KWVPSLEKKLKEIKELREEISSG-GDSLVSMFYNVLLFTK 374 (900) T ss_pred HHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHCCC T ss_conf 63130788898899999998658-9511131332530068 No 78 >cd05567 PTS_IIB_mannitol PTS_IIB_mannitol: subunit IIB of enzyme II (EII) of the mannitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII is a mannitol-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIA, IIB, and IIC domains are expressed from the mtlA gene as a single protein, also known as the mannitol PTS permease, the mtl transporter, or MtlA. MtlA is only functional as a dimer with the dimer contacts occuring between the IIC domains. MtlA takes up exogenous mannitol releasing the phosphate ester into the cytoplasm in preparation for oxidation to fructose-6-phosphate by the NAD-dependent mannitol-P dehydrogenase (MtlD). The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include mannitol, chitobiose/lichenan, ascorbate, lactose, galactitol, fructose, and a s Probab=24.66 E-value=44 Score=13.34 Aligned_cols=34 Identities=15% Similarity=0.302 Sum_probs=29.4 Q ss_pred HHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHH Q ss_conf 9999999998738968835124201256877778 Q gi|254780504|r 159 SAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLM 192 (492) Q Consensus 159 SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil 192 (492) +-++|.+...+..+..+++-||+||+.+..+++. T Consensus 51 t~~~L~~rAk~~~p~a~~i~v~nfL~~~~yd~lv 84 (87) T cd05567 51 THASLTDRAKKKAPQAQHLSVDNFLNTPEYDELI 84 (87) T ss_pred EHHHHHHHHHHHCCCCEEEEHHHHCCCHHHHHHH T ss_conf 6699999999768998899658725887799999 No 79 >KOG2733 consensus Probab=24.59 E-value=45 Score=13.33 Aligned_cols=49 Identities=16% Similarity=0.193 Sum_probs=24.8 Q ss_pred CCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 9878998644356676577999999972898757817999988869988999999999 Q gi|254780504|r 10 NSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKE 67 (492) Q Consensus 10 ~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~ 67 (492) ..++|||||||=-.+--..-++ ..+.++ +..+-=.||+ .+-.++.+++. T Consensus 5 ~yDvVIyGASGfTG~yivee~v----~~~~~~--~~slavAGRn---~~KL~~vL~~~ 53 (423) T KOG2733 5 RYDVVIYGASGFTGKYIVEEAV----SSQVFE--GLSLAVAGRN---EKKLQEVLEKV 53 (423) T ss_pred EEEEEEECCCCCCCEEEHHHHH----HHHCCC--CCEEEEECCC---HHHHHHHHHHH T ss_conf 3359998156665302489886----430246--7547873278---88999999998 No 80 >pfam04271 consensus Probab=24.51 E-value=45 Score=13.32 Aligned_cols=19 Identities=26% Similarity=0.615 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8458846999988734898 Q gi|254780504|r 445 ETWKWSDSILKSWQIINQK 463 (492) Q Consensus 445 ~sWrivdpiL~~~~~~~~~ 463 (492) .+|+.++.||..|.+++.+ T Consensus 48 ~~~~Yi~~IL~~W~~~gi~ 66 (73) T pfam04271 48 ANFRYIQAILKNWKNNGIK 66 (73) T ss_pred CCHHHHHHHHHHHHHCCCC T ss_conf 9879999999999995999 No 81 >KOG1774 consensus Probab=24.40 E-value=27 Score=14.85 Aligned_cols=41 Identities=34% Similarity=0.514 Sum_probs=30.0 Q ss_pred HHHHHHHHH-HHHHHCCCCCHHHHCCCCEEEEEEEEEECCCC--HHHHHHHHHH Q ss_conf 125687777-88774775200252803330799999846770--2455553220 Q gi|254780504|r 183 LGKEAVQGL-MVFRFANTFYESLWNNKYIDHVQITTAETIGV--EDRVDYYNNT 233 (492) Q Consensus 183 LGKe~vqni-l~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gv--egR~~yyD~~ 233 (492) ++|.|||-| +.|||- =|| ..|||-..|..|+ |||.-=||.+ T Consensus 6 v~kvmv~Pin~Ifr~L-------q~~---t~VqIWl~eq~~~rieG~IvGFDEy 49 (88) T KOG1774 6 VQKVMVQPINLIFRFL-------QNR---TRVQIWLFEQVGLRIEGRIVGFDEY 49 (88) T ss_pred CCCEECCCHHHHHHHH-------HCC---CCEEEEEEECCCCEEEEEEECHHHH T ss_conf 1614527699999998-------627---8638998713683776789625884 No 82 >pfam03699 UPF0182 Uncharacterized protein family (UPF0182). This family contains uncharacterized integral membrane proteins. Probab=24.14 E-value=14 Score=16.97 Aligned_cols=44 Identities=23% Similarity=0.366 Sum_probs=33.2 Q ss_pred HHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 999999879833586989999845884699998873489874579889971889 Q gi|254780504|r 424 RLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQS 477 (492) Q Consensus 424 ~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a 477 (492) +..-||.+|+-++++-.++ ||||++|+ +.-|-...+.|..|.+- T Consensus 597 Kv~VDAY~G~v~~Y~~D~~--------DPil~t~~--kiFP~lf~p~~~mp~~L 640 (771) T pfam03699 597 KAVVDAYDGTVDFYVVDPE--------DPILKTWQ--KIFPGLFKPLSAMPDDL 640 (771) T ss_pred EEEEECCCCEEEEEECCCC--------CHHHHHHH--HHCCCCCCCHHHCCHHH T ss_conf 8999877880799972898--------86999999--86960159957789999 No 83 >PRK13245 hetR heterocyst differentiation control protein; Reviewed Probab=24.02 E-value=28 Score=14.78 Aligned_cols=24 Identities=17% Similarity=0.221 Sum_probs=14.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 869899998458846999988734 Q gi|254780504|r 437 FMGYDEVEETWKWSDSILKSWQII 460 (492) Q Consensus 437 F~r~dEve~sWrivdpiL~~~~~~ 460 (492) =+-.+.++.+-.=.|.|+..|.+. T Consensus 256 dI~p~~~~qA~~ELdeiir~WAdr 279 (299) T PRK13245 256 DIPPERWEQAMEELDEIIRHWADK 279 (299) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 898799999999999999999998 No 84 >PRK07024 short chain dehydrogenase; Provisional Probab=23.63 E-value=46 Score=13.21 Aligned_cols=70 Identities=11% Similarity=0.145 Sum_probs=42.8 Q ss_pred CCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHC Q ss_conf 98789986443566765779999999728987578179999888699889999999999874234679989999898516 Q gi|254780504|r 10 NSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLI 89 (492) Q Consensus 10 ~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~ 89 (492) |-+++|-|||..+-+- +-..|.. .+.+++.++|+.-..++.. +.+ . ...+ T Consensus 2 ~~~VlITGassGIG~a-----~A~~la~-----~G~~v~l~~R~~~~L~~~~----~~~----~------------~~~~ 51 (256) T PRK07024 2 PLKVFITGASSGIGQA-----LAREYAR-----QGATLGLVARRTDALQAFA----ARL----P------------KARV 51 (256) T ss_pred CCEEEEECCCHHHHHH-----HHHHHHH-----CCCEEEEEECCHHHHHHHH----HHC----C------------CCCE T ss_conf 9989998460299999-----9999998-----8998999989889999999----976----7------------9976 Q ss_pred CEEECCCCCHHHHHHHHHHH Q ss_conf 57864678977899888874 Q gi|254780504|r 90 FYINLDVEKNYGWELLGKLL 109 (492) Q Consensus 90 ~Y~~~d~~~~~~y~~L~~~l 109 (492) .++++|+.+.++.+.+.+.+ T Consensus 52 ~~~~~Dv~d~~~~~~~~~~~ 71 (256) T PRK07024 52 SVYAADVRDADALAAAAADF 71 (256) T ss_pred EEEEECCCCHHHHHHHHHHH T ss_conf 99981179999999999999 No 85 >COG2607 Predicted ATPase (AAA+ superfamily) [General function prediction only] Probab=23.55 E-value=46 Score=13.19 Aligned_cols=91 Identities=19% Similarity=0.361 Sum_probs=52.7 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCC Q ss_conf 87899864435667657799999997289875781799998886998899999999998742346799899998985165 Q gi|254780504|r 11 SDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIF 90 (492) Q Consensus 11 ~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~ 90 (492) -+..+.||.|- -|--|.-|+++-|... ..++|-+.|.++.+ +-. +-+.++. .+.++--||..++ T Consensus 86 NnVLLwGaRGt-GKSSLVKA~~~e~~~~-----glrLVEV~k~dl~~--Lp~-l~~~Lr~-------~~~kFIlFcDDLS 149 (287) T COG2607 86 NNVLLWGARGT-GKSSLVKALLNEYADE-----GLRLVEVDKEDLAT--LPD-LVELLRA-------RPEKFILFCDDLS 149 (287) T ss_pred CCEEEECCCCC-CHHHHHHHHHHHHHHC-----CCEEEEECHHHHHH--HHH-HHHHHHC-------CCCEEEEEECCCC T ss_conf 23677637777-7479999999998741-----77079976888865--799-9999961-------8860899956777 Q ss_pred EEECCCCCHHHHHHHHHHHCCC---CCCCEEEEE Q ss_conf 7864678977899888874057---678469984 Q gi|254780504|r 91 YINLDVEKNYGWELLGKLLDID---KTRIRVFYL 121 (492) Q Consensus 91 Y~~~d~~~~~~y~~L~~~l~~~---~~~~rifYL 121 (492) |-. +.++|+.|+..++++ .+.|.+||- T Consensus 150 Fe~----gd~~yK~LKs~LeG~ve~rP~NVl~YA 179 (287) T COG2607 150 FEE----GDDAYKALKSALEGGVEGRPANVLFYA 179 (287) T ss_pred CCC----CCHHHHHHHHHHCCCCCCCCCEEEEEE T ss_conf 777----813899999985388556887079997 No 86 >PTZ00213 asparagine synthetase A; Provisional Probab=23.50 E-value=6.6 Score=19.26 Aligned_cols=41 Identities=20% Similarity=0.242 Sum_probs=19.5 Q ss_pred CCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCCCC Q ss_conf 8858999999999973068881122361365213667768734334101 Q gi|254780504|r 259 VEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGYLE 307 (492) Q Consensus 259 ~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY~~ 307 (492) +++..=-++-+|-..|+=.+. +=|.-.+|+.++...|.|-| T Consensus 182 Lt~keRE~~i~ke~gAVFl~g--------IG~~L~~G~~HD~RAPDYDD 222 (350) T PTZ00213 182 LDPKEREREIVKKYGAVFLIG--------IGCKLSSGDPHDARAPDYDD 222 (350) T ss_pred CCHHHHHHHHHHHHCCEEEEE--------CCCCCCCCCCCCCCCCCCCC T ss_conf 987889999999839599994--------37867898957777998764 No 87 >cd03009 TryX_like_TryX_NRX Tryparedoxin (TryX)-like family, TryX and nucleoredoxin (NRX) subfamily; TryX and NRX are thioredoxin (TRX)-like protein disulfide oxidoreductases that alter the redox state of target proteins via the reversible oxidation of an active center CXXC motif. TryX is involved in the regulation of oxidative stress in parasitic trypanosomatids by reducing TryX peroxidase, which in turn catalyzes the reduction of hydrogen peroxide and organic hydroperoxides. TryX derives reducing equivalents from reduced trypanothione, a polyamine peptide conjugate unique to trypanosomatids, which is regenerated by the NADPH-dependent flavoprotein trypanothione reductase. Vertebrate NRX is a 400-amino acid nuclear protein with one redox active TRX domain containing a CPPC active site motif followed by one redox inactive TRX-like domain. Mouse NRX transcripts are expressed in all adult tissues but is restricted to the nervous system and limb buds in embryos. Plant NRX, longer than the Probab=23.49 E-value=47 Score=13.19 Aligned_cols=43 Identities=19% Similarity=0.102 Sum_probs=32.4 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECC Q ss_conf 8789986443566765779999999728987578179999888 Q gi|254780504|r 11 SDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRI 53 (492) Q Consensus 11 ~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~ 53 (492) ..+|-|.||-==-=|+-+|.|-.+|..-.-...++.||+++.. T Consensus 20 ~vll~FWATWC~PC~~e~P~L~~ly~~~~~~~~~~~vi~Is~d 62 (131) T cd03009 20 TVGLYFSASWCPPCRAFTPKLVEFYEKLKESGKNFEIVFISWD 62 (131) T ss_pred EEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 8999998757866652658999999998613887799998558 No 88 >COG0300 DltE Short-chain dehydrogenases of various substrate specificities [General function prediction only] Probab=23.21 E-value=47 Score=13.15 Aligned_cols=196 Identities=15% Similarity=0.148 Sum_probs=93.3 Q ss_pred CCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 98987899864435667657799999997289875781799998886998899999999998742346799899998985 Q gi|254780504|r 8 IHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLS 87 (492) Q Consensus 8 ~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~ 87 (492) +..-+.||=|||+.+-+- +-..++ ..+..+|-+||++---++..+.+++ .+.- T Consensus 4 ~~~~~~lITGASsGIG~~-----~A~~lA-----~~g~~liLvaR~~~kL~~la~~l~~-----------------~~~v 56 (265) T COG0300 4 MKGKTALITGASSGIGAE-----LAKQLA-----RRGYNLILVARREDKLEALAKELED-----------------KTGV 56 (265) T ss_pred CCCCEEEEECCCCHHHHH-----HHHHHH-----HCCCEEEEEECCHHHHHHHHHHHHH-----------------HHCC T ss_conf 778679997788648999-----999999-----7799799996769999999999987-----------------3086 Q ss_pred HCCEEECCCCCHHHHHHHHHHHCCCCCC-CEEEEECCCHHHHHHHHHHHHHHCCCC-------CCCEEEEECCCCCCHHH Q ss_conf 1657864678977899888874057678-469984069568666777777725876-------44357760566789889 Q gi|254780504|r 88 LIFYINLDVEKNYGWELLGKLLDIDKTR-IRVFYLAMSSAFFGKISQRIHANNLVT-------EHTRVVLEKPIGSNLSS 159 (492) Q Consensus 88 ~~~Y~~~d~~~~~~y~~L~~~l~~~~~~-~rifYLAiPP~~f~~i~~~L~~~~l~~-------~~~RiViEKPFG~Dl~S 159 (492) .+..++.|++++++-.++...+...... ..+ ++.+|... .|.+ +|+=|--|..+ T Consensus 57 ~v~vi~~DLs~~~~~~~l~~~l~~~~~~IdvL----------------VNNAG~g~~g~f~~~~~~~--~~~mi~lN~~a 118 (265) T COG0300 57 EVEVIPADLSDPEALERLEDELKERGGPIDVL----------------VNNAGFGTFGPFLELSLDE--EEEMIQLNILA 118 (265) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHCCCCCCEE----------------EECCCCCCCCCHHHCCHHH--HHHHHHHHHHH T ss_conf 27999776788367999999998248852389----------------9778747766542188589--99999999999 Q ss_pred HHHHHHHHHHHCCH---HHEEECCCHHHHHHHHHH--------HHHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHH Q ss_conf 99999999873896---883512420125687777--------8877477520025280333079999984677024555 Q gi|254780504|r 160 AQEIHAITRKIFKE---SQIFRIDHYLGKEAVQGL--------MVFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVD 228 (492) Q Consensus 160 A~eLn~~l~~~f~E---~qiyRIDHYLGKe~vqni--------l~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~ 228 (492) .-.|...+...+.+ -+|--|----|+.+.-+. .++-|+|.+-+.+ + =..|.++.-..-.+ |.+ T Consensus 119 ~~~LT~~~lp~m~~~~~G~IiNI~S~ag~~p~p~~avY~ATKa~v~~fSeaL~~EL--~--~~gV~V~~v~PG~~--~T~ 192 (265) T COG0300 119 LTRLTKAVLPGMVERGAGHIINIGSAAGLIPTPYMAVYSATKAFVLSFSEALREEL--K--GTGVKVTAVCPGPT--RTE 192 (265) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEECHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH--C--CCCEEEEEEECCCC--CCC T ss_conf 99999999999986589669998434532888632799999999999999999983--5--89849999965733--355 Q ss_pred HHHHH--HHHHH-----HHHHHHHHHHHHHHCC Q ss_conf 53220--01788-----7655899999998718 Q gi|254780504|r 229 YYNNT--GALRD-----MIQNHLLQLLCLVAME 254 (492) Q Consensus 229 yyD~~--GaiRD-----mvQNHllQlL~lvAMe 254 (492) |+|.. +.-.. |+.-|=..-.++.+++ T Consensus 193 f~~~~~~~~~~~~~~~~~~~~~~va~~~~~~l~ 225 (265) T COG0300 193 FFDAKGSDVYLLSPGELVLSPEDVAEAALKALE 225 (265) T ss_pred CCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHH T ss_conf 333444432112321230699999999999985 No 89 >COG3268 Uncharacterized conserved protein [Function unknown] Probab=23.19 E-value=39 Score=13.72 Aligned_cols=123 Identities=15% Similarity=0.074 Sum_probs=50.3 Q ss_pred CCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCC-HHHHHHHHHH Q ss_conf 987899864435667657799999997289875781799998886998899999999998742346799-8999989851 Q gi|254780504|r 10 NSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYN-PLKVQKFLSL 88 (492) Q Consensus 10 ~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~-~~~~~~F~~~ 88 (492) ++++||.||||=-.+ .+--. |...+.-+ .=.||+.--. +-++..|.+....-+.. +...++.+++ T Consensus 6 e~d~iiYGAtGy~G~-lvae~---l~~~g~~~------aLAgRs~~kl----~~l~~~LG~~~~~~p~~~p~~~~~~~~~ 71 (382) T COG3268 6 EYDIIIYGATGYAGG-LVAEY---LAREGLTA------ALAGRSSAKL----DALRASLGPEAAVFPLGVPAALEAMASR 71 (382) T ss_pred CEEEEEECCCCCHHH-HHHHH---HHHCCCCH------HHCCCCHHHH----HHHHHHCCCCCCCCCCCCHHHHHHHHHC T ss_conf 403899746552158-99999---99748864------3216888999----8899850966444677888999999742 Q ss_pred CCEEECCCCCHHHHHHHHHHHCCCCCCCEEEEECCC--HHHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 657864678977899888874057678469984069--56866677777772587644357760 Q gi|254780504|r 89 IFYINLDVEKNYGWELLGKLLDIDKTRIRVFYLAMS--SAFFGKISQRIHANNLVTEHTRVVLE 150 (492) Q Consensus 89 ~~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiP--P~~f~~i~~~L~~~~l~~~~~RiViE 150 (492) ..-+- +....|..-..-+-..-..+...|+-+- ..+|..+....++.- ...+.|||-- T Consensus 72 ~~VVl---ncvGPyt~~g~plv~aC~~~GTdY~DiTGEi~~fe~~i~~yh~~A-~~~Ga~Ii~~ 131 (382) T COG3268 72 TQVVL---NCVGPYTRYGEPLVAACAAAGTDYADITGEIMFFENSIDLYHAQA-ADAGARIIPG 131 (382) T ss_pred CEEEE---ECCCCCCCCCCHHHHHHHHHCCCEEECCCCHHHHHHHHHHHHHHH-HHCCCEEECC T ss_conf 66899---614661202647999999719872413561799999988877788-7559789666 No 90 >TIGR00570 cdk7 CDK-activating kinase assembly factor MAT1; InterPro: IPR004575 MAT1 (menage a trois 1) is a RING finger protein with a characteristic C3HC4 motif located in the N-terminal domain. MAT1 stabilises the cyclin H-CDK7 complex to form a functional CDK-activating kinase (CAK) enzymatic complex which then goes on to activate many of the CDK enzymes intimately involved in the cell cycle . CDK7 forms a stable complex with cyclin H and MAT1 in vivo only when phosphorylated on either one of two residues (Ser164 or Thr170) in its T-loop. The requirement for MAT1 for the activation of CAK can be by-passed by the phosphorylation of CDK7 on the T-loop. The two mechanisms for CDK7 complex stabilisation and activation (MAT1 addition and T-loop phosphorylation), which can operate independently in vitro, actually cooperate under physiological conditions to maintain complex integrity. With prolonged exposure to elevated temperature, dissociation to monomeric subunits occurs in vivo when CDK7 is dephosphorylated, even in the presence of MAT1 . The Cyclin H-MAT1-CDK7 complex also forms part of TFIIH, a multiprotein complex required for both transcription and DNA repair.; GO: 0007049 cell cycle, 0005634 nucleus. Probab=23.18 E-value=44 Score=13.37 Aligned_cols=14 Identities=21% Similarity=0.202 Sum_probs=12.5 Q ss_pred CCHHHHHHHHHHCC Q ss_conf 86248999999879 Q gi|254780504|r 419 PDGYERLLMDIIHA 432 (492) Q Consensus 419 pdaYe~Ll~D~i~G 432 (492) --||+|.|.|||.| T Consensus 303 ~lAc~RaLqeAf~G 316 (322) T TIGR00570 303 NLACERALQEAFSG 316 (322) T ss_pred HHHHHHHHHHHHHC T ss_conf 89999999998511 No 91 >PRK09537 pylS pyrolysyl-tRNA synthetase; Reviewed Probab=22.94 E-value=48 Score=13.11 Aligned_cols=40 Identities=15% Similarity=0.184 Sum_probs=20.0 Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHC Q ss_conf 9999888699889999999999874234679989999898516 Q gi|254780504|r 47 IIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLI 89 (492) Q Consensus 47 IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~ 89 (492) ++--+|+--+.-++|.+-....- +.....++.++.|+.+. T Consensus 47 vvnnsrs~rtaralrhhkyrk~c---k~crvsdedin~fltkt 86 (420) T PRK09537 47 VVNNSRSSRTARALRHHKYRKTC---KRCRVSDEDINNFLTKT 86 (420) T ss_pred EEECCCCHHHHHHHHHHHHHCCC---CCCCCCHHHHHHHHHHH T ss_conf 87156322778887621110422---22576778899999874 No 92 >pfam09093 Lyase_catalyt Lyase, catalytic. Members of this family are predominantly found in chondroitin ABC lyase I, and adopt a helical structure, with fifteen alpha-helices which are at least two turns long and several short helical turns. The bulk of the domain is formed by ten alpha-helices forming five hairpin-like pairs and arranged into an incomplete toroid, the (alpha/alpha)5 fold. Additionally, two long and two short alpha-helices at the N terminus of the domain wrap around the toroid. At the C-terminal end of the toroid there is one additional short alpha-helix. This domain is required for degradation of polysaccharides containing 1,4-beta-D-hexosaminyl and 1,3-beta-D-glucoronosyl or 1,3-alpha-L-iduronosyl linkages to disaccharides containing 4-deoxy-beta-D-gluc-4-enuronosyl groups. Probab=22.55 E-value=49 Score=13.06 Aligned_cols=55 Identities=25% Similarity=0.255 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECC Q ss_conf 0178876558999999987185445885899999999997306888112236136521366776873433 Q gi|254780504|r 234 GALRDMIQNHLLQLLCLVAMEMPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVK 303 (492) Q Consensus 234 GaiRDmvQNHllQlL~lvAMe~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~ 303 (492) |+.-|.+-.-+++.|+.+-|++- -.||+..|+++..+=. .++ ||..|..||-++. T Consensus 187 ~~~~D~~nT~~~~~l~siLl~~D--------~~ek~~~L~~fs~wl~-----~~l--~~spG~~ggfKpD 241 (356) T pfam09093 187 SSDLDSFNTLSMQHLASLLLEPD--------DPEKIQLLNSLSRWIT-----GAL--QVAPGGAGGIKVD 241 (356) T ss_pred CCHHHHHHHHHHHHHHHHHCCCC--------CHHHHHHHHHHHHHHH-----CCC--CCCCCCCCCCCCC T ss_conf 41148888889999999983898--------1999999999999984-----616--7698877764788 No 93 >COG2382 Fes Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism] Probab=22.30 E-value=49 Score=13.03 Aligned_cols=107 Identities=20% Similarity=0.247 Sum_probs=74.9 Q ss_pred EECCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCHHHEEECCCHHHHHHHHHHHHHHHCC- Q ss_conf 8406956866677777772587644357760566789889999999998738968835124201256877778877477- Q gi|254780504|r 120 YLAMSSAFFGKISQRIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAITRKIFKESQIFRIDHYLGKEAVQGLMVFRFAN- 198 (492) Q Consensus 120 YLAiPP~~f~~i~~~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~l~~~f~E~qiyRIDHYLGKe~vqnil~lRFaN- 198 (492) |...+..--...-.-+-.+-++ +..|.||.=|-|+.-. .++ .-+|-.| ..--+|+.+ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~l~-~~~~~vv~lppgy~~~--~k~----------pvl~~~D---------G~~~~~~g~i 115 (299) T COG2382 58 YWDCERTPGGPVEEILYSSELL-SERRRVVYLPPGYNPL--EKY----------PVLYLQD---------GQDWFRSGRI 115 (299) T ss_pred CCCCCCCCCCCHHHHHHHHHHC-CCEEEEEEECCCCCCC--CCC----------CEEEEEC---------CHHHHHCCCH T ss_conf 2455545688524420022202-6516999968998866--435----------4799963---------2888863876 Q ss_pred -CCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf -52002528033307999998467702455553220017887655899999 Q gi|254780504|r 199 -TFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHLLQLL 248 (492) Q Consensus 199 -~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHllQlL 248 (492) ++++.+--..-|.-+-+..-..+.+..|..=|-...+.-+++|+-||=.+ T Consensus 116 ~~~~dsli~~g~i~pai~vgid~~d~~~R~~~~~~n~~~~~~L~~eLlP~v 166 (299) T COG2382 116 PRILDSLIAAGEIPPAILVGIDYIDVKKRREELHCNEAYWRFLAQELLPYV 166 (299) T ss_pred HHHHHHHHHCCCCCCCEEEECCCCCHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 899999997087888469811877789999882560899999998866642 No 94 >PRK11052 malQ 4-alpha-glucanotransferase; Provisional Probab=22.25 E-value=30 Score=14.61 Aligned_cols=27 Identities=19% Similarity=0.316 Sum_probs=17.1 Q ss_pred CHHHHHHHHHHHHHHCCHHHEEECCCHHHH Q ss_conf 988999999999873896883512420125 Q gi|254780504|r 156 NLSSAQEIHAITRKIFKESQIFRIDHYLGK 185 (492) Q Consensus 156 Dl~SA~eLn~~l~~~f~E~qiyRIDHYLGK 185 (492) .++-.+++ |...+.---.+||||-||- T Consensus 428 gy~~f~~~---lra~mr~aG~lRIDHvmGL 454 (694) T PRK11052 428 AYQPFIDL---LRANMQNCGALRIDHVMSL 454 (694) T ss_pred CCHHHHHH---HHHHHHHCCCEEHHHHHHH T ss_conf 74899999---9999876596210456642 No 95 >TIGR02371 ala_DH_arch alanine dehydrogenase; InterPro: IPR012742 This enzyme, a homologue of bacterial ornithine cyclodeaminases and marsupial mu-crystallins, is a homodimeric NAD-dependent alanine dehydrogenase found in Archaeoglobus fulgidus and several other archaea.. Probab=21.91 E-value=45 Score=13.29 Aligned_cols=66 Identities=20% Similarity=0.084 Sum_probs=39.9 Q ss_pred EECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCC--EE Q ss_conf 9864435667657799999997289875781799998886998899999999998742346799899998985165--78 Q gi|254780504|r 15 IFGGTGDLAKRKLFPALYNNYTNREENQISSRIIGICRIKMTVESYRSFVDKELQRYLKDGEYNPLKVQKFLSLIF--YI 92 (492) Q Consensus 15 IFGATGDLA~RKL~PALf~L~~~~~l~~~~~~IiG~aR~~~~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~~~~~--Y~ 92 (492) ==||.|-.|-|+ |.+ +-.+ -+-|||+||+.|+.=+-...|-+- ..++-....++.-++|+++++ |. T Consensus 112 RTGAAGGvaaKY-------LAr--kdss-v~G~iGaG~QA~tQL~Al~rVfd~--eeV~~y~rt~~~~~kF~~~~skd~~ 179 (327) T TIGR02371 112 RTGAAGGVAAKY-------LAR--KDSS-VLGLIGAGRQAYTQLEALSRVFDL--EEVKVYSRTKEAAEKFVKRASKDYE 179 (327) T ss_pred HCCCCHHHHHHH-------HHH--CCCC-EEEEEECCHHHHHHHHHHHHHCCC--CEEEEEECCHHHHHHHHHHHHCCCC T ss_conf 003202455554-------430--1663-355763685799999998762554--5069986176889999987520246 No 96 >TIGR02769 nickel_nikE nickel import ATP-binding protein NikE; InterPro: IPR014137 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family represents the NikE subunit of a multisubunit nickel import ABC transporter complex. Nickel, once imported, may be used in urease and in certain classes of hydrogenase and superoxide dismutase.. Probab=21.82 E-value=50 Score=12.96 Aligned_cols=35 Identities=9% Similarity=0.185 Sum_probs=24.3 Q ss_pred CCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCC Q ss_conf 85867857775513346885179999985488643 Q gi|254780504|r 328 NPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTI 362 (492) Q Consensus 328 n~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~ 362 (492) +=+|+|+|.+=--++.+..=+-+|=++|.+.|+.+ T Consensus 68 ~v~F~G~dl~~L~~~~~rafrR~vQlvFQD~~sav 102 (267) T TIGR02769 68 EVSFRGQDLSKLDRKQRRAFRRDVQLVFQDSPSAV 102 (267) T ss_pred EEEECCCCHHHCCHHHCCEEEEEEEEEEECCCCCC T ss_conf 04762564555081102111213678987474311 No 97 >TIGR00154 ispE 4-diphosphocytidyl-2C-methyl-D-erythritol kinase; InterPro: IPR004424 4-diphosphocytidyl-2C-methyl-D-erythritol kinase is a member of the family of GHMP kinases that were previously designated as conserved hypothetical protein YchB or as isopentenyl monophosphate kinase. In Lycopersicon esculentum (tomato) and Escherichia coli the protein has been indentified as 4-diphosphocytidyl-2C-methyl-D-erythritol kinase, an enzyme of the deoxyxylulose phosphate pathway of terpenoid biosynthesis.; GO: 0050515 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity, 0016114 terpenoid biosynthetic process. Probab=21.73 E-value=39 Score=13.78 Aligned_cols=29 Identities=28% Similarity=0.576 Sum_probs=22.2 Q ss_pred CCCEEEEEE-CCCCCCCEEEEEEEEE-CCCC Q ss_conf 785777551-3346885179999985-4886 Q gi|254780504|r 332 SGIPFYLRT-GKYLAKHISEIVIALK-PAPC 360 (492) Q Consensus 332 ~gVPF~lrt-GK~l~~k~~eI~i~FK-~~p~ 360 (492) ++|||.++- |=++++.+-||.=.|. +||+ T Consensus 147 aDvpffv~GY~~A~AtG~GE~i~~~~e~P~e 177 (322) T TIGR00154 147 ADVPFFVSGYGTAFATGRGEIITPLEEEPPE 177 (322) T ss_pred CCCEEEECCCCEEEECCCCCCCCCCCCCCCC T ss_conf 9830786363312112344250578889874 No 98 >PRK12493 magnesium chelatase; Provisional Probab=21.58 E-value=51 Score=12.93 Aligned_cols=40 Identities=20% Similarity=0.420 Sum_probs=19.9 Q ss_pred HHHHCC-CCEEEEEEEEEECCCCHHHHH---HHHHHHHHHHHHHHHH Q ss_conf 025280-333079999984677024555---5322001788765589 Q gi|254780504|r 202 ESLWNN-KYIDHVQITTAETIGVEDRVD---YYNNTGALRDMIQNHL 244 (492) Q Consensus 202 e~~WNr-~~I~~VqIt~~E~~gvegR~~---yyD~~GaiRDmvQNHl 244 (492) +|+|+. .-|..++|.=.|++| |-. -.--+|+.||++.|-+ T Consensus 993 rPvwd~~GRV~g~elIPL~ELG---RPRIDVvv~iSGiFRD~fp~~i 1036 (1318) T PRK12493 993 RPVPDELGRVNKVELIPLEELG---RPRIDVVVTCSGIFRDLFINQM 1036 (1318) T ss_pred CCCCCCCCCCCCEEEECHHHCC---CCCEEEEEEECCHHHHHHHHHH T ss_conf 4110688867733886689919---9985799960504476678999 No 99 >PRK01060 endonuclease IV; Provisional Probab=21.25 E-value=35 Score=14.12 Aligned_cols=21 Identities=33% Similarity=0.635 Sum_probs=17.3 Q ss_pred EEEEECCCCCCCCEEEEEECC Q ss_conf 999970858678577755133 Q gi|254780504|r 322 IKANIDNPRWSGIPFYLRTGK 342 (492) Q Consensus 322 ~~l~idn~RW~gVPF~lrtGK 342 (492) .+..++.+|++++|++|.|=+ T Consensus 242 f~~~~~~~~~~~iP~ILETP~ 262 (281) T PRK01060 242 LRYIVHDPRFDGIPKILETPY 262 (281) T ss_pred HHHHHCCHHHCCCCEEEECCC T ss_conf 999980952549988996899 No 100 >KOG3559 consensus Probab=21.18 E-value=40 Score=13.66 Aligned_cols=14 Identities=21% Similarity=0.268 Sum_probs=7.6 Q ss_pred CCCEEEEEEEEEEC Q ss_conf 03330799999846 Q gi|254780504|r 207 NKYIDHVQITTAET 220 (492) Q Consensus 207 r~~I~~VqIt~~E~ 220 (492) .++|.+|.-+..|. T Consensus 317 ~~fivSvnyVls~~ 330 (598) T KOG3559 317 PHFIVSVNYVLSEL 330 (598) T ss_pred CEEEEEEEEEEEHH T ss_conf 40577665652001 No 101 >COG1947 IspE 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase [Lipid metabolism] Probab=20.88 E-value=48 Score=13.09 Aligned_cols=26 Identities=27% Similarity=0.498 Sum_probs=20.0 Q ss_pred CCCEEEEEECCCCCCCEEEEEEEEEC Q ss_conf 78577755133468851799999854 Q gi|254780504|r 332 SGIPFYLRTGKYLAKHISEIVIALKP 357 (492) Q Consensus 332 ~gVPF~lrtGK~l~~k~~eI~i~FK~ 357 (492) ++|||.|..|-++++.+-|+...+-+ T Consensus 136 aDVPffl~g~tA~a~G~GE~l~~~~~ 161 (289) T COG1947 136 ADVPFFLSGGTAFAEGRGEKLEPLED 161 (289) T ss_pred CCCCEEEECCCEEEEECCCEEEECCC T ss_conf 98671431885698873635147777 No 102 >COG3564 Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.74 E-value=53 Score=12.81 Aligned_cols=67 Identities=22% Similarity=0.400 Sum_probs=33.2 Q ss_pred EEEECCCCCCCCCEECCCCC-CCCCCCCCCCCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEEEEEEECCCCCCCC Q ss_conf 36521366776873433410-1479886648743899999708586785777551334688517999998548864376 Q gi|254780504|r 287 TVRGQYQSGIINGIPVKGYL-EEIPLGISDTETFVAIKANIDNPRWSGIPFYLRTGKYLAKHISEIVIALKPAPCTIFD 364 (492) Q Consensus 287 ~v~GQY~~~~~~~~~v~gY~-~e~~v~~S~TeTfaa~~l~idn~RW~gVPF~lrtGK~l~~k~~eI~i~FK~~p~~~f~ 364 (492) +.+-|-...-.++.+. -|- .+--|.+-+ ..--.|+ ||||||..-.--.=|.|+..|---+--+.+|. T Consensus 27 vmFHQSGGCCDGSsPM-CYP~~~fivGd~D-----vlLG~i~-----gvPvyIs~~QyeaWKHTqLIIDVVpGRGGmFS 94 (116) T COG3564 27 VMFHQSGGCCDGSSPM-CYPRADFIVGDND-----VLLGEID-----GVPVYISGPQYEAWKHTQLIIDVVPGRGGMFS 94 (116) T ss_pred EEEECCCCCCCCCCCC-CCCCCCEEECCCC-----EEEEEEC-----CEEEEECCCHHHHHHCCEEEEEEECCCCCEEE T ss_conf 7883268845899874-0145445664773-----5886537-----87788647077521001789998659886457 No 103 >PRK02509 hypothetical protein; Provisional Probab=20.74 E-value=19 Score=15.98 Aligned_cols=44 Identities=11% Similarity=0.321 Sum_probs=32.9 Q ss_pred HHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 999999879833586989999845884699998873489874579889971889 Q gi|254780504|r 424 RLLMDIIHANQTLFMGYDEVEETWKWSDSILKSWQIINQKVDYYAAGTWGPDQS 477 (492) Q Consensus 424 ~Ll~D~i~Gd~tlF~r~dEve~sWrivdpiL~~~~~~~~~~~~Y~~GS~GP~~a 477 (492) +..-||.+|+-++++-.++ |||+++|+ +.-|..+.+-|..|.+- T Consensus 725 KavVDAYdG~V~~Y~~D~~--------DPil~tw~--kiFP~lfkp~s~mp~~L 768 (980) T PRK02509 725 KVVIDAYNGDVKFYIVDPN--------DPIIQTWS--KIFPQLFKPLSAMPPSL 768 (980) T ss_pred EEEEECCCCEEEEEECCCC--------CHHHHHHH--HHCCHHCCCHHHCCHHH T ss_conf 8999755880799970898--------85999999--86844039846689999 No 104 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=20.73 E-value=41 Score=13.62 Aligned_cols=15 Identities=40% Similarity=0.820 Sum_probs=8.9 Q ss_pred EEEEECCCCCCCCEE Q ss_conf 999970858678577 Q gi|254780504|r 322 IKANIDNPRWSGIPF 336 (492) Q Consensus 322 ~~l~idn~RW~gVPF 336 (492) +++.||+.+|.|.++ T Consensus 84 I~VeId~~~w~G~~v 98 (126) T pfam04076 84 IKVDIDDRVWNGQEV 98 (126) T ss_pred EEEEECHHHCCCCCC T ss_conf 999968888389817 No 105 >pfam03767 Acid_phosphat_B HAD superfamily, subfamily IIIB (Acid phosphatase). This family proteins includes acid phosphatases and a number of vegetative storage proteins. Probab=20.65 E-value=53 Score=12.80 Aligned_cols=100 Identities=15% Similarity=0.197 Sum_probs=51.3 Q ss_pred CCCHHHHHHHHHHCCEEECCCCCHHHHHHHHHHHCCCCCCCEEEEECCCHHH-HHHHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 7998999989851657864678977899888874057678469984069568-666777777725876443577605667 Q gi|254780504|r 76 EYNPLKVQKFLSLIFYINLDVEKNYGWELLGKLLDIDKTRIRVFYLAMSSAF-FGKISQRIHANNLVTEHTRVVLEKPIG 154 (492) Q Consensus 76 ~~~~~~~~~F~~~~~Y~~~d~~~~~~y~~L~~~l~~~~~~~rifYLAiPP~~-f~~i~~~L~~~~l~~~~~RiViEKPFG 154 (492) .+++..|++++..-.--.+ +...+.++.+.+ ....|||++=-+.- .....+||.+.|. .+|.++++-- .+ T Consensus 104 ~~~~~~~~~wv~~~~apai----pg~l~l~~~~~~---~Gvkif~lT~R~e~~r~~T~~NL~~~G~-~~w~~Lilr~-~~ 174 (230) T pfam03767 104 PFDPEKFDEWVNKGEAPAL----PGALELYNYLVE---LGVKIFFVSGRSEDLRAATVENLKKAGF-HGWEKLILRG-KK 174 (230) T ss_pred CCCHHHHHHHHHCCCCCCC----HHHHHHHHHHHH---CCCEEEEECCCCHHHHHHHHHHHHHCCC-CCCHHHHCCC-CC T ss_conf 5780658999961768677----359999999997---7927999728737888987999997599-9845663668-65 Q ss_pred CCHHHHHHHHHHHHHHCCHHHEEECCCHHHH Q ss_conf 8988999999999873896883512420125 Q gi|254780504|r 155 SNLSSAQEIHAITRKIFKESQIFRIDHYLGK 185 (492) Q Consensus 155 ~Dl~SA~eLn~~l~~~f~E~qiyRIDHYLGK 185 (492) ..-.||.+-...-.+. -+++=|||=-..|- T Consensus 175 ~~~~~~~~yKs~~R~~-l~~~GYrIvg~iGD 204 (230) T pfam03767 175 DSNKSATSYKSERRKK-LVKKGYNIVGNIGD 204 (230) T ss_pred CCCCCCCCCCHHHHHH-HHHCCCEEEEEECC T ss_conf 4688540000899999-99769879998678 No 106 >PRK08313 acetyl-CoA acetyltransferase; Provisional Probab=20.54 E-value=53 Score=12.78 Aligned_cols=45 Identities=16% Similarity=0.206 Sum_probs=34.6 Q ss_pred CCCEEEEEEECCCC----CHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 57817999988869----9889999999999874234679989999898 Q gi|254780504|r 42 QISSRIIGICRIKM----TVESYRSFVDKELQRYLKDGEYNPLKVQKFL 86 (492) Q Consensus 42 ~~~~~IiG~aR~~~----~~e~fr~~v~~~l~~~~~~~~~~~~~~~~F~ 86 (492) +....|||+|.+++ .+..+++.+.+++.....+...+...++.++ T Consensus 3 ~r~vAIvGvG~t~f~~~~~~~s~~~L~~eA~~~Al~DAGl~~~dID~~~ 51 (391) T PRK08313 3 KRLAAVLGTGQTKYVAKRQDVSMAGLVREAIDRALADAGLTFDDIDAVV 51 (391) T ss_pred CCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCEEE T ss_conf 8988999986588888999979999999999999997499989989999 No 107 >PRK11034 clpA ATP-dependent Clp protease ATP-binding subunit; Provisional Probab=20.47 E-value=53 Score=12.77 Aligned_cols=22 Identities=27% Similarity=0.404 Sum_probs=8.8 Q ss_pred HHHHHHCCCCCCCCCHHHHHHH Q ss_conf 9999987983358698999984 Q gi|254780504|r 425 LLMDIIHANQTLFMGYDEVEET 446 (492) Q Consensus 425 Ll~D~i~Gd~tlF~r~dEve~s 446 (492) +|.++++..+--.+=.||+|.+ T Consensus 548 ~Lte~Vr~~PysVvL~DEIEKA 569 (758) T PRK11034 548 LLTDAVIKHPHAVLLLDEIEKA 569 (758) T ss_pred CCCHHHHHCCCEEEEEHHHHHH T ss_conf 0128787398779973367563 No 108 >KOG0672 consensus Probab=20.38 E-value=53 Score=12.76 Aligned_cols=41 Identities=20% Similarity=0.194 Sum_probs=24.5 Q ss_pred CCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEE Q ss_conf 98987899864435667657799999997289875781799 Q gi|254780504|r 8 IHNSDFIIFGGTGDLAKRKLFPALYNNYTNREENQISSRII 48 (492) Q Consensus 8 ~~~~~lVIFGATGDLA~RKL~PALf~L~~~~~l~~~~~~Ii 48 (492) .+.---|+|||||-+|.-|+=+-.=.|+.-..-.....++| T Consensus 17 ~d~K~hvL~gaTGSvA~iK~~~li~kL~ei~G~dki~iqvv 57 (218) T KOG0672 17 DDGKFHVLLGATGSVAVIKLPLLIKKLEEIYGRDKISIQVV 57 (218) T ss_pred CCCCEEEEEEECCCCCEEEHHHHHHHHHHHCCCCCEEEEEE T ss_conf 78864699971363000025889999998617762369999 No 109 >pfam02576 DUF150 Uncharacterized BCR, YhbC family COG0779. Probab=20.31 E-value=54 Score=12.75 Aligned_cols=33 Identities=15% Similarity=0.294 Sum_probs=26.9 Q ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHCCHHHEE Q ss_conf 357760566789889999999998738968835 Q gi|254780504|r 145 TRVVLEKPIGSNLSSAQEIHAITRKIFKESQIF 177 (492) Q Consensus 145 ~RiViEKPFG~Dl~SA~eLn~~l~~~f~E~qiy 177 (492) .||.|+++=|-+++-+..+++.+...++++..+ T Consensus 27 l~I~iD~~~gv~iddc~~~Sr~i~~~Ld~~d~~ 59 (141) T pfam02576 27 LRIYIDKDGGVTLDDCEEVSRAISALLDVEDPI 59 (141) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 999998999978999999999998775126666 No 110 >COG2502 AsnA Asparagine synthetase A [Amino acid transport and metabolism] Probab=20.30 E-value=19 Score=16.00 Aligned_cols=161 Identities=17% Similarity=0.196 Sum_probs=69.7 Q ss_pred HHHCCCCCCCEEEEECCCHHHHHHHHH-------HHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCHHHEE-- Q ss_conf 874057678469984069568666777-------77772587644357760566789889999999998738968835-- Q gi|254780504|r 107 KLLDIDKTRIRVFYLAMSSAFFGKISQ-------RIHANNLVTEHTRVVLEKPIGSNLSSAQEIHAITRKIFKESQIF-- 177 (492) Q Consensus 107 ~~l~~~~~~~rifYLAiPP~~f~~i~~-------~L~~~~l~~~~~RiViEKPFG~Dl~SA~eLn~~l~~~f~E~qiy-- 177 (492) ..++..+...-+.-.++|-+.|+.|-. .|...+...+-.-.+==|-.-.|-+|-..+.+..-..|+=|.+- T Consensus 46 DnLsg~ekaVsv~vk~~p~a~fEvVhSLAKWKR~tL~r~~f~~~eGlythM~AlRpDeD~ld~~HSvYVDQWDWEkvi~~ 125 (330) T COG2502 46 DNLSGTEKAVSVKVKKLPDATFEVVHSLAKWKRHTLARYGFSAGEGLYTHMKALRPDEDRLDPIHSVYVDQWDWEKVIPD 125 (330) T ss_pred CCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCHHEEEECCCCHHHHCCC T ss_conf 55557514358887536873446768889998888986387678724652110378733445302278521136442587 Q ss_pred --ECCCHHHHHHHHHHH-HHHHCCCCCHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf --124201256877778-87747752002528033307999998467702455553220017887655899999998718 Q gi|254780504|r 178 --RIDHYLGKEAVQGLM-VFRFANTFYESLWNNKYIDHVQITTAETIGVEDRVDYYNNTGALRDMIQNHLLQLLCLVAME 254 (492) Q Consensus 178 --RIDHYLGKe~vqnil-~lRFaN~~~e~~WNr~~I~~VqIt~~E~~gvegR~~yyD~~GaiRDmvQNHllQlL~lvAMe 254 (492) |==-|| |++|..|. ++|-+-.-.+...+-.-+=--+||+--+--.+.| | T Consensus 126 g~rNl~yL-K~tV~kIY~~ir~te~av~~~~~~~~~LP~~ItFihseeL~~r--y------------------------- 177 (330) T COG2502 126 GDRNLAYL-KSTVEKIYAAIRETELAVSAEFGLAPFLPDQITFIHSQELVAR--Y------------------------- 177 (330) T ss_pred CCCCHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEHHHHHHH--C------------------------- T ss_conf 54037999-9999999999999999998652876458761579636999976--8------------------------- Q ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECCCCCCCCCEECCCC Q ss_conf 544588589999999999730688811223613652136677687343341 Q gi|254780504|r 255 MPPSVEAKSIKNEKIKVLQALQMITPENVQKLTVRGQYQSGIINGIPVKGY 305 (492) Q Consensus 255 ~P~~~~~~~ir~eK~kvlk~~r~~~~~~v~~~~v~GQY~~~~~~~~~v~gY 305 (492) ..+++..=-++-.|-+.|+=.+. +=|--..|..++-..|.| T Consensus 178 --P~L~~k~RE~ai~Ke~gAvFlig--------IGg~LsdG~~hd~RaPdY 218 (330) T COG2502 178 --PDLDPKGRERAIAKELGAVFLIG--------IGGKLSDGKPHDVRAPDY 218 (330) T ss_pred --CCCCCCHHHHHHHHHHCCEEEEE--------CCCCCCCCCCCCCCCCCC T ss_conf --89993024678998638389984--------165069999577889986 No 111 >TIGR00587 nfo apurinic endonuclease (APN1); InterPro: IPR001719 DNA damaging agents such as the anti-tumour drugs bleomycin and neocarzinostatin or those that generate oxygen radicals produce a variety of lesions in DNA. Amongst these is base-loss which forms apurinic/apyrimidinic (AP) sites or strand breaks with atypical 3' termini. DNA repair at the AP sites is initiated by specific endonuclease cleavage of the phosphodiester backbone. Such endonucleases are also generally capable of removing blocking groups from the 3' terminus of DNA strand breaks. AP endonucleases can be classified into two families based on sequence similarity. Family 2 groups the enzymes listed below , . Bacterial endonuclease IV (3.1.21.2 from EC) (gene nfo). Mycobacterium leprae probable endonuclease (gene end) . Yeast apurinic endonuclease APN1 (4.2.99.18 from EC). Caenorhabditis elegans hypothetical protein APN-1 or T05H10.2. Escherichia coli endonuclease IV and its Saccharomyces cerevisiae homologue Apn1 have been shown to be transition metalloproteins that require zinc and manganese for activity .; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0006281 DNA repair, 0005622 intracellular. Probab=20.27 E-value=9.4 Score=18.17 Aligned_cols=23 Identities=35% Similarity=0.566 Sum_probs=18.7 Q ss_pred EEEEEEEECCCCCCCCEEEEEEC Q ss_conf 38999997085867857775513 Q gi|254780504|r 319 FVAIKANIDNPRWSGIPFYLRTG 341 (492) Q Consensus 319 faa~~l~idn~RW~gVPF~lrtG 341 (492) |.+.+..+..+||+|.|.||-|= T Consensus 271 ~d~f~~~~~d~~~~~iP~~LETP 293 (318) T TIGR00587 271 FDAFRLLMQDERLKGIPIILETP 293 (318) T ss_pred HHHHHHHHCCCCCCCCCCEEECC T ss_conf 68899974373101588366468 Done!