Query gi|254780848|ref|YP_003065261.1| co-chaperonin GroES [Candidatus Liberibacter asiaticus str. psy62] Match_columns 111 No_of_seqs 103 out of 1601 Neff 6.2 Searched_HMMs 39220 Date Sun May 29 23:42:04 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780848.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG0234 GroS Co-chaperonin Gro 100.0 1.3E-33 3.2E-38 207.4 10.3 96 7-106 1-96 (96) 2 PRK00364 groES co-chaperonin G 100.0 2.5E-32 6.3E-37 200.2 10.8 95 7-105 1-95 (95) 3 cd00320 cpn10 Chaperonin 10 Kd 100.0 7E-30 1.8E-34 186.6 10.0 93 8-104 1-93 (93) 4 pfam00166 Cpn10 Chaperonin 10 100.0 3.3E-29 8.4E-34 182.8 10.1 92 8-104 1-92 (92) 5 KOG1641 consensus 99.9 8.1E-27 2.1E-31 169.5 7.3 100 2-104 4-103 (104) 6 smart00829 PKS_ER Enoylreducta 87.3 1.1 2.9E-05 24.1 4.5 42 46-103 32-73 (288) 7 KOG1197 consensus 86.3 0.56 1.4E-05 25.8 2.5 28 45-81 72-99 (336) 8 COG1062 AdhC Zn-dependent alco 82.0 1.2 3E-05 24.1 2.6 23 46-77 64-86 (366) 9 TIGR01751 crot-CoA-red crotony 80.5 1.5 3.9E-05 23.4 2.7 48 46-102 93-142 (409) 10 PRK10754 quinone oxidoreductas 80.0 4.1 0.0001 21.0 4.8 76 8-102 23-108 (327) 11 TIGR03366 HpnZ_proposed putati 78.0 2.9 7.3E-05 21.9 3.5 29 46-78 7-36 (280) 12 KOG0022 consensus 76.6 4 0.0001 21.0 4.0 21 46-75 70-90 (375) 13 KOG0025 consensus 75.7 4.8 0.00012 20.6 4.1 63 10-84 45-116 (354) 14 TIGR03451 mycoS_dep_FDH mycoth 74.5 3.1 7.9E-05 21.7 2.9 23 46-77 63-85 (358) 15 COG0604 Qor NADPH:quinone redu 71.1 6.4 0.00016 19.9 3.9 75 10-102 24-110 (326) 16 TIGR02817 adh_fam_1 zinc-bindi 70.4 3.1 7.9E-05 21.7 2.1 58 10-78 27-92 (338) 17 PRK05396 tdh L-threonine 3-deh 70.2 4 0.0001 21.0 2.7 23 46-77 66-88 (341) 18 PRK10083 putative dehydrogenas 70.0 4.4 0.00011 20.8 2.8 24 46-78 62-85 (339) 19 PRK09422 alcohol dehydrogenase 69.7 4 0.0001 21.0 2.6 23 46-77 62-84 (338) 20 PRK10309 galactitol-1-phosphat 69.1 4.4 0.00011 20.8 2.7 46 46-103 62-131 (347) 21 pfam08240 ADH_N Alcohol dehydr 67.2 5.1 0.00013 20.5 2.7 25 46-79 39-63 (108) 22 TIGR00692 tdh L-threonine 3-de 67.1 4.5 0.00011 20.8 2.4 80 13-103 23-113 (341) 23 COG1064 AdhP Zn-dependent alco 63.6 7.3 0.00019 19.6 2.9 22 46-76 66-87 (339) 24 PRK13771 putative alcohol dehy 62.7 7.7 0.0002 19.5 2.9 54 10-76 22-84 (332) 25 KOG3209 consensus 62.5 4.5 0.00012 20.8 1.7 70 15-106 756-825 (984) 26 cd02558 PSRA_1 PSRA_1: Pseudou 61.7 8.8 0.00022 19.2 3.0 18 17-37 85-102 (246) 27 TIGR02824 quinone_pig3 putativ 60.8 6 0.00015 20.1 2.1 85 7-101 21-111 (334) 28 TIGR01525 ATPase-IB_hvy heavy 59.2 11 0.00029 18.5 3.3 31 45-75 90-137 (639) 29 TIGR01512 ATPase-IB2_Cd cadmiu 57.7 12 0.00031 18.4 3.2 23 53-75 92-117 (494) 30 PRK10671 copA copper exporting 53.1 19 0.0005 17.2 3.7 34 46-84 358-394 (834) 31 pfam01055 Glyco_hydro_31 Glyco 51.7 18 0.00047 17.4 3.3 60 11-73 371-434 (435) 32 KOG0024 consensus 48.9 8.5 0.00022 19.2 1.2 55 12-78 28-93 (354) 33 KOG2107 consensus 48.0 23 0.00058 16.9 3.3 41 46-86 95-138 (179) 34 cd05792 S1_eIF1AD_like S1_eIF1 47.0 19 0.00049 17.3 2.8 44 49-100 25-69 (78) 35 COG3264 Small-conductance mech 46.7 17 0.00044 17.5 2.5 16 67-82 660-675 (835) 36 pfam10794 DUF2606 Protein of u 46.7 25 0.00063 16.7 5.6 56 41-104 72-127 (131) 37 pfam08140 Cuticle_1 Crustacean 46.3 19 0.00049 17.3 2.7 34 29-63 3-37 (40) 38 TIGR02178 yeiP elongation fact 45.5 13 0.00034 18.2 1.8 69 13-99 15-87 (187) 39 pfam10377 ATG11 Autophagy-rela 44.5 27 0.00068 16.5 3.3 36 67-102 42-83 (128) 40 TIGR01852 lipid_A_lpxA acyl-[a 43.4 28 0.00071 16.4 4.8 46 47-95 99-146 (257) 41 PRK06763 F0F1 ATP synthase sub 42.7 17 0.00043 17.6 1.9 25 30-56 107-131 (213) 42 TIGR02754 sod_Ni_protease nick 42.5 29 0.00074 16.3 5.7 85 2-105 4-89 (90) 43 cd06814 PLPDE_III_DSD_D-TA_lik 42.1 25 0.00064 16.6 2.8 36 67-104 339-375 (379) 44 PRK11033 zntA zinc/cadmium/mer 42.1 26 0.00066 16.6 2.8 34 46-84 275-311 (739) 45 KOG2241 consensus 41.7 26 0.00066 16.6 2.8 44 39-83 159-204 (255) 46 TIGR03201 dearomat_had 6-hydro 39.9 26 0.00067 16.5 2.6 23 46-78 62-84 (349) 47 TIGR00575 dnlj DNA ligase, NAD 39.4 32 0.00082 16.0 3.8 47 31-81 341-388 (706) 48 TIGR00293 TIGR00293 prefoldin, 39.4 27 0.00068 16.5 2.6 37 11-59 47-84 (132) 49 PRK02268 hypothetical protein; 38.2 34 0.00086 15.9 4.8 33 60-95 27-60 (141) 50 cd04466 S1_YloQ_GTPase S1_YloQ 36.9 36 0.00091 15.8 4.2 14 67-80 37-50 (68) 51 pfam07602 DUF1565 Protein of u 36.2 32 0.00082 16.0 2.6 30 46-75 27-57 (255) 52 COG4384 Mu-like prophage prote 35.9 37 0.00094 15.7 5.3 24 59-82 87-110 (203) 53 pfam01079 Hint Hint module. Th 35.1 38 0.00097 15.6 4.9 17 65-81 103-119 (214) 54 TIGR01393 lepA GTP-binding pro 34.6 39 0.00099 15.6 2.8 64 13-83 221-285 (598) 55 COG1329 Transcriptional regula 34.0 28 0.00071 16.4 2.0 40 66-105 3-51 (166) 56 TIGR02390 RNA_pol_rpoA1 DNA-di 33.4 41 0.001 15.5 3.2 28 67-94 432-467 (901) 57 cd03699 lepA_II lepA_II: This 30.8 39 0.001 15.5 2.3 38 31-75 48-85 (86) 58 pfam06890 Phage_Mu_Gp45 Bacter 30.6 45 0.0012 15.2 4.0 35 45-86 58-92 (162) 59 COG2217 ZntA Cation transport 29.9 47 0.0012 15.1 3.1 29 46-75 247-278 (713) 60 pfam01959 DHQS 3-dehydroquinat 28.1 34 0.00087 15.9 1.6 19 59-77 305-324 (343) 61 TIGR02272 gentisate_1_2 gentis 28.0 37 0.00095 15.7 1.8 46 32-79 88-136 (343) 62 PRK13211 N-acetylglucosamine-b 28.0 34 0.00088 15.9 1.6 26 55-80 199-225 (475) 63 TIGR01511 ATPase-IB1_Cu copper 27.6 51 0.0013 14.9 3.9 63 7-75 75-142 (545) 64 COG2140 Thermophilic glucose-6 26.3 55 0.0014 14.7 4.1 33 58-91 120-152 (209) 65 TIGR01073 pcrA ATP-dependent D 25.8 37 0.00095 15.7 1.4 22 66-87 760-782 (811) 66 TIGR02228 sigpep_I_arch signal 25.5 53 0.0014 14.8 2.2 61 38-105 45-110 (175) 67 pfam02559 CarD_TRCF CarD-like/ 24.9 34 0.00088 15.9 1.1 13 68-80 2-14 (98) 68 PRK10838 spr putative outer me 24.8 17 0.00044 17.5 -0.4 34 31-79 107-140 (188) 69 COG0298 HypC Hydrogenase matur 24.3 60 0.0015 14.5 2.4 17 62-78 33-49 (82) 70 smart00696 DM9 Repeats found i 23.9 61 0.0015 14.5 3.9 52 34-87 12-63 (71) 71 cd04456 S1_IF1A_like S1_IF1A_l 23.4 46 0.0012 15.2 1.6 16 65-80 36-51 (78) 72 PRK10413 hydrogenase 2 accesso 23.4 62 0.0016 14.4 3.4 12 67-78 42-53 (82) 73 smart00652 eIF1a eukaryotic tr 22.6 57 0.0014 14.7 1.9 30 49-83 30-59 (83) 74 TIGR03404 bicupin_oxalic bicup 22.6 64 0.0016 14.3 4.4 62 25-86 224-311 (367) 75 PRK13914 invasion associated s 22.5 53 0.0014 14.8 1.7 13 67-80 420-432 (475) 76 TIGR02922 TIGR02922 conserved 22.2 66 0.0017 14.3 3.2 38 26-75 31-68 (69) 77 TIGR01941 nqrF NADH:ubiquinone 21.9 67 0.0017 14.3 3.4 28 67-95 261-289 (425) 78 PRK02290 3-dehydroquinate synt 20.9 59 0.0015 14.6 1.7 19 59-77 309-328 (347) 79 PRK04012 translation initiatio 20.3 59 0.0015 14.6 1.6 15 66-80 57-71 (99) 80 TIGR02818 adh_III_F_hyde S-(hy 20.2 72 0.0018 14.1 2.3 58 10-79 23-87 (368) 81 TIGR00074 hypC_hupF hydrogenas 20.2 47 0.0012 15.1 1.1 18 67-84 44-61 (88) 82 pfam03079 ARD ARD/ARD' family. 20.1 73 0.0019 14.0 5.6 43 46-88 94-139 (157) No 1 >COG0234 GroS Co-chaperonin GroES (HSP10) [Posttranslational modification, protein turnover, chaperones] Probab=100.00 E-value=1.3e-33 Score=207.41 Aligned_cols=96 Identities=48% Similarity=0.847 Sum_probs=90.7 Q ss_pred CCCCCCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEE Q ss_conf 24251784799997475421225579945765546530247999823870867887902203137889970568608897 Q gi|254780848|r 7 NYLRPTRGRVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKL 86 (111) Q Consensus 7 ~~ikPl~drVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~ 86 (111) |+|+||+|||||++.+.|++|++ ||+||++++|++ ++|+|||||+|..+++|+.+|++||+||+|+|++|+|++++. T Consensus 1 m~ikPL~DRVlVk~~e~EekT~g-GIvlpdsakeK~--~~g~VvAVG~G~~~~~g~~~~~~VkvGD~Vlf~ky~G~evk~ 77 (96) T COG0234 1 MKIKPLGDRVLVKRVEEEEKTAG-GIVLPDSAKEKP--QEGEVVAVGPGRRDENGELVPLDVKVGDRVLFGKYAGTEVKI 77 (96) T ss_pred CCCEECCCEEEEEECHHHCCCCC-CEEECCCCCCCC--CCEEEEEECCCCCCCCCCEECCCCCCCCEEEECCCCCCEEEE T ss_conf 97265677799998034021057-489568522477--631899972661068997815222339999987547708987 Q ss_pred CCCCEEEEEEHHHEEEEEEC Q ss_conf 69977999853017899845 Q gi|254780848|r 87 NDGEEYLVMQESDIMGIVVE 106 (111) Q Consensus 87 ~dg~~y~i~~e~dIlavi~~ 106 (111) ||++|+||+++||||++++ T Consensus 78 -dgeeylil~e~DILAiv~~ 96 (96) T COG0234 78 -DGEEYLILSESDILAIVEK 96 (96) T ss_pred -CCEEEEEECHHHEEEEECC T ss_conf -9968999645764677339 No 2 >PRK00364 groES co-chaperonin GroES; Reviewed Probab=99.98 E-value=2.5e-32 Score=200.19 Aligned_cols=95 Identities=49% Similarity=0.864 Sum_probs=89.2 Q ss_pred CCCCCCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEE Q ss_conf 24251784799997475421225579945765546530247999823870867887902203137889970568608897 Q gi|254780848|r 7 NYLRPTRGRVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKL 86 (111) Q Consensus 7 ~~ikPl~drVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~ 86 (111) |+|+||||||||++.+++++|+ |||+||+++++++ +.|+|||+|||+++++|+++|+++++||+|+|++|+|++|++ T Consensus 1 MkikPlgdrVLVk~~e~e~kT~-gGIiLpd~~~ek~--~~G~VvavG~g~~~~~g~~~~~~vk~GD~V~~~~~~g~~v~~ 77 (95) T PRK00364 1 MNIKPLGDRVLVKRLEEEEKTA-GGIVLPDSAKEKP--QEGEVVAVGPGRRLENGELVPLDVKVGDKVLFGKYAGTEVKI 77 (95) T ss_pred CCCEECCCEEEEEECCCCCEEE-EEEEECCCCCCCC--CEEEEEEECCCEECCCCCCCCCCCCCCCEEEECCCCCEEEEE T ss_conf 9722869889999854365341-1399886436674--078999968837978888967877879999985768759998 Q ss_pred CCCCEEEEEEHHHEEEEEE Q ss_conf 6997799985301789984 Q gi|254780848|r 87 NDGEEYLVMQESDIMGIVV 105 (111) Q Consensus 87 ~dg~~y~i~~e~dIlavi~ 105 (111) ||++|++++++||+|+++ T Consensus 78 -~g~~y~li~~~dIlavie 95 (95) T PRK00364 78 -DGEEYLILRESDILAIVE 95 (95) T ss_pred -CCEEEEEEEHHHEEEEEC T ss_conf -992899999799699949 No 3 >cd00320 cpn10 Chaperonin 10 Kd subunit (cpn10 or GroES); Cpn10 cooperates with chaperonin 60 (cpn60 or GroEL), an ATPase, to assist the folding and assembly of proteins and is found in eubacterial cytosol, as well as in the matrix of mitochondria and chloroplasts. It forms heptameric rings with a dome-like structure, forming a lid to the large cavity of the tetradecameric cpn60 cylinder and thereby tightly regulating release and binding of proteins to the cpn60 surface. Probab=99.96 E-value=7e-30 Score=186.57 Aligned_cols=93 Identities=45% Similarity=0.848 Sum_probs=86.7 Q ss_pred CCCCCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEEC Q ss_conf 42517847999974754212255799457655465302479998238708678879022031378899705686088976 Q gi|254780848|r 8 YLRPTRGRVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKLN 87 (111) Q Consensus 8 ~ikPl~drVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~ 87 (111) +|+||||||||++.+++++|+ |||+||+++++++ .+|+|||||||+++++|+..|++|++||+|+|++|+|+++++ T Consensus 1 kikPlgdrVLV~~~~~e~~T~-gGIiLp~~~~e~~--~~G~VvavG~g~~~~~g~~~~~~vk~GD~Vl~~~~~g~~i~~- 76 (93) T cd00320 1 KIKPLGDRVLVKRIEAEEKTK-GGIILPDSAKEKP--QEGKVVAVGPGRRNENGERVPLSVKVGDKVLFPKYAGTEVKL- 76 (93) T ss_pred CCEECCCEEEEEECCCCCEEE-EEEEECCCCCCCC--EEEEEEEECCCEECCCCCCCCCCCCCCCEEEECCCCCEEEEE- T ss_conf 941348889998854255363-1599862006475--032899978955947787987766579999986768609999- Q ss_pred CCCEEEEEEHHHEEEEE Q ss_conf 99779998530178998 Q gi|254780848|r 88 DGEEYLVMQESDIMGIV 104 (111) Q Consensus 88 dg~~y~i~~e~dIlavi 104 (111) ||++|++++++||+|++ T Consensus 77 ~~~~y~ii~~~dIlavi 93 (93) T cd00320 77 DGEEYLILRESDILAVI 93 (93) T ss_pred CCEEEEEEEHHHEEEEC T ss_conf 99999999978969979 No 4 >pfam00166 Cpn10 Chaperonin 10 Kd subunit. This family contains GroES and Gp31-like chaperonins. Gp31 is a functional co-chaperonin that is required for the folding and assembly of Gp23, a major capsid protein, during phage morphogenesis. Probab=99.96 E-value=3.3e-29 Score=182.83 Aligned_cols=92 Identities=46% Similarity=0.864 Sum_probs=84.9 Q ss_pred CCCCCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEEC Q ss_conf 42517847999974754212255799457655465302479998238708678879022031378899705686088976 Q gi|254780848|r 8 YLRPTRGRVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKLN 87 (111) Q Consensus 8 ~ikPl~drVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~ 87 (111) +|+||||||||++.+.+++|. |||+||+++++++ ++|+|||||||+++ +|+.+|++|++||+|+|++|+|+++++ T Consensus 1 kikPlgdrVLV~~~~~e~kT~-gGIiLp~~~~e~~--~~G~VvavG~g~~~-~g~~~p~~vk~GD~Vl~~~~~g~~i~~- 75 (92) T pfam00166 1 KIKPLGDRVLVKRVEEEEKTK-GGIILPDSAKEKP--NQGEVVAVGPGARN-NGERVPLEVKVGDKVLFPKYAGTEVKI- 75 (92) T ss_pred CCEECCCEEEEEECCCCCEEE-EEEEECCCCCCCC--EEEEEEEECCCCCC-CCCCCCCCCCCCCEEEECCCCCEEEEE- T ss_conf 943248889999864165120-0499865236564--28999998796244-997815755419999987758759999- Q ss_pred CCCEEEEEEHHHEEEEE Q ss_conf 99779998530178998 Q gi|254780848|r 88 DGEEYLVMQESDIMGIV 104 (111) Q Consensus 88 dg~~y~i~~e~dIlavi 104 (111) ||++|++++++||+|++ T Consensus 76 ~~~~y~ii~~~dIlavi 92 (92) T pfam00166 76 DGEEYLILKEKDILAVI 92 (92) T ss_pred CCEEEEEEEHHHEEEEC T ss_conf 99999999978969989 No 5 >KOG1641 consensus Probab=99.94 E-value=8.1e-27 Score=169.52 Aligned_cols=100 Identities=32% Similarity=0.553 Sum_probs=90.5 Q ss_pred CCCCCCCCCCCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCC Q ss_conf 86523242517847999974754212255799457655465302479998238708678879022031378899705686 Q gi|254780848|r 2 VGEHKNYLRPTRGRVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSG 81 (111) Q Consensus 2 ~~~~~~~ikPl~drVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g 81 (111) ++....++.|+.|||||++.+++++|+ |||+||+++++| ..+|+|+|||||.++..|+.+|.+|++||+|||++|+| T Consensus 4 ~~~~~kk~vPl~DRVLVqr~~a~~KT~-gGilLPEks~~K--~~~g~VvavGpG~~~~~G~~v~~~Vk~Gd~VLlpeygG 80 (104) T KOG1641 4 TSWEIKKVVPLLDRVLVQRIEAPTKTA-GGILLPEKSVGK--LLQGTVVAVGPGSRDKGGEIVPVSVKVGDRVLLPEYGG 80 (104) T ss_pred HHHHHHHCCCCCCEEEEEEEECCCCCC-CEEEECCCCCCC--CCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEECCCC T ss_conf 214365401410102456430354114-406841334465--21489999747742678879375116798788513487 Q ss_pred CEEEECCCCEEEEEEHHHEEEEE Q ss_conf 08897699779998530178998 Q gi|254780848|r 82 TEIKLNDGEEYLVMQESDIMGIV 104 (111) Q Consensus 82 ~~v~~~dg~~y~i~~e~dIlavi 104 (111) ++|+..|..+|.++++.|+|+++ T Consensus 81 t~V~l~~~~~~~~fr~e~~l~~~ 103 (104) T KOG1641 81 TKVKLGDEDEYHLFRDEDDLLAI 103 (104) T ss_pred CEEECCCCCEEEEECCHHHHHHH T ss_conf 17862677216896242454320 No 6 >smart00829 PKS_ER Enoylreductase. Enoylreductase in Polyketide synthases. Probab=87.26 E-value=1.1 Score=24.08 Aligned_cols=42 Identities=24% Similarity=0.265 Sum_probs=28.7 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEECCCCEEEEEEHHHEEEE Q ss_conf 4799982387086788790220313788997056860889769977999853017899 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKLNDGEEYLVMQESDIMGI 103 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~dg~~y~i~~e~dIlav 103 (111) .|+|+++|++.. .+++||+|+.--.++ -.+|.+++++.++-+ T Consensus 32 ~G~V~~vG~~V~---------~~~vGdrV~~~~~G~-------~aey~~v~~~~~~~i 73 (288) T smart00829 32 AGVVTRVGPGVT---------GLAVGDRVMGLAPGS-------FATYVRTDARLVVPI 73 (288) T ss_pred EEEEEEECCCCC---------CCCCCCEEEEECCCC-------CCCEEEECHHHEEEC T ss_conf 999999999999---------999999999987985-------617699763688997 No 7 >KOG1197 consensus Probab=86.35 E-value=0.56 Score=25.82 Aligned_cols=28 Identities=36% Similarity=0.476 Sum_probs=21.3 Q ss_pred EEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCC Q ss_conf 2479998238708678879022031378899705686 Q gi|254780848|r 45 SSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSG 81 (111) Q Consensus 45 ~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g 81 (111) ..|.|+|||.|..+ -++||+|.|-.-.| T Consensus 72 aaGvVvAvG~gvtd---------rkvGDrVayl~~~g 99 (336) T KOG1197 72 AAGVVVAVGEGVTD---------RKVGDRVAYLNPFG 99 (336) T ss_pred CCEEEEEECCCCCC---------CCCCCEEEEECCCH T ss_conf 42279995377555---------40255789952403 No 8 >COG1062 AdhC Zn-dependent alcohol dehydrogenases, class III [Energy production and conversion] Probab=82.03 E-value=1.2 Score=24.05 Aligned_cols=23 Identities=39% Similarity=0.550 Sum_probs=18.9 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEE Q ss_conf 47999823870867887902203137889970 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFG 77 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~ 77 (111) .|.|.+||+|.. +||+||+|+.. T Consensus 64 AGiVe~VG~gVt---------~vkpGDhVI~~ 86 (366) T COG1062 64 AGIVEAVGEGVT---------SVKPGDHVILL 86 (366) T ss_pred CCEEEEECCCCC---------CCCCCCEEEEE T ss_conf 417998469865---------52889989970 No 9 >TIGR01751 crot-CoA-red crotonyl-CoA reductase; InterPro: IPR010085 Crotonyl-CoA reductase is responsible for the conversion of crotonyl-CoA to butyryl-CoA. In serine cycle methylotrophic bacteria this enzyme is involved in the process of acetyl-CoA to glyoxylate , . In other bacteria the enzyme is used to produce butyrate for incorporation into polyketides such as tylosin from Streptomyces fradiae and coronatine from Pseudomonas syringae .. Probab=80.53 E-value=1.5 Score=23.39 Aligned_cols=48 Identities=29% Similarity=0.372 Sum_probs=31.0 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEEC--CCCCEEEECCCCEEEEEEHHHEEE Q ss_conf 479998238708678879022031378899705--686088976997799985301789 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK--WSGTEIKLNDGEEYLVMQESDIMG 102 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~--~~g~~v~~~dg~~y~i~~e~dIla 102 (111) .|.|.+||+|.. .+++||.|+.+= +..++...++|-+=.|-.+..|-+ T Consensus 93 sGvV~~VG~gV~---------~w~~GDeVV~sC~q~d~~~~~~~~g~DpM~~~~QrIWG 142 (409) T TIGR01751 93 SGVVWRVGEGVT---------RWKVGDEVVVSCLQVDETAPESRAGADPMLSSEQRIWG 142 (409) T ss_pred CCEEEEECCCCC---------EECCCCEEEEEECCCCCCCHHHCCCCCCCCCCCCEEEE T ss_conf 612788458741---------52488868996056664346562688888898733877 No 10 >PRK10754 quinone oxidoreductase, NADPH-dependent; Provisional Probab=80.00 E-value=4.1 Score=21.02 Aligned_cols=76 Identities=20% Similarity=0.300 Sum_probs=45.2 Q ss_pred CCCCCCCEEEEEECCC-----CCCCCCCEEEECCCCCCCCC----CEEEEEEEECCCEECCCCCEECCEECCCCEEEEEC Q ss_conf 4251784799997475-----42122557994576554653----02479998238708678879022031378899705 Q gi|254780848|r 8 YLRPTRGRVVVRRLQS-----EIKTATGNILIPDTVSEKPS----ASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK 78 (111) Q Consensus 8 ~ikPl~drVLVk~~~~-----e~~T~~GgI~lp~~~~ek~~----~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~ 78 (111) .-+|-.+.||||..-. .-....| .+-|. .-|. -..|+|+++|++.. ..++||+|+|.. T Consensus 23 ~p~p~~~eVlVkV~a~gin~~D~~~r~G-~~~~~---~~P~~~G~E~~G~V~~vG~~V~---------~~~vGdrV~~~~ 89 (327) T PRK10754 23 PADPAENEIQVENKAIGINFIDTYIRSG-LYPPP---SLPSGLGTEAAGIVSKVGSGVK---------HIKVGDRVVYAQ 89 (327) T ss_pred CCCCCCCEEEEEEEEECCCHHHHHHHCC-CCCCC---CCCCCCCCEEEEEEEEECCCCC---------CCCCCCEEEECC T ss_conf 9999999899999999279899999789-99987---6787476326999999779986---------478999886666 Q ss_pred CC-CCEEEECCCCEEEEEEHHHEEE Q ss_conf 68-6088976997799985301789 Q gi|254780848|r 79 WS-GTEIKLNDGEEYLVMQESDIMG 102 (111) Q Consensus 79 ~~-g~~v~~~dg~~y~i~~e~dIla 102 (111) .. |. --+|..+.++.+.- T Consensus 90 ~~~G~------~aey~~v~~~~~~~ 108 (327) T PRK10754 90 SALGA------YSSVHNVPADKAAI 108 (327) T ss_pred CCCCC------CEEEEEECHHHEEE T ss_conf 78736------54699961799387 No 11 >TIGR03366 HpnZ_proposed putative phosphonate catabolism associated alcohol dehydrogenase. This clade of zinc-binding alcohol dehydrogenases (members of pfam00107) are repeatedly associated with genes proposed to be involved with the catabolism of phosphonate compounds. Probab=78.01 E-value=2.9 Score=21.87 Aligned_cols=29 Identities=21% Similarity=0.402 Sum_probs=20.5 Q ss_pred EEEEEEECCCEEC-CCCCEECCEECCCCEEEEEC Q ss_conf 4799982387086-78879022031378899705 Q gi|254780848|r 46 SGEIMWVGAGVMD-QSGKVIEPEVSKGDIVLFGK 78 (111) Q Consensus 46 ~G~VvavG~g~~~-~~G~~~p~~vkvGD~Vl~~~ 78 (111) .|+|+++|++... ..|+ .+++||+|.... T Consensus 7 ~G~V~~vG~~V~~~~~g~----~~~vGdrV~~~~ 36 (280) T TIGR03366 7 VGEVVALRGGFTPADDGV----PLRLGQRVVWSV 36 (280) T ss_pred EEEEEEECCCCCCCCCCC----CCCCCCEEEECC T ss_conf 799999899986575698----556899998897 No 12 >KOG0022 consensus Probab=76.64 E-value=4 Score=21.03 Aligned_cols=21 Identities=43% Similarity=0.640 Sum_probs=16.7 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEE Q ss_conf 479998238708678879022031378899 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVL 75 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl 75 (111) .|.|.+||+|.. .+++||+|+ T Consensus 70 aGIVESvGegV~---------~vk~GD~Vi 90 (375) T KOG0022 70 AGIVESVGEGVT---------TVKPGDHVI 90 (375) T ss_pred EEEEEEECCCCC---------CCCCCCEEE T ss_conf 047887257745---------258999986 No 13 >KOG0025 consensus Probab=75.74 E-value=4.8 Score=20.61 Aligned_cols=63 Identities=29% Similarity=0.349 Sum_probs=32.9 Q ss_pred CCCCCEEEEEECCCCCCC----CCCEEEECCCCCCCCC----CEEEEEEEECCCEECCCCCEECCEECCCCEEEEEC-CC Q ss_conf 517847999974754212----2557994576554653----02479998238708678879022031378899705-68 Q gi|254780848|r 10 RPTRGRVVVRRLQSEIKT----ATGNILIPDTVSEKPS----ASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK-WS 80 (111) Q Consensus 10 kPl~drVLVk~~~~e~~T----~~GgI~lp~~~~ek~~----~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~-~~ 80 (111) .|..| ||||-..+...- .--|.|.-- -+.|. --.|+||+||.... | .|+||||+-.. .. T Consensus 45 ~~~s~-v~Vk~LAaPINPsDIN~IQGvYpvr--P~~PAVgGnEGv~eVv~vGs~vk---g------fk~Gd~VIp~~a~l 112 (354) T KOG0025 45 VPGSD-VLVKMLAAPINPSDINQIQGVYPVR--PELPAVGGNEGVGEVVAVGSNVK---G------FKPGDWVIPLSANL 112 (354) T ss_pred CCCCC-EEEEEEECCCCHHHHHHHCCCCCCC--CCCCCCCCCCCEEEEEEECCCCC---C------CCCCCEEEECCCCC T ss_conf 98776-2665530678867753333655778--98774247765489998368767---6------68777575558888 Q ss_pred CCEE Q ss_conf 6088 Q gi|254780848|r 81 GTEI 84 (111) Q Consensus 81 g~~v 84 (111) |++- T Consensus 113 GtW~ 116 (354) T KOG0025 113 GTWR 116 (354) T ss_pred CCCE T ss_conf 6324 No 14 >TIGR03451 mycoS_dep_FDH mycothiol-dependent formaldehyde dehydrogenase. Members of this protein family are mycothiol-dependent formaldehyde dehydrogenase (EC 1.2.1.66). This protein is found, so far, only in the Actinobacteria (Mycobacterium sp., Streptomyces sp., Corynebacterium sp., and related species), where mycothione replaces glutathione. Probab=74.47 E-value=3.1 Score=21.67 Aligned_cols=23 Identities=39% Similarity=0.689 Sum_probs=18.3 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEE Q ss_conf 47999823870867887902203137889970 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFG 77 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~ 77 (111) .|+|+++|++.. .+++||+|... T Consensus 63 ~G~V~evG~~V~---------~~kvGDrV~v~ 85 (358) T TIGR03451 63 AGVVEAVGEGVT---------DVAPGDYVVLN 85 (358) T ss_pred EEEEEEECCCCC---------CCCCCCEEEEC T ss_conf 999999899998---------77779989976 No 15 >COG0604 Qor NADPH:quinone reductase and related Zn-dependent oxidoreductases [Energy production and conversion / General function prediction only] Probab=71.11 E-value=6.4 Score=19.93 Aligned_cols=75 Identities=27% Similarity=0.351 Sum_probs=40.9 Q ss_pred CCCCCEEEEEECCC-----CCCCCCCEEEECCCCCCCCC----CEEEEEEEECCCEECCCCCEECCEECCCCEEEEEC-C Q ss_conf 51784799997475-----42122557994576554653----02479998238708678879022031378899705-6 Q gi|254780848|r 10 RPTRGRVVVRRLQS-----EIKTATGNILIPDTVSEKPS----ASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK-W 79 (111) Q Consensus 10 kPl~drVLVk~~~~-----e~~T~~GgI~lp~~~~ek~~----~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~-~ 79 (111) +|-.+.|||+.... ......|. ..|. .+.|. -..|+|+++|++... .++||+|++.. . T Consensus 24 ~p~~geVlV~V~aagvN~~D~~~r~G~-~~~~--~~~P~i~G~d~aG~V~avG~~V~~---------~~~GdrV~~~~~~ 91 (326) T COG0604 24 EPGPGEVLVRVKAAGVNPIDVLVRQGL-APPV--RPLPFIPGSEAAGVVVAVGSGVTG---------FKVGDRVAALGGV 91 (326) T ss_pred CCCCCEEEEEEEEEECCHHHHHHHCCC-CCCC--CCCCCCCCEEEEEEEEECCCCCCC---------CCCCCEEEECCCC T ss_conf 999996999998986174888873388-7876--889975401578999963789887---------7899889985888 Q ss_pred --CCCEEEECCCCEEEEEEHHHEEE Q ss_conf --86088976997799985301789 Q gi|254780848|r 80 --SGTEIKLNDGEEYLVMQESDIMG 102 (111) Q Consensus 80 --~g~~v~~~dg~~y~i~~e~dIla 102 (111) .|. --+|..+.++.+.- T Consensus 92 ~~~G~------~AEy~~v~a~~~~~ 110 (326) T COG0604 92 GRDGG------YAEYVVVPADWLVP 110 (326) T ss_pred CCCCC------CEEEEECCHHHEEE T ss_conf 88986------51267705567287 No 16 >TIGR02817 adh_fam_1 zinc-binding alcohol dehydrogenase family protein; InterPro: IPR014182 Members of this entry form a distinct subset of the larger family of oxidoreductases that includes zinc-binding alcohol dehydrogenases and NADPH:quinone reductases. Some current members of this entry carry designations as putative alginate lyase, however none have been characterised as such.. Probab=70.41 E-value=3.1 Score=21.68 Aligned_cols=58 Identities=21% Similarity=0.291 Sum_probs=30.3 Q ss_pred CCCCCEEEEEECC-----CCCCCCCCEEEECCC-CCCCCC--CEEEEEEEECCCEECCCCCEECCEECCCCEEEEEC Q ss_conf 5178479999747-----542122557994576-554653--02479998238708678879022031378899705 Q gi|254780848|r 10 RPTRGRVVVRRLQ-----SEIKTATGNILIPDT-VSEKPS--ASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK 78 (111) Q Consensus 10 kPl~drVLVk~~~-----~e~~T~~GgI~lp~~-~~ek~~--~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~ 78 (111) +|-+-=+||+... +..|-..+ . -|+. .|-|-- =.-|.|.|||+-.. -.||||.|.|.- T Consensus 27 ~p~grDlLVEVKAvSVNPVDtKVR~~-~-~p~~Ggq~KILGWDAAGVV~avG~~Vt---------LFKpGDeVwYAG 92 (338) T TIGR02817 27 KPGGRDLLVEVKAVSVNPVDTKVRAR-V-APEAGGQPKILGWDAAGVVKAVGDEVT---------LFKPGDEVWYAG 92 (338) T ss_pred CCCCCCEEEEEEEEEECCCCCCCCCC-C-CCCCCCCCCEEEEEEEEEEECCCCCCC---------EECCCCEEEECC T ss_conf 88985306888888726886001277-8-768888742310342212602188100---------102787557543 No 17 >PRK05396 tdh L-threonine 3-dehydrogenase; Validated Probab=70.18 E-value=4 Score=21.03 Aligned_cols=23 Identities=35% Similarity=0.358 Sum_probs=18.0 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEE Q ss_conf 47999823870867887902203137889970 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFG 77 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~ 77 (111) .|+|+++|++.. .+++||+|... T Consensus 66 ~G~V~~vG~~V~---------~~~vGdrV~~~ 88 (341) T PRK05396 66 VGEVVEVGSEVT---------GFKVGDRVSGE 88 (341) T ss_pred EEEEEEECCCCC---------CCCCCCEEEEE T ss_conf 999988289688---------68998899872 No 18 >PRK10083 putative dehydrogenase; Provisional Probab=69.98 E-value=4.4 Score=20.82 Aligned_cols=24 Identities=33% Similarity=0.310 Sum_probs=17.8 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEEC Q ss_conf 479998238708678879022031378899705 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK 78 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~ 78 (111) .|+|+++|++... .++||+|.... T Consensus 62 ~G~V~~vG~~V~~---------~k~GdrV~~~~ 85 (339) T PRK10083 62 FGVIDAVGDGVDA---------ARIGERVAVDP 85 (339) T ss_pred EEEEEEECCCCCC---------CCCCCEEEEEC T ss_conf 8999996787566---------65688999933 No 19 >PRK09422 alcohol dehydrogenase; Provisional Probab=69.75 E-value=4 Score=21.04 Aligned_cols=23 Identities=35% Similarity=0.455 Sum_probs=18.2 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEE Q ss_conf 47999823870867887902203137889970 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFG 77 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~ 77 (111) .|+|+++|++.. .+++||+|... T Consensus 62 ~G~V~~vG~~v~---------~~~vGdrV~v~ 84 (338) T PRK09422 62 IGIVKEVGPGVT---------SLKVGDRVSIA 84 (338) T ss_pred EEEEEECCCCCC---------CCCCCCEEEEC T ss_conf 999998688666---------66428878874 No 20 >PRK10309 galactitol-1-phosphate dehydrogenase; Provisional Probab=69.12 E-value=4.4 Score=20.82 Aligned_cols=46 Identities=33% Similarity=0.593 Sum_probs=29.6 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEECC----------------------CCCEEEECCC--CEEEEEEHHHEE Q ss_conf 4799982387086788790220313788997056----------------------8608897699--779998530178 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKW----------------------SGTEIKLNDG--EEYLVMQESDIM 101 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~----------------------~g~~v~~~dg--~~y~i~~e~dIl 101 (111) .|+|+++|++.. .+++||+|....+ .|... +| -+|.+++++.++ T Consensus 62 ~G~V~~vG~~V~---------~~~vGdrV~~~~~~~Cg~C~~C~~G~~~~C~~~~~~G~~~---~Gg~Aey~~v~~~~~~ 129 (347) T PRK10309 62 SGYVEAVGSGVT---------DLHPGDAVACVPLLPCFTCPECLRGFYSLCAKYDFIGSRR---DGGNAEYIVVKRKNLF 129 (347) T ss_pred EEEEEEECCCCC---------CCCCCCEEEECCCCCCCCCHHHCCCCCCCCCCCCCCCCCC---CCCCCCEEEECHHHCC T ss_conf 999999899998---------6999998998840688888233389866354555233688---8765104785489866 Q ss_pred EE Q ss_conf 99 Q gi|254780848|r 102 GI 103 (111) Q Consensus 102 av 103 (111) -+ T Consensus 130 ~i 131 (347) T PRK10309 130 AL 131 (347) T ss_pred CC T ss_conf 78 No 21 >pfam08240 ADH_N Alcohol dehydrogenase GroES-like domain. This is the catalytic domain of alcohol dehydrogenases. Many of them contain an inserted zinc binding domain. This domain has a GroES-like structure. Probab=67.21 E-value=5.1 Score=20.47 Aligned_cols=25 Identities=32% Similarity=0.399 Sum_probs=18.5 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEECC Q ss_conf 4799982387086788790220313788997056 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKW 79 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~ 79 (111) .|+|+++|++.. .+++||+|....+ T Consensus 39 ~G~V~~~G~~v~---------~~~~GdrV~~~~~ 63 (108) T pfam08240 39 AGIVEEVGPGVT---------GLKVGDRVVVYPL 63 (108) T ss_pred EEEEEECCCCCC---------CCCCCCEEEECCC T ss_conf 799711381713---------2678899998673 No 22 >TIGR00692 tdh L-threonine 3-dehydrogenase; InterPro: IPR004627 L-threonine 3-dehydrogenase (1.1.1.103 from EC) is a tetrameric, zinc-binding, NAD-dependent enzyme of threonine catabolism. It catalyses the conversion of L-threonine and NAD+ to L-2-amino-3-oxobutanoate and NADH. In Escherichia coli His-90 modulates substrate specificity and is believed part of the active site. Closely related proteins include sorbitol dehydrogenase, xylitol dehydrogenase, and benzyl alcohol dehydrogenase. Eukaryotic examples of this enzyme have been demonstrated experimentally but are not detected by this HMM. ; GO: 0008270 zinc ion binding, 0008743 L-threonine 3-dehydrogenase activity, 0006567 threonine catabolic process. Probab=67.11 E-value=4.5 Score=20.77 Aligned_cols=80 Identities=21% Similarity=0.295 Sum_probs=45.4 Q ss_pred CCEEEEEECCC-CCCCCCCEEEECCC-CCCCCC---CE----EEEEEEECCCEECCCCCEECCEECCCCEEEEECCC--C Q ss_conf 84799997475-42122557994576-554653---02----47999823870867887902203137889970568--6 Q gi|254780848|r 13 RGRVVVRRLQS-EIKTATGNILIPDT-VSEKPS---AS----SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWS--G 81 (111) Q Consensus 13 ~drVLVk~~~~-e~~T~~GgI~lp~~-~~ek~~---~~----~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~--g 81 (111) .+-||+|...+ =.=|... ||==|. +|++-+ .. .|+|+.+|++.. .+|+||+|=-.-+= | T Consensus 23 ~~evLIKv~atsICGTDvH-IYnWdeWaq~rIk~P~v~GHE~~G~VvgiG~~V~---------g~kvGdyVS~EtHIvCG 92 (341) T TIGR00692 23 KNEVLIKVKATSICGTDVH-IYNWDEWAQKRIKVPYVVGHEYAGEVVGIGEEVK---------GVKVGDYVSAETHIVCG 92 (341) T ss_pred CCCEEEEEEEEECCCCCCC-EECCCHHHHCCCCCCEEEEEEEEEEEEEECCCCC---------EEEECCEEECCCEEECC T ss_conf 8741655543002477430-4432426434589982760258889987789824---------58847666136114436 Q ss_pred CEEEECCCCEEEEEEHHHEEEE Q ss_conf 0889769977999853017899 Q gi|254780848|r 82 TEIKLNDGEEYLVMQESDIMGI 103 (111) Q Consensus 82 ~~v~~~dg~~y~i~~e~dIlav 103 (111) .=..=..| +|-+++..-|++| T Consensus 93 ~C~~Cr~G-~~HvC~NT~i~GV 113 (341) T TIGR00692 93 KCVACRRG-KSHVCRNTKIVGV 113 (341) T ss_pred EECCCCCC-CCEEEECCEEEEE T ss_conf 00368888-8134524547886 No 23 >COG1064 AdhP Zn-dependent alcohol dehydrogenases [General function prediction only] Probab=63.61 E-value=7.3 Score=19.60 Aligned_cols=22 Identities=36% Similarity=0.496 Sum_probs=16.5 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEE Q ss_conf 4799982387086788790220313788997 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLF 76 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~ 76 (111) .|+|++||++.. .+|+||+|-. T Consensus 66 vG~V~~vG~~V~---------~~k~GDrVgV 87 (339) T COG1064 66 VGTVVEVGEGVT---------GLKVGDRVGV 87 (339) T ss_pred EEEEEEECCCCC---------CCCCCCEEEE T ss_conf 799998658986---------4778999885 No 24 >PRK13771 putative alcohol dehydrogenase; Provisional Probab=62.72 E-value=7.7 Score=19.48 Aligned_cols=54 Identities=28% Similarity=0.462 Sum_probs=30.2 Q ss_pred CCCCCEEEEEECCCC-----CCCCCCEEEECCCCCCCCC----CEEEEEEEECCCEECCCCCEECCEECCCCEEEE Q ss_conf 517847999974754-----2122557994576554653----024799982387086788790220313788997 Q gi|254780848|r 10 RPTRGRVVVRRLQSE-----IKTATGNILIPDTVSEKPS----ASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLF 76 (111) Q Consensus 10 kPl~drVLVk~~~~e-----~~T~~GgI~lp~~~~ek~~----~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~ 76 (111) .|-.+-||||..-.- -.+-.| .+ |.. .-|. -..|+|+++|++.. .+++||+|.. T Consensus 22 ~p~~~eVlVkv~a~gic~sD~~~~~G-~~-~~~--~~P~i~GhE~~G~V~~vG~~v~---------~~~vGdrV~~ 84 (332) T PRK13771 22 KPGKGEVVIKVVYAGLCYRDLLQLQG-FY-PRV--KYPVILGHEVVGVVEEVGEGVE---------GFKPGDRVAS 84 (332) T ss_pred CCCCCEEEEEEEEEEECHHHHHHHCC-CC-CCC--CCCCCCCEEEEEEEEEECCCEE---------EEECCCEEEE T ss_conf 89989899999999378899998668-99-988--8983585579999998167224---------6630888999 No 25 >KOG3209 consensus Probab=62.50 E-value=4.5 Score=20.76 Aligned_cols=70 Identities=26% Similarity=0.459 Sum_probs=38.3 Q ss_pred EEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEECCCCEEEE Q ss_conf 79999747542122557994576554653024799982387086788790220313788997056860889769977999 Q gi|254780848|r 15 RVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKLNDGEEYLV 94 (111) Q Consensus 15 rVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~dg~~y~i 94 (111) -|++.+.+-|- -|.+++. ++.|+..-.|+|+.=.|..+ .|+ +||||+|+- + +|..-+- T Consensus 756 DV~lhR~ENeG---FGFVi~s--S~~kp~sgiGrIieGSPAdR--Cgk-----LkVGDrilA---------V-NG~sI~~ 813 (984) T KOG3209 756 DVVLHRKENEG---FGFVIMS--SQNKPESGIGRIIEGSPADR--CGK-----LKVGDRILA---------V-NGQSILN 813 (984) T ss_pred EEEEECCCCCC---EEEEEEE--CCCCCCCCCCCCCCCCHHHH--HCC-----CCCCCEEEE---------E-CCEEEEC T ss_conf 06986046776---0289984--36689877431256981675--055-----432656887---------5-4703303 Q ss_pred EEHHHEEEEEEC Q ss_conf 853017899845 Q gi|254780848|r 95 MQESDIMGIVVE 106 (111) Q Consensus 95 ~~e~dIlavi~~ 106 (111) +...||+-.|.| T Consensus 814 lsHadiv~LIKd 825 (984) T KOG3209 814 LSHADIVSLIKD 825 (984) T ss_pred CCCHHHHHHHHH T ss_conf 672568888873 No 26 >cd02558 PSRA_1 PSRA_1: Pseudouridine synthase, a subgroup of the RluA family. This group is comprised of bacterial proteins assigned to the RluA family of pseudouridine synthases. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. The RluA family is comprised of proteins related to Escherichia coli RluA. Probab=61.69 E-value=8.8 Score=19.16 Aligned_cols=18 Identities=28% Similarity=0.390 Sum_probs=10.2 Q ss_pred EEEECCCCCCCCCCEEEECCC Q ss_conf 999747542122557994576 Q gi|254780848|r 17 VVRRLQSEIKTATGNILIPDT 37 (111) Q Consensus 17 LVk~~~~e~~T~~GgI~lp~~ 37 (111) +|-+.+.+ | ||.++++-+ T Consensus 85 ~VHRLDr~--T-SGllv~AK~ 102 (246) T cd02558 85 PAHRLDRL--T-AGLVLFSKR 102 (246) T ss_pred EECCCCCC--C-CEEEEEECC T ss_conf 73467998--7-469999889 No 27 >TIGR02824 quinone_pig3 putative NAD(P)H quinone oxidoreductase, PIG3 family; InterPro: IPR014189 Members of this family are putative quinone oxidoreductases that belong to the broader superfamily of zinc-dependent alcohol (of medium chain length) dehydrogenases and quinone oxidoreductases. An alignment shows no motif of conserved Cys residues as are found in zinc-binding members of the superfamily, consequently members of this entry are likely to be quinone oxidoreductases. A member of this family in Homo sapiens (Human), PIG3, is induced by p53 but is otherwise uncharacterised.. Probab=60.84 E-value=6 Score=20.06 Aligned_cols=85 Identities=21% Similarity=0.263 Sum_probs=46.3 Q ss_pred CCCCCCCCEEEEEECCC----CCCCCCCEEEECCCC-CCCCCC-EEEEEEEECCCEECCCCCEECCEECCCCEEEEECCC Q ss_conf 24251784799997475----421225579945765-546530-247999823870867887902203137889970568 Q gi|254780848|r 7 NYLRPTRGRVVVRRLQS----EIKTATGNILIPDTV-SEKPSA-SSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWS 80 (111) Q Consensus 7 ~~ikPl~drVLVk~~~~----e~~T~~GgI~lp~~~-~ek~~~-~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~ 80 (111) -.-+|=-+-||||...+ .+-...-|.|.|.-. .+=+.+ ..|+|+++|++.-.+ +..- .++||+|+=-=-+ T Consensus 21 p~P~p~~GEVLi~V~AAGVNRPD~lQR~G~YPpPPGAS~IlGLE~AG~v~a~G~~V~~~---~L~~-f~vGDrVcAL~~G 96 (334) T TIGR02824 21 PLPVPKAGEVLIRVAAAGVNRPDVLQRQGKYPPPPGASDILGLEVAGEVVAVGEGVSRE---HLAK-FKVGDRVCALVAG 96 (334) T ss_pred CCCCCCCCEEEEEEEEECCCHHHHHHHCCCCCCCCCCCCCCCCEEEEEEEECCCCCCHH---HHHH-CCCCCEEEEECCC T ss_conf 88855886258999984268279986467721011128888712536899838887768---8963-5887579987589 Q ss_pred CCEEEECCCCEEEEEEHHHEE Q ss_conf 608897699779998530178 Q gi|254780848|r 81 GTEIKLNDGEEYLVMQESDIM 101 (111) Q Consensus 81 g~~v~~~dg~~y~i~~e~dIl 101 (111) |= | -||..+.++.+| T Consensus 97 GG---Y---AEy~~V~A~q~L 111 (334) T TIGR02824 97 GG---Y---AEYVAVPAGQVL 111 (334) T ss_pred CC---E---EEEEEECCCEEE T ss_conf 83---3---767875144155 No 28 >TIGR01525 ATPase-IB_hvy heavy metal translocating P-type ATPase; InterPro: IPR006416 This group encompasses the copper and cadmium-type heavy metal transporting P-type ATPases as well as those which cannot be assigned to one or other of the groups. ; GO: 0015662 ATPase activity coupled to transmembrane movement of ions phosphorylative mechanism, 0030001 metal ion transport, 0016021 integral to membrane. Probab=59.16 E-value=11 Score=18.51 Aligned_cols=31 Identities=32% Similarity=0.566 Sum_probs=21.9 Q ss_pred EEEEEEEECCCE--------------ECC---CCCEECCEECCCCEEE Q ss_conf 247999823870--------------867---8879022031378899 Q gi|254780848|r 45 SSGEIMWVGAGV--------------MDQ---SGKVIEPEVSKGDIVL 75 (111) Q Consensus 45 ~~G~VvavG~g~--------------~~~---~G~~~p~~vkvGD~Vl 75 (111) +.|.+|.|.||- .++ +||-+|..+++||.|+ T Consensus 90 ~~GD~v~V~pGe~~PvDG~v~~G~s~~Des~lTGEs~PV~k~~G~~V~ 137 (639) T TIGR01525 90 QVGDIVVVRPGERIPVDGEVISGESEVDESALTGESMPVEKKEGDEVF 137 (639) T ss_pred CCCCEEEECCCCCCCCCCEEECCCCCEEHHHHCCCCCCEECCCCCEEE T ss_conf 667689975688121056786586303003214765221037887331 No 29 >TIGR01512 ATPase-IB2_Cd cadmium-translocating P-type ATPase; InterPro: IPR006404 These sequences describe the P-type ATPase primarily responsible for translocating cadmium ions (and other closely-related divalent heavy metals such as cobalt, mercury, lead and zinc) across biological membranes. These transporters are found in prokaryotes and plants. Experimentally characterised members include P37617 from SWISSPROT from Escherichia coli, Q10866 from SWISSPROT from Mycobacterium tuberculosis and Q59998 from SWISSPROT from Synechocystis PCC6803. The cadmium P-type ATPases have been characterised as Type IB based on a phylogenetic analysis which combines the copper-translocating ATPases with the cadmium-translocating species . This family and that describing the copper-ATPases (IPR006403 from INTERPRO) are well separated, and thus the copper-ATPases can be typed as IB1 and the cadmium-ATPases as IB2. ; GO: 0005524 ATP binding, 0016818 hydrolase activity acting on acid anhydrides in phosphorus-containing anhydrides, 0046872 metal ion binding, 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0016020 membrane. Probab=57.67 E-value=12 Score=18.40 Aligned_cols=23 Identities=30% Similarity=0.615 Sum_probs=17.6 Q ss_pred CCCEECC---CCCEECCEECCCCEEE Q ss_conf 3870867---8879022031378899 Q gi|254780848|r 53 GAGVMDQ---SGKVIEPEVSKGDIVL 75 (111) Q Consensus 53 G~g~~~~---~G~~~p~~vkvGD~Vl 75 (111) |.+..++ +||.+|..+++||.|+ T Consensus 92 G~~~~Des~lTGEs~PV~k~~G~~V~ 117 (494) T TIGR01512 92 GEALVDESALTGESVPVEKAPGDEVF 117 (494) T ss_pred CCCCEECCCCCCCCEEEECCCCCEEC T ss_conf 86322100015654003137777001 No 30 >PRK10671 copA copper exporting ATPase; Provisional Probab=53.09 E-value=19 Score=17.24 Aligned_cols=34 Identities=35% Similarity=0.588 Sum_probs=25.8 Q ss_pred EEEEEEECCCEECC---CCCEECCEECCCCEEEEECCCCCEE Q ss_conf 47999823870867---8879022031378899705686088 Q gi|254780848|r 46 SGEIMWVGAGVMDQ---SGKVIEPEVSKGDIVLFGKWSGTEI 84 (111) Q Consensus 46 ~G~VvavG~g~~~~---~G~~~p~~vkvGD~Vl~~~~~g~~v 84 (111) -|.|+. |.+..++ +|+..|...++||.|+ +|+-. T Consensus 358 DG~vv~-G~~~vDES~LTGES~Pv~K~~Gd~V~----aGtin 394 (834) T PRK10671 358 DGEITQ-GEAWLDEAMLTGEPIPQQKGEGDSVH----AGTVV 394 (834) T ss_pred CEEEEE-CCEEEECCCCCCCCCCEECCCCCEEC----CCCEE T ss_conf 369997-66257522025998765558998722----57564 No 31 >pfam01055 Glyco_hydro_31 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases. Probab=51.71 E-value=18 Score=17.38 Aligned_cols=60 Identities=22% Similarity=0.199 Sum_probs=34.8 Q ss_pred CCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEE----CCCCCEECCEECCCCE Q ss_conf 17847999974754212255799457655465302479998238708----6788790220313788 Q gi|254780848|r 11 PTRGRVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVM----DQSGKVIEPEVSKGDI 73 (111) Q Consensus 11 Pl~drVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~----~~~G~~~p~~vkvGD~ 73 (111) -+|+.+||-|.-.+..|+. -||||... =-....|++...+.|.. ...=+.+|+-||.|-. T Consensus 371 m~G~~lLVAPV~~~~~~~r-~VyLP~G~--W~d~~tg~~~~~~~g~~~~~~~~pl~~~PvFvr~GaI 434 (435) T pfam01055 371 LLGDDLLVAPVLEEGATSR-DVYLPEGT--WYDFWTGEVYGGGGGGQTVTVDAPLDRIPVFVRGGSI 434 (435) T ss_pred EECCCEEEEECCCCCCEEE-EEECCCCC--EEECCCCCEEEECCCCEEEEEECCCCCCCEEEECCEE T ss_conf 8429879950114895089-99879952--7977799787517999899995788755878748856 No 32 >KOG0024 consensus Probab=48.88 E-value=8.5 Score=19.24 Aligned_cols=55 Identities=29% Similarity=0.218 Sum_probs=30.8 Q ss_pred CCCEEEEEEC-------CCCCCCCCCEEEECCCCCCCCC----CEEEEEEEECCCEECCCCCEECCEECCCCEEEEEC Q ss_conf 7847999974-------7542122557994576554653----02479998238708678879022031378899705 Q gi|254780848|r 12 TRGRVVVRRL-------QSEIKTATGNILIPDTVSEKPS----ASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK 78 (111) Q Consensus 12 l~drVLVk~~-------~~e~~T~~GgI~lp~~~~ek~~----~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~ 78 (111) -.+.|||+.. +.+.-+ .|.|-.+- ..+|- -..|.|..||++.. .+|+||+|.... T Consensus 28 ~p~eVlv~i~a~GICGSDvHy~~-~G~ig~~v--~k~PmvlGHEssGiV~evG~~Vk---------~LkVGDrVaiEp 93 (354) T KOG0024 28 DPDEVLVAIKAVGICGSDVHYYT-HGRIGDFV--VKKPMVLGHESSGIVEEVGDEVK---------HLKVGDRVAIEP 93 (354) T ss_pred CCCEEEEEEEEEEECCCCCHHHC-CCCCCCCC--CCCCCCCCCCCCCEEHHHCCCCC---------CCCCCCEEEECC T ss_conf 98779999667786276203023-69767521--34551215424434055356643---------034688478648 No 33 >KOG2107 consensus Probab=48.02 E-value=23 Score=16.88 Aligned_cols=41 Identities=20% Similarity=0.238 Sum_probs=32.7 Q ss_pred EEEEEEECCCEEC---CCCCEECCEECCCCEEEEECCCCCEEEE Q ss_conf 4799982387086---7887902203137889970568608897 Q gi|254780848|r 46 SGEIMWVGAGVMD---QSGKVIEPEVSKGDIVLFGKWSGTEIKL 86 (111) Q Consensus 46 ~G~VvavG~g~~~---~~G~~~p~~vkvGD~Vl~~~~~g~~v~~ 86 (111) .-..+--|.|.++ .++..+-.+|+.||.++++..--..|+. T Consensus 95 eiR~il~GtgYfDVrd~dd~WIRi~vekGDlivlPaGiyHRFTt 138 (179) T KOG2107 95 EIRYILEGTGYFDVRDKDDQWIRIFVEKGDLIVLPAGIYHRFTT 138 (179) T ss_pred HEEEEEECCEEEEECCCCCCEEEEEEECCCEEEECCCCEEEEEC T ss_conf 14778606357761147887799999439889966862254242 No 34 >cd05792 S1_eIF1AD_like S1_eIF1AD_like: eukaryotic translation initiation factor 1A domain containing protein (eIF1AD)-like, S1-like RNA-binding domain. eIF1AD is also known as MGC11102 protein. Little is known about the function of eIF1AD. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins, including translation initiation factor IF1A (also referred to as eIF1A in eukaryotes). eIF1A is essential for translation initiation. eIF1A acts synergistically with eIF1 to mediate assembly of ribosomal initiation complexes at the initiation codon and maintain the accuracy of this process by recognizing and destabilizing aberrant preinitiation complexes from the mRNA. Without eIF1A and eIF1, 43S ribosomal preinitiation complexes can bind to the cap-proximal region, but are unable to reach the initiation codon. eIF1a also enhances the formation of 5'-terminal complexes in the presence of other translation initiation factors. Probab=47.00 E-value=19 Score=17.28 Aligned_cols=44 Identities=16% Similarity=0.237 Sum_probs=27.3 Q ss_pred EEEECCCEECCCCCEECCEECCCCEEEEECC-CCCEEEECCCCEEEEEEHHHE Q ss_conf 9982387086788790220313788997056-860889769977999853017 Q gi|254780848|r 49 IMWVGAGVMDQSGKVIEPEVSKGDIVLFGKW-SGTEIKLNDGEEYLVMQESDI 100 (111) Q Consensus 49 VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~-~g~~v~~~dg~~y~i~~e~dI 100 (111) +++-=|.++- -.+|+|-||+|+.... .|.+++ |+=..++.++.| T Consensus 25 ~Lv~mP~KFR-----k~iWiKRG~fVlvd~~~eg~Kv~---geIv~Il~~e~v 69 (78) T cd05792 25 YLVSMPTKFR-----KNIWIKRGDFVLVEPIEEGDKVK---AEIVKILTRDHV 69 (78) T ss_pred EEEEECHHHH-----CCEEEECCCEEEEEECCCCCEEE---EEEEEEECHHHH T ss_conf 9996013343-----16899718999997604798078---999999657898 No 35 >COG3264 Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane] Probab=46.74 E-value=17 Score=17.54 Aligned_cols=16 Identities=50% Similarity=0.636 Sum_probs=10.9 Q ss_pred EECCCCEEEEECCCCC Q ss_conf 0313788997056860 Q gi|254780848|r 67 EVSKGDIVLFGKWSGT 82 (111) Q Consensus 67 ~vkvGD~Vl~~~~~g~ 82 (111) .+|+||+|-++.+.|+ T Consensus 660 pvkvGD~It~g~~~G~ 675 (835) T COG3264 660 PVKVGDTVTIGTVSGT 675 (835) T ss_pred CCCCCCEEEECCCEEE T ss_conf 7356777998782489 No 36 >pfam10794 DUF2606 Protein of unknown function, DUF2606. Family of bacterial proteins with unknown function. Probab=46.66 E-value=25 Score=16.66 Aligned_cols=56 Identities=23% Similarity=0.332 Sum_probs=34.7 Q ss_pred CCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEECCCCEEEEEEHHHEEEEE Q ss_conf 6530247999823870867887902203137889970568608897699779998530178998 Q gi|254780848|r 41 KPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKLNDGEEYLVMQESDIMGIV 104 (111) Q Consensus 41 k~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~dg~~y~i~~e~dIlavi 104 (111) .|..-.|..+ |..+++|+.+.-.+..|++.+|-+..++.+.- ...+.++++..+++ T Consensus 72 ~Pskeig~~I----GKTd~eGkiiWk~~rKG~Yiv~lp~~et~~~r----~isl~~dr~~~~vi 127 (131) T pfam10794 72 QPSKEIGEII----GKTDHEGKIIWKSGRKGKYIVVLPKNETPETR----NISLIEDRKDHKVI 127 (131) T ss_pred CCCHHHHEEE----CCCCCCCEEEEECCCCCEEEEEECCCCCEEEE----EEEEECCCCCCEEE T ss_conf 9742121144----55476770887337772089996389850577----54210267763589 No 37 >pfam08140 Cuticle_1 Crustacean cuticle protein repeat. This family consists of the cuticle proteins from the Cancer pagurus and the Homarus americanus. These proteins are isolated from the calcified regions of the crustacean and they contain two copies of an 18 residue sequence motif, which thus far has been found only in crustacean calcified exoskeletons. Probab=46.32 E-value=19 Score=17.25 Aligned_cols=34 Identities=24% Similarity=0.432 Sum_probs=18.5 Q ss_pred CCEEEECCCCCCCCCCEEEEEEEECCC-EECCCCCE Q ss_conf 557994576554653024799982387-08678879 Q gi|254780848|r 29 TGNILIPDTVSEKPSASSGEIMWVGAG-VMDQSGKV 63 (111) Q Consensus 29 ~GgI~lp~~~~ek~~~~~G~VvavG~g-~~~~~G~~ 63 (111) || |+.||-....-..-.+.|+.+||. ....+|+. T Consensus 3 SG-ii~pdG~~~Q~~~~~a~ivl~GpSG~V~sdG~n 37 (40) T pfam08140 3 SG-IITPDGTNVQFPHGVANIVLIGPSGAVLSDGKN 37 (40) T ss_pred CC-EECCCCCEEECCCCCCEEEEECCCEEEEECCCE T ss_conf 76-698899487548675218997684259768838 No 38 >TIGR02178 yeiP elongation factor P-like protein YeiP; InterPro: IPR011897 This entry represents homologs of elongation factor P (IPR011768 from INTERPRO) and probably are translation factors. Member of this family are found only in some gammaproteobacteria, including Escherichia coli (YeiP) and Vibrio cholerae.. Probab=45.50 E-value=13 Score=18.18 Aligned_cols=69 Identities=20% Similarity=0.249 Sum_probs=33.5 Q ss_pred CCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEEC---CCCEE-EEECCCCCEEEECC Q ss_conf 847999974754212255799457655465302479998238708678879022031---37889-97056860889769 Q gi|254780848|r 13 RGRVVVRRLQSEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVS---KGDIV-LFGKWSGTEIKLND 88 (111) Q Consensus 13 ~drVLVk~~~~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vk---vGD~V-l~~~~~g~~v~~~d 88 (111) |...|||=++...-+..||= ... +=..-.|-.|.+-+. ..+ +=|.| |..++. .|-+-| T Consensus 15 gktllikDI~~~sP~gRGg~-------~~y---k~r~~dv~tG~kvee------rfk~Dd~ldtVel~rR~~--sFSY~D 76 (187) T TIGR02178 15 GKTLLIKDIEVSSPQGRGGA-------VLY---KLRMSDVRTGSKVEE------RFKADDVLDTVELLRREV--SFSYVD 76 (187) T ss_pred CCEEEEEEEEECCCCCCCCC-------EEE---EEEEEECCCCCEEEE------CCCCCCEEEEEEEEECEE--EEEEEC T ss_conf 98788986554289987623-------388---899865687852420------107672231112110004--677763 Q ss_pred CCEEEEEEHHH Q ss_conf 97799985301 Q gi|254780848|r 89 GEEYLVMQESD 99 (111) Q Consensus 89 g~~y~i~~e~d 99 (111) |++|+||+++| T Consensus 77 G~eyVFmd~eD 87 (187) T TIGR02178 77 GEEYVFMDEED 87 (187) T ss_pred CCEEEEECCCC T ss_conf 75068305878 No 39 >pfam10377 ATG11 Autophagy-related protein 11. This is a family of proteins involved in telomere maintenance. In Schizosaccharomyces pombe this protein is called taf1 (taz1 interacting factor). Taz1 is part of the telomere cap complex. In Saccharomyces cerevisiae this protein is called ATG11 and is known to be involved in vacuolar targeting and peroxisome degradation. Probab=44.50 E-value=27 Score=16.46 Aligned_cols=36 Identities=22% Similarity=0.242 Sum_probs=20.2 Q ss_pred EECCCCEEEEECCCCC------EEEECCCCEEEEEEHHHEEE Q ss_conf 0313788997056860------88976997799985301789 Q gi|254780848|r 67 EVSKGDIVLFGKWSGT------EIKLNDGEEYLVMQESDIMG 102 (111) Q Consensus 67 ~vkvGD~Vl~~~~~g~------~v~~~dg~~y~i~~e~dIla 102 (111) .+++||.|||-+-... ..-++.|--.+++++++..| T Consensus 42 ~F~~GDLaLFLpTr~~~~~~~pwaaFn~~~phYFL~~~~~~~ 83 (128) T pfam10377 42 NFKVGDLVLFLPTRRHAQNKQPWAAFNVGAPHYFLKEESSAA 83 (128) T ss_pred CCCCCCEEEEEECCCCCCCCCCEEEEECCCCEEEECCCCCCC T ss_conf 777776899900467778856239996699717864764211 No 40 >TIGR01852 lipid_A_lpxA acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase; InterPro: IPR010137 This entry describes LpxA, an enzyme for the biosynthesis of lipid A, a component of lipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat.; GO: 0008780 acyl-[acyl-carrier-protein]-UDP-N-acetylglucosamine O-acyltransferase activity, 0008610 lipid biosynthetic process. Probab=43.39 E-value=28 Score=16.37 Aligned_cols=46 Identities=13% Similarity=0.316 Sum_probs=30.3 Q ss_pred EEEEEECCCEECCCCCEECCEECCCCEEEEECCC--CCEEEECCCCEEEEE Q ss_conf 7999823870867887902203137889970568--608897699779998 Q gi|254780848|r 47 GEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWS--GTEIKLNDGEEYLVM 95 (111) Q Consensus 47 G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~--g~~v~~~dg~~y~i~ 95 (111) +.+..||.-.+.--.--+.=+|.+||.|+|+.++ +=.+++.| |.++ T Consensus 99 ~g~T~iG~~nllMAysHvAHDC~vGn~vv~aN~a~LAGHV~vgD---~a~i 146 (257) T TIGR01852 99 GGVTSIGNNNLLMAYSHVAHDCVVGNHVVLANNATLAGHVEVGD---YAII 146 (257) T ss_pred CCEEEECCCCEECCCCEEEECCEECCEEEEECCCEECEEEEECC---EEEE T ss_conf 88899888765720535612436888169813631222799888---9997 No 41 >PRK06763 F0F1 ATP synthase subunit alpha; Validated Probab=42.66 E-value=17 Score=17.61 Aligned_cols=25 Identities=16% Similarity=0.090 Sum_probs=14.3 Q ss_pred CEEEECCCCCCCCCCEEEEEEEECCCE Q ss_conf 579945765546530247999823870 Q gi|254780848|r 30 GNILIPDTVSEKPSASSGEIMWVGAGV 56 (111) Q Consensus 30 GgI~lp~~~~ek~~~~~G~VvavG~g~ 56 (111) -|+++-.. ..|.+..|+|-.+|... T Consensus 107 ~G~~~~~~--g~p~yviG~I~kv~~~~ 131 (213) T PRK06763 107 LGMHMKED--GSPDYVIGEVSKVYTME 131 (213) T ss_pred CCEEECCC--CCCCEEEEEEEEECCCC T ss_conf 53242115--89765889999960432 No 42 >TIGR02754 sod_Ni_protease nickel-type superoxide dismutase maturation protease; InterPro: IPR014124 Members of this protein family are predicted proteases that are encoded adjacent to the genes for a nickel-type superoxide dismutase (IPR014123 from INTERPRO). This family of predicted peptidases belong to MEROPS peptidase subfamily S26A (signal peptidase I), which have a Ser/Lys catalytic dyad.. Probab=42.50 E-value=29 Score=16.29 Aligned_cols=85 Identities=19% Similarity=0.250 Sum_probs=54.8 Q ss_pred CCCCCCCCCCCCCEEEEEECCCCCCCCC-CEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCC Q ss_conf 8652324251784799997475421225-579945765546530247999823870867887902203137889970568 Q gi|254780848|r 2 VGEHKNYLRPTRGRVVVRRLQSEIKTAT-GNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWS 80 (111) Q Consensus 2 ~~~~~~~ikPl~drVLVk~~~~e~~T~~-GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~ 80 (111) .|+++...-+=+|+|||+|.....+-.. |-|+.- -.|....+.+|.== . .+.-+-+-+..+|. T Consensus 4 ~G~SM~P~L~~~D~~~V~P~~~~~r~~~~G~v~v~----~HP~~p~~~~iKRL-----------~-~~~~nG~~~LGDnp 67 (90) T TIGR02754 4 TGESMSPTLKPGDRILVRPLLKIARVPPIGEVVVV----RHPLKPSLLIIKRL-----------A-AVDDNGLFLLGDNP 67 (90) T ss_pred CCCCCCCCCCCCCEEEEEECCCCCCCCCCCEEEEE----ECCCCCCEEEEEEE-----------E-EECCCCCEEECCCC T ss_conf 33215761398886988301102348778808998----56989974789861-----------3-23178838856737 Q ss_pred CCEEEECCCCEEEEEEHHHEEEEEE Q ss_conf 6088976997799985301789984 Q gi|254780848|r 81 GTEIKLNDGEEYLVMQESDIMGIVV 105 (111) Q Consensus 81 g~~v~~~dg~~y~i~~e~dIlavi~ 105 (111) ..+. |-+.+=+++.+.++|.+. T Consensus 68 ~ASt---DSR~~G~v~~~~L~G~v~ 89 (90) T TIGR02754 68 KAST---DSRQLGPVPRELLLGKVL 89 (90) T ss_pred CCCC---CCHHHCCCCCHHEEEEEE T ss_conf 7876---712105888202544785 No 43 >cd06814 PLPDE_III_DSD_D-TA_like_3 Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes Similar to D-Serine Dehydratase and D-Threonine Aldolase, Unknown Group 3. This subfamily is composed of uncharacterized bacterial proteins with similarity to eukaryotic D-serine dehydratases (DSD) and D-threonine aldolases (D-TA). DSD catalyzes the dehydration of D-serine to aminoacrylate, which is rapidly hydrolyzed to pyruvate and ammonia. D-TA reversibly catalyzes the aldol cleavage of D-threonine into glycine and acetaldehyde, and the synthesis of D-threonine from glycine and acetaldehyde. DSD and D-TA are fold type III PLP-dependent enzymes, similar to bacterial alanine racemase (AR), which contains an N-terminal PLP-binding TIM barrel domain and a C-terminal beta-sandwich domain. AR exists as homodimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. Based on their similarity to AR, it is possible mem Probab=42.14 E-value=25 Score=16.63 Aligned_cols=36 Identities=17% Similarity=0.178 Sum_probs=27.0 Q ss_pred EECCCCEEEEEC-CCCCEEEECCCCEEEEEEHHHEEEEE Q ss_conf 031378899705-68608897699779998530178998 Q gi|254780848|r 67 EVSKGDIVLFGK-WSGTEIKLNDGEEYLVMQESDIMGIV 104 (111) Q Consensus 67 ~vkvGD~Vl~~~-~~g~~v~~~dg~~y~i~~e~dIlavi 104 (111) .+++||+|.|.. +++.-+.-. .+|++++...|.+.. T Consensus 339 ~L~vGD~V~lrp~ha~~~~~~f--~~l~vvrgg~vv~~W 375 (379) T cd06814 339 QLKVDDYVFLRPTQSEAVLLQF--GDILVVRGGQIIDRW 375 (379) T ss_pred CCCCCCEEEEECCCCCHHHHHC--CEEEEEECCEEEEEE T ss_conf 7999999999557533666724--879999999999996 No 44 >PRK11033 zntA zinc/cadmium/mercury/lead-transporting ATPase; Provisional Probab=42.08 E-value=26 Score=16.56 Aligned_cols=34 Identities=26% Similarity=0.430 Sum_probs=24.1 Q ss_pred EEEEEEECCCEECC---CCCEECCEECCCCEEEEECCCCCEE Q ss_conf 47999823870867---8879022031378899705686088 Q gi|254780848|r 46 SGEIMWVGAGVMDQ---SGKVIEPEVSKGDIVLFGKWSGTEI 84 (111) Q Consensus 46 ~G~VvavG~g~~~~---~G~~~p~~vkvGD~Vl~~~~~g~~v 84 (111) -|.|++ |....++ +|+..|...++||.|+ +|+-. T Consensus 275 DG~vv~-G~s~vDES~LTGES~PV~K~~Gd~V~----aGt~n 311 (739) T PRK11033 275 DGKLLS-PFASFDESALTGESIPVERATGDKVP----AGATS 311 (739) T ss_pred CEEEEC-CEEEEECCCCCCCCCCEECCCCCCCC----CCCEE T ss_conf 279970-56888846869998715888998246----78643 No 45 >KOG2241 consensus Probab=41.67 E-value=26 Score=16.55 Aligned_cols=44 Identities=16% Similarity=0.177 Sum_probs=24.4 Q ss_pred CCCCCCEEEEEEEECCCEECCCCCE-E-CCEECCCCEEEEECCCCCE Q ss_conf 5465302479998238708678879-0-2203137889970568608 Q gi|254780848|r 39 SEKPSASSGEIMWVGAGVMDQSGKV-I-EPEVSKGDIVLFGKWSGTE 83 (111) Q Consensus 39 ~ek~~~~~G~VvavG~g~~~~~G~~-~-p~~vkvGD~Vl~~~~~g~~ 83 (111) +.+.-..+|-|......-... -+. . |-.-++||+|.|..|.|.. T Consensus 159 kmRgv~S~gMvlcaSs~d~~~-VE~l~pP~gs~pGdRv~fegfegeP 204 (255) T KOG2241 159 KMRGVKSQGMVLCASSPDKSV-VEPLAPPAGSKPGDRVTFEGFEGEP 204 (255) T ss_pred CCCCCCCCEEEEECCCCCCCE-EEECCCCCCCCCCCEEEECCCCCCC T ss_conf 246622314688637866550-4651599989998746656889996 No 46 >TIGR03201 dearomat_had 6-hydroxycyclohex-1-ene-1-carbonyl-CoA dehydrogenase. Members of this protein family are 6-hydroxycyclohex-1-ene-1-carbonyl-CoA dehydrogenase, an enzyme in the anaerobic metabolism of aromatic enzymes by way of benzoyl-CoA, as seen in Thauera aromatica, Geobacter metallireducens, and Azoarcus sp. The experimentally characterized form from T. aromatica uses only NAD+, not NADP+. Note that Rhodopseudomonas palustris uses a different pathway to perform a similar degradation of benzoyl-CoA to 3-hydroxpimelyl-CoA. Probab=39.95 E-value=26 Score=16.50 Aligned_cols=23 Identities=30% Similarity=0.486 Sum_probs=15.8 Q ss_pred EEEEEEECCCEECCCCCEECCEECCCCEEEEEC Q ss_conf 479998238708678879022031378899705 Q gi|254780848|r 46 SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGK 78 (111) Q Consensus 46 ~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~ 78 (111) .|+|+++|++..+- +||+|.... T Consensus 62 ~G~Vv~vG~~v~~~----------vGdrV~v~~ 84 (349) T TIGR03201 62 SGRVIQAGAGAASW----------IGKAVIVPA 84 (349) T ss_pred EEEEEEECCCCCCC----------CCCEEEECC T ss_conf 99999848998776----------799199944 No 47 >TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679 DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC). This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair. Probab=39.41 E-value=32 Score=16.01 Aligned_cols=47 Identities=19% Similarity=0.245 Sum_probs=25.2 Q ss_pred EEEECCCCCCCCCCEEEEEEEECCCEECCCCCEE-CCEECCCCEEEEECCCC Q ss_conf 7994576554653024799982387086788790-22031378899705686 Q gi|254780848|r 31 NILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVI-EPEVSKGDIVLFGKWSG 81 (111) Q Consensus 31 gI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~-p~~vkvGD~Vl~~~~~g 81 (111) |.|.|= |+-.|=...|.+|+--+ ..|.+.+ -.++++||+|+..+=+. T Consensus 341 G~iTPv-A~LePV~vaG~~Vs~AT---LHN~D~I~~kdiriGD~Vvv~kAGd 388 (706) T TIGR00575 341 GAITPV-AKLEPVFVAGTTVSRAT---LHNEDEIEEKDIRIGDTVVVRKAGD 388 (706) T ss_pred EEECCE-EEECCEEECCEEEEEEE---ECCHHHHHCCCCCCCCEEEEEECCC T ss_conf 776003-67565587207862344---0160024202766588899984486 No 48 >TIGR00293 TIGR00293 prefoldin, alpha subunit; InterPro: IPR011599 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 1423 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue , . This family contains the archaeal prefoldin alpha subunit. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex. Probab=39.37 E-value=27 Score=16.46 Aligned_cols=37 Identities=14% Similarity=0.282 Sum_probs=22.3 Q ss_pred CCCCEEEEEECCCCCCCCCCEEEECCCCCCC-CCCEEEEEEEECCCEECC Q ss_conf 1784799997475421225579945765546-530247999823870867 Q gi|254780848|r 11 PTRGRVVVRRLQSEIKTATGNILIPDTVSEK-PSASSGEIMWVGAGVMDQ 59 (111) Q Consensus 11 Pl~drVLVk~~~~e~~T~~GgI~lp~~~~ek-~~~~~G~VvavG~g~~~~ 59 (111) +-|.-+|| |. ++++|++....+. .+ -+|.||.|++-+ T Consensus 47 ~~g~E~LV-P~-------G~~~fv~a~v~d~i~k----v~V~~G~Gy~vE 84 (132) T TIGR00293 47 NEGKEVLV-PV-------GAGLFVKAKVKDAIDK----VLVDIGSGYYVE 84 (132) T ss_pred CCCCEEEE-EE-------CCCCEEEEEEEECCCE----EEECCCCCEEEE T ss_conf 78976887-61-------6442777676512000----454048865245 No 49 >PRK02268 hypothetical protein; Provisional Probab=38.19 E-value=34 Score=15.90 Aligned_cols=33 Identities=15% Similarity=0.381 Sum_probs=17.4 Q ss_pred CCCEECC-EECCCCEEEEECCCCCEEEECCCCEEEEE Q ss_conf 8879022-03137889970568608897699779998 Q gi|254780848|r 60 SGKVIEP-EVSKGDIVLFGKWSGTEIKLNDGEEYLVM 95 (111) Q Consensus 60 ~G~~~p~-~vkvGD~Vl~~~~~g~~v~~~dg~~y~i~ 95 (111) .|+.-|. -+++||+|+| |+.... +.+++.+.-+ T Consensus 27 HGK~apL~Rm~~GD~ivY--YSPk~~-~~~~~~lqaF 60 (141) T PRK02268 27 HGKAAPLRRMKPGDWIIY--YSPKTT-FGGKDKLQAF 60 (141) T ss_pred CCCCCHHHCCCCCCEEEE--ECCEEE-CCCCCEEEEE T ss_conf 784015532899998999--578410-4999656899 No 50 >cd04466 S1_YloQ_GTPase S1_YloQ_GTPase: YloQ GTase family (also known as YjeQ and CpgA), S1-like RNA-binding domain. Proteins in the YloQ GTase family bind the ribosome and have GTPase activity. The precise role of this family is unknown. The protein structure is composed of three domains: an N-terminal S1 domain, a central GTPase domain, and a C-terminal zinc finger domain. This N-terminal S1 domain binds ssRNA. The central GTPase domain contains nucleotide-binding signature motifs: G1 (walker A), G3 (walker B) and G4 motifs. Experiments show that the bacterial YloQ and YjeQ proteins have low intrinsic GTPase activity. The C-terminal zinc-finger domain has structural similarity to a portion of the DNA-repair protein Rad51. This suggests a possible role for this GTPase as a regulator of translation, perhaps as a translation initiation factor. This family is classified based on the N-terminal S1 domain. Probab=36.88 E-value=36 Score=15.78 Aligned_cols=14 Identities=29% Similarity=0.202 Sum_probs=10.1 Q ss_pred EECCCCEEEEECCC Q ss_conf 03137889970568 Q gi|254780848|r 67 EVSKGDIVLFGKWS 80 (111) Q Consensus 67 ~vkvGD~Vl~~~~~ 80 (111) .+-+||+|.|..-. T Consensus 37 ~~~vGD~V~~~~~~ 50 (68) T cd04466 37 PPAVGDRVEFEPED 50 (68) T ss_pred CCCCCEEEEEEECC T ss_conf 98642399999998 No 51 >pfam07602 DUF1565 Protein of unknown function (DUF1565). These proteins share a region of homology in their N termini, and are found in several phylogenetically diverse bacteria and in the archaeon Methanosarcina acetivorans. Some of these proteins also contain characterized domains such as pfam00395 and pfam03422. Probab=36.23 E-value=32 Score=16.02 Aligned_cols=30 Identities=17% Similarity=0.362 Sum_probs=22.4 Q ss_pred EEEEEEECCCEEC-CCCCEECCEECCCCEEE Q ss_conf 4799982387086-78879022031378899 Q gi|254780848|r 46 SGEIMWVGAGVMD-QSGKVIEPEVSKGDIVL 75 (111) Q Consensus 46 ~G~VvavG~g~~~-~~G~~~p~~vkvGD~Vl 75 (111) .|.+|.+.||.|+ .+|+..|..++.|-+++ T Consensus 27 ~~t~I~lapGtYs~~tGE~FPl~ip~gv~l~ 57 (255) T pfam07602 27 AGTIIQLAPGTYSEATGETFPIIIPDGVTLQ 57 (255) T ss_pred CCCEEEECCCCCCCCCCCEEEEEECCCCEEE T ss_conf 9988995996212446855126807983897 No 52 >COG4384 Mu-like prophage protein gp45 [Function unknown] Probab=35.91 E-value=37 Score=15.69 Aligned_cols=24 Identities=17% Similarity=0.300 Sum_probs=18.5 Q ss_pred CCCCEECCEECCCCEEEEECCCCC Q ss_conf 788790220313788997056860 Q gi|254780848|r 59 QSGKVIEPEVSKGDIVLFGKWSGT 82 (111) Q Consensus 59 ~~G~~~p~~vkvGD~Vl~~~~~g~ 82 (111) +++...|+-++.||+|+|+.++.. T Consensus 87 ~~~syR~~GL~aGeT~iY~~eG~~ 110 (203) T COG4384 87 QHGSYRITGLKAGETVIYNHEGAK 110 (203) T ss_pred CCCCCCCCCCCCCCEEEEECCCCE T ss_conf 388611136567746888225748 No 53 >pfam01079 Hint Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module. Probab=35.09 E-value=38 Score=15.61 Aligned_cols=17 Identities=35% Similarity=0.524 Sum_probs=11.9 Q ss_pred CCEECCCCEEEEECCCC Q ss_conf 22031378899705686 Q gi|254780848|r 65 EPEVSKGDIVLFGKWSG 81 (111) Q Consensus 65 p~~vkvGD~Vl~~~~~g 81 (111) .-.|++||+|+..+..+ T Consensus 103 As~V~~Gd~v~v~~~~~ 119 (214) T pfam01079 103 ASDVRPGDYVLVQDNGG 119 (214) T ss_pred CCCCCCCCEEEEEECCC T ss_conf 22277898899997899 No 54 >TIGR01393 lepA GTP-binding protein LepA; InterPro: IPR006297 LepA (GUF1 in Saccaromyces) is a GTP-binding membrane protein related to EF-G and EF-Tu. Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including GUF1 of yeast) originated within the bacterial LepA family. The function of the proteins in this family are unknown. ; GO: 0005525 GTP binding. Probab=34.62 E-value=39 Score=15.57 Aligned_cols=64 Identities=22% Similarity=0.283 Sum_probs=37.9 Q ss_pred CCEEEEEECC-CCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCE Q ss_conf 8479999747-5421225579945765546530247999823870867887902203137889970568608 Q gi|254780848|r 13 RGRVVVRRLQ-SEIKTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTE 83 (111) Q Consensus 13 ~drVLVk~~~-~e~~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~ 83 (111) ||+|..=-.. .-+-+.-| ++.|........+..|.|=-+-.|-.+-. +++|||+|--.++...+ T Consensus 221 gD~I~~Mstgk~y~V~evG-~~~P~~~~~~~~L~aGeVGy~~AgIK~v~------D~~VGDTiT~~~~Pa~e 285 (598) T TIGR01393 221 GDKIRFMSTGKEYEVDEVG-VFTPKLEVKTQELSAGEVGYIIAGIKDVS------DVKVGDTITSVKNPAKE 285 (598) T ss_pred CCEEEEEECCCEEEEEEEE-EECCCCCCCCCCEECCCEEEEEEEEEECC------CEECCCEEECCCCCCCC T ss_conf 9889995348766675500-34345201466200163059998653104------11205445256787376 No 55 >COG1329 Transcriptional regulators, similar to M. xanthus CarD [Transcription] Probab=34.00 E-value=28 Score=16.38 Aligned_cols=40 Identities=25% Similarity=0.341 Sum_probs=23.5 Q ss_pred CEECCCCEEEEECCCCCEEE------EC-CCCEEEEE--EHHHEEEEEE Q ss_conf 20313788997056860889------76-99779998--5301789984 Q gi|254780848|r 66 PEVSKGDIVLFGKWSGTEIK------LN-DGEEYLVM--QESDIMGIVV 105 (111) Q Consensus 66 ~~vkvGD~Vl~~~~~g~~v~------~~-dg~~y~i~--~e~dIlavi~ 105 (111) |..++||+|+||.++.-.|+ ++ +..+|+.+ ..+|....|. T Consensus 3 ~~FkiGd~VVYP~HGvG~I~~Ieeke~~Ge~~~yyVI~f~~~dm~v~VP 51 (166) T COG1329 3 MAFKIGDHVVYPAHGVGIIQAIEEKEIAGETLEYYVIDFPQSDMTVMVP 51 (166) T ss_pred CCCCCCCEEEECCCCCEEEEHHHHHHHCCCEEEEEEEEECCCCCEEEEE T ss_conf 6342799888358766530056677625852489999973788289856 No 56 >TIGR02390 RNA_pol_rpoA1 DNA-directed RNA polymerase subunit A'; InterPro: IPR012758 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This family consists of the archaeal A' subunit of the DNA-directed RNA polymerase. The example from Methanococcus jannaschii contains an intein.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0008270 zinc ion binding, 0006350 transcription. Probab=33.42 E-value=41 Score=15.46 Aligned_cols=28 Identities=32% Similarity=0.662 Sum_probs=21.1 Q ss_pred EECCCCEEEEEC--------CCCCEEEECCCCEEEE Q ss_conf 031378899705--------6860889769977999 Q gi|254780848|r 67 EVSKGDIVLFGK--------WSGTEIKLNDGEEYLV 94 (111) Q Consensus 67 ~vkvGD~Vl~~~--------~~g~~v~~~dg~~y~i 94 (111) .+.-||.|||++ .-|+.|++-=|++|++ T Consensus 432 HL~dGDiVLFNRQPSLHRmSmMgH~VkVLPgkTFRL 467 (901) T TIGR02390 432 HLIDGDIVLFNRQPSLHRMSMMGHKVKVLPGKTFRL 467 (901) T ss_pred EECCCCEEEECCCCCHHHHHHCCCEEEECCCCCCCC T ss_conf 712688876658864202101144546788885213 No 57 >cd03699 lepA_II lepA_II: This subfamily represents the domain II of LepA, a GTP-binding protein localized in the cytoplasmic membrane. The N-terminal domain of LepA shares regions of homology to translation factors. In terms of interaction with the ribosome, EF-G, EF-Tu and IF2 have all been demonstrated to interact at overlapping sites on the ribosome. Chemical protection studies demonstrate that they all include the universally conserved alpha-sarcin loop as part of their binding site. These data indicate that LepA may bind to this location on the ribosome as well. LepA has never been observed in archaea, and eukaryl LepA is organellar. LepA is therefore a true bacterial GTPase, found only in the bacterial lineage. Probab=30.81 E-value=39 Score=15.54 Aligned_cols=38 Identities=21% Similarity=0.422 Sum_probs=16.9 Q ss_pred EEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEE Q ss_conf 799457655465302479998238708678879022031378899 Q gi|254780848|r 31 NILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVL 75 (111) Q Consensus 31 gI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl 75 (111) |++.|+... ...+..|+|=.+-.|..+ +-++++||++- T Consensus 48 G~~~P~~~~-~~~L~aGeVGyii~gik~------~~d~~vGDTit 85 (86) T cd03699 48 GIFRPEMTP-TDELSAGQVGYIIAGIKT------VKDARVGDTIT 85 (86) T ss_pred EECCCCCCC-CCEECCCCCCEEEECCCC------CCCCCCCCEEE T ss_conf 871799763-884947840489973420------15573447875 No 58 >pfam06890 Phage_Mu_Gp45 Bacteriophage Mu Gp45 protein. This family consists of Bacteriophage Mu Gp45 related proteins from both phages and bacteria. The function of this family is unknown although it has been suggested that family members may be involved in baseplate assembly. Probab=30.62 E-value=45 Score=15.19 Aligned_cols=35 Identities=20% Similarity=0.287 Sum_probs=21.3 Q ss_pred EEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEE Q ss_conf 247999823870867887902203137889970568608897 Q gi|254780848|r 45 SSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKL 86 (111) Q Consensus 45 ~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~ 86 (111) .+|.||++...+| .|..++.|+..+|...+ ..|.+ T Consensus 58 sh~Vviaved~~y------R~~~L~~GEvalY~~~G-~~I~L 92 (162) T pfam06890 58 SHGVAVVVEDRRY------RPKGLEAGDVALYHDEG-HRVRL 92 (162) T ss_pred CCEEEEEECCCCC------CCCCCCCCCEEEEECCC-CEEEE T ss_conf 1019999768553------12578888589980689-99999 No 59 >COG2217 ZntA Cation transport ATPase [Inorganic ion transport and metabolism] Probab=29.94 E-value=47 Score=15.12 Aligned_cols=29 Identities=28% Similarity=0.528 Sum_probs=20.5 Q ss_pred EEEEEEECCCEECC---CCCEECCEECCCCEEE Q ss_conf 47999823870867---8879022031378899 Q gi|254780848|r 46 SGEIMWVGAGVMDQ---SGKVIEPEVSKGDIVL 75 (111) Q Consensus 46 ~G~VvavG~g~~~~---~G~~~p~~vkvGD~Vl 75 (111) -|+|++ |.+..++ +|+.+|...++||.|+ T Consensus 247 DG~V~~-G~s~vDeS~iTGEs~PV~k~~Gd~V~ 278 (713) T COG2217 247 DGVVVS-GSSSVDESMLTGESLPVEKKPGDEVF 278 (713) T ss_pred CEEEEE-CCEEEECCHHHCCCCCEECCCCCEEE T ss_conf 759996-70773301311888776669989886 No 60 >pfam01959 DHQS 3-dehydroquinate synthase (EC 4.6.1.3). 3-Dehydroquinate synthase is an enzyme in the common pathway of aromatic amino acid biosynthesis that catalyses the conversion of 3-deoxy-D-arabino-heptulosonic acid 7-phosphate (DAHP) into 3-dehydroquinic acid. This synthesis of aromatic amino acids is an essential metabolic function for most prokaryotic as well as lower eukaryotic cells, including plants. The pathway is absent in humans; therefore, DHQS represents a potential target for the development of novel and selective antimicrobial agents. Owing to the threat posed by the spread of pathogenic bacteria resistant to many currently used antimicrobial drugs, there is clearly a need to develop new anti-infective drugs acting at novel targets. A further potential use for DHQS inhibitors is as herbicides. Probab=28.10 E-value=34 Score=15.89 Aligned_cols=19 Identities=37% Similarity=0.443 Sum_probs=9.9 Q ss_pred CCCCEECC-EECCCCEEEEE Q ss_conf 78879022-03137889970 Q gi|254780848|r 59 QSGKVIEP-EVSKGDIVLFG 77 (111) Q Consensus 59 ~~G~~~p~-~vkvGD~Vl~~ 77 (111) ++|+.++. ++|+||.||.. T Consensus 305 ~~G~~vsVt~Lk~GD~vL~~ 324 (343) T pfam01959 305 PDGTPISVTELKPGDKVLVY 324 (343) T ss_pred CCCCEEEEEECCCCCEEEEE T ss_conf 99968861545899999998 No 61 >TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase; InterPro: IPR011960 This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases may be excluded from this entry.. Probab=28.00 E-value=37 Score=15.68 Aligned_cols=46 Identities=20% Similarity=0.468 Sum_probs=30.8 Q ss_pred EEECCCCC--CCCCCEEEEEEEECCCEEC-CCCCEECCEECCCCEEEEECC Q ss_conf 99457655--4653024799982387086-788790220313788997056 Q gi|254780848|r 32 ILIPDTVS--EKPSASSGEIMWVGAGVMD-QSGKVIEPEVSKGDIVLFGKW 79 (111) Q Consensus 32 I~lp~~~~--ek~~~~~G~VvavG~g~~~-~~G~~~p~~vkvGD~Vl~~~~ 79 (111) ++||.... .+.+.+-=+=|-=|.|+++ -+|++-+| .+||.|+=+.+ T Consensus 88 LilPGevAP~HRH~QsALRFiveG~GA~TAVDGErt~M--~~gDfiiTP~~ 136 (343) T TIGR02272 88 LILPGEVAPSHRHTQSALRFIVEGKGAYTAVDGERTTM--HPGDFIITPSW 136 (343) T ss_pred EEECCCCCCCCCCCCCEEEEEEECCCCEEEECCCCCCC--CCCCEEECCCC T ss_conf 54167647766665120236773784065125753657--77775677898 No 62 >PRK13211 N-acetylglucosamine-binding protein A; Reviewed Probab=27.99 E-value=34 Score=15.86 Aligned_cols=26 Identities=23% Similarity=0.365 Sum_probs=17.5 Q ss_pred CEECCCCCEEC-CEECCCCEEEEECCC Q ss_conf 70867887902-203137889970568 Q gi|254780848|r 55 GVMDQSGKVIE-PEVSKGDIVLFGKWS 80 (111) Q Consensus 55 g~~~~~G~~~p-~~vkvGD~Vl~~~~~ 80 (111) +.+...|.+.| +++++||.|+++=|. T Consensus 199 ~~W~~~G~I~p~~dL~~GD~V~~RvFd 225 (475) T PRK13211 199 GEWTQVGQINPSMDLKVGDKVMTRVFD 225 (475) T ss_pred CCCEECCCCCCCCCCCCCCEEEEEEEC T ss_conf 313545648556778989999999831 No 63 >TIGR01511 ATPase-IB1_Cu copper-translocating P-type ATPase; InterPro: IPR006403 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles . P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H^+, Na^+, K^+, Mg^2+, Ca^2+, Ag^+ and Ag^2+, Zn^2+, Co^2+, Pb^2+, Ni^2+, Cd^2+, Cu^+ and Cu^2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents P-type ATPases that specialise in cation and copper transport; those transporting copper (3.6.3.4 from EC) have been classified as belonging to the IB subfamily , , . These proteins are involved in a variety of processes in both prokaryotes and eukaryotes. Some of these P-ATPases import copper into cells, such as in Arabidopsis sp., which require copper to create functional hormone receptors for ethylene signalling. Other P-ATPases are involved in copper export out of cells, such as occurs with the efflux of hepatic copper into the bile in eukaryotes. In bacteria, some of these P-ATPases play a role in osmotic adaptation. Certain copper ATPases have been implicated in both Wilson and Menkes diseases . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0004008 copper-exporting ATPase activity, 0005524 ATP binding, 0046872 metal ion binding, 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0016020 membrane. Probab=27.64 E-value=51 Score=14.89 Aligned_cols=63 Identities=19% Similarity=0.358 Sum_probs=39.3 Q ss_pred CCCCCCCCEEEEEECCCCCCCCCC--EEEECCCCCCCCCCEEEEEEEECCCEECC---CCCEECCEECCCCEEE Q ss_conf 242517847999974754212255--79945765546530247999823870867---8879022031378899 Q gi|254780848|r 7 NYLRPTRGRVVVRRLQSEIKTATG--NILIPDTVSEKPSASSGEIMWVGAGVMDQ---SGKVIEPEVSKGDIVL 75 (111) Q Consensus 7 ~~ikPl~drVLVk~~~~e~~T~~G--gI~lp~~~~ek~~~~~G~VvavG~g~~~~---~G~~~p~~vkvGD~Vl 75 (111) .+|+|-.=++|.+..+.++-. .| ..+.|.+ |. ..-|+||. |....++ +||-.|...|+||.|+ T Consensus 75 ~~L~P~~A~~~~~~v~v~~l~-~GD~~~V~pG~---k~-PvDG~v~~-G~S~vDES~~TGEs~PV~K~~Gd~Vi 142 (545) T TIGR01511 75 AKLQPETATLLTDDVPVAELQ-PGDIVKVLPGE---KI-PVDGTVIE-GESEVDESLVTGESLPVPKKVGDTVI 142 (545) T ss_pred HHCCCCCEEEECCCCCHHHCC-CCCEEEECCCC---CC-CCCCEEEC-CCCEEHHHHHCCCCCCEECCCCCEEE T ss_conf 840852238731672364418-88889975789---56-40407851-32022245555888665027888799 No 64 >COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only] Probab=26.27 E-value=55 Score=14.75 Aligned_cols=33 Identities=24% Similarity=0.368 Sum_probs=25.5 Q ss_pred CCCCCEECCEECCCCEEEEECCCCCEEEECCCCE Q ss_conf 6788790220313788997056860889769977 Q gi|254780848|r 58 DQSGKVIEPEVSKGDIVLFGKWSGTEIKLNDGEE 91 (111) Q Consensus 58 ~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~dg~~ 91 (111) ..+|+.+-+.++.||.++.+++.|+-+ .|.|.+ T Consensus 120 ~~~G~~~v~~~~~Gd~iyVPp~~gH~t-~N~Gd~ 152 (209) T COG2140 120 KPEGEARVIAVRAGDVIYVPPGYGHYT-INTGDE 152 (209) T ss_pred CCCCCEEEEEECCCCEEEECCCCCEEE-ECCCCC T ss_conf 589957999825897899489764374-658999 No 65 >TIGR01073 pcrA ATP-dependent DNA helicase PcrA; InterPro: IPR005751 The DNA helicase PcrA is found mainly in Gram-positive bacteria and belongs to a superfamily of helicases, together with Rep and UvrD helicases from Escherichia coli. The action of the 3'5' bacterial DNA helicase is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilising the duplex and separating the two strands in an active process .; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=25.79 E-value=37 Score=15.67 Aligned_cols=22 Identities=27% Similarity=0.363 Sum_probs=17.7 Q ss_pred CEECCCCEEEEECCC-CCEEEEC Q ss_conf 203137889970568-6088976 Q gi|254780848|r 66 PEVSKGDIVLFGKWS-GTEIKLN 87 (111) Q Consensus 66 ~~vkvGD~Vl~~~~~-g~~v~~~ 87 (111) ...++||+|.+.+|+ ||-+.+. T Consensus 760 ~~w~vGDrv~H~~fG~GTVv~v~ 782 (811) T TIGR01073 760 LSWAVGDRVSHKKFGVGTVVSVK 782 (811) T ss_pred EEEECCCCCCCCCCCEEEEEEEE T ss_conf 24405861034300005898885 No 66 >TIGR02228 sigpep_I_arch signal peptidase I; InterPro: IPR001733 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidases belong to MEROPS peptidase family S26 (signal peptidase I family, clan SF), subfamily S26B. Eukaryotic microsomal signal peptidase is involved in the removal of signal peptides from secretory proteins as they pass into the endoplasmic reticulum lumen . The peptidase is more complex than its mitochondrial and bacterial counterparts, containing a number of subunits, ranging from two in the chicken oviduct peptidase, to five in the dog pancreas protein . They share sequence similarity with the bacterial leader peptidases (family S26A), although activity here is mediated by a serine/histidine dyad rather than a serine/lysine dyad . Archaeal signal peptidases also belong to this group. ; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0006508 proteolysis, 0016020 membrane. Probab=25.54 E-value=53 Score=14.82 Aligned_cols=61 Identities=18% Similarity=0.373 Sum_probs=36.4 Q ss_pred CCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCC----CEEEEC-CCCEEEEEEHHHEEEEEE Q ss_conf 55465302479998238708678879022031378899705686----088976-997799985301789984 Q gi|254780848|r 38 VSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSG----TEIKLN-DGEEYLVMQESDIMGIVV 105 (111) Q Consensus 38 ~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g----~~v~~~-dg~~y~i~~e~dIlavi~ 105 (111) ....|..+.|-++.+-+ ...|-++|+||-|.|..-.. .-+... =+.+|..++.++=+..++ T Consensus 45 gSMEP~f~~Gd~~~~~~-------~~~~~~~~~GDvI~y~~~~~~WyG~~v~HRv~~~~yiyi~~~~~~~fiT 110 (175) T TIGR02228 45 GSMEPTFNTGDLILVTG-------KVDPEDIQVGDVIVYKSEGKRWYGTPVIHRVIEIKYIYINSGGELGFIT 110 (175) T ss_pred EEECCEEECCEEEEEEC-------CCCHHHCCCCCEEEEEECCCCCCCEEEEEEEEEEEEEEEECCCEEEEEE T ss_conf 54306252130999804-------2176663015679992469952353899999988999996598789994 No 67 >pfam02559 CarD_TRCF CarD-like/TRCF domain. CarD is a Myxococcus xanthus protein required for the activation of light- and starvation-inducible genes. This family includes the presumed N-terminal domain. CarD interacts with the zinc-binding protein CarG, to form a complex that regulates multiple processes in Myxococcus xanthus. This family also includes a domain to the N-terminal side of the DEAD helicase of TRCF proteins. TRCF displaces RNA polymerase stalled at a lesion, binds to the damage recognition protein UvrA, and increases the template strand repair rate during transcription. This domain is involved in binding to the stalled RNA polymerase. Probab=24.86 E-value=34 Score=15.85 Aligned_cols=13 Identities=23% Similarity=0.371 Sum_probs=9.4 Q ss_pred ECCCCEEEEECCC Q ss_conf 3137889970568 Q gi|254780848|r 68 VSKGDIVLFGKWS 80 (111) Q Consensus 68 vkvGD~Vl~~~~~ 80 (111) .++||+|+++.|+ T Consensus 2 ~~~GD~VVh~~~G 14 (98) T pfam02559 2 LKVGDYVVHPDHG 14 (98) T ss_pred CCCCCEEECCCCC T ss_conf 9999999777996 No 68 >PRK10838 spr putative outer membrane lipoprotein; Provisional Probab=24.81 E-value=17 Score=17.52 Aligned_cols=34 Identities=24% Similarity=0.415 Sum_probs=20.2 Q ss_pred EEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECC Q ss_conf 7994576554653024799982387086788790220313788997056 Q gi|254780848|r 31 NILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKW 79 (111) Q Consensus 31 gI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~ 79 (111) ||-||-++.+. ...|+-|+. -++++||.|+|... T Consensus 107 Gi~LPRts~~Q--~~~G~~V~~-------------~~l~pGDLVFF~~~ 140 (188) T PRK10838 107 GLELPRSTYEQ--QEMGKSVSR-------------SNLRTGDLVLFRAG 140 (188) T ss_pred CCCCCCCHHHH--HHCCCCCCH-------------HHCCCCCEEEECCC T ss_conf 80189438999--973976377-------------87899888996699 No 69 >COG0298 HypC Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones] Probab=24.32 E-value=60 Score=14.53 Aligned_cols=17 Identities=35% Similarity=0.481 Sum_probs=11.5 Q ss_pred CEECCEECCCCEEEEEC Q ss_conf 79022031378899705 Q gi|254780848|r 62 KVIEPEVSKGDIVLFGK 78 (111) Q Consensus 62 ~~~p~~vkvGD~Vl~~~ 78 (111) ..++-+++.||||+..- T Consensus 33 ~Lv~~~v~~GdyVLVHv 49 (82) T COG0298 33 DLVGEEVKVGDYVLVHV 49 (82) T ss_pred EEECCCCCCCCEEEEEE T ss_conf 33047533477899995 No 70 >smart00696 DM9 Repeats found in Drosophila proteins. Probab=23.90 E-value=61 Score=14.49 Aligned_cols=52 Identities=23% Similarity=0.431 Sum_probs=35.8 Q ss_pred ECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEEEECCCCCEEEEC Q ss_conf 457655465302479998238708678879022031378899705686088976 Q gi|254780848|r 34 IPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTEIKLN 87 (111) Q Consensus 34 lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~ 87 (111) +|..|-.......|+.+=||...+ +|...|-.+.+.-.+.|-.|.|.++... T Consensus 12 vP~~Av~~G~~~~Ge~lYvgR~~~--~g~~~pGKv~p~~~~~yi~~gG~E~~~~ 63 (71) T smart00696 12 IPPNAVVGGTDSDGEPLYVGRAYY--EGSLLPGKVVPSHGCAYIPYGGQEVRLD 63 (71) T ss_pred CCCCCEEEEECCCCCEEEEEEEEE--CCEEEEEEEECCCCEEEEEECCEEEECC T ss_conf 899849942079999999999999--9868899998068999981399997647 No 71 >cd04456 S1_IF1A_like S1_IF1A_like: Translation initiation factor IF1A-like, S1-like RNA-binding domain. IF1A is also referred to as eIF1A in eukaryotes and aIF1A in archaea. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. IF1A is essential for translation initiation. eIF1A acts synergistically with eIF1 to mediate assembly of ribosomal initiation complexes at the initiation codon and maintain the accuracy of this process by recognizing and destabilizing aberrant preinitiation complexes from the mRNA. Without eIF1A and eIF1, 43S ribosomal preinitiation complexes can bind to the cap-proximal region, but are unable to reach the initiation codon. eIF1a also enhances the formation of 5'-terminal complexes in the presence of other translation initiation factors. This protein family is only found in eukaryotes and archaea. Probab=23.44 E-value=46 Score=15.16 Aligned_cols=16 Identities=13% Similarity=0.310 Sum_probs=13.5 Q ss_pred CCEECCCCEEEEECCC Q ss_conf 2203137889970568 Q gi|254780848|r 65 EPEVSKGDIVLFGKWS 80 (111) Q Consensus 65 p~~vkvGD~Vl~~~~~ 80 (111) -+|++.||.|+...+. T Consensus 36 rvWI~~gD~VlV~~~e 51 (78) T cd04456 36 NIWIKRGDFLIVDPIE 51 (78) T ss_pred EEEEECCCEEEEEEHH T ss_conf 0898369999998812 No 72 >PRK10413 hydrogenase 2 accessory protein HypG; Provisional Probab=23.43 E-value=62 Score=14.44 Aligned_cols=12 Identities=25% Similarity=0.086 Sum_probs=7.9 Q ss_pred EECCCCEEEEEC Q ss_conf 031378899705 Q gi|254780848|r 67 EVSKGDIVLFGK 78 (111) Q Consensus 67 ~vkvGD~Vl~~~ 78 (111) ..++|||||..- T Consensus 42 ~~~vGDyVLVHv 53 (82) T PRK10413 42 ADLLGQWVLVHV 53 (82) T ss_pred CCCCCCEEEEEE T ss_conf 666588999980 No 73 >smart00652 eIF1a eukaryotic translation initiation factor 1A. Probab=22.63 E-value=57 Score=14.66 Aligned_cols=30 Identities=27% Similarity=0.403 Sum_probs=18.9 Q ss_pred EEEECCCEECCCCCEECCEECCCCEEEEECCCCCE Q ss_conf 99823870867887902203137889970568608 Q gi|254780848|r 49 IMWVGAGVMDQSGKVIEPEVSKGDIVLFGKWSGTE 83 (111) Q Consensus 49 VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~~g~~ 83 (111) .++.=||++-. -+|++.||+|+.+.....+ T Consensus 30 rl~~i~gk~rk-----r~wi~~Gd~VlV~~r~~~~ 59 (83) T smart00652 30 RLARIPGKMRK-----KVWIRRGDIVLVDPWDFQD 59 (83) T ss_pred EEEEECHHCEE-----EEEEECCCEEEEEECCCCC T ss_conf 99998303030-----2798479999998136778 No 74 >TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins. Probab=22.62 E-value=64 Score=14.34 Aligned_cols=62 Identities=18% Similarity=0.244 Sum_probs=37.5 Q ss_pred CCCCCCEEEECCCCCCC-CCCEEEEEEEECCCEE-------------------------CCCCCEECCEECCCCEEEEEC Q ss_conf 21225579945765546-5302479998238708-------------------------678879022031378899705 Q gi|254780848|r 25 IKTATGNILIPDTVSEK-PSASSGEIMWVGAGVM-------------------------DQSGKVIEPEVSKGDIVLFGK 78 (111) Q Consensus 25 ~~T~~GgI~lp~~~~ek-~~~~~G~VvavG~g~~-------------------------~~~G~~~p~~vkvGD~Vl~~~ 78 (111) .++.+|.+-+.||..-. .+...+-+|.+-||.. ...|.....+...||.-++++ T Consensus 224 ~~~~GG~v~ivDs~nFpis~tiaaa~v~~~PGamRElHWHPnadEWqyfi~G~aR~TVF~~~g~arTfd~~aGDVGyvP~ 303 (367) T TIGR03404 224 KQVPGGTVRIADSTNFPVSKTIAAAIVTVEPGAMRELHWHPNADEWQYFIQGQARMTVFAAGGNARTFDYQAGDVGYVPR 303 (367) T ss_pred EECCCCEEEEECCCCCCHHHHHHHEEEEECCCCCCCCCCCCCCHHHEEEEECCEEEEEECCCCCEEEEECCCCCEEEECC T ss_conf 76589639983177675146402005773378501345389715630577364789997268832576315886689789 Q ss_pred CCCCEEEE Q ss_conf 68608897 Q gi|254780848|r 79 WSGTEIKL 86 (111) Q Consensus 79 ~~g~~v~~ 86 (111) ..|.-|+- T Consensus 304 ~~GHyIEN 311 (367) T TIGR03404 304 NMGHYVEN 311 (367) T ss_pred CCCEEEEE T ss_conf 87515564 No 75 >PRK13914 invasion associated secreted endopeptidase; Provisional Probab=22.49 E-value=53 Score=14.81 Aligned_cols=13 Identities=31% Similarity=0.830 Sum_probs=10.6 Q ss_pred EECCCCEEEEECCC Q ss_conf 03137889970568 Q gi|254780848|r 67 EVSKGDIVLFGKWS 80 (111) Q Consensus 67 ~vkvGD~Vl~~~~~ 80 (111) ++++||.|+|. |. T Consensus 420 ~~~pGDlvFF~-~~ 432 (475) T PRK13914 420 QAKPGDLVFFD-YG 432 (475) T ss_pred CCCCCCEEEEC-CC T ss_conf 19987889978-89 No 76 >TIGR02922 TIGR02922 conserved hypothetical protein TIGR02922; InterPro: IPR014271 Two members of this family are found in Colwellia psychrerythraea (strain 34H / ATCC BAA-681) and one each in various other species of Colwellia and Shewanella. One member from C. psychrerythraea is of special interest because it is preceded by the same cis-regulatory site as a number of genes that have the PEP-CTERM domain described by IPR013424 from INTERPRO.. Probab=22.17 E-value=66 Score=14.29 Aligned_cols=38 Identities=32% Similarity=0.422 Sum_probs=24.1 Q ss_pred CCCCCEEEECCCCCCCCCCEEEEEEEECCCEECCCCCEECCEECCCCEEE Q ss_conf 12255799457655465302479998238708678879022031378899 Q gi|254780848|r 26 KTATGNILIPDTVSEKPSASSGEIMWVGAGVMDQSGKVIEPEVSKGDIVL 75 (111) Q Consensus 26 ~T~~GgI~lp~~~~ek~~~~~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl 75 (111) ..++|-++||.+=+ |.+ .+|||=.|--+= -=++||+|+ T Consensus 31 ~~~~Grv~iP~eFk-~GK----sIiAV~EG~i~i-------lNsvGdr~~ 68 (69) T TIGR02922 31 VNESGRVVIPQEFK-KGK----SIIAVCEGEITI-------LNSVGDRVL 68 (69) T ss_pred CCCCCCEECCCCCC-CCC----EEEEEECCCEEE-------ECCCCCEEC T ss_conf 66625776067566-898----689982486688-------503551605 No 77 >TIGR01941 nqrF NADH:ubiquinone oxidoreductase, Na(+)-translocating, F subunit; InterPro: IPR010205 This entry represents the NqrF subunit of the six-protein, Na(+)-pumping NADH-quinone reductase of a number of marine and pathogenic Gram-negative bacteria.; GO: 0016655 oxidoreductase activity acting on NADH or NADPH quinone or similar compound as acceptor, 0051537 2 iron 2 sulfur cluster binding, 0006814 sodium ion transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=21.85 E-value=67 Score=14.25 Aligned_cols=28 Identities=21% Similarity=0.254 Sum_probs=16.1 Q ss_pred EECCCCEEEEE-CCCCCEEEECCCCEEEEE Q ss_conf 03137889970-568608897699779998 Q gi|254780848|r 67 EVSKGDIVLFG-KWSGTEIKLNDGEEYLVM 95 (111) Q Consensus 67 ~vkvGD~Vl~~-~~~g~~v~~~dg~~y~i~ 95 (111) ++|+||+|+.+ +|+-.=++..| .+.+++ T Consensus 261 sLKpGDKv~~sGPfGefF~kDtd-aemvFi 289 (425) T TIGR01941 261 SLKPGDKVTVSGPFGEFFAKDTD-AEMVFI 289 (425) T ss_pred ECCCCCEEEEECCCCCCEEEECC-CCEEEE T ss_conf 44798777997788853024578-643897 No 78 >PRK02290 3-dehydroquinate synthase; Provisional Probab=20.87 E-value=59 Score=14.56 Aligned_cols=19 Identities=26% Similarity=0.319 Sum_probs=10.0 Q ss_pred CCCCEECC-EECCCCEEEEE Q ss_conf 78879022-03137889970 Q gi|254780848|r 59 QSGKVIEP-EVSKGDIVLFG 77 (111) Q Consensus 59 ~~G~~~p~-~vkvGD~Vl~~ 77 (111) ++|+.++. ++|+||+|+.. T Consensus 309 ~~G~~vsVt~Lk~GD~vL~~ 328 (347) T PRK02290 309 PDGTPVSVVDLKPGDKVLGY 328 (347) T ss_pred CCCCEEEEEECCCCCEEEEE T ss_conf 99968852545899999998 No 79 >PRK04012 translation initiation factor IF-1A; Provisional Probab=20.25 E-value=59 Score=14.57 Aligned_cols=15 Identities=27% Similarity=0.727 Sum_probs=13.2 Q ss_pred CEECCCCEEEEECCC Q ss_conf 203137889970568 Q gi|254780848|r 66 PEVSKGDIVLFGKWS 80 (111) Q Consensus 66 ~~vkvGD~Vl~~~~~ 80 (111) +|+++||.||.+.+. T Consensus 57 ~WI~~gDiVLV~~re 71 (99) T PRK04012 57 MWIREGDVVIVAPWD 71 (99) T ss_pred EEEECCCEEEEEEEC T ss_conf 497279999998554 No 80 >TIGR02818 adh_III_F_hyde S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase; InterPro: IPR014183 The members of this protein family show dual function. First, they remove formaldehyde, a toxic metabolite, by acting as S-(hydroxymethyl)glutathione dehydrogenase (1.1.1.284 from EC). S-(hydroxymethyl)glutathione can form spontaneously from formaldehyde and glutathione, and so this enzyme previously was designated glutathione-dependent formaldehyde dehydrogenase. These same proteins are also designated as class III alcohol dehydrogenases (1.1.1.1 from EC), i.e. alcohol dehydrogenases that do not require glutathione and they tend to show poor activity for ethanol among their various substrate alcohols.. Probab=20.23 E-value=72 Score=14.06 Aligned_cols=58 Identities=28% Similarity=0.426 Sum_probs=30.8 Q ss_pred CCCCCEEEEEECCC-CCCCCCCEEEECCCCCCC--CCCE----EEEEEEECCCEECCCCCEECCEECCCCEEEEECC Q ss_conf 51784799997475-421225579945765546--5302----4799982387086788790220313788997056 Q gi|254780848|r 10 RPTRGRVVVRRLQS-EIKTATGNILIPDTVSEK--PSAS----SGEIMWVGAGVMDQSGKVIEPEVSKGDIVLFGKW 79 (111) Q Consensus 10 kPl~drVLVk~~~~-e~~T~~GgI~lp~~~~ek--~~~~----~G~VvavG~g~~~~~G~~~p~~vkvGD~Vl~~~~ 79 (111) .|--..|||+.... -..|. ..-|.....|. |... .|.|.+||.|.. .++.||+|+ +-| T Consensus 23 ~P~~Gevl~~~~~tGvChtd--~ftlsG~d~eG~fP~~lGheG~G~v~~vGeGv~---------~~~~Gdhvi-Ply 87 (368) T TIGR02818 23 LPKKGEVLVRIVATGVCHTD--AFTLSGADPEGVFPVILGHEGAGIVEAVGEGVT---------SLKVGDHVI-PLY 87 (368) T ss_pred CCCCCCEEEEEEECCEEECC--CEEECCCCCCCCCEEEECCCCCCEEEECCCCEE---------EEECCCEEE-EEE T ss_conf 88777168987302301011--013237886652005771368704533067434---------530586366-410 No 81 >TIGR00074 hypC_hupF hydrogenase assembly chaperone HypC/HupF; InterPro: IPR001109 The large subunit of [NiFe]-hydrogenase, as well as other nickel metalloenzymes, is synthesised as a precursor devoid of the metalloenzyme active site. This precursor then undergoes a complex post-translational maturation process that requires a number of accessory proteins. The hydrogenase expression/formation proteins (HUPF/HYPC) form a family of small proteins that are hydrogenase precursor-specific chaperones required for this maturation process . They are believed to keep the hydrogenase precursor in a conformation accessible for metal incorporation , . . Probab=20.20 E-value=47 Score=15.11 Aligned_cols=18 Identities=33% Similarity=0.313 Sum_probs=11.8 Q ss_pred EECCCCEEEEECCCCCEE Q ss_conf 031378899705686088 Q gi|254780848|r 67 EVSKGDIVLFGKWSGTEI 84 (111) Q Consensus 67 ~vkvGD~Vl~~~~~g~~v 84 (111) .+++|||||..=-...++ T Consensus 44 ~~~~GdyvLvH~G~A~~~ 61 (88) T TIGR00074 44 EVKVGDYVLVHVGFAISV 61 (88) T ss_pred CCCCCCEEEEEEEHHHHH T ss_conf 999987763200046632 No 82 >pfam03079 ARD ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be experimentally interconverted by removal of the bound metal ion and reconstitution with the appropriate metal ion. The two enzymes share the same substrate, 1,2-dihydroxy-3-keto-5-(methylthio)pentene, but yield different products. ARD' yields the alpha-keto precursor of methionine (and formate), thus forming part of the ubiquitous methionine salvage pathway that converts 5'-methylthioadenosine (MTA) to methionine. This pathway is responsible for the tight control of the concentration of MTA, which is a powerful inhibitor of polyamine biosynthesis and transmethylation reactions. ARD yields methylthiopropanoate, carbon monoxide and formate, and thus prevents the conversion of MTA to methionine. The role of the ARD catalysed Probab=20.11 E-value=73 Score=14.04 Aligned_cols=43 Identities=19% Similarity=0.162 Sum_probs=33.2 Q ss_pred EEEEEEECCCEEC---CCCCEECCEECCCCEEEEECCCCCEEEECC Q ss_conf 4799982387086---788790220313788997056860889769 Q gi|254780848|r 46 SGEIMWVGAGVMD---QSGKVIEPEVSKGDIVLFGKWSGTEIKLND 88 (111) Q Consensus 46 ~G~VvavG~g~~~---~~G~~~p~~vkvGD~Vl~~~~~g~~v~~~d 88 (111) .-+.+--|.|.++ .++..+-+.|+.||.+.++..--..|...+ T Consensus 94 EiR~i~~G~g~fdvr~~~d~wiri~~~~GDli~lPag~~HrFtl~~ 139 (157) T pfam03079 94 EIRYIVEGTGYFDVRDKDDVWIRVFVEKGDLISLPAGIYHRFTTTP 139 (157) T ss_pred EEEEEEEEEEEEEEECCCCEEEEEEECCCCEEEECCCCCEEEECCC T ss_conf 8999995349999991897399999858998996999848677589 Done!