Query gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 98 No_of_seqs 104 out of 559 Neff 5.4 Searched_HMMs 39220 Date Mon May 23 21:19:47 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780231.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK01310 hypothetical protein; 100.0 6.1E-28 1.6E-32 187.5 12.0 94 2-97 11-104 (106) 2 PRK00647 hypothetical protein; 99.9 1.6E-25 4.1E-30 173.2 9.6 88 3-97 7-95 (96) 3 PRK01530 hypothetical protein; 99.9 1.5E-24 3.7E-29 167.4 10.8 86 2-90 14-99 (105) 4 COG1872 Uncharacterized conser 99.9 2.8E-24 7.1E-29 165.8 10.7 87 2-93 13-100 (102) 5 PRK04021 hypothetical protein; 99.9 3.4E-24 8.7E-29 165.2 10.7 82 2-91 9-91 (92) 6 PRK05090 hypothetical protein; 99.9 2.7E-24 7E-29 165.8 10.2 74 2-82 11-84 (95) 7 pfam02594 DUF167 Uncharacteriz 99.9 2.4E-23 6.2E-28 160.2 9.6 74 2-81 5-78 (78) 8 TIGR00251 TIGR00251 conserved 99.8 7E-21 1.8E-25 145.5 6.7 80 2-86 8-87 (91) 9 KOG3276 consensus 99.7 3.7E-17 9.4E-22 123.3 8.9 86 2-94 35-122 (125) 10 pfam08968 DUF1885 Domain of un 80.4 3.6 9.1E-05 22.2 4.6 40 28-71 78-122 (128) 11 pfam01187 MIF Macrophage migra 76.9 7 0.00018 20.5 5.6 58 3-68 36-98 (114) 12 pfam09581 Spore_III_AF Stage I 62.2 9.3 0.00024 19.7 3.2 28 40-67 157-184 (185) 13 pfam02738 Ald_Xan_dh_C2 Molybd 58.6 17 0.00044 18.2 5.4 45 29-73 327-371 (543) 14 cd02577 PSTD1 PSTD1: Pseudouri 54.1 20 0.00052 17.7 4.7 44 28-75 23-66 (319) 15 cd02576 PseudoU_synth_ScPUS7 P 42.8 28 0.00072 16.9 3.1 43 41-84 37-83 (371) 16 TIGR03313 Se_sel_red_Mo probab 41.9 32 0.00081 16.6 6.0 45 29-73 670-714 (951) 17 TIGR03194 4hydrxCoA_A 4-hydrox 40.7 33 0.00084 16.4 4.9 46 29-74 454-499 (746) 18 cd02552 PseudoU_synth_TruD_lik 40.6 31 0.00078 16.6 3.0 44 28-75 29-72 (232) 19 pfam04551 GcpE GcpE protein. I 39.2 35 0.00089 16.3 3.4 62 29-93 222-283 (345) 20 PRK09800 putative selenate red 38.1 36 0.00093 16.2 6.0 45 29-73 646-690 (928) 21 KOG1759 consensus 38.0 28 0.0007 16.9 2.5 31 38-68 69-99 (115) 22 PRK00984 truD tRNA pseudouridi 35.4 34 0.00087 16.4 2.6 44 28-75 40-83 (344) 23 pfam01142 TruD tRNA pseudourid 35.2 41 0.001 15.9 3.1 43 28-74 39-81 (336) 24 TIGR00713 hemL glutamate-1-sem 33.5 10 0.00026 19.5 -0.3 21 62-82 121-141 (434) 25 PHA00659 V structural protein 31.3 40 0.001 16.0 2.4 26 37-63 74-99 (138) 26 cd01804 midnolin_N midnolin_N 31.2 47 0.0012 15.5 3.2 39 29-71 11-49 (78) 27 TIGR02416 CO_dehy_Mo_lg carbon 30.0 36 0.00091 16.2 2.0 39 33-73 494-532 (775) 28 TIGR00747 fabH 3-oxoacyl-(acyl 29.3 39 0.001 16.0 2.1 25 43-67 255-279 (329) 29 pfam02567 PhzC-PhzF Phenazine 28.1 54 0.0014 15.2 3.4 50 30-79 198-256 (280) 30 PHA02512 minor structural prot 27.8 51 0.0013 15.3 2.5 26 37-62 313-338 (844) 31 TIGR00151 ispF 2C-methyl-D-ery 27.3 54 0.0014 15.2 2.5 28 46-75 112-139 (159) 32 TIGR01941 nqrF NADH:ubiquinone 26.1 36 0.00091 16.2 1.4 19 29-47 226-245 (425) 33 COG0384 Predicted epimerase, P 25.5 60 0.0015 14.9 4.5 46 35-80 218-265 (291) 34 pfam12116 SpoIIID Stage III sp 24.5 35 0.0009 16.3 1.1 20 46-65 18-37 (82) 35 KOG4079 consensus 23.9 52 0.0013 15.3 1.9 19 41-59 91-109 (169) 36 pfam01361 Tautomerase Tautomer 22.9 67 0.0017 14.6 2.8 27 45-71 19-45 (60) 37 pfam12500 DUF3706 Protein of u 20.2 76 0.0019 14.3 2.9 30 43-72 5-34 (199) 38 PRK12880 3-oxoacyl-(acyl carri 20.1 60 0.0015 14.9 1.6 21 44-64 269-289 (353) No 1 >PRK01310 hypothetical protein; Validated Probab=99.95 E-value=6.1e-28 Score=187.54 Aligned_cols=94 Identities=34% Similarity=0.474 Sum_probs=88.6 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) +.+.|||||+|++++|.|++.+.||. .+|+|+|+|||+|||||++|++|||++|+||||+|+|++|++||+|+|+|+| T Consensus 11 ~~l~Vrv~P~Asr~~i~G~~~d~dG~--~~LkvrV~ApP~dGkAN~al~~~LAk~lgvpks~v~i~~G~tSR~K~i~I~G 88 (106) T PRK01310 11 LRVAVRVTPRGGRDAIDGIETLADGR--SVLKVRVRAIAEGGEANRAVTELLAKALGVPKSSVRVLSGATSRLKQVAIDG 88 (106) T ss_pred EEEEEEECCCCCCCCEEEEECCCCCC--EEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCCEEEEEEC T ss_conf 89999985488756220030277776--0589999649987758999999999985998333899707877845999967 Q ss_pred CHHHHHHHHHHHHHCC Q ss_conf 9899999997523015 Q gi|254780231|r 82 DCKEITELLQNNDSLT 97 (98) Q Consensus 82 ~~~~i~~~L~~~~s~t 97 (98) +++++.+.|+...+.| T Consensus 89 d~~~l~~~L~~l~~~~ 104 (106) T PRK01310 89 DPEDLGEALRALTAAK 104 (106) T ss_pred CHHHHHHHHHHHHCCC T ss_conf 9999999999974469 No 2 >PRK00647 hypothetical protein; Validated Probab=99.93 E-value=1.6e-25 Score=173.15 Aligned_cols=88 Identities=32% Similarity=0.468 Sum_probs=80.0 Q ss_pred EEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECCC Q ss_conf 59999941866530231014566677767999996387679788999999999829984447998355778069998299 Q gi|254780231|r 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD 82 (98) Q Consensus 3 ~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~~ 82 (98) -|.|+|+|+|++++|.|++ +++|+|+|+|||+|||||+||++|||++|++|||+|+|++|++||+|+|+|+++ T Consensus 7 il~V~VqP~A~r~~i~G~~-------~~~LkV~vtApPvdGKAN~ali~~LAk~l~vpks~I~Ii~G~tSR~K~I~I~~~ 79 (96) T PRK00647 7 ILEVKVTPKARENKIVGFE-------GGILKVRVTEVPEKGKANDAVIALLAKALSLPKRDVTLIAGETSRKKKVLLPRS 79 (96) T ss_pred EEEEEEECCCCCCCEECCC-------CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCCEEEECCCC T ss_conf 9999986298736250002-------998999995599887189999999999969983447996067757338977996 Q ss_pred -HHHHHHHHHHHHHCC Q ss_conf -899999997523015 Q gi|254780231|r 83 -CKEITELLQNNDSLT 97 (98) Q Consensus 83 -~~~i~~~L~~~~s~t 97 (98) ...+.+.+-+..|.| T Consensus 80 ~~~~~~~~~~~~~~~~ 95 (96) T PRK00647 80 IKAIIFEQFPDVSSST 95 (96) T ss_pred CHHHHHHHCCCCCCCC T ss_conf 3358887597644466 No 3 >PRK01530 hypothetical protein; Reviewed Probab=99.92 E-value=1.5e-24 Score=167.43 Aligned_cols=86 Identities=28% Similarity=0.464 Sum_probs=77.0 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) +-|.|+|+|+|++|+|.|+....+ +++|+|+|+|||+|||||+|+++|||++|++|||+|+|++|+|||.|+++|++ T Consensus 14 vll~vkVkP~Ak~N~I~G~~~~~~---~~~LKv~VtApPedGKAN~ali~~LAk~l~l~kS~i~Iv~G~tsR~K~i~I~n 90 (105) T PRK01530 14 ALLNLKVKPNAKQNLISNFVIINN---IPYLKLSIKAIPEQGKANEEIINYLAKEWKLSRSNIEIIKGHTHSLKTILIKN 90 (105) T ss_pred EEEEEEEECCCCCCCCCCEEECCC---CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEECCCCCCCEEEEEEC T ss_conf 999999703867031133661179---87389999749988807899999999995998222799507777722899915 Q ss_pred CHHHHHHHH Q ss_conf 989999999 Q gi|254780231|r 82 DCKEITELL 90 (98) Q Consensus 82 ~~~~i~~~L 90 (98) ..++....+ T Consensus 91 i~~~~l~~i 99 (105) T PRK01530 91 INEDYLNLI 99 (105) T ss_pred CCHHHHHHH T ss_conf 798999999 No 4 >COG1872 Uncharacterized conserved protein [Function unknown] Probab=99.91 E-value=2.8e-24 Score=165.76 Aligned_cols=87 Identities=33% Similarity=0.516 Sum_probs=79.8 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) +.|.|+|+|+|+++.|.|++ .|+++|+|+|+|||++||||++|++|||+.|++|||+|+|++|++||.|+++|.+ T Consensus 13 ~~l~V~V~P~a~~~~i~g~~-----~~~~~Lkv~i~apP~~GKAN~~li~~Lak~~~v~kS~V~ivsGetsR~K~v~i~~ 87 (102) T COG1872 13 VLLRVRVKPKAKRDSIVGLD-----EWRKRLKVRITAPPVDGKANEELIKFLAKTFGVPKSSVEIVSGETSRLKTVLIKN 87 (102) T ss_pred EEEEEEECCCCCCCCCCCEE-----CCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEEEECC T ss_conf 69999977788657533250-----6755599998469877613699999999995998541799846755530799548 Q ss_pred -CHHHHHHHHHHH Q ss_conf -989999999752 Q gi|254780231|r 82 -DCKEITELLQNN 93 (98) Q Consensus 82 -~~~~i~~~L~~~ 93 (98) +++++.+.+... T Consensus 88 i~~d~~~~~~~~~ 100 (102) T COG1872 88 IDPDQLPERLAAL 100 (102) T ss_pred CCHHHHHHHHHHH T ss_conf 8989999998864 No 5 >PRK04021 hypothetical protein; Reviewed Probab=99.91 E-value=3.4e-24 Score=165.22 Aligned_cols=82 Identities=26% Similarity=0.416 Sum_probs=76.6 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) +-|.|+|+|||++++|.|++ .|+++|+|+++|||+|||||+|||+|||+.|+ |+|+|++|+|||+|+++|.| T Consensus 9 v~l~v~vqP~A~~~~i~G~~-----~~~~~Lkv~l~ApPv~GkAN~ali~flak~lg---s~V~i~~G~tSR~K~v~i~g 80 (92) T PRK04021 9 VIISVIVQPNAKENKIEGVD-----EWRGRIKVNIKAPPVKGKANKELIKFFSKLLG---AEVEIIRGETSREKDLLVKG 80 (92) T ss_pred EEEEEEEECCCCCCEEECCC-----CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHC---CCEEEEECCCCCCCEEEEEC T ss_conf 99999981099855460313-----77885899987799877689999999999859---97899705777853899906 Q ss_pred -CHHHHHHHHH Q ss_conf -9899999997 Q gi|254780231|r 82 -DCKEITELLQ 91 (98) Q Consensus 82 -~~~~i~~~L~ 91 (98) +.+++.++|+ T Consensus 81 is~e~v~~~L~ 91 (92) T PRK04021 81 ISLEEVKKKLK 91 (92) T ss_pred CCHHHHHHHHC T ss_conf 99999999865 No 6 >PRK05090 hypothetical protein; Validated Probab=99.91 E-value=2.7e-24 Score=165.79 Aligned_cols=74 Identities=24% Similarity=0.345 Sum_probs=69.8 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) +.|.|+|||||++++|.|++ ++.|+|+|+|||+||+||++||+|||++|+||||+|+|++|++||+|+|+|++ T Consensus 11 ~~l~V~V~P~A~r~~i~g~~-------~~~Lkv~v~ApPvdGkAN~ali~~LA~~l~vpks~V~i~~G~tsR~K~i~I~~ 83 (95) T PRK05090 11 LVLRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFLAKQFKVAKSQVVIEKGELGRHKQVRIIN 83 (95) T ss_pred EEEEEEEEECCCCCEECCCC-------CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCCEEEEEEC T ss_conf 99999995288804580103-------99899999659978728999999999985997565899525876843999838 Q ss_pred C Q ss_conf 9 Q gi|254780231|r 82 D 82 (98) Q Consensus 82 ~ 82 (98) + T Consensus 84 p 84 (95) T PRK05090 84 P 84 (95) T ss_pred C T ss_conf 1 No 7 >pfam02594 DUF167 Uncharacterized ACR, YggU family COG1872. Probab=99.90 E-value=2.4e-23 Score=160.17 Aligned_cols=74 Identities=36% Similarity=0.547 Sum_probs=69.1 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) +-|.|+|+|+|++++|.|++. +++|+|+|+|||+|||||+||++|||++|++|||+|+|++|++||.|+|+|+| T Consensus 5 v~i~i~V~P~a~~~~i~g~~~------~~~l~V~v~a~p~~GkAN~ali~~la~~l~v~ks~I~Iv~G~~sr~K~i~I~g 78 (78) T pfam02594 5 VLLRVRVKPNAKKNSIVGVEE------WGELKVRITAPPVDGKANAELIKFLAKTLGVPKSDVEIVSGETSRSKVLLIEG 78 (78) T ss_pred EEEEEEEEECCCCCEECCCCC------CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEC T ss_conf 999999974886251840269------84799999649977748999999999985997000899865877862899819 No 8 >TIGR00251 TIGR00251 conserved hypothetical protein TIGR00251; InterPro: IPR005228 This family of conserved hypothetical proteins has no known function.. Probab=99.83 E-value=7e-21 Score=145.51 Aligned_cols=80 Identities=24% Similarity=0.383 Sum_probs=74.0 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) .-|.|+|+|+|+|+.|.|++ .|+.+++|+++|||.|||||.+|++||.+.|+..||+|+|++|+.||+|+|+|.+ T Consensus 8 ll~~~~v~p~a~k~~i~g~~-----eWR~~v~v~I~aPp~~GkAN~el~K~f~~~F~~aks~VeivsG~~~R~K~iki~~ 82 (91) T TIGR00251 8 LLIRIEVQPKASKDSIVGYN-----EWRKRVEVKIKAPPKEGKANRELIKFFKEIFGKAKSDVEIVSGELSRQKTIKIKN 82 (91) T ss_pred EEEEEEECCCCCCCCCCCCC-----CCCCEEEEEEECCCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEECC T ss_conf 58998866777865147824-----5311467877077988840158898888644734330788606228831687046 Q ss_pred CHHHH Q ss_conf 98999 Q gi|254780231|r 82 DCKEI 86 (98) Q Consensus 82 ~~~~i 86 (98) ...|+ T Consensus 83 ~~~~~ 87 (91) T TIGR00251 83 IKRDI 87 (91) T ss_pred CCCCC T ss_conf 47211 No 9 >KOG3276 consensus Probab=99.71 E-value=3.7e-17 Score=123.34 Aligned_cols=86 Identities=28% Similarity=0.427 Sum_probs=76.6 Q ss_pred EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC Q ss_conf 05999994186653023101456667776799999638767978899999999982998444799835577806999829 Q gi|254780231|r 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 (98) Q Consensus 2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~ 81 (98) ++|+|+++|+|++++|+++. .+.+.|.|.|||.+|+||+||++||++.|++++|++++.+|++||.|++.|+. T Consensus 35 V~i~IhakpgaK~s~It~v~-------~e~V~V~IaApp~eGeANaeLl~ylskvLgLRksdv~ldkG~kSrsKvv~i~~ 107 (125) T KOG3276 35 VQIAIHAKPGAKQSAITDVG-------DEAVGVAIAAPPREGEANAELLEYLSKVLGLRKSDVTLDKGWKSRSKVVVIED 107 (125) T ss_pred EEEEEEECCCCCCCCEEECC-------CCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCEEEEEEC T ss_conf 89999846874312255126-------43104577359766533699999999986012431365036565432899962 Q ss_pred --CHHHHHHHHHHHH Q ss_conf --9899999997523 Q gi|254780231|r 82 --DCKEITELLQNND 94 (98) Q Consensus 82 --~~~~i~~~L~~~~ 94 (98) ...+..+.|+... T Consensus 108 ~~~~~e~i~~l~ak~ 122 (125) T KOG3276 108 LSATQEYIELLEAKV 122 (125) T ss_pred CCCCHHHHHHHHCCC T ss_conf 744099999974458 No 10 >pfam08968 DUF1885 Domain of unknown function (DUF1885). This domain is found in a set of hypothetical proteins produced by bacteria of the Bacillus genus. Probab=80.40 E-value=3.6 Score=22.21 Aligned_cols=40 Identities=33% Similarity=0.535 Sum_probs=27.8 Q ss_pred CCCEEEEEEECCC-----CCCHHHHHHHHHHHHHHCCCHHHEEEEECCC Q ss_conf 7767999996387-----6797889999999998299844479983557 Q gi|254780231|r 28 DTIHMKIKVTATP-----QKGKANKAMLAMLAKKLALSKSSLRMLSKQS 71 (98) Q Consensus 28 ~~~~lkv~v~ApP-----v~GkAN~ali~~Lak~l~v~ks~I~I~~G~~ 71 (98) .++.--|.|.-|| .+|||| |.|+||||.|. ....+-.|.+ T Consensus 78 e~~~sfIqi~LP~~AT~GDKgKAN-EfckfLAK~l~---geL~LFNGRt 122 (128) T pfam08968 78 EGEQTFIQISLPPTATFGDKGKAN-EFSKFLAKKLE---GELQLFNGRT 122 (128) T ss_pred CCCCEEEEEECCCCCCCCCCHHHH-HHHHHHHHHHH---CEEEEECCEE T ss_conf 787157999889998777622276-89999999741---0157754707 No 11 >pfam01187 MIF Macrophage migration inhibitory factor (MIF). Probab=76.94 E-value=7 Score=20.47 Aligned_cols=58 Identities=14% Similarity=0.149 Sum_probs=38.7 Q ss_pred EEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCC-CCC----HHHHHHHHHHHHHHCCCHHHEEEEE Q ss_conf 59999941866530231014566677767999996387-679----7889999999998299844479983 Q gi|254780231|r 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATP-QKG----KANKAMLAMLAKKLALSKSSLRMLS 68 (98) Q Consensus 3 ~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApP-v~G----kAN~ali~~Lak~l~v~ks~I~I~~ 68 (98) -|.|.|.|+..-.- . |..+.+.-+.|++-- .+. +--+++++||.+.|+||+.+|-|.. T Consensus 36 yv~V~v~~~~~m~f-g-------Gs~eP~a~~~l~siG~l~~~~n~~~s~~l~~~l~~~LgI~~~RiyI~F 98 (114) T pfam01187 36 RIAVHIRPGQAMVF-G-------GSTDPCAVCSIKSIGVVGAEQNRSHSALLFKFLTKELGLPKDRIYIRF 98 (114) T ss_pred EEEEEEECCCEEEE-C-------CCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE T ss_conf 29999928971896-6-------889977999999867899899999999999999986297946499999 No 12 >pfam09581 Spore_III_AF Stage III sporulation protein AF (Spore_III_AF). This family represents the stage III sporulation protein AF (Spore_III_AF) of the bacterial endospore formation program, which exists in some but not all members of the Firmicutes (formerly called low-GC Gram-positives). The C-terminal region of these proteins is poorly conserved. Probab=62.16 E-value=9.3 Score=19.73 Aligned_cols=28 Identities=11% Similarity=0.306 Sum_probs=24.1 Q ss_pred CCCCHHHHHHHHHHHHHHCCCHHHEEEE Q ss_conf 7679788999999999829984447998 Q gi|254780231|r 40 PQKGKANKAMLAMLAKKLALSKSSLRML 67 (98) Q Consensus 40 Pv~GkAN~ali~~Lak~l~v~ks~I~I~ 67 (98) ..+..-++.+..+||..|++|+.+|++. T Consensus 157 ~~~~~~~~~I~~~la~~~~I~~e~I~V~ 184 (185) T pfam09581 157 EEESEEAEKIKNFLADEYNIPKEQIEVY 184 (185) T ss_pred CCCHHHHHHHHHHHHHHHCCCHHHCEEE T ss_conf 2004679999999998839989991772 No 13 >pfam02738 Ald_Xan_dh_C2 Molybdopterin-binding domain of aldehyde dehydrogenase. Probab=58.62 E-value=17 Score=18.16 Aligned_cols=45 Identities=9% Similarity=0.158 Sum_probs=39.2 Q ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC Q ss_conf 767999996387679788999999999829984447998355778 Q gi|254780231|r 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP 73 (98) Q Consensus 29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr 73 (98) ++.+.|.+...+..-=...-+..+.|+.|++|..+|+++.|-|.+ T Consensus 327 dG~v~v~~g~~~~GqG~~T~~~qivAe~LGi~~d~V~v~~gDT~~ 371 (543) T pfam02738 327 DGSVTVSTGGIEIGQGLETKVAQIAAEELGIPLDDIRVISGDTDK 371 (543) T ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCHHHEEEECCCCCC T ss_conf 997788734667788616589999999959986779997278887 No 14 >cd02577 PSTD1 PSTD1: Pseudouridine synthase, a subgroup of the TruD family. This group consists of several hypothetical archeal pseudouridine synthases assigned to the TruD family of psuedouridine synthases. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). The TruD family is comprised of proteins related to Escherichia coli TruD. Probab=54.11 E-value=20 Score=17.71 Aligned_cols=44 Identities=11% Similarity=0.095 Sum_probs=35.9 Q ss_pred CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE Q ss_conf 776799999638767978899999999982998444799835577806 Q gi|254780231|r 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK 75 (98) Q Consensus 28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K 75 (98) .|+++-+.|. -.|.-...+++.||+.++||.++|.. .|.+.|+= T Consensus 23 ~Geh~~~~i~---K~g~~T~~~~~~LAr~lgv~~~~igy-AGlKDk~A 66 (319) T cd02577 23 KGKYLIYLLE---KKNWDTLDAVRRIAKALGISRKRIGY-AGTKDKHA 66 (319) T ss_pred CCCEEEEEEE---ECCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCCC T ss_conf 8788999999---88999999999999982998777515-43466752 No 15 >cd02576 PseudoU_synth_ScPUS7 PseudoU_synth_ScPUS7: Pseudouridine synthase, TruD family. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). Saccharomyces cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA), psi13 in cytoplasmic tRNAs and psi35 in pre-tRNATyr. Psi35 in yeast U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35. Probab=42.77 E-value=28 Score=16.86 Aligned_cols=43 Identities=14% Similarity=0.157 Sum_probs=33.0 Q ss_pred CCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE----EEEECCCHH Q ss_conf 67978899999999982998444799835577806----999829989 Q gi|254780231|r 41 QKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK----IIYIDKDCK 84 (98) Q Consensus 41 v~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K----~i~I~~~~~ 84 (98) -.|.-..++++.||+.+++|.++|.. +|.+.|+= .+.|.+..+ T Consensus 37 K~~~~T~~a~~~lak~~gi~~~~igy-AGlKDk~AvT~Q~~Sv~~~~~ 83 (371) T cd02576 37 KENKDTMEAINKIAKLLRVKPSDFSY-AGTKDKRAVTVQRVSVKKVTA 83 (371) T ss_pred ECCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCCEEEEEEEEEECCCH T ss_conf 88999999999999983997788427-636448777899999978886 No 16 >TIGR03313 Se_sel_red_Mo probable selenate reductase, molybdenum-binding subunit. Our comparative genomics suggests this protein family to be a subunit of a selenium-dependent molybdenum hydroxylase, although the substrate is not specified. This protein is suggested by Bebien, et al., to be the molybdenum-binding subunit of a molydbopterin-containing selenate reductase. Xi, et al, however, show that mutation of this gene in E. coli conferred sensitivity to adenine, suggesting a defect in purine interconversion. This finding, plus homology of nearby genes in a 23-gene purine catabolism region in E. coli to xanthine dehydrogase subunits suggests xanthine dehydrogenase activity. Probab=41.91 E-value=32 Score=16.56 Aligned_cols=45 Identities=7% Similarity=0.051 Sum_probs=39.2 Q ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC Q ss_conf 767999996387679788999999999829984447998355778 Q gi|254780231|r 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP 73 (98) Q Consensus 29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr 73 (98) ++.+.|++.+....-=.+..+..+.|+.|++|..+|+++.|-|+. T Consensus 670 DG~v~v~~g~~d~GQG~~T~~aQiaAe~LGi~~d~I~v~~~DT~~ 714 (951) T TIGR03313 670 DGTFIVHSGGADIGTGLDTVVSKLTAEVLHCPMDDVHVISGDTDH 714 (951) T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCHHHEEEEECCCCC T ss_conf 977999978978887789999999999968984539999679888 No 17 >TIGR03194 4hydrxCoA_A 4-hydroxybenzoyl-CoA reductase, alpha subunit. This model represents the largest chain, alpha, of the enzyme 4-hydroxybenzoyl-CoA reductase. In species capable of degrading various aromatic compounds by way of benzoyl-CoA, this enzyme can convert 4-hydroxybenzoyl-CoA to benzoyl-CoA. Probab=40.72 E-value=33 Score=16.44 Aligned_cols=46 Identities=15% Similarity=0.107 Sum_probs=39.6 Q ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCE Q ss_conf 7679999963876797889999999998299844479983557780 Q gi|254780231|r 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPL 74 (98) Q Consensus 29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~ 74 (98) ++.+.|........-=.+..+....|+.|++|..+|.++.|-|... T Consensus 454 dGsv~v~~g~~e~GqG~~T~~~qiaAe~LGip~e~V~v~~~DT~~~ 499 (746) T TIGR03194 454 DGGITLLTGAADIGQGSSTIASQVAAEVLGVRLSRIRVISADSALT 499 (746) T ss_pred CCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCHHHEEEECCCCCCC T ss_conf 9769999786446888533899999872299889969973678878 No 18 >cd02552 PseudoU_synth_TruD_like PseudoU_synth_TruD_like: Pseudouridine synthase, TruD family. This group consists of eukaryotic, bacterial and archeal pseudouridine synthases similar to Escherichia coli TruD and Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). E. coli TruD and S. cerevisiae Pus7 make psi13 in cytoplasmic tRNAs. In addition S. cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA) and psi35 in pre-tRNATyr. Psi35 in U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35. Probab=40.65 E-value=31 Score=16.65 Aligned_cols=44 Identities=11% Similarity=0.141 Sum_probs=36.3 Q ss_pred CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE Q ss_conf 776799999638767978899999999982998444799835577806 Q gi|254780231|r 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK 75 (98) Q Consensus 28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K 75 (98) .|+++.+.|.- .|.-..+++..||+.++||.++|.. .|.+.|+= T Consensus 29 ~G~~~~~~l~K---~~~~T~~a~~~lA~~l~i~~~~igy-AG~KDr~A 72 (232) T cd02552 29 EGEYLHFTLYK---ENKDTMEALREIAKALGVPPRDIGY-AGTKDKRA 72 (232) T ss_pred CCCEEEEEEEE---CCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCCE T ss_conf 77889999998---7899999999999981998666304-43466761 No 19 >pfam04551 GcpE GcpE protein. In a variety of organisms, including plants and several eubacteria, isoprenoids are synthesized by the mevalonate-independent 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway. Although different enzymes of this pathway have been described, the terminal biosynthetic steps of the MEP pathway have not been fully elucidated. GcpE gene of Escherichia coli is involved in this pathway. Probab=39.17 E-value=35 Score=16.30 Aligned_cols=62 Identities=18% Similarity=0.236 Sum_probs=44.3 Q ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECCCHHHHHHHHHHH Q ss_conf 76799999638767978899999999982998444799835577806999829989999999752 Q gi|254780231|r 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 (98) Q Consensus 29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~~~~~i~~~L~~~ 93 (98) |+.++|++++.|++ --.+..-+=+.|++++..+++++=-+--.-.+.+....+++.+.|+.. T Consensus 222 GDTIRVSLt~dP~~---EV~v~~~IL~sl~lR~~g~~iISCPtCGR~~~dl~~~a~~ve~~l~~~ 283 (345) T pfam04551 222 GDTIRVSLTEDPVE---EVKVAFEILQSLGLRKRGVEIISCPTCGRTLFDLIKVAKEVEERLSHL 283 (345) T ss_pred CCEEEEECCCCCHH---HHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 65399967899547---899999999972877578648758874661155999999999997379 No 20 >PRK09800 putative selenate reductase subunit YgfN; Provisional Probab=38.09 E-value=36 Score=16.20 Aligned_cols=45 Identities=7% Similarity=0.062 Sum_probs=39.2 Q ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC Q ss_conf 767999996387679788999999999829984447998355778 Q gi|254780231|r 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP 73 (98) Q Consensus 29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr 73 (98) ++.+.|++.+....-=.+..+....|+.|++|..+|+++.|-|+. T Consensus 646 DG~v~v~~g~~d~GQG~~T~~aQiaAe~LGip~d~I~v~~~DT~~ 690 (928) T PRK09800 646 DGTFIVHSGGADIGTGLDTVVTKLAAEVLHCPPQDVHVISGDTDH 690 (928) T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCHHHEEEEECCCCC T ss_conf 977999978978676788999999999968985559999579888 No 21 >KOG1759 consensus Probab=37.99 E-value=28 Score=16.92 Aligned_cols=31 Identities=16% Similarity=0.184 Sum_probs=21.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCHHHEEEEE Q ss_conf 3876797889999999998299844479983 Q gi|254780231|r 38 ATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 (98) Q Consensus 38 ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~ 68 (98) .+-+..+=-+++++||.++|+++++++-|-. T Consensus 69 ~~~~N~~~sa~l~~il~~~L~l~~~rv~I~f 99 (115) T KOG1759 69 GAIVNRSYSAALTEILEKELSLDPDRVYIKF 99 (115) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 8667188999999999987088978699999 No 22 >PRK00984 truD tRNA pseudouridine synthase D; Reviewed Probab=35.43 E-value=34 Score=16.37 Aligned_cols=44 Identities=16% Similarity=0.202 Sum_probs=35.8 Q ss_pred CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE Q ss_conf 776799999638767978899999999982998444799835577806 Q gi|254780231|r 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK 75 (98) Q Consensus 28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K 75 (98) .|+++-+.|.- .|.-..++++.||+.++||.++|.. +|.+.|+= T Consensus 40 ~G~~~~~~l~K---~~~~T~~~~~~lA~~l~i~~~~igy-aGlKDr~A 83 (344) T PRK00984 40 EGEHLLLRIRK---RGWNTLFAARELAKFLGISRREVSY-AGLKDRHA 83 (344) T ss_pred CCCEEEEEEEE---CCCCHHHHHHHHHHHCCCCHHHCCC-CCCCCCCC T ss_conf 87889999998---8999999999999980998888242-56666750 No 23 >pfam01142 TruD tRNA pseudouridine synthase D (TruD). TruD is responsible for synthesis of pseudouridine from uracil-13 in transfer RNAs. The structure of TruD reveals an overall V-shaped molecule which contains an RNA-binding cleft. Probab=35.20 E-value=41 Score=15.92 Aligned_cols=43 Identities=14% Similarity=0.210 Sum_probs=32.7 Q ss_pred CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCE Q ss_conf 77679999963876797889999999998299844479983557780 Q gi|254780231|r 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPL 74 (98) Q Consensus 28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~ 74 (98) .|+++-+.+.- .|.-..+++..||+.|+++.++|.. .|.+.|+ T Consensus 39 ~G~~~~~~l~K---~~~~T~~a~~~lAk~l~i~~~~i~y-aGlKDk~ 81 (336) T pfam01142 39 EGEHLILRIEK---RGWNTLDVARELARFLGISPRDVGY-AGLKDRH 81 (336) T ss_pred CCCEEEEEEEE---CCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCC T ss_conf 87889999998---6899999999999982998888124-5566575 No 24 >TIGR00713 hemL glutamate-1-semialdehyde-2,1-aminomutase; InterPro: IPR004639 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents glutamate-1-semialdehyde (GSA) aminotransferase (5.4.3.8 from EC), which catalyse a transamination reaction to produce 5-aminoaevulinic acid during the first stage of tetrapyrrole biosynthesis by the C5 pathway , . This enzyme has a high degree of similarity to amino acid transaminases, and is classed as a class III aminotransferase.; GO: 0042286 glutamate-1-semialdehyde 21-aminomutase activity, 0033014 tetrapyrrole biosynthetic process. Probab=33.45 E-value=10 Score=19.46 Aligned_cols=21 Identities=24% Similarity=0.208 Sum_probs=18.0 Q ss_pred HHEEEEECCCCCEEEEEECCC Q ss_conf 447998355778069998299 Q gi|254780231|r 62 SSLRMLSKQSSPLKIIYIDKD 82 (98) Q Consensus 62 s~I~I~~G~~sr~K~i~I~~~ 82 (98) |.|+|-+|=|+|+|+|..+|. T Consensus 121 sAvRLARG~TGR~~IiKFeGc 141 (434) T TIGR00713 121 SAVRLARGFTGRDKIIKFEGC 141 (434) T ss_pred HHHHHHHHCCCCCCEEEECCC T ss_conf 999986310588727886176 No 25 >PHA00659 V structural protein P5 Probab=31.31 E-value=40 Score=15.96 Aligned_cols=26 Identities=12% Similarity=0.245 Sum_probs=21.9 Q ss_pred ECCCCCCHHHHHHHHHHHHHHCCCHHH Q ss_conf 638767978899999999982998444 Q gi|254780231|r 37 TATPQKGKANKAMLAMLAKKLALSKSS 63 (98) Q Consensus 37 ~ApPv~GkAN~ali~~Lak~l~v~ks~ 63 (98) =|||.|+- .++-+.+.|+.+||++.+ T Consensus 74 wAP~~ENn-T~aYi~~Va~~~Gv~~~~ 99 (138) T PHA00659 74 WAPSNEND-TRAYATAVARAMGVPPQA 99 (138) T ss_pred HCCCCCCC-HHHHHHHHHHHHCCCCCC T ss_conf 19998768-999999999986879987 No 26 >cd01804 midnolin_N midnolin_N Midnolin (midbrain nucleolar protein) is expressed in the nucleolus and is thought to regulate genes involved in neurogenesis. Midnolin contains an amino-terminal ubiquitin-like domain. Probab=31.18 E-value=47 Score=15.52 Aligned_cols=39 Identities=15% Similarity=0.418 Sum_probs=28.2 Q ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCC Q ss_conf 7679999963876797889999999998299844479983557 Q gi|254780231|r 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQS 71 (98) Q Consensus 29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~ 71 (98) |+.+.+.| +|. ..=+.+-+.+|+.+++||.+|.++.-++ T Consensus 11 G~~f~l~V--~~~--etVe~LKk~IskrLkvpkdRi~Llhre~ 49 (78) T cd01804 11 GTRFDLSV--PPD--ETVEGLKKRISQRLKVPKERLALLHRET 49 (78) T ss_pred CCCEEEEE--CCC--CCHHHHHHHHHHHHCCCHHHEEEEEEEE T ss_conf 87468870--765--4089999999988489867746775411 No 27 >TIGR02416 CO_dehy_Mo_lg carbon-monoxide dehydrogenase, large subunit; InterPro: IPR012780 Carbon monoxide dehydrogenases catalyse the reversible oxidation of carbon monoxide to carbon dioxide as shown below , :CO + H(2)O + A == CO(2) + AH(2) A variety of electron acceptors can be used by these enzymes including ferredoxin, methyl viologen and benzyl viologen. Under anaerobic conditions, carbon monoxide is utilised by a number of methanogenic archaea and acetogenic, sulphate-reducing or phototrophic bacteria. Under aerobic conditions carbon monoxide can be used as sole carbon and energy source by the carboxidotrophic bacteria, a taxonomically diverse group of obligate or facultative chemolithoautotrophic species. The anaerobic dehydrogenases are mostly nickel-containing iron-sulphur enzymes, while the The aerobic enzymes categorised so far are molybdenum-containing iron-sulphur proteins. This entry represents the large subunit of the molybdenum-containing carbon monoxide dehydrogenase found in aerobic bacteria. This enzyme is a dimer of heterotrimers, where each heterotrimer consists of a small iron-sulphur subunit, a medium FAD-containing flavoprotein subunit and a large molybdopterin-containign subunit which contains the active site , . The large subunit is divided into two domains. The N-terminal domain is composed of mainly of a five-and four-stranded mixed beta-sheet and interacts with the small and medium subunits. The C-terminal domain contains a region which interacts with its counterpart in the overall dimer, providing the main dimer contact, and a beta-sheet surrounded by alpha helices. The active site located in this subunit contains a novel dinuclear heterometal [CuSMo(=O)OH] cluster.; GO: 0005507 copper ion binding, 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0030151 molybdenum ion binding. Probab=29.97 E-value=36 Score=16.25 Aligned_cols=39 Identities=8% Similarity=0.212 Sum_probs=27.4 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC Q ss_conf 99996387679788999999999829984447998355778 Q gi|254780231|r 33 KIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP 73 (98) Q Consensus 33 kv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr 73 (98) +..+.++=+.- ..---..+|.+||||..+|.|..|.|+. T Consensus 494 R~g~~sQGQGH--~TTyAQIiA~ELGIP~e~I~v~~GdTd~ 532 (775) T TIGR02416 494 RLGTKSQGQGH--ETTYAQIIATELGIPAEDIDVEEGDTDT 532 (775) T ss_pred EEECCCCCCCC--CCHHHHHHHHHCCCCCCCEEEECCCCCC T ss_conf 86302369864--3046666555527880110672078788 No 28 >TIGR00747 fabH 3-oxoacyl-(acyl-carrier-protein) synthase III; InterPro: IPR004655 Beta-ketoacyl-acyl carrier protein synthase III (FabH), in general, initiates elongation in type II fatty acid synthase systems found in bacteria and plants. It is responsible for producing the multitude of fatty acid structures found in bacterial membranes . The two members of this subfamily from Bacillus subtilis differ from each other, and from FabH from Escherichia coli, in acyl group specificity. Active site residues include Cys112, His244 and Asn274 of Escherichia coli FabH. Cys-112 is the site of acyl group attachment. ; GO: 0004315 3-oxoacyl-[acyl-carrier-protein] synthase activity, 0006633 fatty acid biosynthetic process. Probab=29.29 E-value=39 Score=16.00 Aligned_cols=25 Identities=32% Similarity=0.390 Sum_probs=21.7 Q ss_pred CHHHHHHHHHHHHHHCCCHHHEEEE Q ss_conf 9788999999999829984447998 Q gi|254780231|r 43 GKANKAMLAMLAKKLALSKSSLRML 67 (98) Q Consensus 43 GkAN~ali~~Lak~l~v~ks~I~I~ 67 (98) =.||..+++.++|.|.++.++|-.. T Consensus 255 HQAN~RIi~a~ak~L~~~~~~Vv~T 279 (329) T TIGR00747 255 HQANIRIIEALAKRLELKESKVVKT 279 (329) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 4111689999999708980316863 No 29 >pfam02567 PhzC-PhzF Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family also contains a putative thymidilate synthase from Mycobacterium tuberculosis, though there is no significant sequence similarity to pfam00303 members. This family appears to be distantly related to pfam01678, including containing a weak internal duplication. However members of this family do not contain the conserved cysteines that are hypothesized to be active site residues (Bateman A pers obs). Probab=28.06 E-value=54 Score=15.20 Aligned_cols=50 Identities=16% Similarity=0.171 Sum_probs=34.1 Q ss_pred CEEEEEEE-------CCCCCCHHHHHHHHHHHHHHCCCHH-HEEEEECCCC-CEEEEEE Q ss_conf 67999996-------3876797889999999998299844-4799835577-8069998 Q gi|254780231|r 30 IHMKIKVT-------ATPQKGKANKAMLAMLAKKLALSKS-SLRMLSKQSS-PLKIIYI 79 (98) Q Consensus 30 ~~lkv~v~-------ApPv~GkAN~ali~~Lak~l~v~ks-~I~I~~G~~s-r~K~i~I 79 (98) ..+..|.= +=|+-|-||-+|..||++.++.++. .+.|..|+.+ |.=.+.+ T Consensus 198 ~~~~~R~FaP~~Gi~EDpaTGSA~~aLa~yl~~~~~~~~~~~~~i~QG~~~gR~g~i~v 256 (280) T pfam02567 198 RDYHARMFAPALGIVEDPATGSAAGALGAYLSRHGNKPQFGNIAIRQGQASGRPGTLEV 256 (280) T ss_pred CEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCEEEEE T ss_conf 56999816456797776762565799999999846998774599988736089989999 No 30 >PHA02512 minor structural protein; Provisional Probab=27.78 E-value=51 Score=15.31 Aligned_cols=26 Identities=19% Similarity=0.299 Sum_probs=23.7 Q ss_pred ECCCCCCHHHHHHHHHHHHHHCCCHH Q ss_conf 63876797889999999998299844 Q gi|254780231|r 37 TATPQKGKANKAMLAMLAKKLALSKS 62 (98) Q Consensus 37 ~ApPv~GkAN~ali~~Lak~l~v~ks 62 (98) =|||.|+--.++-|+|.++.|+|+.. T Consensus 313 WAPp~D~NDT~aYI~~Vs~~LgV~~d 338 (844) T PHA02512 313 WAPPADKNDTDAYIKAVCAQLGVSAD 338 (844) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 48998887489999999998299989 No 31 >TIGR00151 ispF 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; InterPro: IPR003526 This entry represents MECDP (2-C-methyl-D-erythritol 2,4-cyclodiphosphate) synthetase, an enzyme in the non-mevalonate pathway of isoprenoid synthesis, isoprenoids being essential in all organisms. Isoprenoids can also be synthesized through the mevalonate pathway. The non-mevolante route is used by many bacteria and human pathogens, including Mycobacterium tuberculosis and Plasmodium falciparum. This route appears to involve seven enzymes. MECDP synthetase catalyses the intramolecular attack by a phosphate group on a diphosphate, with cytidine monophosphate (CMP) acting as the leaving group to give the cyclic diphosphate product MEDCP. The enzyme is a trimer with three active sites shared between adjacent copies of the protein. The enzyme also has two metal binding sites, the metals playing key roles in catalysis. A number of proteins from eukaryotes and prokaryotes share this common N-terminal signature and appear to be involved in terpenoid biosynthesis. The ygbB protein is a putative enzyme of this type .; GO: 0008685 2-C-methyl-D-erythritol 24-cyclodiphosphate synthase activity, 0016114 terpenoid biosynthetic process. Probab=27.31 E-value=54 Score=15.18 Aligned_cols=28 Identities=18% Similarity=0.195 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHHCCCHHHEEEEECCCCCEE Q ss_conf 899999999982998444799835577806 Q gi|254780231|r 46 NKAMLAMLAKKLALSKSSLRMLSKQSSPLK 75 (98) Q Consensus 46 N~ali~~Lak~l~v~ks~I~I~~G~~sr~K 75 (98) =.++-+.+|+.|++|..+|+|- +|+..| T Consensus 112 i~~mR~~iA~~L~i~~d~vnvK--ATT~Ek 139 (159) T TIGR00151 112 IEAMRENIAELLGIPLDQVNVK--ATTTEK 139 (159) T ss_pred HHHHHHHHHHHHCCCCCCEEEE--EECCCC T ss_conf 6899999988846870211325--876767 No 32 >TIGR01941 nqrF NADH:ubiquinone oxidoreductase, Na(+)-translocating, F subunit; InterPro: IPR010205 This entry represents the NqrF subunit of the six-protein, Na(+)-pumping NADH-quinone reductase of a number of marine and pathogenic Gram-negative bacteria.; GO: 0016655 oxidoreductase activity acting on NADH or NADPH quinone or similar compound as acceptor, 0051537 2 iron 2 sulfur cluster binding, 0006814 sodium ion transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=26.09 E-value=36 Score=16.24 Aligned_cols=19 Identities=37% Similarity=0.746 Sum_probs=13.5 Q ss_pred CCEEEEEEECCCCC-CHHHH Q ss_conf 76799999638767-97889 Q gi|254780231|r 29 TIHMKIKVTATPQK-GKANK 47 (98) Q Consensus 29 ~~~lkv~v~ApPv~-GkAN~ 47 (98) .-.|.|||+.||-. ++||+ T Consensus 226 ~i~lNvRIAtPP~~n~~~~~ 245 (425) T TIGR01941 226 IIKLNVRIATPPFDNGKANK 245 (425) T ss_pred EEEEEEEECCCCCCCCCCCC T ss_conf 77999884188887676432 No 33 >COG0384 Predicted epimerase, PhzC/PhzF homolog [General function prediction only] Probab=25.46 E-value=60 Score=14.91 Aligned_cols=46 Identities=28% Similarity=0.256 Sum_probs=36.5 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHCCC-HHHEEEEECCC-CCEEEEEEC Q ss_conf 99638767978899999999982998-44479983557-780699982 Q gi|254780231|r 35 KVTATPQKGKANKAMLAMLAKKLALS-KSSLRMLSKQS-SPLKIIYID 80 (98) Q Consensus 35 ~v~ApPv~GkAN~ali~~Lak~l~v~-ks~I~I~~G~~-sr~K~i~I~ 80 (98) -|.+=|+-|-||.+|-.+|.+....+ +.++.+..|.. +|.=.|.+. T Consensus 218 Gi~EDPaTGSA~~~La~yl~~~~~~~~~~~~~~~QG~~~gR~s~i~~~ 265 (291) T COG0384 218 GVVEDPATGSAAGALAAYLAKHGGLPDKLRLLIEQGTELGRPSRIEVR 265 (291) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCEEEEE T ss_conf 887777764546589999998688887653899823236998489999 No 34 >pfam12116 SpoIIID Stage III sporulation protein D. This stage III sporulation protein is a small DNA-binding family that is essential for gene expression of the mother-cell compartment during sporulation. The domain is found in bacteria and viruses, and is about 40 amino acids in length. It has a conserved RGG sequence motif. Probab=24.46 E-value=35 Score=16.27 Aligned_cols=20 Identities=40% Similarity=0.459 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHCCCHHHEE Q ss_conf 89999999998299844479 Q gi|254780231|r 46 NKAMLAMLAKKLALSKSSLR 65 (98) Q Consensus 46 N~ali~~Lak~l~v~ks~I~ 65 (98) |++-++--|+.|||+||-|- T Consensus 18 ~~aTVR~tAk~FGvSKSTVH 37 (82) T pfam12116 18 NKATVRQAAKVFGVSKSTVH 37 (82) T ss_pred CCHHHHHHHHHHCCCHHHHH T ss_conf 26289999999696098898 No 35 >KOG4079 consensus Probab=23.86 E-value=52 Score=15.27 Aligned_cols=19 Identities=26% Similarity=0.270 Sum_probs=17.5 Q ss_pred CCCHHHHHHHHHHHHHHCC Q ss_conf 6797889999999998299 Q gi|254780231|r 41 QKGKANKAMLAMLAKKLAL 59 (98) Q Consensus 41 v~GkAN~ali~~Lak~l~v 59 (98) .||+-|+++++.|+|.+|- T Consensus 91 ld~~~r~eI~~hl~K~lGK 109 (169) T KOG4079 91 LDGMKREEIEKHLAKTLGK 109 (169) T ss_pred CCCCCHHHHHHHHHHHHCC T ss_conf 4555579999999998582 No 36 >pfam01361 Tautomerase Tautomerase enzyme. This family includes the enzyme 4-oxalocrotonate tautomerase that catalyses the ketonisation of 2-hydroxymuconate to 2-oxo-3-hexenedioate. Probab=22.95 E-value=67 Score=14.62 Aligned_cols=27 Identities=4% Similarity=0.112 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHHCCCHHHEEEEECCC Q ss_conf 889999999998299844479983557 Q gi|254780231|r 45 ANKAMLAMLAKKLALSKSSLRMLSKQS 71 (98) Q Consensus 45 AN~ali~~Lak~l~v~ks~I~I~~G~~ 71 (98) -=+++.+.+++.||+|++.|.++--+- T Consensus 19 l~~~iT~~~~~~lg~~~~~v~V~i~Ev 45 (60) T pfam01361 19 LIRRVTEAIVEALGAPPEAIFVVIEEV 45 (60) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEEC T ss_conf 999999999998495933599999981 No 37 >pfam12500 DUF3706 Protein of unknown function (DUF3706). This domain family is found in bacteria, and is typically between 174 and 217 amino acids in length. There is a conserved TRSP sequence motif. Probab=20.19 E-value=76 Score=14.28 Aligned_cols=30 Identities=27% Similarity=0.327 Sum_probs=21.4 Q ss_pred CHHHHHHHHHHHHHHCCCHHHEEEEECCCC Q ss_conf 978899999999982998444799835577 Q gi|254780231|r 43 GKANKAMLAMLAKKLALSKSSLRMLSKQSS 72 (98) Q Consensus 43 GkAN~ali~~Lak~l~v~ks~I~I~~G~~s 72 (98) +.++.+-...|++.|+++-.-|++++|.-+ T Consensus 5 s~~~r~~~~~l~~elgi~I~~VSLl~G~~~ 34 (199) T pfam12500 5 SGAARARFAALEAELGIPIDVVSLLSGSVS 34 (199) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEEEEEEEE T ss_conf 989999999999870990779998677898 No 38 >PRK12880 3-oxoacyl-(acyl carrier protein) synthase III; Reviewed Probab=20.13 E-value=60 Score=14.91 Aligned_cols=21 Identities=14% Similarity=0.373 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHCCCHHHE Q ss_conf 788999999999829984447 Q gi|254780231|r 44 KANKAMLAMLAKKLALSKSSL 64 (98) Q Consensus 44 kAN~ali~~Lak~l~v~ks~I 64 (98) .||..+++.+++.|++|..++ T Consensus 269 QAN~riie~i~k~Lgl~~ek~ 289 (353) T PRK12880 269 QSNAYLVDCIKEELKLNDDKV 289 (353) T ss_pred CCCHHHHHHHHHHCCCCHHHH T ss_conf 688999999999809897870 Done!