Query 537021.9.peg.1142_1 Match_columns 218 No_of_seqs 32 out of 34 Neff 3.8 Searched_HMMs 39220 Date Wed May 25 07:31:27 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_1142.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PHA01972 structural protein 100.0 0 0 410.6 16.8 204 2-209 589-806 (828) 2 PHA02006 virion protein 94.6 0.22 5.5E-06 27.4 7.3 165 2-179 1149-1328(1337) 3 PHA00368 internal virion prote 80.7 4.3 0.00011 20.2 5.3 56 22-78 1091-1148(1316) 4 TIGR01800 cit_synth_II 2-methy 66.5 10 0.00026 18.2 7.1 133 29-214 120-260 (386) 5 TIGR02715 amido_AtzE amidohydr 45.8 11 0.00027 18.0 1.3 53 112-174 190-246 (454) 6 PRK13455 F0F1 ATP synthase sub 34.5 35 0.00089 15.2 2.5 21 83-103 13-33 (184) 7 TIGR00985 3a0801s04tom mitocho 32.4 38 0.00098 14.9 3.3 38 78-115 66-107 (155) 8 PRK05462 S-adenosylmethionine 30.9 30 0.00077 15.5 1.7 48 162-209 202-254 (266) 9 pfam02064 MAS20 MAS20 protein 30.3 42 0.0011 14.7 2.8 15 99-113 91-105 (184) 10 KOG4056 consensus 29.6 42 0.0011 14.7 2.3 25 89-115 67-95 (143) 11 pfam08635 ox_reductase_C Putat 28.8 34 0.00087 15.2 1.7 41 78-118 5-62 (142) 12 COG3962 Acetolactate synthase 27.5 41 0.001 14.8 1.9 41 2-44 397-438 (617) 13 TIGR03331 SAM_DCase_Eco S-aden 27.3 34 0.00087 15.2 1.5 48 162-209 198-250 (259) 14 PHA01752 hypothetical protein 25.8 43 0.0011 14.6 1.8 18 168-185 355-372 (462) 15 pfam01090 Ribosomal_S19e Ribos 24.5 54 0.0014 14.1 3.9 12 74-85 11-22 (140) 16 TIGR00838 argH argininosuccina 24.1 55 0.0014 14.1 3.4 27 18-44 290-317 (469) 17 TIGR02197 heptose_epim ADP-L-g 22.9 45 0.0011 14.6 1.4 32 163-194 149-182 (353) 18 PHA00665 major capsid protein 22.2 36 0.00093 15.1 0.8 66 48-115 64-134 (329) 19 TIGR02523 type_IV_pilV type IV 22.1 60 0.0015 13.9 3.4 29 64-92 10-38 (163) No 1 >PHA01972 structural protein Probab=100.00 E-value=0 Score=410.63 Aligned_cols=204 Identities=21% Similarity=0.285 Sum_probs=185.9 Q ss_pred CHHHHHHHHHHHHCCCCCCC--CCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHHHHH Q ss_conf 33476786542201321136--87617-8999999998631199999999985630322123---699999999999999 Q 537021.9.peg.1 2 QEHARGSVGSTIQDKRWITG--KDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS---QVYRAKALVIGILGE 75 (218) Q Consensus 2 ~~~~~~~~~~~~~~~~~~tg--~~Gt~-ge~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s---~~~~aaaL~at~tl~ 75 (218) +|...+|...++|++..+++ ++||+ ||+.|+++|||+|||+++ ++||.|++.+.|.++ ++.|+++|++++|++ T Consensus 589 de~~~AV~tp~~r~rai~~~g~q~GT~~GE~~Rs~~qFKSFpia~~-~~~~~R~~~~~G~~~~~~~~~y~a~l~~~tTl~ 667 (828) T PHA01972 589 GETMMAVPTPDARTRAIMAGGTKSGTFGGELHRSFFLFKSFPIATI-MRHWRRAFTGKGYPGAFDRMAYAAIMVASTTLL 667 (828) T ss_pred CCCCEECCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHH-HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 1121101588536789882788898488999999999841489999-999999986068886046788999999999999 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHH--HHHCC-----CCCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999985489877786523589999852233--43020-----0002342-0002524676899998777865541 Q 537021.9.peg.1 76 ELIRKTLVPLISGKEPQLDFSDPTEYIKALINGIT--HYERF-----SPFNSSG-WDVLGPWSSQAGKLAIAGKEAVWDE 147 (218) Q Consensus 76 Gal~~qlk~LL~GkdP~~d~tdp~fw~~a~l~Ggg--~fgD~-----t~ygSsg-~~vlGP~~s~~~~L~~~~~~~~~~e 147 (218) |++++|+|+||+||||| |++||+||++|++|||| ||||+ .+||++. .+++||++|++++|+.+++++..+. T Consensus 668 G~~s~q~K~ll~GrdPr-~~~~~~fw~~a~lqGGGlGi~GDfL~~~~~r~G~~~~~~~aGP~~s~~~~l~~l~~~~~~~a 746 (828) T PHA01972 668 GAGSIQAKDLLNGKDPR-SMTDPKFWIEALLKGGSFGYYGDFLRNAHTGYGSDMTSYIGGPVLSYGDQLTKWVAMNPQDA 746 (828) T ss_pred HHHHHHHHHHHCCCCCC-CCCCCHHHHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHHCCCCHHHHHHHHHHHHCCHHHH T ss_conf 99999999997799999-99862499999973687125799987653143510788862860567999999862138888 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 13101483145779999862186144899999999999999999728414689999999999 Q 537021.9.peg.1 148 GTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209 (218) Q Consensus 148 ~~~~~~~~~~~~~~~k~v~~~~P~~NLWYtK~a~d~li~~qlqe~~~Pg~~~r~e~~~~R~~ 209 (218) .+|++++++...+ +|++++|||+||||+|+++||||+|||||++||||++|+|++.+|+. T Consensus 747 ~~G~~~~~g~d~~--~~~k~~tP~~NLWy~k~a~d~Li~dqlqe~~~PGy~~R~~~R~~k~~ 806 (828) T PHA01972 747 AQGKAESAGADLL--KFATQQTPFANLWYTKAATDRLIMDQLQELSSPGYDKKQLNRSRREF 806 (828) T ss_pred HCCCCHHHHHHHH--HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 5699503489999--99985696066999999999999999998839865899999999981 No 2 >PHA02006 virion protein Probab=94.62 E-value=0.22 Score=27.39 Aligned_cols=165 Identities=16% Similarity=0.149 Sum_probs=95.9 Q ss_pred CHHHHHHHHHHHH--CCCCCCCCCCC----H-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH Q ss_conf 3347678654220--13211368761----7-899999999863119999999998563032212369999999999999 Q 537021.9.peg.1 2 QEHARGSVGSTIQ--DKRWITGKDGS----V-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILG 74 (218) Q Consensus 2 ~~~~~~~~~~~~~--~~~~~tg~~Gt----~-ge~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s~~~~aaaL~at~tl 74 (218) |+.+-+++.+++| |-+.|-|..|- . -.+..-++||.||- |.-.-++.+--+..-|..+ .|.|++-.-- T Consensus 1149 q~dVd~vmN~alRiMDDsiLYGR~GQgssFaRSqvGQILGQFRSFV-sFAhNKLLRGT~~nsG~~G----lAsLlAfQYP 1223 (1337) T PHA02006 1149 QADVNTVMNTALRIMDDSLLYGRVGQNSGFARSPVGQILGQFRSFV-AFAHNKLLRGTYENSGVLG----VASLLAFQYP 1223 (1337) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHHHHCCHHCCCHHH----HHHHHHHHCC T ss_conf 7888899999888731003012457887520306889999999999-9874077742010367024----9999975322 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCHHHH---HHHHH--HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHH---HHHH Q ss_conf 989999999985489877786523589---99985--223343020000234200025246768999987778---6554 Q 537021.9.peg.1 75 EELIRKTLVPLISGKEPQLDFSDPTEY---IKALI--NGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKE---AVWD 146 (218) Q Consensus 75 ~Gal~~qlk~LL~GkdP~~d~tdp~fw---~~a~l--~Ggg~fgD~t~ygSsg~~vlGP~~s~~~~L~~~~~~---~~~~ 146 (218) |-.|.+..|++.+|+-- -+|.+.- ..+.= .|.||.+|...-...-..+.-||++.+.......++ ++++ T Consensus 1224 lT~lMv~anearkG~~d---~Se~Gi~kmA~~aIGYTAglGF~gD~~GiV~G~~~mSAPV~GLA~~~~~i~~G~~~L~~G 1300 (1337) T PHA02006 1224 LTALMMGAKAAINGKFD---TSDEGIRKMAIDGIGYTAGLGFTADMWGVVTGHSRMSAPVFGLAEHSNEVFRGVKDLVTG 1300 (1337) T ss_pred HHHHHHHHHHHHCCCCC---CCHHHHHHHHHHHCCCCCCCCCCHHHHHCCCCCCCCCCCCCCHHCCCCCHHHCCCHHCCC T ss_conf 89999988998658763---206779999886403000024203222001477876764300211896054153401068 Q ss_pred HCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 113101483145779999862186144899999 Q 537021.9.peg.1 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARG 179 (218) Q Consensus 147 e~~~~~~~~~~~~~~~k~v~~~~P~~NLWYtK~ 179 (218) +...+.++. -+ ....+-.||.|.+-.-. T Consensus 1301 ~~ps~A~~D-~~----~~A~~ALPFV~V~P~T~ 1328 (1337) T PHA02006 1301 DDPAAATGD-IV----SGAAGALPFVNVFPATK 1328 (1337) T ss_pred CCHHHHHHH-HH----HHHCCCCCCCCCCCCHH T ss_conf 884778889-99----85402486234463089 No 3 >PHA00368 internal virion protein D Probab=80.66 E-value=4.3 Score=20.19 Aligned_cols=56 Identities=16% Similarity=0.088 Sum_probs=36.7 Q ss_pred CCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH-HHHHHHHHHHHHHHHHH Q ss_conf 87617-89999999986311999999999856303221236-99999999999999899 Q 537021.9.peg.1 22 KDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQ-VYRAKALVIGILGEELI 78 (218) Q Consensus 22 ~~Gt~-ge~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s~-~~~aaaL~at~tl~Gal 78 (218) ++--. |-++.++.|||+|-|--.+-|.+ |.+-..--+.| .-.|.+.++|+.++|.- T Consensus 1091 QDskA~Ga~~km~lQFKsF~iKSlN~rfi-Rs~ye~tKN~RaiD~Alt~viS~GlAggy 1148 (1316) T PHA00368 1091 QDSKAFGALVKMVLQFKSFVIKSLNSRFI-RSFYEATKNNRAIDAALTIVISMGLAGGY 1148 (1316) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHCCCHHHHHHHHHHHHHHHHHHH T ss_conf 01244415789988777667774023889-99987502643899999999983134688 No 4 >TIGR01800 cit_synth_II 2-methylcitrate synthase/citrate synthase II; InterPro: IPR011278 Citrate synthase 2.3.3.1 from EC is a member of a small family of enzymes that can directly form a carbon-carbon bond without the presence of metal ion cofactors. It catalyses the first reaction in the Krebs' cycle, namely the conversion of oxaloacetate and acetyl-coenzyme A into citrate and coenzyme A. This reaction is important for energy generation and for carbon assimilation. The reaction proceeds via a non-covalently bound citryl-coenzyme A intermediate in a 2-step process (aldol-Claisen condensation followed by the hydrolysis of citryl-CoA). Citrate synthase enzymes are found in two distinct structural types: type I enzymes (found in eukaryotes, Gram-positive bacteria and archaea) form homodimers and have shorter sequences than type II enzymes, which are found in Gram-negative bacteria and are hexameric in structure. In both types, the monomer is composed of two domains: a large alpha-helical domain consisting of two structural repeats, where the second repeat is interrupted by a small alpha-helical domain. The cleft between these domains forms the active site, where both citrate and acetyl-coenzyme A bind. The enzyme undergoes a conformational change upon binding of the oxaloacetate ligand, whereby the active site cleft closes over in order to form the acetyl-CoA binding site . The energy required for domain closure comes from the interaction of the enzyme with the substrate. Type II enzymes possess an extra N-terminal beta-sheet domain, and some type II enzymes are allosterically inhibited by NADH . This entry represents type I homodimeric citrate synthase enzymes (predominantly from Gram-positive bacteria and archaea), as well as 2-methylcitrate synthase; some enzyme in this entry may be bifunctional citrate synthase/2-methylcitrate synthase enzymes. Members of this family appear as a second citrate synthase isozyme but typically are associated with propionate metabolism and synthesise 2-methylcitrate from propionyl-CoA; citrate synthase activity may be incidental. A number of species, including Thermoplasma acidophilum, Pyrococcus furiosus, and the Antarctic bacterium DS2-3R have a bifunctional member of this family as the only citrate synthase isozyme . 2-methylcitrate (2.3.3.5 from EC) synthase catalyses the conversion of oxaloacetate and propanoyl-CoA into (2R,3S)-2-hydroxybutane-1,2,3-tricarboxylate and coenzyme A. This enzyme is induced during bacterial growth on propionate, while type II hexameric citrate synthase is constitutive . ; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0044262 cellular carbohydrate metabolic process, 0005737 cytoplasm. Probab=66.53 E-value=10 Score=18.16 Aligned_cols=133 Identities=19% Similarity=0.201 Sum_probs=88.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----CCCCCCCCHHHHHH Q ss_conf 9999999863119999999998563032212369999999999999989999999985489-----87778652358999 Q 537021.9.peg.1 29 LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGK-----EPQLDFSDPTEYIK 103 (218) Q Consensus 29 ~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s~~~~aaaL~at~tl~Gal~~qlk~LL~Gk-----dP~~d~tdp~fw~~ 103 (218) ..|+.+||=+-=..|.|.+.. .-.=+|-.+|.-++|+.|+ .+|+|+ +| ++.|.+ T Consensus 120 a~~l~A~~P~Iva~~~R~~~g-~~~v~P~~~s~~s~A~nFL--------------~mL~GeddGsv~P------~~~~~~ 178 (386) T TIGR01800 120 AKRLLAKLPTIVAAWYRIRNG-LEIVAPREDSDLSIAANFL--------------YMLHGEDDGSVEP------SKEEEK 178 (386) T ss_pred HHHHHHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCHHHHHH--------------HHHCCCCCCCCCC------CHHHHH T ss_conf 788774444999999976436-8856884355653567788--------------7641787788788------979998 Q ss_pred HHHHHHHHHHCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 98522334302000023-42000252467689999877786554113101483145779999862186144899999999 Q 537021.9.peg.1 104 ALINGITHYERFSPFNS-SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN 182 (218) Q Consensus 104 a~l~Ggg~fgD~t~ygS-sg~~vlGP~~s~~~~L~~~~~~~~~~e~~~~~~~~~~~~~~~k~v~~~~P~~NLWYtK~a~d 182 (218) +|=.=+=++=||-=..| ..+-|..=.+||.=+=+..+.+.+.+=.-| ++... T Consensus 179 a~~~aLiLyAeHe~NASTFaarv~aSTLSD~YSaitaAIGALkGPLHG-----GAnE~---------------------- 231 (386) T TIGR01800 179 AMDIALILYAEHELNASTFAARVIASTLSDIYSAITAAIGALKGPLHG-----GANEA---------------------- 231 (386) T ss_pred HHCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH-----HHHHH---------------------- T ss_conf 613001134223885667999999988888999999999970687503-----18999---------------------- Q ss_pred HHHHHHHHHHHCC-CHHHHHHHHHHHHHHH-HHH Q ss_conf 9999999997284-1468999999999999-865 Q 537021.9.peg.1 183 HFVRNSIDDVLNP-GGRARAEVYRQRQKYK-KQR 214 (218) Q Consensus 183 ~li~~qlqe~~~P-g~~~r~e~~~~R~~~k-~~r 214 (218) +..+++|.-+| . ++|.++++.-.+ ++| T Consensus 232 --vm~~l~ei~~pd~---~aE~~~~~~l~~GKEr 260 (386) T TIGR01800 232 --VMKMLDEIGDPDD---KAEAWIRKKLENGKER 260 (386) T ss_pred --HHHHHHHHCCHHH---HHHHHHHHHHHHCCCE T ss_conf --9999975238444---5889999999708966 No 5 >TIGR02715 amido_AtzE amidohydrolase, AtzE family; InterPro: IPR014087 Members of this protein family are aminohydrolases related to, but distinct from, glutamyl-tRNA(Gln) amidotransferase subunit A. The best-characterised member is from Pseudomonas sp., which hydrolyses ammonia from the three-nitrogen compound biuret to yield allophanate. Allophanate is also an intermediate in urea degradation by the urea carboxylase/allophanate hydrolase pathway, an alternative to urease.. Probab=45.82 E-value=11 Score=18.02 Aligned_cols=53 Identities=21% Similarity=0.223 Sum_probs=38.9 Q ss_pred HHCCCCCCCC---CC-CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHH Q ss_conf 3020000234---20-002524676899998777865541131014831457799998621861448 Q 537021.9.peg.1 112 YERFSPFNSS---GW-DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNL 174 (218) Q Consensus 112 fgD~t~ygSs---g~-~vlGP~~s~~~~L~~~~~~~~~~e~~~~~~~~~~~~~~~k~v~~~~P~~NL 174 (218) ||+..+-|+. .+ |.+||..-.+.||. .+++..+|.|+...--.. +...|-+++ T Consensus 190 yGRLSR~GvfPFV~SlDH~GPfArsV~DLA-----~~YDvmQG~D~~D~~~~~-----r~~~P~~~~ 246 (454) T TIGR02715 190 YGRLSRRGVFPFVASLDHVGPFARSVEDLA-----LAYDVMQGPDPQDPICTA-----RPAEPTVPL 246 (454) T ss_pred CCCCCCCCCCCHHHHHHCCCCCCCCHHHHH-----HHHHHHCCCCCCCCCCCC-----CCCCCCCCC T ss_conf 655865555512554422586001477899-----999987088877675334-----778777552 No 6 >PRK13455 F0F1 ATP synthase subunit B; Provisional Probab=34.46 E-value=35 Score=15.16 Aligned_cols=21 Identities=14% Similarity=0.327 Sum_probs=14.8 Q ss_pred HHHHCCCCCCCCCCCHHHHHH Q ss_conf 998548987778652358999 Q 537021.9.peg.1 83 VPLISGKEPQLDFSDPTEYIK 103 (218) Q Consensus 83 k~LL~GkdP~~d~tdp~fw~~ 103 (218) .+-+.--.|-.+++||.||+. T Consensus 13 ~~~~~~~~~~~~~~d~~FWv~ 33 (184) T PRK13455 13 SPALAAGGPFFSLSNTDFIVT 33 (184) T ss_pred CHHHHCCCCCCCCCCCHHHHH T ss_conf 868764799888889549999 No 7 >TIGR00985 3a0801s04tom mitochondrial import receptor subunit; InterPro: IPR016043 Virtually all mitochondrial precursors are imported via the same mechanism : precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 .. Probab=32.41 E-value=38 Score=14.93 Aligned_cols=38 Identities=16% Similarity=0.182 Sum_probs=19.2 Q ss_pred HHHHHHHHHCCCCCCCCCCCH----HHHHHHHHHHHHHHHCC Q ss_conf 999999985489877786523----58999985223343020 Q 537021.9.peg.1 78 IRKTLVPLISGKEPQLDFSDP----TEYIKALINGITHYERF 115 (218) Q Consensus 78 l~~qlk~LL~GkdP~~d~tdp----~fw~~a~l~Ggg~fgD~ 115 (218) +...+.+++.|.+-.++++|| .||.+-...|=-++.+- T Consensus 66 ~~~~~r~~~~~~~~~P~p~d~~e~e~fFl~~V~~GE~L~~q~ 107 (155) T TIGR00985 66 LRKVVREAVAELAKEPDPSDAEEKEAFFLEEVQLGEELLAQQ 107 (155) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCC T ss_conf 989999999876407899988789879999999877850489 No 8 >PRK05462 S-adenosylmethionine decarboxylase; Provisional Probab=30.88 E-value=30 Score=15.51 Aligned_cols=48 Identities=10% Similarity=0.221 Sum_probs=31.7 Q ss_pred HHHHHHCCCCHHHHHHHHHHH-----HHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 999862186144899999999-----999999999728414689999999999 Q 537021.9.peg.1 162 KELVNTFVPFQNLWYARGAFN-----HFVRNSIDDVLNPGGRARAEVYRQRQK 209 (218) Q Consensus 162 ~k~v~~~~P~~NLWYtK~a~d-----~li~~qlqe~~~Pg~~~r~e~~~~R~~ 209 (218) +.++--|++-.|+|-||+... .++|..=.+-++|.-.++.+.+++|+- T Consensus 202 Y~miDvNvyQeNiFHTKM~lkefdl~nYLF~~~~~~ls~~e~~~i~~~L~~EM 254 (266) T PRK05462 202 YDMIDVNVYQENIFHTKMLLKEFDLDNYLFNTKPEDLSAEERQEITALLKKEM 254 (266) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCHHHCCCCCCCHHCCHHHHHHHHHHHHHHH T ss_conf 21687424412468888887643722202379900289899999999999999 No 9 >pfam02064 MAS20 MAS20 protein import receptor. Probab=30.34 E-value=42 Score=14.73 Aligned_cols=15 Identities=7% Similarity=0.080 Sum_probs=8.4 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 589999852233430 Q 537021.9.peg.1 99 TEYIKALINGITHYE 113 (218) Q Consensus 99 ~fw~~a~l~Ggg~fg 113 (218) .||.+-...|-.+.. T Consensus 91 ~fFmqqV~~GE~L~~ 105 (184) T pfam02064 91 ATFLENVEAGEILGM 105 (184) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999998889871 No 10 >KOG4056 consensus Probab=29.56 E-value=42 Score=14.72 Aligned_cols=25 Identities=20% Similarity=0.189 Sum_probs=14.2 Q ss_pred CCCCCCCCCH----HHHHHHHHHHHHHHHCC Q ss_conf 9877786523----58999985223343020 Q 537021.9.peg.1 89 KEPQLDFSDP----TEYIKALINGITHYERF 115 (218) Q Consensus 89 kdP~~d~tdp----~fw~~a~l~Ggg~fgD~ 115 (218) +.| |.+|+ +||++-...|--++.+- T Consensus 67 ~~p--d~~d~~~~E~~Fmqqv~lGE~L~~qg 95 (143) T KOG4056 67 NIP--DPSDAEEVEKFFMQQVQLGEELLAQG 95 (143) T ss_pred CCC--CCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 699--98989999999999998689999765 No 11 >pfam08635 ox_reductase_C Putative oxidoreductase C terminal. This is the C terminal of a family of putative oxidoreductases. Probab=28.78 E-value=34 Score=15.22 Aligned_cols=41 Identities=17% Similarity=0.263 Sum_probs=24.3 Q ss_pred HHHHHHHHHCCCCCCCCC------------CCHHHHHHHHHHH-----HHHHHCCCCC Q ss_conf 999999985489877786------------5235899998522-----3343020000 Q 537021.9.peg.1 78 IRKTLVPLISGKEPQLDF------------SDPTEYIKALING-----ITHYERFSPF 118 (218) Q Consensus 78 l~~qlk~LL~GkdP~~d~------------tdp~fw~~a~l~G-----gg~fgD~t~y 118 (218) .+..+|++|++|+-+.+. .+|--|.....+| +.+|-|+.+| T Consensus 5 ~Vq~ak~~l~~nnl~v~~t~ary~~aY~~~~kp~WW~k~~sgGpvVEQaTHf~DLsRY 62 (142) T pfam08635 5 AVQKMKQIIKENNLTVMATNARYVSAYEHNAKPFWWNKSISGGPVVEQGTHFCDLSRY 62 (142) T ss_pred HHHHHHHHHHCCCCEEEEEEEEEEEEECCCCCHHHHHCCCCCCCEEEHHHHHHHHHHH T ss_conf 9999999985498258865100034301469817876056689601036789899998 No 12 >COG3962 Acetolactate synthase [Amino acid transport and metabolism] Probab=27.50 E-value=41 Score=14.77 Aligned_cols=41 Identities=27% Similarity=0.388 Sum_probs=28.9 Q ss_pred CHHHHHHHHHHHHCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHH Q ss_conf 3347678654220132113687617-899999999863119999 Q 537021.9.peg.1 2 QEHARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWS 44 (218) Q Consensus 2 ~~~~~~~~~~~~~~~~~~tg~~Gt~-ge~~R~~~QFK~fPis~~ 44 (218) |.++.|.|--++.+..-+..+.|+. |||-|.-. -..|.+|- T Consensus 397 q~~vigav~~~~~~~svvvcAAGsLPGdLhkLW~--~~~p~~YH 438 (617) T COG3962 397 QTQVIGAVQRTISDDSVVVCAAGSLPGDLHKLWR--AGVPGTYH 438 (617) T ss_pred CHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHC--CCCCCCEE T ss_conf 1357888885248885799837899678999860--48998542 No 13 >TIGR03331 SAM_DCase_Eco S-adenosylmethionine decarboxylase proenzyme, Escherichia coli form. Members of this protein family are the single chain precursor of the S-adenosylmethionine decarboxylase as found in Escherichia coli. This form shows a substantially different architecture from the form shared by the Archaea, Bacillus, and many other species (TIGR03330). It shows little or no similarity to the form found in eukaryotes (TIGR00535). Probab=27.35 E-value=34 Score=15.21 Aligned_cols=48 Identities=10% Similarity=0.247 Sum_probs=33.5 Q ss_pred HHHHHHCCCCHHHHHHHHHHH-----HHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 999862186144899999999-----999999999728414689999999999 Q 537021.9.peg.1 162 KELVNTFVPFQNLWYARGAFN-----HFVRNSIDDVLNPGGRARAEVYRQRQK 209 (218) Q Consensus 162 ~k~v~~~~P~~NLWYtK~a~d-----~li~~qlqe~~~Pg~~~r~e~~~~R~~ 209 (218) +.++--|++-.|+|-||+... .++|..=.+-+.|.-.++.+++++|+- T Consensus 198 Y~~iDvNvyqeNiFHTKm~lke~dL~nYLF~~~~~~ls~~e~~~I~~~L~~EM 250 (259) T TIGR03331 198 YQMIDVNVYQENIFHTKMMLKDFDLDNYLFGTAKEDLSPEERREITARLKKEM 250 (259) T ss_pred CCEEEECCCHHHHHHHHHHHHHCCHHHCCCCCCCHHCCHHHHHHHHHHHHHHH T ss_conf 32587524304568888876315700014579802389999999999999999 No 14 >PHA01752 hypothetical protein Probab=25.75 E-value=43 Score=14.64 Aligned_cols=18 Identities=22% Similarity=0.536 Sum_probs=13.5 Q ss_pred CCCCHHHHHHHHHHHHHH Q ss_conf 186144899999999999 Q 537021.9.peg.1 168 FVPFQNLWYARGAFNHFV 185 (218) Q Consensus 168 ~~P~~NLWYtK~a~d~li 185 (218) +--..|.||+|++-+-.| T Consensus 355 NrKYsNiWYAKmVasA~i 372 (462) T PHA01752 355 NKKYSNIWYAKMVASATI 372 (462) T ss_pred HHCCCCHHHHHHHHHHHH T ss_conf 540010279999888899 No 15 >pfam01090 Ribosomal_S19e Ribosomal protein S19e. Probab=24.48 E-value=54 Score=14.13 Aligned_cols=12 Identities=8% Similarity=-0.172 Sum_probs=5.8 Q ss_pred HHHHHHHHHHHH Q ss_conf 998999999998 Q 537021.9.peg.1 74 GEELIRKTLVPL 85 (218) Q Consensus 74 l~Gal~~qlk~L 85 (218) |.-+++..||.- T Consensus 11 ~I~~lA~~LK~~ 22 (140) T pfam01090 11 LIKALAEFLKKS 22 (140) T ss_pred HHHHHHHHHHHC T ss_conf 999999999647 No 16 >TIGR00838 argH argininosuccinate lyase; InterPro: IPR009049 This entry represents argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens. ; GO: 0004056 argininosuccinate lyase activity, 0042450 arginine biosynthetic process via ornithine. Probab=24.05 E-value=55 Score=14.08 Aligned_cols=27 Identities=33% Similarity=0.662 Sum_probs=23.2 Q ss_pred CCCCCCCCH-HHHHHHHHHHHHHHHHHH Q ss_conf 113687617-899999999863119999 Q 537021.9.peg.1 18 WITGKDGSV-NNLARLMGQFLVMPISWS 44 (218) Q Consensus 18 ~~tg~~Gt~-ge~~R~~~QFK~fPis~~ 44 (218) +|=||-|.+ |+|+=+|+-.|+-|.+|= T Consensus 290 LiRGK~Grv~G~L~g~L~~lKalP~aYN 317 (469) T TIGR00838 290 LIRGKTGRVIGNLTGLLTILKALPLAYN 317 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9987552689999999999711233235 No 17 >TIGR02197 heptose_epim ADP-L-glycero-D-manno-heptose-6-epimerase; InterPro: IPR011912 Lipopolysaccharides (LPS) are glycolipids that consitutes the outer monolayer of the outer membranes of most Gram-negative bacteria . They consist of lipid A (endotoxin) which anchors LPS to the outer membrane, a non-repeating core oligosachharide, and an immunogenic O-antigen repeat polymer, which is an oligosaccharide of 1-40 units that variesbetween different strains of bacteria. Although the O-antigen and most of the core domain are not necessary for growth in the lab, they appear to help bacteria resist environmental stresses including the complement system and antibiotics. This family consists of examples of ADP-L-glycero-D-mannoheptose-6-epimerase, an enzyme involved in biosynthesis of the inner core of LPS in Gram-negative bacteria . This enzyme is homologous to UDP-glucose 4-epimerase (IPR005886 from INTERPRO) and belongs to the NAD dependent epimerase/dehydratase family. It participates in the biosynthetic pathway leading to incorporation of heptose, a conserved sugar, into the core region of LPS, performing the reaction shown below:ADP-D-glycero-D-manno-heptose = ADP-L-glycero-D-manno-heptose It is a homopentameric enzyme with each monomer composed of two domains: an N-terminal modified Rossman fold domain for NADP binding, and a C-terminal substrate binding domain.; GO: 0008712 ADP-glyceromanno-heptose 6-epimerase activity, 0050661 NADP binding, 0005975 carbohydrate metabolic process. Probab=22.92 E-value=45 Score=14.57 Aligned_cols=32 Identities=22% Similarity=0.416 Sum_probs=22.5 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHH--HHC Q ss_conf 99862186144899999999999999999--728 Q 537021.9.peg.1 163 ELVNTFVPFQNLWYARGAFNHFVRNSIDD--VLN 194 (218) Q Consensus 163 k~v~~~~P~~NLWYtK~a~d~li~~qlqe--~~~ 194 (218) +-++.+-|..===|.|..||.++..+++. -.. T Consensus 149 e~L~kLrPlN~YGySK~lFD~~v~~~~~~~~~~~ 182 (353) T TIGR02197 149 EELQKLRPLNVYGYSKFLFDQYVRRRVLPGEALS 182 (353) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 8897518788612216789899999860124798 No 18 >PHA00665 major capsid protein Probab=22.24 E-value=36 Score=15.06 Aligned_cols=66 Identities=14% Similarity=0.162 Sum_probs=45.5 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCCCCCCCHHHHHHHHHHHHH---HHHCC Q ss_conf 99856303221236999999999999998999999998--5489877786523589999852233---43020 Q 537021.9.peg.1 48 LIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPL--ISGKEPQLDFSDPTEYIKALINGIT---HYERF 115 (218) Q Consensus 48 ~~~r~~~~~g~~s~~~~aaaL~at~tl~Gal~~qlk~L--L~GkdP~~d~tdp~fw~~a~l~Ggg---~fgD~ 115 (218) .|+|-++ |+...-+..++-.-++.++.+.+.-=|+| |+|++-..-.++-..|+.++-|.-- ||||. T Consensus 64 ~wR~ln~--Gv~p~kstt~qv~d~~gmlE~~seVD~~La~lnGN~aafR~SEd~aflEgmnQ~~a~~lfYGd~ 134 (329) T PHA00665 64 TWRKLNY--GVQPEKSRTVQVTDSMGMLETYAEVDKALADLNGNTAAFRLSEDRAFIEGMNQTVAQTLFYGDS 134 (329) T ss_pred HHHHHHC--CCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCHHHHHHCCCHHHHHHHHHHHHHHEEECCC T ss_conf 7998617--8898775357741111678889887299998659867754223178999887998875250788 No 19 >TIGR02523 type_IV_pilV type IV pilus modification protein PilV; InterPro: IPR013362 Pilus systems categorized as type IV pilins differ greatly from one another, with some showing greater similarity to type II or type III secretion systems than to each other. Members of this protein family represent the PilV protein of type IV pilus systems found in Pseudomonas aeruginosa PAO1, Pseudomonas syringae DC3000, Neisseria meningitidis MC58, Xylella fastidiosa 9a5c, and other species.. Probab=22.11 E-value=60 Score=13.86 Aligned_cols=29 Identities=17% Similarity=0.024 Sum_probs=23.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999989999999985489877 Q 537021.9.peg.1 64 RAKALVIGILGEELIRKTLVPLISGKEPQ 92 (218) Q Consensus 64 ~aaaL~at~tl~Gal~~qlk~LL~GkdP~ 92 (218) =+|=|++++.++|.+.+|++.+-.=++=+ T Consensus 10 LvAlL~LaiGvLg~~alql~~~~~t~~A~ 38 (163) T TIGR02523 10 LVALLVLAIGVLGMAALQLKAVRYTKSAS 38 (163) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999986888768899 Done!