Query         537021.9.peg.1142_1
Match_columns 218
No_of_seqs    32 out of 34
Neff          3.8 
Searched_HMMs 39220
Date          Wed May 25 07:31:27 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i peg_1142.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PHA01972 structural protein    100.0       0       0  410.6  16.8  204    2-209   589-806 (828)
  2 PHA02006 virion protein         94.6    0.22 5.5E-06   27.4   7.3  165    2-179  1149-1328(1337)
  3 PHA00368 internal virion prote  80.7     4.3 0.00011   20.2   5.3   56   22-78   1091-1148(1316)
  4 TIGR01800 cit_synth_II 2-methy  66.5      10 0.00026   18.2   7.1  133   29-214   120-260 (386)
  5 TIGR02715 amido_AtzE amidohydr  45.8      11 0.00027   18.0   1.3   53  112-174   190-246 (454)
  6 PRK13455 F0F1 ATP synthase sub  34.5      35 0.00089   15.2   2.5   21   83-103    13-33  (184)
  7 TIGR00985 3a0801s04tom mitocho  32.4      38 0.00098   14.9   3.3   38   78-115    66-107 (155)
  8 PRK05462 S-adenosylmethionine   30.9      30 0.00077   15.5   1.7   48  162-209   202-254 (266)
  9 pfam02064 MAS20 MAS20 protein   30.3      42  0.0011   14.7   2.8   15   99-113    91-105 (184)
 10 KOG4056 consensus               29.6      42  0.0011   14.7   2.3   25   89-115    67-95  (143)
 11 pfam08635 ox_reductase_C Putat  28.8      34 0.00087   15.2   1.7   41   78-118     5-62  (142)
 12 COG3962 Acetolactate synthase   27.5      41   0.001   14.8   1.9   41    2-44    397-438 (617)
 13 TIGR03331 SAM_DCase_Eco S-aden  27.3      34 0.00087   15.2   1.5   48  162-209   198-250 (259)
 14 PHA01752 hypothetical protein   25.8      43  0.0011   14.6   1.8   18  168-185   355-372 (462)
 15 pfam01090 Ribosomal_S19e Ribos  24.5      54  0.0014   14.1   3.9   12   74-85     11-22  (140)
 16 TIGR00838 argH argininosuccina  24.1      55  0.0014   14.1   3.4   27   18-44    290-317 (469)
 17 TIGR02197 heptose_epim ADP-L-g  22.9      45  0.0011   14.6   1.4   32  163-194   149-182 (353)
 18 PHA00665 major capsid protein   22.2      36 0.00093   15.1   0.8   66   48-115    64-134 (329)
 19 TIGR02523 type_IV_pilV type IV  22.1      60  0.0015   13.9   3.4   29   64-92     10-38  (163)

No 1  
>PHA01972 structural protein
Probab=100.00  E-value=0  Score=410.63  Aligned_cols=204  Identities=21%  Similarity=0.285  Sum_probs=185.9

Q ss_pred             CHHHHHHHHHHHHCCCCCCC--CCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHHHHH
Q ss_conf             33476786542201321136--87617-8999999998631199999999985630322123---699999999999999
Q 537021.9.peg.1    2 QEHARGSVGSTIQDKRWITG--KDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS---QVYRAKALVIGILGE   75 (218)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~tg--~~Gt~-ge~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s---~~~~aaaL~at~tl~   75 (218)
                      +|...+|...++|++..+++  ++||+ ||+.|+++|||+|||+++ ++||.|++.+.|.++   ++.|+++|++++|++
T Consensus       589 de~~~AV~tp~~r~rai~~~g~q~GT~~GE~~Rs~~qFKSFpia~~-~~~~~R~~~~~G~~~~~~~~~y~a~l~~~tTl~  667 (828)
T PHA01972        589 GETMMAVPTPDARTRAIMAGGTKSGTFGGELHRSFFLFKSFPIATI-MRHWRRAFTGKGYPGAFDRMAYAAIMVASTTLL  667 (828)
T ss_pred             CCCCEECCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHH-HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH
T ss_conf             1121101588536789882788898488999999999841489999-999999986068886046788999999999999


Q ss_pred             HHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHH--HHHCC-----CCCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHH
Q ss_conf             89999999985489877786523589999852233--43020-----0002342-0002524676899998777865541
Q 537021.9.peg.1   76 ELIRKTLVPLISGKEPQLDFSDPTEYIKALINGIT--HYERF-----SPFNSSG-WDVLGPWSSQAGKLAIAGKEAVWDE  147 (218)
Q Consensus        76 Gal~~qlk~LL~GkdP~~d~tdp~fw~~a~l~Ggg--~fgD~-----t~ygSsg-~~vlGP~~s~~~~L~~~~~~~~~~e  147 (218)
                      |++++|+|+||+||||| |++||+||++|++||||  ||||+     .+||++. .+++||++|++++|+.+++++..+.
T Consensus       668 G~~s~q~K~ll~GrdPr-~~~~~~fw~~a~lqGGGlGi~GDfL~~~~~r~G~~~~~~~aGP~~s~~~~l~~l~~~~~~~a  746 (828)
T PHA01972        668 GAGSIQAKDLLNGKDPR-SMTDPKFWIEALLKGGSFGYYGDFLRNAHTGYGSDMTSYIGGPVLSYGDQLTKWVAMNPQDA  746 (828)
T ss_pred             HHHHHHHHHHHCCCCCC-CCCCCHHHHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHHCCCCHHHHHHHHHHHHCCHHHH
T ss_conf             99999999997799999-99862499999973687125799987653143510788862860567999999862138888


Q ss_pred             CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Q ss_conf             13101483145779999862186144899999999999999999728414689999999999
Q 537021.9.peg.1  148 GTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK  209 (218)
Q Consensus       148 ~~~~~~~~~~~~~~~k~v~~~~P~~NLWYtK~a~d~li~~qlqe~~~Pg~~~r~e~~~~R~~  209 (218)
                      .+|++++++...+  +|++++|||+||||+|+++||||+|||||++||||++|+|++.+|+.
T Consensus       747 ~~G~~~~~g~d~~--~~~k~~tP~~NLWy~k~a~d~Li~dqlqe~~~PGy~~R~~~R~~k~~  806 (828)
T PHA01972        747 AQGKAESAGADLL--KFATQQTPFANLWYTKAATDRLIMDQLQELSSPGYDKKQLNRSRREF  806 (828)
T ss_pred             HCCCCHHHHHHHH--HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
T ss_conf             5699503489999--99985696066999999999999999998839865899999999981


No 2  
>PHA02006 virion protein
Probab=94.62  E-value=0.22  Score=27.39  Aligned_cols=165  Identities=16%  Similarity=0.149  Sum_probs=95.9

Q ss_pred             CHHHHHHHHHHHH--CCCCCCCCCCC----H-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH
Q ss_conf             3347678654220--13211368761----7-899999999863119999999998563032212369999999999999
Q 537021.9.peg.1    2 QEHARGSVGSTIQ--DKRWITGKDGS----V-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILG   74 (218)
Q Consensus         2 ~~~~~~~~~~~~~--~~~~~tg~~Gt----~-ge~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s~~~~aaaL~at~tl   74 (218)
                      |+.+-+++.+++|  |-+.|-|..|-    . -.+..-++||.||- |.-.-++.+--+..-|..+    .|.|++-.--
T Consensus      1149 q~dVd~vmN~alRiMDDsiLYGR~GQgssFaRSqvGQILGQFRSFV-sFAhNKLLRGT~~nsG~~G----lAsLlAfQYP 1223 (1337)
T PHA02006       1149 QADVNTVMNTALRIMDDSLLYGRVGQNSGFARSPVGQILGQFRSFV-AFAHNKLLRGTYENSGVLG----VASLLAFQYP 1223 (1337)
T ss_pred             HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHHHHCCHHCCCHHH----HHHHHHHHCC
T ss_conf             7888899999888731003012457887520306889999999999-9874077742010367024----9999975322


Q ss_pred             HHHHHHHHHHHHCCCCCCCCCCCHHHH---HHHHH--HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHH---HHHH
Q ss_conf             989999999985489877786523589---99985--223343020000234200025246768999987778---6554
Q 537021.9.peg.1   75 EELIRKTLVPLISGKEPQLDFSDPTEY---IKALI--NGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKE---AVWD  146 (218)
Q Consensus        75 ~Gal~~qlk~LL~GkdP~~d~tdp~fw---~~a~l--~Ggg~fgD~t~ygSsg~~vlGP~~s~~~~L~~~~~~---~~~~  146 (218)
                      |-.|.+..|++.+|+--   -+|.+.-   ..+.=  .|.||.+|...-...-..+.-||++.+.......++   ++++
T Consensus      1224 lT~lMv~anearkG~~d---~Se~Gi~kmA~~aIGYTAglGF~gD~~GiV~G~~~mSAPV~GLA~~~~~i~~G~~~L~~G 1300 (1337)
T PHA02006       1224 LTALMMGAKAAINGKFD---TSDEGIRKMAIDGIGYTAGLGFTADMWGVVTGHSRMSAPVFGLAEHSNEVFRGVKDLVTG 1300 (1337)
T ss_pred             HHHHHHHHHHHHCCCCC---CCHHHHHHHHHHHCCCCCCCCCCHHHHHCCCCCCCCCCCCCCHHCCCCCHHHCCCHHCCC
T ss_conf             89999988998658763---206779999886403000024203222001477876764300211896054153401068


Q ss_pred             HCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHH
Q ss_conf             113101483145779999862186144899999
Q 537021.9.peg.1  147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARG  179 (218)
Q Consensus       147 e~~~~~~~~~~~~~~~k~v~~~~P~~NLWYtK~  179 (218)
                      +...+.++. -+    ....+-.||.|.+-.-.
T Consensus      1301 ~~ps~A~~D-~~----~~A~~ALPFV~V~P~T~ 1328 (1337)
T PHA02006       1301 DDPAAATGD-IV----SGAAGALPFVNVFPATK 1328 (1337)
T ss_pred             CCHHHHHHH-HH----HHHCCCCCCCCCCCCHH
T ss_conf             884778889-99----85402486234463089


No 3  
>PHA00368 internal virion protein D
Probab=80.66  E-value=4.3  Score=20.19  Aligned_cols=56  Identities=16%  Similarity=0.088  Sum_probs=36.7

Q ss_pred             CCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH-HHHHHHHHHHHHHHHHH
Q ss_conf             87617-89999999986311999999999856303221236-99999999999999899
Q 537021.9.peg.1   22 KDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQ-VYRAKALVIGILGEELI   78 (218)
Q Consensus        22 ~~Gt~-ge~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s~-~~~aaaL~at~tl~Gal   78 (218)
                      ++--. |-++.++.|||+|-|--.+-|.+ |.+-..--+.| .-.|.+.++|+.++|.-
T Consensus      1091 QDskA~Ga~~km~lQFKsF~iKSlN~rfi-Rs~ye~tKN~RaiD~Alt~viS~GlAggy 1148 (1316)
T PHA00368       1091 QDSKAFGALVKMVLQFKSFVIKSLNSRFI-RSFYEATKNNRAIDAALTIVISMGLAGGY 1148 (1316)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHCCCHHHHHHHHHHHHHHHHHHH
T ss_conf             01244415789988777667774023889-99987502643899999999983134688


No 4  
>TIGR01800 cit_synth_II 2-methylcitrate synthase/citrate synthase II; InterPro: IPR011278   Citrate synthase 2.3.3.1 from EC is a member of a small family of enzymes that can directly form a carbon-carbon bond without the presence of metal ion cofactors. It catalyses the first reaction in the Krebs' cycle, namely the conversion of oxaloacetate and acetyl-coenzyme A into citrate and coenzyme A. This reaction is important for energy generation and for carbon assimilation. The reaction proceeds via a non-covalently bound citryl-coenzyme A intermediate in a 2-step process (aldol-Claisen condensation followed by the hydrolysis of citryl-CoA).    Citrate synthase enzymes are found in two distinct structural types: type I enzymes (found in eukaryotes, Gram-positive bacteria and archaea) form homodimers and have shorter sequences than type II enzymes, which are found in Gram-negative bacteria and are hexameric in structure. In both types, the monomer is composed of two domains: a large alpha-helical domain consisting of two structural repeats, where the second repeat is interrupted by a small alpha-helical domain. The cleft between these domains forms the active site, where both citrate and acetyl-coenzyme A bind. The enzyme undergoes a conformational change upon binding of the oxaloacetate ligand, whereby the active site cleft closes over in order to form the acetyl-CoA binding site . The energy required for domain closure comes from the interaction of the enzyme with the substrate. Type II enzymes possess an extra N-terminal beta-sheet domain, and some type II enzymes are allosterically inhibited by NADH .   This entry represents type I homodimeric citrate synthase enzymes (predominantly from Gram-positive bacteria and archaea), as well as 2-methylcitrate synthase; some enzyme in this entry may be bifunctional citrate synthase/2-methylcitrate synthase enzymes. Members of this family appear as a second citrate synthase isozyme but typically are associated with propionate metabolism and synthesise 2-methylcitrate from propionyl-CoA; citrate synthase activity may be incidental. A number of species, including Thermoplasma acidophilum, Pyrococcus furiosus, and the Antarctic bacterium DS2-3R have a bifunctional member of this family as the only citrate synthase isozyme . 2-methylcitrate (2.3.3.5 from EC) synthase catalyses the conversion of oxaloacetate and propanoyl-CoA into (2R,3S)-2-hydroxybutane-1,2,3-tricarboxylate and coenzyme A. This enzyme is induced during bacterial growth on propionate, while type II hexameric citrate synthase is constitutive . ; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0044262 cellular carbohydrate metabolic process, 0005737 cytoplasm.
Probab=66.53  E-value=10  Score=18.16  Aligned_cols=133  Identities=19%  Similarity=0.201  Sum_probs=88.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----CCCCCCCCHHHHHH
Q ss_conf             9999999863119999999998563032212369999999999999989999999985489-----87778652358999
Q 537021.9.peg.1   29 LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGK-----EPQLDFSDPTEYIK  103 (218)
Q Consensus        29 ~~R~~~QFK~fPis~~~m~~~~r~~~~~g~~s~~~~aaaL~at~tl~Gal~~qlk~LL~Gk-----dP~~d~tdp~fw~~  103 (218)
                      ..|+.+||=+-=..|.|.+.. .-.=+|-.+|.-++|+.|+              .+|+|+     +|      ++.|.+
T Consensus       120 a~~l~A~~P~Iva~~~R~~~g-~~~v~P~~~s~~s~A~nFL--------------~mL~GeddGsv~P------~~~~~~  178 (386)
T TIGR01800       120 AKRLLAKLPTIVAAWYRIRNG-LEIVAPREDSDLSIAANFL--------------YMLHGEDDGSVEP------SKEEEK  178 (386)
T ss_pred             HHHHHHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCHHHHHH--------------HHHCCCCCCCCCC------CHHHHH
T ss_conf             788774444999999976436-8856884355653567788--------------7641787788788------979998


Q ss_pred             HHHHHHHHHHCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH
Q ss_conf             98522334302000023-42000252467689999877786554113101483145779999862186144899999999
Q 537021.9.peg.1  104 ALINGITHYERFSPFNS-SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN  182 (218)
Q Consensus       104 a~l~Ggg~fgD~t~ygS-sg~~vlGP~~s~~~~L~~~~~~~~~~e~~~~~~~~~~~~~~~k~v~~~~P~~NLWYtK~a~d  182 (218)
                      +|=.=+=++=||-=..| ..+-|..=.+||.=+=+..+.+.+.+=.-|     ++...                      
T Consensus       179 a~~~aLiLyAeHe~NASTFaarv~aSTLSD~YSaitaAIGALkGPLHG-----GAnE~----------------------  231 (386)
T TIGR01800       179 AMDIALILYAEHELNASTFAARVIASTLSDIYSAITAAIGALKGPLHG-----GANEA----------------------  231 (386)
T ss_pred             HHCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH-----HHHHH----------------------
T ss_conf             613001134223885667999999988888999999999970687503-----18999----------------------


Q ss_pred             HHHHHHHHHHHCC-CHHHHHHHHHHHHHHH-HHH
Q ss_conf             9999999997284-1468999999999999-865
Q 537021.9.peg.1  183 HFVRNSIDDVLNP-GGRARAEVYRQRQKYK-KQR  214 (218)
Q Consensus       183 ~li~~qlqe~~~P-g~~~r~e~~~~R~~~k-~~r  214 (218)
                        +..+++|.-+| .   ++|.++++.-.+ ++|
T Consensus       232 --vm~~l~ei~~pd~---~aE~~~~~~l~~GKEr  260 (386)
T TIGR01800       232 --VMKMLDEIGDPDD---KAEAWIRKKLENGKER  260 (386)
T ss_pred             --HHHHHHHHCCHHH---HHHHHHHHHHHHCCCE
T ss_conf             --9999975238444---5889999999708966


No 5  
>TIGR02715 amido_AtzE amidohydrolase, AtzE family; InterPro: IPR014087   Members of this protein family are aminohydrolases related to, but distinct from, glutamyl-tRNA(Gln) amidotransferase subunit A. The best-characterised member is from Pseudomonas sp., which hydrolyses ammonia from the three-nitrogen compound biuret to yield allophanate. Allophanate is also an intermediate in urea degradation by the urea carboxylase/allophanate hydrolase pathway, an alternative to urease..
Probab=45.82  E-value=11  Score=18.02  Aligned_cols=53  Identities=21%  Similarity=0.223  Sum_probs=38.9

Q ss_pred             HHCCCCCCCC---CC-CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHH
Q ss_conf             3020000234---20-002524676899998777865541131014831457799998621861448
Q 537021.9.peg.1  112 YERFSPFNSS---GW-DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNL  174 (218)
Q Consensus       112 fgD~t~ygSs---g~-~vlGP~~s~~~~L~~~~~~~~~~e~~~~~~~~~~~~~~~k~v~~~~P~~NL  174 (218)
                      ||+..+-|+.   .+ |.+||..-.+.||.     .+++..+|.|+...--..     +...|-+++
T Consensus       190 yGRLSR~GvfPFV~SlDH~GPfArsV~DLA-----~~YDvmQG~D~~D~~~~~-----r~~~P~~~~  246 (454)
T TIGR02715       190 YGRLSRRGVFPFVASLDHVGPFARSVEDLA-----LAYDVMQGPDPQDPICTA-----RPAEPTVPL  246 (454)
T ss_pred             CCCCCCCCCCCHHHHHHCCCCCCCCHHHHH-----HHHHHHCCCCCCCCCCCC-----CCCCCCCCC
T ss_conf             655865555512554422586001477899-----999987088877675334-----778777552


No 6  
>PRK13455 F0F1 ATP synthase subunit B; Provisional
Probab=34.46  E-value=35  Score=15.16  Aligned_cols=21  Identities=14%  Similarity=0.327  Sum_probs=14.8

Q ss_pred             HHHHCCCCCCCCCCCHHHHHH
Q ss_conf             998548987778652358999
Q 537021.9.peg.1   83 VPLISGKEPQLDFSDPTEYIK  103 (218)
Q Consensus        83 k~LL~GkdP~~d~tdp~fw~~  103 (218)
                      .+-+.--.|-.+++||.||+.
T Consensus        13 ~~~~~~~~~~~~~~d~~FWv~   33 (184)
T PRK13455         13 SPALAAGGPFFSLSNTDFIVT   33 (184)
T ss_pred             CHHHHCCCCCCCCCCCHHHHH
T ss_conf             868764799888889549999


No 7  
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit; InterPro: IPR016043 Virtually all mitochondrial precursors are imported via the same mechanism : precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 ..
Probab=32.41  E-value=38  Score=14.93  Aligned_cols=38  Identities=16%  Similarity=0.182  Sum_probs=19.2

Q ss_pred             HHHHHHHHHCCCCCCCCCCCH----HHHHHHHHHHHHHHHCC
Q ss_conf             999999985489877786523----58999985223343020
Q 537021.9.peg.1   78 IRKTLVPLISGKEPQLDFSDP----TEYIKALINGITHYERF  115 (218)
Q Consensus        78 l~~qlk~LL~GkdP~~d~tdp----~fw~~a~l~Ggg~fgD~  115 (218)
                      +...+.+++.|.+-.++++||    .||.+-...|=-++.+-
T Consensus        66 ~~~~~r~~~~~~~~~P~p~d~~e~e~fFl~~V~~GE~L~~q~  107 (155)
T TIGR00985        66 LRKVVREAVAELAKEPDPSDAEEKEAFFLEEVQLGEELLAQQ  107 (155)
T ss_pred             HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCC
T ss_conf             989999999876407899988789879999999877850489


No 8  
>PRK05462 S-adenosylmethionine decarboxylase; Provisional
Probab=30.88  E-value=30  Score=15.51  Aligned_cols=48  Identities=10%  Similarity=0.221  Sum_probs=31.7

Q ss_pred             HHHHHHCCCCHHHHHHHHHHH-----HHHHHHHHHHHCCCHHHHHHHHHHHHH
Q ss_conf             999862186144899999999-----999999999728414689999999999
Q 537021.9.peg.1  162 KELVNTFVPFQNLWYARGAFN-----HFVRNSIDDVLNPGGRARAEVYRQRQK  209 (218)
Q Consensus       162 ~k~v~~~~P~~NLWYtK~a~d-----~li~~qlqe~~~Pg~~~r~e~~~~R~~  209 (218)
                      +.++--|++-.|+|-||+...     .++|..=.+-++|.-.++.+.+++|+-
T Consensus       202 Y~miDvNvyQeNiFHTKM~lkefdl~nYLF~~~~~~ls~~e~~~i~~~L~~EM  254 (266)
T PRK05462        202 YDMIDVNVYQENIFHTKMLLKEFDLDNYLFNTKPEDLSAEERQEITALLKKEM  254 (266)
T ss_pred             CEEEEECCCCHHHHHHHHHHHHCCHHHCCCCCCCHHCCHHHHHHHHHHHHHHH
T ss_conf             21687424412468888887643722202379900289899999999999999


No 9  
>pfam02064 MAS20 MAS20 protein import receptor.
Probab=30.34  E-value=42  Score=14.73  Aligned_cols=15  Identities=7%  Similarity=0.080  Sum_probs=8.4

Q ss_pred             HHHHHHHHHHHHHHH
Q ss_conf             589999852233430
Q 537021.9.peg.1   99 TEYIKALINGITHYE  113 (218)
Q Consensus        99 ~fw~~a~l~Ggg~fg  113 (218)
                      .||.+-...|-.+..
T Consensus        91 ~fFmqqV~~GE~L~~  105 (184)
T pfam02064        91 ATFLENVEAGEILGM  105 (184)
T ss_pred             HHHHHHHHHHHHHHH
T ss_conf             999999998889871


No 10 
>KOG4056 consensus
Probab=29.56  E-value=42  Score=14.72  Aligned_cols=25  Identities=20%  Similarity=0.189  Sum_probs=14.2

Q ss_pred             CCCCCCCCCH----HHHHHHHHHHHHHHHCC
Q ss_conf             9877786523----58999985223343020
Q 537021.9.peg.1   89 KEPQLDFSDP----TEYIKALINGITHYERF  115 (218)
Q Consensus        89 kdP~~d~tdp----~fw~~a~l~Ggg~fgD~  115 (218)
                      +.|  |.+|+    +||++-...|--++.+-
T Consensus        67 ~~p--d~~d~~~~E~~Fmqqv~lGE~L~~qg   95 (143)
T KOG4056          67 NIP--DPSDAEEVEKFFMQQVQLGEELLAQG   95 (143)
T ss_pred             CCC--CCCCHHHHHHHHHHHHHHHHHHHHCC
T ss_conf             699--98989999999999998689999765


No 11 
>pfam08635 ox_reductase_C Putative oxidoreductase C terminal. This is the C terminal of a family of putative oxidoreductases.
Probab=28.78  E-value=34  Score=15.22  Aligned_cols=41  Identities=17%  Similarity=0.263  Sum_probs=24.3

Q ss_pred             HHHHHHHHHCCCCCCCCC------------CCHHHHHHHHHHH-----HHHHHCCCCC
Q ss_conf             999999985489877786------------5235899998522-----3343020000
Q 537021.9.peg.1   78 IRKTLVPLISGKEPQLDF------------SDPTEYIKALING-----ITHYERFSPF  118 (218)
Q Consensus        78 l~~qlk~LL~GkdP~~d~------------tdp~fw~~a~l~G-----gg~fgD~t~y  118 (218)
                      .+..+|++|++|+-+.+.            .+|--|.....+|     +.+|-|+.+|
T Consensus         5 ~Vq~ak~~l~~nnl~v~~t~ary~~aY~~~~kp~WW~k~~sgGpvVEQaTHf~DLsRY   62 (142)
T pfam08635         5 AVQKMKQIIKENNLTVMATNARYVSAYEHNAKPFWWNKSISGGPVVEQGTHFCDLSRY   62 (142)
T ss_pred             HHHHHHHHHHCCCCEEEEEEEEEEEEECCCCCHHHHHCCCCCCCEEEHHHHHHHHHHH
T ss_conf             9999999985498258865100034301469817876056689601036789899998


No 12 
>COG3962 Acetolactate synthase [Amino acid transport and metabolism]
Probab=27.50  E-value=41  Score=14.77  Aligned_cols=41  Identities=27%  Similarity=0.388  Sum_probs=28.9

Q ss_pred             CHHHHHHHHHHHHCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHH
Q ss_conf             3347678654220132113687617-899999999863119999
Q 537021.9.peg.1    2 QEHARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWS   44 (218)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~tg~~Gt~-ge~~R~~~QFK~fPis~~   44 (218)
                      |.++.|.|--++.+..-+..+.|+. |||-|.-.  -..|.+|-
T Consensus       397 q~~vigav~~~~~~~svvvcAAGsLPGdLhkLW~--~~~p~~YH  438 (617)
T COG3962         397 QTQVIGAVQRTISDDSVVVCAAGSLPGDLHKLWR--AGVPGTYH  438 (617)
T ss_pred             CHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHC--CCCCCCEE
T ss_conf             1357888885248885799837899678999860--48998542


No 13 
>TIGR03331 SAM_DCase_Eco S-adenosylmethionine decarboxylase proenzyme, Escherichia coli form. Members of this protein family are the single chain precursor of the S-adenosylmethionine decarboxylase as found in Escherichia coli. This form shows a substantially different architecture from the form shared by the Archaea, Bacillus, and many other species (TIGR03330). It shows little or no similarity to the form found in eukaryotes (TIGR00535).
Probab=27.35  E-value=34  Score=15.21  Aligned_cols=48  Identities=10%  Similarity=0.247  Sum_probs=33.5

Q ss_pred             HHHHHHCCCCHHHHHHHHHHH-----HHHHHHHHHHHCCCHHHHHHHHHHHHH
Q ss_conf             999862186144899999999-----999999999728414689999999999
Q 537021.9.peg.1  162 KELVNTFVPFQNLWYARGAFN-----HFVRNSIDDVLNPGGRARAEVYRQRQK  209 (218)
Q Consensus       162 ~k~v~~~~P~~NLWYtK~a~d-----~li~~qlqe~~~Pg~~~r~e~~~~R~~  209 (218)
                      +.++--|++-.|+|-||+...     .++|..=.+-+.|.-.++.+++++|+-
T Consensus       198 Y~~iDvNvyqeNiFHTKm~lke~dL~nYLF~~~~~~ls~~e~~~I~~~L~~EM  250 (259)
T TIGR03331       198 YQMIDVNVYQENIFHTKMMLKDFDLDNYLFGTAKEDLSPEERREITARLKKEM  250 (259)
T ss_pred             CCEEEECCCHHHHHHHHHHHHHCCHHHCCCCCCCHHCCHHHHHHHHHHHHHHH
T ss_conf             32587524304568888876315700014579802389999999999999999


No 14 
>PHA01752 hypothetical protein
Probab=25.75  E-value=43  Score=14.64  Aligned_cols=18  Identities=22%  Similarity=0.536  Sum_probs=13.5

Q ss_pred             CCCCHHHHHHHHHHHHHH
Q ss_conf             186144899999999999
Q 537021.9.peg.1  168 FVPFQNLWYARGAFNHFV  185 (218)
Q Consensus       168 ~~P~~NLWYtK~a~d~li  185 (218)
                      +--..|.||+|++-+-.|
T Consensus       355 NrKYsNiWYAKmVasA~i  372 (462)
T PHA01752        355 NKKYSNIWYAKMVASATI  372 (462)
T ss_pred             HHCCCCHHHHHHHHHHHH
T ss_conf             540010279999888899


No 15 
>pfam01090 Ribosomal_S19e Ribosomal protein S19e.
Probab=24.48  E-value=54  Score=14.13  Aligned_cols=12  Identities=8%  Similarity=-0.172  Sum_probs=5.8

Q ss_pred             HHHHHHHHHHHH
Q ss_conf             998999999998
Q 537021.9.peg.1   74 GEELIRKTLVPL   85 (218)
Q Consensus        74 l~Gal~~qlk~L   85 (218)
                      |.-+++..||.-
T Consensus        11 ~I~~lA~~LK~~   22 (140)
T pfam01090        11 LIKALAEFLKKS   22 (140)
T ss_pred             HHHHHHHHHHHC
T ss_conf             999999999647


No 16 
>TIGR00838 argH argininosuccinate lyase; InterPro: IPR009049   This entry represents argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens. ; GO: 0004056 argininosuccinate lyase activity, 0042450 arginine biosynthetic process via ornithine.
Probab=24.05  E-value=55  Score=14.08  Aligned_cols=27  Identities=33%  Similarity=0.662  Sum_probs=23.2

Q ss_pred             CCCCCCCCH-HHHHHHHHHHHHHHHHHH
Q ss_conf             113687617-899999999863119999
Q 537021.9.peg.1   18 WITGKDGSV-NNLARLMGQFLVMPISWS   44 (218)
Q Consensus        18 ~~tg~~Gt~-ge~~R~~~QFK~fPis~~   44 (218)
                      +|=||-|.+ |+|+=+|+-.|+-|.+|=
T Consensus       290 LiRGK~Grv~G~L~g~L~~lKalP~aYN  317 (469)
T TIGR00838       290 LIRGKTGRVIGNLTGLLTILKALPLAYN  317 (469)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHCCCCCC
T ss_conf             9987552689999999999711233235


No 17 
>TIGR02197 heptose_epim ADP-L-glycero-D-manno-heptose-6-epimerase; InterPro: IPR011912   Lipopolysaccharides (LPS) are glycolipids that consitutes the outer monolayer of the outer membranes of most Gram-negative bacteria . They consist of lipid A (endotoxin) which anchors LPS to the outer membrane, a non-repeating core oligosachharide, and an immunogenic O-antigen repeat polymer, which is an oligosaccharide of 1-40 units that variesbetween different strains of bacteria. Although the O-antigen and most of the core domain are not necessary for growth in the lab, they appear to help bacteria resist environmental stresses including the complement system and antibiotics.   This family consists of examples of ADP-L-glycero-D-mannoheptose-6-epimerase, an enzyme involved in biosynthesis of the inner core of LPS in Gram-negative bacteria . This enzyme is homologous to UDP-glucose 4-epimerase (IPR005886 from INTERPRO) and belongs to the NAD dependent epimerase/dehydratase family. It participates in the biosynthetic pathway leading to incorporation of heptose, a conserved sugar, into the core region of LPS, performing the reaction shown below: ADP-D-glycero-D-manno-heptose = ADP-L-glycero-D-manno-heptose It is a homopentameric enzyme with each monomer composed of two domains: an N-terminal modified Rossman fold domain for NADP binding, and a C-terminal substrate binding domain.; GO: 0008712 ADP-glyceromanno-heptose 6-epimerase activity, 0050661 NADP binding, 0005975 carbohydrate metabolic process.
Probab=22.92  E-value=45  Score=14.57  Aligned_cols=32  Identities=22%  Similarity=0.416  Sum_probs=22.5

Q ss_pred             HHHHHCCCCHHHHHHHHHHHHHHHHHHHH--HHC
Q ss_conf             99862186144899999999999999999--728
Q 537021.9.peg.1  163 ELVNTFVPFQNLWYARGAFNHFVRNSIDD--VLN  194 (218)
Q Consensus       163 k~v~~~~P~~NLWYtK~a~d~li~~qlqe--~~~  194 (218)
                      +-++.+-|..===|.|..||.++..+++.  -..
T Consensus       149 e~L~kLrPlN~YGySK~lFD~~v~~~~~~~~~~~  182 (353)
T TIGR02197       149 EELQKLRPLNVYGYSKFLFDQYVRRRVLPGEALS  182 (353)
T ss_pred             HHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC
T ss_conf             8897518788612216789899999860124798


No 18 
>PHA00665 major capsid protein
Probab=22.24  E-value=36  Score=15.06  Aligned_cols=66  Identities=14%  Similarity=0.162  Sum_probs=45.5

Q ss_pred             HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCCCCCCCHHHHHHHHHHHHH---HHHCC
Q ss_conf             99856303221236999999999999998999999998--5489877786523589999852233---43020
Q 537021.9.peg.1   48 LIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPL--ISGKEPQLDFSDPTEYIKALINGIT---HYERF  115 (218)
Q Consensus        48 ~~~r~~~~~g~~s~~~~aaaL~at~tl~Gal~~qlk~L--L~GkdP~~d~tdp~fw~~a~l~Ggg---~fgD~  115 (218)
                      .|+|-++  |+...-+..++-.-++.++.+.+.-=|+|  |+|++-..-.++-..|+.++-|.--   ||||.
T Consensus        64 ~wR~ln~--Gv~p~kstt~qv~d~~gmlE~~seVD~~La~lnGN~aafR~SEd~aflEgmnQ~~a~~lfYGd~  134 (329)
T PHA00665         64 TWRKLNY--GVQPEKSRTVQVTDSMGMLETYAEVDKALADLNGNTAAFRLSEDRAFIEGMNQTVAQTLFYGDS  134 (329)
T ss_pred             HHHHHHC--CCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCHHHHHHCCCHHHHHHHHHHHHHHEEECCC
T ss_conf             7998617--8898775357741111678889887299998659867754223178999887998875250788


No 19 
>TIGR02523 type_IV_pilV type IV pilus modification protein PilV; InterPro: IPR013362    Pilus systems categorized as type IV pilins differ greatly from one another, with some showing greater similarity to type II or type III secretion systems than to each other. Members of this protein family represent the PilV protein of type IV pilus systems found in Pseudomonas aeruginosa PAO1, Pseudomonas syringae DC3000, Neisseria meningitidis MC58, Xylella fastidiosa 9a5c, and other species..
Probab=22.11  E-value=60  Score=13.86  Aligned_cols=29  Identities=17%  Similarity=0.024  Sum_probs=23.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHCCCCCC
Q ss_conf             99999999999989999999985489877
Q 537021.9.peg.1   64 RAKALVIGILGEELIRKTLVPLISGKEPQ   92 (218)
Q Consensus        64 ~aaaL~at~tl~Gal~~qlk~LL~GkdP~   92 (218)
                      =+|=|++++.++|.+.+|++.+-.=++=+
T Consensus        10 LvAlL~LaiGvLg~~alql~~~~~t~~A~   38 (163)
T TIGR02523        10 LVALLVLAIGVLGMAALQLKAVRYTKSAS   38 (163)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             99999999999999999986888768899


Done!