Query         gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62]
Match_columns 98
No_of_seqs    104 out of 559
Neff          5.4 
Searched_HMMs 39220
Date          Mon May 23 21:19:47 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i 254780231.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PRK01310 hypothetical protein; 100.0 6.1E-28 1.6E-32  187.5  12.0   94    2-97     11-104 (106)
  2 PRK00647 hypothetical protein;  99.9 1.6E-25 4.1E-30  173.2   9.6   88    3-97      7-95  (96)
  3 PRK01530 hypothetical protein;  99.9 1.5E-24 3.7E-29  167.4  10.8   86    2-90     14-99  (105)
  4 COG1872 Uncharacterized conser  99.9 2.8E-24 7.1E-29  165.8  10.7   87    2-93     13-100 (102)
  5 PRK04021 hypothetical protein;  99.9 3.4E-24 8.7E-29  165.2  10.7   82    2-91      9-91  (92)
  6 PRK05090 hypothetical protein;  99.9 2.7E-24   7E-29  165.8  10.2   74    2-82     11-84  (95)
  7 pfam02594 DUF167 Uncharacteriz  99.9 2.4E-23 6.2E-28  160.2   9.6   74    2-81      5-78  (78)
  8 TIGR00251 TIGR00251 conserved   99.8   7E-21 1.8E-25  145.5   6.7   80    2-86      8-87  (91)
  9 KOG3276 consensus               99.7 3.7E-17 9.4E-22  123.3   8.9   86    2-94     35-122 (125)
 10 pfam08968 DUF1885 Domain of un  80.4     3.6 9.1E-05   22.2   4.6   40   28-71     78-122 (128)
 11 pfam01187 MIF Macrophage migra  76.9       7 0.00018   20.5   5.6   58    3-68     36-98  (114)
 12 pfam09581 Spore_III_AF Stage I  62.2     9.3 0.00024   19.7   3.2   28   40-67    157-184 (185)
 13 pfam02738 Ald_Xan_dh_C2 Molybd  58.6      17 0.00044   18.2   5.4   45   29-73    327-371 (543)
 14 cd02577 PSTD1 PSTD1: Pseudouri  54.1      20 0.00052   17.7   4.7   44   28-75     23-66  (319)
 15 cd02576 PseudoU_synth_ScPUS7 P  42.8      28 0.00072   16.9   3.1   43   41-84     37-83  (371)
 16 TIGR03313 Se_sel_red_Mo probab  41.9      32 0.00081   16.6   6.0   45   29-73    670-714 (951)
 17 TIGR03194 4hydrxCoA_A 4-hydrox  40.7      33 0.00084   16.4   4.9   46   29-74    454-499 (746)
 18 cd02552 PseudoU_synth_TruD_lik  40.6      31 0.00078   16.6   3.0   44   28-75     29-72  (232)
 19 pfam04551 GcpE GcpE protein. I  39.2      35 0.00089   16.3   3.4   62   29-93    222-283 (345)
 20 PRK09800 putative selenate red  38.1      36 0.00093   16.2   6.0   45   29-73    646-690 (928)
 21 KOG1759 consensus               38.0      28  0.0007   16.9   2.5   31   38-68     69-99  (115)
 22 PRK00984 truD tRNA pseudouridi  35.4      34 0.00087   16.4   2.6   44   28-75     40-83  (344)
 23 pfam01142 TruD tRNA pseudourid  35.2      41   0.001   15.9   3.1   43   28-74     39-81  (336)
 24 TIGR00713 hemL glutamate-1-sem  33.5      10 0.00026   19.5  -0.3   21   62-82    121-141 (434)
 25 PHA00659 V structural protein   31.3      40   0.001   16.0   2.4   26   37-63     74-99  (138)
 26 cd01804 midnolin_N midnolin_N   31.2      47  0.0012   15.5   3.2   39   29-71     11-49  (78)
 27 TIGR02416 CO_dehy_Mo_lg carbon  30.0      36 0.00091   16.2   2.0   39   33-73    494-532 (775)
 28 TIGR00747 fabH 3-oxoacyl-(acyl  29.3      39   0.001   16.0   2.1   25   43-67    255-279 (329)
 29 pfam02567 PhzC-PhzF Phenazine   28.1      54  0.0014   15.2   3.4   50   30-79    198-256 (280)
 30 PHA02512 minor structural prot  27.8      51  0.0013   15.3   2.5   26   37-62    313-338 (844)
 31 TIGR00151 ispF 2C-methyl-D-ery  27.3      54  0.0014   15.2   2.5   28   46-75    112-139 (159)
 32 TIGR01941 nqrF NADH:ubiquinone  26.1      36 0.00091   16.2   1.4   19   29-47    226-245 (425)
 33 COG0384 Predicted epimerase, P  25.5      60  0.0015   14.9   4.5   46   35-80    218-265 (291)
 34 pfam12116 SpoIIID Stage III sp  24.5      35  0.0009   16.3   1.1   20   46-65     18-37  (82)
 35 KOG4079 consensus               23.9      52  0.0013   15.3   1.9   19   41-59     91-109 (169)
 36 pfam01361 Tautomerase Tautomer  22.9      67  0.0017   14.6   2.8   27   45-71     19-45  (60)
 37 pfam12500 DUF3706 Protein of u  20.2      76  0.0019   14.3   2.9   30   43-72      5-34  (199)
 38 PRK12880 3-oxoacyl-(acyl carri  20.1      60  0.0015   14.9   1.6   21   44-64    269-289 (353)

No 1  
>PRK01310 hypothetical protein; Validated
Probab=99.95  E-value=6.1e-28  Score=187.54  Aligned_cols=94  Identities=34%  Similarity=0.474  Sum_probs=88.6

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      +.+.|||||+|++++|.|++.+.||.  .+|+|+|+|||+|||||++|++|||++|+||||+|+|++|++||+|+|+|+|
T Consensus        11 ~~l~Vrv~P~Asr~~i~G~~~d~dG~--~~LkvrV~ApP~dGkAN~al~~~LAk~lgvpks~v~i~~G~tSR~K~i~I~G   88 (106)
T PRK01310         11 LRVAVRVTPRGGRDAIDGIETLADGR--SVLKVRVRAIAEGGEANRAVTELLAKALGVPKSSVRVLSGATSRLKQVAIDG   88 (106)
T ss_pred             EEEEEEECCCCCCCCEEEEECCCCCC--EEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCCEEEEEEC
T ss_conf             89999985488756220030277776--0589999649987758999999999985998333899707877845999967


Q ss_pred             CHHHHHHHHHHHHHCC
Q ss_conf             9899999997523015
Q gi|254780231|r   82 DCKEITELLQNNDSLT   97 (98)
Q Consensus        82 ~~~~i~~~L~~~~s~t   97 (98)
                      +++++.+.|+...+.|
T Consensus        89 d~~~l~~~L~~l~~~~  104 (106)
T PRK01310         89 DPEDLGEALRALTAAK  104 (106)
T ss_pred             CHHHHHHHHHHHHCCC
T ss_conf             9999999999974469


No 2  
>PRK00647 hypothetical protein; Validated
Probab=99.93  E-value=1.6e-25  Score=173.15  Aligned_cols=88  Identities=32%  Similarity=0.468  Sum_probs=80.0

Q ss_pred             EEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECCC
Q ss_conf             59999941866530231014566677767999996387679788999999999829984447998355778069998299
Q gi|254780231|r    3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD   82 (98)
Q Consensus         3 ~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~~   82 (98)
                      -|.|+|+|+|++++|.|++       +++|+|+|+|||+|||||+||++|||++|++|||+|+|++|++||+|+|+|+++
T Consensus         7 il~V~VqP~A~r~~i~G~~-------~~~LkV~vtApPvdGKAN~ali~~LAk~l~vpks~I~Ii~G~tSR~K~I~I~~~   79 (96)
T PRK00647          7 ILEVKVTPKARENKIVGFE-------GGILKVRVTEVPEKGKANDAVIALLAKALSLPKRDVTLIAGETSRKKKVLLPRS   79 (96)
T ss_pred             EEEEEEECCCCCCCEECCC-------CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCCEEEECCCC
T ss_conf             9999986298736250002-------998999995599887189999999999969983447996067757338977996


Q ss_pred             -HHHHHHHHHHHHHCC
Q ss_conf             -899999997523015
Q gi|254780231|r   83 -CKEITELLQNNDSLT   97 (98)
Q Consensus        83 -~~~i~~~L~~~~s~t   97 (98)
                       ...+.+.+-+..|.|
T Consensus        80 ~~~~~~~~~~~~~~~~   95 (96)
T PRK00647         80 IKAIIFEQFPDVSSST   95 (96)
T ss_pred             CHHHHHHHCCCCCCCC
T ss_conf             3358887597644466


No 3  
>PRK01530 hypothetical protein; Reviewed
Probab=99.92  E-value=1.5e-24  Score=167.43  Aligned_cols=86  Identities=28%  Similarity=0.464  Sum_probs=77.0

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      +-|.|+|+|+|++|+|.|+....+   +++|+|+|+|||+|||||+|+++|||++|++|||+|+|++|+|||.|+++|++
T Consensus        14 vll~vkVkP~Ak~N~I~G~~~~~~---~~~LKv~VtApPedGKAN~ali~~LAk~l~l~kS~i~Iv~G~tsR~K~i~I~n   90 (105)
T PRK01530         14 ALLNLKVKPNAKQNLISNFVIINN---IPYLKLSIKAIPEQGKANEEIINYLAKEWKLSRSNIEIIKGHTHSLKTILIKN   90 (105)
T ss_pred             EEEEEEEECCCCCCCCCCEEECCC---CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEECCCCCCCEEEEEEC
T ss_conf             999999703867031133661179---87389999749988807899999999995998222799507777722899915


Q ss_pred             CHHHHHHHH
Q ss_conf             989999999
Q gi|254780231|r   82 DCKEITELL   90 (98)
Q Consensus        82 ~~~~i~~~L   90 (98)
                      ..++....+
T Consensus        91 i~~~~l~~i   99 (105)
T PRK01530         91 INEDYLNLI   99 (105)
T ss_pred             CCHHHHHHH
T ss_conf             798999999


No 4  
>COG1872 Uncharacterized conserved protein [Function unknown]
Probab=99.91  E-value=2.8e-24  Score=165.76  Aligned_cols=87  Identities=33%  Similarity=0.516  Sum_probs=79.8

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      +.|.|+|+|+|+++.|.|++     .|+++|+|+|+|||++||||++|++|||+.|++|||+|+|++|++||.|+++|.+
T Consensus        13 ~~l~V~V~P~a~~~~i~g~~-----~~~~~Lkv~i~apP~~GKAN~~li~~Lak~~~v~kS~V~ivsGetsR~K~v~i~~   87 (102)
T COG1872          13 VLLRVRVKPKAKRDSIVGLD-----EWRKRLKVRITAPPVDGKANEELIKFLAKTFGVPKSSVEIVSGETSRLKTVLIKN   87 (102)
T ss_pred             EEEEEEECCCCCCCCCCCEE-----CCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEEEECC
T ss_conf             69999977788657533250-----6755599998469877613699999999995998541799846755530799548


Q ss_pred             -CHHHHHHHHHHH
Q ss_conf             -989999999752
Q gi|254780231|r   82 -DCKEITELLQNN   93 (98)
Q Consensus        82 -~~~~i~~~L~~~   93 (98)
                       +++++.+.+...
T Consensus        88 i~~d~~~~~~~~~  100 (102)
T COG1872          88 IDPDQLPERLAAL  100 (102)
T ss_pred             CCHHHHHHHHHHH
T ss_conf             8989999998864


No 5  
>PRK04021 hypothetical protein; Reviewed
Probab=99.91  E-value=3.4e-24  Score=165.22  Aligned_cols=82  Identities=26%  Similarity=0.416  Sum_probs=76.6

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      +-|.|+|+|||++++|.|++     .|+++|+|+++|||+|||||+|||+|||+.|+   |+|+|++|+|||+|+++|.|
T Consensus         9 v~l~v~vqP~A~~~~i~G~~-----~~~~~Lkv~l~ApPv~GkAN~ali~flak~lg---s~V~i~~G~tSR~K~v~i~g   80 (92)
T PRK04021          9 VIISVIVQPNAKENKIEGVD-----EWRGRIKVNIKAPPVKGKANKELIKFFSKLLG---AEVEIIRGETSREKDLLVKG   80 (92)
T ss_pred             EEEEEEEECCCCCCEEECCC-----CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHC---CCEEEEECCCCCCCEEEEEC
T ss_conf             99999981099855460313-----77885899987799877689999999999859---97899705777853899906


Q ss_pred             -CHHHHHHHHH
Q ss_conf             -9899999997
Q gi|254780231|r   82 -DCKEITELLQ   91 (98)
Q Consensus        82 -~~~~i~~~L~   91 (98)
                       +.+++.++|+
T Consensus        81 is~e~v~~~L~   91 (92)
T PRK04021         81 ISLEEVKKKLK   91 (92)
T ss_pred             CCHHHHHHHHC
T ss_conf             99999999865


No 6  
>PRK05090 hypothetical protein; Validated
Probab=99.91  E-value=2.7e-24  Score=165.79  Aligned_cols=74  Identities=24%  Similarity=0.345  Sum_probs=69.8

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      +.|.|+|||||++++|.|++       ++.|+|+|+|||+||+||++||+|||++|+||||+|+|++|++||+|+|+|++
T Consensus        11 ~~l~V~V~P~A~r~~i~g~~-------~~~Lkv~v~ApPvdGkAN~ali~~LA~~l~vpks~V~i~~G~tsR~K~i~I~~   83 (95)
T PRK05090         11 LVLRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFLAKQFKVAKSQVVIEKGELGRHKQVRIIN   83 (95)
T ss_pred             EEEEEEEEECCCCCEECCCC-------CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCCEEEEEEC
T ss_conf             99999995288804580103-------99899999659978728999999999985997565899525876843999838


Q ss_pred             C
Q ss_conf             9
Q gi|254780231|r   82 D   82 (98)
Q Consensus        82 ~   82 (98)
                      +
T Consensus        84 p   84 (95)
T PRK05090         84 P   84 (95)
T ss_pred             C
T ss_conf             1


No 7  
>pfam02594 DUF167 Uncharacterized ACR, YggU family COG1872.
Probab=99.90  E-value=2.4e-23  Score=160.17  Aligned_cols=74  Identities=36%  Similarity=0.547  Sum_probs=69.1

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      +-|.|+|+|+|++++|.|++.      +++|+|+|+|||+|||||+||++|||++|++|||+|+|++|++||.|+|+|+|
T Consensus         5 v~i~i~V~P~a~~~~i~g~~~------~~~l~V~v~a~p~~GkAN~ali~~la~~l~v~ks~I~Iv~G~~sr~K~i~I~g   78 (78)
T pfam02594         5 VLLRVRVKPNAKKNSIVGVEE------WGELKVRITAPPVDGKANAELIKFLAKTLGVPKSDVEIVSGETSRSKVLLIEG   78 (78)
T ss_pred             EEEEEEEEECCCCCEECCCCC------CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEC
T ss_conf             999999974886251840269------84799999649977748999999999985997000899865877862899819


No 8  
>TIGR00251 TIGR00251 conserved hypothetical protein TIGR00251; InterPro: IPR005228    This family of conserved hypothetical proteins has no known function..
Probab=99.83  E-value=7e-21  Score=145.51  Aligned_cols=80  Identities=24%  Similarity=0.383  Sum_probs=74.0

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      .-|.|+|+|+|+|+.|.|++     .|+.+++|+++|||.|||||.+|++||.+.|+..||+|+|++|+.||+|+|+|.+
T Consensus         8 ll~~~~v~p~a~k~~i~g~~-----eWR~~v~v~I~aPp~~GkAN~el~K~f~~~F~~aks~VeivsG~~~R~K~iki~~   82 (91)
T TIGR00251         8 LLIRIEVQPKASKDSIVGYN-----EWRKRVEVKIKAPPKEGKANRELIKFFKEIFGKAKSDVEIVSGELSRQKTIKIKN   82 (91)
T ss_pred             EEEEEEECCCCCCCCCCCCC-----CCCCEEEEEEECCCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEECC
T ss_conf             58998866777865147824-----5311467877077988840158898888644734330788606228831687046


Q ss_pred             CHHHH
Q ss_conf             98999
Q gi|254780231|r   82 DCKEI   86 (98)
Q Consensus        82 ~~~~i   86 (98)
                      ...|+
T Consensus        83 ~~~~~   87 (91)
T TIGR00251        83 IKRDI   87 (91)
T ss_pred             CCCCC
T ss_conf             47211


No 9  
>KOG3276 consensus
Probab=99.71  E-value=3.7e-17  Score=123.34  Aligned_cols=86  Identities=28%  Similarity=0.427  Sum_probs=76.6

Q ss_pred             EEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECC
Q ss_conf             05999994186653023101456667776799999638767978899999999982998444799835577806999829
Q gi|254780231|r    2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK   81 (98)
Q Consensus         2 ~~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~   81 (98)
                      ++|+|+++|+|++++|+++.       .+.+.|.|.|||.+|+||+||++||++.|++++|++++.+|++||.|++.|+.
T Consensus        35 V~i~IhakpgaK~s~It~v~-------~e~V~V~IaApp~eGeANaeLl~ylskvLgLRksdv~ldkG~kSrsKvv~i~~  107 (125)
T KOG3276          35 VQIAIHAKPGAKQSAITDVG-------DEAVGVAIAAPPREGEANAELLEYLSKVLGLRKSDVTLDKGWKSRSKVVVIED  107 (125)
T ss_pred             EEEEEEECCCCCCCCEEECC-------CCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCEEEEEEC
T ss_conf             89999846874312255126-------43104577359766533699999999986012431365036565432899962


Q ss_pred             --CHHHHHHHHHHHH
Q ss_conf             --9899999997523
Q gi|254780231|r   82 --DCKEITELLQNND   94 (98)
Q Consensus        82 --~~~~i~~~L~~~~   94 (98)
                        ...+..+.|+...
T Consensus       108 ~~~~~e~i~~l~ak~  122 (125)
T KOG3276         108 LSATQEYIELLEAKV  122 (125)
T ss_pred             CCCCHHHHHHHHCCC
T ss_conf             744099999974458


No 10 
>pfam08968 DUF1885 Domain of unknown function (DUF1885). This domain is found in a set of hypothetical proteins produced by bacteria of the Bacillus genus.
Probab=80.40  E-value=3.6  Score=22.21  Aligned_cols=40  Identities=33%  Similarity=0.535  Sum_probs=27.8

Q ss_pred             CCCEEEEEEECCC-----CCCHHHHHHHHHHHHHHCCCHHHEEEEECCC
Q ss_conf             7767999996387-----6797889999999998299844479983557
Q gi|254780231|r   28 DTIHMKIKVTATP-----QKGKANKAMLAMLAKKLALSKSSLRMLSKQS   71 (98)
Q Consensus        28 ~~~~lkv~v~ApP-----v~GkAN~ali~~Lak~l~v~ks~I~I~~G~~   71 (98)
                      .++.--|.|.-||     .+|||| |.|+||||.|.   ....+-.|.+
T Consensus        78 e~~~sfIqi~LP~~AT~GDKgKAN-EfckfLAK~l~---geL~LFNGRt  122 (128)
T pfam08968        78 EGEQTFIQISLPPTATFGDKGKAN-EFSKFLAKKLE---GELQLFNGRT  122 (128)
T ss_pred             CCCCEEEEEECCCCCCCCCCHHHH-HHHHHHHHHHH---CEEEEECCEE
T ss_conf             787157999889998777622276-89999999741---0157754707


No 11 
>pfam01187 MIF Macrophage migration inhibitory factor (MIF).
Probab=76.94  E-value=7  Score=20.47  Aligned_cols=58  Identities=14%  Similarity=0.149  Sum_probs=38.7

Q ss_pred             EEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEECCC-CCC----HHHHHHHHHHHHHHCCCHHHEEEEE
Q ss_conf             59999941866530231014566677767999996387-679----7889999999998299844479983
Q gi|254780231|r    3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATP-QKG----KANKAMLAMLAKKLALSKSSLRMLS   68 (98)
Q Consensus         3 ~i~v~V~P~a~~~~I~g~~~~~d~~~~~~lkv~v~ApP-v~G----kAN~ali~~Lak~l~v~ks~I~I~~   68 (98)
                      -|.|.|.|+..-.- .       |..+.+.-+.|++-- .+.    +--+++++||.+.|+||+.+|-|..
T Consensus        36 yv~V~v~~~~~m~f-g-------Gs~eP~a~~~l~siG~l~~~~n~~~s~~l~~~l~~~LgI~~~RiyI~F   98 (114)
T pfam01187        36 RIAVHIRPGQAMVF-G-------GSTDPCAVCSIKSIGVVGAEQNRSHSALLFKFLTKELGLPKDRIYIRF   98 (114)
T ss_pred             EEEEEEECCCEEEE-C-------CCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE
T ss_conf             29999928971896-6-------889977999999867899899999999999999986297946499999


No 12 
>pfam09581 Spore_III_AF Stage III sporulation protein AF (Spore_III_AF). This family represents the stage III sporulation protein AF (Spore_III_AF) of the bacterial endospore formation program, which exists in some but not all members of the Firmicutes (formerly called low-GC Gram-positives). The C-terminal region of these proteins is poorly conserved.
Probab=62.16  E-value=9.3  Score=19.73  Aligned_cols=28  Identities=11%  Similarity=0.306  Sum_probs=24.1

Q ss_pred             CCCCHHHHHHHHHHHHHHCCCHHHEEEE
Q ss_conf             7679788999999999829984447998
Q gi|254780231|r   40 PQKGKANKAMLAMLAKKLALSKSSLRML   67 (98)
Q Consensus        40 Pv~GkAN~ali~~Lak~l~v~ks~I~I~   67 (98)
                      ..+..-++.+..+||..|++|+.+|++.
T Consensus       157 ~~~~~~~~~I~~~la~~~~I~~e~I~V~  184 (185)
T pfam09581       157 EEESEEAEKIKNFLADEYNIPKEQIEVY  184 (185)
T ss_pred             CCCHHHHHHHHHHHHHHHCCCHHHCEEE
T ss_conf             2004679999999998839989991772


No 13 
>pfam02738 Ald_Xan_dh_C2 Molybdopterin-binding domain of aldehyde dehydrogenase.
Probab=58.62  E-value=17  Score=18.16  Aligned_cols=45  Identities=9%  Similarity=0.158  Sum_probs=39.2

Q ss_pred             CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC
Q ss_conf             767999996387679788999999999829984447998355778
Q gi|254780231|r   29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP   73 (98)
Q Consensus        29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr   73 (98)
                      ++.+.|.+...+..-=...-+..+.|+.|++|..+|+++.|-|.+
T Consensus       327 dG~v~v~~g~~~~GqG~~T~~~qivAe~LGi~~d~V~v~~gDT~~  371 (543)
T pfam02738       327 DGSVTVSTGGIEIGQGLETKVAQIAAEELGIPLDDIRVISGDTDK  371 (543)
T ss_pred             CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCHHHEEEECCCCCC
T ss_conf             997788734667788616589999999959986779997278887


No 14 
>cd02577 PSTD1 PSTD1: Pseudouridine synthase, a subgroup of the TruD family. This group consists of several hypothetical archeal pseudouridine synthases assigned to the TruD family of psuedouridine synthases.  Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi).  The TruD family is comprised of proteins related to Escherichia coli TruD.
Probab=54.11  E-value=20  Score=17.71  Aligned_cols=44  Identities=11%  Similarity=0.095  Sum_probs=35.9

Q ss_pred             CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE
Q ss_conf             776799999638767978899999999982998444799835577806
Q gi|254780231|r   28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK   75 (98)
Q Consensus        28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K   75 (98)
                      .|+++-+.|.   -.|.-...+++.||+.++||.++|.. .|.+.|+=
T Consensus        23 ~Geh~~~~i~---K~g~~T~~~~~~LAr~lgv~~~~igy-AGlKDk~A   66 (319)
T cd02577          23 KGKYLIYLLE---KKNWDTLDAVRRIAKALGISRKRIGY-AGTKDKHA   66 (319)
T ss_pred             CCCEEEEEEE---ECCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCCC
T ss_conf             8788999999---88999999999999982998777515-43466752


No 15 
>cd02576 PseudoU_synth_ScPUS7 PseudoU_synth_ScPUS7: Pseudouridine synthase, TruD family. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus7.  Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi).  Saccharomyces cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA), psi13 in cytoplasmic tRNAs and psi35 in pre-tRNATyr. Psi35 in yeast U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved.  Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35.
Probab=42.77  E-value=28  Score=16.86  Aligned_cols=43  Identities=14%  Similarity=0.157  Sum_probs=33.0

Q ss_pred             CCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE----EEEECCCHH
Q ss_conf             67978899999999982998444799835577806----999829989
Q gi|254780231|r   41 QKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK----IIYIDKDCK   84 (98)
Q Consensus        41 v~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K----~i~I~~~~~   84 (98)
                      -.|.-..++++.||+.+++|.++|.. +|.+.|+=    .+.|.+..+
T Consensus        37 K~~~~T~~a~~~lak~~gi~~~~igy-AGlKDk~AvT~Q~~Sv~~~~~   83 (371)
T cd02576          37 KENKDTMEAINKIAKLLRVKPSDFSY-AGTKDKRAVTVQRVSVKKVTA   83 (371)
T ss_pred             ECCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCCEEEEEEEEEECCCH
T ss_conf             88999999999999983997788427-636448777899999978886


No 16 
>TIGR03313 Se_sel_red_Mo probable selenate reductase, molybdenum-binding subunit. Our comparative genomics suggests this protein family to be a subunit of a selenium-dependent molybdenum hydroxylase, although the substrate is not specified. This protein is suggested by Bebien, et al., to be the molybdenum-binding subunit of a molydbopterin-containing selenate reductase. Xi, et al, however, show that mutation of this gene in E. coli conferred sensitivity to adenine, suggesting a defect in purine interconversion. This finding, plus homology of nearby genes in a 23-gene purine catabolism region in E. coli to xanthine dehydrogase subunits suggests xanthine dehydrogenase activity.
Probab=41.91  E-value=32  Score=16.56  Aligned_cols=45  Identities=7%  Similarity=0.051  Sum_probs=39.2

Q ss_pred             CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC
Q ss_conf             767999996387679788999999999829984447998355778
Q gi|254780231|r   29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP   73 (98)
Q Consensus        29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr   73 (98)
                      ++.+.|++.+....-=.+..+..+.|+.|++|..+|+++.|-|+.
T Consensus       670 DG~v~v~~g~~d~GQG~~T~~aQiaAe~LGi~~d~I~v~~~DT~~  714 (951)
T TIGR03313       670 DGTFIVHSGGADIGTGLDTVVSKLTAEVLHCPMDDVHVISGDTDH  714 (951)
T ss_pred             CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCHHHEEEEECCCCC
T ss_conf             977999978978887789999999999968984539999679888


No 17 
>TIGR03194 4hydrxCoA_A 4-hydroxybenzoyl-CoA reductase, alpha subunit. This model represents the largest chain, alpha, of the enzyme 4-hydroxybenzoyl-CoA reductase. In species capable of degrading various aromatic compounds by way of benzoyl-CoA, this enzyme can convert 4-hydroxybenzoyl-CoA to benzoyl-CoA.
Probab=40.72  E-value=33  Score=16.44  Aligned_cols=46  Identities=15%  Similarity=0.107  Sum_probs=39.6

Q ss_pred             CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCE
Q ss_conf             7679999963876797889999999998299844479983557780
Q gi|254780231|r   29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPL   74 (98)
Q Consensus        29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~   74 (98)
                      ++.+.|........-=.+..+....|+.|++|..+|.++.|-|...
T Consensus       454 dGsv~v~~g~~e~GqG~~T~~~qiaAe~LGip~e~V~v~~~DT~~~  499 (746)
T TIGR03194       454 DGGITLLTGAADIGQGSSTIASQVAAEVLGVRLSRIRVISADSALT  499 (746)
T ss_pred             CCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCHHHEEEECCCCCCC
T ss_conf             9769999786446888533899999872299889969973678878


No 18 
>cd02552 PseudoU_synth_TruD_like PseudoU_synth_TruD_like: Pseudouridine synthase, TruD family. This group consists of eukaryotic, bacterial and archeal pseudouridine synthases similar to Escherichia coli TruD and Saccharomyces cerevisiae Pus7.  Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi).  E. coli TruD and S. cerevisiae Pus7 make psi13 in cytoplasmic tRNAs. In addition S. cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA) and psi35 in pre-tRNATyr.  Psi35 in U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved.  Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35.
Probab=40.65  E-value=31  Score=16.65  Aligned_cols=44  Identities=11%  Similarity=0.141  Sum_probs=36.3

Q ss_pred             CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE
Q ss_conf             776799999638767978899999999982998444799835577806
Q gi|254780231|r   28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK   75 (98)
Q Consensus        28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K   75 (98)
                      .|+++.+.|.-   .|.-..+++..||+.++||.++|.. .|.+.|+=
T Consensus        29 ~G~~~~~~l~K---~~~~T~~a~~~lA~~l~i~~~~igy-AG~KDr~A   72 (232)
T cd02552          29 EGEYLHFTLYK---ENKDTMEALREIAKALGVPPRDIGY-AGTKDKRA   72 (232)
T ss_pred             CCCEEEEEEEE---CCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCCE
T ss_conf             77889999998---7899999999999981998666304-43466761


No 19 
>pfam04551 GcpE GcpE protein. In a variety of organisms, including plants and several eubacteria, isoprenoids are synthesized by the mevalonate-independent 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway. Although different enzymes of this pathway have been described, the terminal biosynthetic steps of the MEP pathway have not been fully elucidated. GcpE gene of Escherichia coli is involved in this pathway.
Probab=39.17  E-value=35  Score=16.30  Aligned_cols=62  Identities=18%  Similarity=0.236  Sum_probs=44.3

Q ss_pred             CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEEEEEECCCHHHHHHHHHHH
Q ss_conf             76799999638767978899999999982998444799835577806999829989999999752
Q gi|254780231|r   29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQNN   93 (98)
Q Consensus        29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K~i~I~~~~~~i~~~L~~~   93 (98)
                      |+.++|++++.|++   --.+..-+=+.|++++..+++++=-+--.-.+.+....+++.+.|+..
T Consensus       222 GDTIRVSLt~dP~~---EV~v~~~IL~sl~lR~~g~~iISCPtCGR~~~dl~~~a~~ve~~l~~~  283 (345)
T pfam04551       222 GDTIRVSLTEDPVE---EVKVAFEILQSLGLRKRGVEIISCPTCGRTLFDLIKVAKEVEERLSHL  283 (345)
T ss_pred             CCEEEEECCCCCHH---HHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHCC
T ss_conf             65399967899547---899999999972877578648758874661155999999999997379


No 20 
>PRK09800 putative selenate reductase subunit YgfN; Provisional
Probab=38.09  E-value=36  Score=16.20  Aligned_cols=45  Identities=7%  Similarity=0.062  Sum_probs=39.2

Q ss_pred             CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC
Q ss_conf             767999996387679788999999999829984447998355778
Q gi|254780231|r   29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP   73 (98)
Q Consensus        29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr   73 (98)
                      ++.+.|++.+....-=.+..+....|+.|++|..+|+++.|-|+.
T Consensus       646 DG~v~v~~g~~d~GQG~~T~~aQiaAe~LGip~d~I~v~~~DT~~  690 (928)
T PRK09800        646 DGTFIVHSGGADIGTGLDTVVTKLAAEVLHCPPQDVHVISGDTDH  690 (928)
T ss_pred             CCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCHHHEEEEECCCCC
T ss_conf             977999978978676788999999999968985559999579888


No 21 
>KOG1759 consensus
Probab=37.99  E-value=28  Score=16.92  Aligned_cols=31  Identities=16%  Similarity=0.184  Sum_probs=21.7

Q ss_pred             CCCCCCHHHHHHHHHHHHHHCCCHHHEEEEE
Q ss_conf             3876797889999999998299844479983
Q gi|254780231|r   38 ATPQKGKANKAMLAMLAKKLALSKSSLRMLS   68 (98)
Q Consensus        38 ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~   68 (98)
                      .+-+..+=-+++++||.++|+++++++-|-.
T Consensus        69 ~~~~N~~~sa~l~~il~~~L~l~~~rv~I~f   99 (115)
T KOG1759          69 GAIVNRSYSAALTEILEKELSLDPDRVYIKF   99 (115)
T ss_pred             CHHHHHHHHHHHHHHHHHHHCCCCCEEEEEE
T ss_conf             8667188999999999987088978699999


No 22 
>PRK00984 truD tRNA pseudouridine synthase D; Reviewed
Probab=35.43  E-value=34  Score=16.37  Aligned_cols=44  Identities=16%  Similarity=0.202  Sum_probs=35.8

Q ss_pred             CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCEE
Q ss_conf             776799999638767978899999999982998444799835577806
Q gi|254780231|r   28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK   75 (98)
Q Consensus        28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~K   75 (98)
                      .|+++-+.|.-   .|.-..++++.||+.++||.++|.. +|.+.|+=
T Consensus        40 ~G~~~~~~l~K---~~~~T~~~~~~lA~~l~i~~~~igy-aGlKDr~A   83 (344)
T PRK00984         40 EGEHLLLRIRK---RGWNTLFAARELAKFLGISRREVSY-AGLKDRHA   83 (344)
T ss_pred             CCCEEEEEEEE---CCCCHHHHHHHHHHHCCCCHHHCCC-CCCCCCCC
T ss_conf             87889999998---8999999999999980998888242-56666750


No 23 
>pfam01142 TruD tRNA pseudouridine synthase D (TruD). TruD is responsible for synthesis of pseudouridine from uracil-13 in transfer RNAs. The structure of TruD reveals an overall V-shaped molecule which contains an RNA-binding cleft.
Probab=35.20  E-value=41  Score=15.92  Aligned_cols=43  Identities=14%  Similarity=0.210  Sum_probs=32.7

Q ss_pred             CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCCE
Q ss_conf             77679999963876797889999999998299844479983557780
Q gi|254780231|r   28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPL   74 (98)
Q Consensus        28 ~~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr~   74 (98)
                      .|+++-+.+.-   .|.-..+++..||+.|+++.++|.. .|.+.|+
T Consensus        39 ~G~~~~~~l~K---~~~~T~~a~~~lAk~l~i~~~~i~y-aGlKDk~   81 (336)
T pfam01142        39 EGEHLILRIEK---RGWNTLDVARELARFLGISPRDVGY-AGLKDRH   81 (336)
T ss_pred             CCCEEEEEEEE---CCCCHHHHHHHHHHHHCCCHHHEEE-CCCCCCC
T ss_conf             87889999998---6899999999999982998888124-5566575


No 24 
>TIGR00713 hemL glutamate-1-semialdehyde-2,1-aminomutase; InterPro: IPR004639   Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin .   The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA.   The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III.   Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) .   This entry represents glutamate-1-semialdehyde (GSA) aminotransferase (5.4.3.8 from EC), which catalyse a transamination reaction to produce 5-aminoaevulinic acid during the first stage of tetrapyrrole biosynthesis by the C5 pathway , . This enzyme has a high degree of similarity to amino acid transaminases, and is classed as a class III aminotransferase.; GO: 0042286 glutamate-1-semialdehyde 21-aminomutase activity, 0033014 tetrapyrrole biosynthetic process.
Probab=33.45  E-value=10  Score=19.46  Aligned_cols=21  Identities=24%  Similarity=0.208  Sum_probs=18.0

Q ss_pred             HHEEEEECCCCCEEEEEECCC
Q ss_conf             447998355778069998299
Q gi|254780231|r   62 SSLRMLSKQSSPLKIIYIDKD   82 (98)
Q Consensus        62 s~I~I~~G~~sr~K~i~I~~~   82 (98)
                      |.|+|-+|=|+|+|+|..+|.
T Consensus       121 sAvRLARG~TGR~~IiKFeGc  141 (434)
T TIGR00713       121 SAVRLARGFTGRDKIIKFEGC  141 (434)
T ss_pred             HHHHHHHHCCCCCCEEEECCC
T ss_conf             999986310588727886176


No 25 
>PHA00659 V structural protein P5
Probab=31.31  E-value=40  Score=15.96  Aligned_cols=26  Identities=12%  Similarity=0.245  Sum_probs=21.9

Q ss_pred             ECCCCCCHHHHHHHHHHHHHHCCCHHH
Q ss_conf             638767978899999999982998444
Q gi|254780231|r   37 TATPQKGKANKAMLAMLAKKLALSKSS   63 (98)
Q Consensus        37 ~ApPv~GkAN~ali~~Lak~l~v~ks~   63 (98)
                      =|||.|+- .++-+.+.|+.+||++.+
T Consensus        74 wAP~~ENn-T~aYi~~Va~~~Gv~~~~   99 (138)
T PHA00659         74 WAPSNEND-TRAYATAVARAMGVPPQA   99 (138)
T ss_pred             HCCCCCCC-HHHHHHHHHHHHCCCCCC
T ss_conf             19998768-999999999986879987


No 26 
>cd01804 midnolin_N midnolin_N   Midnolin (midbrain nucleolar protein) is expressed in the nucleolus and is thought to regulate genes involved in neurogenesis.  Midnolin contains an amino-terminal ubiquitin-like domain.
Probab=31.18  E-value=47  Score=15.52  Aligned_cols=39  Identities=15%  Similarity=0.418  Sum_probs=28.2

Q ss_pred             CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCC
Q ss_conf             7679999963876797889999999998299844479983557
Q gi|254780231|r   29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQS   71 (98)
Q Consensus        29 ~~~lkv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~   71 (98)
                      |+.+.+.|  +|.  ..=+.+-+.+|+.+++||.+|.++.-++
T Consensus        11 G~~f~l~V--~~~--etVe~LKk~IskrLkvpkdRi~Llhre~   49 (78)
T cd01804          11 GTRFDLSV--PPD--ETVEGLKKRISQRLKVPKERLALLHRET   49 (78)
T ss_pred             CCCEEEEE--CCC--CCHHHHHHHHHHHHCCCHHHEEEEEEEE
T ss_conf             87468870--765--4089999999988489867746775411


No 27 
>TIGR02416 CO_dehy_Mo_lg carbon-monoxide dehydrogenase, large subunit; InterPro: IPR012780   Carbon monoxide dehydrogenases catalyse the reversible oxidation of carbon monoxide to carbon dioxide as shown below , : CO + H(2)O + A == CO(2) + AH(2) A variety of electron acceptors can be used by these enzymes including ferredoxin, methyl viologen and benzyl viologen. Under anaerobic conditions, carbon monoxide is utilised by a number of methanogenic archaea and acetogenic, sulphate-reducing or phototrophic bacteria. Under aerobic conditions carbon monoxide can be used as sole carbon and energy source by the carboxidotrophic bacteria, a taxonomically diverse group of obligate or facultative chemolithoautotrophic species. The anaerobic dehydrogenases are mostly nickel-containing iron-sulphur enzymes, while the The aerobic enzymes categorised so far are molybdenum-containing iron-sulphur proteins.   This entry represents the large subunit of the molybdenum-containing carbon monoxide dehydrogenase found in aerobic bacteria. This enzyme is a dimer of heterotrimers, where each heterotrimer consists of a small iron-sulphur subunit, a medium FAD-containing flavoprotein subunit and a large molybdopterin-containign subunit which contains the active site , . The large subunit is divided into two domains. The N-terminal domain is composed of mainly of a five-and four-stranded mixed beta-sheet and interacts with the small and medium subunits. The C-terminal domain contains a region which interacts with its counterpart in the overall dimer, providing the main dimer contact, and a beta-sheet surrounded by alpha helices. The active site located in this subunit contains a novel dinuclear heterometal [CuSMo(=O)OH] cluster.; GO: 0005507 copper ion binding, 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0030151 molybdenum ion binding.
Probab=29.97  E-value=36  Score=16.25  Aligned_cols=39  Identities=8%  Similarity=0.212  Sum_probs=27.4

Q ss_pred             EEEEECCCCCCHHHHHHHHHHHHHHCCCHHHEEEEECCCCC
Q ss_conf             99996387679788999999999829984447998355778
Q gi|254780231|r   33 KIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP   73 (98)
Q Consensus        33 kv~v~ApPv~GkAN~ali~~Lak~l~v~ks~I~I~~G~~sr   73 (98)
                      +..+.++=+.-  ..---..+|.+||||..+|.|..|.|+.
T Consensus       494 R~g~~sQGQGH--~TTyAQIiA~ELGIP~e~I~v~~GdTd~  532 (775)
T TIGR02416       494 RLGTKSQGQGH--ETTYAQIIATELGIPAEDIDVEEGDTDT  532 (775)
T ss_pred             EEECCCCCCCC--CCHHHHHHHHHCCCCCCCEEEECCCCCC
T ss_conf             86302369864--3046666555527880110672078788


No 28 
>TIGR00747 fabH 3-oxoacyl-(acyl-carrier-protein) synthase III; InterPro: IPR004655   Beta-ketoacyl-acyl carrier protein synthase III (FabH), in general, initiates elongation in type II fatty acid synthase systems found in bacteria and plants. It is responsible for producing the multitude of fatty acid structures found in bacterial membranes . The two members of this subfamily from Bacillus subtilis differ from each other, and from FabH from Escherichia coli, in acyl group specificity. Active site residues include Cys112, His244 and Asn274 of Escherichia coli FabH. Cys-112 is the site of acyl group attachment. ; GO: 0004315 3-oxoacyl-[acyl-carrier-protein] synthase activity, 0006633 fatty acid biosynthetic process.
Probab=29.29  E-value=39  Score=16.00  Aligned_cols=25  Identities=32%  Similarity=0.390  Sum_probs=21.7

Q ss_pred             CHHHHHHHHHHHHHHCCCHHHEEEE
Q ss_conf             9788999999999829984447998
Q gi|254780231|r   43 GKANKAMLAMLAKKLALSKSSLRML   67 (98)
Q Consensus        43 GkAN~ali~~Lak~l~v~ks~I~I~   67 (98)
                      =.||..+++.++|.|.++.++|-..
T Consensus       255 HQAN~RIi~a~ak~L~~~~~~Vv~T  279 (329)
T TIGR00747       255 HQANIRIIEALAKRLELKESKVVKT  279 (329)
T ss_pred             CCHHHHHHHHHHHHHCCCCCCEEEE
T ss_conf             4111689999999708980316863


No 29 
>pfam02567 PhzC-PhzF Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family also contains a putative thymidilate synthase from Mycobacterium tuberculosis, though there is no significant sequence similarity to pfam00303 members. This family appears to be distantly related to pfam01678, including containing a weak internal duplication. However members of this family do not contain the conserved cysteines that are hypothesized to be active site residues (Bateman A pers obs).
Probab=28.06  E-value=54  Score=15.20  Aligned_cols=50  Identities=16%  Similarity=0.171  Sum_probs=34.1

Q ss_pred             CEEEEEEE-------CCCCCCHHHHHHHHHHHHHHCCCHH-HEEEEECCCC-CEEEEEE
Q ss_conf             67999996-------3876797889999999998299844-4799835577-8069998
Q gi|254780231|r   30 IHMKIKVT-------ATPQKGKANKAMLAMLAKKLALSKS-SLRMLSKQSS-PLKIIYI   79 (98)
Q Consensus        30 ~~lkv~v~-------ApPv~GkAN~ali~~Lak~l~v~ks-~I~I~~G~~s-r~K~i~I   79 (98)
                      ..+..|.=       +=|+-|-||-+|..||++.++.++. .+.|..|+.+ |.=.+.+
T Consensus       198 ~~~~~R~FaP~~Gi~EDpaTGSA~~aLa~yl~~~~~~~~~~~~~i~QG~~~gR~g~i~v  256 (280)
T pfam02567       198 RDYHARMFAPALGIVEDPATGSAAGALGAYLSRHGNKPQFGNIAIRQGQASGRPGTLEV  256 (280)
T ss_pred             CEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCEEEEE
T ss_conf             56999816456797776762565799999999846998774599988736089989999


No 30 
>PHA02512 minor structural protein; Provisional
Probab=27.78  E-value=51  Score=15.31  Aligned_cols=26  Identities=19%  Similarity=0.299  Sum_probs=23.7

Q ss_pred             ECCCCCCHHHHHHHHHHHHHHCCCHH
Q ss_conf             63876797889999999998299844
Q gi|254780231|r   37 TATPQKGKANKAMLAMLAKKLALSKS   62 (98)
Q Consensus        37 ~ApPv~GkAN~ali~~Lak~l~v~ks   62 (98)
                      =|||.|+--.++-|+|.++.|+|+..
T Consensus       313 WAPp~D~NDT~aYI~~Vs~~LgV~~d  338 (844)
T PHA02512        313 WAPPADKNDTDAYIKAVCAQLGVSAD  338 (844)
T ss_pred             CCCCCCCCCHHHHHHHHHHHHCCCCC
T ss_conf             48998887489999999998299989


No 31 
>TIGR00151 ispF 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; InterPro: IPR003526   This entry represents MECDP (2-C-methyl-D-erythritol 2,4-cyclodiphosphate) synthetase, an enzyme in the non-mevalonate pathway of isoprenoid synthesis, isoprenoids being essential in all organisms. Isoprenoids can also be synthesized through the mevalonate pathway. The non-mevolante route is used by many bacteria and human pathogens, including Mycobacterium tuberculosis and Plasmodium falciparum. This route appears to involve seven enzymes. MECDP synthetase catalyses the intramolecular attack by a phosphate group on a diphosphate, with cytidine monophosphate (CMP) acting as the leaving group to give the cyclic diphosphate product MEDCP. The enzyme is a trimer with three active sites shared between adjacent copies of the protein. The enzyme also has two metal binding sites, the metals playing key roles in catalysis.   A number of proteins from eukaryotes and prokaryotes share this common N-terminal signature and appear to be involved in terpenoid biosynthesis. The ygbB protein is a putative enzyme of this type .; GO: 0008685 2-C-methyl-D-erythritol 24-cyclodiphosphate synthase activity, 0016114 terpenoid biosynthetic process.
Probab=27.31  E-value=54  Score=15.18  Aligned_cols=28  Identities=18%  Similarity=0.195  Sum_probs=22.1

Q ss_pred             HHHHHHHHHHHHCCCHHHEEEEECCCCCEE
Q ss_conf             899999999982998444799835577806
Q gi|254780231|r   46 NKAMLAMLAKKLALSKSSLRMLSKQSSPLK   75 (98)
Q Consensus        46 N~ali~~Lak~l~v~ks~I~I~~G~~sr~K   75 (98)
                      =.++-+.+|+.|++|..+|+|-  +|+..|
T Consensus       112 i~~mR~~iA~~L~i~~d~vnvK--ATT~Ek  139 (159)
T TIGR00151       112 IEAMRENIAELLGIPLDQVNVK--ATTTEK  139 (159)
T ss_pred             HHHHHHHHHHHHCCCCCCEEEE--EECCCC
T ss_conf             6899999988846870211325--876767


No 32 
>TIGR01941 nqrF NADH:ubiquinone oxidoreductase, Na(+)-translocating, F subunit; InterPro: IPR010205   This entry represents the NqrF subunit of the six-protein, Na(+)-pumping NADH-quinone reductase of a number of marine and pathogenic Gram-negative bacteria.; GO: 0016655 oxidoreductase activity acting on NADH or NADPH quinone or similar compound as acceptor, 0051537 2 iron 2 sulfur cluster binding, 0006814 sodium ion transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane.
Probab=26.09  E-value=36  Score=16.24  Aligned_cols=19  Identities=37%  Similarity=0.746  Sum_probs=13.5

Q ss_pred             CCEEEEEEECCCCC-CHHHH
Q ss_conf             76799999638767-97889
Q gi|254780231|r   29 TIHMKIKVTATPQK-GKANK   47 (98)
Q Consensus        29 ~~~lkv~v~ApPv~-GkAN~   47 (98)
                      .-.|.|||+.||-. ++||+
T Consensus       226 ~i~lNvRIAtPP~~n~~~~~  245 (425)
T TIGR01941       226 IIKLNVRIATPPFDNGKANK  245 (425)
T ss_pred             EEEEEEEECCCCCCCCCCCC
T ss_conf             77999884188887676432


No 33 
>COG0384 Predicted epimerase, PhzC/PhzF homolog [General function prediction only]
Probab=25.46  E-value=60  Score=14.91  Aligned_cols=46  Identities=28%  Similarity=0.256  Sum_probs=36.5

Q ss_pred             EEECCCCCCHHHHHHHHHHHHHHCCC-HHHEEEEECCC-CCEEEEEEC
Q ss_conf             99638767978899999999982998-44479983557-780699982
Q gi|254780231|r   35 KVTATPQKGKANKAMLAMLAKKLALS-KSSLRMLSKQS-SPLKIIYID   80 (98)
Q Consensus        35 ~v~ApPv~GkAN~ali~~Lak~l~v~-ks~I~I~~G~~-sr~K~i~I~   80 (98)
                      -|.+=|+-|-||.+|-.+|.+....+ +.++.+..|.. +|.=.|.+.
T Consensus       218 Gi~EDPaTGSA~~~La~yl~~~~~~~~~~~~~~~QG~~~gR~s~i~~~  265 (291)
T COG0384         218 GVVEDPATGSAAGALAAYLAKHGGLPDKLRLLIEQGTELGRPSRIEVR  265 (291)
T ss_pred             CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCEEEEE
T ss_conf             887777764546589999998688887653899823236998489999


No 34 
>pfam12116 SpoIIID Stage III sporulation protein D. This stage III sporulation protein is a small DNA-binding family that is essential for gene expression of the mother-cell compartment during sporulation. The domain is found in bacteria and viruses, and is about 40 amino acids in length. It has a conserved RGG sequence motif.
Probab=24.46  E-value=35  Score=16.27  Aligned_cols=20  Identities=40%  Similarity=0.459  Sum_probs=16.7

Q ss_pred             HHHHHHHHHHHHCCCHHHEE
Q ss_conf             89999999998299844479
Q gi|254780231|r   46 NKAMLAMLAKKLALSKSSLR   65 (98)
Q Consensus        46 N~ali~~Lak~l~v~ks~I~   65 (98)
                      |++-++--|+.|||+||-|-
T Consensus        18 ~~aTVR~tAk~FGvSKSTVH   37 (82)
T pfam12116        18 NKATVRQAAKVFGVSKSTVH   37 (82)
T ss_pred             CCHHHHHHHHHHCCCHHHHH
T ss_conf             26289999999696098898


No 35 
>KOG4079 consensus
Probab=23.86  E-value=52  Score=15.27  Aligned_cols=19  Identities=26%  Similarity=0.270  Sum_probs=17.5

Q ss_pred             CCCHHHHHHHHHHHHHHCC
Q ss_conf             6797889999999998299
Q gi|254780231|r   41 QKGKANKAMLAMLAKKLAL   59 (98)
Q Consensus        41 v~GkAN~ali~~Lak~l~v   59 (98)
                      .||+-|+++++.|+|.+|-
T Consensus        91 ld~~~r~eI~~hl~K~lGK  109 (169)
T KOG4079          91 LDGMKREEIEKHLAKTLGK  109 (169)
T ss_pred             CCCCCHHHHHHHHHHHHCC
T ss_conf             4555579999999998582


No 36 
>pfam01361 Tautomerase Tautomerase enzyme. This family includes the enzyme 4-oxalocrotonate tautomerase that catalyses the ketonisation of 2-hydroxymuconate to 2-oxo-3-hexenedioate.
Probab=22.95  E-value=67  Score=14.62  Aligned_cols=27  Identities=4%  Similarity=0.112  Sum_probs=21.1

Q ss_pred             HHHHHHHHHHHHHCCCHHHEEEEECCC
Q ss_conf             889999999998299844479983557
Q gi|254780231|r   45 ANKAMLAMLAKKLALSKSSLRMLSKQS   71 (98)
Q Consensus        45 AN~ali~~Lak~l~v~ks~I~I~~G~~   71 (98)
                      -=+++.+.+++.||+|++.|.++--+-
T Consensus        19 l~~~iT~~~~~~lg~~~~~v~V~i~Ev   45 (60)
T pfam01361        19 LIRRVTEAIVEALGAPPEAIFVVIEEV   45 (60)
T ss_pred             HHHHHHHHHHHHHCCCCCCEEEEEEEC
T ss_conf             999999999998495933599999981


No 37 
>pfam12500 DUF3706 Protein of unknown function (DUF3706). This domain family is found in bacteria, and is typically between 174 and 217 amino acids in length. There is a conserved TRSP sequence motif.
Probab=20.19  E-value=76  Score=14.28  Aligned_cols=30  Identities=27%  Similarity=0.327  Sum_probs=21.4

Q ss_pred             CHHHHHHHHHHHHHHCCCHHHEEEEECCCC
Q ss_conf             978899999999982998444799835577
Q gi|254780231|r   43 GKANKAMLAMLAKKLALSKSSLRMLSKQSS   72 (98)
Q Consensus        43 GkAN~ali~~Lak~l~v~ks~I~I~~G~~s   72 (98)
                      +.++.+-...|++.|+++-.-|++++|.-+
T Consensus         5 s~~~r~~~~~l~~elgi~I~~VSLl~G~~~   34 (199)
T pfam12500         5 SGAARARFAALEAELGIPIDVVSLLSGSVS   34 (199)
T ss_pred             CHHHHHHHHHHHHHHCCCEEEEEEEEEEEE
T ss_conf             989999999999870990779998677898


No 38 
>PRK12880 3-oxoacyl-(acyl carrier protein) synthase III; Reviewed
Probab=20.13  E-value=60  Score=14.91  Aligned_cols=21  Identities=14%  Similarity=0.373  Sum_probs=14.4

Q ss_pred             HHHHHHHHHHHHHHCCCHHHE
Q ss_conf             788999999999829984447
Q gi|254780231|r   44 KANKAMLAMLAKKLALSKSSL   64 (98)
Q Consensus        44 kAN~ali~~Lak~l~v~ks~I   64 (98)
                      .||..+++.+++.|++|..++
T Consensus       269 QAN~riie~i~k~Lgl~~ek~  289 (353)
T PRK12880        269 QSNAYLVDCIKEELKLNDDKV  289 (353)
T ss_pred             CCCHHHHHHHHHHCCCCHHHH
T ss_conf             688999999999809897870


Done!