Query         044511
Match_columns 343
No_of_seqs    151 out of 733
Neff          5.7 
Searched_HMMs 46136
Date          Fri Mar 29 03:56:17 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044511.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044511hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00684 Terpene_cyclase_plant_ 100.0 1.5E-95  3E-100  751.9  30.1  319   17-343     1-363 (542)
  2 PLN02279 ent-kaur-16-ene synth 100.0 1.5E-88 3.3E-93  719.9  26.5  294   45-343   245-597 (784)
  3 PLN02592 ent-copalyl diphospha 100.0 1.5E-86 3.2E-91  703.2  27.6  295   45-343   285-658 (800)
  4 PF01397 Terpene_synth:  Terpen 100.0 1.6E-35 3.5E-40  267.6   8.9  135   27-169     1-183 (183)
  5 PF03936 Terpene_synth_C:  Terp  99.9 9.6E-23 2.1E-27  189.7  10.5  134  199-333     1-134 (270)
  6 cd00868 Terpene_cyclase_C1 Ter  99.8 3.2E-19   7E-24  167.2  12.7  122  213-335     1-122 (284)
  7 cd00687 Terpene_cyclase_nonpla  98.2 2.2E-06 4.8E-11   82.0   5.0  107  220-328    18-126 (303)
  8 cd00385 Isoprenoid_Biosyn_C1 I  96.6  0.0015 3.3E-08   57.8   3.2   75  247-327     1-75  (243)
  9 PF14165 YtzH:  YtzH-like prote  48.9      68  0.0015   26.1   6.1   48  270-319     9-57  (87)
 10 COG3063 PilF Tfp pilus assembl  45.5 1.5E+02  0.0032   28.6   8.8  125   52-198    55-208 (250)
 11 KOG3951 Uncharacterized conser  30.2      82  0.0018   30.7   4.5   38  106-145   260-307 (321)
 12 COG2976 Uncharacterized protei  29.2      39 0.00084   31.7   2.1   34  215-258     8-41  (207)
 13 KOG3906 Tryptophan 2,3-dioxyge  24.6 1.7E+02  0.0036   28.9   5.6   29   53-85     87-115 (399)
 14 KOG1914 mRNA cleavage and poly  23.6 1.7E+02  0.0036   31.6   5.8  103  207-316   223-333 (656)
 15 PF10828 DUF2570:  Protein of u  21.7 1.2E+02  0.0027   25.1   3.6   34   44-83     72-108 (110)
 16 TIGR00636 PduO_Nterm ATP:cob(I  21.1 1.7E+02  0.0037   26.5   4.7   21  276-296    22-42  (171)
 17 PF12626 PolyA_pol_arg_C:  Poly  20.1 1.3E+02  0.0027   25.9   3.4   28  203-230    65-92  (124)

No 1  
>cd00684 Terpene_cyclase_plant_C1 Plant Terpene Cyclases, Class 1. This CD includes a diverse group of monomeric plant terpene cyclases (Tspa-Tspf) that convert the acyclic isoprenoid diphosphates, geranyl diphosphate (GPP), farnesyl diphosphate (FPP), or geranylgeranyl diphosphate (GGPP) into cyclic monoterpenes, diterpenes, or sesquiterpenes, respectively; a few form acyclic species. Terpnoid cyclases are soluble enzymes localized to the cytosol (sesquiterpene synthases) or plastids (mono- and diterpene synthases). All monoterpene and diterpene synthases have restrict substrate specificity, however, some sesquiterpene synthases can accept both FPP and GPP. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl diphosphates, via bridging Mg2+ ions (K+ preferred by gymnosperm cyclases), inducing conformational changes such that an N-terminal regi
Probab=100.00  E-value=1.5e-95  Score=751.93  Aligned_cols=319  Identities=50%  Similarity=0.822  Sum_probs=299.0

Q ss_pred             CCCCCCCCCCCCCCccccCCCccchh-HHHHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC--------
Q 044511           17 RRSADYGPTIWSFDYIQSLDSKYKGE-SYAKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN--------   87 (343)
Q Consensus        17 r~~~~~~ps~W~~~fl~~~~~~~~~~-~~~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~--------   87 (343)
                      |++++|+||+|||+++++.++++... .+.+++++||++||+|+....  ||.|++++|++||+||||||++        
T Consensus         1 r~~~~~~~~~w~~~~~~s~~~~~~~~~~~~~~~~~lk~~v~~~~~~~~--~~~~~~~~l~liD~lqrLGi~~hF~~EI~~   78 (542)
T cd00684           1 RPSANFPPSLWGDDHFLSLSSDYSEEDELEEEIEELKEEVRKMLEDSE--YPVDLFERLWLIDRLQRLGISYHFEDEIKE   78 (542)
T ss_pred             CCCCCCCCCcCCCcceeecCCCcchhHHHHHHHHHHHHHHHHHHHhcc--cCCCHHHHHHHHHHHHHcCchhhhHHHHHH
Confidence            78999999999995555555544433 688999999999999998652  5689999999999999999931        


Q ss_pred             -----------------CchhhhhHHHhhhccc------ccccccccccccccCCCCCCChHHHHHH------------H
Q 044511           88 -----------------KSLYATALKFRVLRQY------ETFSRFMDEKGRFKSSGHSDDGKGMLAL------------I  132 (343)
Q Consensus        88 -----------------~dL~~~AL~FRLLR~h------DvF~~F~d~~G~F~~~~l~~dv~glLsL------------i  132 (343)
                                       .||++|||+|||||||      |||++|+|++|+|+++ +.+||+|||||            |
T Consensus        79 ~L~~i~~~~~~~~~~~~~dl~~~al~FRlLR~~Gy~vs~dvf~~F~~~~g~f~~~-~~~d~~g~l~Ly~As~l~~~gE~i  157 (542)
T cd00684          79 ILDYIYRYWTERGESNEDDLYTTALGFRLLRQHGYNVSSDVFKKFKDEDGKFKES-LTQDVKGMLSLYEASHLSFPGEDI  157 (542)
T ss_pred             HHHHHHHhhcccccccCCCHHHHHHHHHHHHHcCCCcCHHHHhhhcCCCCCcCch-hhhhhHHHHHHHHHhhcCCCCcHH
Confidence                             4999999999999999      9999999999999999 99999999999            9


Q ss_pred             HHHHHHHHHHHHHHHhhccCCCCCCCchhHHHHHHhcchhhhhhhhhHHHHHHHHHhcCCCCCchHHHHHHHhhhHHHhh
Q 044511          133 FRDATSFTTAYLKEWVIKHDSNKNDDEHLCTLVNHALELPLHWRMLRLEARWFIDVYENGPDMNPILLELAKVDFNIVQA  212 (343)
Q Consensus       133 LdeA~~Fs~~~L~~~~~~~~~~~~~~~~l~~eV~~aL~~P~~~~l~Rlear~yI~~Y~~~~~~n~~lLelAKlDFn~~Qs  212 (343)
                      ||||++||++||++.++++   +.++++|+++|++||++|||+++||||||+||++|++++++|++||||||+|||+||+
T Consensus       158 LdeA~~ft~~~L~~~~~~~---~~~~~~l~~~V~~aL~~P~~~~~~rlear~yi~~Y~~~~~~n~~lLelAkldfn~~Q~  234 (542)
T cd00684         158 LDEALSFTTKHLEEKLESN---WIIDPDLSGEIEYALEIPLHASLPRLEARWYIEFYEQEDDHNETLLELAKLDFNILQA  234 (542)
T ss_pred             HHHHHHHHHHHHHHHhhcc---CCCCchHHHHHHHHccCchhcCCchHHHHHHHHHhCCCccccHHHHHHHHHHHHHHhH
Confidence            9999999999999999864   2367899999999999999999999999999999999999999999999999999999


Q ss_pred             hhHhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHH
Q 044511          213 VHQENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFT  292 (343)
Q Consensus       213 ~hq~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft  292 (343)
                      +||+||++++|||+++||. ++|||+|+|++|||||++|++|||++|.+|+++||++++++++||+||+|||+|||+.||
T Consensus       235 ~hq~El~~~~rWwk~~gL~-~~l~~aRdr~ve~yf~~~a~~feP~~s~~Rl~~aK~~~l~~~iDD~fD~~gt~eEl~~ft  313 (542)
T cd00684         235 LHQEELKILSRWWKDLDLA-SKLPFARDRLVECYFWAAGTYFEPQYSLARIALAKTIALITVIDDTYDVYGTLEELELFT  313 (542)
T ss_pred             hHHHHHHHHhHHHHhcCCc-ccCCcccchhHHHHHHHHhcccCccchHHHHHHHHHHHHHhhhHhhhccCCCHHHHHHHH
Confidence            9999999999999999998 888999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhhcCchhhccCChHHHHHHHHHHHHHHHHHHHHHHccCchhhhhhhhC
Q 044511          293 DAVERWDATAVEQLPHYMKLCFHALRNSINEMTFDVLRDQGVDILISYLKK  343 (343)
Q Consensus       293 ~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~~~~~~~g~~~~~~~lk~  343 (343)
                      +||+|||.+++++||+|||+||.+|+++++|+++++.+.+|.+ +++|+++
T Consensus       314 ~ai~rwd~~~~~~lPe~mk~~~~al~~~~~ei~~~~~~~~~~~-~~~~~~~  363 (542)
T cd00684         314 EAVERWDISAIDQLPEYMKIVFKALLNTVNEIEEELLKEGGSY-VVPYLKE  363 (542)
T ss_pred             HHHHhccccchhhccHHHHHHHHHHHHHHHHHHHHHHHhcCcc-hHHHHHH
Confidence            9999999999999999999999999999999999999999988 8888763


No 2  
>PLN02279 ent-kaur-16-ene synthase
Probab=100.00  E-value=1.5e-88  Score=719.92  Aligned_cols=294  Identities=29%  Similarity=0.430  Sum_probs=276.5

Q ss_pred             HHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC-------------------------CchhhhhHHHhh
Q 044511           45 AKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN-------------------------KSLYATALKFRV   99 (343)
Q Consensus        45 ~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~-------------------------~dL~~~AL~FRL   99 (343)
                      .++.++|..-|++.-+++|++||.+.++++|+||+||||||++                         .|+++|||+|||
T Consensus       245 ~~~~~yL~~~~~~~~g~vP~~yp~~~fe~l~lvd~L~rlGi~~hF~~EI~~~L~~~~~~~~~~~~~~~~Dl~~tAl~FRL  324 (784)
T PLN02279        245 AGCLRYLRSLLQKFGNAVPTVYPLDQYARLSMVDTLERLGIDRHFRKEIKSVLDETYRYWLQGEEEIFLDLATCALAFRI  324 (784)
T ss_pred             hHHHHHHHHHHHhcCCCCCCCCcccHHHHhHHHHHHHHhCCccccHHHHHHHHHHHHHhhcccccCCCCCHHHHHHHHHH
Confidence            4788999999999888899999999999999999999999932                         499999999999


Q ss_pred             hccc------ccccccccccccccCC--CCCCChHHHHHH------------HHHHHHHHHHHHHHHHhhccCC-CCCCC
Q 044511          100 LRQY------ETFSRFMDEKGRFKSS--GHSDDGKGMLAL------------IFRDATSFTTAYLKEWVIKHDS-NKNDD  158 (343)
Q Consensus       100 LR~h------DvF~~F~d~~G~F~~~--~l~~dv~glLsL------------iLdeA~~Fs~~~L~~~~~~~~~-~~~~~  158 (343)
                      ||||      |||++|+|+ + |+++  |..+||+|||||            |||||+.||++||++.++++.+ ++.++
T Consensus       325 LR~hGy~VS~dvf~~F~~~-~-F~~~l~~~~~dv~gmL~LY~AS~l~~~gE~iLdeA~~Fs~~~L~~~~~~~~~~~~~~~  402 (784)
T PLN02279        325 LRLNGYDVSSDPLKQFAED-H-FSDSLGGYLKDTGAVLELFRASQISYPDESLLEKQNSWTSHFLEQGLSNWSKTADRLR  402 (784)
T ss_pred             HHHcCCCCChhHHhhcCCC-c-ccchhcccchhhHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHHhcccccccccC
Confidence            9999      999999975 4 9887  236999999999            9999999999999999886543 34467


Q ss_pred             chhHHHHHHhcchhhhhhhhhHHHHHHHHHhcCCCC------------CchHHHHHHHhhhHHHhhhhHhHHHHHHHHHH
Q 044511          159 EHLCTLVNHALELPLHWRMLRLEARWFIDVYENGPD------------MNPILLELAKVDFNIVQAVHQENLKYASRWWK  226 (343)
Q Consensus       159 ~~l~~eV~~aL~~P~~~~l~Rlear~yI~~Y~~~~~------------~n~~lLelAKlDFn~~Qs~hq~EL~~lsrWwk  226 (343)
                      ++|+++|+|||++|||+++||||||+||++|++++.            +|++||||||+|||+||++||+||++|+|||+
T Consensus       403 ~~L~~eV~~AL~~P~~~~l~RlEaR~yI~~Y~~~~~~i~Kt~yr~~~~~n~~lLeLAklDFN~~Qs~hq~EL~~l~rWwk  482 (784)
T PLN02279        403 KYIKKEVEDALNFPYYANLERLANRRSIENYAVDDTRILKTSYRCSNICNQDFLKLAVEDFNFCQSIHREELKQLERWIV  482 (784)
T ss_pred             ccHHHHHHHHhcCchhcCccHHHHHHHHHHhccccchhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhCeeHH
Confidence            889999999999999999999999999999998885            89999999999999999999999999999999


Q ss_pred             HhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHhhcCch-hhcc
Q 044511          227 KTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFTDAVERWDAT-AVEQ  305 (343)
Q Consensus       227 e~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft~averWD~~-~~~~  305 (343)
                      ++||.  +|||||||+||||||++|++||||||.+|++|||+++|+|++||+||+|||+|||++||+||+|||++ .++.
T Consensus       483 e~~L~--~L~faRdr~ve~Yf~aaa~~fEPe~S~aRi~~aK~~~L~tviDD~fD~yGt~eEL~~ft~aVeRWD~~~~~~~  560 (784)
T PLN02279        483 ENRLD--KLKFARQKLAYCYFSAAATLFSPELSDARLSWAKNGVLTTVVDDFFDVGGSEEELENLIQLVEKWDVNGSPDF  560 (784)
T ss_pred             hcCCc--cCCchhhHHHHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHHHhccccchhh
Confidence            99996  99999999999999999999999999999999999999999999999999999999999999999998 6699


Q ss_pred             CChHHHHHHHHHHHHHHHHHHHHHHccCchhhhhhhhC
Q 044511          306 LPHYMKLCFHALRNSINEMTFDVLRDQGVDILISYLKK  343 (343)
Q Consensus       306 Lpeymk~~f~al~~t~~ei~~~~~~~~g~~~~~~~lk~  343 (343)
                      ||+|||+||.+|++|+|||+.++.+.||++ +++|+++
T Consensus       561 lpeymki~f~aL~~t~nei~~~~~~~qGr~-v~~~l~~  597 (784)
T PLN02279        561 CSEQVEIIFSALRSTISEIGDKAFTWQGRN-VTSHIIK  597 (784)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHHHHHcCch-HHHHHHH
Confidence            999999999999999999999999999999 9999875


No 3  
>PLN02592 ent-copalyl diphosphate synthase
Probab=100.00  E-value=1.5e-86  Score=703.25  Aligned_cols=295  Identities=25%  Similarity=0.385  Sum_probs=276.1

Q ss_pred             HHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC------------------------------Cchhhhh
Q 044511           45 AKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN------------------------------KSLYATA   94 (343)
Q Consensus        45 ~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~------------------------------~dL~~~A   94 (343)
                      .++.++|...|++.-+++|++||.+++++|++||+||||||++                              .|+++||
T Consensus       285 ~~cl~YL~~~~~k~~GgVP~vyP~d~fE~LwlVDtLqRLGIs~hF~~EI~~iLd~iy~~w~~~g~~~a~~~~~~Dld~TA  364 (800)
T PLN02592        285 ENCLEYLNKAVQRFNGGVPNVYPVDLFEHIWAVDRLQRLGISRYFEPEIKECIDYVHRYWTENGICWARNSHVHDIDDTA  364 (800)
T ss_pred             hHHHHHHHHHHHHcCCCCCCCCCCcHHHHHHHHHHHHHcCCccccHHHHHHHHHHHHHHHhhcCcccccCCCcCCHHHHH
Confidence            4788999999999878899999999999999999999999921                              4899999


Q ss_pred             HHHhhhccc------ccccccccccccccCC--CCCCChHHHHHH------------HHHHHHHHHHHHHHHHhhccCC-
Q 044511           95 LKFRVLRQY------ETFSRFMDEKGRFKSS--GHSDDGKGMLAL------------IFRDATSFTTAYLKEWVIKHDS-  153 (343)
Q Consensus        95 L~FRLLR~h------DvF~~F~d~~G~F~~~--~l~~dv~glLsL------------iLdeA~~Fs~~~L~~~~~~~~~-  153 (343)
                      |+|||||||      |||++|++ +|+|++.  +..+|++|||+|            |||+|+.||+++|++.++.+.+ 
T Consensus       365 LaFRLLRqhGy~VS~DvF~~F~~-~g~F~~~~ge~~~Dv~glL~LYeAS~l~~~gE~iLdeA~~Fs~~~L~~~~~~~~l~  443 (800)
T PLN02592        365 MGFRLLRLHGHQVSADVFKHFEK-GGEFFCFAGQSTQAVTGMFNLYRASQVLFPGEKILENAKEFSSKFLREKQEANELL  443 (800)
T ss_pred             HHHHHHHHcCCCCChHHHHhhcC-CCCccccccccccchHHHHHHHHHHhcCCCcchHHHHHHHHHHHHHHHHhhccccc
Confidence            999999999      99999987 8999855  258999999999            9999999999999998753322 


Q ss_pred             CC-CCCchhHHHHHHhcchhhhhhhhhHHHHHHHHHhcCCCCC-------------chHHHHHHHhhhHHHhhhhHhHHH
Q 044511          154 NK-NDDEHLCTLVNHALELPLHWRMLRLEARWFIDVYENGPDM-------------NPILLELAKVDFNIVQAVHQENLK  219 (343)
Q Consensus       154 ~~-~~~~~l~~eV~~aL~~P~~~~l~Rlear~yI~~Y~~~~~~-------------n~~lLelAKlDFn~~Qs~hq~EL~  219 (343)
                      ++ .++++|+++|+|||++|||+++||+|||+||++|++++++             |++||||||+|||+||++||+||+
T Consensus       444 d~~~~~~~L~~eV~~AL~~P~~~~l~RlEaR~yI~~Y~~~~~~~i~Kt~yr~~~~~n~~lLeLAklDFn~~Qs~hq~EL~  523 (800)
T PLN02592        444 DKWIIMKDLPGEVGFALEIPWYASLPRVETRFYIEQYGGEDDVWIGKTLYRMPYVNNNEYLELAKLDYNNCQALHQLEWD  523 (800)
T ss_pred             cccccCccHHHHHHHhccChhhcCcchHHHHHHHHHhcCCcccchhhhhccccccCCHHHHHHHHHHHHHHHHHhHHHHH
Confidence            33 2578899999999999999999999999999999987764             999999999999999999999999


Q ss_pred             HHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHh---
Q 044511          220 YASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFTDAVE---  296 (343)
Q Consensus       220 ~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft~ave---  296 (343)
                      +++||||++||+  +|||||||++|||||++|++||||||.+|++|||+++|+|++||+||+|||+|||++||++|+   
T Consensus       524 ~lsrWwke~~L~--~L~faRdr~ve~Yfwa~~~~feP~~s~~Ri~~aK~~~LitviDD~fD~yGt~eEl~~ft~~v~~~~  601 (800)
T PLN02592        524 NFQKWYEECNLG--EFGVSRSELLLAYFLAAASIFEPERSHERLAWAKTTVLVEAISSYFNKETSSKQRRAFLHEFGYGY  601 (800)
T ss_pred             HHhHHHHhcCCC--cCCcchhHHHHHHHHHHHhhcCccchHHHHHHHHHHHHHHhhcccccCCCCHHHHHHHHHHHHhcc
Confidence            999999999997  899999999999999999999999999999999999999999999999999999999999997   


Q ss_pred             -----hcCchhhccCCh------HHHHHHHHHHHHHHHHHHHHHHccCchhhhhhhhC
Q 044511          297 -----RWDATAVEQLPH------YMKLCFHALRNSINEMTFDVLRDQGVDILISYLKK  343 (343)
Q Consensus       297 -----rWD~~~~~~Lpe------ymk~~f~al~~t~~ei~~~~~~~~g~~~~~~~lk~  343 (343)
                           |||.+++++||+      |||+||.|||||+|||+.++.++||++ +++||++
T Consensus       602 ~~~~~rWd~~~~~~lp~~~~~~~~mki~f~aLy~tineia~~a~~~qGr~-v~~~L~~  658 (800)
T PLN02592        602 KINGRRSDHHFNDRNMRRSGSVKTGEELVGLLLGTLNQLSLDALEAHGRD-ISHLLRH  658 (800)
T ss_pred             cccccccCchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHhCcc-HHHHHHH
Confidence                 999999999988      999999999999999999999999999 9999985


No 4  
>PF01397 Terpene_synth:  Terpene synthase, N-terminal domain;  InterPro: IPR001906 Sequences containing this domain belong to the terpene synthase family. It has been suggested that this gene family be designated tps (for terpene synthase). Sequence comparisons reveal similarities between the monoterpene (C10) synthases, sesquiterpene (C15) synthases and the diterpene (C20) synthases. It has been split into six subgroups on the basis of phylogeny, called Tpsa-Tpsf [].   Tpsa includes vetispiridiene synthase Q39979 from SWISSPROT, 5-epi- aristolochene synthase, Q40577 from SWISSPROT and (+)-delta-cadinene synthase P93665 from SWISSPROT .  Tpsb includes (-)-limonene synthase, Q40322 from SWISSPROT. Tpsc includes copalyl diphosphate synthase (kaurene synthase A), O04408 from SWISSPROT. Tpsd includes taxadiene synthase, Q41594 from SWISSPROT, pinene synthase, O24475 from SWISSPROT and myrcene synthase, O24474 from SWISSPROT.  Tpse includes ent-kaurene synthase B Q39548 from SWISSPROT. Tpsf includes linalool synthase Q9ZPN5 from SWISSPROT.  In the fungus Phaeosphaeria sp. (strain L487) the synthesis of ent-kaurene from geranylgeranyl dophosphate is promoted by a single bifunctional protein [].; GO: 0016829 lyase activity, 0008152 metabolic process; PDB: 2ONH_A 2ONG_B 3P5R_A 3P5P_A 3N0F_A 3N0G_B 3PYB_A 3PYA_A 3G4F_A 3G4D_B ....
Probab=100.00  E-value=1.6e-35  Score=267.62  Aligned_cols=135  Identities=47%  Similarity=0.729  Sum_probs=119.3

Q ss_pred             CCCCccccCCCccc------hhHHHHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC-------------
Q 044511           27 WSFDYIQSLDSKYK------GESYAKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN-------------   87 (343)
Q Consensus        27 W~~~fl~~~~~~~~------~~~~~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~-------------   87 (343)
                      |||+|++++++.++      .+.+.+++++||++||+|+..+.    .+++++|+|||+||||||++             
T Consensus         1 W~d~fl~s~s~~~~~~~~~~~~~~~~~~~~Lk~~v~~~l~~~~----~d~~~~L~lID~lqRLGi~yhFe~EI~~~L~~i   76 (183)
T PF01397_consen    1 WGDDFLQSLSPSYTACMQSEDEKCKERAEELKEEVRNMLPASY----PDPLEKLELIDTLQRLGISYHFEDEIKEILDSI   76 (183)
T ss_dssp             TTHHHHHHTBHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHSSS----SHHHHHHHHHHHHHHTTCGGGGHHHHHHHHHHH
T ss_pred             CCCceecCCCCcchhccchhHHHHHHHHHHHHHHHHHHHhhcC----CCHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHH
Confidence            99999987655543      36788999999999999998873    38999999999999999942             


Q ss_pred             -----------CchhhhhHHHhhhccc------ccccccccccccccCCCCCCChHHHHHH------------HHHHHHH
Q 044511           88 -----------KSLYATALKFRVLRQY------ETFSRFMDEKGRFKSSGHSDDGKGMLAL------------IFRDATS  138 (343)
Q Consensus        88 -----------~dL~~~AL~FRLLR~h------DvF~~F~d~~G~F~~~~l~~dv~glLsL------------iLdeA~~  138 (343)
                                 .||++|||+|||||||      |||++|+|++|+|+.+ +++||+|||||            |||||+.
T Consensus        77 ~~~~~~~~~~~~dL~~~AL~FRLLRqhGy~VS~DvF~~F~d~~g~F~~~-l~~Dv~glLsLYeAS~l~~~gE~iLdeA~~  155 (183)
T PF01397_consen   77 YRSWDEDNEEIDDLYTTALRFRLLRQHGYYVSSDVFNKFKDEKGNFKES-LSNDVKGLLSLYEASHLRFHGEDILDEARA  155 (183)
T ss_dssp             HHTTTTTSHTSSCHHHHHHHHHHHHHTT----GGGGGGGBETTSSBSGG-GGGHHHHHHHHHHHHTT--TT-HHHHHHHH
T ss_pred             hhhccccccccCchhHHHHHHHHHHHcCCcccHHHHhCcccCCCccchh-hhHhHHHHHHHHHHHHccCCChHHHHHHHH
Confidence                       3999999999999999      9999999999999998 99999999999            9999999


Q ss_pred             HHHHHHHHHhhccCCCCCCCchhHHHHHHhc
Q 044511          139 FTTAYLKEWVIKHDSNKNDDEHLCTLVNHAL  169 (343)
Q Consensus       139 Fs~~~L~~~~~~~~~~~~~~~~l~~eV~~aL  169 (343)
                      ||+++|+++++++..   .+++|+++|+|||
T Consensus       156 Ft~~~L~~~~~~~~~---~~~~L~~~V~~AL  183 (183)
T PF01397_consen  156 FTTKHLKSLLSNLSI---PDPHLAKEVKHAL  183 (183)
T ss_dssp             HHHHHHHHHHTTTCT---TSCHHHHHHHHHH
T ss_pred             HHHHHHHHHhccCCC---CcHHHHHHHHHhC
Confidence            999999999986310   1346999999997


No 5  
>PF03936 Terpene_synth_C:  Terpene synthase family, metal binding domain;  InterPro: IPR005630 Sequences containing this domain belong to the terpene synthase family. It has been suggested that this gene family be designated tps (for terpene synthase). Sequence comparisons reveal similarities between the monoterpene (C10) synthases, sesquiterpene (C15) synthases and the diterpene (C20) synthases. It has been split into six subgroups on the basis of phylogeny, called Tpsa-Tpsf [].  Tpsa includes vetispiridiene synthase Q39979 from SWISSPROT, 5-epi- aristolochene synthase, Q40577 from SWISSPROT and (+)-delta-cadinene synthase P93665 from SWISSPROT .  Tpsb includes (-)-limonene synthase, Q40322 from SWISSPROT. Tpsc includes copalyl diphosphate synthase (kaurene synthase A), O04408 from SWISSPROT. Tpsd includes taxadiene synthase, Q41594 from SWISSPROT, pinene synthase, O24475 from SWISSPROT and myrcene synthase, O24474 from SWISSPROT.  Tpse includes ent-kaurene synthase B Q39548 from SWISSPROT. Tpsf includes linalool synthase Q9ZPN5 from SWISSPROT.  In the fungus Phaeosphaeria sp. (strain L487) the synthesis of ent-kaurene from geranylgeranyl dophosphate is promoted by a single bifunctional protein [].; GO: 0000287 magnesium ion binding, 0016829 lyase activity; PDB: 3PYB_A 3PYA_A 3G4F_A 3G4D_B 3CKE_A 2OA6_D 2E4O_B 3BNY_B 3BNX_A 3LG5_A ....
Probab=99.88  E-value=9.6e-23  Score=189.74  Aligned_cols=134  Identities=35%  Similarity=0.453  Sum_probs=128.1

Q ss_pred             HHHHHHhhhHHHhhhhHhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHH
Q 044511          199 LLELAKVDFNIVQAVHQENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDV  278 (343)
Q Consensus       199 lLelAKlDFn~~Qs~hq~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~  278 (343)
                      +|+|||+|||+||++||+|++++++||+++|+. .+.+.+|+|++.+|||.++.++.|..+..|+++||..+++.++||+
T Consensus         1 ~~~la~~~~~~~~~~~~~e~~~~~~W~~~~~l~-~~~~~~~~~~~~~~~~~~aa~~~P~~~~~l~~~a~~~~w~f~~DD~   79 (270)
T PF03936_consen    1 YLELAKRDFPHCQALHQQELEEIDRWVKEFGLF-DEDKAARQRFRQAYFGLLAARFYPDSSDELLAAADWMAWLFIFDDF   79 (270)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCTHH-HHHTTSHHHHHHHHHHHHHHHHSGCGHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcccchhhcHhhHHHHHHHHHHHHHHHHHcCCc-cccccchhhhhHhHHhhhhheeCCCcHHHHHHHHhhchheeeeeec
Confidence            689999999999999999999999999999994 4888899999999999999999999888899999999999999999


Q ss_pred             hhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHHHHHHHHHHHHHHHHccC
Q 044511          279 YDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHALRNSINEMTFDVLRDQG  333 (343)
Q Consensus       279 yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~~~~~~~g  333 (343)
                      ||..|+.++++.|+++++||++...+.+|++++.++.++.+++++++..+.+.++
T Consensus        80 ~D~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~d~~~r~~~~~~~~~~  134 (270)
T PF03936_consen   80 FDDGGSAEELEALTDAVERWDPNSGDPLPDPDKPLFRALADIWNRIAARMSPAQR  134 (270)
T ss_dssp             HHTTSHHHHHHHHHHHHHHTSSGGGGGSTHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             cccccchHHHHHHHHHHhcccccccccccchhHHHHHHHHHHHHHHHHHhhhhhc
Confidence            9999999999999999999999888899999999999999999999999888764


No 6  
>cd00868 Terpene_cyclase_C1 Terpene cyclases, Class 1. Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid biosynthesis enzymes, which share the 'isoprenoid synthase fold' and convert linear, all-trans, isoprenoids, geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate into numerous cyclic forms of monoterpenes, diterpenes, and sesquiterpenes. Also included in this CD are the cis-trans terpene cyclases such as trichodiene synthase. The class I terpene cyclization reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational ch
Probab=99.80  E-value=3.2e-19  Score=167.22  Aligned_cols=122  Identities=52%  Similarity=0.939  Sum_probs=116.0

Q ss_pred             hhHhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHH
Q 044511          213 VHQENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFT  292 (343)
Q Consensus       213 ~hq~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft  292 (343)
                      .||+|++++++||+++||. ...+++|.+..++|+|+++++|+|+.+..|+++||.++++.++||.||.+|+.+|+..|+
T Consensus         1 ~~~~e~~~~~~W~~~~~l~-~~~~~~r~~~~~~~~~~a~~~p~~~~~~~l~~~a~~~~~~f~~DD~~D~~~~~~~~~~~~   79 (284)
T cd00868           1 LHQEELKELSRWWKELGLQ-EKLPFARDRLVECYFWAAGSYFEPQYSEARIALAKTIALLTVIDDTYDDYGTLEELELFT   79 (284)
T ss_pred             CCHHHHHHHHHHHHHhCCc-ccCCchhhHhHHHHHHHHHhhcCccchHHHHHHHHHHHHHHHHHhccccCCCHHHHHHHH
Confidence            4999999999999999998 555699999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhhcCchhhccCChHHHHHHHHHHHHHHHHHHHHHHccCch
Q 044511          293 DAVERWDATAVEQLPHYMKLCFHALRNSINEMTFDVLRDQGVD  335 (343)
Q Consensus       293 ~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~~~~~~~g~~  335 (343)
                      ++++||+...++.+|++++.++.+++++.++++..+.+.+|..
T Consensus        80 ~~~~~~~~~~~~~~p~~~~~~~~~l~d~~~r~~~~~~~~~~~~  122 (284)
T cd00868          80 EAVERWDISAIDELPEYMKPVFKALYDLVNEIEEELAKEGGSE  122 (284)
T ss_pred             HHHHhcChhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHhcCch
Confidence            9999999999999999999999999999999999998877654


No 7  
>cd00687 Terpene_cyclase_nonplant_C1 Non-plant Terpene Cyclases, Class 1. This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in
Probab=98.15  E-value=2.2e-06  Score=81.99  Aligned_cols=107  Identities=15%  Similarity=0.002  Sum_probs=85.0

Q ss_pred             HHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHH-HHHHHHHHHHHHhhcc-CCHHHHHHHHHHHhh
Q 044511          220 YASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMST-MVNALITAIDDVYDVY-GTLEELEIFTDAVER  297 (343)
Q Consensus       220 ~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~t-K~~~litviDD~yD~y-GTleEl~~ft~aver  297 (343)
                      +...|.++.|+-  .=+.+|++.++++|+.++.++.|+.+..|+.++ +.++++.++||.||.. ++++++..+++.+.+
T Consensus        18 ~~~~w~~~~~l~--~~~~~~~~~~~~~~~~~~a~~~P~a~~~~l~l~~~~~~w~f~~DD~~D~~~~~~~~~~~~~~~~~~   95 (303)
T cd00687          18 EYLEWVLEEMLI--PSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQKSPEDGEAGVTRLLD   95 (303)
T ss_pred             HHHHHHHHcCCC--CcchhHHHHhcCCHHHHHhhcCCCCCHHHHHHHHHHHHHHHHhcccCCccccCHHHHHHHHHHHHh
Confidence            467788888664  345899999999999999999999999999655 9999999999999997 599999999998888


Q ss_pred             cCchhhccCChHHHHHHHHHHHHHHHHHHHH
Q 044511          298 WDATAVEQLPHYMKLCFHALRNSINEMTFDV  328 (343)
Q Consensus       298 WD~~~~~~Lpeymk~~f~al~~t~~ei~~~~  328 (343)
                      |.......-|.....+..++.++...+...+
T Consensus        96 ~~~~~~~~~~~~~~p~~~~~~d~~~r~~~~~  126 (303)
T cd00687          96 ILRGDGLDSPDDATPLEFGLADLWRRTLARM  126 (303)
T ss_pred             ccCCCCCCCCCCCCHHHHHHHHHHHHhccCC
Confidence            7654221114666777777777777665543


No 8  
>cd00385 Isoprenoid_Biosyn_C1 Isoprenoid Biosynthesis enzymes, Class 1. Superfamily of trans-isoprenyl diphosphate synthases (IPPS) and class I terpene cyclases which either synthesis geranyl/farnesyl diphosphates (GPP/FPP) or longer chained products from isoprene precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), or use geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate as substrate. These enzymes produce a myriad of precursors for such end products as steroids, cholesterol, sesquiterpenes, heme, carotenoids, retinoids, and diterpenes; and are widely distributed among archaea, bacteria, and eukaryota.The enzymes in this superfamily share the same 'isoprenoid synthase fold' and include several subgroups. The head-to-tail (HT) IPPS catalyze the successive 1'-4 condensation of the 5-carbon IPP to the growing isoprene chain to form linear, all-trans, C10-, C15-, C20- C25-, C30-, C35-, C40-, C45-, or C50-isoprenoid diphosphates. Cyclic monoter
Probab=96.64  E-value=0.0015  Score=57.78  Aligned_cols=75  Identities=24%  Similarity=0.263  Sum_probs=58.3

Q ss_pred             HHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHHHHHHHHHHHH
Q 044511          247 FWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHALRNSINEMTF  326 (343)
Q Consensus       247 fw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~  326 (343)
                      +|+++++++|+++..|..+++..++..++||++|..++..+.......+      .....|.++...+..+...+.++..
T Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~DDi~D~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~   74 (243)
T cd00385           1 FRPLAVLLEPEASRLRAAVEKLHAASLVHDDIVDDSGTRRGLPTAHLAV------AIDGLPEAILAGDLLLADAFEELAR   74 (243)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCchhhhhhH------HhcCchHHHHHHHHHHHHHHHHHHh
Confidence            3567889999999999999999999999999999998887776655544      2344667777777777777666654


Q ss_pred             H
Q 044511          327 D  327 (343)
Q Consensus       327 ~  327 (343)
                      .
T Consensus        75 ~   75 (243)
T cd00385          75 E   75 (243)
T ss_pred             C
Confidence            3


No 9  
>PF14165 YtzH:  YtzH-like protein
Probab=48.94  E-value=68  Score=26.13  Aligned_cols=48  Identities=19%  Similarity=0.219  Sum_probs=33.3

Q ss_pred             HHHHHHHHH-hhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHHHHH
Q 044511          270 ALITAIDDV-YDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHALRN  319 (343)
Q Consensus       270 ~litviDD~-yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~al~~  319 (343)
                      .|--++++- -|.+||..|++.+...|+.  .-+-+.++..+|-+..-+|+
T Consensus         9 LLkDIL~~hq~DccgTvsEcEQieRLvks--Lm~n~~i~~~ik~~L~~Iy~   57 (87)
T PF14165_consen    9 LLKDILSNHQLDCCGTVSECEQIERLVKS--LMANPNIDADIKQTLEEIYS   57 (87)
T ss_pred             HHHHHHHhhhhhccCcHHHHHHHHHHHHH--HHcCCCcCHHHHHHHHHHHH
Confidence            333444444 4889999999999888877  55556677887766655544


No 10 
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=45.47  E-value=1.5e+02  Score=28.61  Aligned_cols=125  Identities=17%  Similarity=0.195  Sum_probs=76.2

Q ss_pred             HHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC--CchhhhhHHHhhhccc----ccccccccccccccCCCCCCCh
Q 044511           52 KEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN--KSLYATALKFRVLRQY----ETFSRFMDEKGRFKSSGHSDDG  125 (343)
Q Consensus        52 k~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~--~dL~~~AL~FRLLR~h----DvF~~F~d~~G~F~~~~l~~dv  125 (343)
                      |..+++.|...|    +.....+-+--.-|++|-.+  .+-|..||.     .+    ||.|+|    |.|-|+ ...-.
T Consensus        55 ~~nlekAL~~DP----s~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAls-----l~p~~GdVLNNY----G~FLC~-qg~~~  120 (250)
T COG3063          55 KKNLEKALEHDP----SYYLAHLVRAHYYQKLGENDLADESYRKALS-----LAPNNGDVLNNY----GAFLCA-QGRPE  120 (250)
T ss_pred             HHHHHHHHHhCc----ccHHHHHHHHHHHHHcCChhhHHHHHHHHHh-----cCCCccchhhhh----hHHHHh-CCChH
Confidence            456666666554    67778888888899999855  356666652     23    999998    667655 33444


Q ss_pred             HHHHHH--------------HHHHHHHH---------HHHHHHHHhhccCCCCCCCchhHHHHHHhcchhhhhhhhhHHH
Q 044511          126 KGMLAL--------------IFRDATSF---------TTAYLKEWVIKHDSNKNDDEHLCTLVNHALELPLHWRMLRLEA  182 (343)
Q Consensus       126 ~glLsL--------------iLdeA~~F---------s~~~L~~~~~~~~~~~~~~~~l~~eV~~aL~~P~~~~l~Rlea  182 (343)
                      .+|--+              .++++.-.         ++.+|+..+...   ....+.+.     .+--+.+..=.+.+|
T Consensus       121 eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~d---p~~~~~~l-----~~a~~~~~~~~y~~A  192 (250)
T COG3063         121 EAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRALELD---PQFPPALL-----ELARLHYKAGDYAPA  192 (250)
T ss_pred             HHHHHHHHHHhCCCCCCcchhhhhhHHHHhhcCCchhHHHHHHHHHHhC---cCCChHHH-----HHHHHHHhcccchHH
Confidence            444333              34443332         235666666541   01222222     233355566778899


Q ss_pred             HHHHHHhcCCCCCchH
Q 044511          183 RWFIDVYENGPDMNPI  198 (343)
Q Consensus       183 r~yI~~Y~~~~~~n~~  198 (343)
                      |.|++.|.+....+..
T Consensus       193 r~~~~~~~~~~~~~A~  208 (250)
T COG3063         193 RLYLERYQQRGGAQAE  208 (250)
T ss_pred             HHHHHHHHhcccccHH
Confidence            9999999987765544


No 11 
>KOG3951 consensus Uncharacterized conserved protein [Function unknown]
Probab=30.18  E-value=82  Score=30.70  Aligned_cols=38  Identities=34%  Similarity=0.447  Sum_probs=30.3

Q ss_pred             ccccccccccccCCCCCCChHHHHHH----------HHHHHHHHHHHHHH
Q 044511          106 FSRFMDEKGRFKSSGHSDDGKGMLAL----------IFRDATSFTTAYLK  145 (343)
Q Consensus       106 F~~F~d~~G~F~~~~l~~dv~glLsL----------iLdeA~~Fs~~~L~  145 (343)
                      |++- +.+|-|... ..-|++|-..+          -|=.|..||++||.
T Consensus       260 yDHV-hp~GAFv~~-s~iDmkgcvrllk~q~p~~~e~LLnaLRfTTKHlN  307 (321)
T KOG3951|consen  260 YDHV-HPNGAFVSN-SSIDMKGCVRLLKLQPPEQSECLLNALRFTTKHLN  307 (321)
T ss_pred             eecc-ccccccccc-CcCcHHHHHHHHHcCCchhhHHHHHHHHHHHhhcC
Confidence            4433 678999766 78899998887          57789999999983


No 12 
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.15  E-value=39  Score=31.65  Aligned_cols=34  Identities=24%  Similarity=0.627  Sum_probs=23.4

Q ss_pred             HhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCc
Q 044511          215 QENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQF  258 (343)
Q Consensus       215 q~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~  258 (343)
                      |+|+..|.+||++.|-.          ++-....++|.+|.=+|
T Consensus         8 ~qql~~ik~wwkeNGk~----------li~gviLg~~~lfGW~y   41 (207)
T COG2976           8 QQQLEAIKDWWKENGKA----------LIVGVILGLGGLFGWRY   41 (207)
T ss_pred             HHHHHHHHHHHHHCCch----------hHHHHHHHHHHHHHHHH
Confidence            78999999999999965          33344455555554443


No 13 
>KOG3906 consensus Tryptophan 2,3-dioxygenase [Amino acid transport and metabolism]
Probab=24.58  E-value=1.7e+02  Score=28.93  Aligned_cols=29  Identities=21%  Similarity=0.447  Sum_probs=21.8

Q ss_pred             HHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCC
Q 044511           53 EQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGN   85 (343)
Q Consensus        53 ~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI   85 (343)
                      ..||+||.+..    .|-...|.+|-.|.|.-+
T Consensus        87 DsvR~~l~~~v----~DEtktLkiVsrl~Rv~~  115 (399)
T KOG3906|consen   87 DSVRKLLNNTV----VDETKTLKIVSRLDRVTK  115 (399)
T ss_pred             HHHHHHhcchh----hcchhHHHHHHhHHHHHH
Confidence            78999998753    465677888888877665


No 14 
>KOG1914 consensus mRNA cleavage and polyadenylation factor I complex, subunit RNA14 [RNA processing and modification]
Probab=23.58  E-value=1.7e+02  Score=31.64  Aligned_cols=103  Identities=17%  Similarity=0.252  Sum_probs=66.9

Q ss_pred             hHHHhhhhHhHHHHHHHH--HHHhCCCCCCCC-----cccchhhHHHHHhh-ccccCCCcchhhHHHHHHHHHHHHHHHH
Q 044511          207 FNIVQAVHQENLKYASRW--WKKTGLGGENLN-----FVRDRIVENFFWSV-GEKFEPQFGYFRRMSTMVNALITAIDDV  278 (343)
Q Consensus       207 Fn~~Qs~hq~EL~~lsrW--wke~~l~~~~L~-----faRdr~ve~Yfw~~-~~~fEP~~s~~Ri~~tK~~~litviDD~  278 (343)
                      +-+=|---..|.+++++|  |-++.-. ..|.     ---.|++.+|=-++ .+.|-|+      +|--.+..+.-+.|+
T Consensus       223 ~~vp~~~T~~e~~qv~~W~n~I~wEks-NpL~t~~~~~~~~Rv~yayeQ~ll~l~~~pe------iWy~~s~yl~~~s~l  295 (656)
T KOG1914|consen  223 PAVPPKGTKDEIQQVELWKNWIKWEKS-NPLRTLDGTMLTRRVMYAYEQCLLYLGYHPE------IWYDYSMYLIEISDL  295 (656)
T ss_pred             CCCCCCCChHHHHHHHHHHHHHHHHhc-CCcccccccHHHHHHHHHHHHHHHHHhcCHH------HHHHHHHHHHHhhHH
Confidence            334455667789999999  6654432 2222     22358887775443 3456664      677778888889999


Q ss_pred             hhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHH
Q 044511          279 YDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHA  316 (343)
Q Consensus       279 yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~a  316 (343)
                      ++.+|..++..+||+-...-=..+++.+...-+..|-+
T Consensus       296 ~~~~~d~~~a~~~t~e~~~~yEr~I~~l~~~~~~Ly~~  333 (656)
T KOG1914|consen  296 LTEKGDVPDAKSLTDEAASIYERAIEGLLKENKLLYFA  333 (656)
T ss_pred             HHHhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999999999876543334555554444444433


No 15 
>PF10828 DUF2570:  Protein of unknown function (DUF2570);  InterPro: IPR022538 This entry is represented by Bacteriophage IME08, pseT.3. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  This is a family of proteins with unknown function. 
Probab=21.66  E-value=1.2e+02  Score=25.10  Aligned_cols=34  Identities=32%  Similarity=0.450  Sum_probs=23.8

Q ss_pred             HHHHHhhHHHHHHHHhhcCCC---cCCCCchhhHHHHHHHHHh
Q 044511           44 YAKQLEKPKEQVSAMLQQDDK---VVDLDPLHQLELIDNLHRL   83 (343)
Q Consensus        44 ~~~~~~~Lk~~Vk~~l~~~~~---~~~~d~~~~L~lID~LqRL   83 (343)
                      ...+.++.++.+|..+.+.+|   .-|.+      .||.|+||
T Consensus        72 ~r~~~e~~~e~ik~~lk~d~Ca~~~~P~~------V~d~L~~~  108 (110)
T PF10828_consen   72 LRQQSEERRESIKTALKDDPCANTAVPDA------VIDSLRRL  108 (110)
T ss_pred             HHHHHHHHHHHHHHHHccCccccCCCCHH------HHHHHHHh
Confidence            345667777888888887555   34443      78888887


No 16 
>TIGR00636 PduO_Nterm ATP:cob(I)alamin adenosyltransferase. This model represents as ATP:cob(I)alamin adenosyltransferase family corresponding to the N-terminal half of Salmonella PduO, a 1,2-propanediol utilization protein that probably is bifunctional. PduO represents one of at least three families of ATP:corrinoid adenosyltransferase: others are CobA (which partially complements PduO) and EutT. It was not clear originally whether ATP:cob(I)alamin adenosyltransferase activity resides in the N-terminal region of PduO, modeled here, but this has now become clear from the characterization of MeaD from Methylobacterium extorquens.
Probab=21.06  E-value=1.7e+02  Score=26.46  Aligned_cols=21  Identities=33%  Similarity=0.536  Sum_probs=17.3

Q ss_pred             HHHhhccCCHHHHHHHHHHHh
Q 044511          276 DDVYDVYGTLEELEIFTDAVE  296 (343)
Q Consensus       276 DD~yD~yGTleEl~~ft~ave  296 (343)
                      |...+.|||+|||..++-.+.
T Consensus        22 d~riea~Gt~DElns~iGl~~   42 (171)
T TIGR00636        22 SPRVEAYGTLDELNSFIGVAL   42 (171)
T ss_pred             CccceehhhHHHHHHHHHHHH
Confidence            445789999999999887764


No 17 
>PF12626 PolyA_pol_arg_C:  Polymerase A arginine-rich C-terminus; PDB: 3AQN_A 3AQK_A 3AQM_B 3AQL_B.
Probab=20.07  E-value=1.3e+02  Score=25.87  Aligned_cols=28  Identities=29%  Similarity=0.528  Sum_probs=24.0

Q ss_pred             HHhhhHHHhhhhHhHHHHHHHHHHHhCC
Q 044511          203 AKVDFNIVQAVHQENLKYASRWWKKTGL  230 (343)
Q Consensus       203 AKlDFn~~Qs~hq~EL~~lsrWwke~~l  230 (343)
                      |-.||=.|.+.--+++.+|..||.+.--
T Consensus        65 AAyDFL~LR~~~ge~~~~l~~WW~~fq~   92 (124)
T PF12626_consen   65 AAYDFLLLRAEAGEELSELAEWWTEFQE   92 (124)
T ss_dssp             HHHHHHHHHHHH-HHHHHHHHHHHHHTT
T ss_pred             HHHHHHHHHHHhCCCcHHHHHHHHHHHh
Confidence            6789999988889999999999999754


Done!