Query 044511
Match_columns 343
No_of_seqs 151 out of 733
Neff 5.7
Searched_HMMs 46136
Date Fri Mar 29 03:56:17 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044511.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044511hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00684 Terpene_cyclase_plant_ 100.0 1.5E-95 3E-100 751.9 30.1 319 17-343 1-363 (542)
2 PLN02279 ent-kaur-16-ene synth 100.0 1.5E-88 3.3E-93 719.9 26.5 294 45-343 245-597 (784)
3 PLN02592 ent-copalyl diphospha 100.0 1.5E-86 3.2E-91 703.2 27.6 295 45-343 285-658 (800)
4 PF01397 Terpene_synth: Terpen 100.0 1.6E-35 3.5E-40 267.6 8.9 135 27-169 1-183 (183)
5 PF03936 Terpene_synth_C: Terp 99.9 9.6E-23 2.1E-27 189.7 10.5 134 199-333 1-134 (270)
6 cd00868 Terpene_cyclase_C1 Ter 99.8 3.2E-19 7E-24 167.2 12.7 122 213-335 1-122 (284)
7 cd00687 Terpene_cyclase_nonpla 98.2 2.2E-06 4.8E-11 82.0 5.0 107 220-328 18-126 (303)
8 cd00385 Isoprenoid_Biosyn_C1 I 96.6 0.0015 3.3E-08 57.8 3.2 75 247-327 1-75 (243)
9 PF14165 YtzH: YtzH-like prote 48.9 68 0.0015 26.1 6.1 48 270-319 9-57 (87)
10 COG3063 PilF Tfp pilus assembl 45.5 1.5E+02 0.0032 28.6 8.8 125 52-198 55-208 (250)
11 KOG3951 Uncharacterized conser 30.2 82 0.0018 30.7 4.5 38 106-145 260-307 (321)
12 COG2976 Uncharacterized protei 29.2 39 0.00084 31.7 2.1 34 215-258 8-41 (207)
13 KOG3906 Tryptophan 2,3-dioxyge 24.6 1.7E+02 0.0036 28.9 5.6 29 53-85 87-115 (399)
14 KOG1914 mRNA cleavage and poly 23.6 1.7E+02 0.0036 31.6 5.8 103 207-316 223-333 (656)
15 PF10828 DUF2570: Protein of u 21.7 1.2E+02 0.0027 25.1 3.6 34 44-83 72-108 (110)
16 TIGR00636 PduO_Nterm ATP:cob(I 21.1 1.7E+02 0.0037 26.5 4.7 21 276-296 22-42 (171)
17 PF12626 PolyA_pol_arg_C: Poly 20.1 1.3E+02 0.0027 25.9 3.4 28 203-230 65-92 (124)
No 1
>cd00684 Terpene_cyclase_plant_C1 Plant Terpene Cyclases, Class 1. This CD includes a diverse group of monomeric plant terpene cyclases (Tspa-Tspf) that convert the acyclic isoprenoid diphosphates, geranyl diphosphate (GPP), farnesyl diphosphate (FPP), or geranylgeranyl diphosphate (GGPP) into cyclic monoterpenes, diterpenes, or sesquiterpenes, respectively; a few form acyclic species. Terpnoid cyclases are soluble enzymes localized to the cytosol (sesquiterpene synthases) or plastids (mono- and diterpene synthases). All monoterpene and diterpene synthases have restrict substrate specificity, however, some sesquiterpene synthases can accept both FPP and GPP. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl diphosphates, via bridging Mg2+ ions (K+ preferred by gymnosperm cyclases), inducing conformational changes such that an N-terminal regi
Probab=100.00 E-value=1.5e-95 Score=751.93 Aligned_cols=319 Identities=50% Similarity=0.822 Sum_probs=299.0
Q ss_pred CCCCCCCCCCCCCCccccCCCccchh-HHHHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC--------
Q 044511 17 RRSADYGPTIWSFDYIQSLDSKYKGE-SYAKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN-------- 87 (343)
Q Consensus 17 r~~~~~~ps~W~~~fl~~~~~~~~~~-~~~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~-------- 87 (343)
|++++|+||+|||+++++.++++... .+.+++++||++||+|+.... ||.|++++|++||+||||||++
T Consensus 1 r~~~~~~~~~w~~~~~~s~~~~~~~~~~~~~~~~~lk~~v~~~~~~~~--~~~~~~~~l~liD~lqrLGi~~hF~~EI~~ 78 (542)
T cd00684 1 RPSANFPPSLWGDDHFLSLSSDYSEEDELEEEIEELKEEVRKMLEDSE--YPVDLFERLWLIDRLQRLGISYHFEDEIKE 78 (542)
T ss_pred CCCCCCCCCcCCCcceeecCCCcchhHHHHHHHHHHHHHHHHHHHhcc--cCCCHHHHHHHHHHHHHcCchhhhHHHHHH
Confidence 78999999999995555555544433 688999999999999998652 5689999999999999999931
Q ss_pred -----------------CchhhhhHHHhhhccc------ccccccccccccccCCCCCCChHHHHHH------------H
Q 044511 88 -----------------KSLYATALKFRVLRQY------ETFSRFMDEKGRFKSSGHSDDGKGMLAL------------I 132 (343)
Q Consensus 88 -----------------~dL~~~AL~FRLLR~h------DvF~~F~d~~G~F~~~~l~~dv~glLsL------------i 132 (343)
.||++|||+||||||| |||++|+|++|+|+++ +.+||+||||| |
T Consensus 79 ~L~~i~~~~~~~~~~~~~dl~~~al~FRlLR~~Gy~vs~dvf~~F~~~~g~f~~~-~~~d~~g~l~Ly~As~l~~~gE~i 157 (542)
T cd00684 79 ILDYIYRYWTERGESNEDDLYTTALGFRLLRQHGYNVSSDVFKKFKDEDGKFKES-LTQDVKGMLSLYEASHLSFPGEDI 157 (542)
T ss_pred HHHHHHHhhcccccccCCCHHHHHHHHHHHHHcCCCcCHHHHhhhcCCCCCcCch-hhhhhHHHHHHHHHhhcCCCCcHH
Confidence 4999999999999999 9999999999999999 99999999999 9
Q ss_pred HHHHHHHHHHHHHHHhhccCCCCCCCchhHHHHHHhcchhhhhhhhhHHHHHHHHHhcCCCCCchHHHHHHHhhhHHHhh
Q 044511 133 FRDATSFTTAYLKEWVIKHDSNKNDDEHLCTLVNHALELPLHWRMLRLEARWFIDVYENGPDMNPILLELAKVDFNIVQA 212 (343)
Q Consensus 133 LdeA~~Fs~~~L~~~~~~~~~~~~~~~~l~~eV~~aL~~P~~~~l~Rlear~yI~~Y~~~~~~n~~lLelAKlDFn~~Qs 212 (343)
||||++||++||++.++++ +.++++|+++|++||++|||+++||||||+||++|++++++|++||||||+|||+||+
T Consensus 158 LdeA~~ft~~~L~~~~~~~---~~~~~~l~~~V~~aL~~P~~~~~~rlear~yi~~Y~~~~~~n~~lLelAkldfn~~Q~ 234 (542)
T cd00684 158 LDEALSFTTKHLEEKLESN---WIIDPDLSGEIEYALEIPLHASLPRLEARWYIEFYEQEDDHNETLLELAKLDFNILQA 234 (542)
T ss_pred HHHHHHHHHHHHHHHhhcc---CCCCchHHHHHHHHccCchhcCCchHHHHHHHHHhCCCccccHHHHHHHHHHHHHHhH
Confidence 9999999999999999864 2367899999999999999999999999999999999999999999999999999999
Q ss_pred hhHhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHH
Q 044511 213 VHQENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFT 292 (343)
Q Consensus 213 ~hq~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft 292 (343)
+||+||++++|||+++||. ++|||+|+|++|||||++|++|||++|.+|+++||++++++++||+||+|||+|||+.||
T Consensus 235 ~hq~El~~~~rWwk~~gL~-~~l~~aRdr~ve~yf~~~a~~feP~~s~~Rl~~aK~~~l~~~iDD~fD~~gt~eEl~~ft 313 (542)
T cd00684 235 LHQEELKILSRWWKDLDLA-SKLPFARDRLVECYFWAAGTYFEPQYSLARIALAKTIALITVIDDTYDVYGTLEELELFT 313 (542)
T ss_pred hHHHHHHHHhHHHHhcCCc-ccCCcccchhHHHHHHHHhcccCccchHHHHHHHHHHHHHhhhHhhhccCCCHHHHHHHH
Confidence 9999999999999999998 888999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhhcCchhhccCChHHHHHHHHHHHHHHHHHHHHHHccCchhhhhhhhC
Q 044511 293 DAVERWDATAVEQLPHYMKLCFHALRNSINEMTFDVLRDQGVDILISYLKK 343 (343)
Q Consensus 293 ~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~~~~~~~g~~~~~~~lk~ 343 (343)
+||+|||.+++++||+|||+||.+|+++++|+++++.+.+|.+ +++|+++
T Consensus 314 ~ai~rwd~~~~~~lPe~mk~~~~al~~~~~ei~~~~~~~~~~~-~~~~~~~ 363 (542)
T cd00684 314 EAVERWDISAIDQLPEYMKIVFKALLNTVNEIEEELLKEGGSY-VVPYLKE 363 (542)
T ss_pred HHHHhccccchhhccHHHHHHHHHHHHHHHHHHHHHHHhcCcc-hHHHHHH
Confidence 9999999999999999999999999999999999999999988 8888763
No 2
>PLN02279 ent-kaur-16-ene synthase
Probab=100.00 E-value=1.5e-88 Score=719.92 Aligned_cols=294 Identities=29% Similarity=0.430 Sum_probs=276.5
Q ss_pred HHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC-------------------------CchhhhhHHHhh
Q 044511 45 AKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN-------------------------KSLYATALKFRV 99 (343)
Q Consensus 45 ~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~-------------------------~dL~~~AL~FRL 99 (343)
.++.++|..-|++.-+++|++||.+.++++|+||+||||||++ .|+++|||+|||
T Consensus 245 ~~~~~yL~~~~~~~~g~vP~~yp~~~fe~l~lvd~L~rlGi~~hF~~EI~~~L~~~~~~~~~~~~~~~~Dl~~tAl~FRL 324 (784)
T PLN02279 245 AGCLRYLRSLLQKFGNAVPTVYPLDQYARLSMVDTLERLGIDRHFRKEIKSVLDETYRYWLQGEEEIFLDLATCALAFRI 324 (784)
T ss_pred hHHHHHHHHHHHhcCCCCCCCCcccHHHHhHHHHHHHHhCCccccHHHHHHHHHHHHHhhcccccCCCCCHHHHHHHHHH
Confidence 4788999999999888899999999999999999999999932 499999999999
Q ss_pred hccc------ccccccccccccccCC--CCCCChHHHHHH------------HHHHHHHHHHHHHHHHhhccCC-CCCCC
Q 044511 100 LRQY------ETFSRFMDEKGRFKSS--GHSDDGKGMLAL------------IFRDATSFTTAYLKEWVIKHDS-NKNDD 158 (343)
Q Consensus 100 LR~h------DvF~~F~d~~G~F~~~--~l~~dv~glLsL------------iLdeA~~Fs~~~L~~~~~~~~~-~~~~~ 158 (343)
|||| |||++|+|+ + |+++ |..+||+||||| |||||+.||++||++.++++.+ ++.++
T Consensus 325 LR~hGy~VS~dvf~~F~~~-~-F~~~l~~~~~dv~gmL~LY~AS~l~~~gE~iLdeA~~Fs~~~L~~~~~~~~~~~~~~~ 402 (784)
T PLN02279 325 LRLNGYDVSSDPLKQFAED-H-FSDSLGGYLKDTGAVLELFRASQISYPDESLLEKQNSWTSHFLEQGLSNWSKTADRLR 402 (784)
T ss_pred HHHcCCCCChhHHhhcCCC-c-ccchhcccchhhHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHHhcccccccccC
Confidence 9999 999999975 4 9887 236999999999 9999999999999999886543 34467
Q ss_pred chhHHHHHHhcchhhhhhhhhHHHHHHHHHhcCCCC------------CchHHHHHHHhhhHHHhhhhHhHHHHHHHHHH
Q 044511 159 EHLCTLVNHALELPLHWRMLRLEARWFIDVYENGPD------------MNPILLELAKVDFNIVQAVHQENLKYASRWWK 226 (343)
Q Consensus 159 ~~l~~eV~~aL~~P~~~~l~Rlear~yI~~Y~~~~~------------~n~~lLelAKlDFn~~Qs~hq~EL~~lsrWwk 226 (343)
++|+++|+|||++|||+++||||||+||++|++++. +|++||||||+|||+||++||+||++|+|||+
T Consensus 403 ~~L~~eV~~AL~~P~~~~l~RlEaR~yI~~Y~~~~~~i~Kt~yr~~~~~n~~lLeLAklDFN~~Qs~hq~EL~~l~rWwk 482 (784)
T PLN02279 403 KYIKKEVEDALNFPYYANLERLANRRSIENYAVDDTRILKTSYRCSNICNQDFLKLAVEDFNFCQSIHREELKQLERWIV 482 (784)
T ss_pred ccHHHHHHHHhcCchhcCccHHHHHHHHHHhccccchhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhCeeHH
Confidence 889999999999999999999999999999998885 89999999999999999999999999999999
Q ss_pred HhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHhhcCch-hhcc
Q 044511 227 KTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFTDAVERWDAT-AVEQ 305 (343)
Q Consensus 227 e~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft~averWD~~-~~~~ 305 (343)
++||. +|||||||+||||||++|++||||||.+|++|||+++|+|++||+||+|||+|||++||+||+|||++ .++.
T Consensus 483 e~~L~--~L~faRdr~ve~Yf~aaa~~fEPe~S~aRi~~aK~~~L~tviDD~fD~yGt~eEL~~ft~aVeRWD~~~~~~~ 560 (784)
T PLN02279 483 ENRLD--KLKFARQKLAYCYFSAAATLFSPELSDARLSWAKNGVLTTVVDDFFDVGGSEEELENLIQLVEKWDVNGSPDF 560 (784)
T ss_pred hcCCc--cCCchhhHHHHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHHHhccccchhh
Confidence 99996 99999999999999999999999999999999999999999999999999999999999999999998 6699
Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHccCchhhhhhhhC
Q 044511 306 LPHYMKLCFHALRNSINEMTFDVLRDQGVDILISYLKK 343 (343)
Q Consensus 306 Lpeymk~~f~al~~t~~ei~~~~~~~~g~~~~~~~lk~ 343 (343)
||+|||+||.+|++|+|||+.++.+.||++ +++|+++
T Consensus 561 lpeymki~f~aL~~t~nei~~~~~~~qGr~-v~~~l~~ 597 (784)
T PLN02279 561 CSEQVEIIFSALRSTISEIGDKAFTWQGRN-VTSHIIK 597 (784)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHcCch-HHHHHHH
Confidence 999999999999999999999999999999 9999875
No 3
>PLN02592 ent-copalyl diphosphate synthase
Probab=100.00 E-value=1.5e-86 Score=703.25 Aligned_cols=295 Identities=25% Similarity=0.385 Sum_probs=276.1
Q ss_pred HHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC------------------------------Cchhhhh
Q 044511 45 AKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN------------------------------KSLYATA 94 (343)
Q Consensus 45 ~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~------------------------------~dL~~~A 94 (343)
.++.++|...|++.-+++|++||.+++++|++||+||||||++ .|+++||
T Consensus 285 ~~cl~YL~~~~~k~~GgVP~vyP~d~fE~LwlVDtLqRLGIs~hF~~EI~~iLd~iy~~w~~~g~~~a~~~~~~Dld~TA 364 (800)
T PLN02592 285 ENCLEYLNKAVQRFNGGVPNVYPVDLFEHIWAVDRLQRLGISRYFEPEIKECIDYVHRYWTENGICWARNSHVHDIDDTA 364 (800)
T ss_pred hHHHHHHHHHHHHcCCCCCCCCCCcHHHHHHHHHHHHHcCCccccHHHHHHHHHHHHHHHhhcCcccccCCCcCCHHHHH
Confidence 4788999999999878899999999999999999999999921 4899999
Q ss_pred HHHhhhccc------ccccccccccccccCC--CCCCChHHHHHH------------HHHHHHHHHHHHHHHHhhccCC-
Q 044511 95 LKFRVLRQY------ETFSRFMDEKGRFKSS--GHSDDGKGMLAL------------IFRDATSFTTAYLKEWVIKHDS- 153 (343)
Q Consensus 95 L~FRLLR~h------DvF~~F~d~~G~F~~~--~l~~dv~glLsL------------iLdeA~~Fs~~~L~~~~~~~~~- 153 (343)
|+||||||| |||++|++ +|+|++. +..+|++|||+| |||+|+.||+++|++.++.+.+
T Consensus 365 LaFRLLRqhGy~VS~DvF~~F~~-~g~F~~~~ge~~~Dv~glL~LYeAS~l~~~gE~iLdeA~~Fs~~~L~~~~~~~~l~ 443 (800)
T PLN02592 365 MGFRLLRLHGHQVSADVFKHFEK-GGEFFCFAGQSTQAVTGMFNLYRASQVLFPGEKILENAKEFSSKFLREKQEANELL 443 (800)
T ss_pred HHHHHHHHcCCCCChHHHHhhcC-CCCccccccccccchHHHHHHHHHHhcCCCcchHHHHHHHHHHHHHHHHhhccccc
Confidence 999999999 99999987 8999855 258999999999 9999999999999998753322
Q ss_pred CC-CCCchhHHHHHHhcchhhhhhhhhHHHHHHHHHhcCCCCC-------------chHHHHHHHhhhHHHhhhhHhHHH
Q 044511 154 NK-NDDEHLCTLVNHALELPLHWRMLRLEARWFIDVYENGPDM-------------NPILLELAKVDFNIVQAVHQENLK 219 (343)
Q Consensus 154 ~~-~~~~~l~~eV~~aL~~P~~~~l~Rlear~yI~~Y~~~~~~-------------n~~lLelAKlDFn~~Qs~hq~EL~ 219 (343)
++ .++++|+++|+|||++|||+++||+|||+||++|++++++ |++||||||+|||+||++||+||+
T Consensus 444 d~~~~~~~L~~eV~~AL~~P~~~~l~RlEaR~yI~~Y~~~~~~~i~Kt~yr~~~~~n~~lLeLAklDFn~~Qs~hq~EL~ 523 (800)
T PLN02592 444 DKWIIMKDLPGEVGFALEIPWYASLPRVETRFYIEQYGGEDDVWIGKTLYRMPYVNNNEYLELAKLDYNNCQALHQLEWD 523 (800)
T ss_pred cccccCccHHHHHHHhccChhhcCcchHHHHHHHHHhcCCcccchhhhhccccccCCHHHHHHHHHHHHHHHHHhHHHHH
Confidence 33 2578899999999999999999999999999999987764 999999999999999999999999
Q ss_pred HHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHh---
Q 044511 220 YASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFTDAVE--- 296 (343)
Q Consensus 220 ~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft~ave--- 296 (343)
+++||||++||+ +|||||||++|||||++|++||||||.+|++|||+++|+|++||+||+|||+|||++||++|+
T Consensus 524 ~lsrWwke~~L~--~L~faRdr~ve~Yfwa~~~~feP~~s~~Ri~~aK~~~LitviDD~fD~yGt~eEl~~ft~~v~~~~ 601 (800)
T PLN02592 524 NFQKWYEECNLG--EFGVSRSELLLAYFLAAASIFEPERSHERLAWAKTTVLVEAISSYFNKETSSKQRRAFLHEFGYGY 601 (800)
T ss_pred HHhHHHHhcCCC--cCCcchhHHHHHHHHHHHhhcCccchHHHHHHHHHHHHHHhhcccccCCCCHHHHHHHHHHHHhcc
Confidence 999999999997 899999999999999999999999999999999999999999999999999999999999997
Q ss_pred -----hcCchhhccCCh------HHHHHHHHHHHHHHHHHHHHHHccCchhhhhhhhC
Q 044511 297 -----RWDATAVEQLPH------YMKLCFHALRNSINEMTFDVLRDQGVDILISYLKK 343 (343)
Q Consensus 297 -----rWD~~~~~~Lpe------ymk~~f~al~~t~~ei~~~~~~~~g~~~~~~~lk~ 343 (343)
|||.+++++||+ |||+||.|||||+|||+.++.++||++ +++||++
T Consensus 602 ~~~~~rWd~~~~~~lp~~~~~~~~mki~f~aLy~tineia~~a~~~qGr~-v~~~L~~ 658 (800)
T PLN02592 602 KINGRRSDHHFNDRNMRRSGSVKTGEELVGLLLGTLNQLSLDALEAHGRD-ISHLLRH 658 (800)
T ss_pred cccccccCchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHhCcc-HHHHHHH
Confidence 999999999988 999999999999999999999999999 9999985
No 4
>PF01397 Terpene_synth: Terpene synthase, N-terminal domain; InterPro: IPR001906 Sequences containing this domain belong to the terpene synthase family. It has been suggested that this gene family be designated tps (for terpene synthase). Sequence comparisons reveal similarities between the monoterpene (C10) synthases, sesquiterpene (C15) synthases and the diterpene (C20) synthases. It has been split into six subgroups on the basis of phylogeny, called Tpsa-Tpsf []. Tpsa includes vetispiridiene synthase Q39979 from SWISSPROT, 5-epi- aristolochene synthase, Q40577 from SWISSPROT and (+)-delta-cadinene synthase P93665 from SWISSPROT . Tpsb includes (-)-limonene synthase, Q40322 from SWISSPROT. Tpsc includes copalyl diphosphate synthase (kaurene synthase A), O04408 from SWISSPROT. Tpsd includes taxadiene synthase, Q41594 from SWISSPROT, pinene synthase, O24475 from SWISSPROT and myrcene synthase, O24474 from SWISSPROT. Tpse includes ent-kaurene synthase B Q39548 from SWISSPROT. Tpsf includes linalool synthase Q9ZPN5 from SWISSPROT. In the fungus Phaeosphaeria sp. (strain L487) the synthesis of ent-kaurene from geranylgeranyl dophosphate is promoted by a single bifunctional protein [].; GO: 0016829 lyase activity, 0008152 metabolic process; PDB: 2ONH_A 2ONG_B 3P5R_A 3P5P_A 3N0F_A 3N0G_B 3PYB_A 3PYA_A 3G4F_A 3G4D_B ....
Probab=100.00 E-value=1.6e-35 Score=267.62 Aligned_cols=135 Identities=47% Similarity=0.729 Sum_probs=119.3
Q ss_pred CCCCccccCCCccc------hhHHHHHHhhHHHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC-------------
Q 044511 27 WSFDYIQSLDSKYK------GESYAKQLEKPKEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN------------- 87 (343)
Q Consensus 27 W~~~fl~~~~~~~~------~~~~~~~~~~Lk~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~------------- 87 (343)
|||+|++++++.++ .+.+.+++++||++||+|+..+. .+++++|+|||+||||||++
T Consensus 1 W~d~fl~s~s~~~~~~~~~~~~~~~~~~~~Lk~~v~~~l~~~~----~d~~~~L~lID~lqRLGi~yhFe~EI~~~L~~i 76 (183)
T PF01397_consen 1 WGDDFLQSLSPSYTACMQSEDEKCKERAEELKEEVRNMLPASY----PDPLEKLELIDTLQRLGISYHFEDEIKEILDSI 76 (183)
T ss_dssp TTHHHHHHTBHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHSSS----SHHHHHHHHHHHHHHTTCGGGGHHHHHHHHHHH
T ss_pred CCCceecCCCCcchhccchhHHHHHHHHHHHHHHHHHHHhhcC----CCHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHH
Confidence 99999987655543 36788999999999999998873 38999999999999999942
Q ss_pred -----------CchhhhhHHHhhhccc------ccccccccccccccCCCCCCChHHHHHH------------HHHHHHH
Q 044511 88 -----------KSLYATALKFRVLRQY------ETFSRFMDEKGRFKSSGHSDDGKGMLAL------------IFRDATS 138 (343)
Q Consensus 88 -----------~dL~~~AL~FRLLR~h------DvF~~F~d~~G~F~~~~l~~dv~glLsL------------iLdeA~~ 138 (343)
.||++|||+||||||| |||++|+|++|+|+.+ +++||+||||| |||||+.
T Consensus 77 ~~~~~~~~~~~~dL~~~AL~FRLLRqhGy~VS~DvF~~F~d~~g~F~~~-l~~Dv~glLsLYeAS~l~~~gE~iLdeA~~ 155 (183)
T PF01397_consen 77 YRSWDEDNEEIDDLYTTALRFRLLRQHGYYVSSDVFNKFKDEKGNFKES-LSNDVKGLLSLYEASHLRFHGEDILDEARA 155 (183)
T ss_dssp HHTTTTTSHTSSCHHHHHHHHHHHHHTT----GGGGGGGBETTSSBSGG-GGGHHHHHHHHHHHHTT--TT-HHHHHHHH
T ss_pred hhhccccccccCchhHHHHHHHHHHHcCCcccHHHHhCcccCCCccchh-hhHhHHHHHHHHHHHHccCCChHHHHHHHH
Confidence 3999999999999999 9999999999999998 99999999999 9999999
Q ss_pred HHHHHHHHHhhccCCCCCCCchhHHHHHHhc
Q 044511 139 FTTAYLKEWVIKHDSNKNDDEHLCTLVNHAL 169 (343)
Q Consensus 139 Fs~~~L~~~~~~~~~~~~~~~~l~~eV~~aL 169 (343)
||+++|+++++++.. .+++|+++|+|||
T Consensus 156 Ft~~~L~~~~~~~~~---~~~~L~~~V~~AL 183 (183)
T PF01397_consen 156 FTTKHLKSLLSNLSI---PDPHLAKEVKHAL 183 (183)
T ss_dssp HHHHHHHHHHTTTCT---TSCHHHHHHHHHH
T ss_pred HHHHHHHHHhccCCC---CcHHHHHHHHHhC
Confidence 999999999986310 1346999999997
No 5
>PF03936 Terpene_synth_C: Terpene synthase family, metal binding domain; InterPro: IPR005630 Sequences containing this domain belong to the terpene synthase family. It has been suggested that this gene family be designated tps (for terpene synthase). Sequence comparisons reveal similarities between the monoterpene (C10) synthases, sesquiterpene (C15) synthases and the diterpene (C20) synthases. It has been split into six subgroups on the basis of phylogeny, called Tpsa-Tpsf []. Tpsa includes vetispiridiene synthase Q39979 from SWISSPROT, 5-epi- aristolochene synthase, Q40577 from SWISSPROT and (+)-delta-cadinene synthase P93665 from SWISSPROT . Tpsb includes (-)-limonene synthase, Q40322 from SWISSPROT. Tpsc includes copalyl diphosphate synthase (kaurene synthase A), O04408 from SWISSPROT. Tpsd includes taxadiene synthase, Q41594 from SWISSPROT, pinene synthase, O24475 from SWISSPROT and myrcene synthase, O24474 from SWISSPROT. Tpse includes ent-kaurene synthase B Q39548 from SWISSPROT. Tpsf includes linalool synthase Q9ZPN5 from SWISSPROT. In the fungus Phaeosphaeria sp. (strain L487) the synthesis of ent-kaurene from geranylgeranyl dophosphate is promoted by a single bifunctional protein [].; GO: 0000287 magnesium ion binding, 0016829 lyase activity; PDB: 3PYB_A 3PYA_A 3G4F_A 3G4D_B 3CKE_A 2OA6_D 2E4O_B 3BNY_B 3BNX_A 3LG5_A ....
Probab=99.88 E-value=9.6e-23 Score=189.74 Aligned_cols=134 Identities=35% Similarity=0.453 Sum_probs=128.1
Q ss_pred HHHHHHhhhHHHhhhhHhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHH
Q 044511 199 LLELAKVDFNIVQAVHQENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDV 278 (343)
Q Consensus 199 lLelAKlDFn~~Qs~hq~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~ 278 (343)
+|+|||+|||+||++||+|++++++||+++|+. .+.+.+|+|++.+|||.++.++.|..+..|+++||..+++.++||+
T Consensus 1 ~~~la~~~~~~~~~~~~~e~~~~~~W~~~~~l~-~~~~~~~~~~~~~~~~~~aa~~~P~~~~~l~~~a~~~~w~f~~DD~ 79 (270)
T PF03936_consen 1 YLELAKRDFPHCQALHQQELEEIDRWVKEFGLF-DEDKAARQRFRQAYFGLLAARFYPDSSDELLAAADWMAWLFIFDDF 79 (270)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCTHH-HHHTTSHHHHHHHHHHHHHHHHSGCGHHHHHHHHHHHHHHHHHHHH
T ss_pred CcccchhhcHhhHHHHHHHHHHHHHHHHHcCCc-cccccchhhhhHhHHhhhhheeCCCcHHHHHHHHhhchheeeeeec
Confidence 689999999999999999999999999999994 4888899999999999999999999888899999999999999999
Q ss_pred hhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHHHHHHHHHHHHHHHHccC
Q 044511 279 YDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHALRNSINEMTFDVLRDQG 333 (343)
Q Consensus 279 yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~~~~~~~g 333 (343)
||..|+.++++.|+++++||++...+.+|++++.++.++.+++++++..+.+.++
T Consensus 80 ~D~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~d~~~r~~~~~~~~~~ 134 (270)
T PF03936_consen 80 FDDGGSAEELEALTDAVERWDPNSGDPLPDPDKPLFRALADIWNRIAARMSPAQR 134 (270)
T ss_dssp HHTTSHHHHHHHHHHHHHHTSSGGGGGSTHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cccccchHHHHHHHHHHhcccccccccccchhHHHHHHHHHHHHHHHHHhhhhhc
Confidence 9999999999999999999999888899999999999999999999999888764
No 6
>cd00868 Terpene_cyclase_C1 Terpene cyclases, Class 1. Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid biosynthesis enzymes, which share the 'isoprenoid synthase fold' and convert linear, all-trans, isoprenoids, geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate into numerous cyclic forms of monoterpenes, diterpenes, and sesquiterpenes. Also included in this CD are the cis-trans terpene cyclases such as trichodiene synthase. The class I terpene cyclization reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational ch
Probab=99.80 E-value=3.2e-19 Score=167.22 Aligned_cols=122 Identities=52% Similarity=0.939 Sum_probs=116.0
Q ss_pred hhHhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHH
Q 044511 213 VHQENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFT 292 (343)
Q Consensus 213 ~hq~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft 292 (343)
.||+|++++++||+++||. ...+++|.+..++|+|+++++|+|+.+..|+++||.++++.++||.||.+|+.+|+..|+
T Consensus 1 ~~~~e~~~~~~W~~~~~l~-~~~~~~r~~~~~~~~~~a~~~p~~~~~~~l~~~a~~~~~~f~~DD~~D~~~~~~~~~~~~ 79 (284)
T cd00868 1 LHQEELKELSRWWKELGLQ-EKLPFARDRLVECYFWAAGSYFEPQYSEARIALAKTIALLTVIDDTYDDYGTLEELELFT 79 (284)
T ss_pred CCHHHHHHHHHHHHHhCCc-ccCCchhhHhHHHHHHHHHhhcCccchHHHHHHHHHHHHHHHHHhccccCCCHHHHHHHH
Confidence 4999999999999999998 555699999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhhcCchhhccCChHHHHHHHHHHHHHHHHHHHHHHccCch
Q 044511 293 DAVERWDATAVEQLPHYMKLCFHALRNSINEMTFDVLRDQGVD 335 (343)
Q Consensus 293 ~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~~~~~~~g~~ 335 (343)
++++||+...++.+|++++.++.+++++.++++..+.+.+|..
T Consensus 80 ~~~~~~~~~~~~~~p~~~~~~~~~l~d~~~r~~~~~~~~~~~~ 122 (284)
T cd00868 80 EAVERWDISAIDELPEYMKPVFKALYDLVNEIEEELAKEGGSE 122 (284)
T ss_pred HHHHhcChhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHhcCch
Confidence 9999999999999999999999999999999999998877654
No 7
>cd00687 Terpene_cyclase_nonplant_C1 Non-plant Terpene Cyclases, Class 1. This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in
Probab=98.15 E-value=2.2e-06 Score=81.99 Aligned_cols=107 Identities=15% Similarity=0.002 Sum_probs=85.0
Q ss_pred HHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCcchhhHHHH-HHHHHHHHHHHHhhcc-CCHHHHHHHHHHHhh
Q 044511 220 YASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQFGYFRRMST-MVNALITAIDDVYDVY-GTLEELEIFTDAVER 297 (343)
Q Consensus 220 ~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~s~~Ri~~t-K~~~litviDD~yD~y-GTleEl~~ft~aver 297 (343)
+...|.++.|+- .=+.+|++.++++|+.++.++.|+.+..|+.++ +.++++.++||.||.. ++++++..+++.+.+
T Consensus 18 ~~~~w~~~~~l~--~~~~~~~~~~~~~~~~~~a~~~P~a~~~~l~l~~~~~~w~f~~DD~~D~~~~~~~~~~~~~~~~~~ 95 (303)
T cd00687 18 EYLEWVLEEMLI--PSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQKSPEDGEAGVTRLLD 95 (303)
T ss_pred HHHHHHHHcCCC--CcchhHHHHhcCCHHHHHhhcCCCCCHHHHHHHHHHHHHHHHhcccCCccccCHHHHHHHHHHHHh
Confidence 467788888664 345899999999999999999999999999655 9999999999999997 599999999998888
Q ss_pred cCchhhccCChHHHHHHHHHHHHHHHHHHHH
Q 044511 298 WDATAVEQLPHYMKLCFHALRNSINEMTFDV 328 (343)
Q Consensus 298 WD~~~~~~Lpeymk~~f~al~~t~~ei~~~~ 328 (343)
|.......-|.....+..++.++...+...+
T Consensus 96 ~~~~~~~~~~~~~~p~~~~~~d~~~r~~~~~ 126 (303)
T cd00687 96 ILRGDGLDSPDDATPLEFGLADLWRRTLARM 126 (303)
T ss_pred ccCCCCCCCCCCCCHHHHHHHHHHHHhccCC
Confidence 7654221114666777777777777665543
No 8
>cd00385 Isoprenoid_Biosyn_C1 Isoprenoid Biosynthesis enzymes, Class 1. Superfamily of trans-isoprenyl diphosphate synthases (IPPS) and class I terpene cyclases which either synthesis geranyl/farnesyl diphosphates (GPP/FPP) or longer chained products from isoprene precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), or use geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate as substrate. These enzymes produce a myriad of precursors for such end products as steroids, cholesterol, sesquiterpenes, heme, carotenoids, retinoids, and diterpenes; and are widely distributed among archaea, bacteria, and eukaryota.The enzymes in this superfamily share the same 'isoprenoid synthase fold' and include several subgroups. The head-to-tail (HT) IPPS catalyze the successive 1'-4 condensation of the 5-carbon IPP to the growing isoprene chain to form linear, all-trans, C10-, C15-, C20- C25-, C30-, C35-, C40-, C45-, or C50-isoprenoid diphosphates. Cyclic monoter
Probab=96.64 E-value=0.0015 Score=57.78 Aligned_cols=75 Identities=24% Similarity=0.263 Sum_probs=58.3
Q ss_pred HHhhccccCCCcchhhHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHHHHHHHHHHHH
Q 044511 247 FWSVGEKFEPQFGYFRRMSTMVNALITAIDDVYDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHALRNSINEMTF 326 (343)
Q Consensus 247 fw~~~~~fEP~~s~~Ri~~tK~~~litviDD~yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~al~~t~~ei~~ 326 (343)
+|+++++++|+++..|..+++..++..++||++|..++..+.......+ .....|.++...+..+...+.++..
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~DDi~D~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (243)
T cd00385 1 FRPLAVLLEPEASRLRAAVEKLHAASLVHDDIVDDSGTRRGLPTAHLAV------AIDGLPEAILAGDLLLADAFEELAR 74 (243)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCchhhhhhH------HhcCchHHHHHHHHHHHHHHHHHHh
Confidence 3567889999999999999999999999999999998887776655544 2344667777777777777666654
Q ss_pred H
Q 044511 327 D 327 (343)
Q Consensus 327 ~ 327 (343)
.
T Consensus 75 ~ 75 (243)
T cd00385 75 E 75 (243)
T ss_pred C
Confidence 3
No 9
>PF14165 YtzH: YtzH-like protein
Probab=48.94 E-value=68 Score=26.13 Aligned_cols=48 Identities=19% Similarity=0.219 Sum_probs=33.3
Q ss_pred HHHHHHHHH-hhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHHHHH
Q 044511 270 ALITAIDDV-YDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHALRN 319 (343)
Q Consensus 270 ~litviDD~-yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~al~~ 319 (343)
.|--++++- -|.+||..|++.+...|+. .-+-+.++..+|-+..-+|+
T Consensus 9 LLkDIL~~hq~DccgTvsEcEQieRLvks--Lm~n~~i~~~ik~~L~~Iy~ 57 (87)
T PF14165_consen 9 LLKDILSNHQLDCCGTVSECEQIERLVKS--LMANPNIDADIKQTLEEIYS 57 (87)
T ss_pred HHHHHHHhhhhhccCcHHHHHHHHHHHHH--HHcCCCcCHHHHHHHHHHHH
Confidence 333444444 4889999999999888877 55556677887766655544
No 10
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=45.47 E-value=1.5e+02 Score=28.61 Aligned_cols=125 Identities=17% Similarity=0.195 Sum_probs=76.2
Q ss_pred HHHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCCCC--CchhhhhHHHhhhccc----ccccccccccccccCCCCCCCh
Q 044511 52 KEQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGNTN--KSLYATALKFRVLRQY----ETFSRFMDEKGRFKSSGHSDDG 125 (343)
Q Consensus 52 k~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI~~--~dL~~~AL~FRLLR~h----DvF~~F~d~~G~F~~~~l~~dv 125 (343)
|..+++.|...| +.....+-+--.-|++|-.+ .+-|..||. .+ ||.|+| |.|-|+ ...-.
T Consensus 55 ~~nlekAL~~DP----s~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAls-----l~p~~GdVLNNY----G~FLC~-qg~~~ 120 (250)
T COG3063 55 KKNLEKALEHDP----SYYLAHLVRAHYYQKLGENDLADESYRKALS-----LAPNNGDVLNNY----GAFLCA-QGRPE 120 (250)
T ss_pred HHHHHHHHHhCc----ccHHHHHHHHHHHHHcCChhhHHHHHHHHHh-----cCCCccchhhhh----hHHHHh-CCChH
Confidence 456666666554 67778888888899999855 356666652 23 999998 667655 33444
Q ss_pred HHHHHH--------------HHHHHHHH---------HHHHHHHHhhccCCCCCCCchhHHHHHHhcchhhhhhhhhHHH
Q 044511 126 KGMLAL--------------IFRDATSF---------TTAYLKEWVIKHDSNKNDDEHLCTLVNHALELPLHWRMLRLEA 182 (343)
Q Consensus 126 ~glLsL--------------iLdeA~~F---------s~~~L~~~~~~~~~~~~~~~~l~~eV~~aL~~P~~~~l~Rlea 182 (343)
.+|--+ .++++.-. ++.+|+..+... ....+.+. .+--+.+..=.+.+|
T Consensus 121 eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~d---p~~~~~~l-----~~a~~~~~~~~y~~A 192 (250)
T COG3063 121 EAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRALELD---PQFPPALL-----ELARLHYKAGDYAPA 192 (250)
T ss_pred HHHHHHHHHHhCCCCCCcchhhhhhHHHHhhcCCchhHHHHHHHHHHhC---cCCChHHH-----HHHHHHHhcccchHH
Confidence 444333 34443332 235666666541 01222222 233355566778899
Q ss_pred HHHHHHhcCCCCCchH
Q 044511 183 RWFIDVYENGPDMNPI 198 (343)
Q Consensus 183 r~yI~~Y~~~~~~n~~ 198 (343)
|.|++.|.+....+..
T Consensus 193 r~~~~~~~~~~~~~A~ 208 (250)
T COG3063 193 RLYLERYQQRGGAQAE 208 (250)
T ss_pred HHHHHHHHhcccccHH
Confidence 9999999987765544
No 11
>KOG3951 consensus Uncharacterized conserved protein [Function unknown]
Probab=30.18 E-value=82 Score=30.70 Aligned_cols=38 Identities=34% Similarity=0.447 Sum_probs=30.3
Q ss_pred ccccccccccccCCCCCCChHHHHHH----------HHHHHHHHHHHHHH
Q 044511 106 FSRFMDEKGRFKSSGHSDDGKGMLAL----------IFRDATSFTTAYLK 145 (343)
Q Consensus 106 F~~F~d~~G~F~~~~l~~dv~glLsL----------iLdeA~~Fs~~~L~ 145 (343)
|++- +.+|-|... ..-|++|-..+ -|=.|..||++||.
T Consensus 260 yDHV-hp~GAFv~~-s~iDmkgcvrllk~q~p~~~e~LLnaLRfTTKHlN 307 (321)
T KOG3951|consen 260 YDHV-HPNGAFVSN-SSIDMKGCVRLLKLQPPEQSECLLNALRFTTKHLN 307 (321)
T ss_pred eecc-ccccccccc-CcCcHHHHHHHHHcCCchhhHHHHHHHHHHHhhcC
Confidence 4433 678999766 78899998887 57789999999983
No 12
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.15 E-value=39 Score=31.65 Aligned_cols=34 Identities=24% Similarity=0.627 Sum_probs=23.4
Q ss_pred HhHHHHHHHHHHHhCCCCCCCCcccchhhHHHHHhhccccCCCc
Q 044511 215 QENLKYASRWWKKTGLGGENLNFVRDRIVENFFWSVGEKFEPQF 258 (343)
Q Consensus 215 q~EL~~lsrWwke~~l~~~~L~faRdr~ve~Yfw~~~~~fEP~~ 258 (343)
|+|+..|.+||++.|-. ++-....++|.+|.=+|
T Consensus 8 ~qql~~ik~wwkeNGk~----------li~gviLg~~~lfGW~y 41 (207)
T COG2976 8 QQQLEAIKDWWKENGKA----------LIVGVILGLGGLFGWRY 41 (207)
T ss_pred HHHHHHHHHHHHHCCch----------hHHHHHHHHHHHHHHHH
Confidence 78999999999999965 33344455555554443
No 13
>KOG3906 consensus Tryptophan 2,3-dioxygenase [Amino acid transport and metabolism]
Probab=24.58 E-value=1.7e+02 Score=28.93 Aligned_cols=29 Identities=21% Similarity=0.447 Sum_probs=21.8
Q ss_pred HHHHHHhhcCCCcCCCCchhhHHHHHHHHHhCC
Q 044511 53 EQVSAMLQQDDKVVDLDPLHQLELIDNLHRLGN 85 (343)
Q Consensus 53 ~~Vk~~l~~~~~~~~~d~~~~L~lID~LqRLGI 85 (343)
..||+||.+.. .|-...|.+|-.|.|.-+
T Consensus 87 DsvR~~l~~~v----~DEtktLkiVsrl~Rv~~ 115 (399)
T KOG3906|consen 87 DSVRKLLNNTV----VDETKTLKIVSRLDRVTK 115 (399)
T ss_pred HHHHHHhcchh----hcchhHHHHHHhHHHHHH
Confidence 78999998753 465677888888877665
No 14
>KOG1914 consensus mRNA cleavage and polyadenylation factor I complex, subunit RNA14 [RNA processing and modification]
Probab=23.58 E-value=1.7e+02 Score=31.64 Aligned_cols=103 Identities=17% Similarity=0.252 Sum_probs=66.9
Q ss_pred hHHHhhhhHhHHHHHHHH--HHHhCCCCCCCC-----cccchhhHHHHHhh-ccccCCCcchhhHHHHHHHHHHHHHHHH
Q 044511 207 FNIVQAVHQENLKYASRW--WKKTGLGGENLN-----FVRDRIVENFFWSV-GEKFEPQFGYFRRMSTMVNALITAIDDV 278 (343)
Q Consensus 207 Fn~~Qs~hq~EL~~lsrW--wke~~l~~~~L~-----faRdr~ve~Yfw~~-~~~fEP~~s~~Ri~~tK~~~litviDD~ 278 (343)
+-+=|---..|.+++++| |-++.-. ..|. ---.|++.+|=-++ .+.|-|+ +|--.+..+.-+.|+
T Consensus 223 ~~vp~~~T~~e~~qv~~W~n~I~wEks-NpL~t~~~~~~~~Rv~yayeQ~ll~l~~~pe------iWy~~s~yl~~~s~l 295 (656)
T KOG1914|consen 223 PAVPPKGTKDEIQQVELWKNWIKWEKS-NPLRTLDGTMLTRRVMYAYEQCLLYLGYHPE------IWYDYSMYLIEISDL 295 (656)
T ss_pred CCCCCCCChHHHHHHHHHHHHHHHHhc-CCcccccccHHHHHHHHHHHHHHHHHhcCHH------HHHHHHHHHHHhhHH
Confidence 334455667789999999 6654432 2222 22358887775443 3456664 677778888889999
Q ss_pred hhccCCHHHHHHHHHHHhhcCchhhccCChHHHHHHHH
Q 044511 279 YDVYGTLEELEIFTDAVERWDATAVEQLPHYMKLCFHA 316 (343)
Q Consensus 279 yD~yGTleEl~~ft~averWD~~~~~~Lpeymk~~f~a 316 (343)
++.+|..++..+||+-...-=..+++.+...-+..|-+
T Consensus 296 ~~~~~d~~~a~~~t~e~~~~yEr~I~~l~~~~~~Ly~~ 333 (656)
T KOG1914|consen 296 LTEKGDVPDAKSLTDEAASIYERAIEGLLKENKLLYFA 333 (656)
T ss_pred HHHhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999876543334555554444444433
No 15
>PF10828 DUF2570: Protein of unknown function (DUF2570); InterPro: IPR022538 This entry is represented by Bacteriophage IME08, pseT.3. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a family of proteins with unknown function.
Probab=21.66 E-value=1.2e+02 Score=25.10 Aligned_cols=34 Identities=32% Similarity=0.450 Sum_probs=23.8
Q ss_pred HHHHHhhHHHHHHHHhhcCCC---cCCCCchhhHHHHHHHHHh
Q 044511 44 YAKQLEKPKEQVSAMLQQDDK---VVDLDPLHQLELIDNLHRL 83 (343)
Q Consensus 44 ~~~~~~~Lk~~Vk~~l~~~~~---~~~~d~~~~L~lID~LqRL 83 (343)
...+.++.++.+|..+.+.+| .-|.+ .||.|+||
T Consensus 72 ~r~~~e~~~e~ik~~lk~d~Ca~~~~P~~------V~d~L~~~ 108 (110)
T PF10828_consen 72 LRQQSEERRESIKTALKDDPCANTAVPDA------VIDSLRRL 108 (110)
T ss_pred HHHHHHHHHHHHHHHHccCccccCCCCHH------HHHHHHHh
Confidence 345667777888888887555 34443 78888887
No 16
>TIGR00636 PduO_Nterm ATP:cob(I)alamin adenosyltransferase. This model represents as ATP:cob(I)alamin adenosyltransferase family corresponding to the N-terminal half of Salmonella PduO, a 1,2-propanediol utilization protein that probably is bifunctional. PduO represents one of at least three families of ATP:corrinoid adenosyltransferase: others are CobA (which partially complements PduO) and EutT. It was not clear originally whether ATP:cob(I)alamin adenosyltransferase activity resides in the N-terminal region of PduO, modeled here, but this has now become clear from the characterization of MeaD from Methylobacterium extorquens.
Probab=21.06 E-value=1.7e+02 Score=26.46 Aligned_cols=21 Identities=33% Similarity=0.536 Sum_probs=17.3
Q ss_pred HHHhhccCCHHHHHHHHHHHh
Q 044511 276 DDVYDVYGTLEELEIFTDAVE 296 (343)
Q Consensus 276 DD~yD~yGTleEl~~ft~ave 296 (343)
|...+.|||+|||..++-.+.
T Consensus 22 d~riea~Gt~DElns~iGl~~ 42 (171)
T TIGR00636 22 SPRVEAYGTLDELNSFIGVAL 42 (171)
T ss_pred CccceehhhHHHHHHHHHHHH
Confidence 445789999999999887764
No 17
>PF12626 PolyA_pol_arg_C: Polymerase A arginine-rich C-terminus; PDB: 3AQN_A 3AQK_A 3AQM_B 3AQL_B.
Probab=20.07 E-value=1.3e+02 Score=25.87 Aligned_cols=28 Identities=29% Similarity=0.528 Sum_probs=24.0
Q ss_pred HHhhhHHHhhhhHhHHHHHHHHHHHhCC
Q 044511 203 AKVDFNIVQAVHQENLKYASRWWKKTGL 230 (343)
Q Consensus 203 AKlDFn~~Qs~hq~EL~~lsrWwke~~l 230 (343)
|-.||=.|.+.--+++.+|..||.+.--
T Consensus 65 AAyDFL~LR~~~ge~~~~l~~WW~~fq~ 92 (124)
T PF12626_consen 65 AAYDFLLLRAEAGEELSELAEWWTEFQE 92 (124)
T ss_dssp HHHHHHHHHHHH-HHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHhCCCcHHHHHHHHHHHh
Confidence 6789999988889999999999999754
Done!