Query 044478
Match_columns 232
No_of_seqs 159 out of 732
Neff 5.4
Searched_HMMs 46136
Date Fri Mar 29 03:37:27 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044478.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044478hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00684 Terpene_cyclase_plant_ 100.0 7E-69 1.5E-73 520.2 18.3 205 23-232 1-211 (542)
2 PF01397 Terpene_synth: Terpen 100.0 1.3E-62 2.8E-67 419.9 13.7 174 33-212 1-183 (183)
3 PLN02279 ent-kaur-16-ene synth 100.0 1.1E-49 2.4E-54 399.4 13.1 156 70-232 268-433 (784)
4 PLN02592 ent-copalyl diphospha 100.0 4.6E-48 1E-52 387.3 12.5 160 70-232 308-480 (800)
5 PF14164 YqzH: YqzH-like prote 42.7 13 0.00028 27.1 1.0 19 128-147 11-29 (64)
6 KOG0835 Cyclin L [General func 35.6 94 0.002 29.7 5.8 80 127-225 125-210 (367)
7 KOG3951 Uncharacterized conser 33.6 57 0.0012 30.2 3.9 45 149-195 264-308 (321)
8 PF08373 RAP: RAP domain; Int 32.7 32 0.00069 23.2 1.7 26 121-147 17-42 (58)
9 COG4738 Predicted transcriptio 31.7 69 0.0015 26.1 3.7 85 77-170 14-102 (124)
10 PF11848 DUF3368: Domain of un 29.8 24 0.00053 23.6 0.7 21 127-148 24-44 (48)
11 PF11000 DUF2840: Protein of u 28.5 33 0.00071 28.9 1.4 16 75-90 96-111 (149)
12 cd07604 BAR_ASAPs The Bin/Amph 27.0 1.2E+02 0.0025 26.8 4.7 86 49-150 13-100 (215)
13 KOG3906 Tryptophan 2,3-dioxyge 26.3 1.1E+02 0.0025 28.6 4.6 115 49-178 78-209 (399)
14 COG1309 AcrR Transcriptional r 24.6 1.5E+02 0.0033 22.2 4.5 36 138-173 44-79 (201)
15 PRK15008 HTH-type transcriptio 24.4 88 0.0019 26.3 3.4 44 128-171 28-83 (212)
16 PF14090 HTH_39: Helix-turn-he 22.5 44 0.00095 23.9 1.0 25 127-153 33-57 (70)
No 1
>cd00684 Terpene_cyclase_plant_C1 Plant Terpene Cyclases, Class 1. This CD includes a diverse group of monomeric plant terpene cyclases (Tspa-Tspf) that convert the acyclic isoprenoid diphosphates, geranyl diphosphate (GPP), farnesyl diphosphate (FPP), or geranylgeranyl diphosphate (GGPP) into cyclic monoterpenes, diterpenes, or sesquiterpenes, respectively; a few form acyclic species. Terpnoid cyclases are soluble enzymes localized to the cytosol (sesquiterpene synthases) or plastids (mono- and diterpene synthases). All monoterpene and diterpene synthases have restrict substrate specificity, however, some sesquiterpene synthases can accept both FPP and GPP. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl diphosphates, via bridging Mg2+ ions (K+ preferred by gymnosperm cyclases), inducing conformational changes such that an N-terminal regi
Probab=100.00 E-value=7e-69 Score=520.21 Aligned_cols=205 Identities=48% Similarity=0.804 Sum_probs=187.4
Q ss_pred ccCCCCCCCCccc-cccccCCCCccchhHHHHHHHHHHHHHHHHHhc--cCCccchhhhhHHHHHHhCcccccHHHHHHH
Q 044478 23 RRSANFDPTIWGD-YFLSYTGDFKESGDASVKHQELKKEIRTMLRAD--INKPSQTKLDLIDDIQRLGVSYHFESEIDEI 99 (232)
Q Consensus 23 R~~a~~~PsiW~d-~fl~~~~~~~~~~~~~~~~~~LkeeVr~ml~~~--~~d~~~~~L~lID~lqRLGI~yhFe~EI~~~ 99 (232)
|++++|+||+||| .|+++++++.+...+.+++++||++||+||... +.|++ ++|+|||+||||||+|||++||+++
T Consensus 1 r~~~~~~~~~w~~~~~~s~~~~~~~~~~~~~~~~~lk~~v~~~~~~~~~~~~~~-~~l~liD~lqrLGi~~hF~~EI~~~ 79 (542)
T cd00684 1 RPSANFPPSLWGDDHFLSLSSDYSEEDELEEEIEELKEEVRKMLEDSEYPVDLF-ERLWLIDRLQRLGISYHFEDEIKEI 79 (542)
T ss_pred CCCCCCCCCcCCCcceeecCCCcchhHHHHHHHHHHHHHHHHHHHhcccCCCHH-HHHHHHHHHHHcCchhhhHHHHHHH
Confidence 7899999999999 666766655432268899999999999999864 56899 9999999999999999999999999
Q ss_pred HHHHHhhccCCCCCCCCcccchhhHHHHHHHHhhcccccccccccccccccccccccchhhcHHHHHHHHHHhhcCCCCc
Q 044478 100 LQKMHEANQDCDLGDDENVQELYYISLQFRLLRQNGYKISAADVFNSFKDSNGNFKSFLKRDIRGMLSLYEAAHLRVHGE 179 (232)
Q Consensus 100 L~~i~~~~~~~~~~~~~~~~dL~~~AL~FRLLRqhGy~VS~~DvF~~Fkd~~G~Fk~~l~~Dv~glLsLYeAS~L~~~gE 179 (232)
|++||+.|.+. .+..+.|||+|||+|||||||||+|| ||||++|+|++|+|++++.+||+|||||||||||+++||
T Consensus 80 L~~i~~~~~~~---~~~~~~dl~~~al~FRlLR~~Gy~vs-~dvf~~F~~~~g~f~~~~~~d~~g~l~Ly~As~l~~~gE 155 (542)
T cd00684 80 LDYIYRYWTER---GESNEDDLYTTALGFRLLRQHGYNVS-SDVFKKFKDEDGKFKESLTQDVKGMLSLYEASHLSFPGE 155 (542)
T ss_pred HHHHHHhhccc---ccccCCCHHHHHHHHHHHHHcCCCcC-HHHHhhhcCCCCCcCchhhhhhHHHHHHHHHhhcCCCCc
Confidence 99999987632 11223499999999999999999999 999999999999999999999999999999999999999
Q ss_pred hHHHHHHHHHHHHHHHHhhc---CChhHHHHHHHHcCCCCCCCCChhHHhhcccCC
Q 044478 180 NILNEALTFTVTHLESFTSQ---SNTQLAAQVNRALNRPIRKSLPRLEAKHYMPIY 232 (232)
Q Consensus 180 ~iLdeA~~Ft~~~L~~~~~~---~~~~l~~~V~~aL~~P~h~~l~RleaR~yI~~Y 232 (232)
+|||||+.||++||++.+.. ++++|+++|++||++|+||++||||||+||++|
T Consensus 156 ~iLdeA~~ft~~~L~~~~~~~~~~~~~l~~~V~~aL~~P~~~~~~rlear~yi~~Y 211 (542)
T cd00684 156 DILDEALSFTTKHLEEKLESNWIIDPDLSGEIEYALEIPLHASLPRLEARWYIEFY 211 (542)
T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCchHHHHHHHHccCchhcCCchHHHHHHHHHh
Confidence 99999999999999999886 789999999999999999999999999999988
No 2
>PF01397 Terpene_synth: Terpene synthase, N-terminal domain; InterPro: IPR001906 Sequences containing this domain belong to the terpene synthase family. It has been suggested that this gene family be designated tps (for terpene synthase). Sequence comparisons reveal similarities between the monoterpene (C10) synthases, sesquiterpene (C15) synthases and the diterpene (C20) synthases. It has been split into six subgroups on the basis of phylogeny, called Tpsa-Tpsf []. Tpsa includes vetispiridiene synthase Q39979 from SWISSPROT, 5-epi- aristolochene synthase, Q40577 from SWISSPROT and (+)-delta-cadinene synthase P93665 from SWISSPROT . Tpsb includes (-)-limonene synthase, Q40322 from SWISSPROT. Tpsc includes copalyl diphosphate synthase (kaurene synthase A), O04408 from SWISSPROT. Tpsd includes taxadiene synthase, Q41594 from SWISSPROT, pinene synthase, O24475 from SWISSPROT and myrcene synthase, O24474 from SWISSPROT. Tpse includes ent-kaurene synthase B Q39548 from SWISSPROT. Tpsf includes linalool synthase Q9ZPN5 from SWISSPROT. In the fungus Phaeosphaeria sp. (strain L487) the synthesis of ent-kaurene from geranylgeranyl dophosphate is promoted by a single bifunctional protein [].; GO: 0016829 lyase activity, 0008152 metabolic process; PDB: 2ONH_A 2ONG_B 3P5R_A 3P5P_A 3N0F_A 3N0G_B 3PYB_A 3PYA_A 3G4F_A 3G4D_B ....
Probab=100.00 E-value=1.3e-62 Score=419.89 Aligned_cols=174 Identities=50% Similarity=0.848 Sum_probs=147.2
Q ss_pred cccccccc-CCCCc----cc-hhHHHHHHHHHHHHHHHHHhccCCccchhhhhHHHHHHhCcccccHHHHHHHHHHHHhh
Q 044478 33 WGDYFLSY-TGDFK----ES-GDASVKHQELKKEIRTMLRADINKPSQTKLDLIDDIQRLGVSYHFESEIDEILQKMHEA 106 (232)
Q Consensus 33 W~d~fl~~-~~~~~----~~-~~~~~~~~~LkeeVr~ml~~~~~d~~~~~L~lID~lqRLGI~yhFe~EI~~~L~~i~~~ 106 (232)
|||+||++ ++.+. .. +++.+++++||++||.||.....|++ ++|+|||+||||||+|||++||+++|++||+.
T Consensus 1 W~d~fl~s~s~~~~~~~~~~~~~~~~~~~~Lk~~v~~~l~~~~~d~~-~~L~lID~lqRLGi~yhFe~EI~~~L~~i~~~ 79 (183)
T PF01397_consen 1 WGDDFLQSLSPSYTACMQSEDEKCKERAEELKEEVRNMLPASYPDPL-EKLELIDTLQRLGISYHFEDEIKEILDSIYRS 79 (183)
T ss_dssp TTHHHHHHTBHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHSSSSHHH-HHHHHHHHHHHTTCGGGGHHHHHHHHHHHHHT
T ss_pred CCCceecCCCCcchhccchhHHHHHHHHHHHHHHHHHHHhhcCCCHH-HHHHHHHHHHHcCCcHHHHHHHHHHHHHHhhh
Confidence 99999942 22221 11 38899999999999999987545799 99999999999999999999999999999998
Q ss_pred ccCCCCCCCCcccchhhHHHHHHHHhhcccccccccccccccccccccccchhhcHHHHHHHHHHhhcCCCCchHHHHHH
Q 044478 107 NQDCDLGDDENVQELYYISLQFRLLRQNGYKISAADVFNSFKDSNGNFKSFLKRDIRGMLSLYEAAHLRVHGENILNEAL 186 (232)
Q Consensus 107 ~~~~~~~~~~~~~dL~~~AL~FRLLRqhGy~VS~~DvF~~Fkd~~G~Fk~~l~~Dv~glLsLYeAS~L~~~gE~iLdeA~ 186 (232)
|... .....|||+|||+|||||||||+|| ||||++|+|++|+|++++.+||+|||+|||||||+++||+|||||+
T Consensus 80 ~~~~----~~~~~dL~~~AL~FRLLRqhGy~VS-~DvF~~F~d~~g~F~~~l~~Dv~glLsLYeAS~l~~~gE~iLdeA~ 154 (183)
T PF01397_consen 80 WDED----NEEIDDLYTTALRFRLLRQHGYYVS-SDVFNKFKDEKGNFKESLSNDVKGLLSLYEASHLRFHGEDILDEAR 154 (183)
T ss_dssp TTTT----SHTSSCHHHHHHHHHHHHHTT-----GGGGGGGBETTSSBSGGGGGHHHHHHHHHHHHTT--TT-HHHHHHH
T ss_pred cccc----ccccCchhHHHHHHHHHHHcCCccc-HHHHhCcccCCCccchhhhHhHHHHHHHHHHHHccCCChHHHHHHH
Confidence 7642 1111299999999999999999999 9999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhhcCCh---hHHHHHHHHc
Q 044478 187 TFTVTHLESFTSQSNT---QLAAQVNRAL 212 (232)
Q Consensus 187 ~Ft~~~L~~~~~~~~~---~l~~~V~~aL 212 (232)
.||+++|++++++..+ +|+++|+|||
T Consensus 155 ~Ft~~~L~~~~~~~~~~~~~L~~~V~~AL 183 (183)
T PF01397_consen 155 AFTTKHLKSLLSNLSIPDPHLAKEVKHAL 183 (183)
T ss_dssp HHHHHHHHHHHTTTCTTSCHHHHHHHHHH
T ss_pred HHHHHHHHHHhccCCCCcHHHHHHHHHhC
Confidence 9999999999986533 4999999997
No 3
>PLN02279 ent-kaur-16-ene synthase
Probab=100.00 E-value=1.1e-49 Score=399.37 Aligned_cols=156 Identities=29% Similarity=0.487 Sum_probs=144.2
Q ss_pred CCccchhhhhHHHHHHhCcccccHHHHHHHHHHHHhhccCCCCCCCCcccchhhHHHHHHHHhhcccccccccccccccc
Q 044478 70 NKPSQTKLDLIDDIQRLGVSYHFESEIDEILQKMHEANQDCDLGDDENVQELYYISLQFRLLRQNGYKISAADVFNSFKD 149 (232)
Q Consensus 70 ~d~~~~~L~lID~lqRLGI~yhFe~EI~~~L~~i~~~~~~~~~~~~~~~~dL~~~AL~FRLLRqhGy~VS~~DvF~~Fkd 149 (232)
.+.+ +++++||+||||||+|||++||+++|+.+|+.|.. +.+..+.||++|||+|||||||||+|| ||||++|+|
T Consensus 268 ~~~f-e~l~lvd~L~rlGi~~hF~~EI~~~L~~~~~~~~~---~~~~~~~Dl~~tAl~FRLLR~hGy~VS-~dvf~~F~~ 342 (784)
T PLN02279 268 LDQY-ARLSMVDTLERLGIDRHFRKEIKSVLDETYRYWLQ---GEEEIFLDLATCALAFRILRLNGYDVS-SDPLKQFAE 342 (784)
T ss_pred ccHH-HHhHHHHHHHHhCCccccHHHHHHHHHHHHHhhcc---cccCCCCCHHHHHHHHHHHHHcCCCCC-hhHHhhcCC
Confidence 5789 99999999999999999999999999999998863 233345599999999999999999999 999999996
Q ss_pred cccccccch---hhcHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhhc-------CChhHHHHHHHHcCCCCCCC
Q 044478 150 SNGNFKSFL---KRDIRGMLSLYEAAHLRVHGENILNEALTFTVTHLESFTSQ-------SNTQLAAQVNRALNRPIRKS 219 (232)
Q Consensus 150 ~~G~Fk~~l---~~Dv~glLsLYeAS~L~~~gE~iLdeA~~Ft~~~L~~~~~~-------~~~~l~~~V~~aL~~P~h~~ 219 (232)
+ + |++++ .+||+|||+|||||||+++||+|||||+.||++||++.+.. ++++|+++|++||++|||++
T Consensus 343 ~-~-F~~~l~~~~~dv~gmL~LY~AS~l~~~gE~iLdeA~~Fs~~~L~~~~~~~~~~~~~~~~~L~~eV~~AL~~P~~~~ 420 (784)
T PLN02279 343 D-H-FSDSLGGYLKDTGAVLELFRASQISYPDESLLEKQNSWTSHFLEQGLSNWSKTADRLRKYIKKEVEDALNFPYYAN 420 (784)
T ss_pred C-c-ccchhcccchhhHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHHhcccccccccCccHHHHHHHHhcCchhcC
Confidence 5 4 99988 59999999999999999999999999999999999998764 57889999999999999999
Q ss_pred CChhHHhhcccCC
Q 044478 220 LPRLEAKHYMPIY 232 (232)
Q Consensus 220 l~RleaR~yI~~Y 232 (232)
|||||||+||++|
T Consensus 421 l~RlEaR~yI~~Y 433 (784)
T PLN02279 421 LERLANRRSIENY 433 (784)
T ss_pred ccHHHHHHHHHHh
Confidence 9999999999988
No 4
>PLN02592 ent-copalyl diphosphate synthase
Probab=100.00 E-value=4.6e-48 Score=387.34 Aligned_cols=160 Identities=33% Similarity=0.536 Sum_probs=143.4
Q ss_pred CCccchhhhhHHHHHHhCcccccHHHHHHHHHHHHhhccCCCCC--CCCcccchhhHHHHHHHHhhcccccccccccccc
Q 044478 70 NKPSQTKLDLIDDIQRLGVSYHFESEIDEILQKMHEANQDCDLG--DDENVQELYYISLQFRLLRQNGYKISAADVFNSF 147 (232)
Q Consensus 70 ~d~~~~~L~lID~lqRLGI~yhFe~EI~~~L~~i~~~~~~~~~~--~~~~~~dL~~~AL~FRLLRqhGy~VS~~DvF~~F 147 (232)
.+++ ++|+|||+||||||+|||++||+++|+.+|+.|.....+ .+..+.|+++|||+|||||||||+|| ||||++|
T Consensus 308 ~d~f-E~LwlVDtLqRLGIs~hF~~EI~~iLd~iy~~w~~~g~~~a~~~~~~Dld~TALaFRLLRqhGy~VS-~DvF~~F 385 (800)
T PLN02592 308 VDLF-EHIWAVDRLQRLGISRYFEPEIKECIDYVHRYWTENGICWARNSHVHDIDDTAMGFRLLRLHGHQVS-ADVFKHF 385 (800)
T ss_pred CcHH-HHHHHHHHHHHcCCccccHHHHHHHHHHHHHHHhhcCcccccCCCcCCHHHHHHHHHHHHHcCCCCC-hHHHHhh
Confidence 6889 999999999999999999999999999999977531000 12235699999999999999999999 9999999
Q ss_pred cccccccccch---hhcHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhh--c------CChhHHHHHHHHcCCCC
Q 044478 148 KDSNGNFKSFL---KRDIRGMLSLYEAAHLRVHGENILNEALTFTVTHLESFTS--Q------SNTQLAAQVNRALNRPI 216 (232)
Q Consensus 148 kd~~G~Fk~~l---~~Dv~glLsLYeAS~L~~~gE~iLdeA~~Ft~~~L~~~~~--~------~~~~l~~~V~~aL~~P~ 216 (232)
++ +|+|++.+ .+|++|||+|||||||+++||.|||+|+.||+++|++.+. + ++++|+++|+|||++||
T Consensus 386 ~~-~g~F~~~~ge~~~Dv~glL~LYeAS~l~~~gE~iLdeA~~Fs~~~L~~~~~~~~l~d~~~~~~~L~~eV~~AL~~P~ 464 (800)
T PLN02592 386 EK-GGEFFCFAGQSTQAVTGMFNLYRASQVLFPGEKILENAKEFSSKFLREKQEANELLDKWIIMKDLPGEVGFALEIPW 464 (800)
T ss_pred cC-CCCccccccccccchHHHHHHHHHHhcCCCcchHHHHHHHHHHHHHHHHhhccccccccccCccHHHHHHHhccChh
Confidence 97 89998665 7999999999999999999999999999999999999753 1 36789999999999999
Q ss_pred CCCCChhHHhhcccCC
Q 044478 217 RKSLPRLEAKHYMPIY 232 (232)
Q Consensus 217 h~~l~RleaR~yI~~Y 232 (232)
|++||||||||||++|
T Consensus 465 ~~~l~RlEaR~yI~~Y 480 (800)
T PLN02592 465 YASLPRVETRFYIEQY 480 (800)
T ss_pred hcCcchHHHHHHHHHh
Confidence 9999999999999988
No 5
>PF14164 YqzH: YqzH-like protein
Probab=42.68 E-value=13 Score=27.12 Aligned_cols=19 Identities=26% Similarity=0.408 Sum_probs=15.4
Q ss_pred HHHHhhcccccccccccccc
Q 044478 128 FRLLRQNGYKISAADVFNSF 147 (232)
Q Consensus 128 FRLLRqhGy~VS~~DvF~~F 147 (232)
=+-|||+||++. ++.+...
T Consensus 11 ~~~l~QYg~d~~-~~pls~~ 29 (64)
T PF14164_consen 11 INCLRQYGYDVE-CMPLSDE 29 (64)
T ss_pred HHHHHHhCCccc-CCCCCHH
Confidence 367999999999 8877663
No 6
>KOG0835 consensus Cyclin L [General function prediction only]
Probab=35.58 E-value=94 Score=29.68 Aligned_cols=80 Identities=18% Similarity=0.231 Sum_probs=54.0
Q ss_pred HHHHHhhcccccccccccccccccccccccchhhcHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhhcC-Ch-hH
Q 044478 127 QFRLLRQNGYKISAADVFNSFKDSNGNFKSFLKRDIRGMLSLYEAAHLRVHGENILNEALTFTVTHLESFTSQS-NT-QL 204 (232)
Q Consensus 127 ~FRLLRqhGy~VS~~DvF~~Fkd~~G~Fk~~l~~Dv~glLsLYeAS~L~~~gE~iLdeA~~Ft~~~L~~~~~~~-~~-~l 204 (232)
.-|+||.-||.|+ | .++.++.--|=+.-=..++..|+..|+.|...-|.--+-.. .+ .+
T Consensus 125 e~~ILr~LGF~~H-------v------------~hPhklii~YLqtL~~~~~~~l~Q~~wNfmNDslRT~v~vry~pe~i 185 (367)
T KOG0835|consen 125 ERRILRELGFDVH-------V------------EHPHKLIIMYLQTLQLPPNLKLLQAAWNFMNDSLRTDVFVRYSPESI 185 (367)
T ss_pred HHHHHHHhCCeee-------e------------eccHHHHHHHHHHhcCCCchhHHHHHHHhhhhccccceeeecCHHHH
Confidence 3489999999999 3 34566666666664445788899999999887776543322 22 22
Q ss_pred H----HHHHHHcCCCCCCCCChhHH
Q 044478 205 A----AQVNRALNRPIRKSLPRLEA 225 (232)
Q Consensus 205 ~----~~V~~aL~~P~h~~l~Rlea 225 (232)
+ -.....++.|+..+-+++.+
T Consensus 186 ACaciyLaAR~~eIpLp~~P~Wf~~ 210 (367)
T KOG0835|consen 186 ACACIYLAARNLEIPLPFQPHWFKA 210 (367)
T ss_pred HHHHHHHHHhhhcCCCCCCccHHHH
Confidence 2 33456778888887777654
No 7
>KOG3951 consensus Uncharacterized conserved protein [Function unknown]
Probab=33.63 E-value=57 Score=30.18 Aligned_cols=45 Identities=31% Similarity=0.440 Sum_probs=34.4
Q ss_pred ccccccccchhhcHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHH
Q 044478 149 DSNGNFKSFLKRDIRGMLSLYEAAHLRVHGENILNEALTFTVTHLES 195 (232)
Q Consensus 149 d~~G~Fk~~l~~Dv~glLsLYeAS~L~~~gE~iLdeA~~Ft~~~L~~ 195 (232)
+.+|-|-..-.-|+||-..|-.+-- ....|.+ =.|..||++||..
T Consensus 264 hp~GAFv~~s~iDmkgcvrllk~q~-p~~~e~L-LnaLRfTTKHlNd 308 (321)
T KOG3951|consen 264 HPNGAFVSNSSIDMKGCVRLLKLQP-PEQSECL-LNALRFTTKHLND 308 (321)
T ss_pred cccccccccCcCcHHHHHHHHHcCC-chhhHHH-HHHHHHHHhhcCC
Confidence 4678888888899999999887753 3344544 4689999999974
No 8
>PF08373 RAP: RAP domain; InterPro: IPR013584 The ~60-residue RAP (an acronym for RNA-binding domain abundant in Apicomplexans) domain is found in various proteins in eukaryotes. It is particularly abundant in apicomplexans and might mediate a range of cellular functions through its potential interactions with RNA []. The RAP domain consists of multiple blocks of charged and aromatics residues and is predicted to be composed of alpha helical and beta strand structures. Two predicted loop regions that are dominated by glycine and tryptophan residues are found before and after the central beta sheet []. Some proteins known to contain a RAP domain are listed below: Human hypothetical protein MGC5297, Mammalian FAST kinase domain-containing proteins (FASTKDs), Chlamydomonas reinhardtii chloroplastic trans-splicing factor Raa3.
Probab=32.68 E-value=32 Score=23.16 Aligned_cols=26 Identities=23% Similarity=0.333 Sum_probs=20.2
Q ss_pred hhhHHHHHHHHhhcccccccccccccc
Q 044478 121 LYYISLQFRLLRQNGYKISAADVFNSF 147 (232)
Q Consensus 121 L~~~AL~FRLLRqhGy~VS~~DvF~~F 147 (232)
+-.++|.=|+|+..||+|= +=.|-.+
T Consensus 17 ~g~t~lk~r~L~~~G~~Vi-~Ip~~eW 42 (58)
T PF08373_consen 17 TGSTKLKHRHLKALGYKVI-SIPYYEW 42 (58)
T ss_pred chHHHHHHHHHHHCCCEEE-EecHHHH
Confidence 3579999999999999997 5444444
No 9
>COG4738 Predicted transcriptional regulator [Transcription]
Probab=31.73 E-value=69 Score=26.12 Aligned_cols=85 Identities=24% Similarity=0.342 Sum_probs=44.8
Q ss_pred hhhHHHHHHhCcccccHHHHHHHHHHHHhhccCCCCCCCCcccch--hhHHHHHHHHhhcccccccccccccccccccc-
Q 044478 77 LDLIDDIQRLGVSYHFESEIDEILQKMHEANQDCDLGDDENVQEL--YYISLQFRLLRQNGYKISAADVFNSFKDSNGN- 153 (232)
Q Consensus 77 L~lID~lqRLGI~yhFe~EI~~~L~~i~~~~~~~~~~~~~~~~dL--~~~AL~FRLLRqhGy~VS~~DvF~~Fkd~~G~- 153 (232)
-++|+.|+.|||++--- .+|-.+-....... .+-..+..| -.+|++.|.||.+|+= |+=+-=|-++|.
T Consensus 14 ~~~ie~L~~lgi~R~vA----~tlv~L~~~~E~sS-~~IE~~sgLRQPEVSiAMr~Lre~gWV----~~R~eKKkGKGRP 84 (124)
T COG4738 14 YEIIELLRILGIPRNVA----TTLVCLAKGDEASS-REIERVSGLRQPEVSIAMRYLRENGWV----DEREEKKKGKGRP 84 (124)
T ss_pred HHHHHHHHHcCCCchHH----HHHHHHhcCcchhh-hhhHHhhcCCCchhHHHHHHHHHcccc----chHHhcccCCCCC
Confidence 35899999999998642 22222222110000 000111122 2589999999999973 222222224454
Q ss_pred ccc-chhhcHHHHHHHHH
Q 044478 154 FKS-FLKRDIRGMLSLYE 170 (232)
Q Consensus 154 Fk~-~l~~Dv~glLsLYe 170 (232)
.|. .|+.+...+++-++
T Consensus 85 ik~Y~Lt~~~~eIvs~ie 102 (124)
T COG4738 85 IKLYRLTVPFDEIVSEIE 102 (124)
T ss_pred ceEEEecCcHHHHHHHHH
Confidence 222 45677777766554
No 10
>PF11848 DUF3368: Domain of unknown function (DUF3368); InterPro: IPR021799 This domain is functionally uncharacterised. This domain is found in bacteria and archaea. This presumed domain is about 50 amino acids in length.
Probab=29.83 E-value=24 Score=23.59 Aligned_cols=21 Identities=24% Similarity=0.477 Sum_probs=15.2
Q ss_pred HHHHHhhccccccccccccccc
Q 044478 127 QFRLLRQNGYKISAADVFNSFK 148 (232)
Q Consensus 127 ~FRLLRqhGy~VS~~DvF~~Fk 148 (232)
.+.-|+++||++| +++++.+.
T Consensus 24 ~l~~l~~~g~~is-~~l~~~~L 44 (48)
T PF11848_consen 24 LLDRLQQAGFRIS-PKLIEEIL 44 (48)
T ss_pred HHHHHHHcCcccC-HHHHHHHH
Confidence 3455588888888 88777654
No 11
>PF11000 DUF2840: Protein of unknown function (DUF2840); InterPro: IPR021263 This bacterial family of proteins have no known function.
Probab=28.54 E-value=33 Score=28.93 Aligned_cols=16 Identities=38% Similarity=0.551 Sum_probs=13.5
Q ss_pred hhhhhHHHHHHhCccc
Q 044478 75 TKLDLIDDIQRLGVSY 90 (232)
Q Consensus 75 ~~L~lID~lqRLGI~y 90 (232)
.-|..||.|+.|||+-
T Consensus 96 ~VL~~IDaiEalGidp 111 (149)
T PF11000_consen 96 RVLQAIDAIEALGIDP 111 (149)
T ss_pred HHHHHHhHHHHcCCCh
Confidence 4578899999999974
No 12
>cd07604 BAR_ASAPs The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP with SH3 domain, ANK repeat and PH domain containing proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. This subfamily is composed of ASAPs (ArfGAP with SH3 domain, ANK repeat and PH domain containing proteins), which are Arf GTPase activating proteins (GAPs) with similarity to ACAPs (ArfGAP with Coiled-coil, ANK repeat and PH domain containing proteins) in that they contain an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, an Arf GAP domain, and ankyrin (ANK) repeats. However, ASAPs contain an additional C-terminal SH3 domain. ASAPs function in regulating cell growth, migration, and invasion. Vertebrates contain at least three members, ASAP1, ASAP2, and ASAP3. ASAP1 and ASAP2 shows GTPase activating protein (GAP) activity towards Arf1 and Arf5. They do not show GAP activity towards Arf6, but is able to mediate
Probab=27.01 E-value=1.2e+02 Score=26.83 Aligned_cols=86 Identities=19% Similarity=0.199 Sum_probs=55.7
Q ss_pred hHHHHHHHHHHHHHHHHHhccCCccchhhhhHHHHHHhCcccc--cHHHHHHHHHHHHhhccCCCCCCCCcccchhhHHH
Q 044478 49 DASVKHQELKKEIRTMLRADINKPSQTKLDLIDDIQRLGVSYH--FESEIDEILQKMHEANQDCDLGDDENVQELYYISL 126 (232)
Q Consensus 49 ~~~~~~~~LkeeVr~ml~~~~~d~~~~~L~lID~lqRLGI~yh--Fe~EI~~~L~~i~~~~~~~~~~~~~~~~dL~~~AL 126 (232)
....+.++++.-+|.|..+ ..+.......++++|+.||=..- .+.+|..+|...-.. +-.++=
T Consensus 13 ~~~~~l~Kl~K~~k~~~~~-g~~~~~~~~~F~~aL~~~g~~~~~~~~~~i~~~l~kF~~~--------------l~El~~ 77 (215)
T cd07604 13 GDRVGLQKLKKAVKAIHNS-GLAHVENELQFAEALEKLGSKALSREEEDLGAAFLKFSVF--------------TKELAA 77 (215)
T ss_pred HHHHHHHHHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHhccccCcccHHHHHHHHHHHHH--------------HHHHHH
Confidence 5677889999999999865 33433367889999999993222 233578887654221 223344
Q ss_pred HHHHHhhccccccccccccccccc
Q 044478 127 QFRLLRQNGYKISAADVFNSFKDS 150 (232)
Q Consensus 127 ~FRLLRqhGy~VS~~DvF~~Fkd~ 150 (232)
.+-.|++|=-++- ...+.+|..+
T Consensus 78 ~~~~L~~~~~~~i-~~pL~~f~k~ 100 (215)
T cd07604 78 LFKNLMQNLNNII-MFPLDSLLKG 100 (215)
T ss_pred HHHHHHHHHHHHH-HHHHHHHHHH
Confidence 4556666655554 5778888755
No 13
>KOG3906 consensus Tryptophan 2,3-dioxygenase [Amino acid transport and metabolism]
Probab=26.29 E-value=1.1e+02 Score=28.62 Aligned_cols=115 Identities=23% Similarity=0.281 Sum_probs=60.2
Q ss_pred hHHHHHHHHHHHHHHHHHhccCCccchhhhhHHHHHHhCcccccHHHHHHHHHHH-------HhhccCCCCCCCCcccch
Q 044478 49 DASVKHQELKKEIRTMLRADINKPSQTKLDLIDDIQRLGVSYHFESEIDEILQKM-------HEANQDCDLGDDENVQEL 121 (232)
Q Consensus 49 ~~~~~~~~LkeeVr~ml~~~~~d~~~~~L~lID~lqRLGI~yhFe~EI~~~L~~i-------~~~~~~~~~~~~~~~~dL 121 (232)
+.++-+=+| .-||+||.+...|.. ..|++|-.+.|.-+=-..--|--.+|+.+ |+.+.. -.+
T Consensus 78 WFKQIifel-DsvR~~l~~~v~DEt-ktLkiVsrl~Rv~~ILkLLv~Q~~iLeTMtpLDF~DFRkYL~----PAS----- 146 (399)
T KOG3906|consen 78 WFKQIIFEL-DSVRKLLNNTVVDET-KTLKIVSRLDRVTKILKLLVEQITILETMTPLDFVDFRKYLT----PAS----- 146 (399)
T ss_pred HHHHHHHhh-HHHHHHhcchhhcch-hHHHHHHhHHHHHHHHHHHHHHhhHHhhcCccchHHHHHhcC----ccc-----
Confidence 444444444 779999986555666 67787777766543221111212233332 121110 011
Q ss_pred hhHHHHHHHHhh---------cccccc-cccccccccccccccccchhhcHHHHHHHHHHhhcCCCC
Q 044478 122 YYISLQFRLLRQ---------NGYKIS-AADVFNSFKDSNGNFKSFLKRDIRGMLSLYEAAHLRVHG 178 (232)
Q Consensus 122 ~~~AL~FRLLRq---------hGy~VS-~~DvF~~Fkd~~G~Fk~~l~~Dv~glLsLYeAS~L~~~g 178 (232)
-.-+|.||||-- -.|+.- -.||| +|++-+-. -.++.-+.||+|-|+=--++||
T Consensus 147 GFQSLQFRllENKlGVlqe~Rv~YN~qhY~dvF---~dee~~~l-~~sE~eksLLeLve~WLERTPG 209 (399)
T KOG3906|consen 147 GFQSLQFRLLENKLGVLQERRVKYNAQHYKDVF---NDEELKTL-NVSEEEKSLLELVESWLERTPG 209 (399)
T ss_pred cchhhhHHHHhhhhhhhhhhhccccHHHhhhcc---CchhhHhh-hcccccchHHHHHHHHHhcCCC
Confidence 124899999832 223321 02554 44433211 1156678999999998777776
No 14
>COG1309 AcrR Transcriptional regulator [Transcription]
Probab=24.55 E-value=1.5e+02 Score=22.22 Aligned_cols=36 Identities=19% Similarity=0.392 Sum_probs=20.9
Q ss_pred cccccccccccccccccccchhhcHHHHHHHHHHhh
Q 044478 138 ISAADVFNSFKDSNGNFKSFLKRDIRGMLSLYEAAH 173 (232)
Q Consensus 138 VS~~DvF~~Fkd~~G~Fk~~l~~Dv~glLsLYeAS~ 173 (232)
||++-++..|.|+++-|..-.......+.....+..
T Consensus 44 vs~~~~Y~~f~~K~~l~~~~~~~~~~~~~~~~~~~~ 79 (201)
T COG1309 44 VSKGTLYRHFPSKEDLLLALLERALEELRALLEEAL 79 (201)
T ss_pred CCcchhHHHcCCHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 444478888988776555444444444555444444
No 15
>PRK15008 HTH-type transcriptional regulator RutR; Provisional
Probab=24.41 E-value=88 Score=26.30 Aligned_cols=44 Identities=16% Similarity=0.302 Sum_probs=26.1
Q ss_pred HHHHhhcccc------------cccccccccccccccccccchhhcHHHHHHHHHH
Q 044478 128 FRLLRQNGYK------------ISAADVFNSFKDSNGNFKSFLKRDIRGMLSLYEA 171 (232)
Q Consensus 128 FRLLRqhGy~------------VS~~DvF~~Fkd~~G~Fk~~l~~Dv~glLsLYeA 171 (232)
.+|++++||+ ||+.-++..|+|+++=|..-+..-...+....++
T Consensus 28 ~~lf~e~Gy~~~s~~dIA~~aGvs~gtiY~hF~sKe~L~~a~~~~~~~~~~~~~~~ 83 (212)
T PRK15008 28 LDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAVLRQILDIWLAPLKA 83 (212)
T ss_pred HHHHHHhCcccCCHHHHHHHhCcCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666665 3333588999988876665444444455444444
No 16
>PF14090 HTH_39: Helix-turn-helix domain
Probab=22.52 E-value=44 Score=23.91 Aligned_cols=25 Identities=24% Similarity=0.496 Sum_probs=16.1
Q ss_pred HHHHHhhcccccccccccccccccccc
Q 044478 127 QFRLLRQNGYKISAADVFNSFKDSNGN 153 (232)
Q Consensus 127 ~FRLLRqhGy~VS~~DvF~~Fkd~~G~ 153 (232)
+=.=||++||.|. ..-....+.+|+
T Consensus 33 RI~eLR~~G~~I~--t~~~~~~~~~g~ 57 (70)
T PF14090_consen 33 RISELRKKGYPIV--TEWVTVPTEDGR 57 (70)
T ss_pred HHHHHHHcCCeee--EEEEEeeccCCC
Confidence 3344899999998 445445555554
Done!