Query gi|254781191|ref|YP_003065604.1| hypothetical protein CLIBASIA_05490 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 165 No_of_seqs 108 out of 117 Neff 5.2 Searched_HMMs 39220 Date Mon May 30 06:42:12 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781191.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam10926 DUF2800 Protein of u 100.0 0 0 372.9 15.2 147 2-153 211-363 (363) 2 COG1023 Gnd Predicted 6-phosph 84.0 2.7 6.8E-05 23.0 5.0 67 22-91 189-255 (300) 3 pfam05565 Sipho_Gp157 Siphovir 68.5 5.4 0.00014 21.1 3.0 31 36-66 61-92 (162) 4 TIGR00423 TIGR00423 conserved 51.4 12 0.00031 18.8 2.4 62 3-85 16-77 (331) 5 TIGR00872 gnd_rel 6-phosphoglu 49.9 9.3 0.00024 19.6 1.6 57 32-91 240-296 (341) 6 COG5175 MOT2 Transcriptional r 48.8 14 0.00035 18.5 2.3 89 4-100 39-141 (480) 7 pfam04577 DUF563 Protein of un 43.4 20 0.00051 17.5 2.4 30 70-100 211-240 (246) 8 TIGR01793 cit_synth_euk citrat 43.0 16 0.0004 18.1 1.8 70 33-104 275-348 (430) 9 PRK08173 DNA topoisomerase III 42.5 22 0.00056 17.2 2.6 31 106-142 657-687 (857) 10 KOG1150 consensus 40.7 12 0.00031 18.9 1.0 14 136-149 139-152 (250) 11 TIGR01823 PabB-fungal para-ami 39.5 14 0.00037 18.4 1.2 72 3-85 633-705 (851) 12 pfam00285 Citrate_synt Citrate 38.8 34 0.00086 16.0 4.2 49 35-94 218-266 (352) 13 pfam07211 consensus 30.7 45 0.0012 15.2 7.9 62 51-113 56-124 (162) 14 cd06109 BsCS-I_like Bacillus s 30.4 46 0.0012 15.2 4.2 50 34-94 207-256 (349) 15 COG4318 Uncharacterized protei 29.8 36 0.00091 15.8 1.9 78 31-112 30-108 (221) 16 KOG3379 consensus 28.5 49 0.0013 15.0 6.0 83 30-115 51-141 (150) 17 PRK06341 single-stranded DNA-b 27.8 32 0.00083 16.1 1.4 39 48-90 57-95 (165) 18 KOG0432 consensus 27.1 52 0.0013 14.8 4.1 50 47-96 446-501 (995) 19 TIGR02182 GRXB Glutaredoxin, G 26.8 17 0.00044 17.9 -0.1 19 44-62 83-101 (212) 20 PRK06245 cofG FO synthase subu 26.8 53 0.0013 14.8 2.8 11 3-13 23-33 (336) 21 KOG3300 consensus 26.0 55 0.0014 14.7 3.4 29 48-76 83-111 (146) 22 KOG3248 consensus 25.5 26 0.00067 16.7 0.6 13 64-76 251-263 (421) 23 TIGR00734 hisAF_rel hisA/hisF 25.4 56 0.0014 14.6 3.1 32 73-104 13-55 (230) 24 cd06112 citrate_synt_like_1_1 25.2 56 0.0014 14.6 4.2 50 35-95 223-272 (373) 25 TIGR02262 benz_CoA_lig benzoat 24.4 33 0.00085 16.0 1.0 87 64-151 341-474 (520) 26 cd06115 AthCS_per_like Arabido 24.3 59 0.0015 14.5 4.3 50 35-95 258-307 (410) 27 TIGR01806 CM_mono2 chorismate 23.3 35 0.00089 15.9 0.9 99 52-158 6-110 (116) 28 pfam03439 Spt5-NGN Early trans 22.9 62 0.0016 14.3 2.9 44 67-117 39-82 (82) 29 KOG4616 consensus 22.5 46 0.0012 15.2 1.4 26 133-158 61-86 (137) 30 PRK13627 carnitine operon prot 22.3 33 0.00084 16.1 0.6 72 63-152 121-195 (196) 31 cd06111 DsCS_like Cold-active 22.0 65 0.0017 14.2 4.3 41 40-88 221-261 (362) 32 PRK12351 methylcitrate synthas 21.7 66 0.0017 14.1 4.0 46 41-94 231-276 (377) 33 cd05580 STKc_PKA STKc_PKA: Ser 21.7 59 0.0015 14.5 1.8 61 90-150 225-285 (290) 34 PRK12710 flgJ peptidoglycan hy 21.3 49 0.0013 15.0 1.4 18 103-120 151-168 (291) 35 TIGR01631 Trypano_RHS trypanos 20.5 54 0.0014 14.7 1.4 23 65-88 190-214 (814) 36 KOG2672 consensus 20.5 34 0.00087 16.0 0.4 12 4-15 123-134 (360) 37 pfam09630 DUF2024 Domain of un 20.0 34 0.00086 16.0 0.3 11 89-99 66-76 (81) No 1 >pfam10926 DUF2800 Protein of unknown function (DUF2800). This is a family of uncharacterized proteins found in bacteria and viruses. Some members of this family are annotated as being Phi APSE P51-like proteins. Probab=100.00 E-value=0 Score=372.89 Aligned_cols=147 Identities=31% Similarity=0.564 Sum_probs=140.5 Q ss_pred CCCCCEECCCCCCCHHHHHHHHHHHHHCCC----CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECC Q ss_conf 764360634667488999999998630216----6898899999998599999999999999999988986886144158 Q gi|254781191|r 2 DENACRFCRAKPRCGALAVKALSTFSEHMQ----ILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEG 77 (165) Q Consensus 2 ~~d~CrFCkak~~CrArA~~~l~~~~~~~~----~Lt~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeG 77 (165) ..+||||||||++|||||++|+++++++|. .||++||++||.++++|++|+++|++||++++++|++|||||||+| T Consensus 211 ~G~~CrFCkak~~C~Ara~~~l~la~~df~~~~~~Ls~~ei~~il~~~~~l~~W~~dv~~~a~~~~~~G~~~pG~KlV~G 290 (363) T pfam10926 211 PGEHCRFCKAKATCRARAQAVLNLAKYDFEDPTLDLTDEELAELLEKADLLEKWAKDVEAYALDEARNGKEVPGYKLVEG 290 (363) T ss_pred CCCCCCCCCCHHHCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECC T ss_conf 89875306322228999999998887621577768999999999971489999999999999999977998787266147 Q ss_pred CCCCCCCCCHHHHHHHHHHHCC--CCCCCCCCCCHHHHHHHHCHHCCCHHHHHHHHHHEECCCCCCEEEECCCCCCCC Q ss_conf 7788711798999999997167--431012568999999986521014445654321024679980780368875111 Q gi|254781191|r 78 RKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL 153 (165) Q Consensus 78 Rs~~R~~~de~~a~~~l~~~g~--~~~y~~klls~t~~EKllgkk~f~e~~~~~l~~li~kp~GKPtLvp~sDkRpa~ 153 (165) || ||+|+|+++|+++|+.+|+ +++|+++|||||+|||+|||++|+++ |++||+||+|||||||+|||||++ T Consensus 291 Rs-~Rkw~de~~~~~~l~~~g~~~~~~y~~kli~pt~~ekllgkk~f~el----l~~li~k~~GkptLvp~sDkRpa~ 363 (363) T pfam10926 291 RS-NRKWTDEEAVAAALKAAGLKDEDIYKKKLISPTQAEKLLGKKKFKEL----LGALIVKPPGKPTLVPESDKRPAL 363 (363) T ss_pred CC-CCCCCCHHHHHHHHHHCCCCHHHCCCCCCCCHHHHHHHHCHHHHHHH----HHHHEECCCCCCEEEECCCCCCCC T ss_conf 67-77647889999999986996554210136898999988787889999----761263689961782478888999 No 2 >COG1023 Gnd Predicted 6-phosphogluconate dehydrogenase [Carbohydrate transport and metabolism] Probab=83.99 E-value=2.7 Score=23.03 Aligned_cols=67 Identities=10% Similarity=0.211 Sum_probs=47.0 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHH Q ss_conf 9998630216689889999999859999999999999999998898688614415877887117989999 Q gi|254781191|r 22 ALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVE 91 (165) Q Consensus 22 ~l~~~~~~~~~Lt~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~ 91 (165) .+++.+...-+++-+++++++.....|.+|+=++-+.|++.--.=.++.|+ |.+ ||--+|+=++++. T Consensus 189 GfelL~~s~fD~D~~~VA~vW~hGSVIrSWLldLt~~Af~~d~~L~q~~g~--v~d-SGEGrWTv~~ald 255 (300) T COG1023 189 GFELLKNSPFDYDLEAVAEVWNHGSVIRSWLLDLTAEAFKKDPDLDQISGR--VSD-SGEGRWTVEEALD 255 (300) T ss_pred HHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHCCE--ECC-CCCCEEEHHHHHH T ss_conf 999987188887779999998276459999999999998548887776176--345-8875066687776 No 3 >pfam05565 Sipho_Gp157 Siphovirus Gp157. This family contains both viral and bacterial proteins which are related to the Gp157 protein of the Streptococcus thermophilus SFi bacteriophages. It is thought that bacteria possessing the gene coding for this protein have an increased resistance to the bacteriophage. Probab=68.51 E-value=5.4 Score=21.07 Aligned_cols=31 Identities=16% Similarity=0.241 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CC Q ss_conf 89999999859999999999999999998-89 Q gi|254781191|r 36 RQLSQVMNVLPLIETWMKGVKEEALNVLS-SG 66 (165) Q Consensus 36 eei~~il~~~~~l~~W~~~v~~~A~~~~~-~G 66 (165) +|+-.+-.+..-+++-++.+++|....+. .| T Consensus 61 ~e~krl~erkk~~enk~~~LK~yl~~~m~~~g 92 (162) T pfam05565 61 AEIKRLAERKKSIENKVKRLKDYLEEAMEATG 92 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999999899999999999999809 No 4 >TIGR00423 TIGR00423 conserved hypothetical protein TIGR00423; InterPro: IPR005244 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=51.35 E-value=12 Score=18.81 Aligned_cols=62 Identities=19% Similarity=0.370 Sum_probs=40.5 Q ss_pred CCCCEECCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCC Q ss_conf 64360634667488999999998630216689889999999859999999999999999998898688614415877887 Q gi|254781191|r 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSR 82 (165) Q Consensus 3 ~d~CrFCkak~~CrArA~~~l~~~~~~~~~Lt~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R 82 (165) ..+|+||-.++.=. ..+.-.||.|||.+--. ++++.=+.+-+.+|--.|.++ +.|-| -- T Consensus 16 ~~~C~FCAF~~~~k----------~~~~Y~Ls~eEI~~Kv~---------ea~~~G~tE~~iQGGlnP~~~-~nGss-l~ 74 (331) T TIGR00423 16 VGKCKFCAFRRREK----------DKDAYVLSLEEILRKVK---------EAVAKGATEICIQGGLNPQLD-INGSS-LE 74 (331) T ss_pred HHCCCCCCCCCCCC----------CCCCCCCCHHHHHHHHH---------HHHHCCCEEEEEECCCCCCCC-CCCCH-HH T ss_conf 32479633113468----------98881407799999999---------999719827885234278876-45414-99 Q ss_pred CCC Q ss_conf 117 Q gi|254781191|r 83 TYN 85 (165) Q Consensus 83 ~~~ 85 (165) .|- T Consensus 75 yy~ 77 (331) T TIGR00423 75 YYE 77 (331) T ss_pred HHH T ss_conf 999 No 5 >TIGR00872 gnd_rel 6-phosphogluconate dehydrogenase (decarboxylating); InterPro: IPR004849 This family resembles a larger family of bacterial and eukaryotic 6-phosphogluconate dehydrogenases (Gnd) but differs from it by a deep split in a UPGMA similarity clustering tree and the lack of a central region of about 140 residues. Among complete genomes, it is found is found in Bacillus subtilis and Mycobacterium tuberculosis, both of which also contain Gnd, and in Aquifex aeolicus. . Probab=49.88 E-value=9.3 Score=19.59 Aligned_cols=57 Identities=12% Similarity=0.246 Sum_probs=44.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHH Q ss_conf 689889999999859999999999999999998898688614415877887117989999 Q gi|254781191|r 32 ILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVE 91 (165) Q Consensus 32 ~Lt~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~ 91 (165) +.+-+|+|+++.+.+.|.+|+=|+-..|+..--.=.++.|+=-= ||-=+|+-++++. T Consensus 240 DfD~~eVArVw~~GSVIrSwL~dLt~~Af~~~~dL~~~~G~v~d---SGEGrWtv~~a~d 296 (341) T TIGR00872 240 DFDLEEVARVWRRGSVIRSWLLDLTAKAFEESADLEEVEGIVED---SGEGRWTVKEAVD 296 (341) T ss_pred CCCHHHHHHHHCCCCEEEHHHHHHHHHHHHCCCCCCCEEEEECC---CCCCHHHHHHHHH T ss_conf 68748899987089864126888999997507882132236702---7872789999886 No 6 >COG5175 MOT2 Transcriptional repressor [Transcription] Probab=48.79 E-value=14 Score=18.54 Aligned_cols=89 Identities=18% Similarity=0.150 Sum_probs=48.8 Q ss_pred CCCEECC------CCCCCHH-HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEE- Q ss_conf 4360634------6674889-9999999863021668988999999985999999999999999999889868861441- Q gi|254781191|r 4 NACRFCR------AKPRCGA-LAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK- 75 (165) Q Consensus 4 d~CrFCk------ak~~CrA-rA~~~l~~~~~~~~~Lt~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlV- 75 (165) --||||- ..+.|+| |..+.-+-++ ...||.||+-.-|.+..+-+..-+ ++-..+...++.+.|..+| T Consensus 39 ~ic~fc~~~irq~lngrcpacrr~y~denv~--~~~~s~ee~kmel~rk~erk~rek---erke~e~~nrkhlsniRVvQ 113 (480) T COG5175 39 QICQFCYNNIRQNLNGRCPACRRKYDDENVR--YVTLSPEELKMELARKEERKMREK---ERKEAEGQNRKHLSNIRVVQ 113 (480) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHCCCCCEE--EEECCHHHHHHHHHHHHHHHCCHH---HHHHHHCCCCCCCCCCEEEE T ss_conf 8999999888765058883765423434513--786688998778876665422077---77665303666554525623 Q ss_pred ------CCCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf ------5877887117989999999971674 Q gi|254781191|r 76 ------EGRKGSRTYNNDNQVEQLLMRELGD 100 (165) Q Consensus 76 ------eGRs~~R~~~de~~a~~~l~~~g~~ 100 (165) -|- +.++.||.. +-+|++..|. T Consensus 114 KNLvYVigi--~pkva~Ee~-~~vLk~~eyF 141 (480) T COG5175 114 KNLVYVIGI--PPKVADEEV-APVLKRHEYF 141 (480) T ss_pred CCEEEEECC--CCCCCCCCC-CCCCCCHHHH T ss_conf 153699627--987776322-2100233554 No 7 >pfam04577 DUF563 Protein of unknown function (DUF563). Family of uncharacterized proteins. Probab=43.44 E-value=20 Score=17.46 Aligned_cols=30 Identities=23% Similarity=0.267 Sum_probs=19.8 Q ss_pred CCCEEECCCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 8614415877887117989999999971674 Q gi|254781191|r 70 PNYELKEGRKGSRTYNNDNQVEQLLMRELGD 100 (165) Q Consensus 70 pG~KlVeGRs~~R~~~de~~a~~~l~~~g~~ 100 (165) |-.=++. |++.|++.++++++.+++..|++ T Consensus 211 PRL~iis-R~~~R~ilN~~evv~~ae~vGFe 240 (246) T pfam04577 211 PRLLIIS-RRGSRKILNEDEVVRAAEEVGFE 240 (246) T ss_pred CEEEEEE-CCCCCEECCHHHHHHHHHHCCCE T ss_conf 8699998-78962552899999999984987 No 8 >TIGR01793 cit_synth_euk citrate (Si)-synthase, eukaryotic; InterPro: IPR010109 Citrate synthase 2.3.3.1 from EC is a member of a small family of enzymes that can directly form a carbon-carbon bond without the presence of metal ion cofactors. It catalyses the first reaction in the Krebs' cycle, namely the conversion of oxaloacetate and acetyl-coenzyme A into citrate and coenzyme A. This reaction is important for energy generation and for carbon assimilation. The reaction proceeds via a non-covalently bound citryl-coenzyme A intermediate in a 2-step process (aldol-Claisen condensation followed by the hydrolysis of citryl-CoA). Citrate synthase enzymes are found in two distinct structural types: type I enzymes (found in eukaryotes, Gram-positive bacteria and archaea) form homodimers and have shorter sequences than type II enzymes, which are found in Gram-negative bacteria and are hexameric in structure. In both types, the monomer is composed of two domains: a large alpha-helical domain consisting of two structural repeats, where the second repeat is interrupted by a small alpha-helical domain. The cleft between these domains forms the active site, where both citrate and acetyl-coenzyme A bind. The enzyme undergoes a conformational change upon binding of the oxaloacetate ligand, whereby the active site cleft closes over in order to form the acetyl-CoA binding site . The energy required for domain closure comes from the interaction of the enzyme with the substrate. Type II enzymes possess an extra N-terminal beta-sheet domain, and some type II enzymes are allosterically inhibited by NADH . This entry includes both mitochondrial and peroxisomal forms of citrate synthase. Peroxisomal forms of the enzyme, recognized by the C-terminal targeting motif SKL, act in the glyoxylate cycle. Eukaryotic homologs include a Tetrahymena thermophila citrate synthase that doubles as a filament protein, a putative citrate synthase from Plasmodium falciparum (no TCA cycle), and a methylcitrate synthase from Aspergillus nidulans (Emericella nidulans).; GO: 0004108 citrate (Si)-synthase activity, 0044262 cellular carbohydrate metabolic process. Probab=43.04 E-value=16 Score=18.15 Aligned_cols=70 Identities=19% Similarity=0.192 Sum_probs=41.4 Q ss_pred CCHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 898899999998599999----999999999999988986886144158778871179899999999716743101 Q gi|254781191|r 33 LSNRQLSQVMNVLPLIET----WMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYN 104 (165) Q Consensus 33 Lt~eei~~il~~~~~l~~----W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~~~l~~~g~~~~y~ 104 (165) |.|-|+---|..+-.... =-+.+|+|.=+.+++|+-||||==-==|..-.+|.-.-+. +|+..--|++|+ T Consensus 275 LANQEVL~wl~~~v~e~G~~n~t~eqlKdy~WktLnsG~VVPGYGHaVLRkTDPRY~cQREF--ALk~LPdDplFk 348 (430) T TIGR01793 275 LANQEVLVWLKEVVKEVGEDNLTEEQLKDYVWKTLNSGKVVPGYGHAVLRKTDPRYICQREF--ALKHLPDDPLFK 348 (430) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEECCCCCCCCCCCCCCCCCHHHH--HHHCCCCCHHHH T ss_conf 68899999999887751788774767899999760589542576652026889884312888--833178961568 No 9 >PRK08173 DNA topoisomerase III; Validated Probab=42.52 E-value=22 Score=17.17 Aligned_cols=31 Identities=23% Similarity=0.491 Sum_probs=18.0 Q ss_pred CCCCHHHHHHHHCHHCCCHHHHHHHHHHEECCCCCCE Q ss_conf 5689999999865210144456543210246799807 Q gi|254781191|r 106 TLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQV 142 (165) Q Consensus 106 klls~t~~EKllgkk~f~e~~~~~l~~li~kp~GKPt 142 (165) +-||..++++||.+++-.. |.+|.-|. |||- T Consensus 657 k~l~~~~~~~Ll~~~kt~~-----ikGF~sK~-Gk~F 687 (857) T PRK08173 657 RQFEIAEVEELLRKKEIGP-----LSGFRSKM-GRPF 687 (857) T ss_pred CCCCHHHHHHHHHCCCCCC-----CCCCCCCC-CCCE T ss_conf 3689999999997798677-----66752679-9830 No 10 >KOG1150 consensus Probab=40.66 E-value=12 Score=18.86 Aligned_cols=14 Identities=43% Similarity=0.717 Sum_probs=13.1 Q ss_pred CCCCCCEEEECCCC Q ss_conf 67998078036887 Q gi|254781191|r 136 RKDGKQVIVPCDLP 149 (165) Q Consensus 136 kp~GKPtLvp~sDk 149 (165) +..|+||++++||| T Consensus 139 kKegkpt~ieedDp 152 (250) T KOG1150 139 KKEGKPTIIEEDDP 152 (250) T ss_pred HHCCCCCCCCCCCH T ss_conf 73389987655688 No 11 >TIGR01823 PabB-fungal para-aminobenzoate synthase; InterPro: IPR010117 This entry represents the fungal clade of para-aminobenzoate synthase, which acts on chorismate to form para-aminobenzoic acid (PABA), a precursor of folate.. Probab=39.45 E-value=14 Score=18.38 Aligned_cols=72 Identities=13% Similarity=0.160 Sum_probs=53.2 Q ss_pred CCCCEECCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCCCEEECCCCCC Q ss_conf 64360634667488999999998630216689889999999859999999999999999998-89868861441587788 Q gi|254781191|r 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLS-SGEDLPNYELKEGRKGS 81 (165) Q Consensus 3 ~d~CrFCkak~~CrArA~~~l~~~~~~~~~Lt~eei~~il~~~~~l~~W~~~v~~~A~~~~~-~G~~~pG~KlVeGRs~~ 81 (165) ++||.|-+.|+|-+.-.+.+|+-|..- |.++-|++|.+--+|+|.+=+-. -+- .|-.||-+==||--+ T Consensus 633 d~~~~~~PiKGTvkk~~~M~lekAr~i--L~~~KemgEnlMI~DLIRnDLy~-------l~~~~~VhVeel~sVeeh~-- 701 (851) T TIGR01823 633 DDRLKLKPIKGTVKKEEDMSLEKARKI--LKSEKEMGENLMILDLIRNDLYE-------LVKKNDVHVEELLSVEEHA-- 701 (851) T ss_pred CCCEEECCCCCEECCCCCCCHHHHHHH--HCCCCHHHHHHHHHHHHHHHHHH-------HCCCCCEECCCCCEEHHHH-- T ss_conf 672232256302210788688999987--26841046787899988753898-------6078974225300001121-- Q ss_pred CCCC Q ss_conf 7117 Q gi|254781191|r 82 RTYN 85 (165) Q Consensus 82 R~~~ 85 (165) |.|. T Consensus 702 ~VYq 705 (851) T TIGR01823 702 RVYQ 705 (851) T ss_pred HHEE T ss_conf 2300 No 12 >pfam00285 Citrate_synt Citrate synthase. Probab=38.79 E-value=34 Score=16.02 Aligned_cols=49 Identities=24% Similarity=0.413 Sum_probs=30.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHH Q ss_conf 889999999859999999999999999998898688614415877887117989999999 Q gi|254781191|r 35 NRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL 94 (165) Q Consensus 35 ~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~~~l 94 (165) ++.+.+.| ..|.+ .++++.|..+.+.+|+.+||| |-|.|++.|-=+..| T Consensus 218 n~~v~~ml---~~i~~-~~~~~~~i~~~l~~~~~i~GF-------GHrvYk~~DPRa~~l 266 (352) T pfam00285 218 NEAVLEML---EEIGS-PENVEEYIEKALDKGERLMGF-------GHRVYKTYDPRARIL 266 (352) T ss_pred HHHHHHHH---HHHCC-CHHHHHHHHHHHHCCCCCCCC-------CCCCCCCCCCHHHHH T ss_conf 89999999---99558-166899999998468977678-------976688899569999 No 13 >pfam07211 consensus Probab=30.70 E-value=45 Score=15.19 Aligned_cols=62 Identities=16% Similarity=0.062 Sum_probs=39.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCC------CCCHHHHHHHHHHHCCCCCCC-CCCCCHHHH Q ss_conf 999999999999988986886144158778871------179899999999716743101-256899999 Q gi|254781191|r 51 WMKGVKEEALNVLSSGEDLPNYELKEGRKGSRT------YNNDNQVEQLLMRELGDEAYN-RTLLSPTET 113 (165) Q Consensus 51 W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~------~~de~~a~~~l~~~g~~~~y~-~klls~t~~ 113 (165) .-.-|..|+..+....-.+. ++-.-|++..|+ +.|+++...-+..+|+++.-+ ..=++=+.+ T Consensus 56 fegli~~Y~~~~~~~DpK~K-~~tpyGkv~srK~k~~~~~~De~~Ll~~lk~n~~~~~Ik~t~k~~w~el 124 (162) T pfam07211 56 FEGLVTQYYLKQLTKDPKAK-LSTPYGKVKSRKRKKAPNASDKDQLLNHLKEAGFTEFIKVTEKVSWSDL 124 (162) T ss_pred HHHHHHHHHHHHHHCCCCCC-CCCCCCCEEEECCCCCCCCCCHHHHHHHHHHCCCHHHEECCCCCCHHHH T ss_conf 99999999999873187520-0566764111035788564799999999986583333001564567776 No 14 >cd06109 BsCS-I_like Bacillus subtilis (Bs) citrate synthase CS-I_like. CS catalyzes the condensation of acetyl coenzyme A (AcCoA) and oxalacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the citric acid cycle (TCA or Krebs cycle). 2MCS catalyzes the condensation of propionyl-coenzyme A (PrCoA) and OAA to form 2-methylcitrate and coenzyme A (CoA) during propionate metabolism. The overall CS reaction is thought to proceed through three partial reactions and involves both closed and open conformational forms of the enzyme: a) the carbanion or equivalent is generated from AcCoA by base abstraction of a proton, b) the nucleophilic attack of this carbanion on OAA to generate citryl-CoA, and c) the hydrolysis of citryl-CoA to produce citrate and CoA. This group contains proteins similar to BsCS-I, one of two CS isozymes in the gram-positive B. subtilis. The majority of CS activity in B. subtilis is provided by the other isozyme, BsCS-II (not included in this group). BsCS- Probab=30.40 E-value=46 Score=15.16 Aligned_cols=50 Identities=20% Similarity=0.314 Sum_probs=32.4 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHH Q ss_conf 9889999999859999999999999999998898688614415877887117989999999 Q gi|254781191|r 34 SNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL 94 (165) Q Consensus 34 t~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~~~l 94 (165) .++++...+ +.|.+ .+++++|..+.+.+|+.+||| |-|.|++.|-=...+ T Consensus 207 A~~~v~~~l---~ei~~-~~~~~~~i~~~l~~~~~i~Gf-------GHrvY~~~DPRa~~l 256 (349) T cd06109 207 APGPVLDML---DAIGT-PENAEAWLREALARGERLMGF-------GHRVYRVRDPRADVL 256 (349) T ss_pred HHHHHHHHH---HHHCC-HHHHHHHHHHHHHCCCCCCCC-------CCCCCCCCCCHHHHH T ss_conf 679999999---99588-346999999998357875378-------988789999449999 No 15 >COG4318 Uncharacterized protein conserved in bacteria [Function unknown] Probab=29.83 E-value=36 Score=15.85 Aligned_cols=78 Identities=17% Similarity=0.238 Sum_probs=51.8 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCC-CCHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 668988999999985999999999999999999889868861441587788711-7989999999971674310125689 Q gi|254781191|r 31 QILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTY-NNDNQVEQLLMRELGDEAYNRTLLS 109 (165) Q Consensus 31 ~~Lt~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~-~de~~a~~~l~~~g~~~~y~~klls 109 (165) ..|-+.+|..=+..+..=.+-+..-+....++-+.-+++| +|-|- |+|-| +|-.-...+|.+.|..+.|..+.+. T Consensus 30 ~~LrPTQitVGm~eVe~kr~~la~~kpk~~~~fl~kh~iP---vVlGP-ggr~YltDhHHL~~Al~~~gvk~g~~~~va~ 105 (221) T COG4318 30 AALRPTQITVGMREVELKRAKLAEKKPKDGEAFLGKHEIP---VVLGP-GGRFYLTDHHHLSRALLREGVKQGLPVVVAD 105 (221) T ss_pred HHCCCCCEEHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCC---EEECC-CCCEEEECHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 6648301110056888789999860911288996068777---68678-9722551048899999871743031577734 Q ss_pred HHH Q ss_conf 999 Q gi|254781191|r 110 PTE 112 (165) Q Consensus 110 ~t~ 112 (165) ..+ T Consensus 106 ~s~ 108 (221) T COG4318 106 LSP 108 (221) T ss_pred CCC T ss_conf 451 No 16 >KOG3379 consensus Probab=28.52 E-value=49 Score=14.96 Aligned_cols=83 Identities=11% Similarity=0.218 Sum_probs=57.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHCCCCCCCC--EEECCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 166898899999998599999999999999------9999889868861--44158778871179899999999716743 Q gi|254781191|r 30 MQILSNRQLSQVMNVLPLIETWMKGVKEEA------LNVLSSGEDLPNY--ELKEGRKGSRTYNNDNQVEQLLMRELGDE 101 (165) Q Consensus 30 ~~~Lt~eei~~il~~~~~l~~W~~~v~~~A------~~~~~~G~~~pG~--KlVeGRs~~R~~~de~~a~~~l~~~g~~~ 101 (165) +.+||.+|.++++.-+..+.+-++.+..-. ..-...|+.||.. -++.-|.| -+.+.+.+-..|... .++ T Consensus 51 l~dLt~~E~aDlF~t~~~v~~~lek~~~~ts~ti~iQDG~~AGQTVpHvHvHIlPR~~g--Df~~Nd~IY~~L~~~-~~e 127 (150) T KOG3379 51 LTDLTAAETADLFTTVQKVQRVLEKHYNATSLTIAIQDGPEAGQTVPHVHVHILPRKAG--DFGDNDLIYDELDKH-EKE 127 (150) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCEEEEEECCCCCC--CCCCCHHHHHHHHHC-CCC T ss_conf 13598188999999999999999998266615898425600176565268987442025--666645999988752-455 Q ss_pred CCCCCCCCHHHHHH Q ss_conf 10125689999999 Q gi|254781191|r 102 AYNRTLLSPTETEQ 115 (165) Q Consensus 102 ~y~~klls~t~~EK 115 (165) +.+++.-|+.+|.+ T Consensus 128 ~~~r~~Rs~eEM~e 141 (150) T KOG3379 128 LEDRKPRSLEEMAE 141 (150) T ss_pred CCCCCCCHHHHHHH T ss_conf 33378324899999 No 17 >PRK06341 single-stranded DNA-binding protein; Provisional Probab=27.78 E-value=32 Score=16.11 Aligned_cols=39 Identities=21% Similarity=0.379 Sum_probs=26.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHH Q ss_conf 9999999999999999889868861441587788711798999 Q gi|254781191|r 48 IETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQV 90 (165) Q Consensus 48 l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a 90 (165) +..|-+.+.+.+.+-+..|..| +|+||--.|.|+|.+-. T Consensus 57 vv~~~~~lae~~~~yl~KG~~V----~VeGrL~~r~w~dk~G~ 95 (165) T PRK06341 57 VVIFNEGLCKVAEQYLKKGAKV----YIEGQLQTRKWTDQSGV 95 (165) T ss_pred HHHHHHHHHHHHHHHHCCCCEE----EEECCEEEEEEECCCCC T ss_conf 6665289999999973789879----88322687311879998 No 18 >KOG0432 consensus Probab=27.07 E-value=52 Score=14.79 Aligned_cols=50 Identities=14% Similarity=0.356 Sum_probs=37.6 Q ss_pred HHHHHHHHHHHHHHH-HHHCCCCCCCCEEEC--CCCCCCCC---CCHHHHHHHHHH Q ss_conf 999999999999999-998898688614415--87788711---798999999997 Q gi|254781191|r 47 LIETWMKGVKEEALN-VLSSGEDLPNYELKE--GRKGSRTY---NNDNQVEQLLMR 96 (165) Q Consensus 47 ~l~~W~~~v~~~A~~-~~~~G~~~pG~KlVe--GRs~~R~~---~de~~a~~~l~~ 96 (165) ....|++.+++.-.+ ++==|+++|-|.++. |+...++| .+++.|.+.+.+ T Consensus 446 ~w~~W~~~i~DWCiSRQLWWGHrIPAy~v~~~~~~~ee~~Wvv~rseeeA~ekaa~ 501 (995) T KOG0432 446 EWYRWLENIRDWCISRQLWWGHRIPAYFVNLSDGRAEEDYWVVARSEEEAREKAAE 501 (995) T ss_pred HHHHHHHHCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCEEEEECCHHHHHHHHHH T ss_conf 89999751302121344430454642562024677766616885889999999999 No 19 >TIGR02182 GRXB Glutaredoxin, GrxB family; InterPro: IPR011901 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family includes the highly abundant Escherichia coli GrxB (Grx2) glutaredoxin which is notably longer than either GrxA or GrxC. Unlike the other two E. coli glutaredoxins, GrxB appears to be unable to reduce ribonucleotide reductase , and may have more to do with resistance to redox stress .; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity. Probab=26.82 E-value=17 Score=17.86 Aligned_cols=19 Identities=32% Similarity=0.550 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8599999999999999999 Q gi|254781191|r 44 VLPLIETWMKGVKEEALNV 62 (165) Q Consensus 44 ~~~~l~~W~~~v~~~A~~~ 62 (165) ..+.|..|+..|-.|.-.- T Consensus 83 ~s~~I~~WL~~v~~y~~kL 101 (212) T TIGR02182 83 VSPEIEAWLRKVTGYLNKL 101 (212) T ss_pred CCHHHHHHHHHHCCHHHHH T ss_conf 8778999998731123221 No 20 >PRK06245 cofG FO synthase subunit 1; Reviewed Probab=26.82 E-value=53 Score=14.77 Aligned_cols=11 Identities=36% Similarity=1.165 Sum_probs=8.2 Q ss_pred CCCCEECCCCC Q ss_conf 64360634667 Q gi|254781191|r 3 ENACRFCRAKP 13 (165) Q Consensus 3 ~d~CrFCkak~ 13 (165) -+.|+||-.+. T Consensus 23 ~~~C~fCaF~~ 33 (336) T PRK06245 23 RNRCGYCTFRP 33 (336) T ss_pred CCCCCCCCCCC T ss_conf 26882586746 No 21 >KOG3300 consensus Probab=25.97 E-value=55 Score=14.67 Aligned_cols=29 Identities=10% Similarity=0.263 Sum_probs=23.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEC Q ss_conf 99999999999999998898688614415 Q gi|254781191|r 48 IETWMKGVKEEALNVLSSGEDLPNYELKE 76 (165) Q Consensus 48 l~~W~~~v~~~A~~~~~~G~~~pG~KlVe 76 (165) =..|++.++.+-..++.-=+.|||||+=| T Consensus 83 Dr~~l~~lrkn~eeEaeiMKdVPgWkvGE 111 (146) T KOG3300 83 DRRFLSELRKNLEEEAEIMKDVPGWKVGE 111 (146) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 79999999986888999970499862376 No 22 >KOG3248 consensus Probab=25.45 E-value=26 Score=16.69 Aligned_cols=13 Identities=8% Similarity=0.217 Sum_probs=7.4 Q ss_pred HCCCCCCCCEEEC Q ss_conf 8898688614415 Q gi|254781191|r 64 SSGEDLPNYELKE 76 (165) Q Consensus 64 ~~G~~~pG~KlVe 76 (165) ++-+-+|||-.-+ T Consensus 251 lH~qlYP~WSARd 263 (421) T KOG3248 251 LHMQLYPGWSARD 263 (421) T ss_pred HHHHHCCCCCHHH T ss_conf 7777567852200 No 23 >TIGR00734 hisAF_rel hisA/hisF family protein; InterPro: IPR004650 This family of proteins has so far been found in three archaeal species: Methanobacterium thermoautotrophicum, Methanococcus jannaschii, and Archaeoglobus fulgidus. Proteins are homologous to phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase (HisA) and, with lower similarity, to the cyclase HisF, both of which are enzymes of histidine biosynthesis. Each species with this protein also encodes HisA. However, the function of proteins in this group is unknown.. Probab=25.36 E-value=56 Score=14.60 Aligned_cols=32 Identities=22% Similarity=0.295 Sum_probs=19.6 Q ss_pred EEECCCCCCC-CCC----------CHHHHHHHHHHHCCCCCCC Q ss_conf 4415877887-117----------9899999999716743101 Q gi|254781191|r 73 ELKEGRKGSR-TYN----------NDNQVEQLLMRELGDEAYN 104 (165) Q Consensus 73 KlVeGRs~~R-~~~----------de~~a~~~l~~~g~~~~y~ 104 (165) +.|.|+||+| +|. |-..+++++...|---+|- T Consensus 13 ~aV~GKsGeRe~Y~Plesv~~~ss~P~~~ak~~~e~gar~iYi 55 (230) T TIGR00734 13 IAVAGKSGEREEYKPLESVISKSSDPVDVAKALKEIGARSIYI 55 (230) T ss_pred EEEECCCCCCCCCCCCEEEEECCCCHHHHHHHHHHCCCCEEEE T ss_conf 2784778888888971278707888789999998638846863 No 24 >cd06112 citrate_synt_like_1_1 Citrate synthase (CS) catalyzes the condensation of acetyl coenzyme A (AcCoA) and oxalacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the oxidative citric acid cycle (TCA or Krebs cycle). Peroxisomal CS is involved in the glyoxylate cycle. This group also includes CS proteins which functions as a 2-methylcitrate synthase (2MCS). 2MCS catalyzes the condensation of propionyl-CoA (PrCoA) and OAA to form 2-methylcitrate and CoA during propionate metabolism. This group contains proteins which functions exclusively as either a CS or a 2MCS, as well as those with relaxed specificity which have dual functions as both a CS and a 2MCS. The overall CS reaction is thought to proceed through three partial reactions and involves both closed and open conformational forms of the enzyme: a) the carbanion or equivalent is generated from AcCoA by base abstraction of a proton, b) the nucleophilic attack of this carbanion on OAA to generate citryl-CoA, an Probab=25.23 E-value=56 Score=14.58 Aligned_cols=50 Identities=18% Similarity=0.319 Sum_probs=31.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHHH Q ss_conf 8899999998599999999999999999988986886144158778871179899999999 Q gi|254781191|r 35 NRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM 95 (165) Q Consensus 35 ~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~~~l~ 95 (165) ++++.+.|. +|.+ .+++++|..+.+.+|+.+||| |-|.|++.|-=...|+ T Consensus 223 n~~v~~ml~---ei~~-~~~~~~~i~~~l~~~~~i~GF-------GHrvYk~~DPRa~~L~ 272 (373) T cd06112 223 NEDVLEMLE---EIGS-PENVKAYLDKKLANKQKIWGF-------GHRVYKTKDPRATILQ 272 (373) T ss_pred HHHHHHHHH---HHCC-HHHHHHHHHHHHHCCCCCCCC-------CCCCCCCCCCHHHHHH T ss_conf 999999999---8499-888999999987268877888-------8765788994289999 No 25 >TIGR02262 benz_CoA_lig benzoate-CoA ligase family; InterPro: IPR011957 Characterised members of this protein family include benzoate-CoA ligase, 4-hydroxybenzoate-CoA ligase, 2-aminobenzoate-CoA ligase, etc. Members are related to fatty acid and acetate CoA ligases.. Probab=24.40 E-value=33 Score=16.05 Aligned_cols=87 Identities=23% Similarity=0.292 Sum_probs=48.1 Q ss_pred HCCCCCCCCEE--E------------------CCC-CCCCCCCCHHHHHHHHH----HHC---------C-------CCC Q ss_conf 88986886144--1------------------587-78871179899999999----716---------7-------431 Q gi|254781191|r 64 SSGEDLPNYEL--K------------------EGR-KGSRTYNNDNQVEQLLM----REL---------G-------DEA 102 (165) Q Consensus 64 ~~G~~~pG~Kl--V------------------eGR-s~~R~~~de~~a~~~l~----~~g---------~-------~~~ 102 (165) .+|++||||.| | .|= |+.-.|.+-++-..+.+ +.| + ||| T Consensus 341 tsG~PVPGY~~rL~~ed~G~~vA~Ge~GeL~v~GPSSA~~YWn~R~Ksr~TF~G~WTrsGDKY~r~~dG~YtYaGR~DDm 420 (520) T TIGR02262 341 TSGKPVPGYRLRLVGEDGGQDVADGEVGELLVSGPSSAAGYWNNREKSRATFEGEWTRSGDKYVRNDDGVYTYAGRTDDM 420 (520) T ss_pred CCCCCCCCCEEEEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCCCCCCCCCCCEEECCCCEEEEECCCCCC T ss_conf 74754887368874277886113776662046377545567988146132378895224871036378506870244352 Q ss_pred CCCC--CCCHHHHHHHHCHH-CCCHHH---HHHHHHHEECCCCCCEEEECCCCCC Q ss_conf 0125--68999999986521-014445---6543210246799807803688751 Q gi|254781191|r 103 YNRT--LLSPTETEQLVKRK-KVSETT---WEQLQKFITRKDGKQVIVPCDLPVN 151 (165) Q Consensus 103 y~~k--lls~t~~EKllgkk-~f~e~~---~~~l~~li~kp~GKPtLvp~sDkRp 151 (165) ++-+ -+||=+.|..|=.. .+-|-. |+. .+-.+||..=-+|.|-.|--| T Consensus 421 lKVSGIyVSPfEvE~AL~~H~~VLEAAVvg~~D-~~GL~KPKAFvVL~~G~~~~~ 474 (520) T TIGR02262 421 LKVSGIYVSPFEVESALIAHPAVLEAAVVGVED-EDGLIKPKAFVVLRPGADIDN 474 (520) T ss_pred EEECCEEECCHHHHHHHHCCHHHCEEEECCCCC-CCCCCCCCEEEEECCCCCCCH T ss_conf 355241678179998851351222042158748-888717711585078876451 No 26 >cd06115 AthCS_per_like Arabidopsis thaliana (Ath) peroxisomal (Per) CS_like. CS catalyzes the condensation of acetyl coenzyme A (AcCoA) and oxalacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the citric acid cycle (TCA or Krebs cycle). The overall CS reaction is thought to proceed through three partial reactions and involves both closed and open conformational forms of the enzyme: a) the carbanion or equivalent is generated from AcCoA by base abstraction of a proton, b) the nucleophilic attack of this carbanion on OAA to generate citryl-CoA, and c) the hydrolysis of citryl-CoA to produce citrate and CoA. This group contains three Arabidopsis peroxisomal CS proteins, CYS1, -2, and -3 which are involved in the glyoxylate cycle. AthCYS1, in addition to a peroxisomal targeting sequence, has a predicted secretory signal peptide; it may be targeted to both the secretory pathway and the peroxisomes and is thought to be located in the extracellular matrix. AthCSY1 is expr Probab=24.35 E-value=59 Score=14.48 Aligned_cols=50 Identities=18% Similarity=0.331 Sum_probs=31.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHHH Q ss_conf 8899999998599999999999999999988986886144158778871179899999999 Q gi|254781191|r 35 NRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM 95 (165) Q Consensus 35 ~eei~~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~~~l~ 95 (165) |+++.+.| ..|.+ .++|++|..+.+..|+.++|| |-|.|+..|-=+..++ T Consensus 258 ne~v~~ml---~ei~s-~~~v~~~i~~~l~~~~rimGf-------GHrVYk~~DPRa~ilk 307 (410) T cd06115 258 NEAVLRML---AEIGT-VENIPAFIEGVKNRKRKLSGF-------GHRVYKNYDPRAKIIK 307 (410) T ss_pred HHHHHHHH---HHHCC-HHHHHHHHHHHHCCCCCCCCC-------CCCCCCCCCCHHHHHH T ss_conf 89999999---99647-223799999987167876778-------9876888882779999 No 27 >TIGR01806 CM_mono2 chorismate mutase, putative; InterPro: IPR008240 Chorismate mutase (CM; 5.4.99.5 from EC) catalyses the reaction at the branch point of the biosynthetic pathway leading to the three aromatic amino acids, phenylalanine, tryptophan and tyrosine (chorismic acid is the last common intermediate, and CM leads to the L-phenylalanine/L-tyrosine branch). It is part of the shikimate pathway, which is present only in bacteria, fungi and plants. Members of this family contain a chorismate mutase domain of the AroQ class (Prokaryotic type) that has an all-helical structure. The three types of CM are AroQ class, Prokaryotic type (e.g., IPR008239 from INTERPRO and others); AroQ class, Eukaryotic type (IPR008238 from INTERPRO); and AroH class (IPR008243 from INTERPRO). They fall into two structural folds (AroQ class and AroH class) which are completely unrelated . The two types of the AroQ structural class (the Escherichia coli CM dimer and the yeast CM monomer) can be structurally superimposed, and the topology of the four-helix bundle forming the active site is conserved . Periplasmic chorismate mutases form a subclass of the AroQ class, and are twice the size of cytoplasmic AroQ proteins due to a carboxy-terminal domain of unknown function . This C-terminal domain is so far unique to members of this group. For additional information please see , , , .. Probab=23.31 E-value=35 Score=15.92 Aligned_cols=99 Identities=10% Similarity=0.050 Sum_probs=60.6 Q ss_pred HHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHH----HCCCCCCCCCC-CCHHHHHHHHCHHCCCHHH Q ss_conf 999999999999889868861441587788711798999999997----16743101256-8999999986521014445 Q gi|254781191|r 52 MKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMR----ELGDEAYNRTL-LSPTETEQLVKRKKVSETT 126 (165) Q Consensus 52 ~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~~~l~~----~g~~~~y~~kl-ls~t~~EKllgkk~f~e~~ 126 (165) +..+-+.+-+++.-++.|-+||...|.| .-=..=|+.+...+.. +|.+.=|=+.+ ---=.+-|++-...|+.+ T Consensus 6 l~~l~~~~~~RL~~~d~VA~yK~~~~~p-I~D~~RE~~vL~~~~~qA~~~gl~~~~v~~~f~aQi~A~K~iQy~~~~~W- 83 (116) T TIGR01806 6 LDQLVDLANERLQLADDVALYKAESNLP-IEDSPREEQVLDSLRAQAQSAGLDADYVERFFQAQINANKAIQYRLLSDW- 83 (116) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH- T ss_conf 7889999999988637999889883899-77738899999999886774689867764899999999999999888866- Q ss_pred HHHHHHHEECCCCCCEEEECC-CCCCCCCCHHH Q ss_conf 654321024679980780368-87511100234 Q gi|254781191|r 127 WEQLQKFITRKDGKQVIVPCD-LPVNHLKANIS 158 (165) Q Consensus 127 ~~~l~~li~kp~GKPtLvp~s-DkRpa~~~~~~ 158 (165) . .-.+..-+...+.+ |-||.|....+ T Consensus 84 -~-----~~~~~p~~~~~DL~~~~R~~i~~l~~ 110 (116) T TIGR01806 84 -L-----ANPPEPPVQVPDLTKDLRSAIDQLNT 110 (116) T ss_pred -H-----HCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf -5-----25788778778887656788999999 No 28 >pfam03439 Spt5-NGN Early transcription elongation factor of RNA pol II, NGN section. Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit. Probab=22.91 E-value=62 Score=14.30 Aligned_cols=44 Identities=23% Similarity=0.222 Sum_probs=28.7 Q ss_pred CCCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH Q ss_conf 868861441587788711798999999997167431012568999999986 Q gi|254781191|r 67 EDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLV 117 (165) Q Consensus 67 ~~~pG~KlVeGRs~~R~~~de~~a~~~l~~~g~~~~y~~klls~t~~EKll 117 (165) ..+.||=.||.++ +..+.+++...-+-.....+++++.+|+++| T Consensus 39 ~~lkGyIyVEA~~-------~~~v~~ai~gi~~v~~~~~~~Vp~~E~~~~L 82 (82) T pfam03439 39 DGLKGYIYVEADR-------QADVERALEGLPHVRGVVMGLVPIKEMEHLL 82 (82) T ss_pred CCCCEEEEEEECC-------HHHHHHHHHHHHHHHCCCCCCCCHHHHHCCC T ss_conf 8984699998798-------8999999960377746577528889975109 No 29 >KOG4616 consensus Probab=22.51 E-value=46 Score=15.16 Aligned_cols=26 Identities=19% Similarity=0.441 Sum_probs=20.8 Q ss_pred HEECCCCCCEEEECCCCCCCCCCHHH Q ss_conf 02467998078036887511100234 Q gi|254781191|r 133 FITRKDGKQVIVPCDLPVNHLKANIS 158 (165) Q Consensus 133 li~kp~GKPtLvp~sDkRpa~~~~~~ 158 (165) -+++|+|....+|...||.-+.++.| T Consensus 61 kfi~pdgstimipaaeprk~fk~ald 86 (137) T KOG4616 61 KFIQPDGSTIMIPAAEPRKIFKLALD 86 (137) T ss_pred EEECCCCCEEEEECCCCHHHHHCCCC T ss_conf 58768997675124571778732001 No 30 >PRK13627 carnitine operon protein CaiE; Provisional Probab=22.28 E-value=33 Score=16.06 Aligned_cols=72 Identities=8% Similarity=0.040 Sum_probs=41.1 Q ss_pred HHCCCCCCCCEEECCCCCCCCC---CCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCHHCCCHHHHHHHHHHEECCCC Q ss_conf 9889868861441587788711---7989999999971674310125689999999865210144456543210246799 Q gi|254781191|r 63 LSSGEDLPNYELKEGRKGSRTY---NNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDG 139 (165) Q Consensus 63 ~~~G~~~pG~KlVeGRs~~R~~---~de~~a~~~l~~~g~~~~y~~klls~t~~EKllgkk~f~e~~~~~l~~li~kp~G 139 (165) ...|+.+|...|+.|.. -|.. +|++......-...|-++- .+|.... ...++|-+..++ T Consensus 121 V~~g~~ip~~~L~~G~P-Ak~vR~lt~~ei~~~~~~a~~Y~~la----------------~r~~~~l-~~~~Pl~~~e~~ 182 (196) T PRK13627 121 VKAGFQGEKRQLLMGTP-ARAVRSVSDQELHWKRLNTKEYQDLA----------------GRCHASL-HETQPLTQMEEN 182 (196) T ss_pred ECCCCCCCCCCEEEECC-EEEEECCCHHHHHHHHHHHHHHHHHH----------------HHHHHHC-CCCCCCCCCCCC T ss_conf 57994938981998778-16811499999999887699999999----------------9998548-546886446799 Q ss_pred CCEEEECCCCCCC Q ss_conf 8078036887511 Q gi|254781191|r 140 KQVIVPCDLPVNH 152 (165) Q Consensus 140 KPtLvp~sDkRpa 152 (165) .|.|-..++.+|. T Consensus 183 r~~~~~~~~~~p~ 195 (196) T PRK13627 183 RPRLQGTTDVTPK 195 (196) T ss_pred CCCCCCCCCCCCC T ss_conf 8633576566879 No 31 >cd06111 DsCS_like Cold-active citrate synthase (CS) from an Antarctic bacterial strain DS2-3R (Ds)-like. CS catalyzes the condensation of acetyl coenzyme A (AcCoA) and oxalacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the citric acid cycle (TCA or Krebs cycle). 2-methylcitrate synthase (2MCS) catalyzes the condensation of propionyl-coenzyme A (PrCoA) and OAA to form 2-methylcitrate and coenzyme A (CoA) during propionate metabolism. The overall CS reaction is thought to proceed through three partial reactions: a) the carbanion or equivalent is generated from AcCoA by base abstraction of a proton, b) the nucleophilic attack of this carbanion on OAA to generate citryl-CoA, and c) the hydrolysis of citryl-CoA to produce citrate and CoA. DsCS, compared with CS from the hyperthermophile Pyrococcus furiosus (not included in this group), has an increase in the size of surface loops, a higher proline content in the loop regions, a more accessible active site, and a highe Probab=22.00 E-value=65 Score=14.18 Aligned_cols=41 Identities=22% Similarity=0.414 Sum_probs=28.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHH Q ss_conf 9999859999999999999999998898688614415877887117989 Q gi|254781191|r 40 QVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDN 88 (165) Q Consensus 40 ~il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~ 88 (165) .++..+.+|.+ .+++++|..+.+..|+.++|| |.|.|++.| T Consensus 221 ~v~~ml~ei~~-~~~~~~~i~~~l~~~~~i~GF-------GHrvYk~~D 261 (362) T cd06111 221 AVMHMMLEIDD-PEKAAQWMLDALARKEKVMGF-------GHRVYKSGD 261 (362) T ss_pred HHHHHHHHHCC-HHHHHHHHHHHHHCCCCCCCC-------CCCCCCCCC T ss_conf 99999998378-122699999998468877888-------875577889 No 32 >PRK12351 methylcitrate synthase; Provisional Probab=21.67 E-value=66 Score=14.14 Aligned_cols=46 Identities=22% Similarity=0.355 Sum_probs=26.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHH Q ss_conf 999859999999999999999998898688614415877887117989999999 Q gi|254781191|r 41 VMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL 94 (165) Q Consensus 41 il~~~~~l~~W~~~v~~~A~~~~~~G~~~pG~KlVeGRs~~R~~~de~~a~~~l 94 (165) ++....+|.+ .++|++|..+.+..|+.+||| |-|.|.+.|-=+..| T Consensus 231 v~~ml~eI~~-~~~v~~~i~~~l~~~~rl~GF-------GHrvYk~~DPRa~~l 276 (377) T PRK12351 231 VMEMLEEIGS-PEEAEAYLRKRLANKEKIMGF-------GHRVYKVSDPRAKIL 276 (377) T ss_pred HHHHHHHHCC-CHHHHHHHHHHHHCCCCCCCC-------CCCCCCCCCCHHHHH T ss_conf 9999999589-257799999998558976577-------876578999368999 No 33 >cd05580 STKc_PKA STKc_PKA: Serine/Threonine Kinases (STKs), cAMP-dependent protein kinase (PKA) subfamily, catalytic (c) subunit. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PKA subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). This subfamily is composed of the cAMP-dependent proteins kinases, PKA and PRKX. The inactive PKA holoenzyme is a heterotetramer composed of two phosphorylated and active catalytic (C) subunits with a dimer of regulatory (R) subunits. Activation is achieved through the binding of the important second messenger cAMP to the R subunits, which leads to the dissociation of PKA into the R dimer and two active C subunits. PKA is present ubiquitously in cells and interacts with many different downstream targets. It pl Probab=21.66 E-value=59 Score=14.48 Aligned_cols=61 Identities=10% Similarity=-0.046 Sum_probs=40.3 Q ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHHHHCHHCCCHHHHHHHHHHEECCCCCCEEEECCCCC Q ss_conf 9999997167431012568999999986521014445654321024679980780368875 Q gi|254781191|r 90 VEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPV 150 (165) Q Consensus 90 a~~~l~~~g~~~~y~~klls~t~~EKllgkk~f~e~~~~~l~~li~kp~GKPtLvp~sDkR 150 (165) +..++.+...-++.+|--...+.++.++...=|..+-|++|..-..+||=.|+|.-.+|++ T Consensus 225 ~~dLi~~mL~~dP~~Ri~~~~~t~~eil~Hpwf~~i~w~~l~~~~~~pP~~p~~~~~~d~~ 285 (290) T cd05580 225 AKDLLRNLLQVDLTKRLGNLKNGVNDIKNHPWFAGIDWIALLQKKIEAPFVPKVKGPGDTS 285 (290) T ss_pred HHHHHHHHCCCCHHHHCCCCCCCHHHHHCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCHHH T ss_conf 9999998766898996899999999997397808999889971899969744279976687 No 34 >PRK12710 flgJ peptidoglycan hydrolase; Provisional Probab=21.34 E-value=49 Score=14.95 Aligned_cols=18 Identities=11% Similarity=-0.023 Sum_probs=8.6 Q ss_pred CCCCCCCHHHHHHHHCHH Q ss_conf 012568999999986521 Q gi|254781191|r 103 YNRTLLSPTETEQLVKRK 120 (165) Q Consensus 103 y~~klls~t~~EKllgkk 120 (165) ...-+|.=+.+|.=-||. T Consensus 151 ~~~~~lAQAALEsGWGk~ 168 (291) T PRK12710 151 DPKLLVAQAALETGWGKF 168 (291) T ss_pred CHHHHHHHHHHHCCCCCC T ss_conf 989999999986476723 No 35 >TIGR01631 Trypano_RHS trypanosome RHS family; InterPro: IPR006518 These sequences are full-length and part-length members of the RHS (retrotransposon hot spot) family in Trypanosoma brucei and Trypanosoma cruzi. Members of this family are frequently interrupted by non-LTR retrotransposons inserted at exactly the same relative position. . Probab=20.48 E-value=54 Score=14.68 Aligned_cols=23 Identities=9% Similarity=0.325 Sum_probs=14.4 Q ss_pred CCCCCCCCEEECCC--CCCCCCCCHH Q ss_conf 89868861441587--7887117989 Q gi|254781191|r 65 SGEDLPNYELKEGR--KGSRTYNNDN 88 (165) Q Consensus 65 ~G~~~pG~KlVeGR--s~~R~~~de~ 88 (165) +|.+--|+|+|+|| - ...||.|+ T Consensus 190 ~~~~PLGMkV~~G~~~p-P~mWt~ee 214 (814) T TIGR01631 190 HDTEPLGMKVVDGRDNP-PHMWTEEE 214 (814) T ss_pred CCCCCCCEEEECCCCCC-CCCCEEEC T ss_conf 68898521761787968-66760300 No 36 >KOG2672 consensus Probab=20.46 E-value=34 Score=15.96 Aligned_cols=12 Identities=42% Similarity=1.007 Sum_probs=9.4 Q ss_pred CCCEECCCCCCC Q ss_conf 436063466748 Q gi|254781191|r 4 NACRFCRAKPRC 15 (165) Q Consensus 4 d~CrFCkak~~C 15 (165) -+||||-.|..= T Consensus 123 RGCRFCsVKTsR 134 (360) T KOG2672 123 RGCRFCSVKTSR 134 (360) T ss_pred CCCCEEEEECCC T ss_conf 675201210378 No 37 >pfam09630 DUF2024 Domain of unknown function (DUF2024). This protein of 86 residues is expressed in bacteria. It consists of four alpha helices and two beta strands. Its function is unknown. One UniProt entry gives the gene name as Traf5. Probab=20.04 E-value=34 Score=16.02 Aligned_cols=11 Identities=18% Similarity=0.108 Sum_probs=4.7 Q ss_pred HHHHHHHHHCC Q ss_conf 99999997167 Q gi|254781191|r 89 QVEQLLMRELG 99 (165) Q Consensus 89 ~a~~~l~~~g~ 99 (165) .++......|| T Consensus 66 ~V~~~I~~~Gy 76 (81) T pfam09630 66 EVEEAIKENGY 76 (81) T ss_pred HHHHHHHHCCE T ss_conf 99999985987 Done!