Query gi|254781152|ref|YP_003065565.1| hypothetical protein CLIBASIA_05295 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 155 No_of_seqs 17 out of 19 Neff 2.2 Searched_HMMs 39220 Date Mon May 30 06:19:13 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781152.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam07120 DUF1376 Protein of u 99.4 3.7E-13 9.4E-18 97.8 4.4 59 1-60 29-87 (88) 2 COG3756 Uncharacterized protei 97.9 1.4E-05 3.6E-10 53.7 3.9 104 1-117 30-133 (153) 3 COG2996 Predicted RNA-bindinin 82.7 2.5 6.4E-05 23.1 4.5 44 4-48 234-281 (287) 4 COG2188 PhnF Transcriptional r 78.2 6.5 0.00017 20.7 6.7 88 4-92 24-119 (236) 5 pfam08784 RPA_C Replication pr 77.3 4 0.0001 22.0 4.1 32 13-44 66-98 (103) 6 pfam00392 GntR Bacterial regul 69.7 11 0.00027 19.5 5.4 42 5-47 18-59 (64) 7 pfam09681 Phage_rep_org_N N-te 67.6 12 0.0003 19.2 6.6 52 8-59 49-101 (121) 8 cd07377 WHTH_GntR Winged helix 64.7 13 0.00034 18.9 5.3 41 5-46 19-59 (66) 9 COG1281 Disulfide bond chapero 62.8 1.1 2.9E-05 25.2 -1.4 36 3-38 217-252 (286) 10 smart00419 HTH_CRP helix_turn_ 56.3 19 0.00048 18.0 4.7 40 10-49 6-45 (48) 11 pfam10685 KGG Stress-induced b 53.2 5.2 0.00013 21.3 0.7 19 61-79 6-24 (26) 12 COG4533 ABC-type uncharacteriz 53.1 22 0.00055 17.7 7.4 65 7-71 18-90 (564) 13 cd00498 Hsp33 Heat shock prote 50.1 4.8 0.00012 21.5 0.1 33 4-36 214-246 (275) 14 PRK01402 hslO Hsp33-like chape 49.8 4.5 0.00012 21.6 -0.0 16 18-33 273-288 (327) 15 pfam04444 Dioxygenase_N Catech 47.6 26 0.00067 17.2 3.7 57 12-73 12-73 (74) 16 TIGR00118 acolac_lg acetolacta 46.9 20 0.0005 17.9 2.8 39 27-65 201-241 (593) 17 pfam00325 Crp Bacterial regula 45.8 27 0.00068 17.1 3.4 31 12-42 2-32 (32) 18 KOG2403 consensus 45.4 21 0.00053 17.8 2.8 26 36-61 538-566 (642) 19 pfam01430 HSP33 Hsp33 protein. 44.8 6.2 0.00016 20.8 0.0 24 12-35 220-243 (274) 20 COG3432 Predicted transcriptio 42.2 19 0.00047 18.1 2.1 34 15-48 34-67 (95) 21 COG3729 GsiB General stress pr 41.7 11 0.00027 19.5 0.8 15 63-77 29-43 (73) 22 PRK00114 hslO Hsp33-like chape 41.1 10 0.00026 19.6 0.6 31 4-34 216-246 (289) 23 pfam00205 TPP_enzyme_M Thiamin 40.0 35 0.00089 16.5 4.4 36 30-65 2-38 (138) 24 PRK11402 DNA-binding transcrip 39.1 36 0.00092 16.4 5.5 65 5-70 29-99 (243) 25 TIGR03337 phnR transcriptional 38.8 36 0.00093 16.4 5.4 43 5-48 19-62 (231) 26 KOG1141 consensus 35.6 8.6 0.00022 20.0 -0.4 109 5-118 309-434 (1262) 27 pfam10678 DUF2492 Protein of u 35.5 38 0.00097 16.2 2.9 46 4-49 13-69 (78) 28 TIGR01553 formate-DH-alph form 35.4 36 0.00092 16.4 2.7 28 14-41 382-409 (1043) 29 PRK09392 ftrB transcriptional 34.7 42 0.0011 16.0 4.6 46 10-56 171-216 (236) 30 PRK09764 DNA-binding transcrip 34.3 43 0.0011 15.9 5.4 45 5-50 23-69 (239) 31 COG3596 Predicted GTPase [Gene 34.1 16 0.0004 18.5 0.7 13 86-98 56-68 (296) 32 pfam08220 HTH_DeoR DeoR-like h 32.0 47 0.0012 15.7 4.6 39 14-52 16-55 (57) 33 PRK10079 putative transcriptio 31.3 48 0.0012 15.7 5.4 66 5-71 29-100 (241) 34 smart00345 HTH_GNTR helix_turn 30.4 50 0.0013 15.6 4.8 42 5-47 14-55 (60) 35 PRK11269 glyoxylate carboligas 30.3 50 0.0013 15.6 5.1 44 22-65 186-230 (591) 36 TIGR01458 HAD-SF-IIA-hyp3 HAD- 29.9 17 0.00044 18.3 0.3 28 112-139 231-258 (258) 37 PRK06725 acetolactate synthase 28.2 55 0.0014 15.3 4.6 42 24-65 199-241 (570) 38 pfam05158 RNA_pol_Rpc34 RNA po 27.8 56 0.0014 15.3 6.0 75 9-87 22-110 (313) 39 COG2406 Protein distantly rela 26.7 39 0.00098 16.2 1.7 20 5-24 74-93 (172) 40 TIGR02767 TraG-Ti Ti-type conj 26.7 33 0.00083 16.7 1.3 21 60-80 494-521 (641) 41 pfam08279 HTH_11 HTH domain. T 25.7 61 0.0016 15.1 4.3 39 6-46 11-49 (55) 42 PRK11753 cAMP-regulatory prote 25.4 62 0.0016 15.0 4.8 31 14-44 170-200 (211) 43 pfam02329 HDC Histidine carbox 25.3 44 0.0011 15.9 1.7 71 33-109 185-261 (306) 44 KOG4357 consensus 24.9 30 0.00077 16.9 0.8 14 109-122 38-51 (164) 45 KOG0774 consensus 24.8 37 0.00093 16.3 1.3 100 14-113 148-282 (334) 46 PRK08979 acetolactate synthase 24.4 64 0.0016 14.9 4.0 44 23-66 190-234 (572) 47 COG1349 GlpR Transcriptional r 24.3 65 0.0016 14.9 4.1 44 14-57 21-65 (253) 48 PRK06965 acetolactate synthase 23.4 68 0.0017 14.8 4.4 41 25-65 207-248 (587) 49 COG3960 Glyoxylate carboligase 22.9 69 0.0018 14.7 4.1 36 28-63 192-228 (592) 50 PRK13626 transcriptional regul 21.8 73 0.0019 14.6 5.1 36 8-43 19-54 (551) 51 COG0028 IlvB Thiamine pyrophos 21.5 74 0.0019 14.6 4.4 40 27-66 188-228 (550) 52 PRK07979 acetolactate synthase 21.1 75 0.0019 14.5 5.2 40 26-65 193-233 (574) 53 pfam07042 TrfA TrfA protein. T 20.2 79 0.002 14.4 4.0 22 82-103 187-208 (282) No 1 >pfam07120 DUF1376 Protein of unknown function (DUF1376). This family consists of several hypothetical bacterial proteins of around 95 residues in length. The function of this family is unknown. Probab=99.38 E-value=3.7e-13 Score=97.83 Aligned_cols=59 Identities=24% Similarity=0.418 Sum_probs=54.5 Q ss_pred CCCHHCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCEECHHHHHHHHH Q ss_conf 921000489875027999777199999999999999853918993892421899999997 Q gi|254781152|r 1 MISLYDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQEGNIYNVRVEKEIIK 60 (155) Q Consensus 1 lv~MYdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG~LWN~RVEeEL~n 60 (155) |..||++++|||.|+++|||+||||.++|+++++.+++.-. .+.+|+.||.|||+|+-+ T Consensus 29 l~~yy~~~~plp~d~~~LaR~~~~s~~~~~~av~~vl~~ff-~~~~~~w~~~R~e~Ei~~ 87 (88) T pfam07120 29 LDAYWATEGPLPDDDARLARLAGCSTKEWAAAVEFVLSEFF-EELDGRWTNKRCEEELAA 87 (88) T ss_pred HHHHHHCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHH-HCCCCCEEHHHHHHHHHC T ss_conf 99998758989724999999988992889999999999865-402687303989999864 No 2 >COG3756 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.87 E-value=1.4e-05 Score=53.67 Aligned_cols=104 Identities=25% Similarity=0.383 Sum_probs=73.2 Q ss_pred CCCHHCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCEECHHHHHHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 92100048987502799977719999999999999985391899389242189999999756778874220111245666 Q gi|254781152|r 1 MISLYDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQEGNIYNVRVEKEIIKASEERQENGRKGGIKSSQMR 80 (155) Q Consensus 1 lv~MYdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG~LWN~RVEeEL~n~sEer~~agRkGg~~S~Q~R 80 (155) |..||..++|||-|.-.+|+.|+-|.++|+++--.|+.+= -+++++.-|.||++|+.-.-|....+. +-+-+|.|.+ T Consensus 30 L~~yw~t~~pip~d~~~~a~~a~~s~krw~av~~~L~~Ff--~l~~~~w~~KR~~eeiaa~~e~~a~~s-~a~~~~~~~~ 106 (153) T COG3756 30 LMAYWATEKPIPADKAALARAAALSKKRWRAVDAILDEFF--SLQGDGWHNKRCDEEIAAQVEKEAKAS-GAAGKSVQAL 106 (153) T ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH--HHCCCHHHHHHHHHHHHHHHHHHHHHH-HHHCCCCCCC T ss_conf 9999865999996389999999984888877079999998--524444988888899999998677776-6302555663 Q ss_pred HHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCCCHHH Q ss_conf 6777655664301564786308411256447642343 Q gi|254781152|r 81 ILSKKTNNLFQGTLKPAHVFQKPESINNYSAPVDFEK 117 (155) Q Consensus 81 ~~~kk~n~~fq~~~~~~~~~~~~~~~~~~~~~~~~~~ 117 (155) -++-...-|+.+-| |++.-..|+||.- T Consensus 107 --rtr~h~~~~t~~~~--------slt~~qk~~~fs~ 133 (153) T COG3756 107 --RTRAHGAAQTGALP--------SLTQSQKPSDFSP 133 (153) T ss_pred --CCHHHHHHHHCCCC--------CCCCCCCCCCCCC T ss_conf --21578988602363--------2013557867683 No 3 >COG2996 Predicted RNA-bindining protein (contains S1 and HTH domains) [General function prediction only] Probab=82.68 E-value=2.5 Score=23.12 Aligned_cols=44 Identities=34% Similarity=0.400 Sum_probs=38.4 Q ss_pred HHCCCCCCCCHHH----HHHHHHCCCHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 0004898750279----99777199999999999999853918993892 Q gi|254781152|r 4 LYDRGGSIPDNDK----YIAGVCGCSIRRWRNIRAILEKFNKIFIQEGN 48 (155) Q Consensus 4 MYdrGgPL~~N~r----~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG~ 48 (155) |-.+||-+|.||+ -|-.+-|.|+..|-+|+-.|++.|+|.. +|- T Consensus 234 L~~~gG~mpf~DKSsPEdIk~~FgiSKg~FKrAiGgL~K~g~I~q-~g~ 281 (287) T COG2996 234 LESNGGFMPFNDKSSPEDIKATFGISKGQFKRAIGGLMKAGKIKQ-DGD 281 (287) T ss_pred HHHCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCEEEE-CCC T ss_conf 997499244678999899998868178899998888876775877-683 No 4 >COG2188 PhnF Transcriptional regulators [Transcription] Probab=78.19 E-value=6.5 Score=20.73 Aligned_cols=88 Identities=20% Similarity=0.154 Sum_probs=60.3 Q ss_pred HHCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE-EEC-CCEECHH----HHHHHHHHHHHHHHHHHHCCHH-- Q ss_conf 0004898750279997771999999999999998539189-938-9242189----9999997567788742201112-- Q gi|254781152|r 4 LYDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIF-IQE-GNIYNVR----VEKEIIKASEERQENGRKGGIK-- 75 (155) Q Consensus 4 MYdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~-l~D-G~LWN~R----VEeEL~n~sEer~~agRkGg~~-- 75 (155) -|.-|..|| .++-||..-|||--+-|+|++.|.+.|.|. ..+ |-....+ -..++..++|.-...|++-+.+ T Consensus 24 ~~~~G~~LP-sE~eLa~~f~VSR~TvRkAL~~L~~eGli~r~~G~GtfV~~~~~~~~~~~~~~f~e~~~~~g~~~~~~vl 102 (236) T COG2188 24 ELPPGDKLP-SERELAEQFGVSRMTVRKALDELVEEGLIVRRQGKGTFVASPKEQSPLLELTSFSEELKSQGLEPTTEVL 102 (236) T ss_pred CCCCCCCCC-CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 999989595-9899999979889999999999998998899516678988775423366776499999974998636999 Q ss_pred HHHHHHHHHHHHHHHHC Q ss_conf 45666677765566430 Q gi|254781152|r 76 SSQMRILSKKTNNLFQG 92 (155) Q Consensus 76 S~Q~R~~~kk~n~~fq~ 92 (155) +...+.-+..+..+++= T Consensus 103 ~~~~~~a~~~~a~~L~l 119 (236) T COG2188 103 SFEIEPASPEVAELLGL 119 (236) T ss_pred EEEEECCCHHHHHHCCC T ss_conf 99873699899986298 No 5 >pfam08784 RPA_C Replication protein A C terminal. This domain corresponds to the C terminal of the single stranded DNA binding protein RPA (replication protein A). RPA is involved in many DNA metabolic pathways including DNA replication, DNA repair, recombination, cell cycle and DNA damage checkpoints. Probab=77.27 E-value=4 Score=21.96 Aligned_cols=32 Identities=16% Similarity=0.162 Sum_probs=27.0 Q ss_pred CHHHHHHHHH-CCCHHHHHHHHHHHHHCCCEEE Q ss_conf 0279997771-9999999999999985391899 Q gi|254781152|r 13 DNDKYIAGVC-GCSIRRWRNIRAILEKFNKIFI 44 (155) Q Consensus 13 ~N~r~LAg~C-GCSvKkFr~aLd~Ll~~GKI~l 44 (155) .....|+..+ |.|+...+.++|+|...|+|+- T Consensus 66 ~h~~~l~~~l~~~~~~~i~~a~d~L~~eG~IYs 98 (103) T pfam08784 66 VHVDELAAQLNGLPVNDIRQAVDFLSNEGHIYS 98 (103) T ss_pred CCHHHHHHHHCCCCHHHHHHHHHHHHHCCEEEE T ss_conf 279999999829999999999999984880664 No 6 >pfam00392 GntR Bacterial regulatory proteins, gntR family. This family of regulatory proteins consists of the N-terminal HTH region of GntR-like bacterial transcription factors. At the C-terminus there is usually an effector-binding/oligomerisation domain. The GntR-like proteins include the following sub-families: MocR, YtrR, FadR, AraR, HutC and PlmA, DevA, DasR. Many of these proteins have been shown experimentally to be autoregulatory, enabling the prediction of operator sites and the discovery of cis/trans relationships. The DasR regulator has been shown to be a global regulator of primary metabolism and development in Streptomyces coelicolor. Probab=69.69 E-value=11 Score=19.47 Aligned_cols=42 Identities=21% Similarity=0.252 Sum_probs=34.3 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECC Q ss_conf 0048987502799977719999999999999985391899389 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQEG 47 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG 47 (155) |..|..|| .++.||...|+|.-+-+.|++.|.+.|-|....| T Consensus 18 ~~~G~~LP-s~~~La~~~~vSr~tvr~Al~~L~~~G~i~~~~g 59 (64) T pfam00392 18 LRPGDKLP-SERELAAEFGVSRTTVREALRRLEAEGLVERRPG 59 (64) T ss_pred CCCCCCCH-HHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECC T ss_conf 99929984-7999999979699999999999998893899779 No 7 >pfam09681 Phage_rep_org_N N-terminal phage replisome organiser (Phage_rep_org_N). This entry represents the N-terminal domain of a small family of phage proteins. The protein contains a region of low-complexity sequence that reflects DNA direct repeats able to function as an origin of phage replication. The region is N-terminal to the low-complexity region. Probab=67.56 E-value=12 Score=19.21 Aligned_cols=52 Identities=25% Similarity=0.275 Sum_probs=45.1 Q ss_pred CCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEE-ECCCEECHHHHHHHH Q ss_conf 8987502799977719999999999999985391899-389242189999999 Q gi|254781152|r 8 GGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFI-QEGNIYNVRVEKEII 59 (155) Q Consensus 8 GgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l-~DG~LWN~RVEeEL~ 59 (155) .+-+|.+.+.||.+.+-++..-+.|+.++.+.|=|.+ ++|-++=...|+-.+ T Consensus 49 ~~~ipy~~e~LA~~~~~~~~~V~~Al~~f~~~glIe~~e~g~i~i~~~~~~~g 101 (121) T pfam09681 49 SGNIPYTAEMLATIFDRPLNTVRLALKVFEKLGMIEITENGVIKIPNWEKHQN 101 (121) T ss_pred CCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCEEEEECHHHHCC T ss_conf 58889879999999888878999999999988978980299388653887708 No 8 >cd07377 WHTH_GntR Winged helix-turn-helix (WHTH) DNA-binding domain of the GntR family of transcriptional regulators. This CD represents the winged HTH DNA-binding domain of the GntR (named after the gluconate operon repressor in Bacillus subtilis) family of bacterial transcriptional regulators and their putative homologs found in eukaryota and archaea. The GntR family has over 6000 members distributed among almost all bacterial species, which is comprised of FadR, HutC, MocR, YtrA, AraR, PlmA, and other subfamilies for the regulation of the most varied biological process. The monomeric proteins of the GntR family are characterized by two function domains: a small highly conserved winged helix-turn-helix prokaryotic DNA binding domain in the N-terminus, and a very diverse regulatory ligand-binding domain in the C-terminus for effector-binding/oligomerization, which provides the basis for the subfamily classifications. Binding of the effector to GntR-like transcriptional regulators is Probab=64.73 E-value=13 Score=18.89 Aligned_cols=41 Identities=20% Similarity=0.191 Sum_probs=33.8 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEC Q ss_conf 004898750279997771999999999999998539189938 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQE 46 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~D 46 (155) |..|.+|| .++-||...|+|.-+-+.|+..|.+.|-|.... T Consensus 19 ~~~G~~LP-s~~~La~~~~vSr~tvr~Al~~L~~~G~i~~~~ 59 (66) T cd07377 19 LKPGDRLP-SERELAEELGVSRTTVREALRELEAEGLVERRP 59 (66) T ss_pred CCCCCCCC-CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEC T ss_conf 99919992-799999997988899999999999889589973 No 9 >COG1281 Disulfide bond chaperones of the HSP33 family [Posttranslational modification, protein turnover, chaperones] Probab=62.81 E-value=1.1 Score=25.15 Aligned_cols=36 Identities=22% Similarity=0.242 Sum_probs=27.1 Q ss_pred CHHCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 100048987502799977719999999999999985 Q gi|254781152|r 3 SLYDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEK 38 (155) Q Consensus 3 ~MYdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~ 38 (155) ++|+.-++-....+-+.-.|.||--||.++|..|=+ T Consensus 217 rL~~e~~v~ile~~~v~f~C~CSrEr~~~aL~~lg~ 252 (286) T COG1281 217 RLFHEEGVQLLEPQPVEFRCSCSRERVAAALLSLGK 252 (286) T ss_pred HHHCCCCCCCCCCCCCEEECCCCHHHHHHHHHHCCH T ss_conf 984656533057855138767988999999984489 No 10 >smart00419 HTH_CRP helix_turn_helix, cAMP Regulatory protein. Probab=56.33 E-value=19 Score=18.01 Aligned_cols=40 Identities=25% Similarity=0.195 Sum_probs=34.4 Q ss_pred CCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCE Q ss_conf 8750279997771999999999999998539189938924 Q gi|254781152|r 10 SIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQEGNI 49 (155) Q Consensus 10 PL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG~L 49 (155) .++..-+-||.++|++--+.-.++.-|.+.|-|....+.+ T Consensus 6 ~l~lt~~~iA~~lG~tretvsR~l~~l~~~GlI~~~~~~i 45 (48) T smart00419 6 RLPLTRQEIAELLGLTRETVSRTLKRLEKEGLISREGGRI 45 (48) T ss_pred EECCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEECCCEE T ss_conf 8337999999997997999999999999889799789789 No 11 >pfam10685 KGG Stress-induced bacterial acidophilic repeat motif. This repeat is found in proteins which are expressed under conditions of stress in bacteria. The repeat contains a highly conserved, characteristic sequence motif,KGG, that is also recognized by plants and lower eukaryotes and repeated in their LEA (late embryogenesis abundant) family of proteins, thereby rendering those proteins bacteriostatic. An example of such an LEA family is LEA_5, pfam00477. Further downstream from this motif is a Walker A, nucleotide binding, motif GXXXXGK(S,T), that in YciG of E coli is QSGGNKSGKS. YciG is expressed as part of a three-gene operon, yciGFE, and this operon is induced by stress and is regulated by RpoS, which controls the general stress-response in E coli. YciG was shown to be important for stationary-phase resistance to thermal stress and in particular to acid stress. Probab=53.18 E-value=5.2 Score=21.29 Aligned_cols=19 Identities=42% Similarity=0.590 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHCCHHHHHH Q ss_conf 5677887422011124566 Q gi|254781152|r 61 ASEERQENGRKGGIKSSQM 79 (155) Q Consensus 61 ~sEer~~agRkGg~~S~Q~ 79 (155) -.|+.+++|||||.-|++. T Consensus 6 D~e~~~eig~KGG~as~~~ 24 (26) T pfam10685 6 DPEKAREIGRKGGQASGGX 24 (26) T ss_pred CHHHHHHHHHCCCCCCCCC T ss_conf 8899999865037566787 No 12 >COG4533 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=53.08 E-value=22 Score=17.69 Aligned_cols=65 Identities=18% Similarity=0.241 Sum_probs=49.0 Q ss_pred CCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE-E-ECCC------EECHHHHHHHHHHHHHHHHHHHH Q ss_conf 4898750279997771999999999999998539189-9-3892------42189999999756778874220 Q gi|254781152|r 7 RGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIF-I-QEGN------IYNVRVEKEIIKASEERQENGRK 71 (155) Q Consensus 7 rGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~-l-~DG~------LWN~RVEeEL~n~sEer~~agRk 71 (155) .|.|.+.-..-||.+|.||.|..|.+|.-+.+.|.|. . ..|| .+-..-|.++-..-|+.-+-|.- T Consensus 18 ~g~~~~~tl~eLA~~lfCS~Rh~R~lL~q~q~~gWL~W~p~~GRGkrSrL~flr~~e~L~~q~a~~~le~g~~ 90 (564) T COG4533 18 AGQPVETTLNELADILFCSRRHARTLLRQMQEAGWLTWQPQAGRGKRSRLTFLRTPEALLLQLAEKLLEQGKI 90 (564) T ss_pred CCCCCEEEHHHHHHHHCCCHHHHHHHHHHHHHCCCEEECCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCCCH T ss_conf 7987433699998886267789999999998759777246778876215888248699999999999963719 No 13 >cd00498 Hsp33 Heat shock protein 33 (Hsp33): Cytosolic protein that acts as a molecular chaperone under oxidative conditions. In normal (reducing) cytosolic conditions, four conserved Cys residues are coordinated by a Zn ion. Under oxidative stress (such as heat shock), the Cys are reversibly oxidized to disulfide bonds, which causes the chaperone activity to be turned on. Hsp33 is homodimeric in its functional form. Probab=50.08 E-value=4.8 Score=21.51 Aligned_cols=33 Identities=18% Similarity=0.175 Sum_probs=20.8 Q ss_pred HHCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 000489875027999777199999999999999 Q gi|254781152|r 4 LYDRGGSIPDNDKYIAGVCGCSIRRWRNIRAIL 36 (155) Q Consensus 4 MYdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~L 36 (155) +|+.-++-....+-+...|.||..|+..+|..| T Consensus 214 Lf~~~~~~i~~~~~v~f~C~CS~er~~~~L~~L 246 (275) T cd00498 214 LFHEEEVRILEKQPVRFRCDCSRERVAAALLTL 246 (275) T ss_pred HHCCCCCEECCCCCCCCCCCCCHHHHHHHHHHC T ss_conf 828898668756357878999999999999837 No 14 >PRK01402 hslO Hsp33-like chaperonin; Reviewed Probab=49.76 E-value=4.5 Score=21.64 Aligned_cols=16 Identities=19% Similarity=0.542 Sum_probs=6.5 Q ss_pred HHHHHCCCHHHHHHHH Q ss_conf 9777199999999999 Q gi|254781152|r 18 IAGVCGCSIRRWRNIR 33 (155) Q Consensus 18 LAg~CGCSvKkFr~aL 33 (155) +...|.||..|+..+| T Consensus 273 v~f~C~CSreRv~~~L 288 (327) T PRK01402 273 VIARCSCSREKIAGVL 288 (327) T ss_pred CCCCCCCCHHHHHHHH T ss_conf 7817899989999999 No 15 >pfam04444 Dioxygenase_N Catechol dioxygenase N terminus. This family consists of the N termini of catechol, chlorocatechol or hydroxyquinol 1,2-dioxygenase proteins. This region is always found adjacent to the dioxygenase domain (pfam00775). Probab=47.63 E-value=26 Score=17.18 Aligned_cols=57 Identities=11% Similarity=0.077 Sum_probs=38.0 Q ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHHHCCC----EEEECCCEECHHHHHHHH-HHHHHHHHHHHHCC Q ss_conf 502799977719999999999999985391----899389242189999999-75677887422011 Q gi|254781152|r 12 PDNDKYIAGVCGCSIRRWRNIRAILEKFNK----IFIQEGNIYNVRVEKEII-KASEERQENGRKGG 73 (155) Q Consensus 12 ~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GK----I~l~DG~LWN~RVEeEL~-n~sEer~~agRkGg 73 (155) ..+.-.+++...-+..-|.+++++|.+.|. +.+.|+-- +|. |+ ...++.+.+||.|| T Consensus 12 v~hlh~~i~e~~lT~dE~~~av~~L~~~Gqr~E~~Llsd~LG----~e~-Ld~~~da~d~~ag~~gg 73 (74) T pfam04444 12 LSDLHKTIEDLDVTEDEYWAGVDYLNALGERGEFGLLLDVLG----LEH-LDLRVDAADAAAGRTGG 73 (74) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHHHHCCCC----HHH-HHHHHHHHHHHHCCCCC T ss_conf 999999999938999999999999999744451778602005----899-99886156776065479 No 16 >TIGR00118 acolac_lg acetolactate synthase, large subunit, biosynthetic type; InterPro: IPR012846 Two groups of proteins form acetolactate from two molecules of pyruvate. The type of acetolactate synthase described in this entry also catalyzes the formation of acetohydroxybutyrate from pyruvate and 2-oxobutyrate, an early step in the branched chain amino acid biosynthesis; it is therefore also termed acetohydroxyacid synthase. In bacteria, this catalytic chain is associated with a smaller regulatory chain in an alpha2/beta2 heterotetramer. Acetolactate synthase is a thiamine pyrophosphate enzyme. In this type, FAD and Mg++ are also found. Several isozymes of this enzyme are found in E. coli K12, one of which contains a frameshift in the large subunit gene and is not expressed.; GO: 0000287 magnesium ion binding, 0003984 acetolactate synthase activity, 0030976 thiamin pyrophosphate binding, 0050660 FAD binding, 0009082 branched chain family amino acid biosynthetic process. Probab=46.90 E-value=20 Score=17.93 Aligned_cols=39 Identities=15% Similarity=0.298 Sum_probs=32.5 Q ss_pred HHHHHHHHHHHHCCC-EEEECCCEE-CHHHHHHHHHHHHHH Q ss_conf 999999999985391-899389242-189999999756778 Q gi|254781152|r 27 RRWRNIRAILEKFNK-IFIQEGNIY-NVRVEKEIIKASEER 65 (155) Q Consensus 27 KkFr~aLd~Ll~~GK-I~l~DG~LW-N~RVEeEL~n~sEer 65 (155) ..-.+|++.|.+.-| +++.+|++- -+.|.+||++++|.- T Consensus 201 qsI~kA~~li~~AkkPVilvGGGvin~a~as~~L~elae~~ 241 (593) T TIGR00118 201 QSIKKAAELIEKAKKPVILVGGGVINIAGASEELKELAERL 241 (593) T ss_pred HHHHHHHHHHHHHCCCEEEECCHHHHCCCCHHHHHHHHHHC T ss_conf 99999999999726977986413542055407999998540 No 17 >pfam00325 Crp Bacterial regulatory proteins, crp family. Probab=45.76 E-value=27 Score=17.15 Aligned_cols=31 Identities=19% Similarity=0.133 Sum_probs=26.1 Q ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHHHCCCE Q ss_conf 5027999777199999999999999853918 Q gi|254781152|r 12 PDNDKYIAGVCGCSIRRWRNIRAILEKFNKI 42 (155) Q Consensus 12 ~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI 42 (155) |-.-+-||.++||++-+--..+..|.+.|-| T Consensus 2 pmsrqdIadylGlt~ETVsR~l~~L~~~glI 32 (32) T pfam00325 2 PMSRQEIADYLGLTRETVSRLLKRLREKGLI 32 (32) T ss_pred CCCHHHHHHHHCCHHHHHHHHHHHHHHCCCC T ss_conf 8658899998472599999999999985789 No 18 >KOG2403 consensus Probab=45.44 E-value=21 Score=17.78 Aligned_cols=26 Identities=15% Similarity=0.354 Sum_probs=15.0 Q ss_pred HHHCCCEEE-ECCCEECHHHHH--HHHHH Q ss_conf 985391899-389242189999--99975 Q gi|254781152|r 36 LEKFNKIFI-QEGNIYNVRVEK--EIIKA 61 (155) Q Consensus 36 Ll~~GKI~l-~DG~LWN~RVEe--EL~n~ 61 (155) --+..++.. +.|++||++.-| ||-|+ T Consensus 538 ~~~~k~lktfDrgmvWNsdLVETLELqNL 566 (642) T KOG2403 538 YGDFKDLKTFDRGMVWNSDLVETLELQNL 566 (642) T ss_pred HHHHHHHCCCCCCCEECHHHHHHHHHHHH T ss_conf 86776531013562302568999998889 No 19 >pfam01430 HSP33 Hsp33 protein. Hsp33 is a molecular chaperone, distinguished from all other known chaperones by its mode of functional regulation. Its activity is redox regulated. Hsp33 is a cytoplasmically localized protein with highly reactive cysteines that respond quickly to changes in the redox environment. Oxidising conditions like H2O2 cause disulfide bonds to form in Hsp33, a process that leads to the activation of its chaperone function. Probab=44.83 E-value=6.2 Score=20.85 Aligned_cols=24 Identities=25% Similarity=0.176 Sum_probs=11.9 Q ss_pred CCHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 502799977719999999999999 Q gi|254781152|r 12 PDNDKYIAGVCGCSIRRWRNIRAI 35 (155) Q Consensus 12 ~~N~r~LAg~CGCSvKkFr~aLd~ 35 (155) ..+.+-+...|.||..|+..+|.. T Consensus 220 i~~~~~v~f~C~CS~er~~~~L~~ 243 (274) T pfam01430 220 ILEPQPVRFGCSCSRERVANALAS 243 (274) T ss_pred EECCCCCCCCCCCCHHHHHHHHHH T ss_conf 800544788799999999999981 No 20 >COG3432 Predicted transcriptional regulator [Transcription] Probab=42.15 E-value=19 Score=18.07 Aligned_cols=34 Identities=15% Similarity=0.031 Sum_probs=27.2 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 7999777199999999999999853918993892 Q gi|254781152|r 15 DKYIAGVCGCSIRRWRNIRAILEKFNKIFIQEGN 48 (155) Q Consensus 15 ~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG~ 48 (155) .-+|...||-+-+.|+..+++|.+.|-|+..+++ T Consensus 34 ~Tri~y~aNlny~~~~~yi~~L~~~Gli~~~~~~ 67 (95) T COG3432 34 ITRIIYGANLNYKRAQKYIEMLVEKGLIIKQDNG 67 (95) T ss_pred CEEEEEECCCCHHHHHHHHHHHHHCCCEEECCCC T ss_conf 6246643276779999999999867877861488 No 21 >COG3729 GsiB General stress protein [General function prediction only] Probab=41.71 E-value=11 Score=19.48 Aligned_cols=15 Identities=53% Similarity=0.742 Sum_probs=8.9 Q ss_pred HHHHHHHHHCCHHHH Q ss_conf 778874220111245 Q gi|254781152|r 63 EERQENGRKGGIKSS 77 (155) Q Consensus 63 Eer~~agRkGg~~S~ 77 (155) |..+++|||||.-|+ T Consensus 29 e~~~EigrkGGq~sg 43 (73) T COG3729 29 ERASEIGRKGGQHSG 43 (73) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999872155678 No 22 >PRK00114 hslO Hsp33-like chaperonin; Reviewed Probab=41.12 E-value=10 Score=19.62 Aligned_cols=31 Identities=23% Similarity=0.233 Sum_probs=16.5 Q ss_pred HHCCCCCCCCHHHHHHHHHCCCHHHHHHHHH Q ss_conf 0004898750279997771999999999999 Q gi|254781152|r 4 LYDRGGSIPDNDKYIAGVCGCSIRRWRNIRA 34 (155) Q Consensus 4 MYdrGgPL~~N~r~LAg~CGCSvKkFr~aLd 34 (155) +|+..++-....+-+...|.||..|+..+|. T Consensus 216 Lf~e~~~~i~~~~~v~f~C~CS~er~~~~L~ 246 (289) T PRK00114 216 LYHEEDVKILEPQPVEFKCDCSRERSANALK 246 (289) T ss_pred HHCCCCCEECCCCCCCCCCCCCHHHHHHHHH T ss_conf 8488984580253578789999999999998 No 23 >pfam00205 TPP_enzyme_M Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold. Probab=39.98 E-value=35 Score=16.48 Aligned_cols=36 Identities=11% Similarity=0.244 Sum_probs=29.9 Q ss_pred HHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHHHH Q ss_conf 999999985391-899389242189999999756778 Q gi|254781152|r 30 RNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASEER 65 (155) Q Consensus 30 r~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sEer 65 (155) .++++.|.+.-+ +++.+++.|.....+|+.++.|.- T Consensus 2 ~~~~~~l~~AkrPvii~G~g~~~~~a~~~l~~lae~~ 38 (138) T pfam00205 2 EKAAELLAAAKRPVILVGGGVRRSGASEELRALAEKL 38 (138) T ss_pred HHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHH T ss_conf 8999999968991999897835221899999999984 No 24 >PRK11402 DNA-binding transcriptional regulator FrlR; Provisional Probab=39.12 E-value=36 Score=16.40 Aligned_cols=65 Identities=18% Similarity=0.261 Sum_probs=43.9 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE-EEC-CCEE-CHHHHHHHHH---HHHHHHHHHH Q ss_conf 004898750279997771999999999999998539189-938-9242-1899999997---5677887422 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIF-IQE-GNIY-NVRVEKEIIK---ASEERQENGR 70 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~-l~D-G~LW-N~RVEeEL~n---~sEer~~agR 70 (155) |..|..|| .++-||-.-|+|--+-|.|++.|.+.|.|. ..+ |-.. ..+....+.. +++.-...|. T Consensus 29 ~~~G~~LP-sE~eLa~~y~VSR~TVR~Al~~L~~eGli~r~~G~GtfV~~~~~~~~l~~~~~f~~~~~~~g~ 99 (243) T PRK11402 29 YQAGQQIP-TENELCTQYNVSRITIRKAISDLVADGVLIRWQGKGTFVQSQKVENALLTVSGFTDFGVSQGK 99 (243) T ss_pred CCCCCCCC-CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCEEEECCCCCCCCHHCCCCHHHHHHHCCC T ss_conf 99979893-999999997979999999999999789999977966899536354530014897999997699 No 25 >TIGR03337 phnR transcriptional regulator protein. This family of proteins are members of the GntR family (pfam00392) containing an N-terminal helix-turn-helix (HTH) motif. This clade is found adjacent to or inside of operons for the degradation of 2-aminoethylphosphonate (AEP) in Salmonella, Vibrio Aeromonas hydrophila, Hahella chejuensis and Psychromonas ingrahamii. Probab=38.78 E-value=36 Score=16.37 Aligned_cols=43 Identities=14% Similarity=0.155 Sum_probs=35.0 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE-EECCC Q ss_conf 004898750279997771999999999999998539189-93892 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIF-IQEGN 48 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~-l~DG~ 48 (155) |.-|..|| .++-||...|+|--+-|.|++.|.+.|-|. ..+.+ T Consensus 19 ~~~G~~LP-sE~eLa~~~~VSR~TvR~Al~~L~~eGli~r~~G~G 62 (231) T TIGR03337 19 LLPGDKLP-SERDLGERFNTTRVTIREALQQLEAEGLIYREDRRG 62 (231) T ss_pred CCCCCCCC-CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCE T ss_conf 99959785-899999997979999999999999879969966866 No 26 >KOG1141 consensus Probab=35.56 E-value=8.6 Score=20.00 Aligned_cols=109 Identities=17% Similarity=0.106 Sum_probs=59.3 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHH------HHHCCCEE--EECCCEECHHHHH-HHH-----HHHHHHHHHHH Q ss_conf 0048987502799977719999999999999------98539189--9389242189999-999-----75677887422 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAI------LEKFNKIF--IQEGNIYNVRVEK-EII-----KASEERQENGR 70 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~------Ll~~GKI~--l~DG~LWN~RVEe-EL~-----n~sEer~~agR 70 (155) |--..||..|+.-+- --+|.+...+...++ |.+-|..+ ...|.+||+++|+ |.. ++...+-.-.- T Consensus 309 ~~~~~~l~~~~d~ve-~~~~~~~~~~~~~~~~~~~~~~~k~g~~v~~~~~~~~~~a~~~~~e~~~~~~~~i~d~~~~~~~ 387 (1262) T KOG1141 309 SNPTDDLLLNFDQVE-FSNNIIDTAVKYWDDQKENGAQDKIGRRVLIKLTTVLKNAVGSRNEIIQLVNEKIPDFDGTEAA 387 (1262) T ss_pred CCCCHHHHHHHHHHH-CCCCHHHHHHEEECCCCCCCHHHHHCCEEEEEEEEHHHHHCCCCCCCEEEEEEECCCCCCCEEE T ss_conf 787626664567652-0121112121120365775034320557775431023242024332126776414777762453 Q ss_pred HCCHHHHHHH---HHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCCCHHHC Q ss_conf 0111245666---67776556643015647863084112564476423435 Q gi|254781152|r 71 KGGIKSSQMR---ILSKKTNNLFQGTLKPAHVFQKPESINNYSAPVDFEKI 118 (155) Q Consensus 71 kGg~~S~Q~R---~~~kk~n~~fq~~~~~~~~~~~~~~~~~~~~~~~~~~~ 118 (155) -|--+-+.|| |-|.-+-.+.||.+|-+-. +.--+|.+|++|++. T Consensus 388 ~~~s~~~~~~t~~~~s~a~me~~k~emk~~~~----mg~k~~d~kt~~~~~ 434 (1262) T KOG1141 388 VNESFTSDQRTEIINSRAIMETLKAEMKLAIA----MGQKVYDTKTDFEKF 434 (1262) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHH----HCCCCCCCCCEEEEE T ss_conf 36533561566544279999987666522533----203035776213466 No 27 >pfam10678 DUF2492 Protein of unknown function (DUF2492). This is a bacterial family of uncharacterized proteins. Probab=35.54 E-value=38 Score=16.25 Aligned_cols=46 Identities=20% Similarity=0.224 Sum_probs=31.0 Q ss_pred HHCCCCCCCCHHHH---------HHHHHCCCHH--HHHHHHHHHHHCCCEEEECCCE Q ss_conf 00048987502799---------9777199999--9999999998539189938924 Q gi|254781152|r 4 LYDRGGSIPDNDKY---------IAGVCGCSIR--RWRNIRAILEKFNKIFIQEGNI 49 (155) Q Consensus 4 MYdrGgPL~~N~r~---------LAg~CGCSvK--kFr~aLd~Ll~~GKI~l~DG~L 49 (155) |.+.|.|.....=. =||.--||.. .+..+++.|.+.||++..+|++ T Consensus 13 ml~~~~~~t~~~L~~~i~~~FG~~arFhTCSae~mta~eLi~FL~~kgKfi~~~~g~ 69 (78) T pfam10678 13 MLASGEPYTREELKEAIAQEFGEEARFHTCSAEGLTADELIEFLLKKGKFIESEGGF 69 (78) T ss_pred HHHCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHCCCEEECCCCE T ss_conf 998589759999999999985866567665614598999999999769876528977 No 28 >TIGR01553 formate-DH-alph formate dehydrogenase, alpha subunit; InterPro: IPR006443 This family of sequences describe a subset of formate dehydrogenase alpha chains found mainly in proteobacteria but also in Aquifex aeolicus. The alpha chain contains domains for molybdopterin dinucleotide binding and molybdopterin oxidoreductase. The holo-enzyme also contains beta and gamma subunits of 32 and 20 kDa. The enzyme catalyzes the oxidation of formate (produced from pyruvate during anaerobic growth) to carbon dioxide with the concomitant release of two electrons and two protons. The electrons are utilised mainly in the nitrate respiration by nitrate reductase . In Escherichia coli and Salmonella typhi, there are two forms of the formate dehydrogenase, one induced by nitrate which is strictly anaerobic (fdn), and one incuced during the transition from aerobic to anaerobic growth (fdo). This subunit is one of only three proteins in Escherichia coli which contain selenocysteine .; GO: 0008863 formate dehydrogenase activity, 0045333 cellular respiration, 0005737 cytoplasm. Probab=35.44 E-value=36 Score=16.38 Aligned_cols=28 Identities=14% Similarity=0.345 Sum_probs=24.5 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 2799977719999999999999985391 Q gi|254781152|r 14 NDKYIAGVCGCSIRRWRNIRAILEKFNK 41 (155) Q Consensus 14 N~r~LAg~CGCSvKkFr~aLd~Ll~~GK 41 (155) +-+.++++|||+++-|-++.|.....|+ T Consensus 382 tpe~vs~IcG~~kelfL~v~ee~~ktg~ 409 (1043) T TIGR01553 382 TPEVVSRICGVKKELFLKVAEEVAKTGK 409 (1043) T ss_pred CHHHHHHHCCCCHHHHHHHHHHHHCCCC T ss_conf 7334222058758899999999853687 No 29 >PRK09392 ftrB transcriptional activator FtrB; Provisional Probab=34.68 E-value=42 Score=15.98 Aligned_cols=46 Identities=13% Similarity=0.184 Sum_probs=33.6 Q ss_pred CCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCEECHHHHH Q ss_conf 87502799977719999999999999985391899389242189999 Q gi|254781152|r 10 SIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQEGNIYNVRVEK 56 (155) Q Consensus 10 PL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG~LWN~RVEe 56 (155) .++..-+-||.++|.|--+.-.++.-|.+.| |+..++.|.=.+.+. T Consensus 171 ~l~~t~~~lA~~lG~tretvsR~L~~L~~~G-I~~~~~~I~I~D~~~ 216 (236) T PRK09392 171 TLPYEKRTLASYLGMTPENLSRAFAALASHG-VHVQGSAVTITDKAA 216 (236) T ss_pred EECCCHHHHHHHHCCCHHHHHHHHHHHHHCC-CEEECCEEEECCHHH T ss_conf 8347899999987898999999999999799-689699999988999 No 30 >PRK09764 DNA-binding transcriptional repressor MngR; Provisional Probab=34.30 E-value=43 Score=15.95 Aligned_cols=45 Identities=20% Similarity=0.214 Sum_probs=35.5 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE-EEC-CCEE Q ss_conf 004898750279997771999999999999998539189-938-9242 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIF-IQE-GNIY 50 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~-l~D-G~LW 50 (155) |.-|..|| .++-||..-|+|--+-|.|++.|.+.|-|. ..+ |-.. T Consensus 23 ~~~G~~LP-sE~eLa~~~~VSR~TvR~Al~~L~~eGli~r~~G~GTfV 69 (239) T PRK09764 23 LKPGDALP-TESALQTEFGVSRVTVRQALRQLVEQQILESIQGSGTYV 69 (239) T ss_pred CCCCCCCC-CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCEEEE T ss_conf 99959992-999999997979999999999999789999986877898 No 31 >COG3596 Predicted GTPase [General function prediction only] Probab=34.11 E-value=16 Score=18.47 Aligned_cols=13 Identities=46% Similarity=0.705 Sum_probs=5.8 Q ss_pred HHHHHHCCCCHHH Q ss_conf 5566430156478 Q gi|254781152|r 86 TNNLFQGTLKPAH 98 (155) Q Consensus 86 ~n~~fq~~~~~~~ 98 (155) +|++|||.++|.. T Consensus 56 iNALF~~~~~~v~ 68 (296) T COG3596 56 INALFQGEVKEVS 68 (296) T ss_pred HHHHHHCCCCEEE T ss_conf 9999702673421 No 32 >pfam08220 HTH_DeoR DeoR-like helix-turn-helix domain. Probab=32.05 E-value=47 Score=15.72 Aligned_cols=39 Identities=21% Similarity=0.129 Sum_probs=32.5 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCEE-EECCCEECH Q ss_conf 279997771999999999999998539189-938924218 Q gi|254781152|r 14 NDKYIAGVCGCSIRRWRNIRAILEKFNKIF-IQEGNIYNV 52 (155) Q Consensus 14 N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~-l~DG~LWN~ 52 (155) ...-||...+.|..+.|.=++.|.+.|+|. .-+|-+.|+ T Consensus 16 ~i~~La~~f~VS~~TiRRDl~~L~~~g~i~R~~GGA~~~~ 55 (57) T pfam08220 16 SVEELAELLGVSEMTIRRDLNELEEQGLLTRTHGGAVSNS 55 (57) T ss_pred EHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCEECCC T ss_conf 7999999989599999996999998899899749577588 No 33 >PRK10079 putative transcriptional regulator; Provisional Probab=31.33 E-value=48 Score=15.65 Aligned_cols=66 Identities=15% Similarity=0.181 Sum_probs=43.7 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE-EEC-CCEECH-HHHHHHH---HHHHHHHHHHHH Q ss_conf 004898750279997771999999999999998539189-938-924218-9999999---756778874220 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIF-IQE-GNIYNV-RVEKEII---KASEERQENGRK 71 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~-l~D-G~LWN~-RVEeEL~---n~sEer~~agRk 71 (155) |.-|..|| .++-||..-|.|--+-|.|++.|.+.|.|. ..+ |-.... .....++ .+++.-...|.. T Consensus 29 ~~~G~~LP-sE~eLa~~y~VSR~TVR~Al~~L~~eGli~r~~G~GtfV~~~~~~~~l~~~~~f~~~~~~~g~~ 100 (241) T PRK10079 29 YRCGDYLP-AEQQLAARFEVNRHTLRRAIDQLVERGWVQRRQGVGVLVLMRPFDYPLNAQARFSQNLLDQGSH 100 (241) T ss_pred CCCCCCCC-CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCCEEEECCCCCCCCCCCCCHHHHHHHCCCC T ss_conf 88989196-9999999979889999999999997798799769625993573332643357769999865998 No 34 >smart00345 HTH_GNTR helix_turn_helix gluconate operon transcriptional repressor. Probab=30.42 E-value=50 Score=15.56 Aligned_cols=42 Identities=21% Similarity=0.274 Sum_probs=33.7 Q ss_pred HCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECC Q ss_conf 0048987502799977719999999999999985391899389 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQEG 47 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~DG 47 (155) |..|..|| .++.||...|+|.-+-+.|++.|.+.|-|.-..| T Consensus 14 ~~~G~~LP-s~~~la~~~~vSr~tvr~A~~~L~~~G~i~~~~g 55 (60) T smart00345 14 LRPGDKLP-SERELAAQLGVSRTTVREALSRLEAEGLVQRRPG 55 (60) T ss_pred CCCCCCCC-CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECC T ss_conf 99839881-0999999989499999999999998897899668 No 35 >PRK11269 glyoxylate carboligase; Provisional Probab=30.32 E-value=50 Score=15.55 Aligned_cols=44 Identities=11% Similarity=0.217 Sum_probs=37.2 Q ss_pred HCCCHHHHHHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHHHH Q ss_conf 19999999999999985391-899389242189999999756778 Q gi|254781152|r 22 CGCSIRRWRNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASEER 65 (155) Q Consensus 22 CGCSvKkFr~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sEer 65 (155) ...+.....++++.|.+..+ +++.+++.|+++..+||.++.|+- T Consensus 186 ~~~~~~~i~~a~~lL~~AkrPvIi~G~G~~~~~a~~~l~~lae~l 230 (591) T PRK11269 186 PAATRAQIEKALAMLNAAERPLIVAGGGVINADASDLLVEFAELT 230 (591) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHC T ss_conf 999999999999999727898799588803111599999999853 No 36 >TIGR01458 HAD-SF-IIA-hyp3 HAD-superfamily subfamily IIA hydrolase, TIGR01458; InterPro: IPR006355 These sequences are all members of the IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. . Probab=29.93 E-value=17 Score=18.27 Aligned_cols=28 Identities=39% Similarity=0.493 Sum_probs=17.1 Q ss_pred CCCHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 6423435898377889877655678766 Q gi|254781152|r 112 PVDFEKINPNPLNLVIALCCATDVLIHH 139 (155) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (155) |-||||++|.|--.+=.|.-|-|.+..| T Consensus 231 pSDeek~~v~pdlt~d~l~~AVdlilqh 258 (258) T TIGR01458 231 PSDEEKIEVKPDLTVDSLAEAVDLILQH 258 (258) T ss_pred CCCCCCCCCCCCCEECCHHHHHHHHHCC T ss_conf 8766686548761021668899997259 No 37 >PRK06725 acetolactate synthase 3 catalytic subunit; Validated Probab=28.24 E-value=55 Score=15.34 Aligned_cols=42 Identities=17% Similarity=0.274 Sum_probs=34.3 Q ss_pred CCHHHHHHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHHHH Q ss_conf 999999999999985391-899389242189999999756778 Q gi|254781152|r 24 CSIRRWRNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASEER 65 (155) Q Consensus 24 CSvKkFr~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sEer 65 (155) .+....+.+.+.|.+..+ +++.+++++.++.++||.++.|+- T Consensus 199 p~~~~l~~a~~~L~~AkrPvil~G~Gv~~~~a~~~l~~lae~~ 241 (570) T PRK06725 199 PDSMKLREVAKAISKAKRPLLYIGGGVIHSGGSDELIKFAREN 241 (570) T ss_pred CCHHHHHHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHHHHHH T ss_conf 9999999999999856898798367600464899999999986 No 38 >pfam05158 RNA_pol_Rpc34 RNA polymerase Rpc34 subunit. Subunit specific to RNA Pol III, the tRNA specific polymerase. The C34 subunit of yeast RNA Pol III is part of a subcomplex of three subunits which have no counterpart in the other two nuclear RNA polymerases. This subunit interacts with TFIIIB70 and is therefore participates in Pol III recruitment. Probab=27.81 E-value=56 Score=15.29 Aligned_cols=75 Identities=23% Similarity=0.293 Sum_probs=53.5 Q ss_pred CCCCCHHHHHHHH-HCCCHHHHHHHHHHHHHCCCEEE--EC-CCEECHHHHHHHHHH----HHHH------HHHHHHCCH Q ss_conf 9875027999777-19999999999999985391899--38-924218999999975----6778------874220111 Q gi|254781152|r 9 GSIPDNDKYIAGV-CGCSIRRWRNIRAILEKFNKIFI--QE-GNIYNVRVEKEIIKA----SEER------QENGRKGGI 74 (155) Q Consensus 9 gPL~~N~r~LAg~-CGCSvKkFr~aLd~Ll~~GKI~l--~D-G~LWN~RVEeEL~n~----sEer------~~agRkGg~ 74 (155) .|-....+.|... -+.++..-..++..|++.|.|.+ .+ +-+|-.+-.+|..++ .|++ +.+|++ || T Consensus 22 ~~~~~~q~dL~~~~~~~~~~~l~~~ln~Ll~~~~l~l~k~~~~l~yk~~~~e~a~k~~~l~~eE~lVY~~Ie~ag~~-GI 100 (313) T pfam05158 22 FPNLFSQQDLQALVPGLSLNLLMSAINILLDQGLLVLLKQNDGLVYKAVTREEAKKKATLSDEESLVYSIIEESGNE-GI 100 (313) T ss_pred CCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEECCCEEEEEEECHHHHHHHCCCCHHHHHHHHHHHHCCCC-CE T ss_conf 89983799999765799999999999999746978999549988899848999876528988899999999862767-74 Q ss_pred HHHHHHHHHHHHH Q ss_conf 2456666777655 Q gi|254781152|r 75 KSSQMRILSKKTN 87 (155) Q Consensus 75 ~S~Q~R~~~kk~n 87 (155) -+...| +++| T Consensus 101 W~k~i~---~~tn 110 (313) T pfam05158 101 WTRTIK---NRTN 110 (313) T ss_pred EHHHHH---HHCC T ss_conf 499999---8809 No 39 >COG2406 Protein distantly related to bacterial ferritins [General function prediction only] Probab=26.72 E-value=39 Score=16.22 Aligned_cols=20 Identities=35% Similarity=0.929 Sum_probs=7.0 Q ss_pred HCCCCCCCCHHHHHHHHHCC Q ss_conf 00489875027999777199 Q gi|254781152|r 5 YDRGGSIPDNDKYIAGVCGC 24 (155) Q Consensus 5 YdrGgPL~~N~r~LAg~CGC 24 (155) |+-|+-+|+|++-+.-+-|| T Consensus 74 ~elg~~~Prd~~~l~dISgC 93 (172) T COG2406 74 YELGGDLPRDMKKLHDISGC 93 (172) T ss_pred HHHCCCCCHHHHHHHHHCCC T ss_conf 98289982669998764179 No 40 >TIGR02767 TraG-Ti Ti-type conjugative transfer system protien TraG; InterPro: IPR014135 This entry contains the Agrobacterium tumefaciens Ti-plasmid TraG, it is responsible for conjugative transfer of the entire plasmid among Agrobacterium strains . The protein is distantly related to the F-type conjugation system TraG protein. Both of these systems are examples of type IV secretion systems.. Probab=26.65 E-value=33 Score=16.65 Aligned_cols=21 Identities=38% Similarity=0.631 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHCCHH-------HHHHH Q ss_conf 7567788742201112-------45666 Q gi|254781152|r 60 KASEERQENGRKGGIK-------SSQMR 80 (155) Q Consensus 60 n~sEer~~agRkGg~~-------S~Q~R 80 (155) +..|+-+|+|||=|+- +||++ T Consensus 494 riLEeARDrGRKYG~sl~l~yQS~GQl~ 521 (641) T TIGR02767 494 RILEEARDRGRKYGVSLMLIYQSLGQLR 521 (641) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 8888774167613589999985467887 No 41 >pfam08279 HTH_11 HTH domain. This family includes helix-turn-helix domains in a wide variety of proteins. Probab=25.69 E-value=61 Score=15.06 Aligned_cols=39 Identities=21% Similarity=0.089 Sum_probs=31.9 Q ss_pred CCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEC Q ss_conf 04898750279997771999999999999998539189938 Q gi|254781152|r 6 DRGGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIFIQE 46 (155) Q Consensus 6 drGgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l~D 46 (155) ...+| ..-..||-.+|+|.++-.+-++.|.+.|..+... T Consensus 11 ~~~~~--vt~~~La~~l~VSr~TV~rdi~~L~~~G~~I~s~ 49 (55) T pfam08279 11 QAREP--ISGQELAEKLGVSRRTIRRDIKALEALGVPIESE 49 (55) T ss_pred HCCCC--CCHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEEC T ss_conf 76999--1899999996988999999999999889978846 No 42 >PRK11753 cAMP-regulatory protein; Provisional Probab=25.40 E-value=62 Score=15.03 Aligned_cols=31 Identities=23% Similarity=0.282 Sum_probs=12.0 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCEEE Q ss_conf 2799977719999999999999985391899 Q gi|254781152|r 14 NDKYIAGVCGCSIRRWRNIRAILEKFNKIFI 44 (155) Q Consensus 14 N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l 44 (155) .-+-||..+|+|-.+.-.++.-|.+.|.|.. T Consensus 170 t~~eLA~~lG~sretvsR~L~~L~~~GlI~~ 200 (211) T PRK11753 170 TRQEIGRIVGCSREMVGRVLKMLEDQGLISA 200 (211) T ss_pred CHHHHHHHHCCCHHHHHHHHHHHHHCCCEEE T ss_conf 9999998869989999999999998899998 No 43 >pfam02329 HDC Histidine carboxylase PI chain. Histidine carboxylase catalyses the formation of histamine from histidine. Cleavage of the proenzyme PI chain yields two subunits, alpha and beta, which arrange as a hexamer (alpha beta)6. Probab=25.28 E-value=44 Score=15.89 Aligned_cols=71 Identities=17% Similarity=0.221 Sum_probs=45.2 Q ss_pred HHHHHHCCC---EEEECCCEE--CHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCC-CHHHHHHCCCCC Q ss_conf 999985391---899389242--18999999975677887422011124566667776556643015-647863084112 Q gi|254781152|r 33 RAILEKFNK---IFIQEGNIY--NVRVEKEIIKASEERQENGRKGGIKSSQMRILSKKTNNLFQGTL-KPAHVFQKPESI 106 (155) Q Consensus 33 Ld~Ll~~GK---I~l~DG~LW--N~RVEeEL~n~sEer~~agRkGg~~S~Q~R~~~kk~n~~fq~~~-~~~~~~~~~~~~ 106 (155) +..+-+..| ++++|.++| +..-|+||+...|+++.+--+--..-|| ..+.+|.-+. --+|++-+|-.| T Consensus 185 isi~~dr~~~a~lfiEDaG~w~~~~~ne~el~~~L~~~rk~v~~Si~~CG~------Dq~v~y~~~~ig~ay~~m~PGqi 258 (306) T pfam02329 185 LAILKDRTKGSSLFVEDAGVWGDENPTEEELKEFLEGRLRKVTKSIALCGQ------DQHVSYERIYIGYKYTMMEPGQV 258 (306) T ss_pred EEECCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCC------CCCEEEEEEEEEEEEEEECCCCC T ss_conf 502117776750144305520577997489999999999999999998657------76705788787778997157652 Q ss_pred CCC Q ss_conf 564 Q gi|254781152|r 107 NNY 109 (155) Q Consensus 107 ~~~ 109 (155) .|- T Consensus 259 G~A 261 (306) T pfam02329 259 GCA 261 (306) T ss_pred CCE T ss_conf 432 No 44 >KOG4357 consensus Probab=24.88 E-value=30 Score=16.85 Aligned_cols=14 Identities=50% Similarity=0.862 Sum_probs=9.8 Q ss_pred CCCCCCHHHCCCCH Q ss_conf 44764234358983 Q gi|254781152|r 109 YSAPVDFEKINPNP 122 (155) Q Consensus 109 ~~~~~~~~~~~~~~ 122 (155) -+||+|||+|.|.. T Consensus 38 ~~~~ld~eki~~ka 51 (164) T KOG4357 38 KPAPLDFEKIDAKA 51 (164) T ss_pred CCCCCCHHHCCCCC T ss_conf 99987865658553 No 45 >KOG0774 consensus Probab=24.85 E-value=37 Score=16.35 Aligned_cols=100 Identities=19% Similarity=0.193 Sum_probs=56.1 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCC---------CEEEEC--CCEECHHHHHHHH-----------HHHHHHHHHHHH Q ss_conf 279997771999999999999998539---------189938--9242189999999-----------756778874220 Q gi|254781152|r 14 NDKYIAGVCGCSIRRWRNIRAILEKFN---------KIFIQE--GNIYNVRVEKEII-----------KASEERQENGRK 71 (155) Q Consensus 14 N~r~LAg~CGCSvKkFr~aLd~Ll~~G---------KI~l~D--G~LWN~RVEeEL~-----------n~sEer~~agRk 71 (155) -++-+.+.-+---+||..+-..|-+.- .+.-++ -|=|....-+-|+ -+.|.+++-+++ T Consensus 148 ~~ke~e~m~~~i~~kF~~iq~~lkqstce~vmiLr~r~ldarRKRRNFsK~aTeiLneyF~~h~~nPYPSee~K~eLAkq 227 (334) T KOG0774 148 MPKEIERMVQIISKKFSHIQMQLKQSTCEAVMILRSRFLDARRKRRNFSKQATEILNEYFYSHLSNPYPSEEAKEELAKQ 227 (334) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 67999999999998889999998888899999999999987776516442179999999997337999837899999987 Q ss_pred CCHHHHH-------HHHHHHHHHHHHHCCC------CHHHHHHCCCCCCCCCCCC Q ss_conf 1112456-------6667776556643015------6478630841125644764 Q gi|254781152|r 72 GGIKSSQ-------MRILSKKTNNLFQGTL------KPAHVFQKPESINNYSAPV 113 (155) Q Consensus 72 Gg~~S~Q-------~R~~~kk~n~~fq~~~------~~~~~~~~~~~~~~~~~~~ 113 (155) .+|+-+| +||--||.-.+||..- |++++-..-..-+-||.|- T Consensus 228 CnItvsQvsnwfgnkrIrykK~~~k~~ee~~l~~~kk~~~~~~~g~~~~~~s~~p 282 (334) T KOG0774 228 CNITVSQVSNWFGNKRIRYKKNMGKNQEEANLYAAKKAVDATPPGAPGNSYSTPP 282 (334) T ss_pred CCCEEHHHCCCCCCCEEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC T ss_conf 2934056323224540321531002365554676426556899999889988899 No 46 >PRK08979 acetolactate synthase 3 catalytic subunit; Validated Probab=24.41 E-value=64 Score=14.92 Aligned_cols=44 Identities=16% Similarity=0.113 Sum_probs=35.7 Q ss_pred CCCHHHHHHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHHHHH Q ss_conf 9999999999999985391-8993892421899999997567788 Q gi|254781152|r 23 GCSIRRWRNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASEERQ 66 (155) Q Consensus 23 GCSvKkFr~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sEer~ 66 (155) ..+......+++.|.+..+ +++.+++.+.++.++||.++.|+-. T Consensus 190 ~~~~~~i~~a~~~L~~A~rPvil~G~g~~~~~a~~~l~~lae~lg 234 (572) T PRK08979 190 SGHKGQIRRGLQALLAAKKPVLYVGGGAIISGCDKQILALAEKLN 234 (572) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHC T ss_conf 889999999999998279988997875353568999999998639 No 47 >COG1349 GlpR Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism] Probab=24.34 E-value=65 Score=14.91 Aligned_cols=44 Identities=16% Similarity=0.077 Sum_probs=35.0 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCEEE-ECCCEECHHHHHH Q ss_conf 2799977719999999999999985391899-3892421899999 Q gi|254781152|r 14 NDKYIAGVCGCSIRRWRNIRAILEKFNKIFI-QEGNIYNVRVEKE 57 (155) Q Consensus 14 N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~l-~DG~LWN~RVEeE 57 (155) ..+-||...|.|.-+.|.=|+.|.+.|++.- -+|-.-+...+.| T Consensus 21 ~v~eLa~~~~VS~~TIRRDL~~Le~~g~l~R~hGGa~~~~~~~~~ 65 (253) T COG1349 21 SVEELAELFGVSEMTIRRDLNELEEQGLLLRVHGGAVLPDSESEY 65 (253) T ss_pred EHHHHHHHHCCCHHHHHHHHHHHHHCCEEEEEECEEECCCCCCCC T ss_conf 799999885978999986199997789289960618837766553 No 48 >PRK06965 acetolactate synthase 3 catalytic subunit; Validated Probab=23.38 E-value=68 Score=14.80 Aligned_cols=41 Identities=10% Similarity=0.144 Sum_probs=33.8 Q ss_pred CHHHHHHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHHHH Q ss_conf 99999999999985391-899389242189999999756778 Q gi|254781152|r 25 SIRRWRNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASEER 65 (155) Q Consensus 25 SvKkFr~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sEer 65 (155) +-...+++++.|.+..+ +++.+++++..+..+||.++.|+- T Consensus 207 ~~~~i~~a~~lL~~AkrPvIl~G~g~~~~~a~~~l~~lae~l 248 (587) T PRK06965 207 HSGQIRKAVSLLLSAKRPYIYTGGGVILANASRELRQLADLL 248 (587) T ss_pred CHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHH T ss_conf 999999999999856898897078820434899999999997 No 49 >COG3960 Glyoxylate carboligase [General function prediction only] Probab=22.87 E-value=69 Score=14.74 Aligned_cols=36 Identities=14% Similarity=0.237 Sum_probs=26.6 Q ss_pred HHHHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHH Q ss_conf 99999999985391-8993892421899999997567 Q gi|254781152|r 28 RWRNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASE 63 (155) Q Consensus 28 kFr~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sE 63 (155) ...+++.+|.+..+ +++.+|++.|.+..+-+.++-| T Consensus 192 qaekalaml~~aerplivagggiinadaa~l~~efae 228 (592) T COG3960 192 QAEKALAMLIQAERPLIVAGGGVINADAAALLQEFAE 228 (592) T ss_pred HHHHHHHHHHHCCCCEEEECCCEECCCHHHHHHHHHH T ss_conf 8999999875302866983485534249999999998 No 50 >PRK13626 transcriptional regulator SgrR; Provisional Probab=21.78 E-value=73 Score=14.61 Aligned_cols=36 Identities=19% Similarity=0.231 Sum_probs=34.2 Q ss_pred CCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE Q ss_conf 898750279997771999999999999998539189 Q gi|254781152|r 8 GGSIPDNDKYIAGVCGCSIRRWRNIRAILEKFNKIF 43 (155) Q Consensus 8 GgPL~~N~r~LAg~CGCSvKkFr~aLd~Ll~~GKI~ 43 (155) |.|......-||-++.||-|..+.++.-|.+.|=|. T Consensus 19 g~~~~~tl~elA~~l~cS~Rn~r~lL~~m~~~gWl~ 54 (551) T PRK13626 19 GKSQETTLNELAELLNCSRRHMRTLLNTMQDRGWLT 54 (551) T ss_pred CCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEE T ss_conf 974674399999986888578999999999888878 No 51 >COG0028 IlvB Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism] Probab=21.46 E-value=74 Score=14.57 Aligned_cols=40 Identities=13% Similarity=0.277 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHHHHH Q ss_conf 999999999985391-8993892421899999997567788 Q gi|254781152|r 27 RRWRNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASEERQ 66 (155) Q Consensus 27 KkFr~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sEer~ 66 (155) ..-+++.+.|.+.-+ +++.+|+.|...+.+|+.++.|+-. T Consensus 188 ~~i~~aa~~L~~AkrPvIl~G~G~~~a~a~~~l~~lae~~~ 228 (550) T COG0028 188 EAIRKAAELLAEAKRPVILAGGGVRRAGASEELRELAEKLG 228 (550) T ss_pred HHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHC T ss_conf 99999999997389978998886564134999999999879 No 52 >PRK07979 acetolactate synthase 3 catalytic subunit; Validated Probab=21.10 E-value=75 Score=14.53 Aligned_cols=40 Identities=13% Similarity=0.120 Sum_probs=33.3 Q ss_pred HHHHHHHHHHHHHCCC-EEEECCCEECHHHHHHHHHHHHHH Q ss_conf 9999999999985391-899389242189999999756778 Q gi|254781152|r 26 IRRWRNIRAILEKFNK-IFIQEGNIYNVRVEKEIIKASEER 65 (155) Q Consensus 26 vKkFr~aLd~Ll~~GK-I~l~DG~LWN~RVEeEL~n~sEer 65 (155) .....++++.|....+ +++.+++.|.+..++|+.++.|+- T Consensus 193 ~~~i~~a~~~L~~AkrPvIi~G~g~~~~~a~~~l~~lae~l 233 (574) T PRK07979 193 KGQIKRALQTLVAAKKPVVYVGGGAINAACHQQLKELVEKL 233 (574) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 99999999999727998899827734354899999999871 No 53 >pfam07042 TrfA TrfA protein. This family consists of several bacterial TrfA proteins. The trfA operon of broad-host-range IncP plasmids is essential to activate the origin of vegetative replication in diverse species. The trfA operon encodes two ORFs. The first ORF is highly conserved and encodes a putative single-stranded DNA binding protein (Ssb). The second, trfA, contains two translational starts as in the IncP alpha plasmids, generating related polypeptides of 406 (TrfA1) and 282 (TrfA2) amino acids. TrfA2 is very similar to the IncP alpha product, whereas the N-terminal region of TrfA1 shows very little similarity to the equivalent region of IncP alpha TrfA1. This region has been implicated in the ability of IncP alpha plasmids to replicate efficiently in Pseudomonas aeruginosa. Probab=20.20 E-value=79 Score=14.42 Aligned_cols=22 Identities=18% Similarity=0.313 Sum_probs=11.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHCC Q ss_conf 7776556643015647863084 Q gi|254781152|r 82 LSKKTNNLFQGTLKPAHVFQKP 103 (155) Q Consensus 82 ~~kk~n~~fq~~~~~~~~~~~~ 103 (155) +..++-.+|.|.--....+++- T Consensus 187 l~peiv~LF~~d~~T~~~We~R 208 (282) T pfam07042 187 IDAEIVVLFAGDHYTKFIWEKY 208 (282) T ss_pred ECHHHHHHCCCCCCEEEEHHHH T ss_conf 8858986507787112249987 Done!