Query gi|254781037|ref|YP_003065450.1| porphobilinogen deaminase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 307 No_of_seqs 169 out of 2086 Neff 5.8 Searched_HMMs 39220 Date Mon May 30 04:29:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781037.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00212 hemC porphobilinogen 100.0 0 0 814.4 24.8 294 6-300 1-312 (312) 2 PRK00072 hemC porphobilinogen 100.0 0 0 774.7 30.5 299 1-302 1-300 (300) 3 COG0181 HemC Porphobilinogen d 100.0 0 0 774.4 28.0 301 4-306 2-303 (307) 4 cd00494 HMBS Hydroxymethylbila 100.0 0 0 760.6 29.8 291 6-298 1-292 (292) 5 KOG2892 consensus 100.0 0 0 613.1 16.9 304 1-306 3-313 (320) 6 pfam01379 Porphobil_deam Porph 100.0 0 0 581.9 16.6 213 5-219 1-213 (213) 7 PRK01066 porphobilinogen deami 100.0 0 0 580.9 17.2 219 1-226 16-234 (234) 8 pfam03900 Porphobil_deamC Porp 99.8 7.7E-20 2E-24 148.8 10.6 74 227-300 1-74 (74) 9 PRK09861 cytoplasmic membrane 96.7 0.064 1.6E-06 32.8 12.6 195 3-228 31-258 (272) 10 PRK10797 glutamate and asparta 96.3 0.1 2.6E-06 31.5 13.4 184 3-219 39-250 (302) 11 PRK11260 cystine transporter s 95.9 0.11 2.8E-06 31.2 9.1 145 66-218 88-237 (264) 12 PRK10859 putative transglycosy 95.2 0.3 7.6E-06 28.5 11.0 146 3-173 42-204 (507) 13 PRK09959 hybrid sensory histid 94.8 0.36 9.2E-06 27.9 10.2 147 66-220 352-501 (1197) 14 pfam09084 NMT1 NMT1/THI5 like. 94.4 0.3 7.7E-06 28.4 7.7 121 34-174 19-149 (216) 15 PRK11063 metQ DL-methionine tr 94.0 0.26 6.7E-06 28.8 6.8 199 3-230 30-259 (271) 16 pfam03180 Lipoprotein_9 NLPA l 93.4 0.31 8E-06 28.3 6.3 193 6-230 1-224 (236) 17 cd00134 PBPb Bacterial peripla 92.8 0.85 2.2E-05 25.5 9.9 147 67-224 49-200 (218) 18 pfam00497 SBP_bac_3 Bacterial 92.4 0.84 2.1E-05 25.5 7.2 144 66-224 48-204 (224) 19 COG0834 HisJ ABC-type amino ac 92.3 1 2.5E-05 25.1 7.5 153 66-227 87-248 (275) 20 COG1464 NlpA ABC-type metal io 91.2 0.94 2.4E-05 25.2 6.4 196 4-231 29-257 (268) 21 PRK11553 alkanesulfonate trans 89.3 1.9 4.8E-05 23.3 7.5 149 4-179 27-190 (314) 22 PRK11480 tauA taurine transpor 88.8 2 5.2E-05 23.0 9.4 196 4-233 23-240 (320) 23 smart00062 PBPb Bacterial peri 88.3 2.2 5.5E-05 22.9 9.7 142 66-218 49-198 (219) 24 PRK11917 bifunctional adhesin/ 88.0 2.3 5.8E-05 22.7 10.6 142 65-219 90-240 (259) 25 COG0715 TauA ABC-type nitrate/ 86.1 2.9 7.4E-05 22.1 8.6 158 68-228 74-250 (335) 26 CHL00180 rbcR LysR transcripti 85.2 3.2 8.1E-05 21.8 10.9 178 4-212 95-287 (307) 27 PRK09495 glnH glutamine ABC tr 84.6 3.4 8.7E-05 21.6 9.1 146 3-175 24-182 (247) 28 pfam04069 OpuAC Substrate bind 83.6 3.8 9.6E-05 21.3 5.8 188 4-222 1-226 (256) 29 TIGR03431 PhnD phosphonate ABC 81.6 4.4 0.00011 20.9 8.7 154 4-180 27-203 (288) 30 PRK04694 Maf-like protein; Rev 78.4 0.79 2E-05 25.7 0.8 114 126-263 3-120 (190) 31 PRK12684 transcriptional regul 78.1 5.7 0.00015 20.1 6.3 133 65-211 131-278 (313) 32 PRK12683 transcriptional regul 74.5 7.1 0.00018 19.5 6.7 136 65-211 131-278 (308) 33 pfam03466 LysR_substrate LysR 71.3 8.5 0.00022 19.0 12.8 175 3-211 5-191 (209) 34 PRK00078 Maf-like protein; Rev 68.6 2.6 6.5E-05 22.4 1.4 29 126-156 4-32 (192) 35 PRK02676 consensus 64.0 4.1 0.00011 21.1 1.7 34 123-158 3-36 (196) 36 PRK12679 cbl transcriptional r 61.7 13 0.00033 17.8 5.9 161 64-237 130-305 (316) 37 PRK04425 Maf-like protein; Rev 61.6 4.9 0.00013 20.6 1.7 31 123-155 5-35 (196) 38 pfam05023 Phytochelatin Phytoc 61.6 2.5 6.3E-05 22.5 0.2 66 197-262 102-171 (208) 39 TIGR00060 L18_bact ribosomal p 61.3 7.1 0.00018 19.6 2.5 45 260-304 35-91 (118) 40 TIGR01027 proB glutamate 5-kin 60.7 3.9 1E-04 21.2 1.1 129 124-288 207-351 (379) 41 PRK12681 cysB transcriptional 59.8 14 0.00036 17.6 6.5 188 5-228 94-298 (324) 42 PRK02478 Maf-like protein; Rev 58.2 6.1 0.00015 20.0 1.7 26 234-259 96-121 (199) 43 PRK09508 leuO leucine transcri 57.8 15 0.00039 17.4 9.2 58 4-84 112-169 (314) 44 PRK00238 consensus 57.1 5.3 0.00013 20.4 1.3 24 126-149 5-28 (198) 45 PRK04719 consensus 56.9 5.6 0.00014 20.2 1.4 26 234-259 92-117 (188) 46 PRK01946 consensus 56.6 6.2 0.00016 19.9 1.6 26 125-150 11-36 (195) 47 PRK00234 Maf-like protein; Rev 56.6 5.7 0.00015 20.1 1.4 107 125-256 4-111 (192) 48 PRK00648 Maf-like protein; Rev 56.3 6.1 0.00016 19.9 1.5 114 125-260 5-118 (191) 49 PRK03797 consensus 55.9 3.9 0.0001 21.2 0.4 109 126-260 3-111 (186) 50 PRK03442 consensus 55.9 5.8 0.00015 20.1 1.3 25 125-149 4-28 (213) 51 PRK00884 Maf-like protein; Rev 54.9 6.3 0.00016 19.9 1.4 31 125-157 4-34 (194) 52 cd00555 Maf Nucleotide binding 54.6 6.1 0.00016 20.0 1.2 110 126-258 2-111 (180) 53 pfam02642 consensus 54.4 17 0.00044 17.0 7.4 100 65-171 37-140 (259) 54 TIGR03427 ABC_peri_uca ABC tra 53.7 18 0.00045 17.0 7.4 166 4-195 2-183 (328) 55 PRK00148 Maf-like protein; Rev 53.6 6.5 0.00017 19.8 1.3 32 125-158 4-35 (195) 56 PRK03415 consensus 52.2 7.5 0.00019 19.4 1.4 27 234-260 90-116 (197) 57 PRK02141 Maf-like protein; Rev 51.3 9.1 0.00023 18.8 1.7 33 122-156 8-40 (206) 58 PRK04056 Maf-like protein; Rev 50.7 8.3 0.00021 19.1 1.4 114 126-262 3-116 (180) 59 PRK00032 Maf-like protein; Rev 50.0 8.3 0.00021 19.1 1.3 25 235-259 91-115 (189) 60 PRK12682 transcriptional regul 49.9 20 0.00052 16.6 4.7 134 65-211 131-278 (309) 61 PRK11233 nitrogen assimilation 49.5 21 0.00052 16.6 11.7 134 5-170 92-238 (305) 62 cd00985 Maf_Ham1 Maf_Ham1. Maf 49.4 7.2 0.00018 19.5 0.9 108 126-257 2-109 (131) 63 COG0424 Maf Nucleotide-binding 48.8 11 0.00027 18.4 1.8 107 124-258 4-115 (193) 64 KOG2098 consensus 48.5 9.9 0.00025 18.6 1.5 16 14-29 8-23 (591) 65 pfam02545 Maf Maf-like protein 47.4 9.8 0.00025 18.6 1.4 27 234-260 91-117 (193) 66 PRK03411 consensus 46.8 10 0.00026 18.5 1.4 25 125-149 4-28 (194) 67 PRK01839 Maf-like protein; Rev 44.2 12 0.0003 18.1 1.4 32 123-156 10-41 (209) 68 PRK04222 consensus 43.8 12 0.0003 18.1 1.3 30 125-156 5-34 (191) 69 PRK10263 DNA translocase FtsK; 41.6 19 0.00049 16.8 2.1 38 142-179 1184-1224(1355) 70 PRK01526 Maf-like protein; Rev 41.3 17 0.00043 17.1 1.8 23 118-142 5-27 (205) 71 PRK05593 rplR 50S ribosomal pr 38.9 30 0.00076 15.5 5.7 45 260-304 36-88 (115) 72 PRK03098 consensus 37.3 15 0.00037 17.5 1.0 111 126-260 1-111 (185) 73 PRK11716 DNA-binding transcrip 35.5 34 0.00086 15.2 10.0 169 5-211 68-253 (269) 74 PRK01441 Maf-like protein; Rev 34.3 21 0.00053 16.5 1.4 23 234-256 99-121 (207) 75 COG0263 ProB Glutamate 5-kinas 33.9 13 0.00034 17.7 0.3 135 123-294 207-347 (369) 76 PRK09791 putative DNA-binding 33.4 36 0.00093 14.9 12.3 176 4-211 95-280 (302) 77 TIGR02114 coaB_strep phosphopa 33.0 37 0.00094 14.9 5.1 50 35-84 40-91 (253) 78 TIGR01382 PfpI intracellular p 32.4 8.2 0.00021 19.1 -1.0 20 153-172 64-83 (189) 79 KOG0605 consensus 31.7 39 0.00099 14.8 4.0 76 193-270 144-226 (550) 80 TIGR01915 npdG NADPH-dependent 31.1 40 0.001 14.7 3.5 29 2-30 30-58 (233) 81 TIGR00742 yjbN TIM-barrel prot 30.4 21 0.00054 16.5 0.9 119 117-240 51-178 (326) 82 PRK12680 transcriptional regul 30.2 41 0.001 14.6 6.6 175 5-218 94-284 (327) 83 COG3265 GntK Gluconate kinase 30.1 35 0.0009 15.0 2.0 42 123-164 68-111 (161) 84 TIGR00172 maf septum formation 29.7 34 0.00086 15.1 1.8 20 123-142 3-22 (212) 85 pfam10080 DUF2318 Predicted me 28.7 23 0.00058 16.3 0.8 29 234-263 61-92 (102) 86 TIGR00422 valS valyl-tRNA synt 26.8 47 0.0012 14.2 3.0 105 52-161 232-359 (970) 87 TIGR01313 therm_gnt_kin carboh 26.4 48 0.0012 14.2 2.5 44 124-167 79-130 (175) 88 PRK09986 DNA-binding transcrip 25.6 49 0.0013 14.1 15.6 170 4-211 81-264 (278) 89 COG3043 NapB Nitrate reductase 25.1 16 0.0004 17.3 -0.6 82 75-160 68-152 (155) 90 pfam02621 DUF178 Putative peri 24.3 52 0.0013 13.9 4.7 101 65-176 34-143 (248) 91 PRK05429 gamma-glutamyl kinase 23.9 51 0.0013 14.0 1.8 129 123-289 209-344 (372) 92 PRK05331 putative glycerol-3-p 23.8 33 0.00084 15.2 0.9 38 128-170 189-226 (317) 93 CHL00139 rpl18 ribosomal prote 23.7 53 0.0014 13.9 5.3 63 238-304 7-82 (109) 94 pfam04261 Dyp_perox Dyp-type p 23.4 54 0.0014 13.8 3.7 57 214-270 187-255 (311) 95 pfam07302 AroM AroM protein. T 22.9 55 0.0014 13.8 2.0 112 65-209 16-134 (221) 96 PRK11545 gntK gluconate kinase 21.9 58 0.0015 13.6 2.9 90 121-223 79-173 (177) 97 cd03575 NTR_WFIKKN NTR domain, 21.8 52 0.0013 13.9 1.6 25 117-141 82-106 (109) 98 COG3572 GshA Gamma-glutamylcys 21.7 46 0.0012 14.3 1.3 34 174-208 63-96 (456) 99 PRK07765 para-aminobenzoate sy 21.5 45 0.0011 14.3 1.2 25 62-86 10-34 (221) 100 TIGR03414 ABC_choline_bnd chol 21.2 60 0.0015 13.5 7.3 62 2-89 6-67 (290) 101 COG1732 OpuBC Periplasmic glyc 21.1 60 0.0015 13.5 5.8 201 3-233 32-281 (300) 102 KOG4134 consensus 20.7 61 0.0016 13.5 1.9 65 149-214 50-123 (253) No 1 >TIGR00212 hemC porphobilinogen deaminase; InterPro: IPR000860 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents hydroxymethylbilane synthase (or porphobilinogen deaminase, 2.5.1.61 from EC), which functions during the second stage of tetrapyrrole biosynthesis. This enzyme catalyses the polymerisation of four PBG molecules into the tetrapyrrole structure, preuroporphyrinogen, with the concomitant release of four molecules of ammonia. This enzyme uses a unique dipyrro-methane cofactor made from two molecules of PBG, which is covalently attached to a cysteine side chain. The tetrapyrrole product is synthesized in an ordered, sequential fashion, by initial attachment of the first pyrrole unit (ring A) to the cofactor, followed by subsequent additions of the remaining pyrrole units (rings B, C, D) to the growing pyrrole chain . The link between the pyrrole ring and the cofactor is broken once all the pyrroles have been added. This enzyme is folded into three distinct domains that enclose a single, large active site that makes use of an aspartic acid as its one essential catalytic residue, acting as a general acid/base during catalysis , . A deficiency of hydroxymethylbilane synthase is implicated in the neuropathic disease, Acute Intermittent Porphyria (AIP) . ; GO: 0004418 hydroxymethylbilane synthase activity, 0033014 tetrapyrrole biosynthetic process. Probab=100.00 E-value=0 Score=814.43 Aligned_cols=294 Identities=46% Similarity=0.673 Sum_probs=286.2 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCC-HHHCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 8997079888899999999999974089983389999973274115875-567486226999999998709836997403 Q gi|254781037|r 6 FRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRT-LTEIGGKGLFTEEIEKKLISGEIDCAVHSA 84 (307) Q Consensus 6 i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~-l~~~g~kg~Ftkele~aLl~g~iDiAVHSl 84 (307) ||||||+|+||||||++|.+.|+++|+ |.+++||++|+|+||+++++| |+++||||+||||||+|||+|+|||||||| T Consensus 1 l~IGTRgS~LAl~Qa~~V~~~l~~~~~-p~~~~ei~~~kT~GD~~~~~~~L~~~GgKGlFtKEle~aLL~geiD~AVHSl 79 (312) T TIGR00212 1 LRIGTRGSKLALAQANLVKEQLKEVYP-PELDTEIVIIKTTGDKIQDKPLLSDIGGKGLFTKELEQALLDGEIDLAVHSL 79 (312) T ss_pred CEEECCCCHHHHHHHHHHHHHHHHHCC-CCCCEEEEEEEECCCCCCCCCCEEECCCHHHHHHHHHHHHHCCCCCEEEECC T ss_conf 978404868999999999999997369-7402179999804861116100021375036688999996259844888430 Q ss_pred CCCCCCC-CCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHC Q ss_conf 4321110-122101222365683105665243243324567322047530288899842377310330767999998637 Q gi|254781037|r 85 KDMPTKL-LKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNN 163 (307) Q Consensus 85 KDlP~~~-~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~ 163 (307) ||||+.+ |+||.|+||++|+|||||||++++.+|.+||+||+|||||+||++||+.+||||+++++||||+|||+||++ T Consensus 80 KD~P~~~~~~gL~~~a~~~RedP~D~l~s~~~~~l~~LP~GA~vGTSS~RR~aql~~~RPDl~~~~lRGNi~TRl~Kl~~ 159 (312) T TIGR00212 80 KDVPTVLSPEGLEIAAVLKREDPRDVLVSRKYLSLEELPQGAKVGTSSLRRKAQLKALRPDLEIEPLRGNIDTRLRKLDE 159 (312) T ss_pred CCCCCCCCCCCCEEEEECCCCCCCCEEEECCCCCHHHCCCCCEEECCHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHCC T ss_conf 23576559888278635587886426886278885127798889745799999998308994499746987999998518 Q ss_pred -CCCCEEEEEHHHHHHCCCC-CCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHC Q ss_conf -8765563014545421621-12223368667358543207899985276124556776239788888788999998717 Q gi|254781037|r 164 -NKAHAILLAYAGIKRLKKE-KVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELD 241 (307) Q Consensus 164 -g~~DaiilA~Agl~RL~l~-~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~ 241 (307) |+|||||||+|||+|||+. +.++++|+++.|+|||||||||||||+||.++.++++.|||++|+.|+.|||+||+.|+ T Consensus 160 Gg~~DAiiLA~AGL~RLg~~~~~i~~~~~~~~~~PA~GQG~i~ve~R~dD~~~~~il~~i~h~~t~~~~~aER~fL~~L~ 239 (312) T TIGR00212 160 GGEYDAIILAAAGLKRLGLENDVITEVLDPEVMLPAVGQGAIAVECRKDDTEIKEILKEINHEETAVEVTAERAFLKELD 239 (312) T ss_pred CCCCHHHHHHHHHHHCCCCCCHHHHHCCCHHHCCCCCCCCEEEEEEECCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC T ss_conf 88204899988875302541013431478114178866757999984587799998620288889999999999998728 Q ss_pred CCCCCCEEEEEEEECC-----EEEEEEEEECCCCC---EEEEEEEECCHH----HHHHHH--HHHHHHHHHHC Q ss_conf 7648852899999999-----89999999869987---899999843988----899999--99999999706 Q gi|254781037|r 242 GSCKSAIAGFAYCKGS-----TLYFYGIVLASDGK---IFHEVSRNGNRC----DAINIG--RDAAQYIRFIS 300 (307) Q Consensus 242 GgC~~PIGa~A~i~~~-----~l~l~a~v~s~DG~---~~i~~~~~g~~~----~a~~LG--~~la~~Ll~~g 300 (307) |||++||||||+++++ +|+|+|+|+++||+ ..++.+..++.+ +++++| .++|+++++++ T Consensus 240 GgC~~Pig~~a~~~~~~~~t~~l~l~a~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~A~~~~~~~ 312 (312) T TIGR00212 240 GGCQTPIGAYAEYEGDDNDTDKLKLKAVVLSLDGKIVREVIRAEKEGNIENKKEDAEELGNPLEVAEELLKRG 312 (312) T ss_pred CCCCCCEEEEEEEECCCCCCCEEEEEEEEECCCCCEEEEEHHHHHCCCHHHHHCCHHHCCCHHHHHHHHHHHC T ss_conf 8964331468898337788427999999971787164001001320313444105765067889999999719 No 2 >PRK00072 hemC porphobilinogen deaminase; Reviewed Probab=100.00 E-value=0 Score=774.73 Aligned_cols=299 Identities=43% Similarity=0.655 Sum_probs=293.0 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEE Q ss_conf 99882899707988889999999999997408998338999997327411587556748622699999999870983699 Q gi|254781037|r 1 MEKKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCA 80 (307) Q Consensus 1 M~~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiA 80 (307) |+ ++||||||+|+||+|||++|+++|++.+ |++++||++|+|+||+++++||+++||||+||||||++|++|+|||| T Consensus 1 M~-~~irIgtR~S~LAl~Qa~~V~~~L~~~~--p~~~~ei~~i~T~GD~~~~~pL~~iGgkG~Ftkele~aLl~g~iDiA 77 (300) T PRK00072 1 MM-RKLRIGTRGSKLALWQAEWVKDRLKAAH--PGLEVELVPIKTKGDRILDVPLAKIGGKGLFVKELEEALLDGEIDIA 77 (300) T ss_pred CC-CCEEEEECCCHHHHHHHHHHHHHHHHHC--CCCEEEEEEEECCCCCCCCCCHHHCCCCEEEHHHHHHHHHHCCCCEE T ss_conf 99-7669997898999999999999999758--99718999996467445687578828950358999999982875525 Q ss_pred EECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHH Q ss_conf 74034321110122101222365683105665243243324567322047530288899842377310330767999998 Q gi|254781037|r 81 VHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNK 160 (307) Q Consensus 81 VHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~K 160 (307) ||||||||+++|+||.|+||++|+|||||||+++++++++||+|++|||||+||++||+++|||++|+++||||+|||+| T Consensus 78 VHSlKDlPt~~~~~l~i~avl~R~d~rDvLv~~~~~~l~~Lp~ga~IGTSS~RR~aql~~~~pdl~~~~iRGNv~TRl~K 157 (300) T PRK00072 78 VHSLKDVPTELPEGLVLAAIPEREDPRDAFVSNDYASLDDLPEGAVVGTSSLRRQAQLLALRPDLEIKDLRGNVDTRLRK 157 (300) T ss_pred EEECCCCCCCCCCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCCCEEECCCHHHHHHHHHHCCCCCEEECCCCHHHHHHH T ss_conf 54134578758988558765547886785301466751338999888504650899999867999567766888999999 Q ss_pred HHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 63787655630145454216211222336866735854320789998527612455677623978888878899999871 Q gi|254781037|r 161 LNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAEL 240 (307) Q Consensus 161 l~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L 240 (307) |++|+|||+|||+|||+|||+.++++++|++++|+|||||||||||||++|.++.+++++|||++|+.|+.+||+||+.| T Consensus 158 l~~g~~DaiiLA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaiave~r~~d~~~~~~l~~in~~~t~~~~~aER~~L~~L 237 (300) T PRK00072 158 LDEGEYDAIILAAAGLKRLGLEDRITELLDPDEMLPAVGQGALGIECRADDEEILELLAPLNHAETRLCVTAERAFLRAL 237 (300) T ss_pred HHCCCCCEEEHHHHHHHHCCCCCCEEEEECHHHCCCCCCCCEEEEEEECCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHC T ss_conf 75678645420676786657854216985533357854630389999449988887766508988999999999999863 Q ss_pred CCCCCCCEEEEEEEECC-EEEEEEEEECCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHCCH Q ss_conf 77648852899999999-899999998699878999998439888999999999999970605 Q gi|254781037|r 241 DGSCKSAIAGFAYCKGS-TLYFYGIVLASDGKIFHEVSRNGNRCDAINIGRDAAQYIRFISNK 302 (307) Q Consensus 241 ~GgC~~PIGa~A~i~~~-~l~l~a~v~s~DG~~~i~~~~~g~~~~a~~LG~~la~~Ll~~g~k 302 (307) +|||++|||+||+++++ +++|+|+|+|+||+++++.+.+|+.+++.++|.++|++|+++||+ T Consensus 238 ~ggC~~PIg~~a~~~~~~~l~l~~~v~s~dG~~~i~~~~~g~~~~a~~lG~~la~~Ll~~Gak 300 (300) T PRK00072 238 EGGCQVPIGAYAEIEGDDELHLRGLVGSPDGSRIIREERIGNGADAEALGIELAEELLAQGAR 300 (300) T ss_pred CCCCCCCEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEECHHHHHHHHHHHHHHHHHCCCC T ss_conf 999998448899998898699999999899987999999857999999999999999975799 No 3 >COG0181 HemC Porphobilinogen deaminase [Coenzyme metabolism] Probab=100.00 E-value=0 Score=774.42 Aligned_cols=301 Identities=43% Similarity=0.652 Sum_probs=295.6 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) ++||||||+|+||+|||++|+++|++.+ |+++|||++|+|+||+++++||+++||||+||||||++|++|+||+|||| T Consensus 2 ~~irIgTR~S~LAl~Qa~~V~~~L~~~~--p~~~~ei~~i~T~GD~i~d~pL~~iGgkGlFtkEle~all~g~~DiAVHS 79 (307) T COG0181 2 MKLRIGTRGSKLALAQANEVIERLKAAY--PDLEVEIVTIKTKGDRILDKPLSKIGGKGLFTKELEQALLEGEIDIAVHS 79 (307) T ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHHC--CCCEEEEEEEECCCCHHCCCHHHHCCCCEEEHHHHHHHHHCCCCCEEEEE T ss_conf 7358851787889999999999999868--99538999985145142013488848818779999999976997789760 Q ss_pred CCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHC Q ss_conf 34321110122101222365683105665243243324567322047530288899842377310330767999998637 Q gi|254781037|r 84 AKDMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNN 163 (307) Q Consensus 84 lKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~ 163 (307) |||||+++|++|.++||++|+||||+||++++.+|.+||+||+|||||+||+||++++|||++|+++||||||||+||.+ T Consensus 80 lKDvP~~~p~gL~laai~~R~dprDalVs~~~~~l~~LP~Ga~VGTSSlRR~aql~~~rPdl~i~~lRGNVdTRL~KL~~ 159 (307) T COG0181 80 LKDVPTELPEGLVLAAIPEREDPRDALVSRDGYDLEELPEGAVVGTSSLRRQAQLKALRPDLKIEPLRGNVDTRLRKLDE 159 (307) T ss_pred CCCCCCCCCCCCEEEEECCCCCHHHEEEECCCCCHHHCCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHC T ss_conf 44487568999369996178995366888788856348998831452687999999869997699665768889998616 Q ss_pred CCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 87655630145454216211222336866735854320789998527612455677623978888878899999871776 Q gi|254781037|r 164 NKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGS 243 (307) Q Consensus 164 g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~Gg 243 (307) |+|||||||+|||+|||+.++++++|++++|+|||||||||||||++|.++.++++.|||.+|+.|+.|||+|++.|+|| T Consensus 160 g~yDAIILA~AGL~RLgl~~~~~~~l~p~~~~PA~gQGal~ie~R~~d~~~~~ll~~i~~~~t~~~v~aERa~l~~L~gg 239 (307) T COG0181 160 GEYDAIILAAAGLKRLGLENRITEILDPEEFLPAPGQGALAIECRAGDEKVLELLAELNDEDTRICVTAERAFLRELEGG 239 (307) T ss_pred CCCCHHHHHHHHHHHCCCCCCCEEECCHHHCCCCCCCCEEEEEEECCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 78529999999998628742220634754268988886699999639678999998615820889999999999962899 Q ss_pred CCCCEEEEEEEECC-EEEEEEEEECCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHCCHHHCC Q ss_conf 48852899999999-8999999986998789999984398889999999999999706056337 Q gi|254781037|r 244 CKSAIAGFAYCKGS-TLYFYGIVLASDGKIFHEVSRNGNRCDAINIGRDAAQYIRFISNKNVFN 306 (307) Q Consensus 244 C~~PIGa~A~i~~~-~l~l~a~v~s~DG~~~i~~~~~g~~~~a~~LG~~la~~Ll~~g~k~iLn 306 (307) |++|||+||.++++ +|+|+|.|+++||++.++.+..|+.+++.++|+++|++++++|++++|+ T Consensus 240 C~~PIg~~a~~~~~~~l~l~~~v~~~DG~~~~~~~~~g~~~~a~~lG~~~a~~l~~~ga~e~l~ 303 (307) T COG0181 240 CQVPIGAYAELTGGGELRLRALVGSPDGSQIIKAEIRGDAEDAEELGKELAEELLEKGAKEILE 303 (307) T ss_pred CCCCEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8771288899858973999999988887635788730656669999999999999987899987 No 4 >cd00494 HMBS Hydroxymethylbilane synthase (HMBS), also known as porphobilinogen deaminase (PBGD), is an intermediate enzyme in the biosynthetic pathway of tetrapyrrolic ring systems, such as heme, chlorophylls, and vitamin B12. HMBS catalyzes the conversion of porphobilinogen (PBG) into hydroxymethylbilane (HMB). HMBS consists of three domains, and is believed to bind substrate through a hinge-bending motion of domains I and II. HMBS is found in all organisms except viruses. Probab=100.00 E-value=0 Score=760.65 Aligned_cols=291 Identities=44% Similarity=0.625 Sum_probs=286.3 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 89970798888999999999999740899833899999732741158755674862269999999987098369974034 Q gi|254781037|r 6 FRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSAK 85 (307) Q Consensus 6 i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSlK 85 (307) ||||||+|+||+|||++|+++|++.+ |++++||++|+|+||+++++||+++||||+||||||++|++|++|||||||| T Consensus 1 IrIgTR~S~LAl~Qa~~V~~~L~~~~--p~~~~ei~~i~T~GD~~~~~~L~~~ggkG~Ftkele~aLl~g~iDiAVHSlK 78 (292) T cd00494 1 IRIGTRKSKLALIQTNKVIEKLKELC--PGIEVEIVIIKTTGDKILDKPLAKIGGKGLFTKELEEALLNGEIDLAVHSLK 78 (292) T ss_pred CEEEECCCHHHHHHHHHHHHHHHHHC--CCCCEEEEEEECCCCCCCCCCHHHCCCCCEEHHHHHHHHHCCCCCEEEEECC T ss_conf 98993798999999999999999758--9981899999665731367638774996144799999997098787986046 Q ss_pred CCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCC Q ss_conf 32111012210122236568310566524324332456732204753028889984237731033076799999863787 Q gi|254781037|r 86 DMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNK 165 (307) Q Consensus 86 DlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~ 165 (307) |||+++|+||.|+|+++|+|||||||+++++++++||+|++|||||+||++||+++|||++|++|||||+|||+||++|+ T Consensus 79 DlPt~~~~gl~i~avl~R~dprDvlv~~~~~~l~~lp~ga~IGTSS~RR~aql~~~~pdl~i~~iRGNV~TRl~KL~~g~ 158 (292) T cd00494 79 DVPTELPEGLVLGAIPKREDPRDALVSRNGSSLEDLPAGSVVGTSSLRRQAQLKRKRPDLKFEPLRGNVDTRLRKLDEGE 158 (292) T ss_pred CCCCCCCCCCEEEEEECCCCCCCHHHCCCCHHHHCCCCCCEEEECCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHCCC T ss_conf 57774798746876634788222444064002212898987851474379999986899954676588999999851678 Q ss_pred CCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 65563014545421621122233686673585432078999852761245567762397888887889999987177648 Q gi|254781037|r 166 AHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCK 245 (307) Q Consensus 166 ~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~ 245 (307) |||||||+|||+|||+.++++++|++++|+|||||||||||||++|.++.+++++|||++|+.|+.+||+||+.|+|||+ T Consensus 159 ~DaiILA~AGL~RLgl~~~i~~~l~~~~~~PA~gQGaiave~r~~d~~~~~~l~~in~~~t~~~~~aER~~l~~l~ggC~ 238 (292) T cd00494 159 YDAIILAAAGLKRLGLEDRITQYLSPEVMLPAVGQGALAIECRKGDEELLALLKPLNHEETALCVLAERAFLRELEGGCQ 238 (292) T ss_pred CCEEEEHHHHHHHCCCCCCCCCCCCHHCCCCCCCCCEEEEEEECCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 74775446567554784202300363107887665306888724898888888775587889999999999998489999 Q ss_pred CCEEEEEEEECC-EEEEEEEEECCCCCEEEEEEEECCHHHHHHHHHHHHHHHHH Q ss_conf 852899999999-89999999869987899999843988899999999999997 Q gi|254781037|r 246 SAIAGFAYCKGS-TLYFYGIVLASDGKIFHEVSRNGNRCDAINIGRDAAQYIRF 298 (307) Q Consensus 246 ~PIGa~A~i~~~-~l~l~a~v~s~DG~~~i~~~~~g~~~~a~~LG~~la~~Ll~ 298 (307) +||||||+++++ +++|+|+|+|+||+++++.+.+|+.+++.++|.++|++|++ T Consensus 239 ~Piga~a~i~~~~~l~l~~~v~s~DG~~~~~~~~~g~~~~a~~lG~~lA~~Lla 292 (292) T cd00494 239 VPIGVYAELDGGEELKLKGLVGSPDGKRLIEGEVRGNIEDAEELGKKLAEELLA 292 (292) T ss_pred CCEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEECHHHHHHHHHHHHHHHHC T ss_conf 845889999699879999999988898799999997789999999999999719 No 5 >KOG2892 consensus Probab=100.00 E-value=0 Score=613.06 Aligned_cols=304 Identities=39% Similarity=0.564 Sum_probs=288.6 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEE Q ss_conf 99882899707988889999999999997408998338999997327411587556748622699999999870983699 Q gi|254781037|r 1 MEKKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCA 80 (307) Q Consensus 1 M~~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiA 80 (307) |+.+.|+||||+|+||++|+++|.+.|++.| |+++|+|..++|+||+++++||++||+|++||||||++|.+|.+||+ T Consensus 3 ~~~~~irIGtRKSkLAvIQs~~v~~~Lek~Y--P~l~f~I~t~~T~GDkIl~k~L~~ig~KsLfTkELE~aL~~~~~div 80 (320) T KOG2892 3 PRTAVIRIGTRKSKLAVIQSYHVREKLEKKY--PELAFEIITMSTTGDKILSKPLAKIGGKSLFTKELEDALINGHVDIV 80 (320) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHHC--CCCEEEEEEECCCCHHHHHCHHHHHCCCCHHHHHHHHHHHCCCCCEE T ss_conf 8654897147666101230799999998659--88635799831433677614476626541007889999853985289 Q ss_pred EECCCCCCCCCCCCCEEEEECCCCCCCEEEEECC---CCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHH Q ss_conf 7403432111012210122236568310566524---3243324567322047530288899842377310330767999 Q gi|254781037|r 81 VHSAKDMPTKLLKGLQISAYLPREDIRDVFISHT---AQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETR 157 (307) Q Consensus 81 VHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~---~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TR 157 (307) |||||||||.+|+|+.|+|++||+||+|+||.+. |+++.+||+|++|||||+||+||+++.||+|.|.++|||++|| T Consensus 81 VHSLKDlPT~LP~G~~ig~i~kRedp~DalV~~~~s~~kTladLP~GsVVGTSSlRR~Aql~RkyPHL~f~~~rgn~~tr 160 (320) T KOG2892 81 VHSLKDLPTYLPEGMIIGCILKREDPRDALVFLTYSAYKTLADLPAGSVVGTSSLRRKAQILRKYPHLHVESIRGNLQTR 160 (320) T ss_pred EEECCCCCCCCCCCCEECCCCCCCCCCCEEEEECCCCCCCHHHCCCCCEECHHHHHHHHHHHHHCCCEEEEEECCHHHHH T ss_conf 87512363218997375032577786531787144332558668998631418876899987548961899834708777 Q ss_pred HHHHHCCC-CCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 99863787-65563014545421621122233686673585432078999852761245567762397888887889999 Q gi|254781037|r 158 LNKLNNNK-AHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKVINHEDTWDSVNCERAF 236 (307) Q Consensus 158 l~Kl~~g~-~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~f 236 (307) ++||+.+. |-+||||.|||.|+++.+++.+.|.+++|+||+||||++|+||++|..+..++..+||.+|..|+.+||+| T Consensus 161 l~kld~~~~~~~iilA~agl~rmg~~~ri~qil~p~~~~~avgqgalgie~r~~d~~m~~~l~~l~~~~t~~~~~~eraf 240 (320) T KOG2892 161 LSKLDGGKPFQAIILALAGLKRMGWTNRIAQILHPDEMLYAVGQGALGIECRADDDKMLTYLASLNDLNTTLRILAERAF 240 (320) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCEEEEEEECCHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 77625898402389988889874167677776185889987435306788705817799999873183778999999999 Q ss_pred HHHHCCCCCCCEEEEEEEECCEEEEEEEEECCCCCEEEEEEEECCHHHHH--HH-HHHHHHHHHHHCCHHHCC Q ss_conf 98717764885289999999989999999869987899999843988899--99-999999999706056337 Q gi|254781037|r 237 LAELDGSCKSAIAGFAYCKGSTLYFYGIVLASDGKIFHEVSRNGNRCDAI--NI-GRDAAQYIRFISNKNVFN 306 (307) Q Consensus 237 L~~L~GgC~~PIGa~A~i~~~~l~l~a~v~s~DG~~~i~~~~~g~~~~a~--~L-G~~la~~Ll~~g~k~iLn 306 (307) ++.|+|||++||++|+..+++++.|+|.|+|.||++.+..+..+.+.+.. .. |+++++.|...+++.+++ T Consensus 241 lrtl~Ggcs~piav~s~~~~g~l~L~g~v~s~dg~~~i~~~~~~~~~~~~~~~~~~~~a~~~L~~~~~~~~~~ 313 (320) T KOG2892 241 LRTLEGGCSVPIAVYSSKKEGNLILTGLVWSLDGTKVIETSRKGPIVYEKMVKMGGKEAGQQLLSRTAPGIFG 313 (320) T ss_pred HHHHCCCCCCCEEEEEECCCCCEEEEEEEECCCCCHHHHHHHHCCHHHHHHHHHCCCCCCHHHHHHCCCCCCC T ss_conf 9860798667335675225775799854764665267887762234555443221454238888524578888 No 6 >pfam01379 Porphobil_deam Porphobilinogen deaminase, dipyromethane cofactor binding domain. Probab=100.00 E-value=0 Score=581.94 Aligned_cols=213 Identities=46% Similarity=0.673 Sum_probs=209.0 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 28997079888899999999999974089983389999973274115875567486226999999998709836997403 Q gi|254781037|r 5 PFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSA 84 (307) Q Consensus 5 ~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSl 84 (307) +|+||||+|+||+|||++|.++|++. +|++++|+++|+|+||++++.||+++||||+||||||++|++|+|||||||| T Consensus 1 tIrIgtR~S~LAl~Qa~~v~~~L~~~--~p~~~~ei~~i~T~GD~~~~~~l~~~ggkG~Ftkele~aLl~~~iDiAVHS~ 78 (213) T pfam01379 1 TIRIGTRKSKLALIQTNHVIEKLKEL--CPDIEVEIVIIKTTGDKILDKPLAKIGGKGLFTKELEEALLNGEIDLAVHSL 78 (213) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHH--CCCCCEEEEEEECCCCCCCCCCCCCCCCCEEEHHHHHHHHHCCCCCEEEEEC T ss_conf 96999589999999999999999986--8998689999978784436665311698414269999999709878787604 Q ss_pred CCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCC Q ss_conf 43211101221012223656831056652432433245673220475302888998423773103307679999986378 Q gi|254781037|r 85 KDMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNN 164 (307) Q Consensus 85 KDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g 164 (307) ||||+++++||.|+||++|+|||||||+++++++.+||+|++|||||+||++|++++|||++++++||||+|||+||++| T Consensus 79 KDlP~~~~~~l~i~avl~R~d~rD~Li~~~~~~l~~lp~ga~IGTSS~RR~aql~~~~pdl~~~~iRGNV~TRl~KL~~g 158 (213) T pfam01379 79 KDVPTELPEGLVLGAIPKREDPRDALVSRNGKSLEDLPEGSVVGTSSLRRSAQLKRKRPDLKFEPLRGNVDTRLRKLDEG 158 (213) T ss_pred CCCCCCCCCCCEEEEEECCCCHHHHHHCCCCCCHHHCCCCCEEECCCHHHHHHHHHHCCCCCEEECCCCHHHHHHHHHCC T ss_conf 65776479874687664478866644302676665677676553253779999998689984677668889999997578 Q ss_pred CCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHH Q ss_conf 7655630145454216211222336866735854320789998527612455677 Q gi|254781037|r 165 KAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVK 219 (307) Q Consensus 165 ~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~ 219 (307) +|||+|||+|||+|||+.++++++|++++|+|||||||||||||++|.++.++++ T Consensus 159 ~~DaiiLA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaie~r~~d~~i~~llk 213 (213) T pfam01379 159 EYDAIILAAAGLKRLGLEDRITQSLPPEDMLPAVGQGALAIECRKGDEEILALLK 213 (213) T ss_pred CCCEEEEHHHHHHHCCCCCCCCEECCHHHCCCHHHHHHEEEEEECCCHHHHHHHC T ss_conf 9879863676786668731140671877648837430369988169899999749 No 7 >PRK01066 porphobilinogen deaminase; Provisional Probab=100.00 E-value=0 Score=580.87 Aligned_cols=219 Identities=32% Similarity=0.403 Sum_probs=212.1 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEE Q ss_conf 99882899707988889999999999997408998338999997327411587556748622699999999870983699 Q gi|254781037|r 1 MEKKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCA 80 (307) Q Consensus 1 M~~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiA 80 (307) |++|+||||||+|+||+|||++|.++|++.+ |++++||++|+|+||+++++||+++||||+||||||++|++|+||+| T Consensus 16 ~~kk~IrIGTR~S~LAl~Qa~~V~~~L~~~~--p~~~~ei~~i~T~GD~~~~~~L~~iggkG~Ftkele~aLl~g~iDiA 93 (234) T PRK01066 16 LGKRPLRIASRKSPLAKAQVHECLRLLRSFF--PKLWFQISTTTTQGDLDQKTPLHLVENTGFFTDDVDFLVLSGKCDLA 93 (234) T ss_pred HCCCCEEEEECCCHHHHHHHHHHHHHHHHHC--CCCEEEEEEEECCCCCCCCCCHHHCCCCCEEHHHHHHHHHCCCCCEE T ss_conf 0688469995887999999999999999768--99739999996337314577687748974548999999976987789 Q ss_pred EECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHH Q ss_conf 74034321110122101222365683105665243243324567322047530288899842377310330767999998 Q gi|254781037|r 81 VHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNK 160 (307) Q Consensus 81 VHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~K 160 (307) ||||||||+ |+|+.|+|+++|+|||||||++++.++.+||+|++|||||+||++||+++|||++|+++||||+|||+| T Consensus 94 VHSlKDlP~--~~~l~i~av~~R~d~rDvLv~~~~~~l~~lp~~a~IGTSS~RR~aql~~~~pdl~i~~iRGNI~TRl~K 171 (234) T PRK01066 94 IHSAKDLPE--PPKLTVVAITAGLDPSDLLVYAEKYLLQPLPKRPRIGSSSLRREELLKLLFPSGIILDIRGTIEERLKL 171 (234) T ss_pred EEECCCCCC--CCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCCCEEECCCHHHHHHHHHHCCCCCEEECCCCHHHHHHH T ss_conf 871578899--999768999657898888760467641228866877316661999999878999789640899999998 Q ss_pred HHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHCCHHH Q ss_conf 637876556301454542162112223368667358543207899985276124556776239788 Q gi|254781037|r 161 LNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKVINHEDT 226 (307) Q Consensus 161 l~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~Ind~~T 226 (307) |.+|+|||||||+|||+|||+...+++++++ +++|||||||||||++|.++.+++++|||.++ T Consensus 172 l~~g~~DaiILA~AGL~RLgl~~~~~~~lp~---~~~pgQGalaIe~R~dD~~~~~ll~~ln~~e~ 234 (234) T PRK01066 172 LERGKYDAIVVAKAAVLRLGLRLPYTIELPP---PYHPLQGRLAITAKKHIAKWKTLFLPLNITEN 234 (234) T ss_pred HCCCCCCEEEHHHHHHHHCCCCCCCEEECCC---CCCCCCCCEEEEECCCCHHHHHHHHHHCCCCC T ss_conf 6178989533898999875897754585589---88788682889980797999999997378798 No 8 >pfam03900 Porphobil_deamC Porphobilinogen deaminase, C-terminal domain. Probab=99.82 E-value=7.7e-20 Score=148.79 Aligned_cols=74 Identities=32% Similarity=0.464 Sum_probs=72.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEEEEEEEECCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHC Q ss_conf 88878899999871776488528999999998999999986998789999984398889999999999999706 Q gi|254781037|r 227 WDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLYFYGIVLASDGKIFHEVSRNGNRCDAINIGRDAAQYIRFIS 300 (307) Q Consensus 227 ~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~l~a~v~s~DG~~~i~~~~~g~~~~a~~LG~~la~~Ll~~g 300 (307) ++||.|||+||+.|+|||++|||+||++++++|+|+++|+|+||+++++.+.+++.+++.+||+++|++|+++| T Consensus 1 alcv~AER~fl~~L~GgC~~PIga~A~i~~~~i~l~~~v~s~DG~~~~~~~~~g~~~~a~~lG~~lA~~Ll~~G 74 (74) T pfam03900 1 ALCVLAERAFLRELEGGCQVPIGAYAVLKDGNLRLKGLVGSPDGTRVFEIEVRGDIEDAEELGKKLAEELLAQG 74 (74) T ss_pred CEEHHHHHHHHHHHCCCCCCCEEEEEEEECCEEEEEEEEECCCCCEEEEEEEEECHHHHHHHHHHHHHHHHHCC T ss_conf 95649899999986899998519999998999999999998989889999999858999999999999999683 No 9 >PRK09861 cytoplasmic membrane lipoprotein-28; Provisional Probab=96.66 E-value=0.064 Score=32.78 Aligned_cols=195 Identities=11% Similarity=0.119 Sum_probs=104.6 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEE- Q ss_conf 8828997079888899999999999974089983389999973274115875567486226999999998709836997- Q gi|254781037|r 3 KKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAV- 81 (307) Q Consensus 3 ~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAV- 81 (307) .++|+||+-..+-+-+- +.+...+++- .++++|++.+. |-.+ -..||-+|+||.-. T Consensus 31 ~k~ikVG~~~~p~~~i~-e~~~~~~~ek---~G~~leiv~Fs---Dy~~----------------PN~AL~~G~iDaN~f 87 (272) T PRK09861 31 AKHIKVGVINGAEQDVA-EVAKKVAKEK---YGLDVELVGFS---GSLL----------------PNDATNHGELDANVF 87 (272) T ss_pred CCEEEEEECCCCHHHHH-HHHHHHHHHH---CCCEEEEEEEC---CCCC----------------HHHHHHCCCCCCHHH T ss_conf 97389996789869999-9999888761---79768999946---8622----------------628976798360245 Q ss_pred -EC--CCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCC----CHHHHHHHH------HC----- Q ss_conf -40--3432111012210122236568310566524324332456732204753----028889984------23----- Q gi|254781037|r 82 -HS--AKDMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSL----RRKALLLRW------RS----- 143 (307) Q Consensus 82 -HS--lKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~----RR~aql~~~------~p----- 143 (307) |- |+..- -..|..+.++-+-.-.-=.+-|++++++++||.|++|+-++- -|.-.++.. .| T Consensus 88 QH~pyL~~~n--~~~g~~L~~v~~~~~~P~glYS~K~ksl~dlp~Ga~IaIPnDpsN~~RAL~lL~~aGLI~Lk~~~~~~ 165 (272) T PRK09861 88 QHRPFLEQDN--QAHGYKLVAVGNTFVFPMAGYSKKIKTVAQIKEGATVAIPNDPTNLGRALLLLQKEKLITLKEGKGLL 165 (272) T ss_pred HHHHHHHHHH--HHCCCCEEEEEEEEEEEEEECCCCCCCHHHCCCCCEEEECCCHHHHHHHHHHHHHCCCEEECCCCCCC T ss_conf 5799999999--98699579973167830140344659875847999898047812699999999988978977999977 Q ss_pred -----------CCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEEC--- Q ss_conf -----------773103307679999986378765563014545421621122233686673585432078999852--- Q gi|254781037|r 144 -----------DISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHV--- 209 (307) Q Consensus 144 -----------~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~--- 209 (307) +++|+++...-- -+-|++.++|+.|.-..=...-|+...-...+-.+. . .+=+..|.+|+ T Consensus 166 ~t~~DI~~NPk~lk~~e~~aaql--~rsl~Dp~vD~avin~n~a~~agl~p~~dal~~E~~--~--~py~niiavr~~~~ 239 (272) T PRK09861 166 PTALDITDNPRHLQIMELEGAQL--PRVLDDPKVDVAIISTTYIQQTGLSPVHDSVFIEDK--N--SPYVNILVAREDNK 239 (272) T ss_pred CCHHHHHCCCCCCEEEECCHHHH--HHHHCCCCCCEEEECHHHHHHCCCCCCCCCEEECCC--C--CCEEEEEEECCCCC T ss_conf 77445760887767998267774--675057761689976057988777944002036689--9--98379999827676 Q ss_pred CCCHHHHHHHHHCCHHHHH Q ss_conf 7612455677623978888 Q gi|254781037|r 210 NNPKAQELVKVINHEDTWD 228 (307) Q Consensus 210 ~d~~i~~il~~Ind~~T~~ 228 (307) +|+.+..+.+....++... T Consensus 240 d~~~ik~lv~a~~S~evk~ 258 (272) T PRK09861 240 NAENVKEFLQSYQSPEVAK 258 (272) T ss_pred CCHHHHHHHHHHCCHHHHH T ss_conf 9989999999977999999 No 10 >PRK10797 glutamate and aspartate transporter subunit; Provisional Probab=96.33 E-value=0.1 Score=31.47 Aligned_cols=184 Identities=18% Similarity=0.206 Sum_probs=96.8 Q ss_pred CCCEEEEECCCHH--------------HHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHH Q ss_conf 8828997079888--------------89999999999997408998338999997327411587556748622699999 Q gi|254781037|r 3 KKPFRIGTRCSPL--------------ALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEI 68 (307) Q Consensus 3 ~~~i~IgtR~S~L--------------Al~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkel 68 (307) +.+|+||++.+-= ..=-++.+++.|...-+.|+++++++++.+ ..+ T Consensus 39 ~G~l~vG~~~~~pPf~~~d~~g~~vGfdvDla~~ia~~l~~~l~~~~~~v~~v~~~~--------------------~~r 98 (302) T PRK10797 39 NGVIVVGHRESSVPFSYYDNQQKVVGYSQDYSNAIVEAVKKKLNKPDLQVKLIPITS--------------------QNR 98 (302) T ss_pred CCEEEEEECCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCH--------------------HHH T ss_conf 998999989998997168999985154179999999999877446773499997578--------------------887 Q ss_pred HHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC--CCCCCCCCCCEE----ECCCCCHHHHHH--- Q ss_conf 99987098369974034321110122101222365683105665243--243324567322----047530288899--- Q gi|254781037|r 69 EKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA--QSLKDLALHSVI----GTSSLRRKALLL--- 139 (307) Q Consensus 69 e~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~--~~l~~lp~~a~I----GTsS~RR~aql~--- 139 (307) -.+|.+|++|++++++=.-|. ..+...++ .+.=...--++|.++. +++.||. |-+| ||++- ..+. T Consensus 99 ip~L~~gk~Di~~~~~t~T~e-R~k~vdFS-~pY~~~~~~llv~k~s~Iks~~DL~-GK~V~V~~GTt~e---~~~~~~~ 172 (302) T PRK10797 99 IPLLQNGTFDFECGSTTNNLE-RQKQAAFS-DTIFVVGTRLLTKKGGDIKDFADLK-GKAVVVTSGTTSE---VLLNKLN 172 (302) T ss_pred HHHHHCCCCCEEEECCCCCHH-HHCCEEEC-CCCEECCEEEEEECCCCCCCHHHHC-CCEEEEECCCHHH---HHHHHHH T ss_conf 799978980889735756977-74124214-6627714689997788666756608-9979996787399---9999866 Q ss_pred -HHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHH---H-HHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHH Q ss_conf -84237731033076799999863787655630145---4-542162112223368667358543207899985276124 Q gi|254781037|r 140 -RWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYA---G-IKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKA 214 (307) Q Consensus 140 -~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~A---g-l~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i 214 (307) ...++.+++....+ .+-+..|.+|+.||.+.-.+ | ..+-.-.+ ..+.+. ....+--+||-.|++|++. T Consensus 173 ~~~~~~~~iv~~~~~-~~a~~~L~~GrvDA~~~D~~~l~~~~~~~~~~~-~~~iv~-----~~~~~e~ygia~rKgd~~L 245 (302) T PRK10797 173 EEQKMNMRIISAKDH-GDSFRTLESGRAVAFMMDDALLAGERAKAKKPD-NWEIVG-----KPQSQEAYGCMLRKDDPQF 245 (302) T ss_pred HHHCCCCEEEECCCH-HHHHHHHHCCCCCEEEECHHHHHHHHHHCCCCC-CEEEEC-----CCCCCCCEEEEEECCCHHH T ss_conf 750677237850888-999999985995099812298889998636997-549937-----8677760799996899999 Q ss_pred HHHHH Q ss_conf 55677 Q gi|254781037|r 215 QELVK 219 (307) Q Consensus 215 ~~il~ 219 (307) ++.+. T Consensus 246 ~~~Vn 250 (302) T PRK10797 246 KKLMD 250 (302) T ss_pred HHHHH T ss_conf 99999 No 11 >PRK11260 cystine transporter subunit; Provisional Probab=95.85 E-value=0.11 Score=31.21 Aligned_cols=145 Identities=14% Similarity=0.092 Sum_probs=76.6 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECC----CCCCCCCCCCCEEECCCCCH-HHHHHH Q ss_conf 9999998709836997403432111012210122236568310566524----32433245673220475302-888998 Q gi|254781037|r 66 EEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHT----AQSLKDLALHSVIGTSSLRR-KALLLR 140 (307) Q Consensus 66 kele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~----~~~l~~lp~~a~IGTsS~RR-~aql~~ 140 (307) .++-.+|.+|++|++++++---|. ....+.+. .+.-...--+++.++ ..+++||. |.+||+..--- ..+++. T Consensus 88 ~~~i~~L~~g~~Di~~~~~t~T~e-R~~~~~Fs-~PY~~~~~~~~v~~~~~~~i~~~~dL~-Gk~V~v~~gs~~~~~l~~ 164 (264) T PRK11260 88 DGMLASLDSKRIDVVINQVTISDE-RKKKYDFS-TPYTVSGIQALVKKGNEGTIKTAADLK-GKKVGVGLGTNYEEWLRQ 164 (264) T ss_pred HHHHHHHHCCCCCEEECCCCCCHH-HHHHCCCC-CCEEEEEEEEEEECCCCCCCCCHHHHC-CCEEEEECCCHHHHHHHH T ss_conf 999999866994667257535888-88517767-743774459999878976668978966-871688569789999986 Q ss_pred HHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHH Q ss_conf 423773103307679999986378765563014545421621122233686673585432078999852761245567 Q gi|254781037|r 141 WRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELV 218 (307) Q Consensus 141 ~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il 218 (307) ..|+.+++.+. |..+.+..|.+|+.||++.-......+--.......+. .++...--.++-.+++|+++.+.+ T Consensus 165 ~~~~~~i~~~~-~~~~~~~~l~~G~vDa~i~d~~~~~~~~~~~~~~~~~~----~~~~~~~~~~iav~k~~~~L~~~i 237 (264) T PRK11260 165 NVQGVDVRTYD-DDPTKYQDLRVGRIDAILVDRLAALDLVKKTNDTLAVT----GEAFSRQESGVALRKGNPDLLKAV 237 (264) T ss_pred HCCCCCEEEEC-CHHHHHHHHHCCCCCEEEECHHHHHHHHHHCCCCEEEC----CCCCCCCEEEEEEECCCHHHHHHH T ss_conf 08898168636-61899999875986799945999999998689978980----687666508999989999999999 No 12 >PRK10859 putative transglycosylase; Provisional Probab=95.16 E-value=0.3 Score=28.46 Aligned_cols=146 Identities=14% Similarity=0.126 Sum_probs=86.7 Q ss_pred CCCEEEEECCCHHHHHHH--------HHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHC Q ss_conf 882899707988889999--------999999997408998338999997327411587556748622699999999870 Q gi|254781037|r 3 KKPFRIGTRCSPLALAHA--------FETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLIS 74 (307) Q Consensus 3 ~~~i~IgtR~S~LAl~Qa--------~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~ 74 (307) .-.|||+||.|+..-.+. .+.++...+. -+++.++++..+ ..++=.+|-+ T Consensus 42 rG~LrV~T~~spttYy~~~~~~~GFEYELak~FA~~---LGV~L~i~~~~~-------------------~~el~~~L~~ 99 (507) T PRK10859 42 RGELRVSTIHSPLTYSTINGKPFGLDYELAKQFADY---LGVKLKVTVRQN-------------------ISQLFDDLDN 99 (507) T ss_pred CCEEEEEEECCCCEEEECCCCCCCHHHHHHHHHHHH---CCCEEEEEECCC-------------------HHHHHHHHHC T ss_conf 896899997589659941996477999999999998---099179997499-------------------9999999867 Q ss_pred CCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC----CCCCCCCCCC-EE--ECCCCCHHHHHHHH-HCCCC Q ss_conf 98369974034321110122101222365683105665243----2433245673-22--04753028889984-23773 Q gi|254781037|r 75 GEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA----QSLKDLALHS-VI--GTSSLRRKALLLRW-RSDIS 146 (307) Q Consensus 75 g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~----~~l~~lp~~a-~I--GTsS~RR~aql~~~-~p~l~ 146 (307) |++|||...|- ...+....+. .-++=-...-.||.+.+ +++.||.... .| |+|-..+-..++.. .|++. T Consensus 100 G~~DiaAAgLt-~t~~R~~~~~--~~p~Y~~v~q~LVyr~g~~rP~~l~DL~G~~I~V~~gSsy~e~L~~Lk~~~~P~L~ 176 (507) T PRK10859 100 GNADLLAAGLV-YNSERVKNYQ--PGPTYYSVSQQLVYRVGQYRPRTLGDLTGEQLTVAPGHVVVNDLQTLKETKFPELS 176 (507) T ss_pred CCCCEEECCCC-CCHHHHHCCC--CCCCCCCEEEEEEEECCCCCCCCHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCEE T ss_conf 99888955787-7853674043--56865030039999589989889778379869981797599999998853489715 Q ss_pred CHHH-CCCHHHHHHHHHCCCCCEEEEEH Q ss_conf 1033-07679999986378765563014 Q gi|254781037|r 147 VIDF-RGKIETRLNKLNNNKAHAILLAY 173 (307) Q Consensus 147 ~~~i-RGNv~TRl~Kl~~g~~DaiilA~ 173 (307) .... ..+.+.=|+++.+|++|.+|.-. T Consensus 177 w~~~~~~~teeLL~~V~~g~IdyTVaDS 204 (507) T PRK10859 177 WKVDDKKGSAELLEQVIEGKLDYTIADS 204 (507) T ss_pred EEECCCCCHHHHHHHHHCCCCCEEEECC T ss_conf 9955899999999999759877688554 No 13 >PRK09959 hybrid sensory histidine kinase in two-component regulatory system with EvgA; Provisional Probab=94.84 E-value=0.36 Score=27.90 Aligned_cols=147 Identities=10% Similarity=0.103 Sum_probs=81.0 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCC-CCCCCEEEEECCCCCCCEEEEECCCC-CCCCCCCCCEEECCC-CCHHHHHHHHH Q ss_conf 9999998709836997403432111-01221012223656831056652432-433245673220475-30288899842 Q gi|254781037|r 66 EEIEKKLISGEIDCAVHSAKDMPTK-LLKGLQISAYLPREDIRDVFISHTAQ-SLKDLALHSVIGTSS-LRRKALLLRWR 142 (307) Q Consensus 66 kele~aLl~g~iDiAVHSlKDlP~~-~~~~l~i~a~~~R~d~rD~lv~~~~~-~l~~lp~~a~IGTsS-~RR~aql~~~~ 142 (307) .+..+++.+|++|+..--. ++. ....+.+.- +-=..| -|+|.++.. ....|..|.+|+..+ ..-..+++..+ T Consensus 352 ~~~~~~l~~g~~D~l~~~~---~s~~r~~~l~fT~-py~~~p-~ViV~r~~~~~~~~l~~gkrvai~~~~~~~~~l~~~y 426 (1197) T PRK09959 352 IHAGTQLNPGGWDILPGAI---YSEDRENNVLFAE-AFITTP-YVFVMQKAPDSEQTLKKGMKVAIPYYYELHSQLKEMY 426 (1197) T ss_pred HHHHHHHHCCCCHHCCCCC---CCCCCCCCCEECC-CCCCCC-EEEEEECCCCCCCCHHCCCEEEECCCCCHHHHHHHHC T ss_conf 9999998769812203235---7854434655413-235687-5999963786301142198899917850689999868 Q ss_pred CCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHH Q ss_conf 377310330767999998637876556301454542162112223368667358543207899985276124556776 Q gi|254781037|r 143 SDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKV 220 (307) Q Consensus 143 p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~ 220 (307) |++.++.+ .|..+-++.+.+|++||.|-....-.-+ +.......+. ....|..-.-.++.-++++++++..++.+ T Consensus 427 P~i~~v~v-~s~~~al~~v~~G~aDa~v~~l~~a~y~-i~~~~~~~L~-~~~~~~~~~~~lafaV~~d~peL~sILnK 501 (1197) T PRK09959 427 PEVEWIKV-DNASAAFHKVKEGELDALVATQLNSRYM-IDHYYPNELY-HFLIPGVPNASLSFAFPRGEPELKDIINK 501 (1197) T ss_pred CCCEEEEE-CCHHHHHHHHHCCCCCEEEEHHHHHHHH-HHHHCCCCCE-EEECCCCCCCCEEEEECCCCHHHHHHHHH T ss_conf 98349980-9999999999778876674244778999-9874566504-87415877632699987998899999999 No 14 >pfam09084 NMT1 NMT1/THI5 like. This family contains the NMT1 and THI5 proteins. These proteins are proposed to be required for the biosynthesis of the pyrimidine moiety of thiamine. They are regulated by thiamine. Probab=94.43 E-value=0.3 Score=28.39 Aligned_cols=121 Identities=21% Similarity=0.141 Sum_probs=69.9 Q ss_pred CCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEE--EEECCCCCCCEEEE Q ss_conf 9833899999732741158755674862269999999987098369974034321110122101--22236568310566 Q gi|254781037|r 34 PPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQI--SAYLPREDIRDVFI 111 (307) Q Consensus 34 p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i--~a~~~R~d~rD~lv 111 (307) .++++|++...+.++ .-++|..|++|+++-+.=-+-.....|..+ .+..-..++..+++ T Consensus 19 ~GL~Ve~~~~~~~~~-------------------~~~al~sG~~D~a~~~~~~~~~~~~~g~~~~~i~~~~~~~~~~~~v 79 (216) T pfam09084 19 EGLDVEIVEPADPSD-------------------AVQLVAAGKADFGVSYQPSLLLARAKGLPVVSVAALIQHPPNGLIS 79 (216) T ss_pred CCCCEEEEECCCCHH-------------------HHHHHHCCCCCEEECCCHHHHHHHHCCCEEEEEEECCCCCCEEEEE T ss_conf 498089996688278-------------------9999976986778537199999997898499999643589728999 Q ss_pred ECCC--CCCCCCCCCCEEECCCCC-HHHHHHHHH--CCC---CCHHHCCCHHHHHHHHHCCCCCEEEEEHH Q ss_conf 5243--243324567322047530-288899842--377---31033076799999863787655630145 Q gi|254781037|r 112 SHTA--QSLKDLALHSVIGTSSLR-RKALLLRWR--SDI---SVIDFRGKIETRLNKLNNNKAHAILLAYA 174 (307) Q Consensus 112 ~~~~--~~l~~lp~~a~IGTsS~R-R~aql~~~~--p~l---~~~~iRGNv~TRl~Kl~~g~~DaiilA~A 174 (307) .++. ++++|| +|.+||++..= -...+.+.. -.+ .+.-+.-+-.....-|.+|+.||.++... T Consensus 80 ~~~~~i~s~~DL-kGk~i~~~~~~~~~~~l~~~L~~~Gl~~~dv~~v~~~~~~~~~al~~G~vDa~~~~~~ 149 (216) T pfam09084 80 LKDSGIKSPKDL-KGKRIGYSGSPFEEALLKALLAKAGGDPSDVTLVNVGGTSLSPALLTGKVDAAIGGYY 149 (216) T ss_pred ECCCCCCCHHHH-CCCEEEEECCCHHHHHHHHHHHHCCCCHHHEEEEECCHHHHHHHHHCCCCCEEEECCC T ss_conf 788898997895-8988996379728999999999859998990899678488888985699558996246 No 15 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=94.03 E-value=0.26 Score=28.80 Aligned_cols=199 Identities=14% Similarity=0.156 Sum_probs=104.6 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEE- Q ss_conf 8828997079888899999999999974089983389999973274115875567486226999999998709836997- Q gi|254781037|r 3 KKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAV- 81 (307) Q Consensus 3 ~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAV- 81 (307) .++|+||.-..+-+-+- +.+...+++- .++++|++.+. |-. ....||-+|+||.-. T Consensus 30 ~~~ikVG~~~gp~~ei~-e~~~~~~~ek---~G~~veiv~Fs---Dy~----------------~pN~AL~~G~iDaN~f 86 (271) T PRK11063 30 PNHIKVGVIVGAEQQVA-EVAQKVAKEK---YGLDVELVTFN---DYV----------------LPNEALSKGDIDANAF 86 (271) T ss_pred CCEEEEEECCCCHHHHH-HHHHHHHHHH---CCCEEEEEEEC---CCH----------------HHHHHHHCCCCCHHHH T ss_conf 98189984689879999-9988888860---79768999936---832----------------1558986798442345 Q ss_pred -EC--CCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCC----CCHHHHHHHH------HCCC--- Q ss_conf -40--343211101221012223656831056652432433245673220475----3028889984------2377--- Q gi|254781037|r 82 -HS--AKDMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSS----LRRKALLLRW------RSDI--- 145 (307) Q Consensus 82 -HS--lKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS----~RR~aql~~~------~p~l--- 145 (307) |- |.+-- -..|..+.++-+-.-.-=.|-|++++++++||.|++|+-+. .-|.-.++.. .|+. T Consensus 87 QH~~yL~~~n--k~~g~~L~~v~~~~~~P~glYS~K~ksl~elp~Ga~IaIPND~sN~~RAL~lL~~aGLIkLk~~~~~~ 164 (271) T PRK11063 87 QHKPYLDQQI--KDRGYKLVAVGNTFVYPIAGYSKKIKSLDELQDGSQVAVPNDPTNLGRSLLLLQKVGLIKLKDGVGLL 164 (271) T ss_pred HCHHHHHHHH--HHCCCCEEEEEEEEEEEEEEEECCCCCHHHCCCCCEEEECCCCCHHHHHHHHHHHCCCEEECCCCCCC T ss_conf 5799999999--97799579986677841586423658864756999998048856099999999988988977999987 Q ss_pred -CCHHHCCCH----------HHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEEC---CC Q ss_conf -310330767----------9999986378765563014545421621122233686673585432078999852---76 Q gi|254781037|r 146 -SVIDFRGKI----------ETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHV---NN 211 (307) Q Consensus 146 -~~~~iRGNv----------~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~---~d 211 (307) .+.+|..|- ..--+-|++.++|+.+.-..=....|+...-.-.+-.+ .+.| =+.-|.+|+ +| T Consensus 165 ~T~~DI~~Npk~l~~~e~daaql~rsl~D~dv~~aviN~~~a~~agl~p~kdal~~E~--~~~p--y~Nvivvr~~~kd~ 240 (271) T PRK11063 165 PTVLDIVENPKNLKIVELEAPQLPRSLDDAQIALAVINTTYASQIGLTPAKDGIFVED--KDSP--YVNLIVAREDNKDA 240 (271) T ss_pred CCHHHHHCCCCCCEEEECCHHHHHHHCCCCCEEEEEECHHHHHHCCCCCCCCCEEECC--CCCC--EEEEEEECCCCCCC T ss_conf 8966885488774899903776443135765108981477897779893425047778--9998--38999982777799 Q ss_pred CHHHHHHHHHCCHHHHHHH Q ss_conf 1245567762397888887 Q gi|254781037|r 212 PKAQELVKVINHEDTWDSV 230 (307) Q Consensus 212 ~~i~~il~~Ind~~T~~~v 230 (307) +.+..+.+....++...-+ T Consensus 241 ~~ik~lv~~~~S~evk~~I 259 (271) T PRK11063 241 ENVKKFVQAYQSDEVYEAA 259 (271) T ss_pred HHHHHHHHHHCCHHHHHHH T ss_conf 8999999997799999999 No 16 >pfam03180 Lipoprotein_9 NLPA lipoprotein. This family of bacterial lipoproteins contains several antigenic members, that may be involved in bacterial virulence. Their precise function is unknown. However they are probably distantly related to pfam00497 which are solute binding proteins. Probab=93.41 E-value=0.31 Score=28.30 Aligned_cols=193 Identities=17% Similarity=0.169 Sum_probs=101.9 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 89970798888999999999999740899833899999732741158755674862269999999987098369974034 Q gi|254781037|r 6 FRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSAK 85 (307) Q Consensus 6 i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSlK 85 (307) |+||.-..|-|-+- +.|...+++. ++++|++.+. |-.+ ...||-+|+||+-. .- T Consensus 1 lkvG~~~~p~~~i~-~~v~~~~~~~----Gi~veiv~F~---Dy~~----------------pN~AL~~GeIDaN~--fQ 54 (236) T pfam03180 1 LKVGATPGPHAEVL-EVAKPLAKKK----GLDLEIVEFT---DYVQ----------------PNTALADGEIDANA--FQ 54 (236) T ss_pred CEEEECCCCHHHHH-HHHHHHHHHC----CCEEEEEEEC---CCCC----------------HHHHHHCCCCCHHH--HC T ss_conf 98964389769999-9999999964----9879999816---8645----------------50797789955244--43 Q ss_pred CCCC----CCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCC----CCHHHHHHHHH------CC----CCC Q ss_conf 3211----101221012223656831056652432433245673220475----30288899842------37----731 Q gi|254781037|r 86 DMPT----KLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSS----LRRKALLLRWR------SD----ISV 147 (307) Q Consensus 86 DlP~----~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS----~RR~aql~~~~------p~----l~~ 147 (307) -.|- .-..|..+.++-+---.-=.|-|++++++++||.|++|+-+. .-|.-.++... ++ ..+ T Consensus 55 H~~yl~~~n~~~g~~L~~v~~~~~~p~glYS~k~ksl~~lp~Ga~IaIpnD~sN~~RAL~lL~~aGLI~Lk~~~~~~~t~ 134 (236) T pfam03180 55 HLPYLDQFNKEGGLDLVAVGNTHVEPIGLYSKKYKSLSELPDGATIAVPNDPSNEGRALLLLEKAGLIKLKDGAGLLATV 134 (236) T ss_pred CHHHHHHHHHHCCCCEEEEEEEEECCEEEEECCCCCHHHCCCCCEEEECCCCCHHHHHHHHHHHCCCEEECCCCCCCCCH T ss_conf 89999999997799679963046723795433768875857999898158844299999999988988976899987898 Q ss_pred HHHCCCH---------HHH-HHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEEC---CCCHH Q ss_conf 0330767---------999-9986378765563014545421621122233686673585432078999852---76124 Q gi|254781037|r 148 IDFRGKI---------ETR-LNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHV---NNPKA 214 (307) Q Consensus 148 ~~iRGNv---------~TR-l~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~---~d~~i 214 (307) .+|..|- ... -+-| .++|+.|+-..=...-|+...-.-.+.. +..-+=+..+.+|+ +|+++ T Consensus 135 ~DI~~Npk~l~~~ev~a~ql~~~l--~dvD~avin~n~a~~agl~p~~~~l~~e----~~~~~y~n~ivvr~~~~d~~~i 208 (236) T pfam03180 135 KDITENPKNLKIKELEAAQLPRAL--DDVDAAVINTNYALQAGLDPKKDALFEE----DKDSPYVNIIVVREDDKDDPAV 208 (236) T ss_pred HHHHHCCCCCEEEEECHHHHHHHC--CCCCEEEECHHHHHHCCCCHHHHHHHCC----CCCCCEEEEEEEECCCCCCHHH T ss_conf 889728677669991677755421--6657899865679887949212133147----8899837999981777699999 Q ss_pred HHHHHHHCCHHHHHHH Q ss_conf 5567762397888887 Q gi|254781037|r 215 QELVKVINHEDTWDSV 230 (307) Q Consensus 215 ~~il~~Ind~~T~~~v 230 (307) ..+.+....++...-+ T Consensus 209 k~l~e~~~s~~vk~~i 224 (236) T pfam03180 209 KKLVKAYQSEEVKAFI 224 (236) T ss_pred HHHHHHHCCHHHHHHH T ss_conf 9999998799999999 No 17 >cd00134 PBPb Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD. Probab=92.81 E-value=0.85 Score=25.49 Aligned_cols=147 Identities=16% Similarity=0.214 Sum_probs=78.2 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC---CCCCCCCCCCEEECCCC-CHHHHHHHHH Q ss_conf 9999987098369974034321110122101222365683105665243---24332456732204753-0288899842 Q gi|254781037|r 67 EIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA---QSLKDLALHSVIGTSSL-RRKALLLRWR 142 (307) Q Consensus 67 ele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~---~~l~~lp~~a~IGTsS~-RR~aql~~~~ 142 (307) ++-.+|.+|++|+++.++.--|.. ...+.... +--.. .-+++.+++ .++++|. |.+||+-.- --...+...+ T Consensus 49 ~~~~~l~~g~~D~~~~~~~~t~~R-~~~~~fs~-p~~~~-~~~~~~~~~~~~~~~~dl~-g~~i~v~~g~~~~~~l~~~~ 124 (218) T cd00134 49 GLITALKSGKVDLIAAGMTITPER-AKQVDFSD-PYYKS-GQVILVKKGSPIKSVKDLK-GKKVAVQKGSTAEKYLKKAL 124 (218) T ss_pred HHHHHHHCCCCCEEEEEECCCHHH-HCCCCCCC-CEECC-CEEEEEECCCCCCCHHHHC-CCEEEEECCCHHHHHHHHHH T ss_conf 999998579977688850258566-14145576-44415-7079998689999989977-96267958988999999731 Q ss_pred CCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCC-CCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHHHHH Q ss_conf 377310330767999998637876556301454542162112-2233686673585432078999852761245567762 Q gi|254781037|r 143 SDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKV-IKEILNIEDFPPSPGQGAICIETHVNNPKAQELVKVI 221 (307) Q Consensus 143 p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~-i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il~~I 221 (307) |+.++..+- |.+.-+++|..|+.|+.+.-...+..+-.... ....+.+. + ..-.-.+++-.++++.++ +..+ T Consensus 125 ~~~~~~~~~-~~~~~~~~l~~G~vD~~v~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~l---~~~~ 197 (218) T cd00134 125 PEAKVVSYD-DNAEALAALENGRADAVIVDEIALAALLKKHPPELKIVGPS-I--DLEPLGFGVAVGKDNKEL---LDAV 197 (218) T ss_pred CCCEEEEEC-CHHHHHHHHHCCCCEEEEECHHHHHHHHHHCCCCCEEECCC-C--CCCCEEEEEEECCCCHHH---HHHH T ss_conf 575499969-99999999985994699976999999998679981996356-6--767348999974999999---9999 Q ss_pred CCH Q ss_conf 397 Q gi|254781037|r 222 NHE 224 (307) Q Consensus 222 nd~ 224 (307) |.. T Consensus 198 n~~ 200 (218) T cd00134 198 NKA 200 (218) T ss_pred HHH T ss_conf 999 No 18 >pfam00497 SBP_bac_3 Bacterial extracellular solute-binding proteins, family 3. Probab=92.37 E-value=0.84 Score=25.54 Aligned_cols=144 Identities=15% Similarity=0.185 Sum_probs=74.8 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC-------CCCCCCCCCCEEECCCCCHHHH- Q ss_conf 99999987098369974034321110122101222365683105665243-------2433245673220475302888- Q gi|254781037|r 66 EEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA-------QSLKDLALHSVIGTSSLRRKAL- 137 (307) Q Consensus 66 kele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~-------~~l~~lp~~a~IGTsS~RR~aq- 137 (307) ++.-.++.+|++|++.+++-.-|.. .+.+.... +--..+.-+++.++. +++.+|. |-+||+..--...+ T Consensus 48 ~~~~~~l~~g~~D~~~~~~~~t~~R-~~~~~~s~-p~~~~~~~~~~~~~~~~~~~~~~~~~dL~-~~~I~v~~g~~~~~~ 124 (224) T pfam00497 48 DGLIPALQSGKIDVIIAGMTITPER-KKKVDFSD-PYYYSGQVLVVRKDSPPKIKSIKDLADLK-GKKVGVQKGTTQEDL 124 (224) T ss_pred HHHHHHHHCCCCCEEECCCCCCHHH-HHHHCCCC-CEEECCEEEEEECCCCCCCCCCCCHHHHC-CCEEEEECCCHHHHH T ss_conf 9999999769987661367788555-10212575-34404717999989876545658989977-977999689779999 Q ss_pred HHHH-HCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHH----HHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCC Q ss_conf 9984-2377310330767999998637876556301454----5421621122233686673585432078999852761 Q gi|254781037|r 138 LLRW-RSDISVIDFRGKIETRLNKLNNNKAHAILLAYAG----IKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNP 212 (307) Q Consensus 138 l~~~-~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Ag----l~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~ 212 (307) +... .++.++..+. +.+.-+++|..|+.|+++.-... +++.+....+ ..|...+.-+++-+++++. T Consensus 125 l~~~~~~~~~i~~~~-~~~~~l~~L~~gr~D~~i~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~k~~~ 195 (224) T pfam00497 125 LKELAPKGAEIVLYD-DQAEALQALAAGRVDAVVADSPVAAYLIKKNPGLNLV--------VGEPLSGEPYGIAVRKGDP 195 (224) T ss_pred HHHHCCCCCEEEEEC-CHHHHHHHHHCCCEEEEEECHHHHHHHHHHCCCCEEE--------ECCCCCCCEEEEEEECCCH T ss_conf 986062445169954-2788888987497689995799999999868997299--------4367776149999979999 Q ss_pred HHHHHHHHHCCH Q ss_conf 245567762397 Q gi|254781037|r 213 KAQELVKVINHE 224 (307) Q Consensus 213 ~i~~il~~Ind~ 224 (307) .+ +..+|.. T Consensus 196 ~l---~~~~n~~ 204 (224) T pfam00497 196 EL---LAAVNKA 204 (224) T ss_pred HH---HHHHHHH T ss_conf 99---9999999 No 19 >COG0834 HisJ ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms] Probab=92.27 E-value=1 Score=25.05 Aligned_cols=153 Identities=16% Similarity=0.137 Sum_probs=89.5 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC----CCCCCCCC---CCEEECCCCCHHHHH Q ss_conf 99999987098369974034321110122101222365683105665243----24332456---732204753028889 Q gi|254781037|r 66 EEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA----QSLKDLAL---HSVIGTSSLRRKALL 138 (307) Q Consensus 66 kele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~----~~l~~lp~---~a~IGTsS~RR~aql 138 (307) ..+-.+|..|++|+.+.++=-.| +..+.+.. ..+.-....-+++.+.. ++++||.- +...||+ .+-.... T Consensus 87 ~~~~~~l~~g~~D~~~~~~~~t~-~r~~~~~f-s~py~~~~~~~~~~~~~~~~~~~~~DL~gk~v~v~~gt~-~~~~~~~ 163 (275) T COG0834 87 DGLIPALKAGKVDIIIAGMTITP-ERKKKVDF-SDPYYYSGQVLLVKKDSDIGIKSLEDLKGKKVGVQLGTT-DEAEEKA 163 (275) T ss_pred HHHHHHHHCCCCCEEEECCCCCH-HHHHCCCC-CCCEEECCEEEEEECCCCCCCCCHHHHCCCEEEEECCCH-HHHHHHH T ss_conf 77638987799888985673687-77520056-777365465899976885455898895897699966845-8888776 Q ss_pred HHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHC--CCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHH Q ss_conf 98423773103307679999986378765563014545421--6211222336866735854320789998527612455 Q gi|254781037|r 139 LRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRL--KKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQE 216 (307) Q Consensus 139 ~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL--~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~ 216 (307) ....+..+++....+ ..-+..|.+|+.|+.+.-..-+..+ ..............+.+ ...+++..++++ . .+ T Consensus 164 ~~~~~~~~~~~~~~~-~~~~~al~~Gr~Da~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~-~-~~ 237 (275) T COG0834 164 KKPGPNAKIVAYDSN-AEALLALKNGRADAVVSDSAVLAGLKLLKKNPGLYVLLVFPGLS---VEYLGIALRKGD-D-PE 237 (275) T ss_pred HHCCCCCEEEECCCH-HHHHHHHHCCCCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCC---CCCEEEEECCCC-H-HH T ss_conf 404677449981888-99999998499889970688876666530155653211455665---432000124685-5-99 Q ss_pred HHHHHCCHHHH Q ss_conf 67762397888 Q gi|254781037|r 217 LVKVINHEDTW 227 (307) Q Consensus 217 il~~Ind~~T~ 227 (307) +...+|..-.. T Consensus 238 l~~~~n~~l~~ 248 (275) T COG0834 238 LLEAVNKALKE 248 (275) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 20 >COG1464 NlpA ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism] Probab=91.18 E-value=0.94 Score=25.22 Aligned_cols=196 Identities=16% Similarity=0.127 Sum_probs=92.9 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) ++|+||+..-+=|-+--..+...+++. +++++|+.++ +-.. -..||-+|++|.-. T Consensus 29 ~~I~vg~~~~p~a~ile~~~k~~~~k~----Gi~l~i~~Ft-----DY~~--------------PN~AL~~gdiDaN~-- 83 (268) T COG1464 29 KTIKVGATPGPHAEILEVVVKPALKKK----GLDLKIVEFT-----DYVQ--------------PNEALADGDIDANA-- 83 (268) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHC----CCEEEEEEEC-----CCCC--------------HHHHHHCCCCCCHH-- T ss_conf 717996368965999999987788855----9659999814-----8766--------------55787668846001-- Q ss_pred CCCCCC---CCCC-CCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCC----CCHHHHHHHHH------C----CC Q ss_conf 343211---1012-21012223656831056652432433245673220475----30288899842------3----77 Q gi|254781037|r 84 AKDMPT---KLLK-GLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSS----LRRKALLLRWR------S----DI 145 (307) Q Consensus 84 lKDlP~---~~~~-~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS----~RR~aql~~~~------p----~l 145 (307) +--.|. +.-+ +..+.++-.-.-.-=.+-|++++++++||.|++|+-+- .-|.-.++.-- + .- T Consensus 84 FQH~pyL~~~~k~~~~~Lv~vg~~~i~Pmg~YSkk~ksl~el~~GatIaiPNDpsN~gRAL~lL~~aGLIkLk~~~~~~a 163 (268) T COG1464 84 FQHKPYLDQFNKEHGGKLVAVGNTHIEPMGLYSKKYKSLAELKDGATIAIPNDPTNEGRALLLLQKAGLIKLKDGVNLLA 163 (268) T ss_pred HHCHHHHHHHHHHCCCCEEEEEEEEECCCEECCHHCCCHHHCCCCCEEECCCCCCCHHHHHHHHHHCCCEEECCCCCCCC T ss_conf 21468899999973997898755776142501220086846799998988789873567999999879679768876558 Q ss_pred CCHHHCCCHHHHHHHHHCCCCCEEEEEHH------------HHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEEC---C Q ss_conf 31033076799999863787655630145------------45421621122233686673585432078999852---7 Q gi|254781037|r 146 SVIDFRGKIETRLNKLNNNKAHAILLAYA------------GIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHV---N 210 (307) Q Consensus 146 ~~~~iRGNv~TRl~Kl~~g~~DaiilA~A------------gl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~---~ 210 (307) .+.+|..|- ++|.-.|.||-.|+.| =...-||+..-.-++.++ - ...+=+--|.+|+ + T Consensus 164 T~~DI~eNP----K~lki~EldAaqlpRaLddvD~AvIN~nyA~~AgL~p~kdai~~e~--~-~~spY~Niivvr~~d~d 236 (268) T COG1464 164 TPKDITENP----KNLKIKELEAAQLPRALDDVDAAVINTNYALQAGLNPKKDALFEED--K-DSSPYVNIIVVREEDKD 236 (268) T ss_pred CHHHHHHCC----CCCEEEECCHHHCCCCCCCCCEEEECCHHHHHCCCCCCCCCEECCC--C-CCCCCEEEEEECCCCCC T ss_conf 887886194----4575687056756532035677987430797759991204214146--6-67862479997155668 Q ss_pred CCHHHHHHHHHCCHHHHHHHH Q ss_conf 612455677623978888878 Q gi|254781037|r 211 NPKAQELVKVINHEDTWDSVN 231 (307) Q Consensus 211 d~~i~~il~~Ind~~T~~~v~ 231 (307) ++.+..+++....+++...+. T Consensus 237 ~~~ik~lv~a~qs~evk~~i~ 257 (268) T COG1464 237 DPAVKKLVEAYQSDEVKAFIE 257 (268) T ss_pred CHHHHHHHHHHCCHHHHHHHH T ss_conf 889999999974999999999 No 21 >PRK11553 alkanesulfonate transporter substrate-binding subunit; Provisional Probab=89.32 E-value=1.9 Score=23.29 Aligned_cols=149 Identities=17% Similarity=0.199 Sum_probs=87.8 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) +++|||.-++.+.+.= ..-...+++. +++..++.+.+.+ | -.+-++|..|.+|++ + T Consensus 27 ~~lrIgyq~~~~~~~~-~~~~~~~ek~--~~~~kV~w~~F~~-G------------------~~~~eAl~aG~iD~~--~ 82 (314) T PRK11553 27 EALRIGYQKGSIGLVL-AKSHQLLEKR--FPQTKISWVEFPA-G------------------PQMLEALNVGSIDLG--S 82 (314) T ss_pred CEEEEEECCCCHHHHH-HHHCCHHHHH--CCCCCEEEEECCC-C------------------HHHHHHHHCCCCCEE--E T ss_conf 7599996177415899-9852867661--7898358997897-4------------------899999976997545--1 Q ss_pred CCCCCCCC----CCCCEEEEECCCCCCCEEEEECCC---CCCCCCCCCCEEECC--CCCHHHHHHHH------HCCCCCH Q ss_conf 34321110----122101222365683105665243---243324567322047--53028889984------2377310 Q gi|254781037|r 84 AKDMPTKL----LKGLQISAYLPREDIRDVFISHTA---QSLKDLALHSVIGTS--SLRRKALLLRW------RSDISVI 148 (307) Q Consensus 84 lKDlP~~~----~~~l~i~a~~~R~d~rD~lv~~~~---~~l~~lp~~a~IGTs--S~RR~aql~~~------~p~l~~~ 148 (307) +=|.|... ...+.+.++.+.....++++.+.+ +++.|| +|-+|++. |---...++.+ ..|++++ T Consensus 83 ~g~~p~i~a~a~G~~~~~va~~~~~~~~~~i~V~~~S~I~s~aDL-kGKkVa~~~Gs~~~~~l~~aL~~aGL~~~DV~~v 161 (314) T PRK11553 83 TGDIPPIFAQAAGADLVYVGVEPPKPKAEVILVAENSPIKTVADL-KGHKVAFQKGSSSHNLLLRALRQAGLKFTDIQPT 161 (314) T ss_pred ECCHHHHHHHHCCCCEEEEEEECCCCCCEEEEEECCCCCCCHHHH-CCCEEEEECCCHHHHHHHHHHHHCCCCHHHEEEE T ss_conf 158589999866998699998637886418998489877888893-8998974179737999999999869988991898 Q ss_pred HHCCCHHHHHHHHHCCCCCEEEEEHHHHHHC Q ss_conf 3307679999986378765563014545421 Q gi|254781037|r 149 DFRGKIETRLNKLNNNKAHAILLAYAGIKRL 179 (307) Q Consensus 149 ~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL 179 (307) ++ .-..-..-|.+|+.||.+....-+.+. T Consensus 162 ~l--~p~d~~aAl~~G~VDA~~~w~P~~~~a 190 (314) T PRK11553 162 YL--TPADARAAFQQGNVDAWAIWDPYYSAA 190 (314) T ss_pred EC--CCHHHHHHHHCCCCCEEEECCHHHHHH T ss_conf 45--938899999669978899756789999 No 22 >PRK11480 tauA taurine transporter substrate binding subunit; Provisional Probab=88.77 E-value=2 Score=23.05 Aligned_cols=196 Identities=11% Similarity=0.150 Sum_probs=112.5 Q ss_pred CCEEEEECCC--HHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEE Q ss_conf 8289970798--88899999999999974089983389999973274115875567486226999999998709836997 Q gi|254781037|r 4 KPFRIGTRCS--PLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAV 81 (307) Q Consensus 4 ~~i~IgtR~S--~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAV 81 (307) +.+|||--.| |+.++|++-. +++. .++.++.+.+.+-. ++-+||..|.+||+ T Consensus 23 ~~v~igYq~~~~p~~~aka~g~---~ek~---~G~kV~W~~F~sG~-------------------~~~eAlasG~vDig- 76 (320) T PRK11480 23 VNVTVAYQTSAEPAKVAQADNT---FAKE---SGATVDWRKFDSGA-------------------SIVRALASGDVQIG- 76 (320) T ss_pred CEEEEEEECCCCCHHHHHHCCC---HHHH---CCCCEEEEECCCHH-------------------HHHHHHHCCCCCEE- T ss_conf 4599998069862899986384---5675---19812799779759-------------------99999966997564- Q ss_pred ECCCCCCCCC--CCC--CEEEEECCCCCCCEEEEECCC-CCCCCCCCCCEEECCCCCH--HHHHHHH------HCCCCCH Q ss_conf 4034321110--122--101222365683105665243-2433245673220475302--8889984------2377310 Q gi|254781037|r 82 HSAKDMPTKL--LKG--LQISAYLPREDIRDVFISHTA-QSLKDLALHSVIGTSSLRR--KALLLRW------RSDISVI 148 (307) Q Consensus 82 HSlKDlP~~~--~~~--l~i~a~~~R~d~rD~lv~~~~-~~l~~lp~~a~IGTsS~RR--~aql~~~------~p~l~~~ 148 (307) ++=+.|... ..| +.+..+.....-.+++|.+++ .+..|| +|-+|++.-.-- -..++.+ ..|++++ T Consensus 77 -~~G~~P~~~a~a~g~~ik~v~i~~~~~~~ealvv~~~I~s~~DL-kGKkVav~~gS~~hy~ll~aL~~~Gl~~~dV~iv 154 (320) T PRK11480 77 -NLGSSPLAVAASQQVPIEVFLLASKLGNSEALVVKKTISKPEDL-IGKRIAVPFISTTHYSLLAALKHWGIKPGQVEIV 154 (320) T ss_pred -CCCCCHHHHHHHCCCCEEEEEEECCCCCCEEEEECCCCCCHHHC-CCCEEECCCCCCHHHHHHHHHHHCCCCHHHEEEE T ss_conf -57784799998679985999985478974389956899996884-9998960788731899999999859998990788 Q ss_pred HHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCC--CCCCCCEEEEEEE-----CCCCHHHHHHHHH Q ss_conf 33076799999863787655630145454216211222336866735--8543207899985-----2761245567762 Q gi|254781037|r 149 DFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFP--PSPGQGAICIETH-----VNNPKAQELVKVI 221 (307) Q Consensus 149 ~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~--PA~gQGaIaIe~r-----~~d~~i~~il~~I 221 (307) ++.= ..-..-+..|+.||...=.-.+.++.-..+ ..++-.+.- ..|-=. + ...| ++-+.+..+++.+ T Consensus 155 ~m~p--~d~~aA~~~G~vDa~~~W~P~l~~l~~~G~--vL~~s~~~~~~G~pt~d-~-~vv~~~Fa~e~Pe~V~~flkv~ 228 (320) T PRK11480 155 NLQP--PAIIAAWQRGDIDGAYVWAPAVNALEKDGK--VLTDSEQVGQWGAPTLD-V-WVVRKDFAEKHPEVVKAFAKSA 228 (320) T ss_pred ECCC--HHHHHHHHCCCCCEEEECCHHHHHHHHCCC--EEEECHHHHHCCCCCCC-E-EEECHHHHHHCHHHHHHHHHHH T ss_conf 4492--889999976997879626779999984397--88641664204897455-6-8863999988989999999999 Q ss_pred CCHHHHHHHHHH Q ss_conf 397888887889 Q gi|254781037|r 222 NHEDTWDSVNCE 233 (307) Q Consensus 222 nd~~T~~~v~aE 233 (307) .+...+....-+ T Consensus 229 ~~A~~~~~~npd 240 (320) T PRK11480 229 IDAQQPYIANPD 240 (320) T ss_pred HHHHHHHHHCHH T ss_conf 999999995967 No 23 >smart00062 PBPb Bacterial periplasmic substrate-binding proteins. bacterial proteins, eukaryotic ones are in PBPe Probab=88.34 E-value=2.2 Score=22.86 Aligned_cols=142 Identities=16% Similarity=0.242 Sum_probs=78.3 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC---CCCCCCCCCCEEECCCCCH-HHHHHHH Q ss_conf 99999987098369974034321110122101222365683105665243---2433245673220475302-8889984 Q gi|254781037|r 66 EEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA---QSLKDLALHSVIGTSSLRR-KALLLRW 141 (307) Q Consensus 66 kele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~---~~l~~lp~~a~IGTsS~RR-~aql~~~ 141 (307) ++.-.+|.+|++|++.-++---|.. .+.+.... +=-..+-+++.+.+ .++++| .|.+||+..--- ...+... T Consensus 49 ~r~~~~l~~g~~D~~~~~~~~~~~r-~~~~~~s~--p~~~~~~~~~~~~~~~~~~~~dL-~g~~i~~~~g~~~~~~l~~~ 124 (219) T smart00062 49 DNLLTALKSGKIDVVAAGMTITPER-AKQVDFSD--PYYKSGQVILVRKDSPIKSLEDL-KGKKVAVVAGTTGEELLKKL 124 (219) T ss_pred HHHHHHHHCCCCCEEEEECCCCHHH-HHCCCCCC--CEECCCEEEEEECCCCCCCHHHH-CCCEEEEECCCHHHHHHHHH T ss_conf 9999999758977053113578677-50022245--50024447999658997997996-79779995798799999972 Q ss_pred HCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHC----CCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHH Q ss_conf 23773103307679999986378765563014545421----62112223368667358543207899985276124556 Q gi|254781037|r 142 RSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRL----KKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQEL 217 (307) Q Consensus 142 ~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL----~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~i 217 (307) .++.++..+. |.+.-+++|.+|+.|+.|.-...+... +..+... .. .+..-+.-+++-.++++..+.+. T Consensus 125 ~~~~~~~~~~-~~~~~~~~l~~g~vD~~i~~~~~~~~~~~~~~~~~~~~-~~-----~~~~~~~~~~~~~~~~~~~l~~~ 197 (219) T smart00062 125 YPEAKIVSYD-SQAEALAALKAGRADAAVADAPALAALVKQHGLPELKI-VG-----DPLDTPEGYAFAVRKGDPELLDK 197 (219) T ss_pred CCCCEEEEEC-CHHHHHHHHHCCCEEEEEECHHHHHHHHHHCCCCCEEE-EC-----CCCCCCCEEEEEEECCCHHHHHH T ss_conf 8785289859-99999999983987999977999999998489998378-34-----67777641999995999999999 Q ss_pred H Q ss_conf 7 Q gi|254781037|r 218 V 218 (307) Q Consensus 218 l 218 (307) + T Consensus 198 ~ 198 (219) T smart00062 198 I 198 (219) T ss_pred H T ss_conf 9 No 24 >PRK11917 bifunctional adhesin/ABC transporter aspartate/glutamate-binding protein; Reviewed Probab=87.98 E-value=2.3 Score=22.72 Aligned_cols=142 Identities=18% Similarity=0.224 Sum_probs=74.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEEC--CCCCCCEEEEECC--CCCCCCCCCCCEEEC--CCCCHHHH- Q ss_conf 9999999870983699740343211101221012223--6568310566524--324332456732204--75302888- Q gi|254781037|r 65 TEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYL--PREDIRDVFISHT--AQSLKDLALHSVIGT--SSLRRKAL- 137 (307) Q Consensus 65 tkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~--~R~d~rD~lv~~~--~~~l~~lp~~a~IGT--sS~RR~aq- 137 (307) ...+-.+|.+|++|+.+.++=-- ++......+. .=.+..-+++.++ .++++|| +|.+||+ +|.-.... T Consensus 90 ~~~~~p~L~~gkvD~i~~~~tiT----~eR~k~vdFS~PY~~~~~~ilv~k~~~i~~~~DL-~gk~Vgv~~gst~~~~~~ 164 (259) T PRK11917 90 AKTRGPLLDNGSVDAVIATFTIT----PERKRIYNFSEPYYQDAIGLLVLKEKNYKSLADM-KGANIGVAQAATTKKAIG 164 (259) T ss_pred HHHHHHHHHCCCEEEEEECCCCC----CCHHCCEEECCCEEEEEEEEEEECCCCCCCHHHH-CCCCEEEEECCCHHHHHH T ss_conf 78776898779855998664767----6543057622476996559999899999998995-898179992784699999 Q ss_pred --HHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHH Q ss_conf --998423773103307679999986378765563014545421621122233686673585432078999852761245 Q gi|254781037|r 138 --LLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQ 215 (307) Q Consensus 138 --l~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~ 215 (307) .+....+.++..+. |...-...|.+|..||.+....-+.-. .......++ +.+-| --.||-.|++|++.. T Consensus 165 ~~~~~~~~~~~~~~~~-~~~~~~~~l~~grvDa~~~d~~~~~~~--~~~~~~~~~-~~~~~----~~~giavrKgd~~l~ 236 (259) T PRK11917 165 EAAKKIGIDVKFSEFP-DYPSIKAALDAKRVDAFSVDKSILLGY--VDDKSEILP-DSFEP----QSYGIVTKKDDPAFA 236 (259) T ss_pred HHHHHCCCCEEEEECC-CHHHHHHHHHCCCEEEEECCHHHHHHH--HHCCCCCCC-CCCCC----CEEEEEEECCCHHHH T ss_conf 9888608851699649-999999999849802998269999999--861980168-88875----258999948998999 Q ss_pred HHHH Q ss_conf 5677 Q gi|254781037|r 216 ELVK 219 (307) Q Consensus 216 ~il~ 219 (307) +.+. T Consensus 237 ~~vn 240 (259) T PRK11917 237 KYVD 240 (259) T ss_pred HHHH T ss_conf 9999 No 25 >COG0715 TauA ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components [Inorganic ion transport and metabolism] Probab=86.09 E-value=2.9 Score=22.05 Aligned_cols=158 Identities=15% Similarity=0.141 Sum_probs=92.0 Q ss_pred HHHHHHCCCCEEEEECCCCCCC--CCCC--CCEEEEECCCCCCCEEEEECCCC---CCCCCCCCCEEECCCCCH-HHHHH Q ss_conf 9999870983699740343211--1012--21012223656831056652432---433245673220475302-88899 Q gi|254781037|r 68 IEKKLISGEIDCAVHSAKDMPT--KLLK--GLQISAYLPREDIRDVFISHTAQ---SLKDLALHSVIGTSSLRR-KALLL 139 (307) Q Consensus 68 le~aLl~g~iDiAVHSlKDlP~--~~~~--~l~i~a~~~R~d~rD~lv~~~~~---~l~~lp~~a~IGTsS~RR-~aql~ 139 (307) ..++|..|.+|+|+...=+.|. .... .+.+.+.+.+.....+++.+... ++.|| +|-+||.+-+-. ...++ T Consensus 74 ~~~~l~~G~~D~a~~~~~~~~~~~~~~~g~pv~~va~~~~~~~~~~i~~~~~~~i~~~adl-kGk~vg~~~~~~~~~~~l 152 (335) T COG0715 74 VLEALAAGALDFAVYYTGDTPLIAAGAAGAPVKVVAALVQNGNGIALLVLKDSGIKSVADL-KGKKVGVPFGGSTSDFLL 152 (335) T ss_pred HHHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEEEEEEEECCCCEEEEECCCCCCCCHHHC-CCCEEEEECCCCCHHHHH T ss_conf 9999976998887423785599996356999799998743797329997388776771333-899899818997079999 Q ss_pred HH-H--CCCCCH---HHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEEC---- Q ss_conf 84-2--377310---3307679999986378765563014545421621122233686673585432078999852---- Q gi|254781037|r 140 RW-R--SDISVI---DFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHV---- 209 (307) Q Consensus 140 ~~-~--p~l~~~---~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~---- 209 (307) +. . -.+... .+.-.-..-..-|..|..||.+.-.--..++.........++.....+ --+..+ +-.++ T Consensus 153 ~~~L~~~Gl~~~dv~~v~~~~~~~~~al~~g~vda~~~~ep~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~ 230 (335) T COG0715 153 RYALAKAGLDPDDVELVNLPPADAVAALAAGQVDAFVVWEPWNAAAEGEGGGRVLLDGADLWG-NHPELV-LVVRKEFIE 230 (335) T ss_pred HHHHHHCCCCHHHCEEEECCHHHHHHHHHCCCCCEEEECCHHHHHHHHCCCCEEEEECCCCCC-CCCEEE-EEECHHHHH T ss_conf 999998699989946873692999999975986889966679899731578579861566676-674269-995588776 Q ss_pred CC-CHHHHHHHHHCCHHHHH Q ss_conf 76-12455677623978888 Q gi|254781037|r 210 NN-PKAQELVKVINHEDTWD 228 (307) Q Consensus 210 ~d-~~i~~il~~Ind~~T~~ 228 (307) .+ ..+..+++.+...--+. T Consensus 231 ~~p~~~~~~v~a~~~a~~~~ 250 (335) T COG0715 231 ANPEAVKAFLKALAKATAWA 250 (335) T ss_pred HCHHHHHHHHHHHHHHHHHH T ss_conf 69999999999999999999 No 26 >CHL00180 rbcR LysR transcriptional regulator; Provisional Probab=85.25 E-value=3.2 Score=21.79 Aligned_cols=178 Identities=19% Similarity=0.203 Sum_probs=90.1 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) -+|+||+=.|- +...--.+...+.+. +|++++++..- -+.++.+.|.+|++|+|+-+ T Consensus 95 g~lrI~~~~s~-~~~~lp~~l~~f~~~--~P~v~i~l~~~--------------------~~~~~~~~l~~~~~D~~i~~ 151 (307) T CHL00180 95 GTLIIGASQTT-GTYLMPRLIGLFRQK--YPQIAVQLQVH--------------------STRRIAWSVANGQIDLAIIG 151 (307) T ss_pred CCCCCCCCCHH-HHHHHHHHHHHHHHH--CCCCCEEEEEC--------------------CHHHHHHHHHCCCEEEEEEC T ss_conf 86010104066-666435889999998--88997278977--------------------99999999987980099975 Q ss_pred CCCCCCCCCCCCEEEEECCCCCCCEEEEECCC-------CCCCCCCCCCEEE---CCCCCHHHH--HHHHHC---CCCCH Q ss_conf 34321110122101222365683105665243-------2433245673220---475302888--998423---77310 Q gi|254781037|r 84 AKDMPTKLLKGLQISAYLPREDIRDVFISHTA-------QSLKDLALHSVIG---TSSLRRKAL--LLRWRS---DISVI 148 (307) Q Consensus 84 lKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~-------~~l~~lp~~a~IG---TsS~RR~aq--l~~~~p---~l~~~ 148 (307) . .+|....+++...-+ .+++--+.+++++ .+++||..-.-|+ .++.||... +..... .+++. T Consensus 152 ~-~~~~~~~~~l~~~~l--~~~~~~lv~~~~hpla~~~~v~~~dL~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (307) T CHL00180 152 G-EVPTELKDNLQVTPY--AEDELALILPKSHPFAKLKKIQKEDLYRLRFIALDSQSTIRKVIDNILIQNGIDSSRFKIE 228 (307) T ss_pred C-CCCCCCCCCEEEEEE--ECCCEEEEECCCCHHHCCCCCCHHHHHCCCEEEECCCCCHHHHHHHHHHHCCCCCCCCEEE T ss_conf 7-788666785499994--3242799973898022399999999817984871799968999999999769984554279 Q ss_pred HHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCC Q ss_conf 3307679999986378765563014545421621122233686673585432078999852761 Q gi|254781037|r 149 DFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNP 212 (307) Q Consensus 149 ~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~ 212 (307) ---+++++=++-+.+|-.=+ +|....+.+---.... ..++++.... +=.+++..+++.. T Consensus 229 ~~~~~~~~l~~~v~~g~Gia-~lP~~~v~~~~~~g~l-~~~~~~~~~~---~r~i~lv~~~~r~ 287 (307) T CHL00180 229 MELNSIEAIKNAVQSGLGAA-FVSVSAIEKELELGTL-HWAKIENITI---KRTLSIITNPNRY 287 (307) T ss_pred EEECCHHHHHHHHHHCCEEE-ECCHHHHHHHHHCCCE-EEEECCCCCC---EEEEEEEEECCCC T ss_conf 99780999999999399499-8259999989877988-9997899985---0499999979597 No 27 >PRK09495 glnH glutamine ABC transporter periplasmic protein; Reviewed Probab=84.61 E-value=3.4 Score=21.61 Aligned_cols=146 Identities=16% Similarity=0.141 Sum_probs=80.3 Q ss_pred CCCEEEEECCC--HHHHHHHH-------HHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHH Q ss_conf 88289970798--88899999-------9999999740899833899999732741158755674862269999999987 Q gi|254781037|r 3 KKPFRIGTRCS--PLALAHAF-------ETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLI 73 (307) Q Consensus 3 ~~~i~IgtR~S--~LAl~Qa~-------~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl 73 (307) .++|+|||..+ |....+.. .+.+++.+. -++++|++++. | .++-.+|. T Consensus 24 ~~~l~Vg~~~~~pPf~f~~~g~~~GfdvDl~~~ia~~---lg~~~e~~~~~-------------------~-~~~i~~l~ 80 (247) T PRK09495 24 DKELVVATDTAFVPFEFKQGDKYVGFDVDLWAAIAKE---LKLDYTLKPMD-------------------F-SGIIPALQ 80 (247) T ss_pred CCEEEEEECCCCCCEEECCCCEEEEEHHHHHHHHHHH---HCCCEEEEECC-------------------H-HHHHHHHH T ss_conf 9949999899879805767992788299999999999---69946999778-------------------8-99999997 Q ss_pred CCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC---CCCCCCCCCCEEECCCCC-HHHHHHHHHCCCCCHH Q ss_conf 098369974034321110122101222365683105665243---243324567322047530-2888998423773103 Q gi|254781037|r 74 SGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA---QSLKDLALHSVIGTSSLR-RKALLLRWRSDISVID 149 (307) Q Consensus 74 ~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~---~~l~~lp~~a~IGTsS~R-R~aql~~~~p~l~~~~ 149 (307) .|++|+++.++---|. ...-+.. .-+--..+.-+++.++. .+++||. |-+||.-.-= -..+++...|+.+++. T Consensus 81 ~g~~D~~~~~~~~T~e-R~~~~~F-S~py~~~~~~i~v~~~~~~i~~~~dL~-Gk~v~v~~Gs~~~~~~~~~~~~~~i~~ 157 (247) T PRK09495 81 TKNIDLALAGITITDE-RKKAIDF-SDGYYKSGLLVMVKANNNDIKSVKDLD-GKVVAVKSGTGSVDYAKANIKTKDLRQ 157 (247) T ss_pred CCCCCEEECCCCCCHH-HHHHCCC-CCCCEEEEEEEEEECCCCCCCCHHHCC-CCEEEEECCCHHHHHHHHHCCCCCEEE T ss_conf 6995636466014789-9843334-777077246999989997779824548-988998469589999996388986696 Q ss_pred HCCCHHHHHHHHHCCCCCEEEEEHHH Q ss_conf 30767999998637876556301454 Q gi|254781037|r 150 FRGKIETRLNKLNNNKAHAILLAYAG 175 (307) Q Consensus 150 iRGNv~TRl~Kl~~g~~DaiilA~Ag 175 (307) -.|.+.-+..|.+|+.||.+.-..- T Consensus 158 -~~~~~~~~~aL~~GrvDa~v~d~~~ 182 (247) T PRK09495 158 -FPNIDNAYLELGTGRADAVLHDTPN 182 (247) T ss_pred -CCCHHHHHHHHHCCCEEEEEECHHH T ss_conf -5988999998765873499956899 No 28 >pfam04069 OpuAC Substrate binding domain of ABC-type glycine betaine transport system. Part of a high affinity multicomponent binding-protein-dependent transport system involved in bacterial osmoregulation. This domain is often fused to the permease component of the transporter complex. Family members are often integral membrane proteins or predicted to be attached to the membrane by a lipid anchor. Glycine betaine is involved in protection from high osmolarity environments for example in Bacillus subtilis. The family member OpuBC is closely related, and involved in choline transport. Choline is necessary for the biosynthesis of glycine betaine. L-carnitine is important for osmoregulation in Listeria monocytogenes. Family also contains proteins binding l-proline (ProX), histidine (HisX) and taurine (TauA). Probab=83.56 E-value=3.8 Score=21.32 Aligned_cols=188 Identities=14% Similarity=0.096 Sum_probs=92.6 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) |+|+||+-...=+.+.++.....|++. +++++.+.+.+ .. -+-.+|-+|+||+.... T Consensus 1 ktI~ig~~~w~e~~i~a~i~~~~Le~~----G~~V~~~~~~~---------------~~----~~~~al~~G~iDi~~~~ 57 (256) T pfam04069 1 KTIVIGSKNWTEQEILANIAAQLLEAL----GYVVELVGLGS---------------TA----VLFAALASGDIDLYPEE 57 (256) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHC----CCCEEEEECCC---------------CH----HHHHHHHCCCCCEEEHH T ss_conf 929992688628999999999999976----98169854787---------------08----99999976997586333 Q ss_pred C-----CCCCCCCCCCCEEE--EECCCCCCCEEEEECCC-------CCCCCCCCCCE---------E----ECCCCCHHH Q ss_conf 3-----43211101221012--22365683105665243-------24332456732---------2----047530288 Q gi|254781037|r 84 A-----KDMPTKLLKGLQIS--AYLPREDIRDVFISHTA-------QSLKDLALHSV---------I----GTSSLRRKA 136 (307) Q Consensus 84 l-----KDlP~~~~~~l~i~--a~~~R~d~rD~lv~~~~-------~~l~~lp~~a~---------I----GTsS~RR~a 136 (307) + .++-..+.++..+. .... ...+..|..+.+ ++++||.+.+. + |.++.+-.. T Consensus 58 w~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~Vp~~~a~~~~i~si~dL~~~~~~~~~~~~~~~~~~~G~~~~~~~~ 136 (256) T pfam04069 58 WTGTTYEAYKKAVEEKLGLLVLGPLG-AGNTYGLAVPKYVAEKPGIKSISDLAKPADDELGFKGEFIGRPDGWGCTRSTE 136 (256) T ss_pred CCCCHHHHHHHHHHCCCCEEECCCCC-CCCEEEEEECHHHHHHCCCCCHHHHHCCCCCCCCCCCCEECCCCCCCCCHHHH T ss_conf 06625799998751468738613577-78758999858888356998799972742102488865532788854006789 Q ss_pred HHHHHHCCCC-CHHHCCCHHHH----HHHHHCCCCCEEEE--EHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEEC Q ss_conf 8998423773-10330767999----99863787655630--14545421621122233686673585432078999852 Q gi|254781037|r 137 LLLRWRSDIS-VIDFRGKIETR----LNKLNNNKAHAILL--AYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHV 209 (307) Q Consensus 137 ql~~~~p~l~-~~~iRGNv~TR----l~Kl~~g~~Daiil--A~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~ 209 (307) .++..+ +|. +....|+...- -+.+.+|+.+.... .-.-+.+.++. ..=++..+.|++. -+....|+ T Consensus 137 ~~~~~y-Gl~~~~~~~~s~~~~~a~~~~A~~~g~~~v~~~w~p~~~~~~~dl~----~L~Dp~~~~~~~~--~v~~v~~~ 209 (256) T pfam04069 137 GLLKAY-GLDKYELVEGSEAAMDALLYAAIKRGEPDVVYAWTPDWMIKKYDLV----VLEDPKGLFPPAY--NVVPVVRK 209 (256) T ss_pred HHHHHC-CCCCEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHCCEE----ECCCCCCCCCCCC--EEEEEECH T ss_conf 999863-9764012358555459999999984998899963761655427969----9368864579765--58752104 Q ss_pred ----CCCHHHHHHHHHC Q ss_conf ----7612455677623 Q gi|254781037|r 210 ----NNPKAQELVKVIN 222 (307) Q Consensus 210 ----~d~~i~~il~~In 222 (307) +++.+.++|+++. T Consensus 210 ~~~~~~P~~~~~l~~~~ 226 (256) T pfam04069 210 GFAEKHPEVAAFLNKLS 226 (256) T ss_pred HHHHHCHHHHHHHHHCC T ss_conf 67777989999998678 No 29 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=81.62 E-value=4.4 Score=20.85 Aligned_cols=154 Identities=14% Similarity=0.099 Sum_probs=91.2 Q ss_pred CCEEEEEC---CCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEE Q ss_conf 82899707---988889999999999997408998338999997327411587556748622699999999870983699 Q gi|254781037|r 4 KPFRIGTR---CSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCA 80 (307) Q Consensus 4 ~~i~IgtR---~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiA 80 (307) +++++|-= +..-...+-.-..+.|.+.- +..+|++... | -.++.++|.+|++|+| T Consensus 27 ~~l~~gv~P~~~~~~~~~~~~Pl~~yL~~~l---g~~Ve~~~~~---~----------------y~~~~eal~~g~~Dia 84 (288) T TIGR03431 27 KELNFGIIPTENASDLKQRWEPLADYLSKKL---GVKVKLFFAT---D----------------YAGVIEGMRFGKVDIA 84 (288) T ss_pred CEEEEEEECCCCHHHHHHHHHHHHHHHHHHH---CCCEEEEECC---C----------------HHHHHHHHHCCCCEEE T ss_conf 3689998069999999999999999999987---8978999689---9----------------9999999985983099 Q ss_pred EECCCC-CCCCCCCCCEEEEECCCC--C--CCEEEEECCCC---CCCCCCCCCEEECCCCCH-------HHHHHH---HH Q ss_conf 740343-211101221012223656--8--31056652432---433245673220475302-------888998---42 Q gi|254781037|r 81 VHSAKD-MPTKLLKGLQISAYLPRE--D--IRDVFISHTAQ---SLKDLALHSVIGTSSLRR-------KALLLR---WR 142 (307) Q Consensus 81 VHSlKD-lP~~~~~~l~i~a~~~R~--d--~rD~lv~~~~~---~l~~lp~~a~IGTsS~RR-------~aql~~---~~ 142 (307) .-+--. +-.....+....+...|. . .+-++|.+... +++|| +|-+|+-.++.= ...|.. .. T Consensus 85 ~~~p~~yv~a~~~~~~~~la~~~~~~g~~~y~s~iivr~Ds~i~sl~DL-kGk~iaf~~~~StSG~l~P~~~L~~~~g~~ 163 (288) T TIGR03431 85 WYGPSSYAEAYQKANAEAFAIEVNADGSTGYYSVLIVKKDSPIKSLEDL-KGKTFGFVDPNSTSGFLVPSYYLFKKNGIK 163 (288) T ss_pred EECCHHHHHHHHCCCCEEEEEEECCCCCCCEEEEEEEECCCCCCCHHHH-CCCEEEECCCCCCHHHHHHHHHHHHHCCCC T ss_conf 9884788998652597488998515888645799999899988877895-798766028741111399999999865997 Q ss_pred CCCC--CHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCC Q ss_conf 3773--1033076799999863787655630145454216 Q gi|254781037|r 143 SDIS--VIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLK 180 (307) Q Consensus 143 p~l~--~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~ 180 (307) |+-. -+..-|+-+.-+.-+.+|++|+.......++++- T Consensus 164 ~~~~f~~v~~~gshd~~~~aV~~G~~Dag~~~~~~~~~~~ 203 (288) T TIGR03431 164 PKEYFKKVTFSGSHEAAILAVANGTVDAATTNDENLDRMI 203 (288) T ss_pred HHHHHHHEEECCCHHHHHHHHHCCCCCEEEECHHHHHHHH T ss_conf 2545531331578789999998599329983388899999 No 30 >PRK04694 Maf-like protein; Reviewed Probab=78.41 E-value=0.79 Score=25.69 Aligned_cols=114 Identities=14% Similarity=0.129 Sum_probs=52.5 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHHHCCCHHH-HHHHHHCCCC---CEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCC Q ss_conf 2204753028889984237731033076799-9998637876---55630145454216211222336866735854320 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVIDFRGKIET-RLNKLNNNKA---HAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQG 201 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~iRGNv~T-Rl~Kl~~g~~---DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQG 201 (307) +++++|+||+..|..+.-++++.+ =|++. +.......+| -|.-.|.+.+.++.-.+ + +.++=++=| T Consensus 3 ILAS~SprR~~lL~~~gi~f~v~~--~didE~~~~~~~p~~~v~~lA~~KA~~~~~~~~~~~------~-~~~VIgaDt- 72 (190) T PRK04694 3 YLASRSPRRRELLQRLDVPFQTLQ--LDVPEVRAADESPDHYVQRVALEKAHAGLALVQAAD------A-DAIVLGSDT- 72 (190) T ss_pred EECCCCHHHHHHHHHCCCCCEEEC--CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC------C-CCEEEECCE- T ss_conf 998899999999987899839966--999877687779899999999999999889877538------9-988992382- Q ss_pred EEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEEEEE Q ss_conf 78999852761245567762397888887889999987177648852899999999899999 Q gi|254781037|r 202 AICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLYFYG 263 (307) Q Consensus 202 aIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~l~a 263 (307) |.+. ++ +++.+=.+. .++ +..|+.|.|-.|.=+.+.+-+.++...... T Consensus 73 ---vv~~-~g----~ilgKP~~~---~~A---~~~L~~lsGk~H~v~T~v~l~~~~~~~~~~ 120 (190) T PRK04694 73 ---EVVL-GE----RVFGKPVDV---DDA---IAMLRALSGRTHQVLTAVVLVCAQRAPAQA 120 (190) T ss_pred ---EEEE-CC----EEECCCCCH---HHH---HHHHHHHCCCEEEEEEEEEEEECCCCEEEE T ss_conf ---9998-99----997898999---999---999999789938999999999789836999 No 31 >PRK12684 transcriptional regulator CysB-like protein; Reviewed Probab=78.14 E-value=5.7 Score=20.14 Aligned_cols=133 Identities=14% Similarity=0.094 Sum_probs=60.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEE--------EEECCCCCCCEEEEECCCCCCCCCCCCCEEECC---CCC Q ss_conf 999999987098369974034321110122101--------222365683105665243243324567322047---530 Q gi|254781037|r 65 TEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQI--------SAYLPREDIRDVFISHTAQSLKDLALHSVIGTS---SLR 133 (307) Q Consensus 65 tkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i--------~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTs---S~R 133 (307) ..++.+.|.+|++|+|+.+. |...++++.. .++.+.+.| +-.++..++++|.....|... +.| T Consensus 131 ~~~~~~~l~~g~~Dl~i~~~---~~~~~~~l~~~~l~~~~~~~v~~~~hp---la~~~~v~l~dL~~~p~I~~~~~~~~r 204 (313) T PRK12684 131 PTQIAEMVIHDQADLAIATE---AIADYKELVSLPCYQWNHAVVVPPDHP---LLERKPLTLEDLAQYPLITYDDAFAGR 204 (313) T ss_pred HHHHHHHHHCCCCCEEECCC---CCCCCCCEEEEEEEECCEEEEECCCCC---CCCCCCCCHHHHCCCCEEECCCCCCHH T ss_conf 89999999779966550435---777778738998032216899538985---446999999998599989628998589 Q ss_pred HHHH----HHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEEC Q ss_conf 2888----998423773103307679999986378765563014545421621122233686673585432078999852 Q gi|254781037|r 134 RKAL----LLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHV 209 (307) Q Consensus 134 R~aq----l~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~ 209 (307) +... -..+.|+..+. =+|.++=..-+..|- .-.+++.-.++.. .+.--..++.....| ...+.|..|+ T Consensus 205 ~~i~~~~~~~g~~p~~~~e--~~~~~~i~~~V~~Gl-Gi~ilP~~a~~~~--~~~~L~~lp~~~~~~---~~~~~v~~~k 276 (313) T PRK12684 205 SKINKAFALRGLKPDIVLE--AIDADVIKTYVELGL-GVGIVADMAFDPE--RDRNLRAIPAGHLFG---SNVTRVALKQ 276 (313) T ss_pred HHHHHHHHHCCCCCCEEEE--ECCHHHHHHHHHHCC-CHHHHHHHHHCHH--HCCCEEEEECCCCCC---CEEEEEEEEC T ss_conf 9999999977998747999--997999999999598-5725599984676--389879997846776---6059999979 Q ss_pred CC Q ss_conf 76 Q gi|254781037|r 210 NN 211 (307) Q Consensus 210 ~d 211 (307) +. T Consensus 277 ~~ 278 (313) T PRK12684 277 GA 278 (313) T ss_pred CC T ss_conf 88 No 32 >PRK12683 transcriptional regulator CysB-like protein; Reviewed Probab=74.49 E-value=7.1 Score=19.51 Aligned_cols=136 Identities=11% Similarity=0.050 Sum_probs=63.5 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC-------CCCCCCCCCCEEEC---CCCCH Q ss_conf 999999987098369974034321110122101222365683105665243-------24332456732204---75302 Q gi|254781037|r 65 TEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTA-------QSLKDLALHSVIGT---SSLRR 134 (307) Q Consensus 65 tkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~-------~~l~~lp~~a~IGT---sS~RR 134 (307) +.++.+.|.+|++|+|+-+. |...++++. .++-..++--+++.+++ .++++|...--|.- ++.|+ T Consensus 131 ~~~~~~~L~~g~~Dl~i~~~---~~~~~~~l~--~~~l~~~~~~lv~~~~hpla~~~~v~l~dl~~~p~I~~~~~~~~r~ 205 (308) T PRK12683 131 PQEIAEMLLNGEADIGIATE---ALDREPDLV--SFPYYSWHHVVVVPKGHPLTGRENLTLEALAEYPIITYDQGFTGRS 205 (308) T ss_pred HHHHHHHHHCCCCCEEECCC---CCCCCCCCE--EEEECCCCEEEEECCCCCCCCCCCCCHHHHCCCCEEECCCCCCHHH T ss_conf 69999999779977880666---778887716--8761245158984699953369999999985999896189982999 Q ss_pred HHHHHHHHCC--CCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCC Q ss_conf 8889984237--7310330767999998637876556301454542162112223368667358543207899985276 Q gi|254781037|r 135 KALLLRWRSD--ISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNN 211 (307) Q Consensus 135 ~aql~~~~p~--l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d 211 (307) ...-...+-+ .++.---+++++=..-...|--=+ +|+...++..--.. . ..++.+...|. -.++|-.|++. T Consensus 206 ~i~~~~~~~g~~p~~~~e~~~~~~i~~~V~~g~Gi~-ilp~~~~~~~~~~~-l-v~l~~~~~~~~---~~~~i~~rk~~ 278 (308) T PRK12683 206 HIDQAFAEAGAVPDIVLTALDADVIKTYVELGMGVG-IVAAMAYDPQRDTG-L-VALDTQHLFEA---NTTRVALRRGA 278 (308) T ss_pred HHHHHHHHCCCCCCEEEEECCHHHHHHHHHHCCCHH-HHHHHHHHHHHCCC-E-EEEECCCCCCC---CEEEEEECCCC T ss_conf 999999977999857999998999999999798470-78999987886099-7-99989788766---17999990968 No 33 >pfam03466 LysR_substrate LysR substrate binding domain. The structure of this domain is known and is similar to the periplasmic binding proteins. Probab=71.27 E-value=8.5 Score=19.03 Aligned_cols=175 Identities=20% Similarity=0.110 Sum_probs=90.9 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 88289970798888999999999999740899833899999732741158755674862269999999987098369974 Q gi|254781037|r 3 KKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVH 82 (307) Q Consensus 3 ~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVH 82 (307) .-+|+||+=.|-...+=...+.+..++ +|++++++..-. ..++.++|.+|++|+|+- T Consensus 5 ~G~i~I~~~~~~~~~~lp~~l~~f~~~---~P~i~i~i~~~~--------------------~~~i~~~l~~g~~Di~i~ 61 (209) T pfam03466 5 RGRLRIGAPPTFAAYLLPPLLARFRER---YPDVELELREGD--------------------SEELLDLLAEGELDLAIR 61 (209) T ss_pred CEEEEEEEEHHHHHHHHHHHHHHHHHH---CCCCEEEEEECC--------------------CHHHHHHHHCCCCEEEEE T ss_conf 789999938899999999999999998---889289999898--------------------488999998698148873 Q ss_pred CCCCCCCCCCCCCEEEEECCCCCCCEEEEECCC-------CCCCCCCCCCEEE--CCCCCHHHHH---HHHHCCCCCHHH Q ss_conf 034321110122101222365683105665243-------2433245673220--4753028889---984237731033 Q gi|254781037|r 83 SAKDMPTKLLKGLQISAYLPREDIRDVFISHTA-------QSLKDLALHSVIG--TSSLRRKALL---LRWRSDISVIDF 150 (307) Q Consensus 83 SlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~-------~~l~~lp~~a~IG--TsS~RR~aql---~~~~p~l~~~~i 150 (307) .... .++++...-+ -.++.-+++++++ .++++|...--|. .++..+.... .......+..-- T Consensus 62 ~~~~----~~~~~~~~~l--~~~~~~~~~s~~~~l~~~~~i~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (209) T pfam03466 62 RGPP----DDPGLEARPL--FEEPLVLVAPPDHPLAAGEPVSLEDLADEPLILLEPGSGLRDLVDRALERAGLRPRVALE 135 (209) T ss_pred CCCC----CCCCEEEEEE--EEEEEEEEECCCCHHHCCCCCCHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCCCCEEEE T ss_conf 1789----9998089997--624136553367055428975435514665798569987137998778764876302467 Q ss_pred CCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCC Q ss_conf 0767999998637876556301454542162112223368667358543207899985276 Q gi|254781037|r 151 RGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNN 211 (307) Q Consensus 151 RGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d 211 (307) =.+.++-+.-+.+|..=|++-....-+-+. ...+. .++.++..| .-.+++-++++. T Consensus 136 ~~~~~~~~~lv~~g~Gi~~lp~~~~~~~~~-~~~L~-~i~~~~~~~---~~~~~li~~~~~ 191 (209) T pfam03466 136 VNSLEALLAAVAAGLGIALLPRSAVARELA-DGRLV-VLPLPDPPL---PRPIYLVYRKGR 191 (209) T ss_pred ECCHHHHHHHHHHCCEEEECCHHHHHHHHH-CCCEE-EEECCCCCC---CCEEEEEEECCC T ss_conf 478799999999499999913999998875-89989-995888887---447999998949 No 34 >PRK00078 Maf-like protein; Reviewed Probab=68.55 E-value=2.6 Score=22.40 Aligned_cols=29 Identities=31% Similarity=0.485 Sum_probs=21.6 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 2204753028889984237731033076799 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVIDFRGKIET 156 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~iRGNv~T 156 (307) +++++|+||+..|..+..++.+. --|||- T Consensus 4 ILAS~S~~R~~lL~~~g~~f~v~--~~~idE 32 (192) T PRK00078 4 ILASASERRQELLKRIVEDFQVI--VSDFDE 32 (192) T ss_pred EEECCCHHHHHHHHHCCCCCEEE--CCCCCC T ss_conf 99279999999997189997997--199897 No 35 >PRK02676 consensus Probab=63.97 E-value=4.1 Score=21.06 Aligned_cols=34 Identities=18% Similarity=0.104 Sum_probs=21.8 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHH Q ss_conf 673220475302888998423773103307679999 Q gi|254781037|r 123 LHSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRL 158 (307) Q Consensus 123 ~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl 158 (307) +.-++|++|+||+..|..+.-++.+.+- +||-+. T Consensus 3 ~~iILAS~SprR~~lL~~~g~~f~v~~~--~idE~~ 36 (196) T PRK02676 3 PPFILASASPARRRLLQTVGIDPIVQPS--HFDESQ 36 (196) T ss_pred CCEEEECCCHHHHHHHHHCCCCEEEECC--CCCCCC T ss_conf 9999947999999999977998399789--999888 No 36 >PRK12679 cbl transcriptional regulator Cbl; Reviewed Probab=61.74 E-value=13 Score=17.83 Aligned_cols=161 Identities=11% Similarity=0.027 Sum_probs=74.7 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECC-------CCCCCCCCCCCEEE--CCC-CC Q ss_conf 999999998709836997403432111012210122236568310566524-------32433245673220--475-30 Q gi|254781037|r 64 FTEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHT-------AQSLKDLALHSVIG--TSS-LR 133 (307) Q Consensus 64 Ftkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~-------~~~l~~lp~~a~IG--TsS-~R 133 (307) .+.++.+.|.+|++|+++=+. |...++++ ..++-..++..+++.++ ..++++|..-..|. .+| .| T Consensus 130 ~~~~i~~~l~~g~~Dl~i~~~---~~~~~p~l--~~~p~~~~~~~lvvp~~HpLa~~~~vsl~~L~~~plI~~~~g~~~R 204 (316) T PRK12679 130 TPQEIATLLQNGEADIGIASE---RLSNDPQL--VAFPWFRWHHSLLVPLDHPLTQITPLTLESIAKWPLITYRQGITGR 204 (316) T ss_pred CHHHHHHHHHCCCCCEEECCC---CCCCCCCC--EEEEEEECCEEEEECCCCCCCCCCCCCHHHHCCCCEEEECCCCHHH T ss_conf 879999999879988522246---77899760--7998055341356018995124899899998799989508998499 Q ss_pred HHH-HHHH-HHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCC Q ss_conf 288-8998-42377310330767999998637876556301454542162112223368667358543207899985276 Q gi|254781037|r 134 RKA-LLLR-WRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNN 211 (307) Q Consensus 134 R~a-ql~~-~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d 211 (307) +.. ++.. ..=..++.--=++.++=..-...|--=| |++.-..+-.. +..-..++.+. -..+....+-.|++- T Consensus 205 ~~id~~f~~~G~~p~i~~e~~~~~~i~~~V~~GlGva-ilp~~a~~~~~--~~~l~~~~~~~---~~~~~~~~~~~~r~~ 278 (316) T PRK12679 205 SRIDDAFARKGLLADIVLSAQDSDVIKTYVALGLGIG-LVAEQSSGEQE--EKNLIRLDTRH---LFDANTVWLGLKRGQ 278 (316) T ss_pred HHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHCCCEE-ECHHHHCCCCC--CCCEEEECCCC---CCCCCEEEEEEECCC T ss_conf 9999999977999977999998999999999899099-70144507546--89889932634---676653899996787 Q ss_pred ---CHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf ---12455677623978888878899999 Q gi|254781037|r 212 ---PKAQELVKVINHEDTWDSVNCERAFL 237 (307) Q Consensus 212 ---~~i~~il~~Ind~~T~~~v~aER~fL 237 (307) ..+.+.+...+..-|...+ ||+++ T Consensus 279 ~l~~~~~~Fi~~~~~~l~~~~~--~~~~~ 305 (316) T PRK12679 279 LQRNYVWRFLELCNAGLSVEDI--KRQVM 305 (316) T ss_pred CCHHHHHHHHHHHHCCCCHHHH--HHHHH T ss_conf 5569999999975202899999--99987 No 37 >PRK04425 Maf-like protein; Reviewed Probab=61.58 E-value=4.9 Score=20.56 Aligned_cols=31 Identities=26% Similarity=0.294 Sum_probs=21.5 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHH Q ss_conf 673220475302888998423773103307679 Q gi|254781037|r 123 LHSVIGTSSLRRKALLLRWRSDISVIDFRGKIE 155 (307) Q Consensus 123 ~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~ 155 (307) -.-+++++|+||+..|..+.-++++ +--||| T Consensus 5 ~~lILAS~SprR~~LL~~~gi~f~v--~~~~id 35 (196) T PRK04425 5 LPLVLGTSSVFRREQMERLGIAFQA--ASPDFD 35 (196) T ss_pred CCEEECCCCHHHHHHHHHCCCCCEE--ECCCCC T ss_conf 8799858999999999878999599--769999 No 38 >pfam05023 Phytochelatin Phytochelatin synthase. Phytochelatin synthase is the enzyme responsible for the synthesis of heavy-metal-binding peptides (phytochelatins) from glutathione and related thiols. The crystal structure of a member of this family shows it to possess a papain fold. The enzyme catalyses the deglycination of a GSH donor molecule. The enzyme contains a catalytic triad of cysteine, histidine and aspartate residues. Probab=61.58 E-value=2.5 Score=22.49 Aligned_cols=66 Identities=21% Similarity=0.360 Sum_probs=47.6 Q ss_pred CCCCCEEEEEEECCCCHH----HHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEEEE Q ss_conf 543207899985276124----556776239788888788999998717764885289999999989999 Q gi|254781037|r 197 SPGQGAICIETHVNNPKA----QELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLYFY 262 (307) Q Consensus 197 A~gQGaIaIe~r~~d~~i----~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~l~ 262 (307) |-++|+-....+.+|.-+ ..+...+++...+..++--|..|..-|+|=.+|||+|-.-.+.-|.|. T Consensus 102 a~~~g~~~~~~~~~~~s~~~FR~~v~~~~~~~~~~vivny~R~~lgQtG~GHfSPIgaY~~~~D~vLilD 171 (208) T pfam05023 102 AKCNGAVVKTFHASDLSLDEFRKHLIRCLNDDGRFVIVNYDRKAIGQTGGGHFSPIGGYHEASDSVLILD 171 (208) T ss_pred HHHCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEEECCCHHHCCCCCCCCCCCCCCCCCCCEEEEEE T ss_conf 9857982699977869899999999998389997899977514317778875263102165678499984 No 39 >TIGR00060 L18_bact ribosomal protein L18; InterPro: IPR004389 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This is the ribosomal protein L18 family. The archaebacterial and eukaryotic type rpL18 is not included in this family.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome. Probab=61.29 E-value=7.1 Score=19.55 Aligned_cols=45 Identities=18% Similarity=0.230 Sum_probs=31.7 Q ss_pred EEEEEEECCCCCEEEE------------EEEECCHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 9999998699878999------------99843988899999999999997060563 Q gi|254781037|r 260 YFYGIVLASDGKIFHE------------VSRNGNRCDAINIGRDAAQYIRFISNKNV 304 (307) Q Consensus 260 ~l~a~v~s~DG~~~i~------------~~~~g~~~~a~~LG~~la~~Ll~~g~k~i 304 (307) +|.++|.++.+...+. .+..++.+++..+|+.+|++++++|-.++ T Consensus 35 hiYAQvIDd~~~~tlas~Stl~~~l~~~~~~~~n~~~a~~vG~~~A~rl~~kgi~~v 91 (118) T TIGR00060 35 HIYAQVIDDVKQVTLASASTLEKKLKEQLKSTGNKDAAKKVGKLVAERLIKKGINDV 91 (118) T ss_pred CEEEEEEECCCCEEEEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 178999985888478731556688776510378988999999999999997277226 No 40 >TIGR01027 proB glutamate 5-kinase; InterPro: IPR005715 L-glutamate 5-phosphotransferase, (gamma-glutamyl kinase, proB, 2.7.2.11 from EC), catalyzes the first step in proline biosynthesisATP + L-glutamate = ADP + L-glutamate 5-phosphate. the product of which rapidly cyclises to 5-oxoproline and phosphate. Bacterial ProB proteins hits the full length of this model, but the ProB-like domain of delta 1-pyrroline-5-carboxylate synthetase does not hit the C-terminal 100 residues. ; GO: 0004349 glutamate 5-kinase activity, 0006561 proline biosynthetic process, 0005737 cytoplasm. Probab=60.67 E-value=3.9 Score=21.20 Aligned_cols=129 Identities=17% Similarity=0.198 Sum_probs=68.1 Q ss_pred CCE-EECCCCCHHHHH--HHHHCCCCCHHHCCCHHHHHHHHHC----CCCCEEEEEHH-HHHHCCCCCCCCCCCCHHHCC Q ss_conf 732-204753028889--9842377310330767999998637----87655630145-454216211222336866735 Q gi|254781037|r 124 HSV-IGTSSLRRKALL--LRWRSDISVIDFRGKIETRLNKLNN----NKAHAILLAYA-GIKRLKKEKVIKEILNIEDFP 195 (307) Q Consensus 124 ~a~-IGTsS~RR~aql--~~~~p~l~~~~iRGNv~TRl~Kl~~----g~~DaiilA~A-gl~RL~l~~~i~~~l~~~~~~ 195 (307) |+. +||+-.|=+-+. ...+-.+.+.-.-|+=..-|..+-+ |+-+|+++..- .=.||.- +-.+.+ - T Consensus 207 g~~~~GTGGM~sK~~AA~~A~~~Gv~v~i~~g~~P~~i~~~~~hhfyg~~~GT~F~a~~~~~~L~~--rk~Wi~-----~ 279 (379) T TIGR01027 207 GSSKVGTGGMRSKIEAARLATRAGVPVIIASGSKPEKIADALEHHFYGAPVGTVFEAQEARRKLRA--RKQWIA-----F 279 (379) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCCCCEECCCCCCHHHHH--HHHHHH-----H T ss_conf 788756646789999999998689958994488978999999974037787744335776502678--999998-----4 Q ss_pred CCCCCCEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEE----ECCEEEEEEEEEC---C Q ss_conf 85432078999852761245567762397888887889999987177648852899999----9998999999986---9 Q gi|254781037|r 196 PSPGQGAICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYC----KGSTLYFYGIVLA---S 268 (307) Q Consensus 196 PA~gQGaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i----~~~~l~l~a~v~s---~ 268 (307) -++-+|.|-|---... .+.+ +|.--.|.|+..-- .++.+ .+.+ + T Consensus 280 ~~~~~G~i~vD~GA~~----Al~~---------------------~G~SLLp~Gi~~V~G~F~~GdvV----~i~~qrl~ 330 (379) T TIGR01027 280 ASEPKGKITVDAGAEE----ALKE---------------------RGKSLLPAGIVEVEGNFSRGDVV----EILNQRLP 330 (379) T ss_pred CCCCCCEEEECHHHHH----HHHH---------------------CCCCCCCCCEEEEEECCCCCCEE----EEECCCCC T ss_conf 3776746887525899----9983---------------------78711431136662014764569----86125340 Q ss_pred CC-CEEEEEEEECCHHHHHHH Q ss_conf 98-789999984398889999 Q gi|254781037|r 269 DG-KIFHEVSRNGNRCDAINI 288 (307) Q Consensus 269 DG-~~~i~~~~~g~~~~a~~L 288 (307) +| ...=+.....+.++-+.+ T Consensus 331 ~G~~~iG~G~v~Y~S~eL~~i 351 (379) T TIGR01027 331 EGKQLIGRGLVNYSSDELEKI 351 (379) T ss_pred CCCCEECCCCCCCCHHHHHHH T ss_conf 126201010245698999998 No 41 >PRK12681 cysB transcriptional regulator CysB; Reviewed Probab=59.80 E-value=14 Score=17.62 Aligned_cols=188 Identities=12% Similarity=0.069 Sum_probs=83.7 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 28997079888899999999999974089983389999973274115875567486226999999998709836997403 Q gi|254781037|r 5 PFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSA 84 (307) Q Consensus 5 ~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSl 84 (307) .++||+=.|.-.-+- -.+.....+. +|++++++.. | -..++.+.|.+|++|+|+=+- T Consensus 94 ~L~Ia~~~t~a~~~L-P~~i~~F~~~--~P~v~l~l~~----~----------------~~~~i~~~l~~g~~DlaI~~e 150 (324) T PRK12681 94 SLYIATTHTQARYAL-PPVIKGFIKR--YPRVSLHMHQ----G----------------SPTQIAEAAAKGEADFAIATE 150 (324) T ss_pred EEEEEECHHHHHHHH-HHHHHHHHHH--CCCCCEEEEE----C----------------CHHHHHHHHHCCCCCCCCCCC T ss_conf 499985358888772-5999999987--8998446531----7----------------879999999879987231246 Q ss_pred CCCCCCCCCCCEEEEECCCCCCCEEEEEC-------CCCCCCCCCCCCEE--ECC-CCCHHHH----HHHHHCCCCCHHH Q ss_conf 43211101221012223656831056652-------43243324567322--047-5302888----9984237731033 Q gi|254781037|r 85 KDMPTKLLKGLQISAYLPREDIRDVFISH-------TAQSLKDLALHSVI--GTS-SLRRKAL----LLRWRSDISVIDF 150 (307) Q Consensus 85 KDlP~~~~~~l~i~a~~~R~d~rD~lv~~-------~~~~l~~lp~~a~I--GTs-S~RR~aq----l~~~~p~l~~~~i 150 (307) +..+.+++... +-..+++-+++.+ +..++++|..-..| ..+ +.|+... -..+.|++.+. T Consensus 151 ---~~~~~~dl~~~--P~~~~~~~vvvp~~HPLa~~~~itl~dL~~~plI~~~~g~~~R~~id~~f~~~g~~P~iv~e-- 223 (324) T PRK12681 151 ---ALHLYDDLIML--PCYHWNRSVVVPPDHPLAKKKKLTIEELAQYPLVTYVFGFTGRSELDTAFNRAGLTPRIVFT-- 223 (324) T ss_pred ---CCCCCCCCEEE--EEEECCCCCCCCCCCCCCCCCCCCHHHHCCCCEEEECCCCCHHHHHHHHHHHCCCCCEEEEE-- T ss_conf ---66668874686--40003222457999965579999999986998596149982999999999988997607999-- Q ss_pred CCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCC---CHHHHHHHHHCCHHHH Q ss_conf 0767999998637876556301454542162112223368667358543207899985276---1245567762397888 Q gi|254781037|r 151 RGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNN---PKAQELVKVINHEDTW 227 (307) Q Consensus 151 RGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d---~~i~~il~~Ind~~T~ 227 (307) =.+.++=..-+..| ..-.|+|.-..+=. .+..-..++.+.+. ......|-.|++- ..+...+..+...-|+ T Consensus 224 ~~~~~~I~~~V~~G-lGV~ilp~mA~~~~--~d~~l~~~~~~~~f---~~~~t~i~~r~g~~l~~~~~~Fi~~~~p~l~~ 297 (324) T PRK12681 224 ATDADVIKTYVRLG-LGVGVIASMAVDPV--ADPDLVAIDASHLF---AHSTTKIGFRRGTFLRSYMYDFIERFAPHLTR 297 (324) T ss_pred ECCHHHHHHHHHHC-CEEEECHHHHCCCC--CCCCEEEEECCCCC---CCCEEEEEEECCCCHHHHHHHHHHHHHCCCCH T ss_conf 89899999999959-91397035551965--57881784365567---77647999979971579999999986120489 Q ss_pred H Q ss_conf 8 Q gi|254781037|r 228 D 228 (307) Q Consensus 228 ~ 228 (307) . T Consensus 298 ~ 298 (324) T PRK12681 298 D 298 (324) T ss_pred H T ss_conf 9 No 42 >PRK02478 Maf-like protein; Reviewed Probab=58.19 E-value=6.1 Score=19.98 Aligned_cols=26 Identities=15% Similarity=0.302 Sum_probs=18.0 Q ss_pred HHHHHHHCCCCCCCEEEEEEEECCEE Q ss_conf 99998717764885289999999989 Q gi|254781037|r 234 RAFLAELDGSCKSAIAGFAYCKGSTL 259 (307) Q Consensus 234 R~fL~~L~GgC~~PIGa~A~i~~~~l 259 (307) +.+|+.|-|..|.=+.+.+-+.+++. T Consensus 96 ~~mL~~lsGk~H~v~T~v~l~~~~~~ 121 (199) T PRK02478 96 RRHLQKLSGKTHQLNSAVVLVRDGKV 121 (199) T ss_pred HHHHHHHCCCEEEEEEEEEEEECCEE T ss_conf 99999976983999989999989989 No 43 >PRK09508 leuO leucine transcriptional activator; Reviewed Probab=57.77 E-value=15 Score=17.40 Aligned_cols=58 Identities=17% Similarity=0.256 Sum_probs=36.2 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) +.++||.-. .++..-.-.+...+.+. +|++.+++... ...++.+.|.+|++|+|+-+ T Consensus 112 ~~~~i~~~~-~~~~~llp~ll~~~~~~--~P~v~l~~~~~--------------------~~~~~~~~L~~g~iDl~I~~ 168 (314) T PRK09508 112 RVFNLCICS-PLDNRLASQIYNRIEQI--APNIHLVFKSS--------------------LNQNIEHQLRYQETEFVISY 168 (314) T ss_pred CEEEEEECC-HHHHHHHHHHHHHHHHH--CCCCEEEEEEC--------------------CCHHHHHHHHCCCCCEEEEC T ss_conf 369999630-78999999999999986--89977999857--------------------70649999976995299967 Q ss_pred C Q ss_conf 3 Q gi|254781037|r 84 A 84 (307) Q Consensus 84 l 84 (307) . T Consensus 169 ~ 169 (314) T PRK09508 169 E 169 (314) T ss_pred C T ss_conf 8 No 44 >PRK00238 consensus Probab=57.10 E-value=5.3 Score=20.36 Aligned_cols=24 Identities=29% Similarity=0.414 Sum_probs=14.0 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHH Q ss_conf 220475302888998423773103 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVID 149 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~ 149 (307) +.+++|+||+..|..+.-++.+.+ T Consensus 5 ILAS~Sp~R~~lL~~~gi~f~~~~ 28 (198) T PRK00238 5 YLASGSPRRRELLTQIGVPFSVLS 28 (198) T ss_pred EECCCCHHHHHHHHHCCCCCEEEE T ss_conf 983899999999987899829983 No 45 >PRK04719 consensus Probab=56.88 E-value=5.6 Score=20.18 Aligned_cols=26 Identities=12% Similarity=0.234 Sum_probs=18.7 Q ss_pred HHHHHHHCCCCCCCEEEEEEEECCEE Q ss_conf 99998717764885289999999989 Q gi|254781037|r 234 RAFLAELDGSCKSAIAGFAYCKGSTL 259 (307) Q Consensus 234 R~fL~~L~GgC~~PIGa~A~i~~~~l 259 (307) +..|+.|.|.-|.=+.+.|-+.++.. T Consensus 92 ~~~L~~lsGk~H~v~T~v~i~~~~~~ 117 (188) T PRK04719 92 KRMLLKLSGRQHQVMTAVTVVSREKQ 117 (188) T ss_pred HHHHHHHCCCCEEEEEEEEEEECCEE T ss_conf 99999857996799999999989978 No 46 >PRK01946 consensus Probab=56.61 E-value=6.2 Score=19.91 Aligned_cols=26 Identities=31% Similarity=0.377 Sum_probs=20.1 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHHH Q ss_conf 32204753028889984237731033 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVIDF 150 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~i 150 (307) -++|++|+||+..|..+..++.+... T Consensus 11 iILAS~Sp~R~~lL~~~gi~f~v~~~ 36 (195) T PRK01946 11 IILASGSPRRKELLSGLGIDFEVRTL 36 (195) T ss_pred EEEECCCHHHHHHHHHCCCCCEEECC T ss_conf 99978999999999977999789438 No 47 >PRK00234 Maf-like protein; Reviewed Probab=56.58 E-value=5.7 Score=20.14 Aligned_cols=107 Identities=15% Similarity=0.171 Sum_probs=49.7 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCC-CCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEE Q ss_conf 32204753028889984237731033076799999863787-65563014545421621122233686673585432078 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNK-AHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAI 203 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~-~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaI 203 (307) -+.+++|+||+..|..+.-++++.+ -+||-.. ..++ ..-++...|-.+-.. +....+ +.++=++-| | T Consensus 4 iILAS~SprR~~lL~~~gi~f~v~~--~~idE~~---~~~~~p~~~~~~lA~~KA~~----v~~~~~-~~~VIgaDt--v 71 (192) T PRK00234 4 LLLASSSPYRRELLARLRLPFTCAS--PDIDESH---RPDESAEELVRRLARQKAEA----LAGSHP-QHLIIGSDQ--V 71 (192) T ss_pred EEECCCCHHHHHHHHHCCCCCEEEC--CCCCCCC---CCCCCHHHHHHHHHHHHHHH----HHHHCC-CCEEEECCE--E T ss_conf 8986899999999997799949978--9998888---99989999999999999986----476489-867873586--9 Q ss_pred EEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEC Q ss_conf 99985276124556776239788888788999998717764885289999999 Q gi|254781037|r 204 CIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKG 256 (307) Q Consensus 204 aIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~ 256 (307) . .| ++ +++.+=.+.+ + -|..|+.|.|..|.=+.+.|-+.. T Consensus 72 v-~~--~g----~il~KP~~~~---~---A~~~L~~lsGk~h~v~Tgv~v~~~ 111 (192) T PRK00234 72 A-VL--GE----QILGKPHTFE---R---ACEQLLAASGQSVTFLTGLALLNS 111 (192) T ss_pred E-EE--CC----EEECCCCCHH---H---HHHHHHHHCCCEEEEEEEEEEEEC T ss_conf 9-98--99----9977999999---9---999999828995999987999989 No 48 >PRK00648 Maf-like protein; Reviewed Probab=56.34 E-value=6.1 Score=19.95 Aligned_cols=114 Identities=18% Similarity=0.210 Sum_probs=52.1 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEE Q ss_conf 32204753028889984237731033076799999863787655630145454216211222336866735854320789 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAIC 204 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIa 204 (307) -+++++|+||+..|..+.-++.+.+-. .++..+.+-.+ -..++...|-.+- ..+.....++.++=++=| ++ T Consensus 5 iILAS~S~~R~~lL~~~gi~f~v~~~~-~dEe~~~~~~~--p~~~~~~lA~~Ka----~~v~~~~~~~~~vI~aDt--vv 75 (191) T PRK00648 5 IVLASSSPRRKEILEGFRIPFEVIPSP-FVEESYPYSLD--PEEYTLELARLKA----EAVRSDHFPDELIITADT--IV 75 (191) T ss_pred EEEECCCHHHHHHHHHCCCCCEEECCC-CCCCCCCCCCC--HHHHHHHHHHHHH----HHHHHHHCCCCEEEEECE--EE T ss_conf 999689999999999779995997899-98776777899--9999999999999----999972189988999684--99 Q ss_pred EEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEE Q ss_conf 99852761245567762397888887889999987177648852899999999899 Q gi|254781037|r 205 IETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLY 260 (307) Q Consensus 205 Ie~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~ 260 (307) .| ++ +++.+= .+..++ |.+|+.|.|..|.-+.+.|-+.++... T Consensus 76 -~~--~g----~i~~KP---~~~~eA---~~~L~~lsGk~h~v~T~v~v~~~~~~~ 118 (191) T PRK00648 76 -WY--DG----KVLGKP---KSEEEA---IEMLKTLSGKTHEVITGVCIIHNGKLL 118 (191) T ss_pred -EE--CC----EEECCC---CCHHHH---HHHHHHHCCCCCEEEEEEEEEECCEEE T ss_conf -98--99----996797---659999---999999789960999999999899999 No 49 >PRK03797 consensus Probab=55.90 E-value=3.9 Score=21.19 Aligned_cols=109 Identities=17% Similarity=0.276 Sum_probs=51.6 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEE Q ss_conf 22047530288899842377310330767999998637876556301454542162112223368667358543207899 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICI 205 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaI 205 (307) +.+++|+||+..|..+ +.|.-+--++|.+... .+--..++.-|-.+=. .+...++.+ ++=++=| + | T Consensus 3 ILAS~Sp~R~~lL~~~---i~f~v~~~~idE~~~~---~~p~~~~~~lA~~KA~----~v~~~~~~~-~VIgaDt--v-v 68 (186) T PRK03797 3 ILASSSPRRREILSRF---FEIKVYPSNVSERSSI---KDPREHALDLARKKAL----EVYSRFPKA-TIIAADT--I-V 68 (186) T ss_pred EECCCCHHHHHHHHCC---CCCEEECCCCCCCCCC---CCHHHHHHHHHHHHHH----HHHHHCCCC-EEEEECE--E-E T ss_conf 9868999999999757---9979978998988888---8999999999999999----999878998-6999862--9-9 Q ss_pred EEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEE Q ss_conf 9852761245567762397888887889999987177648852899999999899 Q gi|254781037|r 206 ETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLY 260 (307) Q Consensus 206 e~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~ 260 (307) .| |.+ ++.+ +.+..++ +.+|+.|.|.-|.=+.+.|-+.++.+. T Consensus 69 ~~---~g~---i~gK---P~~~~eA---~~~L~~lsGk~h~v~T~v~i~~~~~~~ 111 (186) T PRK03797 69 SL---GGK---ILGK---PKNEEEA---RKMLKTLSGKVHLVITGYCIIHNGKII 111 (186) T ss_pred EE---CCE---EECC---CCCHHHH---HHHHHHHCCCCEEEEEEEEEEECCEEE T ss_conf 98---999---9679---9999999---999999789926999999999999999 No 50 >PRK03442 consensus Probab=55.87 E-value=5.8 Score=20.10 Aligned_cols=25 Identities=28% Similarity=0.224 Sum_probs=17.8 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHH Q ss_conf 3220475302888998423773103 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVID 149 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~ 149 (307) -+.+++|+||+..|..+.-++.+.+ T Consensus 4 lILAS~SprR~~lL~~~g~~f~v~~ 28 (213) T PRK03442 4 LVLGSASPGRLKVLRQAGIDPLVVV 28 (213) T ss_pred EEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 8986899999999997799819974 No 51 >PRK00884 Maf-like protein; Reviewed Probab=54.88 E-value=6.3 Score=19.86 Aligned_cols=31 Identities=16% Similarity=0.284 Sum_probs=20.6 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHH Q ss_conf 322047530288899842377310330767999 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVIDFRGKIETR 157 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TR 157 (307) -++|++|+||+..|..+.-++.+.+ -|||-. T Consensus 4 iILAS~SprR~~lL~~~gi~f~v~~--~~idE~ 34 (194) T PRK00884 4 LILASTSPYRRALLEKLQLPFECAA--PEVDET 34 (194) T ss_pred EEECCCCHHHHHHHHHCCCCCEEEC--CCCCCC T ss_conf 8997899999999987899958979--999988 No 52 >cd00555 Maf Nucleotide binding protein Maf. Maf has been implicated in inhibition of septum formation in eukaryotes, bacteria and archaea, but homologs in B.subtilis and S.cerevisiae are nonessential for cell division. Maf has been predicted to be a nucleotide- or nucleic acid-binding protein with structural similarity to the hypoxanthine/xanthine NTP pyrophosphatase Ham1 from Methanococcus jannaschii, RNase H from Escherichia coli, and some other nucleotide or RNA-binding proteins. Probab=54.59 E-value=6.1 Score=19.97 Aligned_cols=110 Identities=17% Similarity=0.245 Sum_probs=52.0 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEE Q ss_conf 22047530288899842377310330767999998637876556301454542162112223368667358543207899 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICI 205 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaI 205 (307) ++|++|+||+..|+.+.-++.+. --|||-+.-+-.+..--+.-+|.+ +-.. +...++.+.++=++=| + + T Consensus 2 ILaS~S~~R~~lL~~~gi~f~~~--~~~iDE~~~~~~~p~~~~~~lA~~--KA~~----v~~~~~~~~iiI~aDt--v-v 70 (180) T cd00555 2 ILASASPRRRELLEQLGIPFEVV--PSDIDETPIKGESPEDYVLRLAEA--KAEA----VAARLPPDALVIGADT--V-V 70 (180) T ss_pred EECCCCHHHHHHHHHCCCCEEEE--CCCCCCCCCCCCCHHHHHHHHHHH--HHHH----HHHHCCCCCEEEECCE--E-E T ss_conf 87779999999999779984997--899998988899999999999999--9999----9974789988998180--9-9 Q ss_pred EEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCE Q ss_conf 98527612455677623978888878899999871776488528999999998 Q gi|254781037|r 206 ETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGST 258 (307) Q Consensus 206 e~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~ 258 (307) .| ++ + ++. -+.+..++ +.+|+.+.|-=|.=+.+.+-+.++. T Consensus 71 ~~--~g-~---i~~---KP~~~~~A---~~~L~~lsgk~h~v~T~v~v~~~~~ 111 (180) T cd00555 71 VL--DG-R---ILG---KPKDREEA---REMLKRLSGRTHEVYTGVALIDPGG 111 (180) T ss_pred EE--CC-E---ECC---CCCCHHHH---HHHHHHHCCCCEEEEEEEEEEECCC T ss_conf 98--99-9---847---98769999---9967753799479999999998996 No 53 >pfam02642 consensus Probab=54.38 E-value=17 Score=17.04 Aligned_cols=100 Identities=17% Similarity=0.165 Sum_probs=70.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEE--EEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHH-- Q ss_conf 999999987098369974034321110122101--2223656831056652432433245673220475302888998-- Q gi|254781037|r 65 TEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQI--SAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLR-- 140 (307) Q Consensus 65 tkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i--~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~-- 140 (307) +.+|.+.-++|+.|+..=|.-=.|...++=..+ ++-+-|. ..=.+|+++..+ + ++.+|+.+...-.|.++. T Consensus 37 IetLN~~A~~g~~dvskiS~~ayp~i~d~Y~lL~~G~alG~G-~GPlvva~~~~~---l-~~~~iAvPG~~TTA~lL~~l 111 (259) T pfam02642 37 IETLNREALKNRYEVSAISAHLYPLIADKYALLPVGTSLGYG-YGPIVVSKKEVS---L-KLKRLATPGEHTTAHLLCKL 111 (259) T ss_pred HHHHHHHHHCCCCCEEEECHHHHHHHHHCEEEEECHHHCCCC-CCCEEEECCCCC---C-CCCEEECCCHHHHHHHHHHH T ss_conf 999999998189997887597788686068875332230488-687478547777---7-89989787736799999998 Q ss_pred HHCCCCCHHHCCCHHHHHHHHHCCCCCEEEE Q ss_conf 4237731033076799999863787655630 Q gi|254781037|r 141 WRSDISVIDFRGKIETRLNKLNNNKAHAILL 171 (307) Q Consensus 141 ~~p~l~~~~iRGNv~TRl~Kl~~g~~Daiil 171 (307) ++|+.+++++ +.+.=+..+.+|++||-++ T Consensus 112 ~~~~~~~v~~--~FdeI~~aV~~G~vDaGvi 140 (259) T pfam02642 112 YYPDAELVPM--PFDRIIQAVLEGKVDAGAL 140 (259) T ss_pred HCCCCEEEEE--EHHHHHHHHHCCCCCCEEE T ss_conf 5588759995--6899899997599871079 No 54 >TIGR03427 ABC_peri_uca ABC transporter periplasmic binding protein, urea carboxylase region. Members of this family are ABC transporter periplasmic binding proteins associated with the urea carboxylase/allophanate hydrolase pathway, an alternative to urease for urea degradation. The protein is restricted to bacteria with the pathway, with its gene close to the urea carboxylase and allophanate hydrolase genes. The substrate for this transporter therefore is likely to be urea or a compound from which urea is easily derived. Probab=53.69 E-value=18 Score=16.97 Aligned_cols=166 Identities=14% Similarity=0.153 Sum_probs=94.6 Q ss_pred CCEEEEECC----CHHHHHHHHHHHH-HHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCE Q ss_conf 828997079----8888999999999-99974089983389999973274115875567486226999999998709836 Q gi|254781037|r 4 KPFRIGTRC----SPLALAHAFETRS-SLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEID 78 (307) Q Consensus 4 ~~i~IgtR~----S~LAl~Qa~~v~~-~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iD 78 (307) ++++||.-. -|+.++|-.=.-+ -.++ | +++++++.+..-. +--+++..|++| T Consensus 2 ~~~kig~S~w~Gw~Pw~~A~e~GifkKw~~k-~---Gi~Vel~~f~dy~-------------------~si~a~~AG~~D 58 (328) T TIGR03427 2 DKFKVCWSIYAGWMPWGYAAQQGIVDKWADK-Y---GITIEVVQINDYV-------------------ESINQYTAGKFD 58 (328) T ss_pred CCEEEEEECCCCHHHHHHHHHCCCHHHHHHH-C---CCEEEEEECCCHH-------------------HHHHHHHCCCCC T ss_conf 9339997268538889999974926555887-5---9737999828848-------------------889999758977 Q ss_pred EEEECCCCCC-CCCCCCCEEEE--ECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHHHH--------HCCCCC Q ss_conf 9974034321-11012210122--236568310566524324332456732204753028889984--------237731 Q gi|254781037|r 79 CAVHSAKDMP-TKLLKGLQISA--YLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLLRW--------RSDISV 147 (307) Q Consensus 79 iAVHSlKDlP-~~~~~~l~i~a--~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~--------~p~l~~ 147 (307) -+.-..=|.= ..-..|..+.+ +..--+-.|++|.+..+++.|| +|-+||....-=..+++.. ..|+++ T Consensus 59 g~~~Tn~Dal~i~aa~Gvd~~~vlv~D~SnG~D~Ivak~~~slaDL-KGKkVav~~~svshyLL~rALe~aGLs~~DV~v 137 (328) T TIGR03427 59 GCTMTNMDALTIPAAGGVDTTALIVGDFSNGNDGIVLKGGKSLADL-KGQKVNLVELSVSHYLLARALESVGLSEKDVKV 137 (328) T ss_pred EEEEECCHHHHHHHCCCCCEEEEEEECCCCCCCEEEECCCCCHHHH-CCCEEECCCCCCHHHHHHHHHHHCCCCHHHEEE T ss_conf 4776261566513117997289998525788747997899887884-898883247870799999999986998899078 Q ss_pred HHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCC Q ss_conf 033076799999863787655630145454216211222336866735 Q gi|254781037|r 148 IDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFP 195 (307) Q Consensus 148 ~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~ 195 (307) +++.- +.-..-+..|+.||.+-=.-=+..+.-..-...+|+..+++ T Consensus 138 Vn~~~--~d~~aAf~sg~VdA~vTWeP~ls~i~~~~ga~~iF~Ss~iP 183 (328) T TIGR03427 138 VNTSD--ADIVAAFITKDVTAVVTWNPQLSEIKAQPGANEVFDSSQIP 183 (328) T ss_pred EECCC--HHHHHHHHCCCCCEEEEECHHHHHHHHCCCCCEEEECCCCC T ss_conf 84795--46999974799878999785689997289982644257788 No 55 >PRK00148 Maf-like protein; Reviewed Probab=53.57 E-value=6.5 Score=19.77 Aligned_cols=32 Identities=19% Similarity=0.099 Sum_probs=18.5 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHH Q ss_conf 3220475302888998423773103307679999 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVIDFRGKIETRL 158 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl 158 (307) -++|++|+||+..|..+.-++.+.+ =+||-+. T Consensus 4 lILAS~Sp~R~~lL~~~g~~f~v~~--~~idE~~ 35 (195) T PRK00148 4 LVLASASPARLKLLRLAGIPPLVVV--SHVDEDA 35 (195) T ss_pred EEECCCCHHHHHHHHHCCCCEEEEC--CCCCCCC T ss_conf 8985799999999997799849978--9989898 No 56 >PRK03415 consensus Probab=52.21 E-value=7.5 Score=19.37 Aligned_cols=27 Identities=15% Similarity=0.324 Sum_probs=18.4 Q ss_pred HHHHHHHCCCCCCCEEEEEEEECCEEE Q ss_conf 999987177648852899999999899 Q gi|254781037|r 234 RAFLAELDGSCKSAIAGFAYCKGSTLY 260 (307) Q Consensus 234 R~fL~~L~GgC~~PIGa~A~i~~~~l~ 260 (307) |..|+.|.|.-|.=+.+.|-.+++.+. T Consensus 90 ~~~L~~lsG~~H~v~T~v~l~~~~~~~ 116 (197) T PRK03415 90 AQMLRALSGQTHQVMTAVALADSQHIL 116 (197) T ss_pred HHHHHHHCCCCEEEEEEEEEEECCCEE T ss_conf 999999769952999999998299489 No 57 >PRK02141 Maf-like protein; Reviewed Probab=51.31 E-value=9.1 Score=18.84 Aligned_cols=33 Identities=27% Similarity=0.356 Sum_probs=22.9 Q ss_pred CCCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 56732204753028889984237731033076799 Q gi|254781037|r 122 ALHSVIGTSSLRRKALLLRWRSDISVIDFRGKIET 156 (307) Q Consensus 122 p~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~T 156 (307) |+--+++++|+||+..|..+.-++.+.+- +||- T Consensus 8 ~pklILAS~SprR~eLL~~~Gi~f~v~~~--~iDE 40 (206) T PRK02141 8 PPRLILASSSRYRRELLERLRLPFDVVSP--DIDE 40 (206) T ss_pred CCCEEEECCCHHHHHHHHHCCCCCEEECC--CCCC T ss_conf 99799937999999999878999599858--9998 No 58 >PRK04056 Maf-like protein; Reviewed Probab=50.73 E-value=8.3 Score=19.11 Aligned_cols=114 Identities=10% Similarity=0.067 Sum_probs=50.4 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEE Q ss_conf 22047530288899842377310330767999998637876556301454542162112223368667358543207899 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICI 205 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaI 205 (307) +.+++|+||+..|....-++.+ +.-|||-..-+-... -..+...|-.+-.. +...++.+..+=++=| + | T Consensus 3 ILAS~SprR~~lL~~~gi~f~~--~~~~iDE~~~~~~~p--~~~v~~la~~Ka~~----~~~~~~~~~~vi~aDt--v-v 71 (180) T PRK04056 3 ILASSSPTRANLLKEAGIEFEQ--KSVDFDEESIKKTSP--KEFVYLAVKGKLEQ----ALKLYGNNCNLLVADS--V-V 71 (180) T ss_pred EECCCCHHHHHHHHHCCCCCEE--ECCCCCCCCCCCCCH--HHHHHHHHHHHHHH----HHHHCCCCCEEEEECE--E-E T ss_conf 9978999999999977999699--718999988777899--99999999999999----9987199986999786--9-9 Q ss_pred EEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEEEE Q ss_conf 985276124556776239788888788999998717764885289999999989999 Q gi|254781037|r 206 ETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLYFY 262 (307) Q Consensus 206 e~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~l~ 262 (307) .| ++ +++. -+.+..+ | +.+|+.|.|.=|.=+.+.+...++..... T Consensus 72 ~~--~g----~ilg---KP~~~~~--A-~~~L~~lsGk~h~v~T~v~~~~~~~~~~~ 116 (180) T PRK04056 72 SC--DG----KILR---KAKDKEE--A-REMLKLQSGNEISVLTCMIFISPEKEVLD 116 (180) T ss_pred EE--CC----EEEC---CCCCHHH--H-HHHHHHHCCCEEEEEEEEEEECCCEEEEE T ss_conf 98--99----9987---9999999--9-99999868993999999999729989999 No 59 >PRK00032 Maf-like protein; Reviewed Probab=50.03 E-value=8.3 Score=19.11 Aligned_cols=25 Identities=24% Similarity=0.428 Sum_probs=15.5 Q ss_pred HHHHHHCCCCCCCEEEEEEEECCEE Q ss_conf 9998717764885289999999989 Q gi|254781037|r 235 AFLAELDGSCKSAIAGFAYCKGSTL 259 (307) Q Consensus 235 ~fL~~L~GgC~~PIGa~A~i~~~~l 259 (307) .+|+.|.|--|.=+.+.+-+.++++ T Consensus 91 ~~L~~lsGk~h~v~T~v~v~~~~~~ 115 (189) T PRK00032 91 AMLAALSGRTHQVMTAVALADSQRI 115 (189) T ss_pred HHHHHHCCCCEEEEEEEEEEECCEE T ss_conf 9999858993899989999989989 No 60 >PRK12682 transcriptional regulator CysB-like protein; Reviewed Probab=49.93 E-value=20 Score=16.59 Aligned_cols=134 Identities=14% Similarity=0.089 Sum_probs=62.2 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECC-------CCCCCCCCCCCEEEC--CCCCHH Q ss_conf 99999998709836997403432111012210122236568310566524-------324332456732204--753028 Q gi|254781037|r 65 TEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHT-------AQSLKDLALHSVIGT--SSLRRK 135 (307) Q Consensus 65 tkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~-------~~~l~~lp~~a~IGT--sS~RR~ 135 (307) +.++.+.|.+|++|+|+-+- |....++ +.+.+-..++--+++.++ ..+++||.....|.- ++.-|. T Consensus 131 ~~~~~~~l~~g~~Dl~i~~~---~~~~~~~--l~~~~l~~~~~~~v~~~~h~la~~~~i~~~dL~~~p~I~~~~~~~~r~ 205 (309) T PRK12682 131 PDEIAQMVISGEADIGIATE---SLDDDPD--LVTLPCYDWTHAVIVPSDHPLAQKERITLEDLAEYPLITYHPGFTGRS 205 (309) T ss_pred HHHHHHHHHCCCCCHHHHHC---CCCCCCC--EEEEEEECCCEEEEECCCCHHHCCCCCCHHHHCCCCEEEECCCCCHHH T ss_conf 29999999769976456507---7778988--089993315236761699721149999999982899797279986799 Q ss_pred HHHHH-----HHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECC Q ss_conf 88998-----4237731033076799999863787655630145454216211222336866735854320789998527 Q gi|254781037|r 136 ALLLR-----WRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVN 210 (307) Q Consensus 136 aql~~-----~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~ 210 (307) .+-.. +.|+. .--=.|+++-+.-...|--=+ +++.-..+... ...... ++++...| +-.+.+..|++ T Consensus 206 ~~~~~~~~~g~~p~~--~~~~~~~~~~~~~v~~G~Gi~-~lp~~~~~~~~-~~~Lv~-v~~~~~~~---~~~~~l~~rk~ 277 (309) T PRK12682 206 AIDKAFAAAGLQPDI--VLEAIDSDVIKTYVRLGLGVG-IIAEMAYRPDR-DGDLVA-LPAGHLFG---PNTAWVAVKRG 277 (309) T ss_pred HHHHHHHHCCCCCCE--EEEECCHHHHHHHHHHCCEEE-HHHHHHHHHHH-CCCEEE-EECCCCCC---CCEEEEEEECC T ss_conf 999999977999626--999998999999999599424-22899833764-799899-97988777---61799999696 Q ss_pred C Q ss_conf 6 Q gi|254781037|r 211 N 211 (307) Q Consensus 211 d 211 (307) . T Consensus 278 ~ 278 (309) T PRK12682 278 A 278 (309) T ss_pred C T ss_conf 6 No 61 >PRK11233 nitrogen assimilation transcriptional regulator; Provisional Probab=49.50 E-value=21 Score=16.55 Aligned_cols=134 Identities=17% Similarity=0.135 Sum_probs=63.1 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 28997079888899999999999974089983389999973274115875567486226999999998709836997403 Q gi|254781037|r 5 PFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSA 84 (307) Q Consensus 5 ~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSl 84 (307) +++||+-.|..+-.-.--+...+.+. +|++++++..- .+.++.+.|.+|++|+|+-.. T Consensus 92 ~l~ig~~p~~~~~~l~~p~l~~f~~~--~P~v~v~l~~~--------------------~~~~l~~~l~~g~~Dlai~~~ 149 (305) T PRK11233 92 QVSIGLAPGTAASSITMPLLQAVRAE--FPEIVIYLHEN--------------------SGAVLNEKLINGQLDMAVIYE 149 (305) T ss_pred EEEEEEECCHHHHHHHHHHHHHHHHH--CCCEEEEEECC--------------------CHHHHHHHHHCCCEEEEEEEC T ss_conf 49999962155665369999999997--89429997238--------------------879999998769847999827 Q ss_pred CCCCCCCCCCCEE--------EEECCCCCCCEEEEECCCCCCCCCCCCCEEE---CCCCCHHH-HHHHH-HCCCCCHHHC Q ss_conf 4321110122101--------2223656831056652432433245673220---47530288-89984-2377310330 Q gi|254781037|r 85 KDMPTKLLKGLQI--------SAYLPREDIRDVFISHTAQSLKDLALHSVIG---TSSLRRKA-LLLRW-RSDISVIDFR 151 (307) Q Consensus 85 KDlP~~~~~~l~i--------~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IG---TsS~RR~a-ql~~~-~p~l~~~~iR 151 (307) ... ++++.. ..+.+++.|.+ ..++.+|.....|. .++.|+.. +.... .-..++.--= T Consensus 150 ~~~----~~~l~~~~l~~~~l~lv~~~~~p~~------~i~l~~l~~~~~i~~~~~~~~r~~~~~~~~~~~~~~~~~~e~ 219 (305) T PRK11233 150 HSP----VAGVSSQALLKEDLFLVGTQDCPGQ------SVDLNAIAQMNLFLPRDYSAVRLRVDEAFSLRRLTAKVIGEI 219 (305) T ss_pred CCC----CCCCEEEEEEECCEEEEECCCCCCC------CCCHHHHCCCCEEECCCCCHHHHHHHHHHHHCCCCCCEEEEE T ss_conf 989----8882899998535899984778878------789999808998987999849999999999779998479999 Q ss_pred CCHHHHHHHHHCCCCCEEE Q ss_conf 7679999986378765563 Q gi|254781037|r 152 GKIETRLNKLNNNKAHAIL 170 (307) Q Consensus 152 GNv~TRl~Kl~~g~~Daii 170 (307) +++++=..-+.+|..-+++ T Consensus 220 ~~~~~i~~~V~~G~Giail 238 (305) T PRK11233 220 ESIATLTAAIASGMGVTVL 238 (305) T ss_pred CCHHHHHHHHHHCCEEEEE T ss_conf 9699999999979978987 No 62 >cd00985 Maf_Ham1 Maf_Ham1. Maf, a nucleotide binding protein, has been implicated in inhibition of septum formation in eukaryotes, bacteria and archaea. A Ham1-related protein from Methanococcus jannaschii is a novel NTPase that has been shown to hydrolyze nonstandard nucleotides, such as hypoxanthine/xanthine NTP, but not standard nucleotides. Probab=49.44 E-value=7.2 Score=19.50 Aligned_cols=108 Identities=17% Similarity=0.154 Sum_probs=50.8 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEE Q ss_conf 22047530288899842377310330767999998637876556301454542162112223368667358543207899 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICI 205 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaI 205 (307) +++|+|+||...++.+.+ +.+.-+.-+++-...+-+.+ ..+..-| ++.- ..+.+.++ +.++=+.=||+. T Consensus 2 iLAS~S~~R~~~l~~~~~-~~~~~~~~~idE~~~~~~~~---~~~~~~A---~~KA-~~v~~~~~-~~~vi~~DT~v~-- 70 (131) T cd00985 2 ILASGSPRRLEELKQIGG-IEFEVLPSDIDETGLKGEPE---DTVEELA---LLKA-RAVAERLP-DAPVIADDTGLV-- 70 (131) T ss_pred EEECCCHHHHHHHHHCCC-CCEEEECCCCCCCCCCCCHH---HHHHHHH---HHHH-HHHHHHCC-CCEEEEECCEEE-- T ss_conf 996799899999996689-99599679989888888989---9999999---9999-88897779-965996163688-- Q ss_pred EEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECC Q ss_conf 9852761245567762397888887889999987177648852899999999 Q gi|254781037|r 206 ETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGS 257 (307) Q Consensus 206 e~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~ 257 (307) + ++ ++ +-++-|..+. +.+|+.+.|--|.=+.+.|-+..+ T Consensus 71 -~--~g-~i--~~Kp~~~~~a-------~~~L~~lsgr~h~~~t~v~l~~~~ 109 (131) T cd00985 71 -V--DG-RP--GGKPARFAEA-------LEMLRGLSGRTAEFVTAVALVDPD 109 (131) T ss_pred -E--CC-EE--CCCCCCHHHH-------HHHHHHHCCCEEEEEEEEEEEECC T ss_conf -8--99-87--3489999999-------998876158869999999999489 No 63 >COG0424 Maf Nucleotide-binding protein implicated in inhibition of septum formation [Cell division and chromosome partitioning] Probab=48.78 E-value=11 Score=18.36 Aligned_cols=107 Identities=21% Similarity=0.280 Sum_probs=56.8 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCC--CCCCCCCHHHCCCCCCCC Q ss_conf 732204753028889984237731033076799999863787655630145454216211--222336866735854320 Q gi|254781037|r 124 HSVIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEK--VIKEILNIEDFPPSPGQG 201 (307) Q Consensus 124 ~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~--~i~~~l~~~~~~PA~gQG 201 (307) --+.+++||||+..|..+.= .|.-+--+||+...+-.. +..=..||-... .+...++ ++ T Consensus 4 ~LiLAS~SPrR~elL~~~gi--~f~~~~~~iDE~~~~~~~--------P~~~v~~LA~~KA~~va~~~~---------~~ 64 (193) T COG0424 4 RLILASSSPRRRELLEQLGI--PFEVIPSDIDEPLLKAEE--------PREYVLRLAEEKARAVAARLP---------PD 64 (193) T ss_pred CEEEECCCHHHHHHHHHCCC--CEEEECCCCCCCCCCCCC--------HHHHHHHHHHHHHHHHHHHCC---------CC T ss_conf 28994699899999998799--819806899987665789--------799999999999999998479---------99 Q ss_pred EEEEEEEC---CCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCE Q ss_conf 78999852---7612455677623978888878899999871776488528999999998 Q gi|254781037|r 202 AICIETHV---NNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGST 258 (307) Q Consensus 202 aIaIe~r~---~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~ 258 (307) ++-|-|-. -|- +++.+=.+. .+ =+.+|+.|.|-.|.=+-++|-++.++ T Consensus 65 ~~VigaDtvv~ldg---rilgKP~~~---~e---A~~~L~~lSG~~h~v~T~v~li~~~~ 115 (193) T COG0424 65 ALVIGADTVVVLDG---RILGKPKDE---EE---AREMLRKLSGRTHQVYTGVALIDPGK 115 (193) T ss_pred CEEEECCEEEEECC---EEECCCCCH---HH---HHHHHHHHCCCEEEEEEEEEEEECCC T ss_conf 87996274999899---982699999---99---99999984698699999999997797 No 64 >KOG2098 consensus Probab=48.50 E-value=9.9 Score=18.61 Aligned_cols=16 Identities=13% Similarity=-0.058 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8889999999999997 Q gi|254781037|r 14 PLALAHAFETRSSLMD 29 (307) Q Consensus 14 ~LAl~Qa~~v~~~L~~ 29 (307) .--..|.+..+.+|+. T Consensus 8 ~a~~t~~~~lr~rle~ 23 (591) T KOG2098 8 QAHKTQLDSLRERLER 23 (591) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 6689899999999987 No 65 >pfam02545 Maf Maf-like protein. Maf is a putative inhibitor of septum formation in eukaryotes, bacteria, and archaea. Probab=47.40 E-value=9.8 Score=18.62 Aligned_cols=27 Identities=26% Similarity=0.389 Sum_probs=21.2 Q ss_pred HHHHHHHCCCCCCCEEEEEEEECCEEE Q ss_conf 999987177648852899999999899 Q gi|254781037|r 234 RAFLAELDGSCKSAIAGFAYCKGSTLY 260 (307) Q Consensus 234 R~fL~~L~GgC~~PIGa~A~i~~~~l~ 260 (307) |.+|+.|.|--|.=+.+.|-+.++... T Consensus 91 ~~~L~~lsGk~h~v~Tav~l~~~~~~~ 117 (193) T pfam02545 91 REMLRKLSGNVHSVYTGLALISPEKNL 117 (193) T ss_pred HHHHHHHCCCCEEEEEEEEEEECCEEE T ss_conf 999999879966999999999899399 No 66 >PRK03411 consensus Probab=46.80 E-value=10 Score=18.53 Aligned_cols=25 Identities=20% Similarity=0.291 Sum_probs=16.3 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHH Q ss_conf 3220475302888998423773103 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVID 149 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~ 149 (307) -+++++||||+..|..+.-++++.+ T Consensus 4 lILAS~SprR~~lL~~~gi~f~~~~ 28 (194) T PRK03411 4 LILASTSPWRRALLEKLQIPFECAA 28 (194) T ss_pred EEECCCCHHHHHHHHHCCCCCEEEC T ss_conf 8987899999999987899968979 No 67 >PRK01839 Maf-like protein; Reviewed Probab=44.16 E-value=12 Score=18.11 Aligned_cols=32 Identities=16% Similarity=0.182 Sum_probs=22.0 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 6732204753028889984237731033076799 Q gi|254781037|r 123 LHSVIGTSSLRRKALLLRWRSDISVIDFRGKIET 156 (307) Q Consensus 123 ~~a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~T 156 (307) +.-+.+++|+||+..|..+.-++++ +--++|. T Consensus 10 p~IILAS~SprR~~lL~~~gi~f~~--~~~~~dE 41 (209) T PRK01839 10 PFLYLASQSPRRQELLQQLGVRYEL--LLPRPDE 41 (209) T ss_pred CCEEECCCCHHHHHHHHHCCCCCEE--ECCCCCC T ss_conf 8889857999999999878999589--7889987 No 68 >PRK04222 consensus Probab=43.75 E-value=12 Score=18.10 Aligned_cols=30 Identities=23% Similarity=0.405 Sum_probs=16.8 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 32204753028889984237731033076799 Q gi|254781037|r 125 SVIGTSSLRRKALLLRWRSDISVIDFRGKIET 156 (307) Q Consensus 125 a~IGTsS~RR~aql~~~~p~l~~~~iRGNv~T 156 (307) -+.+++|+||+..|..+.-++.+. --|||- T Consensus 5 iILAS~SprR~~lL~~~gi~F~~~--~~~iDE 34 (191) T PRK04222 5 LILASTSAYRRELLGRLQLDFDTA--RPEVDE 34 (191) T ss_pred EEECCCCHHHHHHHHHCCCCCEEE--CCCCCC T ss_conf 898479999999999789980997--899998 No 69 >PRK10263 DNA translocase FtsK; Provisional Probab=41.56 E-value=19 Score=16.75 Aligned_cols=38 Identities=29% Similarity=0.399 Sum_probs=19.6 Q ss_pred HCCCCCH--HHCCCHHHHHHHHHCCCCC-EEEEEHHHHHHC Q ss_conf 2377310--3307679999986378765-563014545421 Q gi|254781037|r 142 RSDISVI--DFRGKIETRLNKLNNNKAH-AILLAYAGIKRL 179 (307) Q Consensus 142 ~p~l~~~--~iRGNv~TRl~Kl~~g~~D-aiilA~Agl~RL 179 (307) ||...|+ .|+-||.|||..--....| -+||-..|-++| T Consensus 1184 RPSVDVITGlIKANiPtRIAF~VsSkiDSRTILD~~GAE~L 1224 (1355) T PRK10263 1184 RPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESL 1224 (1355) T ss_pred CCCCCEECCCEECCCHHHHEECCCCCCCCCCCCCCCCHHHH T ss_conf 99867004624334323310003445663510787588885 No 70 >PRK01526 Maf-like protein; Reviewed Probab=41.32 E-value=17 Score=17.12 Aligned_cols=23 Identities=35% Similarity=0.411 Sum_probs=17.5 Q ss_pred CCCCCCCCEEECCCCCHHHHHHHHH Q ss_conf 3324567322047530288899842 Q gi|254781037|r 118 LKDLALHSVIGTSSLRRKALLLRWR 142 (307) Q Consensus 118 l~~lp~~a~IGTsS~RR~aql~~~~ 142 (307) .+.+| -++|++|+||+..|..+. T Consensus 5 ~~~~~--iILAS~SprR~eLL~~~g 27 (205) T PRK01526 5 RKNLP--IILASSSPARIELLNRIK 27 (205) T ss_pred CCCCC--EEEECCCHHHHHHHHHCC T ss_conf 78996--999079999999998779 No 71 >PRK05593 rplR 50S ribosomal protein L18; Reviewed Probab=38.88 E-value=30 Score=15.50 Aligned_cols=45 Identities=20% Similarity=0.175 Sum_probs=30.6 Q ss_pred EEEEEEECCCCCEEEEEE--------EECCHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 999999869987899999--------843988899999999999997060563 Q gi|254781037|r 260 YFYGIVLASDGKIFHEVS--------RNGNRCDAINIGRDAAQYIRFISNKNV 304 (307) Q Consensus 260 ~l~a~v~s~DG~~~i~~~--------~~g~~~~a~~LG~~la~~Ll~~g~k~i 304 (307) ++.+++.++++...+... ..++.+.+..+|+.+|+..++.|-+++ T Consensus 36 hiyaQiIDD~~~~tl~saST~~k~~~~~~n~~~A~~vG~~ia~~a~~~~I~~v 88 (115) T PRK05593 36 HIYAQVIDDVKGKTLASASTLEKDLKAGGNKEAAKKVGKLIAERAKAKGIKQV 88 (115) T ss_pred EEEEEEEECCCCCEEEEEECCCHHHHCCCCHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 27999998699979999741424441688499999999999999998799889 No 72 >PRK03098 consensus Probab=37.33 E-value=15 Score=17.50 Aligned_cols=111 Identities=15% Similarity=0.179 Sum_probs=51.8 Q ss_pred EEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEE Q ss_conf 22047530288899842377310330767999998637876556301454542162112223368667358543207899 Q gi|254781037|r 126 VIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICI 205 (307) Q Consensus 126 ~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaI 205 (307) +.+++|+||+..|..+.-++.+.+ -|||..+.+-.. -..++...|-.+-.. +...++ +.++=++=|-+ T Consensus 1 ILAS~S~~R~~lL~~~gi~f~~~~--~~iDE~~~~~~~--p~~~~~~lA~~KA~~----v~~~~~-~~~vI~aDtvv--- 68 (185) T PRK03098 1 ILASGSPRRKELLELAGVPFEIIV--SEVEETIGAYSS--PSEIVQSLALQKASA----VAENNP-DAIVLGADTIV--- 68 (185) T ss_pred CCCCCCHHHHHHHHHCCCCCEEEC--CCCCCCCCCCCC--HHHHHHHHHHHHHHH----HHHHCC-CCEEEEECEEE--- T ss_conf 979699999999986899959977--999999999969--999999999999999----998789-98799877189--- Q ss_pred EEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEE Q ss_conf 9852761245567762397888887889999987177648852899999999899 Q gi|254781037|r 206 ETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLY 260 (307) Q Consensus 206 e~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~ 260 (307) .| |.+ ++ +-+.+..++ +.+|+.|.|-.|.=+.+.|-+.+++.. T Consensus 69 ~~---~g~---il---gKP~~~~eA---~~~L~~lsGk~h~v~T~v~v~~~~~~~ 111 (185) T PRK03098 69 TY---DGR---IL---GKPSDEEEA---KEMLQLLSGKTHEVYTGVAIIAKDKTV 111 (185) T ss_pred EE---CCE---EE---CCCCCHHHH---HHHHHHHCCCEEEEEEEEEEEECCEEE T ss_conf 71---997---84---697659999---999999779917999899999899899 No 73 >PRK11716 DNA-binding transcriptional regulator IlvY; Provisional Probab=35.47 E-value=34 Score=15.16 Aligned_cols=169 Identities=16% Similarity=0.164 Sum_probs=71.6 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 28997079888899999999999974089983389999973274115875567486226999999998709836997403 Q gi|254781037|r 5 PFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSA 84 (307) Q Consensus 5 ~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSl 84 (307) +++||+=-+ .+...--.+...+... +|++++++.. || +.++.+.|.+|++|+|+-+. T Consensus 68 ~lri~~~~t-~~~~~lp~~l~~f~~~--~P~v~~~l~~----~~----------------~~~~~~~l~~g~~D~ai~~~ 124 (269) T PRK11716 68 ELSLFCSVT-AAYSHLPPILDRFRAE--HPLVEIKLTT----GD----------------AADAVEKVQSGEADLAIAAK 124 (269) T ss_pred CEEEEECHH-HHHHHCCHHHHHHHHH--CCCCEEEEEC----CC----------------HHHHHHHHHCCCCCEEEEEC T ss_conf 269985148-8777546789999998--8894489842----89----------------99999999869955899937 Q ss_pred CCCCCCCCCCCEEEEECCCCCCCEEEEECCC------------CCCCCCCCCCEEE-CCCCCHHH-HHH-H--HHCCCCC Q ss_conf 4321110122101222365683105665243------------2433245673220-47530288-899-8--4237731 Q gi|254781037|r 85 KDMPTKLLKGLQISAYLPREDIRDVFISHTA------------QSLKDLALHSVIG-TSSLRRKA-LLL-R--WRSDISV 147 (307) Q Consensus 85 KDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~------------~~l~~lp~~a~IG-TsS~RR~a-ql~-~--~~p~l~~ 147 (307) |...++++....+ .++| -++|.+.+ ..+.++| .+.- +++.|+.. ++. . ..|+..+ T Consensus 125 ---p~~~~~~l~~~~l--~~~~-l~~v~p~~~~~~~~~~~~~~~dl~~~p--~il~~~~~~r~~~~~~~~~~~~~p~i~~ 196 (269) T PRK11716 125 ---PETLPSSLAFSPI--DEIP-LVLIAPALPCPVRQQVSQEKPDWSRIP--FILPEHGPARRRIDLWFRRHKIKPNIYA 196 (269) T ss_pred ---CCCCCCCEEEEEE--EEEE-EEEEEECCCCHHHCCCCCCHHHHHCCC--EEECCCCCHHHHHHHHHHHCCCCCCEEE T ss_conf ---8789997699996--5641-899997884154363568844683386--7815898279999999997699971799 Q ss_pred HHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCC Q ss_conf 0330767999998637876556301454542162112223368667358543207899985276 Q gi|254781037|r 148 IDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNN 211 (307) Q Consensus 148 ~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d 211 (307) .+ ...++=++-+..| +.-.++....+++.-....+. .++.+...| ---+++.+|++. T Consensus 197 -e~-~~~~~i~~~V~~G-~GiailP~~~~~~~~~~~~l~-~l~~~~~~~---~~~i~l~~~~~r 253 (269) T PRK11716 197 -QV-SGHEAIVSMVALG-CGVGILPEVVLDNSPVRDRVQ-RLEVPPELT---PFELGVCCQKKR 253 (269) T ss_pred -EE-CCHHHHHHHHHHC-CEEEEHHHHHHHCCCCCCCEE-EEECCCCCC---CEEEEEEEECCC T ss_conf -98-9799999999979-941720599983275579889-974899887---369999997998 No 74 >PRK01441 Maf-like protein; Reviewed Probab=34.32 E-value=21 Score=16.51 Aligned_cols=23 Identities=17% Similarity=0.213 Sum_probs=14.5 Q ss_pred HHHHHHHCCCCCCCEEEEEEEEC Q ss_conf 99998717764885289999999 Q gi|254781037|r 234 RAFLAELDGSCKSAIAGFAYCKG 256 (307) Q Consensus 234 R~fL~~L~GgC~~PIGa~A~i~~ 256 (307) +..|+.|.|--|.=+.+.|-+.. T Consensus 99 ~~~L~~lsGk~H~v~Tgv~l~~~ 121 (207) T PRK01441 99 SQCLRLLSGRNHRVYTGVCLVTP 121 (207) T ss_pred HHHHHHHCCCEEEEEEEEEEEEC T ss_conf 99999975991799989999989 No 75 >COG0263 ProB Glutamate 5-kinase [Amino acid transport and metabolism] Probab=33.93 E-value=13 Score=17.73 Aligned_cols=135 Identities=18% Similarity=0.151 Sum_probs=81.6 Q ss_pred CCCEEECCCCCHHHHHH--HHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCC Q ss_conf 67322047530288899--8423773103307679999986378765563014545421621122233686673585432 Q gi|254781037|r 123 LHSVIGTSSLRRKALLL--RWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQ 200 (307) Q Consensus 123 ~~a~IGTsS~RR~aql~--~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQ 200 (307) .|+.+||+-.|-+-+.- ..+-.+.++-..|+-...+..+-+++-.|+.+...+-.|++- +-.+.+ --++-+ T Consensus 207 sgs~~GTGGM~TKl~AA~iA~~aG~~~iI~~g~~~~~i~~~~~~~~~GT~F~~~~~~~~~a--rk~Wi~-----~~~~~~ 279 (369) T COG0263 207 SGSELGTGGMRTKLEAAKIATRAGVPVIIASGSKPDVILDALEGEAVGTLFEPQAKERLNA--RKQWIA-----GALPPA 279 (369) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCCCCEEECCCCCHHHH--HHHHHH-----CCCCCC T ss_conf 7777775618889999999997699589946998315899971898851893588643341--688862-----478878 Q ss_pred CEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEE---E-ECCEEEEEEEEECCCCCEEEEE Q ss_conf 07899985276124556776239788888788999998717764885289999---9-9998999999986998789999 Q gi|254781037|r 201 GAICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAY---C-KGSTLYFYGIVLASDGKIFHEV 276 (307) Q Consensus 201 GaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~---i-~~~~l~l~a~v~s~DG~~~i~~ 276 (307) |.|-|-. -|++++++ .|.--.|.|+-.- + .++.+.+. ++|...-+. T Consensus 280 G~i~iD~-----------------------GA~~Al~~--~gkSLLpaGV~~V~G~F~rGdvV~i~-----~~g~~iarG 329 (369) T COG0263 280 GEITVDA-----------------------GAVEALLE--QGKSLLPAGVTSVEGNFSRGDVVRIR-----PQGGEIARG 329 (369) T ss_pred CEEEECC-----------------------CHHHHHHH--CCCCCCCCCCEEEEEEECCCCEEEEE-----CCCCEEEEE T ss_conf 5499884-----------------------58999985--69754544326766565598779992-----189516745 Q ss_pred EEECCHHHHHHHHHHHHH Q ss_conf 984398889999999999 Q gi|254781037|r 277 SRNGNRCDAINIGRDAAQ 294 (307) Q Consensus 277 ~~~g~~~~a~~LG~~la~ 294 (307) ....+.++.+.+.-.-.+ T Consensus 330 ~v~Y~s~el~~i~G~~s~ 347 (369) T COG0263 330 LVNYSSDELRKIAGHKSS 347 (369) T ss_pred ECCCCHHHHHHHHCCCHH T ss_conf 426799999988487666 No 76 >PRK09791 putative DNA-binding transcriptional regulator; Provisional Probab=33.40 E-value=36 Score=14.95 Aligned_cols=176 Identities=15% Similarity=0.141 Sum_probs=88.0 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) -.++||.=.| ++..---.+...+.+. +|++++++.. |+ +.++-+.|.+|++|++|.+ T Consensus 95 G~l~Ig~~~~-~~~~~lp~~l~~f~~~--~P~v~i~i~~----~~----------------~~~~~~~l~~g~~D~~i~~ 151 (302) T PRK09791 95 GQINIGMGAS-ISRSLMPAVISRFHQQ--HPQVKVRIME----GQ----------------LVSMINELRQGELDFTINT 151 (302) T ss_pred CEEEEEECHH-HHHHHHHHHHHHHHHH--CCCCEEEEEE----CC----------------HHHHHHHHHCCCCCEEEEC T ss_conf 3399986268-8987119999999998--8797799998----99----------------9999999987997789965 Q ss_pred CCCCCCCCCCCCEEEEECCCCCCCEEEEECCC-----CCCCCCCCCCEEE---CCCCCHHHHHH--HHHCCCCCHHHCCC Q ss_conf 34321110122101222365683105665243-----2433245673220---47530288899--84237731033076 Q gi|254781037|r 84 AKDMPTKLLKGLQISAYLPREDIRDVFISHTA-----QSLKDLALHSVIG---TSSLRRKALLL--RWRSDISVIDFRGK 153 (307) Q Consensus 84 lKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~-----~~l~~lp~~a~IG---TsS~RR~aql~--~~~p~l~~~~iRGN 153 (307) . .|...++++...- --+++.-+++.+++ .++++|....-+. .++.+|...=. ......++.-.=++ T Consensus 152 ~--~~~~~~~~l~~~~--l~~~~~~~v~~~~hpl~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~~~~~~ 227 (302) T PRK09791 152 Y--YQGPYDHEFTFEK--LLEKQFAIFCRPGHPAIGARSIKQLLDYSWTMPTPHGSYYKQLSELLDDQAQTPQVGVVCET 227 (302) T ss_pred C--CCCCCCCCEEEEE--EEEEEEEEEECCCCCCCCCCCHHHHHCCCEEEECCCCCHHHHHHHHHHHCCCCCCEEEEECC T ss_conf 6--7887887647999--77401599983898435798999984799387579997999999999967999856999992 Q ss_pred HHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCC Q ss_conf 7999998637876556301454542162112223368667358543207899985276 Q gi|254781037|r 154 IETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNN 211 (307) Q Consensus 154 v~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d 211 (307) +++=++-+..|.+-+++-....-..+.-... . .++.++..| .-.+++.+|++. T Consensus 228 ~~~~~~lv~~g~gv~ilp~~~~~~~~~~~~l-v-~lpl~~~~~---~~~~~lv~r~~~ 280 (302) T PRK09791 228 FSACISLVAKSDFLSILPEEMGCDPLHGQGL-V-MLPVSEILP---KAAYYLIQRRDS 280 (302) T ss_pred HHHHHHHHHHCCEEEEECHHHHHHHHHCCCE-E-EEECCCCCC---CEEEEEEEECCC T ss_conf 9999999997896796319999878748999-9-997999886---008999998909 No 77 >TIGR02114 coaB_strep phosphopantothenate--cysteine ligase; InterPro: IPR011848 In most bacteria, a single bifunctional protein catalyses phosphopantothenoylcysteine decarboxylase and phosphopantothenate--cysteine ligase activities, sequential steps in coenzyme A biosynthesis. These activities reside in separate proteins encoded by tandem genes in some bacterial lineages. This entry describes proteins from the genera Streptococcus and Enterococcus homologous to the C-terminal region of IPR005252 from INTERPRO, corresponding to phosphopantothenate--cysteine ligase activity .. Probab=33.00 E-value=37 Score=14.90 Aligned_cols=50 Identities=14% Similarity=0.188 Sum_probs=35.4 Q ss_pred CCEEEEEEEEEEC--CCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 8338999997327--4115875567486226999999998709836997403 Q gi|254781037|r 35 PHNIVIIPLSTKG--DRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSA 84 (307) Q Consensus 35 ~~~~ei~~i~T~G--D~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSl 84 (307) ++++-+|+-|+-= -...+-+..++-..-=...+|++.+-+..-|+-|||| T Consensus 40 Gh~VtlvTTK~A~kP~~~~~Lsi~Eie~~~~L~~~L~~~v~~kq~d~liHsM 91 (253) T TIGR02114 40 GHEVTLVTTKRALKPEPQLNLSIKEIETVKDLLTTLKELVAEKQHDILIHSM 91 (253) T ss_pred CCEEEEEECCCCCCCCCCCCCCEEEECCHHHHHHHHHHHCCCCCCCEEEEEE T ss_conf 8789787536121888888862486427789999988750125543888851 No 78 >TIGR01382 PfpI intracellular protease, PfpI family; InterPro: IPR006286 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad . This group of cysteine peptidases belong to MEROPS peptidase family C56 (Pfp1 endopeptidase, clan PC(C)). The member of this family from Pyrococcus horikoshii has been solved to 2 Angstrom resolution. It is an ATP-independent intracellular protease that crystallises as a hexameric ring. Cys-101 is proposed as the active site residue in a catalytic triad with the adjacent His-102 and a Glu residue from an adjacent monomer. A member of this family from Bacillus subtilis, GSP18, has been shown to be expressed in response to several forms of stress. A role in the degradation of small peptides has been suggested. This family is contained in a larger one of the thiamine biosynthesis protein ThiJ and its homologs.; GO: 0016798 hydrolase activity acting on glycosyl bonds. Probab=32.37 E-value=8.2 Score=19.14 Aligned_cols=20 Identities=5% Similarity=0.411 Sum_probs=12.7 Q ss_pred CHHHHHHHHHCCCCCEEEEE Q ss_conf 67999998637876556301 Q gi|254781037|r 153 KIETRLNKLNNNKAHAILLA 172 (307) Q Consensus 153 Nv~TRl~Kl~~g~~DaiilA 172 (307) +||-++.+++..+|||++++ T Consensus 64 ~vd~~~~~v~~~~YDal~ip 83 (189) T TIGR01382 64 TVDATIDDVDPEDYDALVIP 83 (189) T ss_pred EEECCEECCCHHHCCEEEEE T ss_conf 87441214785256478971 No 79 >KOG0605 consensus Probab=31.67 E-value=39 Score=14.76 Aligned_cols=76 Identities=21% Similarity=0.301 Sum_probs=43.3 Q ss_pred HCCCCCCCCEEEEE--EECCC---CHHHHHHHH--HCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCEEEEEEEE Q ss_conf 73585432078999--85276---124556776--239788888788999998717764885289999999989999999 Q gi|254781037|r 193 DFPPSPGQGAICIE--THVNN---PKAQELVKV--INHEDTWDSVNCERAFLAELDGSCKSAIAGFAYCKGSTLYFYGIV 265 (307) Q Consensus 193 ~~~PA~gQGaIaIe--~r~~d---~~i~~il~~--Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~i~~~~l~l~a~v 265 (307) +.+.-.||||-|=+ ||+.| -...++|++ .--.+--.-|.|||.+|..-++.|-+-. -||.-+.+.|+|-=.. T Consensus 144 e~Lk~IgkGAfGeVrLarKk~Tg~iyAmK~LkKS~M~~~~Qv~hV~aERdiL~~~ds~~vVKL-yYsFQD~~~LYLiMEy 222 (550) T KOG0605 144 ELLKVIGKGAFGEVRLARKKDTGEIYAMKILKKSEMLKKNQVEHVRAERDILAEVDSPWVVKL-YYSFQDKEYLYLIMEY 222 (550) T ss_pred HHHEEECCCCCEEEEEEEECCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEE-EEEECCCCEEEEEEEE T ss_conf 333034466530689999726786786410027777764317778889887640477767888-8885078705999971 Q ss_pred ECCCC Q ss_conf 86998 Q gi|254781037|r 266 LASDG 270 (307) Q Consensus 266 ~s~DG 270 (307) + +-| T Consensus 223 l-PGG 226 (550) T KOG0605 223 L-PGG 226 (550) T ss_pred C-CCC T ss_conf 3-784 No 80 >TIGR01915 npdG NADPH-dependent F420 reductase; InterPro: IPR010185 Members of this family are found only in species with evidence of coenzyme F420. All members of this family are believed to act as NADPH-dependent F420 reductase.. Probab=31.12 E-value=40 Score=14.71 Aligned_cols=29 Identities=24% Similarity=0.131 Sum_probs=24.7 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHHHHHH Q ss_conf 98828997079888899999999999974 Q gi|254781037|r 2 EKKPFRIGTRCSPLALAHAFETRSSLMDV 30 (307) Q Consensus 2 ~~~~i~IgtR~S~LAl~Qa~~v~~~L~~~ 30 (307) ++++|+||||+..=|---|..+.+.|.+. T Consensus 30 ~~~~iIIGSR~~EkA~EaA~ka~e~l~~~ 58 (233) T TIGR01915 30 VDNEIIIGSRDKEKAEEAAAKALEELGDQ 58 (233) T ss_pred CCCCEEEECCCHHHHHHHHHHHHHHHHCC T ss_conf 77555770488456999999999999708 No 81 >TIGR00742 yjbN TIM-barrel protein, yjbN family; InterPro: IPR004653 This family represents one branch of COG0042 from COG (Predicted TIM-barrel enzymes, possibly dehydrogenases, nifR3 family) although NifR3 itself, a protein of unknown function associated with nitrogen regulation in Rhodobacter capsulatus, is not a member of this branch. Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archaea. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA . They show a distant relationship to alpha/beta (TIM) barrel enzymes such as dihydroorotate dehydrogenase and glycolate oxidase.; GO: 0016491 oxidoreductase activity, 0050660 FAD binding, 0008033 tRNA processing. Probab=30.40 E-value=21 Score=16.46 Aligned_cols=119 Identities=14% Similarity=0.129 Sum_probs=77.2 Q ss_pred CCCCCCCCCEEECCCCCHHHHHHHHHCC-CCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCC Q ss_conf 4332456732204753028889984237-731033076799999863787655630145454216211222336866735 Q gi|254781037|r 117 SLKDLALHSVIGTSSLRRKALLLRWRSD-ISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFP 195 (307) Q Consensus 117 ~l~~lp~~a~IGTsS~RR~aql~~~~p~-l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~ 195 (307) +..+-|=---+||+|+...+++-.+-.+ ..+-++.=||.===.|+.+|.|.|+.||.|-+-= + +-+..-...-. T Consensus 51 ~~~E~PvAlQlgg~dp~~l~~ca~i~e~h~gydEiNLNVGCPSdrvQng~fGACLMg~a~lVa----~-cv~~M~~~v~i 125 (326) T TIGR00742 51 SPEEKPVALQLGGSDPNDLAKCAKIAEKHRGYDEINLNVGCPSDRVQNGNFGACLMGNADLVA----D-CVKAMQEAVEI 125 (326) T ss_pred CCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHCCCCCHHHCCCHHHHH----H-HHHHHHHHCCC T ss_conf 767786578507898899999999998645874221566883122204441111116823689----9-99998971578 Q ss_pred CCCCCCEEEEEEE----CCC--CHHHHHHHHHCCHH--HHHHHHHHHHHHHHH Q ss_conf 8543207899985----276--12455677623978--888878899999871 Q gi|254781037|r 196 PSPGQGAICIETH----VNN--PKAQELVKVINHED--TWDSVNCERAFLAEL 240 (307) Q Consensus 196 PA~gQGaIaIe~r----~~d--~~i~~il~~Ind~~--T~~~v~aER~fL~~L 240 (307) |-.=--=|||.-+ ++| +++.+.++.+.+.- -..-|-|=.+.|+-| T Consensus 126 PvtvK~RiGId~~ssdykndSYe~l~~Fv~~v~~~Gec~~FivHARkAwL~Gl 178 (326) T TIGR00742 126 PVTVKHRIGIDELSSDYKNDSYEELCDFVEIVSGKGECQNFIVHARKAWLSGL 178 (326) T ss_pred CCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC T ss_conf 82242014756443322323378999999986178861134687899985788 No 82 >PRK12680 transcriptional regulator CysB-like protein; Reviewed Probab=30.21 E-value=41 Score=14.61 Aligned_cols=175 Identities=11% Similarity=0.036 Sum_probs=80.5 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 28997079888899999999999974089983389999973274115875567486226999999998709836997403 Q gi|254781037|r 5 PFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHSA 84 (307) Q Consensus 5 ~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHSl 84 (307) .++|||-.|.-.-+=. .+.....+. +|++++.+..- -..++.+.|.+|++|+|+=+. T Consensus 94 ~L~Ig~~~t~a~~~LP-~vi~~f~~~--yP~V~l~l~~~--------------------~~~~l~e~l~~g~~Diai~~~ 150 (327) T PRK12680 94 QLTLTTTHTQARFVLP-PAVAQIKQA--YPQVSVHLQQA--------------------AESAALDLLGQGDADIAVVST 150 (327) T ss_pred EEEEEEECHHHHHHHH-HHHHHHHHH--CCCCEEEEEEC--------------------CCHHHHHHHHCCCCCEEEECC T ss_conf 7999984227777568-999999986--89967999989--------------------828899999789940999447 Q ss_pred CCCCCCCCCC-CEEEEECCCCCCCEEEEECC--------CCCCCCCCCCCEEE--CCCCCHHH-----HHHHHHCCCCCH Q ss_conf 4321110122-10122236568310566524--------32433245673220--47530288-----899842377310 Q gi|254781037|r 85 KDMPTKLLKG-LQISAYLPREDIRDVFISHT--------AQSLKDLALHSVIG--TSSLRRKA-----LLLRWRSDISVI 148 (307) Q Consensus 85 KDlP~~~~~~-l~i~a~~~R~d~rD~lv~~~--------~~~l~~lp~~a~IG--TsS~RR~a-----ql~~~~p~l~~~ 148 (307) -. ..++. ..+-+ -.+++-+++.++ ..++++|-....|. .++.-|.. .-..+.|++.+ T Consensus 151 ~~---~~~~~~~~~pl---~~~~~~vvvP~~HpL~~~~~~i~L~~La~~plI~~~~g~~~r~~id~af~~~Gl~P~iv~- 223 (327) T PRK12680 151 AG---GEPSAGIAVPL---YRWRRLVVVPRGHALDTPRTAPDMAALAEHPLISYDSSTRPGSSLQRAFAQVGLEPSIAL- 223 (327) T ss_pred CC---CCCCCCEEEEE---EECCEEEEEECCCCCCCCCCCCCHHHHHCCCEEECCCCCCHHHHHHHHHHHCCCCCCEEE- T ss_conf 77---79976547675---223579999789810058999999998089889638998679999999997799764799- Q ss_pred HHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCCCHHHHHH Q ss_conf 3307679999986378765563014545421621122233686673585432078999852761245567 Q gi|254781037|r 149 DFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNNPKAQELV 218 (307) Q Consensus 149 ~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d~~i~~il 218 (307) =-.+.++=..-...|- .--|+|.-+..-. +.--..+|.+...| .-.--|-.|.+ ..++... T Consensus 224 -ea~~~d~Ik~~V~~Gl-GV~Ivp~mA~~~~---d~~l~~~~~~~l~~---~~~~~~~~~~~-~~lr~~~ 284 (327) T PRK12680 224 -TALDADLIKTYVRAGL-GVGLVAEMAVNAF---DEDLRAWPAPAPIA---ECIAWAVLPRD-RVLRDYA 284 (327) T ss_pred -EECCHHHHHHHHHHCC-EEEECHHHHHCCC---CCCEEEEECCCCCC---CCEEEEEEECC-CHHHHHH T ss_conf -9995999999999498-6188164762876---89838940645777---56059999788-4689999 No 83 >COG3265 GntK Gluconate kinase [Carbohydrate transport and metabolism] Probab=30.08 E-value=35 Score=15.01 Aligned_cols=42 Identities=24% Similarity=0.392 Sum_probs=36.2 Q ss_pred CCCEEECCCCCHHH--HHHHHHCCCCCHHHCCCHHHHHHHHHCC Q ss_conf 67322047530288--8998423773103307679999986378 Q gi|254781037|r 123 LHSVIGTSSLRRKA--LLLRWRSDISVIDFRGKIETRLNKLNNN 164 (307) Q Consensus 123 ~~a~IGTsS~RR~a--ql~~~~p~l~~~~iRGNv~TRl~Kl~~g 164 (307) ++.+|..|++.|.+ +|+.-.|++.|+-+-|.-++-+++|..- T Consensus 68 ~~~vi~CSALKr~YRD~LR~~~~~~~Fv~L~g~~~~i~~Rm~~R 111 (161) T COG3265 68 KHVVIACSALKRSYRDLLREANPGLRFVYLDGDFDLILERMKAR 111 (161) T ss_pred CCEEEECHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHC T ss_conf 81699627887889999854599738999638899999998760 No 84 >TIGR00172 maf septum formation protein Maf; InterPro: IPR003697 Maf is a putative inhibitor of septum formation in eukaryotes, bacteria, and archaea. The Maf protein shares substantial amino acid sequence identity with the Escherichia coli OrfE protein . . Probab=29.73 E-value=34 Score=15.15 Aligned_cols=20 Identities=30% Similarity=0.368 Sum_probs=12.5 Q ss_pred CCCEEECCCCCHHHHHHHHH Q ss_conf 67322047530288899842 Q gi|254781037|r 123 LHSVIGTSSLRRKALLLRWR 142 (307) Q Consensus 123 ~~a~IGTsS~RR~aql~~~~ 142 (307) ..-+.|+.||||+..|..+- T Consensus 3 ~~liLAS~SP~R~~LL~~~~ 22 (212) T TIGR00172 3 KELILASTSPRRKELLEELG 22 (212) T ss_pred CCCEECCCCHHHHHHHHHHC T ss_conf 87342478868999999847 No 85 >pfam10080 DUF2318 Predicted membrane protein (DUF2318). Members of this family of hypothetical bacterial proteins have no known function. Probab=28.75 E-value=23 Score=16.25 Aligned_cols=29 Identities=24% Similarity=0.410 Sum_probs=18.4 Q ss_pred HHHHHHHC---CCCCCCEEEEEEEECCEEEEEE Q ss_conf 99998717---7648852899999999899999 Q gi|254781037|r 234 RAFLAELD---GSCKSAIAGFAYCKGSTLYFYG 263 (307) Q Consensus 234 R~fL~~L~---GgC~~PIGa~A~i~~~~l~l~a 263 (307) |.++..+| |||+ ||=.-.++++++|.+.. T Consensus 61 ~~~~~~IG~~~GGCN-PiP~~~~~~~~~i~I~~ 92 (102) T pfam10080 61 RFFLPSIGPKSGGCN-PIPLPYTVDGGNIIIDK 92 (102) T ss_pred EEEHHHCCCCCCCCC-CCCCCCEECCCEEEEEH T ss_conf 975322278788868-64476187499899889 No 86 >TIGR00422 valS valyl-tRNA synthetase; InterPro: IPR002303 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=26.82 E-value=47 Score=14.23 Aligned_cols=105 Identities=20% Similarity=0.151 Sum_probs=71.7 Q ss_pred CCCHHHCCCHHHHHH---HHH-HHHH---------CCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCCC-- Q ss_conf 875567486226999---999-9987---------0983699740343211101221012223656831056652432-- Q gi|254781037|r 52 NRTLTEIGGKGLFTE---EIE-KKLI---------SGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTAQ-- 116 (307) Q Consensus 52 ~~~l~~~g~kg~Ftk---ele-~aLl---------~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~-- 116 (307) +||-.-+|..+|+|+ |== ..|. ..+|-| ..|=-++.-=|+-+.-|+|=.|++|+-+++.+. T Consensus 232 TRPETm~gDtAvaVhP~DeRYYk~L~GK~v~~PL~~r~ipi----iaDe~VD~EFGTG~VKvTPAHD~ND~~~g~RH~LL 307 (970) T TIGR00422 232 TRPETMFGDTAVAVHPEDERYYKHLIGKKVILPLTGRKIPI----IADEYVDMEFGTGAVKVTPAHDFNDYEVGKRHNLL 307 (970) T ss_pred CCCHHHHHCEEEEECCCCCHHHHHHCCCEEEECCCCCEEEE----ECCCCCCCCCCCCEEEEECCCCCCCHHHCCEECCC T ss_conf 67225642828876789704578554988972456768437----61775462417512776087497760013412114 Q ss_pred CC-------CCCCCCC-EEECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHH Q ss_conf 43-------3245673-220475302888998423773103307679999986 Q gi|254781037|r 117 SL-------KDLALHS-VIGTSSLRRKALLLRWRSDISVIDFRGKIETRLNKL 161 (307) Q Consensus 117 ~l-------~~lp~~a-~IGTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl 161 (307) .+ ..|.++| +++|+.. ...-+-..|..|.+.+-|-+|=.+|++. T Consensus 308 e~i~~l~~dG~Lne~a~~~~tG~~-~~Y~iP~~y~GL~~~eAR~kiv~~L~~~ 359 (970) T TIGR00422 308 EFINILDEDGLLNENAEVIDTGKE-DVYAIPGEYQGLERFEARKKIVEDLKEE 359 (970) T ss_pred CHHHHCCCCCCCCCEECEEECCCC-CCCCCCHHCCCCCHHHHHHHHHHHHHHC T ss_conf 143331799750110011007772-3310652205888899999999999866 No 87 >TIGR01313 therm_gnt_kin carbohydrate kinase, thermoresistant glucokinase family; InterPro: IPR006001 This family of proteins includes thermoresistant and thermosensitve isozymes of gluconate kinase (gluconokinase) in E. coli and other related proteins; members of this family are often named by similarity to the thermostable isozyme. These proteins show homology to shikimate kinases and adenylate kinases but not to gluconate kinases from the FGGY family of carbohydrate kinases (IPR000577 from INTERPRO).; GO: 0016301 kinase activity, 0005975 carbohydrate metabolic process. Probab=26.39 E-value=48 Score=14.18 Aligned_cols=44 Identities=23% Similarity=0.335 Sum_probs=37.5 Q ss_pred CCEEECCCCCHHH--HHH----HHHCCCCCHHHCCCHHHHHHHHHC--CCCC Q ss_conf 7322047530288--899----842377310330767999998637--8765 Q gi|254781037|r 124 HSVIGTSSLRRKA--LLL----RWRSDISVIDFRGKIETRLNKLNN--NKAH 167 (307) Q Consensus 124 ~a~IGTsS~RR~a--ql~----~~~p~l~~~~iRGNv~TRl~Kl~~--g~~D 167 (307) +.+|+.|||-|++ .|+ .-.|.+.|+-+.|.-++=+++|.+ |.|= T Consensus 79 ~~~~~CSALKr~YRD~LR~s~~~~~~~~~FiyL~~~~~~~~~R~~~RkGHFM 130 (175) T TIGR01313 79 LVIITCSALKRKYRDILRSSLEEAEPELHFIYLSGSKEVILKRMKSRKGHFM 130 (175) T ss_pred CEEEEEECCHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHCCCCCC T ss_conf 4478840111355566542202689843788636898999999610789986 No 88 >PRK09986 DNA-binding transcriptional activator XapR; Provisional Probab=25.59 E-value=49 Score=14.09 Aligned_cols=170 Identities=15% Similarity=0.125 Sum_probs=83.1 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 82899707988889999999999997408998338999997327411587556748622699999999870983699740 Q gi|254781037|r 4 KPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVHS 83 (307) Q Consensus 4 ~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVHS 83 (307) -.|+||+=.|-..-. --.+.....+. +|++++++... .+.++.+.|.+|++|+|+=+ T Consensus 81 g~l~Ig~~~~~~~~~-l~~~l~~f~~~--~P~i~v~i~~~--------------------~~~~~~~~l~~~~~D~a~~~ 137 (278) T PRK09986 81 GRIELGVVGTALWGR-MRPAMRHFLKE--NPNVEVLFREK--------------------SPSMQMALLERRELDAGIWR 137 (278) T ss_pred CCEEEEECHHHHHHH-HHHHHHHHHHH--CCCEEEEEEEC--------------------CHHHHHHHHHCCCCCEEEEC T ss_conf 733687421688876-42899999987--89839999978--------------------86999999977997589963 Q ss_pred CCCCCCCCCCCCE--------EEEECCCCCCCEEEEECCCCCCCCCCCCCEEEC--CCCCHHHHHHHHH--CCC--CCHH Q ss_conf 3432111012210--------122236568310566524324332456732204--7530288899842--377--3103 Q gi|254781037|r 84 AKDMPTKLLKGLQ--------ISAYLPREDIRDVFISHTAQSLKDLALHSVIGT--SSLRRKALLLRWR--SDI--SVID 149 (307) Q Consensus 84 lKDlP~~~~~~l~--------i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGT--sS~RR~aql~~~~--p~l--~~~~ 149 (307) + |...++++. +..+.+++.| +-.++..++++|...-.|.. .+..+. ++.... .++ .+.- T Consensus 138 ~---~~~~~~~~~~~~l~~~~~~~~~~~~hp---la~~~~i~~~~l~~~~~i~~~~~~~~~~-~~~~~~~~~g~~~~~~~ 210 (278) T PRK09986 138 M---ALEPNPGFTSLRLHESAFAVAVPEEHL---LASKSSVPLKALRNEYFVTLPSVHSDWG-FLQRVCQQAGFSPQIIR 210 (278) T ss_pred C---CCCCCCCCEEEEEEECCCEEEECCCCC---CCCCCCCCHHHHCCCCEEEECCCCCHHH-HHHHHHHHCCCCCCEEE T ss_conf 7---777887742688774163799659873---3369999999985999798679986699-99999997799851799 Q ss_pred HCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEECCC Q ss_conf 30767999998637876556301454542162112223368667358543207899985276 Q gi|254781037|r 150 FRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETHVNN 211 (307) Q Consensus 150 iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r~~d 211 (307) -=.+.++=+.-+..|-.=+ +++. -+.+....+.....|+ ..+|. -+++.-|+++ T Consensus 211 ~~~~~~~~~~~V~~G~Gia-ilp~-~~~~~~~~~v~~~Pl~--~~~~~----~~~lv~~~~~ 264 (278) T PRK09986 211 EVNEPQTVLAMVSMGIGIT-LVAD-SYAQMPWPGVVFRPLK--ERIPA----DLYIVYHQQQ 264 (278) T ss_pred EECCHHHHHHHHHHCCEEE-EEHH-HHHCCCCCCEEEEECC--CCCCE----EEEEEECCCC T ss_conf 9998999999999799899-8417-7742378999999889--99803----8999984999 No 89 >COG3043 NapB Nitrate reductase cytochrome c-type subunit [Energy production and conversion] Probab=25.05 E-value=16 Score=17.32 Aligned_cols=82 Identities=20% Similarity=0.083 Sum_probs=57.6 Q ss_pred CCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEE---EEECCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCHHHC Q ss_conf 98369974034321110122101222365683105---665243243324567322047530288899842377310330 Q gi|254781037|r 75 GEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDV---FISHTAQSLKDLALHSVIGTSSLRRKALLLRWRSDISVIDFR 151 (307) Q Consensus 75 g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~---lv~~~~~~l~~lp~~a~IGTsS~RR~aql~~~~p~l~~~~iR 151 (307) +.=-+-=||.++.+...+..-++.| -.|+..+|. -|+.. -+-|- .|.+.+-.|+||-.-+.-+-|.-.++||= T Consensus 68 nQPP~IPHsi~~Yqvtkn~N~CLsC-H~~e~s~~tGAt~IS~T--Hfmdr-dg~v~a~vSPRRYFClQCHVPQaD~kPlV 143 (155) T COG3043 68 NQPPMIPHSIEGYQVTKNTNRCLSC-HSVENSRTTGATRISPT--HFMDR-DGKVLADVSPRRYFCLQCHVPQADVKPLV 143 (155) T ss_pred CCCCCCCCCCCCCEEECCCCHHHHC-CCHHHHHHCCCCCCCCC--EEECC-CCCCCCCCCCCCEEEEECCCCCCCCCCCC T ss_conf 7999776334671542253440213-68877866189756742--14656-88863555753034310346601266567 Q ss_pred CCHHHHHHH Q ss_conf 767999998 Q gi|254781037|r 152 GKIETRLNK 160 (307) Q Consensus 152 GNv~TRl~K 160 (307) ||-=.++.+ T Consensus 144 ~N~F~p~~~ 152 (155) T COG3043 144 GNTFKPMKG 152 (155) T ss_pred CCCCCCCHH T ss_conf 876665210 No 90 >pfam02621 DUF178 Putative periplasminc binding protein (DUF178). This family of proteins belong to the periplasmic binding protein clan. Probab=24.31 E-value=52 Score=13.93 Aligned_cols=101 Identities=19% Similarity=0.298 Sum_probs=54.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCC-----CCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHHH Q ss_conf 99999998709836997403432111-----0122101222365683105665243243324567322047530288899 Q gi|254781037|r 65 TEEIEKKLISGEIDCAVHSAKDMPTK-----LLKGLQISAYLPREDIRDVFISHTAQSLKDLALHSVIGTSSLRRKALLL 139 (307) Q Consensus 65 tkele~aLl~g~iDiAVHSlKDlP~~-----~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a~IGTsS~RR~aql~ 139 (307) ..+|...|++|++|++.=|.=..|.. +-|++.| --.....-|++..+ ..+.+ ..+|..++--+.+.++ T Consensus 34 P~~Ln~~l~~g~lDv~~iSs~~y~~~~~~y~ilp~~~i---~s~G~v~Svll~s~-~p~~~---~~~Ial~~~S~TS~~L 106 (248) T pfam02621 34 PETLNRLLLEGELDVAAISSIAYARIADDYALLPDLSI---AALGRVYSVLLVSR-EELEG---KLRVALPGESTTSVLL 106 (248) T ss_pred HHHHHHHHHCCCCCEEECCHHHHHHCHHHHEECCCCEE---EEECCCCCEEEEEC-CCCCC---CCEEECCCCCHHHHHH T ss_conf 99999999869998875589999857202315777458---85279751389956-88676---8728769977899999 Q ss_pred H--HHCC-CCCHHHCCCHHHHHHHHHCCCCCE-EEEEHHHH Q ss_conf 8--4237-731033076799999863787655-63014545 Q gi|254781037|r 140 R--WRSD-ISVIDFRGKIETRLNKLNNNKAHA-ILLAYAGI 176 (307) Q Consensus 140 ~--~~p~-l~~~~iRGNv~TRl~Kl~~g~~Da-iilA~Agl 176 (307) . +.+. .++.+++.-+. .-+.++ .|| ++.-..+| T Consensus 107 lkill~~~~~~~~~~~~~~---~~~~~~-~da~LlIGD~Al 143 (248) T pfam02621 107 LKLLLPEAYKLVPMPDEIE---PAVLEG-EDAGLLIGDEAL 143 (248) T ss_pred HHHHHHHCCCEEECCCHHH---HHHHCC-CCEEEEECHHHC T ss_conf 9999864572431671466---787448-975899750434 No 91 >PRK05429 gamma-glutamyl kinase; Provisional Probab=23.87 E-value=51 Score=14.00 Aligned_cols=129 Identities=19% Similarity=0.170 Sum_probs=71.5 Q ss_pred CCCEEECCCCCHHHHHHH--HHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCC-CCCC Q ss_conf 673220475302888998--4237731033076799999863787655630145454216211222336866735-8543 Q gi|254781037|r 123 LHSVIGTSSLRRKALLLR--WRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFP-PSPG 199 (307) Q Consensus 123 ~~a~IGTsS~RR~aql~~--~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~-PA~g 199 (307) .++.+||+-.+-+-+.-+ ....+.+.-..|.-+.-|.++.+|+--|+++... -+++. .+ ..|+. -..- T Consensus 209 ~~s~~GtGGM~tKl~AA~~a~~~Gi~~iIa~G~~~~~l~~i~~g~~~GT~f~~~-~~~~~--~r------K~Wia~~~~~ 279 (372) T PRK05429 209 AGSGLGTGGMATKLEAARIATRAGIPVVIASGRKPDVLLRLLAGEAVGTLFLPQ-EKPLS--AR------KQWIAHALQP 279 (372) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHCCCCCEEEEECC-CCCCC--HH------HHHHHHCCCC T ss_conf 877887784377999999999779989992499965899997799726999438-88761--78------9999644778 Q ss_pred CCEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEE---E-ECCEEEEEEEEECCCCCEEEE Q ss_conf 207899985276124556776239788888788999998717764885289999---9-999899999998699878999 Q gi|254781037|r 200 QGAICIETHVNNPKAQELVKVINHEDTWDSVNCERAFLAELDGSCKSAIAGFAY---C-KGSTLYFYGIVLASDGKIFHE 275 (307) Q Consensus 200 QGaIaIe~r~~d~~i~~il~~Ind~~T~~~v~aER~fL~~L~GgC~~PIGa~A~---i-~~~~l~l~a~v~s~DG~~~i~ 275 (307) +|.|-| |+ .|++++. =.|..-.|+|+... + .|+.+. +.+++|..+-+ T Consensus 280 ~G~i~i----D~-------------------GA~~AL~--~~g~SLL~~GI~~v~G~F~~GD~V~----i~~~~g~~iar 330 (372) T PRK05429 280 AGELVV----DA-------------------GAVKALL--ERGKSLLPAGVTAVEGDFSRGDVVR----IVDPDGREIAR 330 (372) T ss_pred CCEEEE----CH-------------------HHHHHHH--HCCCCCCCCCCEEEECEECCCCEEE----EECCCCCEEEE T ss_conf 708998----85-------------------2899997--5787216514189968765998799----99999989999 Q ss_pred EEEECCHHHHHHHH Q ss_conf 99843988899999 Q gi|254781037|r 276 VSRNGNRCDAINIG 289 (307) Q Consensus 276 ~~~~g~~~~a~~LG 289 (307) .-...+.++...+- T Consensus 331 G~~nyss~el~~i~ 344 (372) T PRK05429 331 GLVNYSSEELRRIA 344 (372) T ss_pred EECCCCHHHHHHHH T ss_conf 97348999999881 No 92 >PRK05331 putative glycerol-3-phosphate acyltransferase PlsX; Provisional Probab=23.77 E-value=33 Score=15.23 Aligned_cols=38 Identities=21% Similarity=0.368 Sum_probs=15.8 Q ss_pred ECCCCCHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEE Q ss_conf 0475302888998423773103307679999986378765563 Q gi|254781037|r 128 GTSSLRRKALLLRWRSDISVIDFRGKIETRLNKLNNNKAHAIL 170 (307) Q Consensus 128 GTsS~RR~aql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~Daii 170 (307) |+...|+..+++.-.|+++| -|||+-| -+-+|++|-+| T Consensus 189 G~~~~k~a~~lL~~~~~~nf---~G~iEg~--di~~g~~DVvV 226 (317) T PRK05331 189 GNELVKEAYELLKAAPSINF---IGNVEGR--DILKGTADVVV 226 (317) T ss_pred CCHHHHHHHHHHHCCCCCCE---EEEECCC--CCCCCCCCEEE T ss_conf 67999999999964999805---8010401--14689997899 No 93 >CHL00139 rpl18 ribosomal protein L18; Validated Probab=23.74 E-value=53 Score=13.86 Aligned_cols=63 Identities=14% Similarity=0.219 Sum_probs=37.1 Q ss_pred HHHCCCCCCC-EEEEEEEECCEEEEEEEEECC-CCCEEEEEEE-----------ECCHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 8717764885-289999999989999999869-9878999998-----------43988899999999999997060563 Q gi|254781037|r 238 AELDGSCKSA-IAGFAYCKGSTLYFYGIVLAS-DGKIFHEVSR-----------NGNRCDAINIGRDAAQYIRFISNKNV 304 (307) Q Consensus 238 ~~L~GgC~~P-IGa~A~i~~~~l~l~a~v~s~-DG~~~i~~~~-----------~g~~~~a~~LG~~la~~Ll~~g~k~i 304 (307) ..+-|...-| +.+|- .+ -++.+++.++ .|......+. .++.+.+..+|+.+|+..+++|-+++ T Consensus 7 ~ki~gt~~rpRL~V~r--Sn--khiyaQiIDD~~g~tl~saST~~k~~~~~~~~~~n~e~A~~vG~~ia~~a~~~gI~~V 82 (109) T CHL00139 7 KKIQGTAERPRLSVFR--SN--KHIYAQIIDDTNGKTLVSCSTLDPDVKSNLSIGSTCDASKLVGQKLAKKSLKKGITKV 82 (109) T ss_pred HHCCCCCCCCEEEEEE--CC--CEEEEEEEECCCCCEEEEEECCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 2604899988899997--08--8189999987999899996233677765514786699999999999999998799999 No 94 >pfam04261 Dyp_perox Dyp-type peroxidase family. This family of dye-decolourising peroxidases lack a typical heme-binding region. Probab=23.36 E-value=54 Score=13.82 Aligned_cols=57 Identities=9% Similarity=0.037 Sum_probs=29.2 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHCCCC-----CCCEEEEEEEE-----CC--EEEEEEEEECCCC Q ss_conf 4556776239788888788999998717764-----88528999999-----99--8999999986998 Q gi|254781037|r 214 AQELVKVINHEDTWDSVNCERAFLAELDGSC-----KSAIAGFAYCK-----GS--TLYFYGIVLASDG 270 (307) Q Consensus 214 i~~il~~Ind~~T~~~v~aER~fL~~L~GgC-----~~PIGa~A~i~-----~~--~l~l~a~v~s~DG 270 (307) ++.+.-.++.++....-+-|+.|=+...-|- ..|.-+|.... ++ .|.=+++-|+..| T Consensus 187 vqri~h~l~~Wd~l~~~eQE~viGR~k~~gapL~~~~~p~~SHv~~a~~~~~~~~~~ilRr~~py~d~g 255 (311) T pfam04261 187 VQRILHDLEAWDRLPLEEQEDVIGRRKSDGAELDGDVKPADSHVRLANPKEDGKGLKILRRSLPYGDVG 255 (311) T ss_pred HHHHHHCHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCEEECCCCCCCCC T ss_conf 328765878671699989998619646889837878777310587567786688775786888989889 No 95 >pfam07302 AroM AroM protein. This family consists of several bacterial and archaeal AroM proteins. In Escherichia coli the aroM gene is cotranscribed with aroL. The function of this family is unknown. Probab=22.86 E-value=55 Score=13.75 Aligned_cols=112 Identities=19% Similarity=0.240 Sum_probs=67.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCC--EEECCCCCHHHHHHHHH Q ss_conf 9999999870983699740343211101221012223656831056652432433245673--22047530288899842 Q gi|254781037|r 65 TEEIEKKLISGEIDCAVHSAKDMPTKLLKGLQISAYLPREDIRDVFISHTAQSLKDLALHS--VIGTSSLRRKALLLRWR 142 (307) Q Consensus 65 tkele~aLl~g~iDiAVHSlKDlP~~~~~~l~i~a~~~R~d~rD~lv~~~~~~l~~lp~~a--~IGTsS~RR~aql~~~~ 142 (307) +-|| .+++...+++.=|++=|==+ ++ +|.++.|.++- |+|+++ |+.|. +++-+-+-| T Consensus 16 ~pei-~~~l~~~v~i~e~GaLDgls--~~--eI~~~aP~~ge-~~LvTR-------L~DG~~v~~s~~~v~~-------- 74 (221) T pfam07302 16 LPEI-TEHLGEPIEITEIGALDGLS--RE--EIAALAPGPGE-DVLVTR-------LRDGQQVTVSKQKIER-------- 74 (221) T ss_pred HHHH-HHHHCCCCCEEEECCCCCCC--HH--HHHHHCCCCCC-CEEEEE-------ECCCCEEEEEHHHHHH-------- T ss_conf 6999-98708997289843557999--99--99971999998-441687-------0699878988899889-------- Q ss_pred CCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHCCCC----C-CCCEEEEEEEC Q ss_conf 3773103307679999986378765563014545421621122233686673585----4-32078999852 Q gi|254781037|r 143 SDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDFPPS----P-GQGAICIETHV 209 (307) Q Consensus 143 p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~~PA----~-gQGaIaIe~r~ 209 (307) -++..|++|++..||.|+|...|--. ++..+.. .+.|+.++|. . +-+-+||-+-. T Consensus 75 ----------~lq~~i~~Le~~G~d~ilLlCTG~F~-~l~~~~~-ll~P~ril~~lV~a~~~~~~lGVvvP~ 134 (221) T pfam07302 75 ----------RLQQVIEKLDKEGVDVILLLCTGEFP-KLSARNL-LLEPQRILPPLVASIVDGHQLGVIVPL 134 (221) T ss_pred ----------HHHHHHHHHHHCCCCEEEEEECCCCC-CCCCCCC-EEEHHHHHHHHHHHHCCCCEEEEEECC T ss_conf ----------89999999986799899996147789-8656772-441188678999986789859999557 No 96 >PRK11545 gntK gluconate kinase 1; Provisional Probab=21.89 E-value=58 Score=13.63 Aligned_cols=90 Identities=17% Similarity=0.257 Sum_probs=52.5 Q ss_pred CCCCCEEECCCCCHHH--HHHHHHCCCCCHHHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCCCCCHHHC-CCC Q ss_conf 4567322047530288--8998423773103307679999986378765563014545421621122233686673-585 Q gi|254781037|r 121 LALHSVIGTSSLRRKA--LLLRWRSDISVIDFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKEILNIEDF-PPS 197 (307) Q Consensus 121 lp~~a~IGTsS~RR~a--ql~~~~p~l~~~~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~~l~~~~~-~PA 197 (307) -..+++|..|.++|.+ +|+...|++.|+-+.|+-+.=.++|..-+ +=.|... |.+ +| | +.+ .|. T Consensus 79 ~~~~~VlaCSALKr~YRd~Lr~~~~~~~fv~L~g~~~~i~~Rl~~R~--~HFmp~~------LL~--SQ-f--~tLE~P~ 145 (177) T PRK11545 79 TNKVSLIVCSALKKHYRDLLREGNPNLSFIYLKGDFDVIESRLKARK--GHFFKTQ------MLV--TQ-F--ETLQEPG 145 (177) T ss_pred CCCCEEEEECCCCHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHCC--CCCCCHH------HHH--HH-H--HHHCCCC T ss_conf 69966998701119999999806997599997299999999997464--6899878------999--89-9--9817999 Q ss_pred C-CCCEEEEEEECC-CCHHHHHHHHHCC Q ss_conf 4-320789998527-6124556776239 Q gi|254781037|r 198 P-GQGAICIETHVN-NPKAQELVKVINH 223 (307) Q Consensus 198 ~-gQGaIaIe~r~~-d~~i~~il~~Ind 223 (307) + -.+++.|-+... +.-+.++++.|.. T Consensus 146 ~~E~~~~~vdi~~~~e~iv~~il~~l~~ 173 (177) T PRK11545 146 ADETDVLVVDIDQPLEGVVASTIEVIKK 173 (177) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHHC T ss_conf 8889869997899999999999999975 No 97 >cd03575 NTR_WFIKKN NTR domain, WFIKKN subfamily; WFIKKN proteins contain a C-terminal NTR domain and are putative secreted proteins which may be multivalent protease inhibitors that act on serine proteases as well as metalloproteases. Human WFIKKN and a related protein sharing the same domain architecture were observed to have distinct tissue expression patterns. WFIKKN is also referred to as growth and differentiation factor-associated serum protein-1 (GASP-1). It inhibits the activity of mature myostatin, a specific regulator of skeletal muscle mass and a member of the TGFbeta superfamily. Probab=21.77 E-value=52 Score=13.93 Aligned_cols=25 Identities=28% Similarity=0.159 Sum_probs=18.7 Q ss_pred CCCCCCCCCEEECCCCCHHHHHHHH Q ss_conf 4332456732204753028889984 Q gi|254781037|r 117 SLKDLALHSVIGTSSLRRKALLLRW 141 (307) Q Consensus 117 ~l~~lp~~a~IGTsS~RR~aql~~~ 141 (307) ...-|.++|-|++||.||..-|+.. T Consensus 82 Gmavl~pdSfv~~ss~rRv~klrE~ 106 (109) T cd03575 82 GMAVLQPDSYVRASSERRVRKLREV 106 (109) T ss_pred CEEEECCCCEECCCCHHHHHHHHHH T ss_conf 8577557762100438899999998 No 98 >COG3572 GshA Gamma-glutamylcysteine synthetase [Coenzyme metabolism] Probab=21.70 E-value=46 Score=14.26 Aligned_cols=34 Identities=38% Similarity=0.495 Sum_probs=15.7 Q ss_pred HHHHHCCCCCCCCCCCCHHHCCCCCCCCEEEEEEE Q ss_conf 54542162112223368667358543207899985 Q gi|254781037|r 174 AGIKRLKKEKVIKEILNIEDFPPSPGQGAICIETH 208 (307) Q Consensus 174 Agl~RL~l~~~i~~~l~~~~~~PA~gQGaIaIe~r 208 (307) -|..|+++. .|.+....=..+++.|||+|-+|-- T Consensus 63 ~g~~r~gwe-pI~d~gn~I~L~~p~g~~aIslEpg 96 (456) T COG3572 63 DGMQRLGWE-PIMDVGNIIGLVEPIGQGAISLEPG 96 (456) T ss_pred HHHHHCCCE-EECCCCCEEEECCCCCCCEEEECCC T ss_conf 556316840-2024565455315788842796468 No 99 >PRK07765 para-aminobenzoate synthase component II; Provisional Probab=21.50 E-value=45 Score=14.35 Aligned_cols=25 Identities=16% Similarity=0.141 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 2699999999870983699740343 Q gi|254781037|r 62 GLFTEEIEKKLISGEIDCAVHSAKD 86 (307) Q Consensus 62 g~Ftkele~aLl~g~iDiAVHSlKD 86 (307) .-||.-|-..|-+-.++.-|.+-.+ T Consensus 10 DSFT~ni~~~lr~lg~~v~V~~~d~ 34 (221) T PRK07765 10 DSFVFNLVQYLGQLGVEAEVWRNDD 34 (221) T ss_pred CCHHHHHHHHHHHCCCEEEEEECCC T ss_conf 9339999999987799189998998 No 100 >TIGR03414 ABC_choline_bnd choline ABC transporter, periplasmic binding protein. Partial phylogenetic profiling (PMID:16930487) vs. the genome property of glycine betaine biosynthesis from choline consistently reveals a member of this ABC transporter periplasmic binding protein as the best match, save for the betaine biosynthesis enzymes themselves. Genomes often carry several paralogs, one encoded together with the permease and ATP-binding components and another encoded next to a choline-sulfatase gene, suggesting that different members of this protein family interact with shared components and give some flexibility in substrate. Of two members from Sinorhizobium meliloti 1021, one designated ChoX has been shown experimentally to bind choline (though not various related compounds such as betaine) and to be required for about 60 % of choline uptake. Members of this protein have an invariant Cys residue near the N-terminus and likely are lipoproteins. Probab=21.20 E-value=60 Score=13.54 Aligned_cols=62 Identities=18% Similarity=0.078 Sum_probs=44.6 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEE Q ss_conf 98828997079888899999999999974089983389999973274115875567486226999999998709836997 Q gi|254781037|r 2 EKKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAV 81 (307) Q Consensus 2 ~~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAV 81 (307) +.++|+||.=+=.=+.+.|+.+...|++. ++++|++...+ +. +=++|-+|++|+.+ T Consensus 6 ~~~~V~i~~~~W~s~~~~t~v~~~iLE~~----GY~Ve~~~~~~----------------~~----~~~~la~GdiDv~~ 61 (290) T TIGR03414 6 SCKTVRFADVGWTDITATTALASVLLEGL----GYQPKVTLLSV----------------PV----TYAGLKDGDLDVFL 61 (290) T ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHHC----CCCEEEEECCH----------------HH----HHHHHHCCCCEEEE T ss_conf 09966996068648999999999999974----99648996774----------------99----99999769970787 Q ss_pred ECCCCCCC Q ss_conf 40343211 Q gi|254781037|r 82 HSAKDMPT 89 (307) Q Consensus 82 HSlKDlP~ 89 (307) -.+ +|+ T Consensus 62 e~W--~p~ 67 (290) T TIGR03414 62 GNW--MPA 67 (290) T ss_pred ECC--CCC T ss_conf 111--478 No 101 >COG1732 OpuBC Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) [Cell envelope biogenesis, outer membrane] Probab=21.12 E-value=60 Score=13.53 Aligned_cols=201 Identities=21% Similarity=0.241 Sum_probs=105.7 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 88289970798888999999999999740899833899999732741158755674862269999999987098369974 Q gi|254781037|r 3 KKPFRIGTRCSPLALAHAFETRSSLMDVHKIPPHNIVIIPLSTKGDRITNRTLTEIGGKGLFTEEIEKKLISGEIDCAVH 82 (307) Q Consensus 3 ~~~i~IgtR~S~LAl~Qa~~v~~~L~~~~~~p~~~~ei~~i~T~GD~~~~~~l~~~g~kg~Ftkele~aLl~g~iDiAVH 82 (307) .++|+|||-...=..+=++...+.|++. ++.++.+ ..+|+ |--+..||.+|+|||-+- T Consensus 32 ~~~I~VgsK~~tE~~IL~~m~~~lle~~----~~kv~~~--------------~~lG~----t~v~~~Al~~G~IDiYpE 89 (300) T COG1732 32 AKTIVVGSKIFTEQYILGNILKQLLEKN----GIKVEDK--------------TGLGG----TAVVRNALKSGDIDIYPE 89 (300) T ss_pred CCCEEEECCCCCHHHHHHHHHHHHHHHC----CCCEEEC--------------CCCCC----HHHHHHHHHCCCCCEEEE T ss_conf 7887983477838899999999999865----9914541--------------68885----299999997598776854 Q ss_pred C--------CC-CCCCCCCCCCEE------------EEECCCCCCCE--EEEECCC-------CCCCCCCCC---CEEEC Q ss_conf 0--------34-321110122101------------22236568310--5665243-------243324567---32204 Q gi|254781037|r 83 S--------AK-DMPTKLLKGLQI------------SAYLPREDIRD--VFISHTA-------QSLKDLALH---SVIGT 129 (307) Q Consensus 83 S--------lK-DlP~~~~~~l~i------------~a~~~R~d~rD--~lv~~~~-------~~l~~lp~~---a~IGT 129 (307) = +| |=|...++.-.. .-++++-+.++ +|..+.. +++.||.+. -+.|. T Consensus 90 YTGt~~~~~lk~~~~~~~dp~~~y~~vK~~~~k~~~l~wl~p~~~nNTyA~av~~~~A~~~~i~TiSDLak~~~~l~~g~ 169 (300) T COG1732 90 YTGTALFSFLKKDPPASKDPKKVYETVKKLDAKQFKLVWLKPAGFNNTYALAVRKDVAEKYNLETISDLAKHSNQLKLGA 169 (300) T ss_pred ECCHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCEEEEECHHHHHHHCCCCHHHHHHHHHHCEECC T ss_conf 32314566345676454798999999999888607978965468776058996198898819816999998665346227 Q ss_pred CCC--CHH----HHHHHHHCCCC--CHHHCCCHHHHHHHHHCCCCCEEEEEH---HHHHHCCCCCCCCCCCCHHHCCCCC Q ss_conf 753--028----88998423773--103307679999986378765563014---5454216211222336866735854 Q gi|254781037|r 130 SSL--RRK----ALLLRWRSDIS--VIDFRGKIETRLNKLNNNKAHAILLAY---AGIKRLKKEKVIKEILNIEDFPPSP 198 (307) Q Consensus 130 sS~--RR~----aql~~~~p~l~--~~~iRGNv~TRl~Kl~~g~~DaiilA~---Agl~RL~l~~~i~~~l~~~~~~PA~ 198 (307) ++- -|. +.-+.+.=++. +..+-+. -+..-+.+|+.|+ ++|+ +.|.+.+|.-. .-|...|+| T Consensus 170 ~~eF~~R~DG~~~l~k~Yg~~~~~~~~~m~~g--l~y~Al~~g~~d~-~~~YsTDg~I~~~~L~VL---kDDK~~fP~-- 241 (300) T COG1732 170 DSEFAERADGLPALQKAYGFDFKPDLRTMDGG--LTYQALKNGTVDA-ADAYSTDGRIAAYGLKVL---KDDKGFFPP-- 241 (300) T ss_pred CHHHHCCCCCCHHHHHHHCCCCCCCCEECCCH--HHHHHHHCCCCCE-EEECCCCCCCCCCCCEEE---ECCCCCCCC-- T ss_conf 84231365122899998487668874433815--8999987499776-763145522122685797---068767998-- Q ss_pred CCCEEEE--EEECCCCHHHHHHHHHCC---HHHHHHHHHH Q ss_conf 3207899--985276124556776239---7888887889 Q gi|254781037|r 199 GQGAICI--ETHVNNPKAQELVKVINH---EDTWDSVNCE 233 (307) Q Consensus 199 gQGaIaI--e~r~~d~~i~~il~~Ind---~~T~~~v~aE 233 (307) -|++.-| +..++.+++.+++.++.. .+|+...+++ T Consensus 242 Y~~apvvre~vlk~~Pel~~~l~~l~~kid~~tMq~LNy~ 281 (300) T COG1732 242 YQAAPVVREEVLKKHPELKTILNKLSGKIDTETMQALNYR 281 (300) T ss_pred CCCCCEECHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 7656402187776798899999887545899999998777 No 102 >KOG4134 consensus Probab=20.68 E-value=61 Score=13.47 Aligned_cols=65 Identities=20% Similarity=0.290 Sum_probs=42.9 Q ss_pred HHCCCHHHHHHHHHCCCCCEEEEEHHHHHHCCCCCCCCC---------CCCHHHCCCCCCCCEEEEEEECCCCHH Q ss_conf 330767999998637876556301454542162112223---------368667358543207899985276124 Q gi|254781037|r 149 DFRGKIETRLNKLNNNKAHAILLAYAGIKRLKKEKVIKE---------ILNIEDFPPSPGQGAICIETHVNNPKA 214 (307) Q Consensus 149 ~iRGNv~TRl~Kl~~g~~DaiilA~Agl~RL~l~~~i~~---------~l~~~~~~PA~gQGaIaIe~r~~d~~i 214 (307) .++-.++|-+-+..++ .||||||+-+|+-||-.-.++. .-+.-.|-|-+||=--|+.-......+ T Consensus 50 ~i~ehld~~vl~y~~~-l~GivLgydnIKvLg~~aki~~D~pf~hlwi~adfyVf~Pk~Gd~LeG~Vn~vS~sHI 123 (253) T KOG4134 50 LIEEHLDTKVLFYDSG-LDGIVLGYDNIKVLGQTAKIRADDPFMHLWINADFYVFRPKAGDILEGVVNHVSRSHI 123 (253) T ss_pred HHHHHHHHHHHHHCCC-CCCEEEEECCEEEECCCCCEECCCCCEEEEEEEEEEEECCCCCCEEEEEEEECCHHHH T ss_conf 9999866788650467-7725886436276211166203898237999655899778887744456553264352 Done!