RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780782|ref|YP_003065195.1| succinyl-diaminopimelate desuccinylase [Candidatus Liberibacter asiaticus str. psy62] (389 letters) >gnl|CDD|183838 PRK13009, PRK13009, succinyl-diaminopimelate desuccinylase; Reviewed. Length = 375 Score = 602 bits (1555), Expect = e-173 Identities = 191/385 (49%), Positives = 239/385 (62%), Gaps = 13/385 (3%) Query: 2 TPDCLEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFG 61 D LE LI+ PSVTP D G +L L+ LGF+ E DF VKNL+AR G Sbjct: 1 MSDVLELAQDLIRRPSVTPDDAGCQDLLAERLEALGFTCERMDF-----GDVKNLWARRG 55 Query: 62 TEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFI 121 TE PHL FAGH DVVPPGD WT PPF TI +G +YGRG DMKGS+A F+ A RF+ Sbjct: 56 TEGPHLCFAGHTDVVPPGDLEAWTSPPFEPTIRDGMLYGRGAADMKGSLAAFVVAAERFV 115 Query: 122 PKYKNF-GSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKI 180 + + GSI+ LIT DEEGPAINGT K+L W++ +GEK D CIVGEPT +GD IK Sbjct: 116 AAHPDHKGSIAFLITSDEEGPAINGTVKVLEWLKARGEKIDYCIVGEPTSTERLGDVIKN 175 Query: 181 GRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITT 240 GRRGSL+G++T+ G QGHVAYPHL +NPI P L +L +D GN F PT+++IT Sbjct: 176 GRRGSLTGKLTVKGVQGHVAYPHLADNPIHLAAPALAELAATEWDEGNEFFPPTSLQITN 235 Query: 241 IDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNVPKLSHTVHFSSPV 300 ID G + NVIP +++ FN RF+ ++LK + + L K L +T+ ++ Sbjct: 236 IDAGTGATNVIPGELEAQFNFRFSTEHTAESLKARVEAILDK-----HGLDYTLEWTLS- 289 Query: 301 SPVFLTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSDARFIKDYC-PVIEFGLVGRTMHA 359 FLT KL + +I TG P LSTSGGTSDARFI DY V+EFG V T+H Sbjct: 290 GEPFLTPPGKLVDAVVAAIEAVTGITPELSTSGGTSDARFIADYGAQVVEFGPVNATIHK 349 Query: 360 LNENASLQDLEDLTCIYENFLQNWF 384 +NE S+ DLE LT IYE L+ Sbjct: 350 VNECVSVADLEKLTRIYERILERLL 374 >gnl|CDD|162269 TIGR01246, dapE_proteo, succinyl-diaminopimelate desuccinylase, proteobacterial clade. This model describes a proteobacterial subset of succinyl-diaminopimelate desuccinylases. An experimentally confirmed Gram-positive lineage succinyl-diaminopimelate desuccinylase has been described for Corynebacterium glutamicum, and a neighbor-joining tree shows the seed members, SP:Q59284, and putative archaeal members such as TrEMBL:O58003 in a single clade. However, the archaeal members differ substantially, share a number of motifs with acetylornithine deacetylases rather than succinyl-diaminopimelate desuccinylases, and are not taken as trusted examples of succinyl-diaminopimelate desuccinylases. This model is limited to proteobacterial members for this reason. Length = 370 Score = 431 bits (1109), Expect = e-121 Identities = 175/379 (46%), Positives = 229/379 (60%), Gaps = 13/379 (3%) Query: 6 LEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFGTEAP 65 E +LI PSVTP D G I+ L+ LGF IE F KNL+A GT P Sbjct: 2 TELAKELISRPSVTPNDAGCQDIIAERLEKLGFEIEWMHFGD-----TKNLWATRGTGEP 56 Query: 66 HLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIPKYK 125 L FAGH DVVP G W+ PPF +GK+YGRG DMKGS+A FI A RF+ K Sbjct: 57 VLAFAGHTDVVPAGPEEQWSSPPFEPVERDGKLYGRGAADMKGSLAAFIVAAERFVKKNP 116 Query: 126 NF-GSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIGRRG 184 + GSISLLIT DEEG AI+GTKK++ + + E D CIVGEP+ +GD IK GRRG Sbjct: 117 DHKGSISLLITSDEEGTAIDGTKKVVETLMARDELIDYCIVGEPSSVKKLGDVIKNGRRG 176 Query: 185 SLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITTIDVG 244 S++G +TI G QGHVAYPHL NPI P L +LT I +D GN F PT+++IT I G Sbjct: 177 SITGNLTIKGIQGHVAYPHLANNPIHKAAPALAELTAIKWDEGNEFFPPTSLQITNIHAG 236 Query: 245 NPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNVPKLSHTVHFSSPVSPVF 304 + NVIP ++ + FN+RF+ +++ LK+ + + I + L + + +S P F Sbjct: 237 TGANNVIPGELYVQFNLRFSTEVSDEILKQRVEA-----ILDQHGLDYDLEWSLSGEP-F 290 Query: 305 LTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSDARFIKDY-CPVIEFGLVGRTMHALNEN 363 LT+D KL ++I T G P LST GGTSD RFI V+EFG V T+H +NE Sbjct: 291 LTNDGKLIDKAREAIEETNGIKPELSTGGGTSDGRFIALMGAEVVEFGPVNATIHKVNEC 350 Query: 364 ASLQDLEDLTCIYENFLQN 382 S++DLE L+ +Y++ L+N Sbjct: 351 VSIEDLEKLSDVYQDLLEN 369 >gnl|CDD|181522 PRK08651, PRK08651, succinyl-diaminopimelate desuccinylase; Reviewed. Length = 394 Score = 207 bits (528), Expect = 6e-54 Identities = 131/404 (32%), Positives = 184/404 (45%), Gaps = 42/404 (10%) Query: 1 MTPDCLEHLIQLIKCPSVTPQD---GGAFFILVNTLKLLGFSIE----EKDFQTKNTSIV 53 M D +E L LIK P+V P L +TL+ LGFS E ++ K+ Sbjct: 4 MMFDIVEFLKDLIKIPTVNPPGENYEEIAEFLRDTLEELGFSTEIIEVPNEYVKKHDGPR 63 Query: 54 KNLYARFGTEAPHLMFAGHIDVVPPGDFNHWTYP-PFSATIAEGKIYGRGIVDMKGSIAC 112 NL AR G+ PHL F GH DVVPPG+ W+ PF + +GK+YGRG DMKG IA Sbjct: 64 PNLIARRGSGNPHLHFNGHYDVVPPGE--GWSVNVPFEPKVKDGKVYGRGASDMKGGIAA 121 Query: 113 FIAAVARFIPKYKNFGSISLLITGDEE--GPAINGTKKMLSWIEKKGEKWDACIVGEPTC 170 +AA R P G+I L I DEE G GT ++ E+ D IVGEP+ Sbjct: 122 LLAAFERLDPAGD--GNIELAIVPDEETGG---TGTGYLV---EEGKVTPDYVIVGEPS- 172 Query: 171 NHIIG-DTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFD-TGN 228 G D I IG RG + G + ++GKQ H + P L N + +L + Sbjct: 173 ----GLDNICIGHRGLVWGVVKVYGKQAHASTPWLGINAFEAAAKIAERLKSSLSTIKSK 228 Query: 229 TTFSPTNMEITTIDVGNP------SKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIK 282 + T+ +G P N++P S + R + E EE+R L Sbjct: 229 YEYDDERGAKPTVTLGGPTVEGGTKTNIVPGYCAFSIDRRL--IPEETA--EEVRDELEA 284 Query: 283 GIQNV-PKLSHTVHFS-SPVSPVFLT-HDRKLTSLLSKSIYNTTGNIPLLSTSGGTSDAR 339 + V P+L V F +P S F+T D +L L ++I G P + S G +DAR Sbjct: 285 LLDEVAPELGIEVEFEITPFSEAFVTDPDSELVKALREAIREVLGVEPKKTISLGGTDAR 344 Query: 340 FIKDYC-PVIEFGLVGRTM-HALNENASLQDLEDLTCIYENFLQ 381 F P + +G + HA +E ++D+E +YE L+ Sbjct: 345 FFGAKGIPTVVYGPGELELAHAPDEYVEVKDVEKAAKVYEEVLK 388 >gnl|CDD|162596 TIGR01910, DapE-ArgE, acetylornithine deacetylase or succinyl-diaminopimelate desuccinylase. This group of sequences contains annotations for both acetylornithine deacetylase and succinyl-diaminopimelate desuccinylase, but does not contain any members with experimental characterization. Bacillus, Staphylococcus and Sulfolobus species contain multiple hits to this subfamily and each may have a separate activity. Determining which is which must await further laboratory research. Length = 375 Score = 180 bits (458), Expect = 6e-46 Identities = 109/391 (27%), Positives = 166/391 (42%), Gaps = 37/391 (9%) Query: 6 LEHLIQLIKCPSVTPQDGGAFFI---LVNTLKLLGFSIEEKDFQTKNTSIV-KNLYARFG 61 +E L LI PSV P G I + + L+ GFS + + ++ K + G Sbjct: 1 VELLKDLISIPSVNPPGGNEETIANYIKDLLREFGFSTDVIEITDDRLKVLGKVVVKEPG 60 Query: 62 T-EAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARF 120 L+F GH DVVP GD W PF +GK+YGRG DMKG + + A+ Sbjct: 61 NGNEKSLIFNGHYDVVPAGDLELWKTDPFKPVEKDGKLYGRGATDMKGGLVALLYALKAI 120 Query: 121 I-PKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKW-DACIVGEPTCNHIIGDTI 178 K G+I L DEE +G L +++ K D ++ EP+ GD I Sbjct: 121 REAGIKPNGNIILQSVVDEE----SGEAGTLYLLQRGYFKDADGVLIPEPSG----GDNI 172 Query: 179 KIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTN----------IGFDTGN 228 IG +GS+ ++ + GKQ H ++P N I L L+ +L GF G Sbjct: 173 VIGHKGSIWFKLRVKGKQAHASFPQFGVNAIMKLAKLITELNELEEHIYARNSYGFIPGP 232 Query: 229 TTFSPTNMEITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGI--QN 286 TF+P I G+ N +P + S ++R N +K+ I ++K + + Sbjct: 233 ITFNP-----GVIKGGD-WVNSVPDYCEFSIDVRIIPEENLDEVKQIIED-VVKALSKSD 285 Query: 287 VPKLSHTVHFSSPVSPVFLTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSDARFIKD-YC 345 + P D +L L I G P + S G +DARF++ Sbjct: 286 GWLYENEPVVKWS-GPNETPPDSRLVKALEAIIKKVRGIEPEVLVSTGGTDARFLRKAGI 344 Query: 346 PVIEFGL-VGRTMHALNENASLQDLEDLTCI 375 P I +G T H +NE S+++L + T + Sbjct: 345 PSIVYGPGDLETAHQVNEYISIKNLVESTKV 375 >gnl|CDD|181490 PRK08588, PRK08588, succinyl-diaminopimelate desuccinylase; Reviewed. Length = 377 Score = 139 bits (352), Expect = 2e-33 Identities = 105/355 (29%), Positives = 159/355 (44%), Gaps = 53/355 (14%) Query: 55 NLYARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFI 114 NL A G+ +P L +GH+DVV GD + WTY PF T +GK+YGRG DMK +A + Sbjct: 50 NLVAEIGSGSPVLALSGHMDVVAAGDVDKWTYDPFELTEKDGKLYGRGATDMKSGLAALV 109 Query: 115 AAVARF----IPKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKG--EKWDACIVGEP 168 A+ G+I LL T EE + G K++ +KG + DA I+GEP Sbjct: 110 IAMIELKEQGQLLN---GTIRLLATAGEEVGEL-GAKQL----TEKGYADDLDALIIGEP 161 Query: 169 TCNHIIGDTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDT-- 226 + G I +GS+ ++T GK H + P L N I L+ ++ FD+ Sbjct: 162 S-----GHGIVYAHKGSMDYKVTSTGKAAHSSMPELGVNAIDPLLEFYNEQ-KEYFDSIK 215 Query: 227 -GNTTFSPTNMEITTIDVGNPSKNVIPAQVKMSFNIR----FNDLWNEKTLKEEIRSRLI 281 N +T I+ G N +P + ++ FNIR ++ N++ + S L Sbjct: 216 KHNPYLGGLTHVVTIINGGE-QVNSVPDEAELEFNIRTIPEYD---NDQ-----VISLLQ 266 Query: 282 KGIQNV-----PKLSHTVHFSSPVSPVFLTHDRKLTSL---LSKSIYNTTGNIPLLSTSG 333 + I V +LS ++ + PV D KL L ++KS IPL + G Sbjct: 267 EIINEVNQNGAAQLSLDIYSNHR--PVASDKDSKLVQLAKDVAKSYVGQD--IPLSAIPG 322 Query: 334 GTSDARFIK---DYCPVIEFGL-VGRTMHALNENASLQDLEDLTCIYENFLQNWF 384 T + F+K D+ PVI FG T H ++E IY+ + + Sbjct: 323 ATDASSFLKKKPDF-PVIIFGPGNNLTAHQVDEYVEKDMYLKFIDIYKEIIIQYL 376 >gnl|CDD|181015 PRK07522, PRK07522, acetylornithine deacetylase; Provisional. Length = 385 Score = 118 bits (298), Expect = 2e-27 Identities = 68/226 (30%), Positives = 102/226 (45%), Gaps = 37/226 (16%) Query: 55 NLYARFG-TEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACF 113 NL+A G + ++ +GH DVVP D WT PF T +G++YGRG DMKG IA Sbjct: 54 NLFATIGPADRGGIVLSGHTDVVPV-DGQAWTSDPFRLTERDGRLYGRGTCDMKGFIAAA 112 Query: 114 IAAVARFI------PKYKNFGSISLLITGDEE-GPAINGTKKMLSWIEKKGEKWDACIVG 166 +AAV P + L + DEE G G M++ + ++G K CIVG Sbjct: 113 LAAVPELAAAPLRRP-------LHLAFSYDEEVGCL--GVPSMIARLPERGVKPAGCIVG 163 Query: 167 EPTCNHIIGDTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIG--- 223 EPT + +G +G + T+ G+ H + N I L+ L ++ Sbjct: 164 EPTSMRPV-----VGHKGKAAYRCTVRGRAAHSSLAPQGVNAIEYAARLIAHLRDLADRL 218 Query: 224 -----FDTGNTTFSP--TNMEITTIDVGNPSKNVIPAQVKMSFNIR 262 FD F P + ++ TI G + N++PA+ + F R Sbjct: 219 AAPGPFDAL---FDPPYSTLQTGTIQGGT-ALNIVPAECEFDFEFR 260 >gnl|CDD|183836 PRK13004, PRK13004, peptidase; Reviewed. Length = 399 Score = 116 bits (292), Expect = 1e-26 Identities = 76/308 (24%), Positives = 121/308 (39%), Gaps = 44/308 (14%) Query: 3 PDCLEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFGT 62 D L LI+ PS + + + ++ +GF E D N+ G Sbjct: 15 ADMTRFLRDLIRIPSESGDEKRVVKRIKEEMEKVGFDKVEIDPM-------GNVLGYIGH 67 Query: 63 EAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIP 122 + F HID V GD +W + PF +G+IYGRG D KG +A + A I Sbjct: 68 GKKLIAFDAHIDTVGIGDIKNWDFDPFEGEEDDGRIYGRGTSDQKGGMASMVYAAK-IIK 126 Query: 123 KYKNFGSISLLITG-----DEEGPAINGTKKMLSW---IEKKGEKWDACIVGEPT-CNHI 173 +L +TG D +G L W IE+ K D ++ EPT N Sbjct: 127 DLGLDDEYTLYVTGTVQEEDCDG---------LCWRYIIEEDKIKPDFVVITEPTDLN-- 175 Query: 174 IGDTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFS- 232 I G+RG + + G H + P +N I + P+L++L + + F Sbjct: 176 ----IYRGQRGRMEIRVETKGVSCHGSAPERGDNAIYKMAPILNELEELNPNLKEDPFLG 231 Query: 233 PTNMEITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLK---EEIRSRLIKGIQNVPK 289 + ++ I +PS+ +P +S + R L +T + EIR+ + V K Sbjct: 232 KGTLTVSDIFSTSPSRCAVPDSCAISIDRR---LTVGETWESVLAEIRA-----LPAVKK 283 Query: 290 LSHTVHFS 297 + V Sbjct: 284 ANAKVSMY 291 >gnl|CDD|183841 PRK13013, PRK13013, succinyl-diaminopimelate desuccinylase; Reviewed. Length = 427 Score = 109 bits (274), Expect = 1e-24 Identities = 79/313 (25%), Positives = 131/313 (41%), Gaps = 36/313 (11%) Query: 55 NLYARF--GTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIAC 112 NL AR + + F H DVV G + WT PF + +G+IYGRG DMKG +A Sbjct: 73 NLVARRQGARDGDCVHFNSHHDVVEVG--HGWTRDPFGGEVKDGRIYGRGACDMKGGLAA 130 Query: 113 FIAAVARFIPKYKNF-GSISLLITGDEEGPAINGTKKMLSWIEKKG----EKWDACIVGE 167 I A F+ Y +F GSI + T DEE G ++++ ++G ++ I+ E Sbjct: 131 SIIAAEAFLAVYPDFAGSIEISGTADEESGGFGG----VAYLAEQGRFSPDRVQHVIIPE 186 Query: 168 PTCNHIIGDTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFD-- 225 P D I +G RG E+ G+ H + P L ++ IR + +L ++ F Sbjct: 187 PLNK----DRICLGHRGVWWAEVETRGRIAHGSMPFLGDSAIRHMGAVLAEIEERLFPLL 242 Query: 226 TGNTTFSP--------TNMEITTIDVGNPSKNV---------IPAQVKMSFNIRFNDLWN 268 T P + + I +I G P ++ + + ++ + RF + Sbjct: 243 ATRRTAMPVVPEGARQSTLNINSIHGGEPEQDPDYTGLPAPCVADRCRIVIDRRFLIEED 302 Query: 269 EKTLKEEIRSRLIKGIQNVPKLSHTVHFSSPVSPVFLTHDRKLTSLLSKSIYNTTGNIPL 328 +K EI + L + + P ++ + V P D + ++ +I G Sbjct: 303 LDEVKAEITALLERLKRARPGFAYEIRDLFEVLPTMTDRDAPVVRSVAAAIERVLGRQAD 362 Query: 329 LSTSGGTSDARFI 341 S GT D + I Sbjct: 363 YVVSPGTYDQKHI 375 >gnl|CDD|130947 TIGR01892, AcOrn-deacetyl, acetylornithine deacetylase (ArgE). This model represents a clade of acetylornithine deacetylases from proteobacteria. This enzyme is the final step of the "acetylated" ornithine biosynthesis pathway. The enzyme is closely related to dapE, succinyl-diaminopimelate desuccinylase, and outside of this clade annotation is very inaccurate as to which function should be ascribed to genes. Length = 364 Score = 106 bits (266), Expect = 1e-23 Identities = 80/239 (33%), Positives = 111/239 (46%), Gaps = 25/239 (10%) Query: 33 LKLLGFSIEEKDF-QTKNTSIVKNLYARFG-TEAPHLMFAGHIDVVPPGDFNHWTYPPFS 90 L+ LGFS+E + F S NL A G + A L +GH DVVP D WT PF Sbjct: 28 LEALGFSVEVQPFPDGAEKS---NLVAVIGPSGAGGLALSGHTDVVP-YDDAAWTRDPFR 83 Query: 91 ATIAEGKIYGRGIVDMKGSIACFIAAVARFIPKYKNFGSISLLITGDEEGPAINGTKKML 150 T +G++YGRG DMKG +AC +AA A + + + L +T DEE G KM Sbjct: 84 LTEKDGRLYGRGTCDMKGFLACALAA-APDLAAEQLKKPLHLALTADEE-VGCTGAPKM- 140 Query: 151 SWIEKKGEKWDACIVGEPTCNHIIGDTIKI-GRRGSLSGEITIHGKQGHVAYPHLTENPI 209 IE + I+GEPT I + +G S E+T+ G+ GH +YP N I Sbjct: 141 --IEAGAGRPRHAIIGEPT------RLIPVRAHKGYASAEVTVRGRSGHSSYPDSGVNAI 192 Query: 210 RGLIPLLHQLT----NIGFDTGNTTFSP--TNMEITTIDVGNPSKNVIPAQVKMSFNIR 262 L +L + + + F+P T + I I G + N+IP + F R Sbjct: 193 FRAGRFLQRLVHLADTLLREDLDEGFTPPYTTLNIGVIQ-GGKAVNIIPGACEFVFEWR 250 >gnl|CDD|184437 PRK13983, PRK13983, diaminopimelate aminotransferase; Provisional. Length = 400 Score = 103 bits (259), Expect = 9e-23 Identities = 87/290 (30%), Positives = 121/290 (41%), Gaps = 52/290 (17%) Query: 6 LEHLIQLIKCPSVTPQDGG------AFFILVNTLKLLGF-SIEEKDFQTKNTSIVK--NL 56 +E L +LI P+V P GG A ++ + LK GF +E D N+ Sbjct: 8 IELLSELIAIPAVNPDFGGEGEKEKAEYLE-SLLKEYGFDEVERYDAPDPRVIEGVRPNI 66 Query: 57 YAR--FGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVD-MKGSIACF 113 A+ G L H+DVVPPGD + W PF + +GKIYGRG D +G ++ Sbjct: 67 VAKIPGGDGKRTLWIISHMDVVPPGDLSLWETDPFKPVVKDGKIYGRGSEDNGQGIVSSL 126 Query: 114 IAAVARF----IPKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGE----KWDACIV 165 +A A PKY ++ L DEE G+K + ++ KK K D +V Sbjct: 127 LALKALMDLGIRPKY----NLGLAFVSDEE----TGSKYGIQYLLKKHPELFKKDDLILV 178 Query: 166 ---GEPTCNHIIGDTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIR-------GLIPL 215 G P G I+I + L + T+ GKQ H + P N R L Sbjct: 179 PDAGNPD-----GSFIEIAEKSILWLKFTVKGKQCHASTPENGINAHRAAADFALELDEA 233 Query: 216 LHQ---LTNIGFDTGNTTFSPTNMEITTIDVGNPSKNVIPAQVKMSFNIR 262 LH+ + FD +TF PT E V N N IP + F+ R Sbjct: 234 LHEKFNAKDPLFDPPYSTFEPTKKEAN---VDNI--NTIPGRDVFYFDCR 278 >gnl|CDD|180721 PRK06837, PRK06837, acetylornithine deacetylase; Provisional. Length = 427 Score = 98.2 bits (245), Expect = 3e-21 Identities = 83/343 (24%), Positives = 132/343 (38%), Gaps = 47/343 (13%) Query: 67 LMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMK-GSIACFIA----AVARFI 121 L+ GHIDVVP G + W+ PPF I +G +YGRG DMK G A A A Sbjct: 100 LILQGHIDVVPEGPLDLWSRPPFDPVIVDGWMYGRGAADMKAGLAAMLFALDALRAAGLA 159 Query: 122 PKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIG 181 P + +EE NG LS ++ +G + DAC++ EPT G+ + Sbjct: 160 PA----ARVHFQSVIEEESTG-NGA---LSTLQ-RGYRADACLIPEPT-----GEKLVRA 205 Query: 182 RRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNI-----GFDTGNTTFS---- 232 + G + + + G HV N I L+ L + + F Sbjct: 206 QVGVIWFRLRVRGAPVHVREAGTGANAIDAAYHLIQALRELEAEWNARKASDPHFEDVPH 265 Query: 233 PTNMEITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNVPKLSH 292 P N + I G+ + +V PA + I + EI + L ++ LS+ Sbjct: 266 PINFNVGIIKGGDWASSV-PAWCDLDCRIAIYPGVTAADAQAEIEACLAAAARDDRFLSN 324 Query: 293 TVHFSSPVSPVF---------LTHDRKLTSLLSKSIYNTTGNIPLLS-TSGGTSDARFIK 342 P V+ L + + L+++ + PL S + +D RF Sbjct: 325 N-----PPEVVWSGFLAEGYVLEPGSEAEAALARA-HAAVFGGPLRSFVTTAYTDTRFYG 378 Query: 343 DY--CPVIEFGLVGRTMHALNENASLQDLEDLTCIYENFLQNW 383 Y P + +G G +H +E L+ + +T F+ W Sbjct: 379 LYYGIPALCYGPSGEGIHGFDERVDLESVRKVTKTIALFVAEW 421 >gnl|CDD|116301 pfam07687, M20_dimer, Peptidase dimerization domain. This domain consists of 4 beta strands and two alpha helices which make up the dimerization surface of members of the M20 family of peptidases. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases. Length = 107 Score = 97.4 bits (243), Expect = 6e-21 Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 2/101 (1%) Query: 180 IGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEIT 239 IG +G G +T+ GK GH P L N I+ L LL +L D G F T + IT Sbjct: 1 IGHKGLAGGHLTVKGKAGHSGAPGLGVNAIKLLARLLAELPAEYGDIGF-DFPRTTLNIT 59 Query: 240 TIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRL 280 I+ G + NVIPA+ + F+IR + + L +EI + L Sbjct: 60 GIEGGT-ATNVIPAEAEAKFDIRLLPGEDLEELLKEIEAIL 99 >gnl|CDD|132565 TIGR03526, selenium_YgeY, putative selenium metabolism hydrolase. SelD, selenophosphate synthase, is the selenium donor protein for both selenocysteine and selenouridine biosynthesis systems, but it occurs also in a few prokaryotes that have neither of those pathways. The method of partial phylogenetic profiling, starting from such orphan-selD genomes, identifies this protein as one of those most strongly correlated to SelD occurrence. Its distribution is also well correlated with that of family TIGR03309, a putative accessory protein of labile selenium (non-selenocysteine) enzyme maturation. This family includes the uncharacterized YgeY of Escherichia coli, and belongs to a larger family of metalloenzymes in which some are known peptidases, others enzymes of different types. Length = 395 Score = 90.6 bits (225), Expect = 6e-19 Identities = 72/283 (25%), Positives = 119/283 (42%), Gaps = 29/283 (10%) Query: 3 PDCLEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFGT 62 D + L L+ PS + +G + ++ LGF E D N+ G Sbjct: 13 GDMIRFLRDLVAIPSESGDEGRVALRIKQEMEKLGFDKVEIDPM-------GNVLGYIGH 65 Query: 63 EAPHLM-FAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFI 121 P L+ HID V GD + W + P+ E IYGRG D +G IA + A + I Sbjct: 66 -GPKLIAMDAHIDTVGIGDMDQWQFDPYEGYEDEEIIYGRGASDQEGGIASMVYA-GKII 123 Query: 122 PKYKNFG---SISLLITGDEEGPAINGTKKMLSW---IEKKGEKWDACIVGEPTCNHIIG 175 K+ G +LL+TG + +G L W IE+ K + ++ EPT Sbjct: 124 ---KDLGLLDDYTLLVTGTVQEEDCDG----LCWQYIIEEDKIKPEFVVITEPT-----D 171 Query: 176 DTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTF-SPT 234 I G+RG + ++T+ G H + P +N I + P+L +L+ + + F Sbjct: 172 MNIYRGQRGRMEIKVTVKGVSCHGSAPERGDNAIYKMAPILKELSQLNANLVEDPFLGKG 231 Query: 235 NMEITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIR 277 + ++ I +PS+ + +S + R + E+IR Sbjct: 232 TLTVSEIFFSSPSRCAVADGCTISIDRRLTWGETWEYALEQIR 274 >gnl|CDD|181333 PRK08262, PRK08262, hypothetical protein; Provisional. Length = 486 Score = 87.3 bits (217), Expect = 6e-18 Identities = 87/401 (21%), Positives = 141/401 (35%), Gaps = 109/401 (27%) Query: 63 EAPHLMFAGHIDVVP--PGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVAR- 119 ++ H DVVP PG WT+PPFS IA+G ++GRG +D KGS+ + A Sbjct: 110 SLKPIVLMAHQDVVPVAPGTEGDWTHPPFSGVIADGYVWGRGALDDKGSLVAILEAAEAL 169 Query: 120 ----FIPKYKNFGSISLLITGDEE--GPAINGTKKMLSWIEKKGEKWDACI--------- 164 F P+ +I L DEE G G + + ++++G + + Sbjct: 170 LAQGFQPR----RTIYLAFGHDEEVGGL---GARAIAELLKERGVRLAFVLDEGGAITEG 222 Query: 165 ----VGEPTCNHIIGDTIKIGRRGSLSGEITIHGKQGHVAYP--------------HLTE 206 V +P +IG + +G + E+T GH + P L + Sbjct: 223 VLPGVKKPVA--LIG----VAEKGYATLELTARATGGHSSMPPRQTAIGRLARALTRLED 276 Query: 207 NP----IRGLI--------------------------PLLHQLTNIGFDTG---NTTFSP 233 NP +RG + PLL ++ +T TT +P Sbjct: 277 NPLPMRLRGPVAEMFDTLAPEMSFAQRVVLANLWLFEPLLLRVLAKSPETAAMLRTTTAP 336 Query: 234 TNMEITTIDVGNPSKNVIPAQVKMSFNIRF--NDLWNEKTLKEEIRSRLIKGIQNVPKLS 291 T ++ G+P NV+P + + N R D +R R + + ++ Sbjct: 337 TMLK------GSPKDNVLPQRATATVNFRILPGDSVESVL--AHVR-RAVADDRVEIEVL 387 Query: 292 HTVHFSSPVSPVFLTHDRKLTSLLSKSIYNTTGN---IPLLSTSGGTSDARFIKDYCP-V 347 SPVS D LL+ +I + P L +D+R V Sbjct: 388 GGNSEPSPVSST----DSAAYKLLAATIREVFPDVVVAPYLVVGA--TDSRHYSGISDNV 441 Query: 348 IEF------GLVGRTMHALNENASLQDLEDLTCIYENFLQN 382 F H NE S+ + + Y ++N Sbjct: 442 YRFSPLRLSPEDLARFHGTNERISVANYARMIRFYYRLIEN 482 >gnl|CDD|180745 PRK06915, PRK06915, acetylornithine deacetylase; Validated. Length = 422 Score = 87.1 bits (216), Expect = 8e-18 Identities = 65/207 (31%), Positives = 97/207 (46%), Gaps = 28/207 (13%) Query: 9 LIQLIKCPSVTPQDGGAFFILVNTLKLLGFSIE--EKD---------FQTKNTSIVK--N 55 L +LI+ SV+ + GA I++ L+ LG ++ E F + TS N Sbjct: 23 LKRLIQEKSVSGDESGAQAIVIEKLRELGLDLDIWEPSFKKLKDHPYFVSPRTSFSDSPN 82 Query: 56 LYARF-GT-EAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMK-GSIAC 112 + A G+ ++ GHIDVVP GD N W + P+S + G+IYGRG DMK G++A Sbjct: 83 IVATLKGSGGGKSMILNGHIDVVPEGDVNQWDHHPYSGEVIGGRIYGRGTTDMKGGNVAL 142 Query: 113 FIAAVARFIPKYKNFGSISLLITGDEE-GPAINGTKKMLSWIEKKGEKWDACIVGEPTCN 171 +A A + G + +EE G A GT L+ I +G K D I+ EPT Sbjct: 143 LLAMEALIESGIELKGDVIFQSVIEEESGGA--GT---LAAIL-RGYKADGAIIPEPTNM 196 Query: 172 HIIGDTIKIGRRGSLSGEITIHGKQGH 198 ++GS+ + + GK H Sbjct: 197 KFF-----PKQQGSMWFRLHVKGKAAH 218 >gnl|CDD|180418 PRK06133, PRK06133, glutamate carboxypeptidase; Reviewed. Length = 410 Score = 86.6 bits (215), Expect = 1e-17 Identities = 92/360 (25%), Positives = 149/360 (41%), Gaps = 56/360 (15%) Query: 28 ILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARF-GTEAPHLMFAGHIDVV-PPGDFNHWT 85 +L LK LG +E S + A F GT +M H+D V PG Sbjct: 65 LLAERLKALGAKVERAP---TPPSAGDMVVATFKGTGKRRIMLIAHMDTVYLPGMLAK-- 119 Query: 86 YPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIPK---YKNFGSISLLITGDEEGPA 142 PF I + YG GI D KG +A + A+ I + +K++G++++L DEE + Sbjct: 120 -QPFR--IDGDRAYGPGIADDKGGVAVILHALK--ILQQLGFKDYGTLTVLFNPDEETGS 174 Query: 143 INGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIGRRGSLSGEITIHGKQGHV-AY 201 G++++ I + + D EP D + + G + + + GK H A Sbjct: 175 P-GSREL---IAELAAQHDVVFSCEPG---RAKDALTLATSGIATALLEVKGKASHAGAA 227 Query: 202 PHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITTIDVGNPSKNVIPAQVKMSFNI 261 P L N L L HQL + D G+ T + T G +NVIPA ++ Sbjct: 228 PELGRN---ALYELAHQLLQLR-DLGDPA-KGTTLNWTVAKAGTN-RNVIPASASAQADV 281 Query: 262 RFNDLWN----EKTLKEEIRSRLIKGIQNVPKLSHTVHF--------SSPVSPVFLTHDR 309 R+ D E L+E+++++L+ + T+ F ++ S H + Sbjct: 282 RYLDPAEFDRLEADLQEKVKNKLVPDTEV------TLRFERGRPPLEANAASRALAEHAQ 335 Query: 310 KLTSLLSKSIYNTTGNIPLLSTSGGTSDARFI--KDYCPVIE-FGLVGRTMHALNENASL 366 + L + + P+ +GG +DA F V+E FGLVG H+ +E L Sbjct: 336 GIYGELGRRL------EPIDMGTGGGTDAAFAAGSGKAAVLEGFGLVGFGAHSNDEYIEL 389 >gnl|CDD|183837 PRK13007, PRK13007, succinyl-diaminopimelate desuccinylase; Reviewed. Length = 352 Score = 86.1 bits (214), Expect = 1e-17 Identities = 56/212 (26%), Positives = 86/212 (40%), Gaps = 46/212 (21%) Query: 68 MFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFI-PKYKN 126 + AGH+D VP D P ++YG G DMK +A + A P + Sbjct: 65 VLAGHLDTVPVAD----NLPS---RREGDRLYGCGASDMKSGLAVMLHLAATLAEPAH-- 115 Query: 127 FGSISLLITGDEEGPAI-NGTKKMLSWIEKKGEKW---DACIVGEPTCNHIIGDTIKIGR 182 ++L+ EE A NG L + ++ +W D I+ EPT I+ G Sbjct: 116 --DLTLVFYDCEEVEAEANG----LGRLAREHPEWLAGDFAILLEPT-----DGVIEAGC 164 Query: 183 RGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEI---- 238 +G+L +T HG++ H A L EN I P+L +L + P + + Sbjct: 165 QGTLRVTVTFHGRRAHSARSWLGENAIHKAAPVLARLAA---------YEPREVVVDGLT 215 Query: 239 -------TTIDVGNPSKNVIPAQVKMSFNIRF 263 I G + NVIP + ++ N RF Sbjct: 216 YREGLNAVRIS-GGVAGNVIPDECVVNVNYRF 246 >gnl|CDD|132363 TIGR03320, ygeY, M20/DapE family protein YgeY. Members of this protein family, including the YgeY protein of Escherichia coli, typically are found in extended genomic regions associated with purine catabolism. Homologs include peptidases and deacylases of the M20/M25 /M40 and DapE/ArgE families. The function is unknown. Length = 395 Score = 86.0 bits (213), Expect = 1e-17 Identities = 72/283 (25%), Positives = 118/283 (41%), Gaps = 29/283 (10%) Query: 3 PDCLEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFGT 62 D + L L+ PS + + + ++ LGF E D N+ G Sbjct: 13 GDMIRFLRDLVAIPSESGDEKRVAERIKEEMEKLGFDKVEIDPM-------GNVLGYIGH 65 Query: 63 EAPHLM-FAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFI 121 P L+ HID V GD W + P+ E IYGRG D +G IA + A + I Sbjct: 66 -GPKLIAMDAHIDTVGIGDSKQWQFDPYEGYEDEEIIYGRGASDQEGGIASMVYA-GKII 123 Query: 122 PKYKNFG---SISLLITGDEEGPAINGTKKMLSW---IEKKGEKWDACIVGEPTCNHIIG 175 K+ G +LL+TG + +G L W IE+ G K + ++ EPT Sbjct: 124 ---KDLGLLDDYTLLVTGTVQEEDCDG----LCWQYIIEEDGIKPEFVVITEPT-----D 171 Query: 176 DTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTF-SPT 234 I G+RG + ++T+ G H + P +N I + P+L +L+ + + F Sbjct: 172 MNIYRGQRGRMEIKVTVKGVSCHGSAPERGDNAIYKMAPILKELSQLNANLVEDPFLGKG 231 Query: 235 NMEITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIR 277 + ++ I +PS+ + +S + R + E+IR Sbjct: 232 TLTVSEIFFSSPSRCAVADGCTISIDRRLTWGETWEYALEQIR 274 >gnl|CDD|179850 PRK04443, PRK04443, acetyl-lysine deacetylase; Provisional. Length = 348 Score = 85.0 bits (211), Expect = 3e-17 Identities = 79/350 (22%), Positives = 123/350 (35%), Gaps = 67/350 (19%) Query: 1 MTPDCLEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFS--IEEKDFQTKNTSIVKNLYA 58 + E L L++ PS + ++ A LV ++ G ++E N Sbjct: 4 SALEARELLKGLVEIPSPSGEEAAAAEFLVEFMESHGREAWVDE----------AGNARG 53 Query: 59 RFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVA 118 G P ++ GHID VP GD P + +G ++GRG VD KG +A F AA A Sbjct: 54 PAGDGPPLVLLLGHIDTVP-GDI-----PVR---VEDGVLWGRGSVDAKGPLAAFAAAAA 104 Query: 119 RFIPKYKNFGSISLLITG--DEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIG- 175 R + G +EE P + + + E+ DA I+GEP+ G Sbjct: 105 RLEAL----VRARVSFVGAVEEEAP----SSGGARLVADR-ERPDAVIIGEPS-----GW 150 Query: 176 DTIKIGRRGSLSGEITIHGKQGHVAYP------HLTE--NPIRGLIPLLHQLTNIGFDTG 227 D I +G +G L + H A P E + FD Sbjct: 151 DGITLGYKGRLLVTYVATSESFHSAGPEPNAAEDAIEWWLAVEAWFEANDGRER-VFDQV 209 Query: 228 NTTFSPTNMEITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNV 287 + D + V + +M+ +R + + +E + + L G Sbjct: 210 TPK-------LVDFDSSSDGLTV---EAEMTVGLRLPPGLSPEEAREILDALLPTG---- 255 Query: 288 PKLSHTVHFSSPVSPVFLTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSD 337 TV F+ V ++ L +I G P L GTSD Sbjct: 256 -----TVTFTGAVPAYMVSKRTPLARAFRVAIREAGGT-PRLKRKTGTSD 299 >gnl|CDD|179939 PRK05111, PRK05111, acetylornithine deacetylase; Provisional. Length = 383 Score = 84.9 bits (211), Expect = 3e-17 Identities = 71/225 (31%), Positives = 100/225 (44%), Gaps = 28/225 (12%) Query: 3 PDCLEHLIQLIKCPSVTPQD-----GGAFFI--LVNTLKLLGFSIEEKDFQTKNTSIVKN 55 P +E LI PS++ D I L + LGF++E + T N Sbjct: 5 PSFIEMYRALIATPSISATDPALDQSNRAVIDLLAGWFEDLGFNVEIQ--PVPGTRGKFN 62 Query: 56 LYARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIA 115 L A G+ L+ AGH D VP D WT PF+ T +GK+YG G DMKG A FI Sbjct: 63 LLASLGSGEGGLLLAGHTDTVP-FDEGRWTRDPFTLTEHDGKLYGLGTADMKGFFA-FIL 120 Query: 116 AVARFIPKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIG 175 R I K + +L T DEE ++ G + ++ E + D I+GEPT + Sbjct: 121 EALRDIDLTKLKKPLYILATADEE-TSMAGAR---AFAEATAIRPDCAIIGEPTSLKPV- 175 Query: 176 DTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGL--IPLLHQ 218 +G +S I I G+ GH + +P G+ I L+H Sbjct: 176 ----RAHKGHMSEAIRITGQSGH------SSDPALGVNAIELMHD 210 >gnl|CDD|181544 PRK08737, PRK08737, acetylornithine deacetylase; Provisional. Length = 364 Score = 82.2 bits (203), Expect = 2e-16 Identities = 56/209 (26%), Positives = 86/209 (41%), Gaps = 30/209 (14%) Query: 6 LEHLIQLIKCPSVTP----QDGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFG 61 L+HL L+ + P GG F L +L GF +E D S LYA G Sbjct: 9 LDHLQALVSFDTRNPPRAITTGGIFDYL--RAQLPGFQVEVIDHGAGAVS----LYAVRG 62 Query: 62 TEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFI 121 T P +F H+D VP D HW+ P + ++ G G+ D+KG+ A +AA Sbjct: 63 T--PKYLFNVHLDTVP--DSPHWSADPHVMRRTDDRVIGLGVCDIKGAAAALLAAAN--- 115 Query: 122 PKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIG 181 G + L + DEE L +G ++A +V EPT + + + Sbjct: 116 ---AGDGDAAFLFSSDEEANDPRCVAAFL----ARGIPYEAVLVAEPTMSEAV-----LA 163 Query: 182 RRGSLSGEITIHGKQGHVAYPH-LTENPI 209 RG S + G+ GH + + + + Sbjct: 164 HRGISSVLMRFAGRAGHASGKQDPSASAL 192 >gnl|CDD|166979 PRK00466, PRK00466, acetyl-lysine deacetylase; Validated. Length = 346 Score = 81.4 bits (201), Expect = 3e-16 Identities = 76/323 (23%), Positives = 127/323 (39%), Gaps = 49/323 (15%) Query: 65 PHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIPKY 124 ++ A H+D VP G I IYGRG VD KG + I +A ++ Sbjct: 61 GDILLASHVDTVP-GYIE--------PKIEGEVIYGRGAVDAKGPLISMI--IAAWLLNE 109 Query: 125 KNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIGRRG 184 K + + DEE +I G K+++S KG + IVGEP+ G I + RG Sbjct: 110 KGI-KVMVSGLADEESTSI-GAKELVS----KGFNFKHIIVGEPSN----GTDIVVEYRG 159 Query: 185 SLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITTIDVG 244 S+ +I G H + N I + + ++ + + PT + Sbjct: 160 SIQLDIMCEGTPEHSSSA--KSNLIVDISKKIIEVYKQPENYDKPSIVPTIIRAGE---- 213 Query: 245 NPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRL----IKGIQNVPKLSHTVHFSSPV 300 S NV PA++ + F++R+ L EI+ + +K + P + V ++PV Sbjct: 214 --SYNVTPAKLYLHFDVRYAINNKRDDLISEIKDKFQECGLKIVDETPPVK--VSINNPV 269 Query: 301 SPVFLTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSDARFIKDYCPVIEFGLVGRTM--H 358 + +LL ++I P L GTSD ++ I G +M H Sbjct: 270 VKAL------MRALLKQNIK------PRLVRKAGTSDMNILQKITTSIATYGPGNSMLEH 317 Query: 359 ALNENASLQDLEDLTCIYENFLQ 381 E +L ++ Y ++ Sbjct: 318 TNQEKITLDEIYIAVKTYMLAIE 340 >gnl|CDD|162581 TIGR01887, dipeptidaselike, dipeptidase, putative. This model represents a clade of probable zinc dipeptidases, closely related to the characterized non-specific dipeptidase, PepV. Many enzymes in this clade have been given names including the terms "Xaa-His" and "carnosinase" due to the early mis-characterization of the Lactobacillus delbrueckii PepV enzyme. These names are likely too specific. Length = 447 Score = 79.7 bits (197), Expect = 1e-15 Identities = 37/81 (45%), Positives = 48/81 (59%), Gaps = 9/81 (11%) Query: 39 SIEEKD-FQTKNTSIVKNL--YARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAE 95 + ++D F T+N V N YA +G +L GH+DVVP GD WT PPF A I + Sbjct: 42 ELAKRDGFTTEN---VDNYAGYAEYGQGEEYLGILGHLDVVPAGD--GWTSPPFEAEIKD 96 Query: 96 GKIYGRGIVDMKG-SIACFIA 115 G+IYGRG +D KG +IA A Sbjct: 97 GRIYGRGTLDDKGPTIAALYA 117 Score = 38.5 bits (90), Expect = 0.003 Identities = 46/210 (21%), Positives = 69/210 (32%), Gaps = 34/210 (16%) Query: 189 EITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTN-----------ME 237 IT+ GK H + P N L L QL G F ++ Sbjct: 246 TITLEGKSAHGSAPEKGINAATYLALFLAQLNLAGGAKAFLQFLAEYLHEDHYGEKLGID 305 Query: 238 ITTIDVGNPSKNV------IPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNVPKLS 291 G+ + NV + N+R+ N+ +K Sbjct: 306 FHDDVSGDLTMNVGVIDYENAEAGLIGLNVRYPVG-NDPDTM-------LKNELAKESGI 357 Query: 292 HTVHFSSPVSPVFLTHDRKLTSLLSKSIYNTTG-NIPLLSTSGGTSDARFIKDYCPVIEF 350 V + + P+++ D L L K TG ++ GGT AR +++ + F Sbjct: 358 VEVTENGYLKPLYVPKDDPLVQTLMKVYEKQTGDEGTPVAIGGGTY-ARLMEN---GVAF 413 Query: 351 G--LVGR--TMHALNENASLQDLEDLTCIY 376 G G TMH NE + DL T IY Sbjct: 414 GALFPGEEDTMHQANEYIMIDDLLLATAIY 443 >gnl|CDD|181523 PRK08652, PRK08652, acetylornithine deacetylase; Provisional. Length = 347 Score = 79.8 bits (197), Expect = 1e-15 Identities = 64/260 (24%), Positives = 100/260 (38%), Gaps = 34/260 (13%) Query: 5 CLEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFSI-EEKDFQTKNTSIVKNLYARFGTE 63 E L QL+K PS + Q+ ++ L+ LG+ + E D + N IV N Sbjct: 4 AKELLKQLVKIPSPSGQEDEIALHIMEFLESLGYDVHIESDGEVIN--IVVN-------S 54 Query: 64 APHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIPK 123 L H D VP F + +YG G D KG +A + A+ + Sbjct: 55 KAELFVEVHYDTVPV------RAEFF---VDGVYVYGTGACDAKGGVAAILLALEELGKE 105 Query: 124 YKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIGRR 183 +++ + + DEE G + + E+ K IV EPT + I Sbjct: 106 FEDLN-VGIAFVSDEE----EGGRGSALFAERYRPKM--AIVLEPT-----DLKVAIAHY 153 Query: 184 GSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITTIDV 243 G+L + + GK H A P N I +L +L + F P ++ I I Sbjct: 154 GNLEAYVEVKGKPSHGACPESGVNAIEKAFEMLEKLKEL-LKALGKYFDP-HIGIQEIIG 211 Query: 244 GNPSKNVIPAQVKMSFNIRF 263 G+P IPA ++ + R Sbjct: 212 GSPEY-SIPALCRLRLDARI 230 >gnl|CDD|181495 PRK08596, PRK08596, acetylornithine deacetylase; Validated. Length = 421 Score = 79.3 bits (196), Expect = 2e-15 Identities = 48/173 (27%), Positives = 75/173 (43%), Gaps = 24/173 (13%) Query: 33 LKLLGFSIEEKDFQTKNTSIVKNLYARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSAT 92 L+ LGFS+++ D + ++V L+ GH+DV W PF T Sbjct: 46 LRKLGFSVDKWDVYPNDPNVVGVKKGTESDAYKSLIINGHMDVAEVSADEAWETNPFEPT 105 Query: 93 IAEGKIYGRGIVDMKGSIACFIAAVARFIPKYKNFGSISL---LI----TGDEEGPAING 145 I +G +YGRG DMKG +A + A+ + I L LI G+E G A G Sbjct: 106 IKDGWLYGRGAADMKGGLAGALFAIQLL-----HEAGIELPGDLIFQSVIGEEVGEA--G 158 Query: 146 TKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIGRRGSLSGEITIHGKQGH 198 T + ++G D +V + + H+ G+ G ++G IT+ Q Sbjct: 159 TLQCC----ERGYDADFAVVVDTSDLHM------QGQGGVITGWITVKSPQTF 201 >gnl|CDD|180882 PRK07205, PRK07205, hypothetical protein; Provisional. Length = 444 Score = 76.3 bits (188), Expect = 1e-14 Identities = 114/456 (25%), Positives = 165/456 (36%), Gaps = 116/456 (25%) Query: 4 DCLEHLIQLIKCPSVTP--QDGGAF-----FILVNTLKL---LGFS--IEEKDFQTKNTS 51 C+ + L+ PSV ++G F +L TL L LGF ++ K + Sbjct: 12 ACVAAIKTLVSYPSVLNEGENGTPFGQAIQDVLEATLDLCQGLGFKTYLDPKGYYG---- 67 Query: 52 IVKNLYARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKG-SI 110 YA G L H+DVVP GD + W PPF A +G ++GRG D KG S+ Sbjct: 68 -----YAEIGQGEELLAILCHLDVVPEGDLSDWQTPPFEAVEKDGCLFGRGTQDDKGPSM 122 Query: 111 ACFIAAVA------------RFIPKYKNFGS------------------ISLLITGDEEG 140 A A A RFI FG+ ++ D Sbjct: 123 AALYAVKALLDAGVQFNKRIRFI-----FGTDEETLWRCMNRYNEVEEQATMGFAPDSSF 177 Query: 141 PAINGTKKML-SWIEKKGEKWDACIVGE-------------PTCNHIIGDTIKIGRRGSL 186 P K +L + + G VG+ P + + K+G + Sbjct: 178 PLTYAEKGLLQAKLVGPGSDQLELEVGQAFNVVPAKASYQGPKLEAVKKELDKLGFEYVV 237 Query: 187 SG-EITIHGKQGHVA-YPHLTENPIRGLIPLLHQLTN---------IGFD-TGNTTFSPT 234 E+T+ GK H P IR L+ + IG D TG F Sbjct: 238 KENEVTVLGKSVHAKDAPQGINAVIRLAKALVVLEPHPALDFLANVIGEDATGLNIFGDI 297 Query: 235 NMEITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKT-LKEEIR-SRLIKGIQNVPKLS- 291 E PS K+SFNI + EK+ ++ +IR L + V +LS Sbjct: 298 EDE--------PSG-------KLSFNIAGLTITKEKSEIRIDIRIPVLADKEKLVQQLSQ 342 Query: 292 -------HTVHFSSPVSPVFLTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSDARFIKDY 344 F ++P+++ D +L S L TG+ +SGG + AR + Sbjct: 343 KAQEYGLTYEEFDY-LAPLYVPLDSELVSTLMSVYQEKTGDDSPAQSSGGATFARTM-PN 400 Query: 345 CPVIEFG--LVGR--TMHALNENASLQDLEDLTCIY 376 C + FG G T H NE+ L+DL IY Sbjct: 401 C--VAFGALFPGAPQTEHQANEHIVLEDLYRAMDIY 434 >gnl|CDD|180564 PRK06446, PRK06446, hypothetical protein; Provisional. Length = 436 Score = 74.4 bits (183), Expect = 5e-14 Identities = 67/233 (28%), Positives = 100/233 (42%), Gaps = 35/233 (15%) Query: 4 DCLEHLIQLIKCPSVTPQ----DGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYAR 59 + L LI+ +K PS++ + A + L +T++ LG I+ +TK +V Y Sbjct: 3 EELYTLIEFLKKPSISATGEGIEETANY-LKDTMEKLG--IKANIERTKGHPVV---YGE 56 Query: 60 FGTEAPH-LMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVA 118 A L+ H DV P + W PFSATI G+IY RG D KG++ + A+ Sbjct: 57 INVGAKKTLLIYNHYDVQPVDPLSEWKRDPFSATIENGRIYARGASDNKGTLMARLFAIK 116 Query: 119 RFIPKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIV----------GEP 168 I K+K ++ L G+EE + N + +IEK K A V G P Sbjct: 117 HLIDKHKLNVNVKFLYEGEEEIGSPN----LEDFIEKNKNKLKADSVIMEGAGLDPKGRP 172 Query: 169 TCNHIIGDTIKIGRRGSLSGEIT--IHGKQGHVAYPHLTENPIRGLIPLLHQL 219 I +G +G L E+ K H + + NP L+ LL L Sbjct: 173 --------QIVLGVKGLLYVELVLRTGTKDLHSSNAPIVRNPAWDLVKLLSTL 217 >gnl|CDD|180927 PRK07318, PRK07318, dipeptidase PepV; Reviewed. Length = 466 Score = 74.5 bits (184), Expect = 5e-14 Identities = 33/81 (40%), Positives = 43/81 (53%), Gaps = 9/81 (11%) Query: 39 SIEEKD-FQTKNTSIVKNL--YARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAE 95 I E+D F+TKN V N + +G L GH+DVVP GD W P+ I + Sbjct: 54 EIAERDGFKTKN---VDNYAGHIEYGEGEEVLGILGHLDVVPAGD--GWDTDPYEPVIKD 108 Query: 96 GKIYGRGIVDMKG-SIACFIA 115 GKIY RG D KG ++A + A Sbjct: 109 GKIYARGTSDDKGPTMAAYYA 129 >gnl|CDD|181666 PRK09133, PRK09133, hypothetical protein; Provisional. Length = 472 Score = 74.3 bits (183), Expect = 6e-14 Identities = 40/117 (34%), Positives = 52/117 (44%), Gaps = 19/117 (16%) Query: 55 NLYARF---GTEAPHLMFAGHIDVV--PPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGS 109 NL AR + P ++ H+DVV D WT PF G YGRG D K Sbjct: 90 NLVARLRGTDPKKP-ILLLAHMDVVEAKRED---WTRDPFKLVEENGYFYGRGTSDDKAD 145 Query: 110 IACFIAAVARFIPKYKNFG---SISLLITGDEEGPAINGTKKMLSW-IEKKGEKWDA 162 A ++A + R K + F I L +TGDEEG +NG +W E + DA Sbjct: 146 AAIWVATLIRL--KREGFKPKRDIILALTGDEEGTPMNGV----AWLAENHRDLIDA 196 >gnl|CDD|169276 PRK08201, PRK08201, hypothetical protein; Provisional. Length = 456 Score = 68.6 bits (168), Expect = 2e-12 Identities = 66/237 (27%), Positives = 101/237 (42%), Gaps = 36/237 (15%) Query: 6 LEHLIQLIKCPSVTP-----QD-GGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYAR 59 LE L + ++ PS++ +D A L L+ G E +T IV YA Sbjct: 17 LEELKEFLRIPSISALSEHKEDVRKAAEWLAGALEKAGLEHVEI-METAGHPIV---YAD 72 Query: 60 F--GTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAV 117 + P ++ GH DV P N W PPF TI +GK+Y RG D KG + + AV Sbjct: 73 WLHAPGKPTVLIYGHYDVQPVDPLNLWETPPFEPTIRDGKLYARGASDDKGQVFMHLKAV 132 Query: 118 ARFIPKYKNFG-SISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGD 176 + ++ I G+EE G+ + S++E++ +K A +V +I D Sbjct: 133 EALLKVEGTLPVNVKFCIEGEEE----IGSPNLDSFVEEEKDKLAADVV-------LISD 181 Query: 177 T---------IKIGRRGSLSGEITIHGKQGHV---AYPHLTENPIRGLIPLLHQLTN 221 T I G RG + EI + G +G + Y N + L+ LL L + Sbjct: 182 TTLLGPGKPAICYGLRGLAALEIDVRGAKGDLHSGLYGGAVPNALHALVQLLASLHD 238 >gnl|CDD|169481 PRK08554, PRK08554, peptidase; Reviewed. Length = 438 Score = 67.9 bits (166), Expect = 4e-12 Identities = 36/101 (35%), Positives = 46/101 (45%), Gaps = 7/101 (6%) Query: 56 LYARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIA 115 +Y G P L+F H DVVP W PF T+ K YGRG D KG++A + Sbjct: 55 VYGEIGEGKPKLLFMAHFDVVPVNP-EEWNTEPFKLTVKGDKAYGRGSADDKGNVASVML 113 Query: 116 AVARFIPKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKK 156 A+ + N G + TGDEE I G M I +K Sbjct: 114 ALKELSKEPLN-GKVIFAFTGDEE---IGG--AMAMHIAEK 148 >gnl|CDD|181163 PRK07906, PRK07906, hypothetical protein; Provisional. Length = 426 Score = 67.2 bits (165), Expect = 7e-12 Identities = 30/70 (42%), Positives = 38/70 (54%), Gaps = 9/70 (12%) Query: 55 NLYARF-GT--EAPHLMFAGHIDVVP--PGDFNHWTYPPFSATIAEGKIYGRGIVDMKGS 109 N+ AR G P L+ GH+DVVP D W+ PFS I +G ++GRG VDMK Sbjct: 53 NVVARLPGADPSRPALLVHGHLDVVPAEAAD---WSVHPFSGEIRDGYVWGRGAVDMKDM 109 Query: 110 IACFIAAVAR 119 A + AV R Sbjct: 110 DA-MMLAVVR 118 >gnl|CDD|130955 TIGR01900, dapE-gram_pos, succinyl-diaminopimelate desuccinylase. This enzyme is involved in the biosynthesis of lysine, and is related to the enzyme acetylornithine deacetylase and other amidases and peptidases found within pfam01546. Length = 373 Score = 66.6 bits (162), Expect = 9e-12 Identities = 71/285 (24%), Positives = 112/285 (39%), Gaps = 56/285 (19%) Query: 9 LIQLIKCPSVTPQDG---GAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFGTEAP 65 L Q++ S + +G +N L+L G + F+ + + + FG +A Sbjct: 2 LQQIMDIFSPSDHEGPIADEIEAALNNLELEGLEV----FRFGDNVLART---DFG-KAS 53 Query: 66 HLMFAGHIDVVPPGDF--NHWTYPPFS--------ATIAEGKIYGRGIVDMKGSIACFI- 114 ++ AGHID VP D W P S A +G ++G G DMK A + Sbjct: 54 RVILAGHIDTVPIADNFPPKWLEPGDSLIREEIAHAHPEDGILWGCGATDMKAGDAVMLH 113 Query: 115 --AAVARFIPKYKNFGSISLLITGDEEGPA-INGTKKMLSWIEKKGEKW---DACIVGEP 168 A + P+ + ++L+ EE A NG I W D I+GEP Sbjct: 114 LAATLDGRAPETELKHDLTLIAYDCEEVAAEKNGLGH----IRDAHPDWLAADFAIIGEP 169 Query: 169 TCNHIIGDTIKIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGN 228 T G I+ G G++ ++T HG H A L +N I ++++L Sbjct: 170 T-----GGGIEAGCNGNIRFDVTAHGVAAHSARAWLGDNAIHKAADIINKL--------- 215 Query: 229 TTFSPTNMEITTIDV----------GNPSKNVIPAQVKMSFNIRF 263 + + I +D G + NVIP + +M N RF Sbjct: 216 AAYEAAEVNIDGLDYREGLNATFCEGGKANNVIPDEARMHLNFRF 260 >gnl|CDD|180938 PRK07338, PRK07338, hypothetical protein; Provisional. Length = 402 Score = 65.8 bits (161), Expect = 2e-11 Identities = 80/323 (24%), Positives = 124/323 (38%), Gaps = 41/323 (12%) Query: 63 EAP-HLMFAGHIDVVPPGDFNHWTYPPFSA--TIAEGKIYGRGIVDMKGSIACFIAAVAR 119 EAP ++ GH+D V P D H PF + +G + G G+ DMKG I +AA+ Sbjct: 90 EAPRQVLLTGHMDTVFPAD--H----PFQTLSWLDDGTLNGPGVADMKGGIVVMLAALLA 143 Query: 120 F--IPKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDT 177 F P G +LI DEE G+ + + A + EP + T Sbjct: 144 FERSPLADKLG-YDVLINPDEE----IGSPASAPLLAELARGKHAALTYEPA---LPDGT 195 Query: 178 IKIGRRGSLSGEITIHGKQGHVAY-PHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNM 236 + R+GS + I + G+ H N I L L + T + Sbjct: 196 LAGARKGSGNFTIVVTGRAAHAGRAFDEGRNAIVAAAELALALHALNGQRDGVTVNVAK- 254 Query: 237 EITTIDVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNVPKLSHTVH- 295 ID G P NV+P + FNIR + + E++ +LI + +S +H Sbjct: 255 ----IDGGGPL-NVVPDNAVLRFNIRPPTPEDAAWAEAELK-KLIAQVNQRHGVSLHLHG 308 Query: 296 -FSSPVSPVFLTHDRKLTSLLSKSIYNTTG---NIPL-LSTSGGTSDARFIKDY-CPVIE 349 F P P+ R L + G + + SGG D + PV++ Sbjct: 309 GFGRPPKPIDAAQQR-LFEAVQA-----CGAALGLTIDWKDSGGVCDGNNLAAAGLPVVD 362 Query: 350 -FGLVGRTMHALNENASLQDLED 371 G+ G +H+ +E L L + Sbjct: 363 TLGVRGGNIHSEDEFVILDSLVE 385 >gnl|CDD|181650 PRK09104, PRK09104, hypothetical protein; Validated. Length = 464 Score = 63.8 bits (156), Expect = 8e-11 Identities = 57/220 (25%), Positives = 89/220 (40%), Gaps = 48/220 (21%) Query: 3 PDCLEHLIQLIKCPSVTPQDGGAFF--------ILVNTLKLLGFSIEEKDFQTKNTSIVK 54 LE L L++ PS++ A+ LV L LGF +D T +V Sbjct: 17 DASLERLFALLRIPSISTDP--AYAADCRKAADWLVADLASLGFEASVRD--TPGHPMVV 72 Query: 55 NLYARFGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAE----GK-IYGRGIVDMKGS 109 + +APH++F GH DV P + W PPF I E K I RG D KG Sbjct: 73 AHHEGPTGDAPHVLFYGHYDVQPVDPLDLWESPPFEPRIKETPDGRKVIVARGASDDKGQ 132 Query: 110 IACFIAAVARF------IPKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDAC 163 + F+ A + +P +++L G+EE +G+ ++ ++E E+ A Sbjct: 133 LMTFVEACRAWKAVTGSLP-----VRVTILFEGEEE----SGSPSLVPFLEANAEELKAD 183 Query: 164 IVGEPTCNHIIGDT---------IKIGRRGSLSGEITIHG 194 + ++ DT I RG + E+TI Sbjct: 184 VA-------LVCDTGMWDRETPAITTSLRGLVGEEVTITA 216 >gnl|CDD|130957 TIGR01902, dapE-lys-deAc, N-acetyl-ornithine/N-acetyl-lysine deacetylase. This clade of mainly archaeal and related bacterial species contains two characterized enzymes, an deacetylase with specificity for both N-acetyl-ornithine and N-acetyl-lysine from Thermus which is found within a lysine biosynthesis operon, and a fusion protein with acetyl-glutamate kinase (an enzyme of ornithine biosynthesis) from Lactobacillus. It is possible that all of the sequences within this clade have dual specificity, or that a mix of specificities have evolved within this clade. Length = 336 Score = 62.6 bits (152), Expect = 2e-10 Identities = 70/285 (24%), Positives = 113/285 (39%), Gaps = 55/285 (19%) Query: 61 GTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARF 120 G ++ AGH+D VP + P I G +YGRG VD KG + I A Sbjct: 47 GDGHKKILLAGHVDTVP----GYI--PV---KIEGGLLYGRGAVDAKGPLIAMIFATWLL 97 Query: 121 IPKYKNFGSISLLITG--DEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTI 178 K I ++++G DEE + G ++++ IVGEP+ + I Sbjct: 98 NEK-----GIKVIVSGLVDEESSSK-GAREVIDKNYPF-----YVIVGEPSG----AEGI 142 Query: 179 KIGRRGSLSGEITIHGKQGHVAYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEI 238 +G +GSL +I G H + N LI ++ + N + ++ Sbjct: 143 TLGYKGSLQLKIMCEGTPFHSSS---AGNAAELLIDYSKKIIEVYKQPEN--YDKPSIVP 197 Query: 239 TTIDVGNPSKNVIPAQVKMSFNIRF------NDLWNEKTLKEEIRSRLIKGIQNVPKLSH 292 T I G S N PA++++ F++R+ + E T K I ++ + P + Sbjct: 198 TIIRFGE-SYNDTPAKLELHFDLRYPPNNKPEEAIKEITDKFPIC---LEIVDETP--PY 251 Query: 293 TVHFSSPVSPVFLTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSD 337 V ++P+ F+ RK G P L GTSD Sbjct: 252 KVSRNNPLVRAFVRAIRKQ------------GMKPRLKKKTGTSD 284 >gnl|CDD|181164 PRK07907, PRK07907, hypothetical protein; Provisional. Length = 449 Score = 62.2 bits (152), Expect = 2e-10 Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 2/76 (2%) Query: 64 APHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIPK 123 AP ++ H DV PPGD + W PPF T +G++YGRG D KG IA +AA+ Sbjct: 83 APTVLLYAHHDVQPPGDPDAWDSPPFELTERDGRLYGRGAADDKGGIAMHLAALRALGGD 142 Query: 124 YKNFGSISLLITGDEE 139 +++ + G+EE Sbjct: 143 LPV--GVTVFVEGEEE 156 >gnl|CDD|162584 TIGR01891, amidohydrolases, amidohydrolase. This model represents a subfamily of amidohydrolases which are a subset of those sequences detected by pfam01546. Included within this group are hydrolases of hippurate (N-benzylglycine), indoleacetic acid (IAA) N-conjugates of amino acids, N-acetyl-L-amino acids and aminobenzoylglutamate. These hydrolases are of the carboxypeptidase-type, most likely utilizing a zinc ion in the active site. Length = 363 Score = 60.4 bits (147), Expect = 7e-10 Identities = 59/268 (22%), Positives = 100/268 (37%), Gaps = 37/268 (13%) Query: 28 ILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARFGTEAPHLMFA--GHIDVVPPGDFNHWT 85 ++ L+ LG + + A G P + A +D +P + Sbjct: 24 LIAEALESLGIEVRRG-VGGATGVV-----ATIGGGKPGPVVALRADMDALPIQEQTDL- 76 Query: 86 YPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIPKYKNF--GSISLLITGDEEGPAI 143 P+ +T G ++ G D+ +I A + K + G++ L+ EEG Sbjct: 77 --PYKSTN-PGVMHACG-HDLHTAIL-LGTAKL--LKKLADLLEGTVRLIFQPAEEGGG- 128 Query: 144 NGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKIGRRGSLSG----EITIHGKQGHV 199 G KM IE I+G I T+ + ++ E+TIHGK H Sbjct: 129 -GATKM---IEDGVLDDVDAILGLHPDPSIPAGTVGLRPGTIMAAADKFEVTIHGKGAHA 184 Query: 200 AYPHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITTIDVGNPSKNVIPAQVKMSF 259 A PHL + + L+ L I + + + I+ G NVIP + MS Sbjct: 185 ARPHLGRDALDAAAQLVVALQQIVSRNVDPSRPAVV-TVGIIEAGGAP-NVIPDKASMSG 242 Query: 260 NIRFNDLWNEKTLKEEIRSRLIKGIQNV 287 +R +L E+R ++I I+ + Sbjct: 243 TVR--------SLDPEVRDQIIDRIERI 262 >gnl|CDD|130941 TIGR01886, dipeptidase, dipeptidase PepV. This model represents a small clade of dipeptidase enzymes which are members of the larger M25 subfamily of metalloproteases. Two characterized enzymes are included in the seed. One, from Lactococcus lactis has been shown to act on a wide range of dipeptides, but not larger peptides. The enzyme from Lactobacillus delbrueckii was originally characterized as a Xaa-His dipeptidase, specifically a carnosinase (beta-Ala-His) by complementation of an E. coli mutant. Further study, including the crystallization of the enzyme, has shown it to also be a non-specific dipeptidase. This group also includes enzymes from Streptococcus and Enterococcus. Length = 466 Score = 56.8 bits (137), Expect = 1e-08 Identities = 34/81 (41%), Positives = 41/81 (50%), Gaps = 9/81 (11%) Query: 39 SIEEKD-FQTKNTSIVKNLYAR--FGTEAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAE 95 S E+D F TKN N +G L GH+DVVP G+ WT PF I E Sbjct: 53 SFAERDGFTTKN---FDNYAGHVEYGAGDERLGIIGHMDVVPAGE--GWTRDPFEPEIDE 107 Query: 96 GKIYGRGIVDMKG-SIACFIA 115 G+IY RG D KG S+A + A Sbjct: 108 GRIYARGASDDKGPSLAAYYA 128 >gnl|CDD|162577 TIGR01880, Ac-peptdase-euk, N-acyl-L-amino-acid amidohydrolase. This model represents a family of eukaryotic N-acyl-L-amino-acid amidohydrolases active on fatty acid and acetyl amides of L-amino acids. Length = 400 Score = 56.3 bits (136), Expect = 1e-08 Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 3/88 (3%) Query: 63 EAPHLMFAGHIDVVPPGDFNHWTYPPFSATI-AEGKIYGRGIVDMKGSIACFIAAVAR-F 120 E P ++ H DVVP HWT+PPFSA +G IY RG DMK ++ AV Sbjct: 70 ELPSILLNSHTDVVPVFR-EHWTHPPFSAFKDEDGNIYARGAQDMKCVGVQYLEAVRNLK 128 Query: 121 IPKYKNFGSISLLITGDEEGPAINGTKK 148 +K +I + DEE +G +K Sbjct: 129 ASGFKFKRTIHISFVPDEEIGGHDGMEK 156 >gnl|CDD|162579 TIGR01883, PepT-like, peptidase T-like protein. This model represents a clade of enzymes closely related to Peptidase T, an aminotripeptidase found in bacteria. This clade consists of gram positive bacteria of which several additionally contain a Peptidase T gene. Length = 361 Score = 50.3 bits (120), Expect = 9e-07 Identities = 77/374 (20%), Positives = 129/374 (34%), Gaps = 29/374 (7%) Query: 6 LEHLIQLIKCPSVTPQDGGAFFILVNTLKLLGFSIEEKDFQTKNTSIVKNLYARF-GT-E 63 ++ ++LI+ S + ++ L + LG + + + S NL AR GT + Sbjct: 3 KKYFLELIQIDSESGKEKAILTYLKKQITKLGIPVSLDEVPAE-VSNDNNLIARLPGTVK 61 Query: 64 APHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIPK 123 + F GH+D VPPG T G I G D K +A + A+ + Sbjct: 62 FDTIFFCGHMDTVPPGAGPEPVVEDGIFTSLGGTILG---ADDKAGVAAMLEAMDVLSTE 118 Query: 124 YKNFGSISLLITGDEEGPAINGTKKMLSWIE-KKGEKWDACIVGEPTCNHIIGDTIKIGR 182 G+I + T EE I S I G DA GE +G I++ Sbjct: 119 ETPHGTIEFIFTVKEELGLIGMRLFDESKITAAYGYCLDA--PGE------VG-NIQLAA 169 Query: 183 RGSLSGEITIHGKQGHVAY-PHLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITTI 241 + + TI GK H P + I +H + D T I + Sbjct: 170 PTQVKVDATIAGKDAHAGLVPEDGISAISVARMAIHAMRLGRIDEETTA------NIGSF 223 Query: 242 DVGNPSKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNVPKLSHTVHFSSPVS 301 G + V Q+ ++ + + + ++++ + Q K T+ + + Sbjct: 224 SGGVNTNIVQDEQLIVAEARSLSF----RKAEAQVQTMRERFEQAAEKYGATLEEETRLI 279 Query: 302 -PVFLTHDRKLTSLLSKSIYNTTGNIPLLSTSGGTSDARFIKDY-CPVIEFGLVGRTMHA 359 F H + + K G SGG SDA + + P + H Sbjct: 280 YEGFKIHPQHPLMNIFKKAAKKIGLKTSEIFSGGGSDANVLNEKGVPTVNLSAGYVHAHT 339 Query: 360 LNENASLQDLEDLT 373 E S++ L L Sbjct: 340 EKETISIEQLVKLA 353 >gnl|CDD|180826 PRK07079, PRK07079, hypothetical protein; Provisional. Length = 469 Score = 38.4 bits (90), Expect = 0.003 Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 3/58 (5%) Query: 63 EAPHLMFAGHIDVVPPGDFNHWTYP--PFSATIAEGKIYGRGIVDMKGSIACFIAAVA 118 P ++ GH DVV G W P++ T + YGRG D KG +AA+ Sbjct: 84 ALPTVLIYGHGDVVR-GYDEQWREGLSPWTLTEEGDRWYGRGTADNKGQHTINLAALE 140 >gnl|CDD|168961 PRK07473, PRK07473, carboxypeptidase; Provisional. Length = 376 Score = 37.8 bits (88), Expect = 0.005 Identities = 35/116 (30%), Positives = 47/116 (40%), Gaps = 14/116 (12%) Query: 58 ARF---GTEAPHLMFAGHIDVVPP-GDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACF 113 ARF P ++ AGH+D V P G T K YG GI+DMKG Sbjct: 66 ARFPHPRQGEPGILIAGHMDTVHPVG-----TLEKLPWRREGNKCYGPGILDMKGGNYLA 120 Query: 114 IAAVARFI-PKYKNFGSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEP 168 + A+ + I++L T DEE GT IE + + +V EP Sbjct: 121 LEAIRQLARAGITTPLPITVLFTPDEE----VGTPSTRDLIEAEAARNKYVLVPEP 172 >gnl|CDD|183818 PRK12893, PRK12893, allantoate amidohydrolase; Reviewed. Length = 412 Score = 32.9 bits (76), Expect = 0.14 Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 32/197 (16%) Query: 189 EITIHGKQGHV-AYP-HLTENPIRGLIPLLHQLTNIGFDTGNTTFSPTNMEITTIDVGNP 246 E+T+ G+ H P + + + ++ + I + T + V Sbjct: 218 EVTVEGQAAHAGTTPMAMRRDALVAAARIILAVERIAAALAPDGVA-TVGRL---RVEPN 273 Query: 247 SKNVIPAQVKMSFNIRFNDLWNEKTLKEEIRSRLIKGIQNV---PKLSHTVHFSSPVSPV 303 S+NVIP +V + +IR D + + + L + + TV PV Sbjct: 274 SRNVIPGKVVFTVDIRHPD----DARLDAMEAALRAACAKIAAARGVQVTVETVWDFPPV 329 Query: 304 FLTHDRKLTSLL---SKSIYNTTGNIPLLSTSGGTSDARFIKDYCPVIEFGLV------G 354 D L +L+ ++++ + + SG DA F+ P ++ G Sbjct: 330 --PFDPALVALVEAAAEALGLSHMRMV----SGAGHDAMFLARVAPA---AMIFVPCRGG 380 Query: 355 RTMHALNENASLQDLED 371 + H E+ DL Sbjct: 381 IS-HNEAEDTEPADLAA 396 Score = 27.9 bits (63), Expect = 4.6 Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 3/28 (10%) Query: 55 NLYARF-GTE--APHLMFAGHIDVVPPG 79 NL+ R GT+ AP ++ H+D P G Sbjct: 64 NLFGRRAGTDPDAPPVLIGSHLDTQPTG 91 >gnl|CDD|180432 PRK06156, PRK06156, hypothetical protein; Provisional. Length = 520 Score = 31.9 bits (73), Expect = 0.28 Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 4/44 (9%) Query: 72 HIDVVP--PGDF--NHWTYPPFSATIAEGKIYGRGIVDMKGSIA 111 H DVVP P + + PF T+ ++YGRG D KG+I Sbjct: 117 HADVVPANPELWVLDGTRLDPFKVTLVGDRLYGRGTEDDKGAIV 160 >gnl|CDD|178296 PLN02693, PLN02693, IAA-amino acid hydrolase. Length = 437 Score = 31.2 bits (70), Expect = 0.47 Identities = 49/214 (22%), Positives = 89/214 (41%), Gaps = 37/214 (17%) Query: 63 EAPHLMFAGHIDVVPPGDFNHWTYPPFSATIAEGKIYGRGIVDMKGSIACFIAAVARFIP 122 E P + +D +P + W + GK++ G G +A + A A+ + Sbjct: 101 EPPFVALRADMDALPIQEAVEWEHKSKIP----GKMHACG---HDGHVAMLLGA-AKILQ 152 Query: 123 KYKNF--GSISLLITGDEEGPAINGTKKMLSWIEKKGEKWDACIVGEPTCNHIIGDTIKI 180 ++++ G++ L+ EEG ++G KKM E+ K I G + Sbjct: 153 EHRHHLQGTVVLIFQPAEEG--LSGAKKMR---EEGALKNVEAIFGIH-----LSPRTPF 202 Query: 181 GRRGSLSG---------EITIHGKQGHVAYPHLTENPI---RGLIPLLHQLTNIGFDTGN 228 G+ S +G E I GK GH A P T +P+ ++ L QL + D + Sbjct: 203 GKAASRAGSFMAGAGVFEAVITGKGGHAAIPQHTIDPVVAASSIVLSLQQLVSRETDPLD 262 Query: 229 TTFSPTNMEITTIDVGNPSKNVIPAQVKMSFNIR 262 + + ++ ++ GN + NVIP + + +R Sbjct: 263 SKV----VTVSKVNGGN-AFNVIPDSITIGGTLR 291 >gnl|CDD|150098 pfam09319, DUF1976, Domain of unknown function (DUF1976). Members of this family are found in a set of hypothetical Mycoplasmal proteins. Their exact function has not, as yet, been defined. Length = 1114 Score = 29.4 bits (66), Expect = 1.8 Identities = 11/23 (47%), Positives = 15/23 (65%) Query: 252 PAQVKMSFNIRFNDLWNEKTLKE 274 Q +SF I F D+ N+K+LKE Sbjct: 1086 LTQKTVSFKINFEDVTNKKSLKE 1108 >gnl|CDD|181761 PRK09290, PRK09290, allantoate amidohydrolase; Reviewed. Length = 413 Score = 28.6 bits (65), Expect = 2.9 Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 3/28 (10%) Query: 55 NLYARF-GTE--APHLMFAGHIDVVPPG 79 NL+ R G + AP ++ H+D VP G Sbjct: 61 NLFGRLEGRDPDAPAVLTGSHLDTVPNG 88 >gnl|CDD|183817 PRK12892, PRK12892, allantoate amidohydrolase; Reviewed. Length = 412 Score = 28.1 bits (63), Expect = 3.8 Identities = 10/23 (43%), Positives = 16/23 (69%), Gaps = 2/23 (8%) Query: 240 TIDVGNPSKNVIPAQVKMSFNIR 262 +D G+PS +IP +V+ SF+ R Sbjct: 270 ALDPGSPS--IIPGRVEFSFDAR 290 >gnl|CDD|181769 PRK09300, PRK09300, tRNA splicing endonuclease; Reviewed. Length = 330 Score = 27.2 bits (61), Expect = 6.9 Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 63 EAPHLMFAGHIDVVPPGDF 81 EA +L+F G I++V F Sbjct: 40 EAAYLLFRGKIEIVDGLGF 58 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.320 0.138 0.422 Gapped Lambda K H 0.267 0.0768 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 6,473,218 Number of extensions: 417122 Number of successful extensions: 814 Number of sequences better than 10.0: 1 Number of HSP's gapped: 724 Number of HSP's successfully gapped: 61 Length of query: 389 Length of database: 5,994,473 Length adjustment: 95 Effective length of query: 294 Effective length of database: 3,941,713 Effective search space: 1158863622 Effective search space used: 1158863622 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 58 (26.0 bits)