RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 021917
(305 letters)
>gnl|CDD|215607 PLN03158, PLN03158, methionine aminopeptidase; Provisional.
Length = 396
Score = 585 bits (1510), Expect = 0.0
Identities = 241/296 (81%), Positives = 266/296 (89%), Gaps = 2/296 (0%)
Query: 4 GSDAAETTSLSCVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFKASWTSHKSVHLKAKL 63
+A T+ L+C RC KPAHLQCPKC+ELKLPREGA+FC+QDCFKA+W+SHKSVH KAKL
Sbjct: 1 MEEALTTSPLACARCSKPAHLQCPKCLELKLPREGASFCSQDCFKAAWSSHKSVHTKAKL 60
Query: 64 SAPGTGENSSLVSEGWRYCIKKGQARTPKLPHFDWTGTLRPYPISSKLTVPAYIELPDWA 123
S+ G+NS +EGW YC+KKGQART KLP FDWTG LRPYPIS + VP +I PDWA
Sbjct: 61 SS--IGQNSDAPAEGWLYCLKKGQARTSKLPDFDWTGPLRPYPISPRRVVPDHIPKPDWA 118
Query: 124 LDGTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVH 183
LDGTPK+EPNSDLQH VEIKTP+QI+RMRETCRIAREVLDAAAR I+PGVTTDEIDRVVH
Sbjct: 119 LDGTPKIEPNSDLQHSVEIKTPEQIQRMRETCRIAREVLDAAARAIKPGVTTDEIDRVVH 178
Query: 184 EATITAGGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGD 243
EATI AGGYPSPLNYHFFPKSCCTSVNEVICHGIPD+RKLEDGDIVN+DVTVYYKG HGD
Sbjct: 179 EATIAAGGYPSPLNYHFFPKSCCTSVNEVICHGIPDARKLEDGDIVNVDVTVYYKGCHGD 238
Query: 244 LNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
LNET+FVGN DEASRQLV+CTYECLEKAI+IVKPGVR+RE+GEVINRHATMSG SV
Sbjct: 239 LNETFFVGNVDEASRQLVKCTYECLEKAIAIVKPGVRYREVGEVINRHATMSGLSV 294
>gnl|CDD|238519 cd01086, MetAP1, Methionine Aminopeptidase 1. E.C. 3.4.11.18. Also
known as methionyl aminopeptidase and Peptidase M.
Catalyzes release of N-terminal amino acids,
preferentially methionine, from peptides and arylamides.
Length = 238
Score = 255 bits (655), Expect = 2e-85
Identities = 84/152 (55%), Positives = 103/152 (67%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
IE MRE RI EVLD A+ I+PGVTT E+D++ HE G YP+PL Y+ FPKS CT
Sbjct: 1 IEGMREAGRIVAEVLDELAKAIKPGVTTKELDQIAHEFIEEHGAYPAPLGYYGFPKSICT 60
Query: 208 SVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYEC 267
SVNEV+CHGIPD R L+DGDIVNIDV V G HGD T+ VG E +++LV+ T E
Sbjct: 61 SVNEVVCHGIPDDRVLKDGDIVNIDVGVELDGYHGDSARTFIVGEVSEEAKKLVEVTEEA 120
Query: 268 LEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
L K I VKPG R +IG I ++A +G+SV
Sbjct: 121 LYKGIEAVKPGNRIGDIGHAIEKYAEKNGYSV 152
>gnl|CDD|235576 PRK05716, PRK05716, methionine aminopeptidase; Validated.
Length = 252
Score = 241 bits (619), Expect = 8e-80
Identities = 81/161 (50%), Positives = 104/161 (64%)
Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNY 198
+ IKTP++IE+MR R+A EVLD ++PGVTT E+DR+ E G P+PL Y
Sbjct: 2 AITIKTPEEIEKMRVAGRLAAEVLDEIEPHVKPGVTTKELDRIAEEYIRDQGAIPAPLGY 61
Query: 199 HFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASR 258
H FPKS CTSVNEV+CHGIP + L++GDIVNIDVTV G HGD + T+ VG +
Sbjct: 62 HGFPKSICTSVNEVVCHGIPSDKVLKEGDIVNIDVTVIKDGYHGDTSRTFGVGEISPEDK 121
Query: 259 QLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
+L + T E L I+ VKPG R +IG I ++A GFSV
Sbjct: 122 RLCEVTKEALYLGIAAVKPGARLGDIGHAIQKYAEAEGFSV 162
>gnl|CDD|223103 COG0024, Map, Methionine aminopeptidase [Translation, ribosomal
structure and biogenesis].
Length = 255
Score = 220 bits (563), Expect = 2e-71
Identities = 74/169 (43%), Positives = 103/169 (60%), Gaps = 3/169 (1%)
Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNY 198
++ IKTP++IE+MRE +IA + L A +++PGVTT E+D + E G YP+ L Y
Sbjct: 2 MISIKTPEEIEKMREAGKIAAKALKEVASLVKPGVTTLELDEIAEEFIREKGAYPAFLGY 61
Query: 199 HFFPKSCCTSVNEVICHGIP-DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGN-ADEA 256
FP C SVNEV+ HGIP D + L++GDIV IDV + G GD T+ VG +DE
Sbjct: 62 KGFPFPTCISVNEVVAHGIPGDKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDED 121
Query: 257 SRQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSVEIKQLV 305
+++L++ T E L I VKPG R +IG I +A GFSV ++ L
Sbjct: 122 AKRLLEATKEALYAGIEAVKPGARLGDIGRAIQEYAESRGFSV-VRNLT 169
>gnl|CDD|237252 PRK12896, PRK12896, methionine aminopeptidase; Reviewed.
Length = 255
Score = 207 bits (529), Expect = 4e-66
Identities = 66/160 (41%), Positives = 91/160 (56%)
Query: 140 VEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYH 199
+EIK+P ++E+MR+ RI L + + PG+TT E+DR+ + G PSP Y+
Sbjct: 8 MEIKSPRELEKMRKIGRIVATALKEMGKAVEPGMTTKELDRIAEKRLEEHGAIPSPEGYY 67
Query: 200 FFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQ 259
FP S C SVNE + HGIP R ++DGD+VNIDV+ Y G HGD T+ VG E + +
Sbjct: 68 GFPGSTCISVNEEVAHGIPGPRVIKDGDLVNIDVSAYLDGYHGDTGITFAVGPVSEEAEK 127
Query: 260 LVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
L + E L I VK G +IG I A +G+SV
Sbjct: 128 LCRVAEEALWAGIKQVKAGRPLNDIGRAIEDFAKKNGYSV 167
>gnl|CDD|129591 TIGR00500, met_pdase_I, methionine aminopeptidase, type I.
Methionine aminopeptidase is a cobalt-binding enzyme.
Bacterial and organellar examples (type I) differ from
eukaroytic and archaeal (type II) examples in lacking a
region of approximately 60 amino acids between the 4th
and 5th cobalt-binding ligands. This model describes
type I. The role of this protein in general is to
produce the mature form of cytosolic proteins by
removing the N-terminal methionine [Protein fate,
Protein modification and repair].
Length = 247
Score = 205 bits (523), Expect = 2e-65
Identities = 77/160 (48%), Positives = 105/160 (65%)
Query: 140 VEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYH 199
+ +K+PD+IE++R+ R+A EVL+ R ++PGV+T E+DR+ + G P+ L Y+
Sbjct: 1 ISLKSPDEIEKIRKAGRLAAEVLEELEREVKPGVSTKELDRIAKDFIEKHGAKPAFLGYY 60
Query: 200 FFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQ 259
FP S C SVNEV+ HGIPD + L+DGDIVNIDV V Y G HGD +T+ VG + +
Sbjct: 61 GFPGSVCISVNEVVIHGIPDKKVLKDGDIVNIDVGVIYDGYHGDTAKTFLVGKISPEAEK 120
Query: 260 LVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
L++CT E L KAI KPG R EIG I ++A GFSV
Sbjct: 121 LLECTEESLYKAIEEAKPGNRIGEIGAAIQKYAEAKGFSV 160
>gnl|CDD|183434 PRK12318, PRK12318, methionine aminopeptidase; Provisional.
Length = 291
Score = 166 bits (421), Expect = 1e-49
Identities = 76/171 (44%), Positives = 108/171 (63%), Gaps = 3/171 (1%)
Query: 137 QHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPL 196
Q+ + IKTP+QIE++R+ C++ +LDA + GVTT+E+D + E P+PL
Sbjct: 38 QYDIIIKTPEQIEKIRKACQVTARILDALCEAAKEGVTTNELDELSRELHKEYNAIPAPL 97
Query: 197 NYHF--FPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNAD 254
NY FPK+ CTS+NEVICHGIP+ L++GDI+NIDV+ G +GD + +G
Sbjct: 98 NYGSPPFPKTICTSLNEVICHGIPNDIPLKNGDIMNIDVSCIVDGYYGDCSRMVMIGEVS 157
Query: 255 EASRQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSVEIKQLV 305
E +++ Q + ECL AI+I+KPG+ EIGEVI A GFSV + Q V
Sbjct: 158 EIKKKVCQASLECLNAAIAILKPGIPLYEIGEVIENCADKYGFSV-VDQFV 207
>gnl|CDD|215993 pfam00557, Peptidase_M24, Metallopeptidase family M24. This family
contains metallopeptidases. It also contains
non-peptidase homologues such as the N terminal domain
of Spt16 which is a histone H3-H4 binding module.
Length = 208
Score = 155 bits (394), Expect = 1e-46
Identities = 53/150 (35%), Positives = 70/150 (46%), Gaps = 4/150 (2%)
Query: 149 ERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCTS 208
E MR+ RIA L+AA IRPGVT E+ + A + GG FP +
Sbjct: 1 ELMRKAARIAAAALEAALAAIRPGVTERELAAELEAAFLARGGA----RGPAFPPIVASG 56
Query: 209 VNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYECL 268
N + H IP R L+DGD+V IDV Y G H D+ T+ VG R+L + E
Sbjct: 57 PNAAVPHYIPSDRVLKDGDLVLIDVGAEYDGYHSDITRTFVVGKPTPEQRELYEAVLEAQ 116
Query: 269 EKAISIVKPGVRFREIGEVINRHATMSGFS 298
E AI+ VKPGV ++ G+
Sbjct: 117 EAAIAAVKPGVTGGDVDAAAREVLEEGGYG 146
>gnl|CDD|238514 cd01066, APP_MetAP, A family including aminopeptidase P,
aminopeptidase M, and prolidase. Also known as
metallopeptidase family M24. This family of enzymes is
able to cleave amido-, imido- and amidino-containing
bonds. Members exibit relatively narrow substrate
specificity compared to other metallo-aminopeptidases,
suggesting they play roles in regulation of biological
processes rather than general protein degradation.
Length = 207
Score = 127 bits (321), Expect = 1e-35
Identities = 46/145 (31%), Positives = 70/145 (48%), Gaps = 10/145 (6%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
I R+R+ IA + AAA IRPGVT E+ + +A AGGYP+
Sbjct: 1 IARLRKAAEIAEAAMAAAAEAIRPGVTEAEVAAAIEQALRAAGGYPAG--------PTIV 52
Query: 208 SVNEV--ICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTY 265
+ H PD R+L++GD+V +D+ Y G H DL T+ +G + R+L +
Sbjct: 53 GSGARTALPHYRPDDRRLQEGDLVLVDLGGVYDGYHADLTRTFVIGEPSDEQRELYEAVR 112
Query: 266 ECLEKAISIVKPGVRFREIGEVINR 290
E E A++ ++PGV E+
Sbjct: 113 EAQEAALAALRPGVTAEEVDAAARE 137
>gnl|CDD|223085 COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and
metabolism].
Length = 384
Score = 114 bits (288), Expect = 3e-29
Identities = 48/144 (33%), Positives = 71/144 (49%), Gaps = 7/144 (4%)
Query: 142 IKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGG-YPSPLNYHF 200
IK+P +I ++R+ IA L+AA IRPG+T EI + A G PS F
Sbjct: 154 IKSPAEIAKIRKAAEIADAALEAALEAIRPGMTEAEIAAELEYALRKGGAEGPS-----F 208
Query: 201 FPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQL 260
+ N + H P RKL DGD+V ID+ Y G D+ T+ +G + R++
Sbjct: 209 DTI-VASGENAALPHYTPSDRKLRDGDLVLIDLGGVYNGYCSDITRTFPIGKPSDEQREI 267
Query: 261 VQCTYECLEKAISIVKPGVRFREI 284
+ E E AI+ ++PGV E+
Sbjct: 268 YEAVLEAQEAAIAAIRPGVTGGEV 291
Score = 40.9 bits (96), Expect = 6e-04
Identities = 31/117 (26%), Positives = 43/117 (36%), Gaps = 15/117 (12%)
Query: 133 NSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGY 192
SD+ I P + RE E +AA IRPGVT E+D + + GY
Sbjct: 248 CSDITRTFPIGKPS--DEQREIYEAVLEAQEAAIAAIRPGVTGGEVDAAARQ-VLEKAGY 304
Query: 193 PSPLNYHFFPKSCCTSVNEVIC-HGIP------DSRKLEDGDIVNIDVTVYYKGVHG 242
+F V V+ H P LE G + +I+ +Y G G
Sbjct: 305 G-----LYFLHGTGHGVGFVLDVHEHPQYLSPGSDTTLEPGMVFSIEPGIYIPGGGG 356
>gnl|CDD|171806 PRK12897, PRK12897, methionine aminopeptidase; Reviewed.
Length = 248
Score = 96.6 bits (240), Expect = 1e-23
Identities = 51/161 (31%), Positives = 78/161 (48%)
Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNY 198
++ IKT ++I+ M E+ ++ A++++PG+TT EI+ V G Y
Sbjct: 1 MITIKTKNEIDLMHESGKLLASCHREIAKIMKPGITTKEINTFVEAYLEKHGATSEQKGY 60
Query: 199 HFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASR 258
+ +P + C SVN+ +CH P L +GDIV ID+ V G D TY VG + +
Sbjct: 61 NGYPYAICASVNDEMCHAFPADVPLTEGDIVTIDMVVNLNGGLSDSAWTYRVGKVSDEAE 120
Query: 259 QLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
+L+ L K I G R +IG I + GFSV
Sbjct: 121 KLLLVAENALYKGIDQAVIGNRVGDIGYAIESYVANEGFSV 161
>gnl|CDD|181534 PRK08671, PRK08671, methionine aminopeptidase; Provisional.
Length = 291
Score = 96.4 bits (241), Expect = 3e-23
Identities = 51/153 (33%), Positives = 73/153 (47%), Gaps = 14/153 (9%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
+E+ E +IA +V + AA++I+PG ++ V G P+ FP C
Sbjct: 2 LEKYLEAGKIASKVREEAAKLIKPGAKLLDVAEFVENRIRELGAKPA------FP--CNI 53
Query: 208 SVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCT 264
S+NEV H P D R +GD+V +D+ + G D T +G E LV+ +
Sbjct: 54 SINEVAAHYTPSPGDERVFPEGDVVKLDLGAHVDGYIADTAVTVDLGGKYE---DLVEAS 110
Query: 265 YECLEKAISIVKPGVRFREIGEVINRHATMSGF 297
E LE AI +V+PGV EIG VI GF
Sbjct: 111 EEALEAAIEVVRPGVSVGEIGRVIEETIRSYGF 143
>gnl|CDD|238525 cd01092, APP-like, Similar to Prolidase and Aminopeptidase P. The
members of this subfamily presumably catalyse hydrolysis
of Xaa-Pro dipeptides and/or release of any N-terminal
amino acid, including proline, that is linked with
proline.
Length = 208
Score = 92.2 bits (230), Expect = 2e-22
Identities = 42/138 (30%), Positives = 65/138 (47%), Gaps = 7/138 (5%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGY-PSPLNYHFFPKSCC 206
IE +R+ RIA + + I+PG+T E+ + G PS F
Sbjct: 1 IELLRKAARIADKAFEELLEFIKPGMTEREVAAELEYFMRKLGAEGPS------FDTIVA 54
Query: 207 TSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYE 266
+ N + HG+P RK+E+GD+V ID Y G D+ T VG + +++ + E
Sbjct: 55 SGPNSALPHGVPSDRKIEEGDLVLIDFGAIYDGYCSDITRTVAVGEPSDELKEIYEIVLE 114
Query: 267 CLEKAISIVKPGVRFREI 284
+ AI VKPGV +E+
Sbjct: 115 AQQAAIKAVKPGVTAKEV 132
Score = 35.2 bits (82), Expect = 0.024
Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 3/59 (5%)
Query: 134 SDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGY 192
SD+ V + P + ++E I E AA + ++PGVT E+D+ + I GY
Sbjct: 90 SDITRTVAVGEPS--DELKEIYEIVLEAQQAAIKAVKPGVTAKEVDKAARD-VIEEAGY 145
>gnl|CDD|238521 cd01088, MetAP2, Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also
known as methionyl aminopeptidase and peptidase M.
Catalyzes release of N-terminal amino acids,
preferentially methionine, from peptides and arylamides.
Length = 291
Score = 79.2 bits (196), Expect = 4e-17
Identities = 47/148 (31%), Positives = 66/148 (44%), Gaps = 18/148 (12%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
+E+ RE I R+V A +I+PG+T EI V G P+ FP +
Sbjct: 1 LEKYREAGEIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELGAGPA------FPVNL-- 52
Query: 208 SVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVH--GDLNETYFVGNADEASRQLVQ 262
S+NE H P D L++GD+V +D G H G + ++ F + D L++
Sbjct: 53 SINECAAHYTPNAGDDTVLKEGDVVKLDF-----GAHVDGYIADSAFTVDFDPKYDDLLE 107
Query: 263 CTYECLEKAISIVKPGVRFREIGEVINR 290
E L AI P VR EIGE I
Sbjct: 108 AAKEALNAAIKEAGPDVRLGEIGEAIEE 135
Score = 31.5 bits (72), Expect = 0.46
Identities = 24/85 (28%), Positives = 35/85 (41%), Gaps = 21/85 (24%)
Query: 158 AREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSP---LNYHFFPKSCCTSVNEVIC 214
A+E L+AA + P V EI + E I + G+ P L H S+
Sbjct: 109 AKEALNAAIKEAGPDVRLGEIGEAIEE-VIESYGF-KPIRNLTGH--------SIERYRL 158
Query: 215 HG---IP-----DSRKLEDGDIVNI 231
H IP + +LE+GD+ I
Sbjct: 159 HAGKSIPNVKGGEGTRLEEGDVYAI 183
>gnl|CDD|129592 TIGR00501, met_pdase_II, methionine aminopeptidase, type II.
Methionine aminopeptidase (map) is a cobalt-binding
enzyme. Bacterial and organellar examples (type I)
differ from eukaroytic and archaeal (type II) examples
in lacking a region of approximately 60 amino acids
between the 4th and 5th cobalt-binding ligands. The role
of this protein in general is to produce the mature
amino end of cytosolic proteins by removing the
N-terminal methionine. This model describes type II,
among which the eukaryotic members typically have an
N-terminal extension not present in archaeal members. It
can act cotranslationally. The enzyme from rat has been
shown to associate with translation initiation factor 2
(IF-2) and may have a role in translational regulation
[Protein fate, Protein modification and repair].
Length = 295
Score = 77.5 bits (191), Expect = 2e-16
Identities = 44/153 (28%), Positives = 64/153 (41%), Gaps = 14/153 (9%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
E+ E +I +V AA I PGV E+ V G P+ FP C
Sbjct: 5 AEKWIEAGKIHSKVRREAADRIVPGVKLLEVAEFVENRIRELGAEPA------FP--CNI 56
Query: 208 SVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCT 264
S+NE H P D +DGD+V +D+ + V G + +T + + LV+
Sbjct: 57 SINECAAHFTPKAGDKTVFKDGDVVKLDLGAH---VDGYIADTAITVDLGDQYDNLVKAA 113
Query: 265 YECLEKAISIVKPGVRFREIGEVINRHATMSGF 297
+ L AI ++ GVR EIG+ I G
Sbjct: 114 KDALYTAIKEIRAGVRVGEIGKAIQEVIESYGV 146
Score = 32.4 bits (74), Expect = 0.26
Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 15/83 (18%)
Query: 158 AREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCTSVNEVICHG- 216
A++ L A + IR GV EI + + E I + G N S+ HG
Sbjct: 113 AKDALYTAIKEIRAGVRVGEIGKAIQE-VIESYGVKPISNL------TGHSMAPYRLHGG 165
Query: 217 --IP-----DSRKLEDGDIVNID 232
IP D+ KLE+GD+V I+
Sbjct: 166 KSIPNVKERDTTKLEEGDVVAIE 188
>gnl|CDD|182080 PRK09795, PRK09795, aminopeptidase; Provisional.
Length = 361
Score = 62.3 bits (151), Expect = 6e-11
Identities = 41/175 (23%), Positives = 72/175 (41%), Gaps = 34/175 (19%)
Query: 141 EIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEI---------DRVVHEA---TIT 188
+IKTP+++E++R C IA + R I+ G++ EI + +A TI
Sbjct: 126 QIKTPEEVEKIRLACGIADRGAEHIRRFIQAGMSEREIAAELEWFMRQQGAEKASFDTIV 185
Query: 189 AGGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETY 248
A G+ L HG + + G+ V +D Y+G D+ T
Sbjct: 186 ASGWRGALP-----------------HGKASDKIVAAGEFVTLDFGALYQGYCSDMTRTL 228
Query: 249 FVGNADEASRQ-----LVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFS 298
V ++ + Q + AIS ++PGVR +++ + R T +G+
Sbjct: 229 LVNGEGVSAESHPLFNVYQIVLQAQLAAISAIRPGVRCQQVDDAARRVITEAGYG 283
>gnl|CDD|182804 PRK10879, PRK10879, proline aminopeptidase P II; Provisional.
Length = 438
Score = 60.9 bits (148), Expect = 2e-10
Identities = 43/151 (28%), Positives = 68/151 (45%), Gaps = 8/151 (5%)
Query: 142 IKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEID-RVVHEATITAGGYPSPLNYHF 200
K+P++I +R I+ A RPG+ +++ + HE YPS Y+
Sbjct: 173 FKSPEEIAVLRRAGEISALAHTRAMEKCRPGMFEYQLEGEIHHEFNRHGARYPS---YNT 229
Query: 201 FPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFV-GNADEASRQ 259
S N I H + ++ DGD+V ID YKG GD+ T+ V G A R+
Sbjct: 230 IVGS---GENGCILHYTENESEMRDGDLVLIDAGCEYKGYAGDITRTFPVNGKFTPAQRE 286
Query: 260 LVQCTYECLEKAISIVKPGVRFREIGEVINR 290
+ E LE ++ + +PG RE+ + R
Sbjct: 287 IYDIVLESLETSLRLYRPGTSIREVTGEVVR 317
>gnl|CDD|173039 PRK14575, PRK14575, putative peptidase; Provisional.
Length = 406
Score = 59.7 bits (144), Expect = 5e-10
Identities = 40/155 (25%), Positives = 72/155 (46%), Gaps = 6/155 (3%)
Query: 130 VEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITA 189
V+ +S + IK+P +I+R+R++ I + A+++IR G T+ E+ A +
Sbjct: 166 VDSSSIFNELRVIKSPWEIKRLRKSAEITEYGITEASKLIRVGCTSAELTAAYKAAVM-- 223
Query: 190 GGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYF 249
S HF + + IP + K GD++ D V G D+ T+
Sbjct: 224 ----SKSETHFSRFHLISVGADFSPKLIPSNTKACSGDLIKFDCGVDVDGYGADIARTFV 279
Query: 250 VGNADEASRQLVQCTYECLEKAISIVKPGVRFREI 284
VG E +R++ Q E +S+V PGV+ +++
Sbjct: 280 VGEPPEITRKIYQTIRTGHEHMLSMVAPGVKMKDV 314
>gnl|CDD|185095 PRK15173, PRK15173, peptidase; Provisional.
Length = 323
Score = 59.0 bits (142), Expect = 5e-10
Identities = 40/155 (25%), Positives = 72/155 (46%), Gaps = 6/155 (3%)
Query: 130 VEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITA 189
V+ +S + IK+P +I+R+R++ I + A+++IR G T+ E+ A +
Sbjct: 83 VDSSSIFNELRVIKSPWEIKRLRKSAEITEYGITEASKLIRVGCTSAELTAAYKAAVM-- 140
Query: 190 GGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYF 249
S HF + + IP + K GD++ D V G D+ T+
Sbjct: 141 ----SKSETHFSRFHLISVGADFSPKLIPSNTKACSGDLIKFDCGVDVDGYGADIARTFV 196
Query: 250 VGNADEASRQLVQCTYECLEKAISIVKPGVRFREI 284
VG E +R++ Q E +S+V PGV+ +++
Sbjct: 197 VGEPPEITRKIYQTIRTGHEHMLSMVAPGVKMKDV 231
>gnl|CDD|173040 PRK14576, PRK14576, putative endopeptidase; Provisional.
Length = 405
Score = 58.1 bits (140), Expect = 2e-09
Identities = 40/166 (24%), Positives = 74/166 (44%), Gaps = 20/166 (12%)
Query: 126 GTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEA 185
G V+ + + IK+P +IE +R++ I + +AA+ IR G T E+ A
Sbjct: 161 GLKLVDSTALFNEIRMIKSPWEIEHLRKSAEITEYGIASAAKKIRVGCTAAELTAAFKAA 220
Query: 186 TITAGGYPSPLNYHFFPKSCCTSVNEVICHG-------IPDSRKLEDGDIVNIDVTVYYK 238
++ FP++ + N +I G I D+ + GD++ D +
Sbjct: 221 VMS------------FPETNFSRFN-LISVGDNFSPKIIADTTPAKVGDLIKFDCGIDVA 267
Query: 239 GVHGDLNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFREI 284
G DL T+ +G D+ ++Q+ E +S+V PGV+ + +
Sbjct: 268 GYGADLARTFVLGEPDKLTQQIYDTIRTGHEHMLSMVAPGVKLKAV 313
>gnl|CDD|180918 PRK07281, PRK07281, methionine aminopeptidase; Reviewed.
Length = 286
Score = 56.8 bits (137), Expect = 3e-09
Identities = 41/198 (20%), Positives = 74/198 (37%), Gaps = 32/198 (16%)
Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPL-- 196
++ +K+ +IE M + +I+PGV E++ V P +
Sbjct: 1 MITLKSAREIEAMDRAGDFLASIHIGLRDLIKPGVDMWEVEEYVRRRCKEENVLPLQIGV 60
Query: 197 --NYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDV--------------------- 233
+P + C +N+ + H P L++GD++ +D+
Sbjct: 61 DGAMMDYPYATCCGLNDEVAHAFPRHYILKEGDLLKVDMVLSEPLDKSIVDVSKLNFDNV 120
Query: 234 ------TVYYKGVHGDLNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFREIGEV 287
T Y+G D Y VG + + L+ T E + + I G R +IG
Sbjct: 121 EQMKKYTESYRGGLADSCWAYAVGTPSDEVKNLMDVTKEAMYRGIEQAVVGNRIGDIGAA 180
Query: 288 INRHATMSGFSVEIKQLV 305
I +A G+ V ++ LV
Sbjct: 181 IQEYAESRGYGV-VRDLV 197
>gnl|CDD|238520 cd01087, Prolidase, Prolidase. E.C. 3.4.13.9. Also known as Xaa-Pro
dipeptidase, X-Pro dipeptidase, proline dipeptidase.,
imidodipeptidase, peptidase D, gamma-peptidase.
Catalyses hydrolysis of Xaa-Pro dipeptides; also acts on
aminoacyl-hydroxyproline analogs. No action on Pro-Pro.
Length = 243
Score = 55.3 bits (134), Expect = 6e-09
Identities = 41/138 (29%), Positives = 64/138 (46%), Gaps = 7/138 (5%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
IE MR+ C I+ E AA + RPG++ E++ E + G +Y
Sbjct: 1 IELMRKACDISAEAHRAAMKASRPGMSEYELEAEF-EYEFRSRGARLAYSYI-----VAA 54
Query: 208 SVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFV-GNADEASRQLVQCTYE 266
N I H + + + L+DGD+V ID Y G D+ T+ V G + R+L +
Sbjct: 55 GSNAAILHYVHNDQPLKDGDLVLIDAGAEYGGYASDITRTFPVNGKFTDEQRELYEAVLA 114
Query: 267 CLEKAISIVKPGVRFREI 284
+ AI+ KPGV + +I
Sbjct: 115 AQKAAIAACKPGVSYEDI 132
>gnl|CDD|129586 TIGR00495, crvDNA_42K, 42K curved DNA binding protein. Proteins
identified by This model have been identified in a
number of species as a nuclear (but not nucleolar)
protein with a cell cycle dependence. Various names
given to members of this family have included cell cycle
protein p38-2G4, DNA-binding protein GBP16, and
proliferation-associated protein 1. This protein is
closely related to methionine aminopeptidase, a
cobolt-binding protein [Unknown function, General].
Length = 389
Score = 50.7 bits (121), Expect = 4e-07
Identities = 31/105 (29%), Positives = 47/105 (44%), Gaps = 11/105 (10%)
Query: 201 FPKSCCTSVNEVICHGIP----DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEA 256
FP C SVN + H P L++GD+V ID+ + G + T+ VG A E
Sbjct: 77 FPT--CISVNNCVGHFSPLKSDQDYILKEGDVVKIDLGCHIDGFIALVAHTFVVGVAQEE 134
Query: 257 S-----RQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSG 296
++ + E A+ +VKPG ++ E IN+ A G
Sbjct: 135 PVTGRKADVIAAAHLAAEAALRLVKPGNTNTQVTEAINKVAHSYG 179
Score = 27.9 bits (62), Expect = 7.5
Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 4/30 (13%)
Query: 158 AREVLDAAARMIRPGVT----TDEIDRVVH 183
A +AA R+++PG T T+ I++V H
Sbjct: 147 AHLAAEAALRLVKPGNTNTQVTEAINKVAH 176
>gnl|CDD|178689 PLN03144, PLN03144, Carbon catabolite repressor protein 4 homolog;
Provisional.
Length = 606
Score = 49.0 bits (117), Expect = 1e-06
Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 15 CVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFKASWTSHKSVHLKAKLSAPGTGENSS 73
V +PA LQC C++ KLP + C+ CF +W H+ +H +A A EN +
Sbjct: 66 SVHPSEPATLQCVGCVKAKLPVSKSYHCSPKCFSDAWRHHRVLHERA---ASAVRENGN 121
>gnl|CDD|238522 cd01089, PA2G4-like, Related to aminopepdidase M, this family
contains proliferation-associated protein 2G4. Family
members have been implicated in cell cycle control.
Length = 228
Score = 46.2 bits (110), Expect = 5e-06
Identities = 40/166 (24%), Positives = 67/166 (40%), Gaps = 30/166 (18%)
Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEI----DRVVHEATITAGGYPSPLNYH---F 200
+ + + +IA +VL + PG ++ D+++ E Y
Sbjct: 1 VTKYKTAGQIANKVLKQVISLCVPGAKVVDLCEKGDKLILEEL--GKVYKKEKKLEKGIA 58
Query: 201 FPKSCCTSVNEVICHGIP----DSRKLEDGDIVNIDVTVYYKGVHGD-----LNETYFVG 251
FP C SVN +CH P + L+DGD+V ID+ G H D + T VG
Sbjct: 59 FPT--CISVNNCVCHFSPLKSDATYTLKDGDVVKIDL-----GCHIDGYIAVVAHTIVVG 111
Query: 252 NADEASR-----QLVQCTYECLEKAISIVKPGVRFREIGEVINRHA 292
E ++ + LE A+ +++PG + +I E I +
Sbjct: 112 AEAETPVTGKKADVIAAAHYALEAALRLLRPGNQNSDITEAIQKVI 157
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 41.2 bits (97), Expect = 5e-04
Identities = 41/165 (24%), Positives = 63/165 (38%), Gaps = 18/165 (10%)
Query: 146 DQIERMRETCRIAREVLDAAARMIRPGVT----TDEIDRVVHEATITAGGYPSPLNYHFF 201
+Q + +R + R+V A +I+PGV + I+ E I A G F
Sbjct: 156 EQYQDLRRAAEVHRQVRRYAQSVIKPGVKLIDICERIESKSRE-LIEADGLKCGWA---F 211
Query: 202 PKSCCTSVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASR 258
P C S+N H P D L D+ +D + V+G + + F +
Sbjct: 212 PTGC--SLNHCAAHYTPNTGDKTVLTYDDVCKLDFGTH---VNGRIIDCAFTVAFNPKYD 266
Query: 259 QLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSVEIKQ 303
L+Q T + I VR +IG I + + VEIK
Sbjct: 267 PLLQATKDATNTGIKEAGIDVRLSDIGAAIQE--VIESYEVEIKG 309
>gnl|CDD|238523 cd01090, Creatinase, Creatine amidinohydrolase. E.C.3.5.3.3.
Hydrolyzes creatine to sarcosine and urea.
Length = 228
Score = 39.4 bits (92), Expect = 0.001
Identities = 18/75 (24%), Positives = 38/75 (50%)
Query: 215 HGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYECLEKAISI 274
H +RK++ GDI++++ G + L T F+ +A ++ + E+ + +
Sbjct: 69 HNPVTNRKVQRGDILSLNCFPMIAGYYTALERTLFLDEVSDAHLKIWEANVAVHERGLEL 128
Query: 275 VKPGVRFREIGEVIN 289
+KPG R ++I +N
Sbjct: 129 IKPGARCKDIAAELN 143
>gnl|CDD|201954 pfam01753, zf-MYND, MYND finger.
Length = 39
Score = 33.5 bits (77), Expect = 0.007
Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 9/46 (19%)
Query: 15 CVRCGKPAH--LQCPKCMELKLPREGAAFCTQDCFKASWTSHKSVH 58
C CGK A L+C +C +C+++C KA W HK
Sbjct: 1 CAVCGKEALKLLRCSRCK-------SVYYCSKECQKADWPYHKKEC 39
>gnl|CDD|235000 PRK02106, PRK02106, choline dehydrogenase; Validated.
Length = 560
Score = 36.0 bits (84), Expect = 0.023
Identities = 25/73 (34%), Positives = 29/73 (39%), Gaps = 21/73 (28%)
Query: 144 TPDQIERMRETCRIAREVLDAAA------RMIRPG--VTTD-EIDRVVHEATITAGGYPS 194
T R+ R+ RE++ A R I PG V TD EID V E TA
Sbjct: 417 TEQDWREFRDAIRLTREIMAQPALDPYRGREISPGADVQTDEEIDAFVREHAETA----- 471
Query: 195 PLNYHFFPKSCCT 207
YH P CT
Sbjct: 472 ---YH--P--SCT 477
>gnl|CDD|211774 TIGR02993, ectoine_eutD, ectoine utilization protein EutD. Members
of this family are putative peptidases or hydrolases
similar to Xaa-Pro aminopeptidase (pfam00557). They
belong to ectoine utilization operons, as found in
Sinorhizobium meliloti 1021 (where it is known to be
induced by ectoine), Mesorhizobium loti, Silicibacter
pomeroyi, Agrobacterium tumefaciens, and Pseudomonas
putida. The exact function is unknown.
Length = 391
Score = 34.9 bits (80), Expect = 0.040
Identities = 34/144 (23%), Positives = 59/144 (40%), Gaps = 15/144 (10%)
Query: 142 IKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATIT-----AGGYPS-- 194
+K+P +I MR RI + I PG+ ++ +++A I G YP+
Sbjct: 158 VKSPTEIAYMRIAARIVENMHQRIFERIEPGMRKCDLVAEIYDAGIRGADGFGGDYPAIV 217
Query: 195 PLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNAD 254
PL P + + H D ++ G+ ++ YK H L+ T F+G
Sbjct: 218 PL----LP----SGADASAPHLTWDDSPMKKGEGTFFEIAGCYKRYHCPLSRTVFLGKPT 269
Query: 255 EASRQLVQCTYECLEKAISIVKPG 278
+A + E +E ++ KPG
Sbjct: 270 QAFLDAEKAVLEGMEAGLAAAKPG 293
>gnl|CDD|224842 COG1931, COG1931, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 140
Score = 31.2 bits (71), Expect = 0.33
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 7/64 (10%)
Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAG------GY 192
V E +T +ER+RE R + +LD A ++ G+T +E+ +++ G
Sbjct: 42 VGEGRTLKSLERLRELLR-KQRILDTARMVLEKGLTGNEVTFYLNKQAAYVGRVNFVEED 100
Query: 193 PSPL 196
SPL
Sbjct: 101 ESPL 104
>gnl|CDD|227587 COG5262, HTA1, Histone H2A [Chromatin structure and dynamics].
Length = 132
Score = 30.2 bits (68), Expect = 0.61
Identities = 9/29 (31%), Positives = 17/29 (58%)
Query: 176 DEIDRVVHEATITAGGYPSPLNYHFFPKS 204
+E+++++ + TI GG +N PKS
Sbjct: 93 EELNKLLGDVTIAQGGVLPNINPGLLPKS 121
>gnl|CDD|224978 COG2067, FadL, Long-chain fatty acid transport protein [Lipid
metabolism].
Length = 440
Score = 30.8 bits (70), Expect = 1.0
Identities = 21/107 (19%), Positives = 32/107 (29%), Gaps = 23/107 (21%)
Query: 38 GAAFCTQDCFKASWTSHKSVHLKAKLSAPGTGENSSLVSEGWRYCIKKGQARTPKLPHFD 97
G A+ D + + V + K G S ++ G A
Sbjct: 225 GVAYEINDNNRIGLSYRSKVKMDFK------GGYESSLNLGMAGLPIGPLA--------- 269
Query: 98 WTGTLRPYPISSKLTVPAYIEL-------PDWALDGTPKVEPNSDLQ 137
G + P L++PA EL WA+ G+ K S
Sbjct: 270 -LGGVTPGSGKLTLSLPASAELSGQHKVADQWAIHGSVKWTDWSSFD 315
>gnl|CDD|217823 pfam03972, MmgE_PrpD, MmgE/PrpD family. This family includes
2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is
required for propionate catabolism. It catalyzes the
third step of the 2-methylcitric acid cycle.
Length = 445
Score = 30.4 bits (69), Expect = 1.2
Identities = 13/47 (27%), Positives = 16/47 (34%), Gaps = 4/47 (8%)
Query: 155 CRIAREVLDAAARMIRP-GVTTDEIDRVV---HEATITAGGYPSPLN 197
R A DAA + R G DEI+ + E I P
Sbjct: 267 ERHAHSAADAALALRREEGDRADEIESIEIETFEVAIRIINKRGPPA 313
>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B. SpoIVB, the
stage IV sporulation protein B of endospore-forming
bacteria such as Bacillus subtilis, is a serine
proteinase, expressed in the spore (rather than mother
cell) compartment, that participates in a proteolytic
activation cascade for Sigma-K. It appears to be
universal among endospore-forming bacteria and occurs
nowhere else [Cellular processes, Sporulation and
germination].
Length = 402
Score = 30.4 bits (69), Expect = 1.3
Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 5/33 (15%)
Query: 215 HGI--PDSRK---LEDGDIVNIDVTVYYKGVHG 242
H I D+ K +E+G IVN +T KG G
Sbjct: 220 HVITDIDTGKPISVENGQIVNSSITSIEKGRRG 252
>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
homologs [Posttranslational modification, protein
turnover, chaperones].
Length = 291
Score = 30.1 bits (68), Expect = 1.3
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 225 DGDIVNIDVTVYYKGVHGDLNETYF-VGNADEASRQLVQ 262
D IV++D V Y+ D + + V NA+ A RQLVQ
Sbjct: 83 DNVIVSVDAVVQYRVT--DPQKAVYNVENAEAALRQLVQ 119
>gnl|CDD|234956 PRK01490, tig, trigger factor; Provisional.
Length = 435
Score = 30.1 bits (69), Expect = 1.5
Identities = 9/24 (37%), Positives = 11/24 (45%)
Query: 217 IPDSRKLEDGDIVNIDVTVYYKGV 240
+P R E+GD V ID G
Sbjct: 152 VPVERPAENGDRVTIDFVGSIDGE 175
>gnl|CDD|233451 TIGR01531, glyc_debranch, glycogen debranching enzymye. glycogen
debranching enzyme possesses two different catalytic
activities; oligo-1,4-->1,4-glucantransferase (EC
2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site
directed mutagenesis studies in S. cerevisiae indicate
that the transferase and glucosidase activities are
independent and located in different regions of the
polypeptide chain. Proteins in this model belong to the
larger alpha-amylase family. The model covers eukaryotic
proteins with a seed composed of human, nematode and
yeast sequences. Yeast seed sequence is well
characterized. The model is quite rigorous; either query
sequence yields large bit score or it fails to hit the
model altogether. There doesn't appear to be any middle
ground [Energy metabolism, Biosynthesis and degradation
of polysaccharides].
Length = 1464
Score = 30.2 bits (68), Expect = 1.8
Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 6/60 (10%)
Query: 244 LNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFRE--IGEVINRHATMSGFSVEI 301
++ + A + L + YE L+K G++FRE G I+R T GF+V I
Sbjct: 1123 PDDDSIPVDDGRADQYLFEVIYEALQKHFQ----GIQFRERNAGPQIDRVMTDEGFNVTI 1178
>gnl|CDD|239876 cd04413, NDPk_I, Nucleoside diphosphate kinase Group I
(NDPk_I)-like: NDP kinase domains are present in a large
family of structurally and functionally conserved
proteins from bacteria to humans that generally catalyze
the transfer of gamma-phosphates of a nucleoside
triphosphate (NTP) donor onto a nucleoside diphosphate
(NDP) acceptor through a phosphohistidine intermediate.
The mammalian nm23/NDP kinase gene family can be divided
into two distinct groups. The group I genes encode
proteins that generally have highly homologous
counterparts in other organisms and possess the classic
enzymatic activity of a kinase. This group includes
vertebrate NDP kinases A-D (Nm23- H1 to -H4), and its
counterparts in bacteria, archea and other eukaryotes.
NDP kinases exist in two different quaternary
structures; all known eukaryotic enzymes are hexamers,
while some bacterial enzymes are tetramers, as in
Myxococcus. They possess the NDP kinase active site
motif (NXXH[G/A]SD) and the nine residues that are most
essential for catalysis.
Length = 130
Score = 28.6 bits (65), Expect = 2.2
Identities = 11/23 (47%), Positives = 17/23 (73%), Gaps = 1/23 (4%)
Query: 269 EKAISIVKP-GVRFREIGEVINR 290
E+ + I+KP GV+ IGE+I+R
Sbjct: 1 ERTLVIIKPDGVQRGLIGEIISR 23
>gnl|CDD|227872 COG5585, COG5585, NAD+--asparagine ADP-ribosyltransferase [Signal
transduction mechanisms].
Length = 417
Score = 29.4 bits (66), Expect = 2.2
Identities = 26/135 (19%), Positives = 45/135 (33%), Gaps = 13/135 (9%)
Query: 156 RIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCTSVNEVICH 215
+ A +VL A G I R +H T+ + C V +
Sbjct: 177 KYANQVLKA------IGAKLSGITRQLH-GTVAGVKEAEVGEFPPVHPGCRNGFGGVKEN 229
Query: 216 GIPDSRKLED-----GDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYECLEK 270
+ D + L+ D+ D+ Y K D NE + ++L + L+
Sbjct: 230 FVKDPKSLDQNPNIHDDLGKKDLKNYTKDPRADENEKLRQTQDRDLGKKLRKTGMPALKN 289
Query: 271 AISIVKPG-VRFREI 284
++P VR R+
Sbjct: 290 GEGQIRPKEVRIRKK 304
>gnl|CDD|227919 COG5632, COG5632, N-acetylmuramoyl-L-alanine amidase [Cell envelope
biogenesis, outer membrane].
Length = 302
Score = 29.0 bits (65), Expect = 2.6
Identities = 8/23 (34%), Positives = 9/23 (39%)
Query: 94 PHFDWTGTLRPYPISSKLTVPAY 116
H DW+G L P K A
Sbjct: 139 KHKDWSGKLCPARYLDKGRWDAV 161
>gnl|CDD|218493 pfam05199, GMC_oxred_C, GMC oxidoreductase. This domain found
associated with pfam00732.
Length = 139
Score = 28.1 bits (63), Expect = 3.0
Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 7/60 (11%)
Query: 125 DGTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHE 184
G P ++PN P + +R R+AR +L AA + PG
Sbjct: 15 LGLPVIDPNYL-------SDPADLAALRAALRLARRILAAALVELTPGPVEVVPGAAEAS 67
>gnl|CDD|206093 pfam13922, PHD_3, PHD domain of transcriptional enhancer, Asx.
This is the DNA-binding domain on the additional sex
combs-like 1 proteins. The Asx protein acts as an
enhancer of trithorax and polycomb in displaying
bidirectional homoeotic phenotypes in Drosophila,
suggesting that it is required for maintenance of both
activation and silencing of Hox genes. Asx is required
for normal adult haematopoiesis and its function
depends on its cellular context.
Length = 64
Score = 26.7 bits (59), Expect = 3.1
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 10/50 (20%)
Query: 1 MAGGSDAAETTSLSCVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFKAS 50
G + AE T+ C C A + C +C AFC DC S
Sbjct: 17 NFSGENTAEATANKCA-CRLKAMVICQQC---------GAFCHDDCIGPS 56
>gnl|CDD|203013 pfam04438, zf-HIT, HIT zinc finger. This presumed zinc finger
contains up to 6 cysteine residues that could
coordinate zinc. The domain is named after the HIT
protein. This domain is also found in the Thyroid
receptor interacting protein 3 (TRIP-3) that
specifically interacts with the ligand binding domain
of the thyroid receptor.
Length = 30
Score = 25.7 bits (57), Expect = 3.5
Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 8/34 (23%)
Query: 15 CVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFK 48
C CG P+ +CP+C G +C+ +C+K
Sbjct: 5 CSVCGNPSKYRCPRC--------GVRYCSLECYK 30
>gnl|CDD|206170 pfam14000, Packaging_FI, DNA packaging protein FI. This family
includes the lambda phage DNA-packaging protein FI.
Proteins in this family are typically between 124 and
140 amino acids in length. There is a conserved EEE
sequence motif.
Length = 125
Score = 27.8 bits (62), Expect = 3.6
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 280 RFREIGEVINRHATMSGFSVEIKQLV 305
R RE+ ++ R +SG + E+ Q V
Sbjct: 8 RLRELAAILGREPDVSGSAAELAQRV 33
>gnl|CDD|183506 PRK12406, PRK12406, long-chain-fatty-acid--CoA ligase; Provisional.
Length = 509
Score = 28.9 bits (65), Expect = 3.8
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 10/64 (15%)
Query: 149 ERMRETCRIAREVLDAAARM----IRPGVTTDEIDR------VVHEATITAGGYPSPLNY 198
+R R +A+ AA + +RPG + R A + G Y P+N+
Sbjct: 9 DRRRSFDELAQRAARAAGGLAALGVRPGDCVALLMRNDFAFFEAAYAAMRLGAYAVPVNW 68
Query: 199 HFFP 202
HF P
Sbjct: 69 HFKP 72
>gnl|CDD|222530 pfam14082, DUF4263, Domain of unknown function (DUF4263). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria, archaea,
eukaryotes and viruses. Proteins in this family are
typically between 244 and 403 amino acids in length.
Length = 162
Score = 28.0 bits (63), Expect = 4.3
Identities = 10/44 (22%), Positives = 21/44 (47%), Gaps = 3/44 (6%)
Query: 130 VEPNSDLQHVVEIKTPDQ---IERMRETCRIAREVLDAAARMIR 170
+ +SD +VEIK P+ + R + E+ A ++++
Sbjct: 52 ADKDSDNLVLVEIKRPNTNLFTKTPRGNGAPSAELSGAISQILD 95
>gnl|CDD|129632 TIGR00541, hisDCase_pyru, histidine decarboxylase, pyruvoyl type.
This enzyme converts histadine to histamine in a single
step by catalyzing the release of CO2. This type is
synthesized as an inactive single chain precursor, then
cleaved into two chains. The Ser at the new N-terminus
at the cleavage site is converted to a pyruvoyl group
essential for activity. This type of histidine
decarboxylase appears is known so far only in some
Gram-positive bacteria, where it may play a role in
amino acid catabolism. There is also a pyridoxal
phosphate type histidine decarboxylase, as found in
human, where histamine is a biologically active amine
[Energy metabolism, Amino acids and amines].
Length = 310
Score = 28.4 bits (63), Expect = 5.0
Identities = 12/50 (24%), Positives = 24/50 (48%)
Query: 102 LRPYPISSKLTVPAYIELPDWALDGTPKVEPNSDLQHVVEIKTPDQIERM 151
+ P I + +T Y+ L A+ G + P+ D + + I P+ ++ M
Sbjct: 250 MEPGEIGNAITCAPYVSLAIDAIPGGSILTPDEDFEILENISLPEWLDDM 299
>gnl|CDD|188804 cd09420, LIM3_Prickle, The third LIM domain of Prickle. The
third LIM domain of Prickle: Prickle contains three
C-terminal LIM domains and a N-terminal PET domain.
Prickles have been implicated in roles of regulating
tissue polarity or planar cell polarity (PCP). PCP
establishment requires the conserved
Frizzled/Dishevelled PCP pathway. Prickle interacts
with Dishevelled, thereby modulating
Frizzled/Dishevelled activity and PCP signaling. Two
forms of prickles have been identified; namely prickle
1 and prickle 2. Prickle 1 and prickle 2 are
differentially expressed. While prickle 1 is expressed
in fetal heart and hematological malignancies, prickle
2 is found in fetal brain, adult cartilage, pancreatic
islet, and some types of timorous cells. LIM domains
are 50-60 amino acids in size and share two
characteristic zinc finger motifs. The two zinc fingers
contain eight conserved residues, mostly cysteines and
histidines, which coordinately bond to two zinc atoms.
LIM domains function as adaptors or scaffolds to
support the assembly of multimeric protein complexes.
Length = 59
Score = 25.9 bits (57), Expect = 5.5
Identities = 11/39 (28%), Positives = 17/39 (43%), Gaps = 6/39 (15%)
Query: 8 AETTSLSCVRCGKPAHLQCPKCMELKLPREGAAFCTQDC 46
A C +C K L P LP++G +C++ C
Sbjct: 26 ATEKCFCCAQCKKSL-LGRPF-----LPKQGQIYCSRAC 58
>gnl|CDD|233843 TIGR02390, RNA_pol_rpoA1, DNA-directed RNA polymerase subunit A'.
This family consists of the archaeal A' subunit of the
DNA-directed RNA polymerase. The example from
Methanocaldococcus jannaschii contains an intein.
Length = 867
Score = 28.5 bits (64), Expect = 5.7
Identities = 15/68 (22%), Positives = 22/68 (32%), Gaps = 26/68 (38%)
Query: 104 PYPISSKLTVPAYIELPDWALDGTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLD 163
P I+ +LTVP + TP I+ +RE +
Sbjct: 338 PEQIAKELTVPERV--------------------------TPWNIDELREYVLNGPDSWP 371
Query: 164 AAARMIRP 171
A +IRP
Sbjct: 372 GANYVIRP 379
>gnl|CDD|232831 TIGR00115, tig, trigger factor. Trigger factor is a
ribosome-associated molecular chaperone and is the first
chaperone to interact with nascent polypeptide. Trigger
factor can bind at the same time as the signal
recognition particle (SRP), but is excluded by the SRP
receptor (FtsY). The central domain of trigger factor
has peptidyl-prolyl cis/trans isomerase activity. This
protein is found in a single copy in virtually every
bacterial genome [Protein fate, Protein folding and
stabilization].
Length = 408
Score = 28.3 bits (64), Expect = 6.1
Identities = 8/22 (36%), Positives = 10/22 (45%)
Query: 219 DSRKLEDGDIVNIDVTVYYKGV 240
+ R E GD V ID + G
Sbjct: 143 ERRAAEKGDRVTIDFEGFIDGE 164
>gnl|CDD|233581 TIGR01810, betA, choline dehydrogenase. Choline dehydrogenase
catalyzes the conversion of exogenously supplied choline
into the intermediate glycine betaine aldehyde, as part
of a two-step oxidative reaction leading to the
formation of osmoprotectant betaine. This enzymatic
system can be found in both gram-positive and
gram-negative bacteria. As in Escherichia coli ,
Staphylococcus xylosus , and Sinorhizobium meliloti,
this enzyme is found associated in a transciptionally
co-induced gene cluster with betaine aldehyde
dehydrogenase, the second catalytic enzyme in this
reaction. Other gram-positive organisms have been shown
to employ a different enzymatic system, utlizing a
soluable choline oxidase or type III alcohol
dehydrogenase instead of choline dehydrogenase. This
enzyme is a member of the GMC oxidoreductase family
(pfam00732 and pfam05199), sharing a common
evoluntionary origin and enzymatic reaction with alcohol
dehydrogenase. Outgrouping from this model, Caulobacter
crescentus shares sequence homology with choline
dehydrogenase, yet other genes participating in this
enzymatic reaction have not currently been identified
[Cellular processes, Adaptations to atypical
conditions].
Length = 532
Score = 27.9 bits (62), Expect = 7.4
Identities = 17/57 (29%), Positives = 22/57 (38%), Gaps = 9/57 (15%)
Query: 142 IKTPDQIERMRETCRIAREVLDAAA------RMIRPGV---TTDEIDRVVHEATITA 189
+ + RE R+ RE+L A I PG T +EID V TA
Sbjct: 408 MSHEEDWREFREAIRVTREILKQKALDPYRGGEISPGPEVQTDEEIDEFVRRHGETA 464
>gnl|CDD|225971 COG3437, COG3437, Response regulator containing a CheY-like
receiver domain and an HD-GYP domain [Transcription /
Signal transduction mechanisms].
Length = 360
Score = 27.7 bits (62), Expect = 7.8
Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 131 EPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHE 184
P+S L T ++ E M+ + E+L ++ R+++ EI R HE
Sbjct: 228 IPDSIL-LKPGKLTSEEFEIMKGHPILGAEILKSSERLMQVAA---EIARHHHE 277
>gnl|CDD|143439 cd07121, ALDH_EutE, Ethanolamine utilization protein EutE-like.
Coenzyme A acylating aldehyde dehydrogenase (ACDH), an
NAD+ and CoA-dependent acetaldehyde dehydrogenase,
acetylating (EC=1.2.1.10), converts acetaldehyde into
acetyl-CoA. This CD is limited to such monofunctional
enzymes as the Ethanolamine utilization protein, EutE,
in Salmonella typhimurium. Mutations in eutE abolish
the ability to utilize ethanolamine as a carbon source.
Length = 429
Score = 27.6 bits (62), Expect = 9.0
Identities = 13/50 (26%), Positives = 22/50 (44%), Gaps = 4/50 (8%)
Query: 239 GVHGDLNETYFVGNADEASRQLVQCTYECLEKAISIVKPGV--RFREIGE 286
GV +++ V A A +Q +CT EK I ++ + E+ E
Sbjct: 1 GVFATVDDA--VAAAKAAQKQYRKCTLADREKIIEAIREALLSNAEELAE 48
>gnl|CDD|187539 cd05228, AR_FR_like_1_SDR_e, uncharacterized subgroup of aldehyde
reductase and flavonoid reductase related proteins,
extended (e) SDRs. This subgroup contains proteins of
unknown function related to aldehyde reductase and
flavonoid reductase of the extended SDR-type. Aldehyde
reductase I (aka carbonyl reductase) is an NADP-binding
SDR; it has an NADP-binding motif consensus that is
slightly different from the canonical SDR form and lacks
the Asn of the extended SDR active site tetrad. Aldehyde
reductase I catalyzes the NADP-dependent reduction of
ethyl 4-chloro-3-oxobutanoate to ethyl
(R)-4-chloro-3-hydroxybutanoate. The related flavonoid
reductases act in the NADP-dependent reduction of
flavonoids, ketone-containing plant secondary
metabolites. Extended SDRs are distinct from classical
SDRs. In addition to the Rossmann fold (alpha/beta
folding pattern with a central beta-sheet) core region
typical of all SDRs, extended SDRs have a less conserved
C-terminal extension of approximately 100 amino acids.
Extended SDRs are a diverse collection of proteins, and
include isomerases, epimerases, oxidoreductases, and
lyases; they typically have a TGXXGXXG cofactor binding
motif. SDRs are a functionally diverse family of
oxidoreductases that have a single domain with a
structurally conserved Rossmann fold, an
NAD(P)(H)-binding region, and a structurally diverse
C-terminal region. Sequence identity between different
SDR enzymes is typically in the 15-30% range; they
catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 318
Score = 27.6 bits (62), Expect = 9.3
Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 8/38 (21%)
Query: 158 AREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSP 195
R VLDAA GV RVVH ++I A G P
Sbjct: 91 TRNVLDAALEA---GVR-----RVVHTSSIAALGGPPD 120
>gnl|CDD|224365 COG1448, TyrB, Aspartate/tyrosine/aromatic aminotransferase [Amino
acid transport and metabolism].
Length = 396
Score = 27.6 bits (62), Expect = 9.9
Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 3/40 (7%)
Query: 144 TPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVH 183
+P+Q++R+RE I L A+ R+ G+ T ID V
Sbjct: 354 SPEQVDRLREEFGI---YLVASGRINVAGLNTSNIDYVAK 390
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.419
Gapped
Lambda K H
0.267 0.0731 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,372,177
Number of extensions: 1438605
Number of successful extensions: 1639
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1589
Number of HSP's successfully gapped: 76
Length of query: 305
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 208
Effective length of database: 6,635,264
Effective search space: 1380134912
Effective search space used: 1380134912
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)