RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 021917
         (305 letters)



>gnl|CDD|215607 PLN03158, PLN03158, methionine aminopeptidase; Provisional.
          Length = 396

 Score =  585 bits (1510), Expect = 0.0
 Identities = 241/296 (81%), Positives = 266/296 (89%), Gaps = 2/296 (0%)

Query: 4   GSDAAETTSLSCVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFKASWTSHKSVHLKAKL 63
             +A  T+ L+C RC KPAHLQCPKC+ELKLPREGA+FC+QDCFKA+W+SHKSVH KAKL
Sbjct: 1   MEEALTTSPLACARCSKPAHLQCPKCLELKLPREGASFCSQDCFKAAWSSHKSVHTKAKL 60

Query: 64  SAPGTGENSSLVSEGWRYCIKKGQARTPKLPHFDWTGTLRPYPISSKLTVPAYIELPDWA 123
           S+   G+NS   +EGW YC+KKGQART KLP FDWTG LRPYPIS +  VP +I  PDWA
Sbjct: 61  SS--IGQNSDAPAEGWLYCLKKGQARTSKLPDFDWTGPLRPYPISPRRVVPDHIPKPDWA 118

Query: 124 LDGTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVH 183
           LDGTPK+EPNSDLQH VEIKTP+QI+RMRETCRIAREVLDAAAR I+PGVTTDEIDRVVH
Sbjct: 119 LDGTPKIEPNSDLQHSVEIKTPEQIQRMRETCRIAREVLDAAARAIKPGVTTDEIDRVVH 178

Query: 184 EATITAGGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGD 243
           EATI AGGYPSPLNYHFFPKSCCTSVNEVICHGIPD+RKLEDGDIVN+DVTVYYKG HGD
Sbjct: 179 EATIAAGGYPSPLNYHFFPKSCCTSVNEVICHGIPDARKLEDGDIVNVDVTVYYKGCHGD 238

Query: 244 LNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
           LNET+FVGN DEASRQLV+CTYECLEKAI+IVKPGVR+RE+GEVINRHATMSG SV
Sbjct: 239 LNETFFVGNVDEASRQLVKCTYECLEKAIAIVKPGVRYREVGEVINRHATMSGLSV 294


>gnl|CDD|238519 cd01086, MetAP1, Methionine Aminopeptidase 1. E.C. 3.4.11.18. Also
           known as methionyl aminopeptidase and Peptidase M.
           Catalyzes release of N-terminal amino acids,
           preferentially methionine, from peptides and arylamides.
          Length = 238

 Score =  255 bits (655), Expect = 2e-85
 Identities = 84/152 (55%), Positives = 103/152 (67%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
           IE MRE  RI  EVLD  A+ I+PGVTT E+D++ HE     G YP+PL Y+ FPKS CT
Sbjct: 1   IEGMREAGRIVAEVLDELAKAIKPGVTTKELDQIAHEFIEEHGAYPAPLGYYGFPKSICT 60

Query: 208 SVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYEC 267
           SVNEV+CHGIPD R L+DGDIVNIDV V   G HGD   T+ VG   E +++LV+ T E 
Sbjct: 61  SVNEVVCHGIPDDRVLKDGDIVNIDVGVELDGYHGDSARTFIVGEVSEEAKKLVEVTEEA 120

Query: 268 LEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
           L K I  VKPG R  +IG  I ++A  +G+SV
Sbjct: 121 LYKGIEAVKPGNRIGDIGHAIEKYAEKNGYSV 152


>gnl|CDD|235576 PRK05716, PRK05716, methionine aminopeptidase; Validated.
          Length = 252

 Score =  241 bits (619), Expect = 8e-80
 Identities = 81/161 (50%), Positives = 104/161 (64%)

Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNY 198
            + IKTP++IE+MR   R+A EVLD     ++PGVTT E+DR+  E     G  P+PL Y
Sbjct: 2   AITIKTPEEIEKMRVAGRLAAEVLDEIEPHVKPGVTTKELDRIAEEYIRDQGAIPAPLGY 61

Query: 199 HFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASR 258
           H FPKS CTSVNEV+CHGIP  + L++GDIVNIDVTV   G HGD + T+ VG      +
Sbjct: 62  HGFPKSICTSVNEVVCHGIPSDKVLKEGDIVNIDVTVIKDGYHGDTSRTFGVGEISPEDK 121

Query: 259 QLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
           +L + T E L   I+ VKPG R  +IG  I ++A   GFSV
Sbjct: 122 RLCEVTKEALYLGIAAVKPGARLGDIGHAIQKYAEAEGFSV 162


>gnl|CDD|223103 COG0024, Map, Methionine aminopeptidase [Translation, ribosomal
           structure and biogenesis].
          Length = 255

 Score =  220 bits (563), Expect = 2e-71
 Identities = 74/169 (43%), Positives = 103/169 (60%), Gaps = 3/169 (1%)

Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNY 198
           ++ IKTP++IE+MRE  +IA + L   A +++PGVTT E+D +  E     G YP+ L Y
Sbjct: 2   MISIKTPEEIEKMREAGKIAAKALKEVASLVKPGVTTLELDEIAEEFIREKGAYPAFLGY 61

Query: 199 HFFPKSCCTSVNEVICHGIP-DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGN-ADEA 256
             FP   C SVNEV+ HGIP D + L++GDIV IDV  +  G  GD   T+ VG  +DE 
Sbjct: 62  KGFPFPTCISVNEVVAHGIPGDKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDED 121

Query: 257 SRQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSVEIKQLV 305
           +++L++ T E L   I  VKPG R  +IG  I  +A   GFSV ++ L 
Sbjct: 122 AKRLLEATKEALYAGIEAVKPGARLGDIGRAIQEYAESRGFSV-VRNLT 169


>gnl|CDD|237252 PRK12896, PRK12896, methionine aminopeptidase; Reviewed.
          Length = 255

 Score =  207 bits (529), Expect = 4e-66
 Identities = 66/160 (41%), Positives = 91/160 (56%)

Query: 140 VEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYH 199
           +EIK+P ++E+MR+  RI    L    + + PG+TT E+DR+  +     G  PSP  Y+
Sbjct: 8   MEIKSPRELEKMRKIGRIVATALKEMGKAVEPGMTTKELDRIAEKRLEEHGAIPSPEGYY 67

Query: 200 FFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQ 259
            FP S C SVNE + HGIP  R ++DGD+VNIDV+ Y  G HGD   T+ VG   E + +
Sbjct: 68  GFPGSTCISVNEEVAHGIPGPRVIKDGDLVNIDVSAYLDGYHGDTGITFAVGPVSEEAEK 127

Query: 260 LVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
           L +   E L   I  VK G    +IG  I   A  +G+SV
Sbjct: 128 LCRVAEEALWAGIKQVKAGRPLNDIGRAIEDFAKKNGYSV 167


>gnl|CDD|129591 TIGR00500, met_pdase_I, methionine aminopeptidase, type I.
           Methionine aminopeptidase is a cobalt-binding enzyme.
           Bacterial and organellar examples (type I) differ from
           eukaroytic and archaeal (type II) examples in lacking a
           region of approximately 60 amino acids between the 4th
           and 5th cobalt-binding ligands. This model describes
           type I. The role of this protein in general is to
           produce the mature form of cytosolic proteins by
           removing the N-terminal methionine [Protein fate,
           Protein modification and repair].
          Length = 247

 Score =  205 bits (523), Expect = 2e-65
 Identities = 77/160 (48%), Positives = 105/160 (65%)

Query: 140 VEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYH 199
           + +K+PD+IE++R+  R+A EVL+   R ++PGV+T E+DR+  +     G  P+ L Y+
Sbjct: 1   ISLKSPDEIEKIRKAGRLAAEVLEELEREVKPGVSTKELDRIAKDFIEKHGAKPAFLGYY 60

Query: 200 FFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQ 259
            FP S C SVNEV+ HGIPD + L+DGDIVNIDV V Y G HGD  +T+ VG     + +
Sbjct: 61  GFPGSVCISVNEVVIHGIPDKKVLKDGDIVNIDVGVIYDGYHGDTAKTFLVGKISPEAEK 120

Query: 260 LVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
           L++CT E L KAI   KPG R  EIG  I ++A   GFSV
Sbjct: 121 LLECTEESLYKAIEEAKPGNRIGEIGAAIQKYAEAKGFSV 160


>gnl|CDD|183434 PRK12318, PRK12318, methionine aminopeptidase; Provisional.
          Length = 291

 Score =  166 bits (421), Expect = 1e-49
 Identities = 76/171 (44%), Positives = 108/171 (63%), Gaps = 3/171 (1%)

Query: 137 QHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPL 196
           Q+ + IKTP+QIE++R+ C++   +LDA     + GVTT+E+D +  E        P+PL
Sbjct: 38  QYDIIIKTPEQIEKIRKACQVTARILDALCEAAKEGVTTNELDELSRELHKEYNAIPAPL 97

Query: 197 NYHF--FPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNAD 254
           NY    FPK+ CTS+NEVICHGIP+   L++GDI+NIDV+    G +GD +    +G   
Sbjct: 98  NYGSPPFPKTICTSLNEVICHGIPNDIPLKNGDIMNIDVSCIVDGYYGDCSRMVMIGEVS 157

Query: 255 EASRQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSVEIKQLV 305
           E  +++ Q + ECL  AI+I+KPG+   EIGEVI   A   GFSV + Q V
Sbjct: 158 EIKKKVCQASLECLNAAIAILKPGIPLYEIGEVIENCADKYGFSV-VDQFV 207


>gnl|CDD|215993 pfam00557, Peptidase_M24, Metallopeptidase family M24.  This family
           contains metallopeptidases. It also contains
           non-peptidase homologues such as the N terminal domain
           of Spt16 which is a histone H3-H4 binding module.
          Length = 208

 Score =  155 bits (394), Expect = 1e-46
 Identities = 53/150 (35%), Positives = 70/150 (46%), Gaps = 4/150 (2%)

Query: 149 ERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCTS 208
           E MR+  RIA   L+AA   IRPGVT  E+   +  A +  GG         FP    + 
Sbjct: 1   ELMRKAARIAAAALEAALAAIRPGVTERELAAELEAAFLARGGA----RGPAFPPIVASG 56

Query: 209 VNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYECL 268
            N  + H IP  R L+DGD+V IDV   Y G H D+  T+ VG      R+L +   E  
Sbjct: 57  PNAAVPHYIPSDRVLKDGDLVLIDVGAEYDGYHSDITRTFVVGKPTPEQRELYEAVLEAQ 116

Query: 269 EKAISIVKPGVRFREIGEVINRHATMSGFS 298
           E AI+ VKPGV   ++           G+ 
Sbjct: 117 EAAIAAVKPGVTGGDVDAAAREVLEEGGYG 146


>gnl|CDD|238514 cd01066, APP_MetAP, A family including aminopeptidase P,
           aminopeptidase M, and prolidase. Also known as
           metallopeptidase family M24. This family of enzymes is
           able to cleave amido-, imido- and amidino-containing
           bonds. Members exibit relatively narrow substrate
           specificity compared to other metallo-aminopeptidases,
           suggesting they play roles in regulation of biological
           processes rather than general protein degradation.
          Length = 207

 Score =  127 bits (321), Expect = 1e-35
 Identities = 46/145 (31%), Positives = 70/145 (48%), Gaps = 10/145 (6%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
           I R+R+   IA   + AAA  IRPGVT  E+   + +A   AGGYP+             
Sbjct: 1   IARLRKAAEIAEAAMAAAAEAIRPGVTEAEVAAAIEQALRAAGGYPAG--------PTIV 52

Query: 208 SVNEV--ICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTY 265
                  + H  PD R+L++GD+V +D+   Y G H DL  T+ +G   +  R+L +   
Sbjct: 53  GSGARTALPHYRPDDRRLQEGDLVLVDLGGVYDGYHADLTRTFVIGEPSDEQRELYEAVR 112

Query: 266 ECLEKAISIVKPGVRFREIGEVINR 290
           E  E A++ ++PGV   E+      
Sbjct: 113 EAQEAALAALRPGVTAEEVDAAARE 137


>gnl|CDD|223085 COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and
           metabolism].
          Length = 384

 Score =  114 bits (288), Expect = 3e-29
 Identities = 48/144 (33%), Positives = 71/144 (49%), Gaps = 7/144 (4%)

Query: 142 IKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGG-YPSPLNYHF 200
           IK+P +I ++R+   IA   L+AA   IRPG+T  EI   +  A    G   PS     F
Sbjct: 154 IKSPAEIAKIRKAAEIADAALEAALEAIRPGMTEAEIAAELEYALRKGGAEGPS-----F 208

Query: 201 FPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQL 260
                 +  N  + H  P  RKL DGD+V ID+   Y G   D+  T+ +G   +  R++
Sbjct: 209 DTI-VASGENAALPHYTPSDRKLRDGDLVLIDLGGVYNGYCSDITRTFPIGKPSDEQREI 267

Query: 261 VQCTYECLEKAISIVKPGVRFREI 284
            +   E  E AI+ ++PGV   E+
Sbjct: 268 YEAVLEAQEAAIAAIRPGVTGGEV 291



 Score = 40.9 bits (96), Expect = 6e-04
 Identities = 31/117 (26%), Positives = 43/117 (36%), Gaps = 15/117 (12%)

Query: 133 NSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGY 192
            SD+     I  P   +  RE      E  +AA   IRPGVT  E+D    +  +   GY
Sbjct: 248 CSDITRTFPIGKPS--DEQREIYEAVLEAQEAAIAAIRPGVTGGEVDAAARQ-VLEKAGY 304

Query: 193 PSPLNYHFFPKSCCTSVNEVIC-HGIP------DSRKLEDGDIVNIDVTVYYKGVHG 242
                  +F       V  V+  H  P          LE G + +I+  +Y  G  G
Sbjct: 305 G-----LYFLHGTGHGVGFVLDVHEHPQYLSPGSDTTLEPGMVFSIEPGIYIPGGGG 356


>gnl|CDD|171806 PRK12897, PRK12897, methionine aminopeptidase; Reviewed.
          Length = 248

 Score = 96.6 bits (240), Expect = 1e-23
 Identities = 51/161 (31%), Positives = 78/161 (48%)

Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNY 198
           ++ IKT ++I+ M E+ ++        A++++PG+TT EI+  V       G       Y
Sbjct: 1   MITIKTKNEIDLMHESGKLLASCHREIAKIMKPGITTKEINTFVEAYLEKHGATSEQKGY 60

Query: 199 HFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASR 258
           + +P + C SVN+ +CH  P    L +GDIV ID+ V   G   D   TY VG   + + 
Sbjct: 61  NGYPYAICASVNDEMCHAFPADVPLTEGDIVTIDMVVNLNGGLSDSAWTYRVGKVSDEAE 120

Query: 259 QLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSV 299
           +L+      L K I     G R  +IG  I  +    GFSV
Sbjct: 121 KLLLVAENALYKGIDQAVIGNRVGDIGYAIESYVANEGFSV 161


>gnl|CDD|181534 PRK08671, PRK08671, methionine aminopeptidase; Provisional.
          Length = 291

 Score = 96.4 bits (241), Expect = 3e-23
 Identities = 51/153 (33%), Positives = 73/153 (47%), Gaps = 14/153 (9%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
           +E+  E  +IA +V + AA++I+PG    ++   V       G  P+      FP  C  
Sbjct: 2   LEKYLEAGKIASKVREEAAKLIKPGAKLLDVAEFVENRIRELGAKPA------FP--CNI 53

Query: 208 SVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCT 264
           S+NEV  H  P   D R   +GD+V +D+  +  G   D   T  +G   E    LV+ +
Sbjct: 54  SINEVAAHYTPSPGDERVFPEGDVVKLDLGAHVDGYIADTAVTVDLGGKYE---DLVEAS 110

Query: 265 YECLEKAISIVKPGVRFREIGEVINRHATMSGF 297
            E LE AI +V+PGV   EIG VI       GF
Sbjct: 111 EEALEAAIEVVRPGVSVGEIGRVIEETIRSYGF 143


>gnl|CDD|238525 cd01092, APP-like, Similar to Prolidase and Aminopeptidase P. The
           members of this subfamily presumably catalyse hydrolysis
           of Xaa-Pro dipeptides and/or release of any N-terminal
           amino acid, including proline, that is linked with
           proline.
          Length = 208

 Score = 92.2 bits (230), Expect = 2e-22
 Identities = 42/138 (30%), Positives = 65/138 (47%), Gaps = 7/138 (5%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGY-PSPLNYHFFPKSCC 206
           IE +R+  RIA +  +     I+PG+T  E+   +       G   PS      F     
Sbjct: 1   IELLRKAARIADKAFEELLEFIKPGMTEREVAAELEYFMRKLGAEGPS------FDTIVA 54

Query: 207 TSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYE 266
           +  N  + HG+P  RK+E+GD+V ID    Y G   D+  T  VG   +  +++ +   E
Sbjct: 55  SGPNSALPHGVPSDRKIEEGDLVLIDFGAIYDGYCSDITRTVAVGEPSDELKEIYEIVLE 114

Query: 267 CLEKAISIVKPGVRFREI 284
             + AI  VKPGV  +E+
Sbjct: 115 AQQAAIKAVKPGVTAKEV 132



 Score = 35.2 bits (82), Expect = 0.024
 Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 3/59 (5%)

Query: 134 SDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGY 192
           SD+   V +  P   + ++E   I  E   AA + ++PGVT  E+D+   +  I   GY
Sbjct: 90  SDITRTVAVGEPS--DELKEIYEIVLEAQQAAIKAVKPGVTAKEVDKAARD-VIEEAGY 145


>gnl|CDD|238521 cd01088, MetAP2, Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also
           known as methionyl aminopeptidase and peptidase M.
           Catalyzes release of N-terminal amino acids,
           preferentially methionine, from peptides and arylamides.
          Length = 291

 Score = 79.2 bits (196), Expect = 4e-17
 Identities = 47/148 (31%), Positives = 66/148 (44%), Gaps = 18/148 (12%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
           +E+ RE   I R+V   A  +I+PG+T  EI   V       G  P+      FP +   
Sbjct: 1   LEKYREAGEIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELGAGPA------FPVNL-- 52

Query: 208 SVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVH--GDLNETYFVGNADEASRQLVQ 262
           S+NE   H  P   D   L++GD+V +D      G H  G + ++ F  + D     L++
Sbjct: 53  SINECAAHYTPNAGDDTVLKEGDVVKLDF-----GAHVDGYIADSAFTVDFDPKYDDLLE 107

Query: 263 CTYECLEKAISIVKPGVRFREIGEVINR 290
              E L  AI    P VR  EIGE I  
Sbjct: 108 AAKEALNAAIKEAGPDVRLGEIGEAIEE 135



 Score = 31.5 bits (72), Expect = 0.46
 Identities = 24/85 (28%), Positives = 35/85 (41%), Gaps = 21/85 (24%)

Query: 158 AREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSP---LNYHFFPKSCCTSVNEVIC 214
           A+E L+AA +   P V   EI   + E  I + G+  P   L  H        S+     
Sbjct: 109 AKEALNAAIKEAGPDVRLGEIGEAIEE-VIESYGF-KPIRNLTGH--------SIERYRL 158

Query: 215 HG---IP-----DSRKLEDGDIVNI 231
           H    IP     +  +LE+GD+  I
Sbjct: 159 HAGKSIPNVKGGEGTRLEEGDVYAI 183


>gnl|CDD|129592 TIGR00501, met_pdase_II, methionine aminopeptidase, type II.
           Methionine aminopeptidase (map) is a cobalt-binding
           enzyme. Bacterial and organellar examples (type I)
           differ from eukaroytic and archaeal (type II) examples
           in lacking a region of approximately 60 amino acids
           between the 4th and 5th cobalt-binding ligands. The role
           of this protein in general is to produce the mature
           amino end of cytosolic proteins by removing the
           N-terminal methionine. This model describes type II,
           among which the eukaryotic members typically have an
           N-terminal extension not present in archaeal members. It
           can act cotranslationally. The enzyme from rat has been
           shown to associate with translation initiation factor 2
           (IF-2) and may have a role in translational regulation
           [Protein fate, Protein modification and repair].
          Length = 295

 Score = 77.5 bits (191), Expect = 2e-16
 Identities = 44/153 (28%), Positives = 64/153 (41%), Gaps = 14/153 (9%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
            E+  E  +I  +V   AA  I PGV   E+   V       G  P+      FP  C  
Sbjct: 5   AEKWIEAGKIHSKVRREAADRIVPGVKLLEVAEFVENRIRELGAEPA------FP--CNI 56

Query: 208 SVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCT 264
           S+NE   H  P   D    +DGD+V +D+  +   V G + +T    +  +    LV+  
Sbjct: 57  SINECAAHFTPKAGDKTVFKDGDVVKLDLGAH---VDGYIADTAITVDLGDQYDNLVKAA 113

Query: 265 YECLEKAISIVKPGVRFREIGEVINRHATMSGF 297
            + L  AI  ++ GVR  EIG+ I       G 
Sbjct: 114 KDALYTAIKEIRAGVRVGEIGKAIQEVIESYGV 146



 Score = 32.4 bits (74), Expect = 0.26
 Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 15/83 (18%)

Query: 158 AREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCTSVNEVICHG- 216
           A++ L  A + IR GV   EI + + E  I + G     N          S+     HG 
Sbjct: 113 AKDALYTAIKEIRAGVRVGEIGKAIQE-VIESYGVKPISNL------TGHSMAPYRLHGG 165

Query: 217 --IP-----DSRKLEDGDIVNID 232
             IP     D+ KLE+GD+V I+
Sbjct: 166 KSIPNVKERDTTKLEEGDVVAIE 188


>gnl|CDD|182080 PRK09795, PRK09795, aminopeptidase; Provisional.
          Length = 361

 Score = 62.3 bits (151), Expect = 6e-11
 Identities = 41/175 (23%), Positives = 72/175 (41%), Gaps = 34/175 (19%)

Query: 141 EIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEI---------DRVVHEA---TIT 188
           +IKTP+++E++R  C IA    +   R I+ G++  EI          +   +A   TI 
Sbjct: 126 QIKTPEEVEKIRLACGIADRGAEHIRRFIQAGMSEREIAAELEWFMRQQGAEKASFDTIV 185

Query: 189 AGGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETY 248
           A G+   L                  HG    + +  G+ V +D    Y+G   D+  T 
Sbjct: 186 ASGWRGALP-----------------HGKASDKIVAAGEFVTLDFGALYQGYCSDMTRTL 228

Query: 249 FVGNADEASRQ-----LVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFS 298
            V     ++       + Q   +    AIS ++PGVR +++ +   R  T +G+ 
Sbjct: 229 LVNGEGVSAESHPLFNVYQIVLQAQLAAISAIRPGVRCQQVDDAARRVITEAGYG 283


>gnl|CDD|182804 PRK10879, PRK10879, proline aminopeptidase P II; Provisional.
          Length = 438

 Score = 60.9 bits (148), Expect = 2e-10
 Identities = 43/151 (28%), Positives = 68/151 (45%), Gaps = 8/151 (5%)

Query: 142 IKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEID-RVVHEATITAGGYPSPLNYHF 200
            K+P++I  +R    I+      A    RPG+   +++  + HE       YPS   Y+ 
Sbjct: 173 FKSPEEIAVLRRAGEISALAHTRAMEKCRPGMFEYQLEGEIHHEFNRHGARYPS---YNT 229

Query: 201 FPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFV-GNADEASRQ 259
              S     N  I H   +  ++ DGD+V ID    YKG  GD+  T+ V G    A R+
Sbjct: 230 IVGS---GENGCILHYTENESEMRDGDLVLIDAGCEYKGYAGDITRTFPVNGKFTPAQRE 286

Query: 260 LVQCTYECLEKAISIVKPGVRFREIGEVINR 290
           +     E LE ++ + +PG   RE+   + R
Sbjct: 287 IYDIVLESLETSLRLYRPGTSIREVTGEVVR 317


>gnl|CDD|173039 PRK14575, PRK14575, putative peptidase; Provisional.
          Length = 406

 Score = 59.7 bits (144), Expect = 5e-10
 Identities = 40/155 (25%), Positives = 72/155 (46%), Gaps = 6/155 (3%)

Query: 130 VEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITA 189
           V+ +S    +  IK+P +I+R+R++  I    +  A+++IR G T+ E+      A +  
Sbjct: 166 VDSSSIFNELRVIKSPWEIKRLRKSAEITEYGITEASKLIRVGCTSAELTAAYKAAVM-- 223

Query: 190 GGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYF 249
               S    HF      +   +     IP + K   GD++  D  V   G   D+  T+ 
Sbjct: 224 ----SKSETHFSRFHLISVGADFSPKLIPSNTKACSGDLIKFDCGVDVDGYGADIARTFV 279

Query: 250 VGNADEASRQLVQCTYECLEKAISIVKPGVRFREI 284
           VG   E +R++ Q      E  +S+V PGV+ +++
Sbjct: 280 VGEPPEITRKIYQTIRTGHEHMLSMVAPGVKMKDV 314


>gnl|CDD|185095 PRK15173, PRK15173, peptidase; Provisional.
          Length = 323

 Score = 59.0 bits (142), Expect = 5e-10
 Identities = 40/155 (25%), Positives = 72/155 (46%), Gaps = 6/155 (3%)

Query: 130 VEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITA 189
           V+ +S    +  IK+P +I+R+R++  I    +  A+++IR G T+ E+      A +  
Sbjct: 83  VDSSSIFNELRVIKSPWEIKRLRKSAEITEYGITEASKLIRVGCTSAELTAAYKAAVM-- 140

Query: 190 GGYPSPLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYF 249
               S    HF      +   +     IP + K   GD++  D  V   G   D+  T+ 
Sbjct: 141 ----SKSETHFSRFHLISVGADFSPKLIPSNTKACSGDLIKFDCGVDVDGYGADIARTFV 196

Query: 250 VGNADEASRQLVQCTYECLEKAISIVKPGVRFREI 284
           VG   E +R++ Q      E  +S+V PGV+ +++
Sbjct: 197 VGEPPEITRKIYQTIRTGHEHMLSMVAPGVKMKDV 231


>gnl|CDD|173040 PRK14576, PRK14576, putative endopeptidase; Provisional.
          Length = 405

 Score = 58.1 bits (140), Expect = 2e-09
 Identities = 40/166 (24%), Positives = 74/166 (44%), Gaps = 20/166 (12%)

Query: 126 GTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEA 185
           G   V+  +    +  IK+P +IE +R++  I    + +AA+ IR G T  E+      A
Sbjct: 161 GLKLVDSTALFNEIRMIKSPWEIEHLRKSAEITEYGIASAAKKIRVGCTAAELTAAFKAA 220

Query: 186 TITAGGYPSPLNYHFFPKSCCTSVNEVICHG-------IPDSRKLEDGDIVNIDVTVYYK 238
            ++            FP++  +  N +I  G       I D+   + GD++  D  +   
Sbjct: 221 VMS------------FPETNFSRFN-LISVGDNFSPKIIADTTPAKVGDLIKFDCGIDVA 267

Query: 239 GVHGDLNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFREI 284
           G   DL  T+ +G  D+ ++Q+        E  +S+V PGV+ + +
Sbjct: 268 GYGADLARTFVLGEPDKLTQQIYDTIRTGHEHMLSMVAPGVKLKAV 313


>gnl|CDD|180918 PRK07281, PRK07281, methionine aminopeptidase; Reviewed.
          Length = 286

 Score = 56.8 bits (137), Expect = 3e-09
 Identities = 41/198 (20%), Positives = 74/198 (37%), Gaps = 32/198 (16%)

Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPL-- 196
           ++ +K+  +IE M         +      +I+PGV   E++  V          P  +  
Sbjct: 1   MITLKSAREIEAMDRAGDFLASIHIGLRDLIKPGVDMWEVEEYVRRRCKEENVLPLQIGV 60

Query: 197 --NYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDV--------------------- 233
                 +P + C  +N+ + H  P    L++GD++ +D+                     
Sbjct: 61  DGAMMDYPYATCCGLNDEVAHAFPRHYILKEGDLLKVDMVLSEPLDKSIVDVSKLNFDNV 120

Query: 234 ------TVYYKGVHGDLNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFREIGEV 287
                 T  Y+G   D    Y VG   +  + L+  T E + + I     G R  +IG  
Sbjct: 121 EQMKKYTESYRGGLADSCWAYAVGTPSDEVKNLMDVTKEAMYRGIEQAVVGNRIGDIGAA 180

Query: 288 INRHATMSGFSVEIKQLV 305
           I  +A   G+ V ++ LV
Sbjct: 181 IQEYAESRGYGV-VRDLV 197


>gnl|CDD|238520 cd01087, Prolidase, Prolidase. E.C. 3.4.13.9. Also known as Xaa-Pro
           dipeptidase, X-Pro dipeptidase, proline dipeptidase.,
           imidodipeptidase, peptidase D, gamma-peptidase.
           Catalyses hydrolysis of Xaa-Pro dipeptides; also acts on
           aminoacyl-hydroxyproline analogs. No action on Pro-Pro.
          Length = 243

 Score = 55.3 bits (134), Expect = 6e-09
 Identities = 41/138 (29%), Positives = 64/138 (46%), Gaps = 7/138 (5%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCT 207
           IE MR+ C I+ E   AA +  RPG++  E++    E    + G     +Y         
Sbjct: 1   IELMRKACDISAEAHRAAMKASRPGMSEYELEAEF-EYEFRSRGARLAYSYI-----VAA 54

Query: 208 SVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFV-GNADEASRQLVQCTYE 266
             N  I H + + + L+DGD+V ID    Y G   D+  T+ V G   +  R+L +    
Sbjct: 55  GSNAAILHYVHNDQPLKDGDLVLIDAGAEYGGYASDITRTFPVNGKFTDEQRELYEAVLA 114

Query: 267 CLEKAISIVKPGVRFREI 284
             + AI+  KPGV + +I
Sbjct: 115 AQKAAIAACKPGVSYEDI 132


>gnl|CDD|129586 TIGR00495, crvDNA_42K, 42K curved DNA binding protein.  Proteins
           identified by This model have been identified in a
           number of species as a nuclear (but not nucleolar)
           protein with a cell cycle dependence. Various names
           given to members of this family have included cell cycle
           protein p38-2G4, DNA-binding protein GBP16, and
           proliferation-associated protein 1. This protein is
           closely related to methionine aminopeptidase, a
           cobolt-binding protein [Unknown function, General].
          Length = 389

 Score = 50.7 bits (121), Expect = 4e-07
 Identities = 31/105 (29%), Positives = 47/105 (44%), Gaps = 11/105 (10%)

Query: 201 FPKSCCTSVNEVICHGIP----DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEA 256
           FP   C SVN  + H  P        L++GD+V ID+  +  G    +  T+ VG A E 
Sbjct: 77  FPT--CISVNNCVGHFSPLKSDQDYILKEGDVVKIDLGCHIDGFIALVAHTFVVGVAQEE 134

Query: 257 S-----RQLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSG 296
                   ++   +   E A+ +VKPG    ++ E IN+ A   G
Sbjct: 135 PVTGRKADVIAAAHLAAEAALRLVKPGNTNTQVTEAINKVAHSYG 179



 Score = 27.9 bits (62), Expect = 7.5
 Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 4/30 (13%)

Query: 158 AREVLDAAARMIRPGVT----TDEIDRVVH 183
           A    +AA R+++PG T    T+ I++V H
Sbjct: 147 AHLAAEAALRLVKPGNTNTQVTEAINKVAH 176


>gnl|CDD|178689 PLN03144, PLN03144, Carbon catabolite repressor protein 4 homolog;
           Provisional.
          Length = 606

 Score = 49.0 bits (117), Expect = 1e-06
 Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 15  CVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFKASWTSHKSVHLKAKLSAPGTGENSS 73
            V   +PA LQC  C++ KLP   +  C+  CF  +W  H+ +H +A   A    EN +
Sbjct: 66  SVHPSEPATLQCVGCVKAKLPVSKSYHCSPKCFSDAWRHHRVLHERA---ASAVRENGN 121


>gnl|CDD|238522 cd01089, PA2G4-like, Related to aminopepdidase M, this family
           contains proliferation-associated protein 2G4. Family
           members have been implicated in cell cycle control.
          Length = 228

 Score = 46.2 bits (110), Expect = 5e-06
 Identities = 40/166 (24%), Positives = 67/166 (40%), Gaps = 30/166 (18%)

Query: 148 IERMRETCRIAREVLDAAARMIRPGVTTDEI----DRVVHEATITAGGYPSPLNYH---F 200
           + + +   +IA +VL     +  PG    ++    D+++ E       Y           
Sbjct: 1   VTKYKTAGQIANKVLKQVISLCVPGAKVVDLCEKGDKLILEEL--GKVYKKEKKLEKGIA 58

Query: 201 FPKSCCTSVNEVICHGIP----DSRKLEDGDIVNIDVTVYYKGVHGD-----LNETYFVG 251
           FP   C SVN  +CH  P     +  L+DGD+V ID+     G H D     +  T  VG
Sbjct: 59  FPT--CISVNNCVCHFSPLKSDATYTLKDGDVVKIDL-----GCHIDGYIAVVAHTIVVG 111

Query: 252 NADEASR-----QLVQCTYECLEKAISIVKPGVRFREIGEVINRHA 292
              E         ++   +  LE A+ +++PG +  +I E I +  
Sbjct: 112 AEAETPVTGKKADVIAAAHYALEAALRLLRPGNQNSDITEAIQKVI 157


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 41.2 bits (97), Expect = 5e-04
 Identities = 41/165 (24%), Positives = 63/165 (38%), Gaps = 18/165 (10%)

Query: 146 DQIERMRETCRIAREVLDAAARMIRPGVT----TDEIDRVVHEATITAGGYPSPLNYHFF 201
           +Q + +R    + R+V   A  +I+PGV      + I+    E  I A G         F
Sbjct: 156 EQYQDLRRAAEVHRQVRRYAQSVIKPGVKLIDICERIESKSRE-LIEADGLKCGWA---F 211

Query: 202 PKSCCTSVNEVICHGIP---DSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASR 258
           P  C  S+N    H  P   D   L   D+  +D   +   V+G + +  F    +    
Sbjct: 212 PTGC--SLNHCAAHYTPNTGDKTVLTYDDVCKLDFGTH---VNGRIIDCAFTVAFNPKYD 266

Query: 259 QLVQCTYECLEKAISIVKPGVRFREIGEVINRHATMSGFSVEIKQ 303
            L+Q T +     I      VR  +IG  I     +  + VEIK 
Sbjct: 267 PLLQATKDATNTGIKEAGIDVRLSDIGAAIQE--VIESYEVEIKG 309


>gnl|CDD|238523 cd01090, Creatinase, Creatine amidinohydrolase. E.C.3.5.3.3.
           Hydrolyzes creatine to sarcosine and urea.
          Length = 228

 Score = 39.4 bits (92), Expect = 0.001
 Identities = 18/75 (24%), Positives = 38/75 (50%)

Query: 215 HGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYECLEKAISI 274
           H    +RK++ GDI++++      G +  L  T F+    +A  ++ +      E+ + +
Sbjct: 69  HNPVTNRKVQRGDILSLNCFPMIAGYYTALERTLFLDEVSDAHLKIWEANVAVHERGLEL 128

Query: 275 VKPGVRFREIGEVIN 289
           +KPG R ++I   +N
Sbjct: 129 IKPGARCKDIAAELN 143


>gnl|CDD|201954 pfam01753, zf-MYND, MYND finger. 
          Length = 39

 Score = 33.5 bits (77), Expect = 0.007
 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 9/46 (19%)

Query: 15 CVRCGKPAH--LQCPKCMELKLPREGAAFCTQDCFKASWTSHKSVH 58
          C  CGK A   L+C +C           +C+++C KA W  HK   
Sbjct: 1  CAVCGKEALKLLRCSRCK-------SVYYCSKECQKADWPYHKKEC 39


>gnl|CDD|235000 PRK02106, PRK02106, choline dehydrogenase; Validated.
          Length = 560

 Score = 36.0 bits (84), Expect = 0.023
 Identities = 25/73 (34%), Positives = 29/73 (39%), Gaps = 21/73 (28%)

Query: 144 TPDQIERMRETCRIAREVLDAAA------RMIRPG--VTTD-EIDRVVHEATITAGGYPS 194
           T       R+  R+ RE++   A      R I PG  V TD EID  V E   TA     
Sbjct: 417 TEQDWREFRDAIRLTREIMAQPALDPYRGREISPGADVQTDEEIDAFVREHAETA----- 471

Query: 195 PLNYHFFPKSCCT 207
              YH  P   CT
Sbjct: 472 ---YH--P--SCT 477


>gnl|CDD|211774 TIGR02993, ectoine_eutD, ectoine utilization protein EutD.  Members
           of this family are putative peptidases or hydrolases
           similar to Xaa-Pro aminopeptidase (pfam00557). They
           belong to ectoine utilization operons, as found in
           Sinorhizobium meliloti 1021 (where it is known to be
           induced by ectoine), Mesorhizobium loti, Silicibacter
           pomeroyi, Agrobacterium tumefaciens, and Pseudomonas
           putida. The exact function is unknown.
          Length = 391

 Score = 34.9 bits (80), Expect = 0.040
 Identities = 34/144 (23%), Positives = 59/144 (40%), Gaps = 15/144 (10%)

Query: 142 IKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATIT-----AGGYPS-- 194
           +K+P +I  MR   RI   +       I PG+   ++   +++A I       G YP+  
Sbjct: 158 VKSPTEIAYMRIAARIVENMHQRIFERIEPGMRKCDLVAEIYDAGIRGADGFGGDYPAIV 217

Query: 195 PLNYHFFPKSCCTSVNEVICHGIPDSRKLEDGDIVNIDVTVYYKGVHGDLNETYFVGNAD 254
           PL     P    +  +    H   D   ++ G+    ++   YK  H  L+ T F+G   
Sbjct: 218 PL----LP----SGADASAPHLTWDDSPMKKGEGTFFEIAGCYKRYHCPLSRTVFLGKPT 269

Query: 255 EASRQLVQCTYECLEKAISIVKPG 278
           +A     +   E +E  ++  KPG
Sbjct: 270 QAFLDAEKAVLEGMEAGLAAAKPG 293


>gnl|CDD|224842 COG1931, COG1931, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 140

 Score = 31.2 bits (71), Expect = 0.33
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 7/64 (10%)

Query: 139 VVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHEATITAG------GY 192
           V E +T   +ER+RE  R  + +LD A  ++  G+T +E+   +++     G        
Sbjct: 42  VGEGRTLKSLERLRELLR-KQRILDTARMVLEKGLTGNEVTFYLNKQAAYVGRVNFVEED 100

Query: 193 PSPL 196
            SPL
Sbjct: 101 ESPL 104


>gnl|CDD|227587 COG5262, HTA1, Histone H2A [Chromatin structure and dynamics].
          Length = 132

 Score = 30.2 bits (68), Expect = 0.61
 Identities = 9/29 (31%), Positives = 17/29 (58%)

Query: 176 DEIDRVVHEATITAGGYPSPLNYHFFPKS 204
           +E+++++ + TI  GG    +N    PKS
Sbjct: 93  EELNKLLGDVTIAQGGVLPNINPGLLPKS 121


>gnl|CDD|224978 COG2067, FadL, Long-chain fatty acid transport protein [Lipid
           metabolism].
          Length = 440

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 21/107 (19%), Positives = 32/107 (29%), Gaps = 23/107 (21%)

Query: 38  GAAFCTQDCFKASWTSHKSVHLKAKLSAPGTGENSSLVSEGWRYCIKKGQARTPKLPHFD 97
           G A+   D  +   +    V +  K      G   S ++ G         A         
Sbjct: 225 GVAYEINDNNRIGLSYRSKVKMDFK------GGYESSLNLGMAGLPIGPLA--------- 269

Query: 98  WTGTLRPYPISSKLTVPAYIEL-------PDWALDGTPKVEPNSDLQ 137
             G + P      L++PA  EL         WA+ G+ K    S   
Sbjct: 270 -LGGVTPGSGKLTLSLPASAELSGQHKVADQWAIHGSVKWTDWSSFD 315


>gnl|CDD|217823 pfam03972, MmgE_PrpD, MmgE/PrpD family.  This family includes
           2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is
           required for propionate catabolism. It catalyzes the
           third step of the 2-methylcitric acid cycle.
          Length = 445

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 13/47 (27%), Positives = 16/47 (34%), Gaps = 4/47 (8%)

Query: 155 CRIAREVLDAAARMIRP-GVTTDEIDRVV---HEATITAGGYPSPLN 197
            R A    DAA  + R  G   DEI+ +     E  I       P  
Sbjct: 267 ERHAHSAADAALALRREEGDRADEIESIEIETFEVAIRIINKRGPPA 313


>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B.  SpoIVB, the
           stage IV sporulation protein B of endospore-forming
           bacteria such as Bacillus subtilis, is a serine
           proteinase, expressed in the spore (rather than mother
           cell) compartment, that participates in a proteolytic
           activation cascade for Sigma-K. It appears to be
           universal among endospore-forming bacteria and occurs
           nowhere else [Cellular processes, Sporulation and
           germination].
          Length = 402

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 5/33 (15%)

Query: 215 HGI--PDSRK---LEDGDIVNIDVTVYYKGVHG 242
           H I   D+ K   +E+G IVN  +T   KG  G
Sbjct: 220 HVITDIDTGKPISVENGQIVNSSITSIEKGRRG 252


>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
           homologs [Posttranslational modification, protein
           turnover, chaperones].
          Length = 291

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 225 DGDIVNIDVTVYYKGVHGDLNETYF-VGNADEASRQLVQ 262
           D  IV++D  V Y+    D  +  + V NA+ A RQLVQ
Sbjct: 83  DNVIVSVDAVVQYRVT--DPQKAVYNVENAEAALRQLVQ 119


>gnl|CDD|234956 PRK01490, tig, trigger factor; Provisional.
          Length = 435

 Score = 30.1 bits (69), Expect = 1.5
 Identities = 9/24 (37%), Positives = 11/24 (45%)

Query: 217 IPDSRKLEDGDIVNIDVTVYYKGV 240
           +P  R  E+GD V ID      G 
Sbjct: 152 VPVERPAENGDRVTIDFVGSIDGE 175


>gnl|CDD|233451 TIGR01531, glyc_debranch, glycogen debranching enzymye.  glycogen
            debranching enzyme possesses two different catalytic
            activities; oligo-1,4-->1,4-glucantransferase (EC
            2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site
            directed mutagenesis studies in S. cerevisiae indicate
            that the transferase and glucosidase activities are
            independent and located in different regions of the
            polypeptide chain. Proteins in this model belong to the
            larger alpha-amylase family. The model covers eukaryotic
            proteins with a seed composed of human, nematode and
            yeast sequences. Yeast seed sequence is well
            characterized. The model is quite rigorous; either query
            sequence yields large bit score or it fails to hit the
            model altogether. There doesn't appear to be any middle
            ground [Energy metabolism, Biosynthesis and degradation
            of polysaccharides].
          Length = 1464

 Score = 30.2 bits (68), Expect = 1.8
 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 244  LNETYFVGNADEASRQLVQCTYECLEKAISIVKPGVRFRE--IGEVINRHATMSGFSVEI 301
             ++     +   A + L +  YE L+K       G++FRE   G  I+R  T  GF+V I
Sbjct: 1123 PDDDSIPVDDGRADQYLFEVIYEALQKHFQ----GIQFRERNAGPQIDRVMTDEGFNVTI 1178


>gnl|CDD|239876 cd04413, NDPk_I, Nucleoside diphosphate kinase Group I
           (NDPk_I)-like: NDP kinase domains are present in a large
           family of structurally and functionally conserved
           proteins from bacteria to humans that generally catalyze
           the transfer of gamma-phosphates of a nucleoside
           triphosphate (NTP) donor onto a nucleoside diphosphate
           (NDP) acceptor through a phosphohistidine intermediate.
           The mammalian nm23/NDP kinase gene family can be divided
           into two distinct groups. The group I genes encode
           proteins that generally have highly homologous
           counterparts in other organisms and possess the classic
           enzymatic activity of a kinase. This group includes
           vertebrate NDP kinases A-D (Nm23- H1 to -H4),  and its
           counterparts in bacteria, archea and other eukaryotes.
           NDP kinases exist in two different quaternary
           structures; all known eukaryotic enzymes are hexamers,
           while some bacterial enzymes are tetramers, as in
           Myxococcus. They possess the NDP kinase active site
           motif (NXXH[G/A]SD) and the nine residues that are most
           essential for catalysis.
          Length = 130

 Score = 28.6 bits (65), Expect = 2.2
 Identities = 11/23 (47%), Positives = 17/23 (73%), Gaps = 1/23 (4%)

Query: 269 EKAISIVKP-GVRFREIGEVINR 290
           E+ + I+KP GV+   IGE+I+R
Sbjct: 1   ERTLVIIKPDGVQRGLIGEIISR 23


>gnl|CDD|227872 COG5585, COG5585, NAD+--asparagine ADP-ribosyltransferase [Signal
           transduction mechanisms].
          Length = 417

 Score = 29.4 bits (66), Expect = 2.2
 Identities = 26/135 (19%), Positives = 45/135 (33%), Gaps = 13/135 (9%)

Query: 156 RIAREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSPLNYHFFPKSCCTSVNEVICH 215
           + A +VL A       G     I R +H  T+          +      C      V  +
Sbjct: 177 KYANQVLKA------IGAKLSGITRQLH-GTVAGVKEAEVGEFPPVHPGCRNGFGGVKEN 229

Query: 216 GIPDSRKLED-----GDIVNIDVTVYYKGVHGDLNETYFVGNADEASRQLVQCTYECLEK 270
            + D + L+       D+   D+  Y K    D NE        +  ++L +     L+ 
Sbjct: 230 FVKDPKSLDQNPNIHDDLGKKDLKNYTKDPRADENEKLRQTQDRDLGKKLRKTGMPALKN 289

Query: 271 AISIVKPG-VRFREI 284
               ++P  VR R+ 
Sbjct: 290 GEGQIRPKEVRIRKK 304


>gnl|CDD|227919 COG5632, COG5632, N-acetylmuramoyl-L-alanine amidase [Cell envelope
           biogenesis, outer membrane].
          Length = 302

 Score = 29.0 bits (65), Expect = 2.6
 Identities = 8/23 (34%), Positives = 9/23 (39%)

Query: 94  PHFDWTGTLRPYPISSKLTVPAY 116
            H DW+G L P     K    A 
Sbjct: 139 KHKDWSGKLCPARYLDKGRWDAV 161


>gnl|CDD|218493 pfam05199, GMC_oxred_C, GMC oxidoreductase.  This domain found
           associated with pfam00732.
          Length = 139

 Score = 28.1 bits (63), Expect = 3.0
 Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 7/60 (11%)

Query: 125 DGTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHE 184
            G P ++PN           P  +  +R   R+AR +L AA   + PG            
Sbjct: 15  LGLPVIDPNYL-------SDPADLAALRAALRLARRILAAALVELTPGPVEVVPGAAEAS 67


>gnl|CDD|206093 pfam13922, PHD_3, PHD domain of transcriptional enhancer, Asx.
          This is the DNA-binding domain on the additional sex
          combs-like 1 proteins. The Asx protein acts as an
          enhancer of trithorax and polycomb in displaying
          bidirectional homoeotic phenotypes in Drosophila,
          suggesting that it is required for maintenance of both
          activation and silencing of Hox genes. Asx is required
          for normal adult haematopoiesis and its function
          depends on its cellular context.
          Length = 64

 Score = 26.7 bits (59), Expect = 3.1
 Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 10/50 (20%)

Query: 1  MAGGSDAAETTSLSCVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFKAS 50
             G + AE T+  C  C   A + C +C          AFC  DC   S
Sbjct: 17 NFSGENTAEATANKCA-CRLKAMVICQQC---------GAFCHDDCIGPS 56


>gnl|CDD|203013 pfam04438, zf-HIT, HIT zinc finger.  This presumed zinc finger
          contains up to 6 cysteine residues that could
          coordinate zinc. The domain is named after the HIT
          protein. This domain is also found in the Thyroid
          receptor interacting protein 3 (TRIP-3) that
          specifically interacts with the ligand binding domain
          of the thyroid receptor.
          Length = 30

 Score = 25.7 bits (57), Expect = 3.5
 Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 8/34 (23%)

Query: 15 CVRCGKPAHLQCPKCMELKLPREGAAFCTQDCFK 48
          C  CG P+  +CP+C        G  +C+ +C+K
Sbjct: 5  CSVCGNPSKYRCPRC--------GVRYCSLECYK 30


>gnl|CDD|206170 pfam14000, Packaging_FI, DNA packaging protein FI.  This family
           includes the lambda phage DNA-packaging protein FI.
           Proteins in this family are typically between 124 and
           140 amino acids in length. There is a conserved EEE
           sequence motif.
          Length = 125

 Score = 27.8 bits (62), Expect = 3.6
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 280 RFREIGEVINRHATMSGFSVEIKQLV 305
           R RE+  ++ R   +SG + E+ Q V
Sbjct: 8   RLRELAAILGREPDVSGSAAELAQRV 33


>gnl|CDD|183506 PRK12406, PRK12406, long-chain-fatty-acid--CoA ligase; Provisional.
          Length = 509

 Score = 28.9 bits (65), Expect = 3.8
 Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 10/64 (15%)

Query: 149 ERMRETCRIAREVLDAAARM----IRPGVTTDEIDR------VVHEATITAGGYPSPLNY 198
           +R R    +A+    AA  +    +RPG     + R          A +  G Y  P+N+
Sbjct: 9   DRRRSFDELAQRAARAAGGLAALGVRPGDCVALLMRNDFAFFEAAYAAMRLGAYAVPVNW 68

Query: 199 HFFP 202
           HF P
Sbjct: 69  HFKP 72


>gnl|CDD|222530 pfam14082, DUF4263, Domain of unknown function (DUF4263).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria, archaea,
           eukaryotes and viruses. Proteins in this family are
           typically between 244 and 403 amino acids in length.
          Length = 162

 Score = 28.0 bits (63), Expect = 4.3
 Identities = 10/44 (22%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 130 VEPNSDLQHVVEIKTPDQ---IERMRETCRIAREVLDAAARMIR 170
            + +SD   +VEIK P+     +  R     + E+  A ++++ 
Sbjct: 52  ADKDSDNLVLVEIKRPNTNLFTKTPRGNGAPSAELSGAISQILD 95


>gnl|CDD|129632 TIGR00541, hisDCase_pyru, histidine decarboxylase, pyruvoyl type.
           This enzyme converts histadine to histamine in a single
           step by catalyzing the release of CO2. This type is
           synthesized as an inactive single chain precursor, then
           cleaved into two chains. The Ser at the new N-terminus
           at the cleavage site is converted to a pyruvoyl group
           essential for activity. This type of histidine
           decarboxylase appears is known so far only in some
           Gram-positive bacteria, where it may play a role in
           amino acid catabolism. There is also a pyridoxal
           phosphate type histidine decarboxylase, as found in
           human, where histamine is a biologically active amine
           [Energy metabolism, Amino acids and amines].
          Length = 310

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 12/50 (24%), Positives = 24/50 (48%)

Query: 102 LRPYPISSKLTVPAYIELPDWALDGTPKVEPNSDLQHVVEIKTPDQIERM 151
           + P  I + +T   Y+ L   A+ G   + P+ D + +  I  P+ ++ M
Sbjct: 250 MEPGEIGNAITCAPYVSLAIDAIPGGSILTPDEDFEILENISLPEWLDDM 299


>gnl|CDD|188804 cd09420, LIM3_Prickle, The third LIM domain of Prickle.  The
          third LIM domain of Prickle: Prickle contains three
          C-terminal LIM domains and a N-terminal PET domain.
          Prickles have been implicated in roles of regulating
          tissue polarity or planar cell polarity (PCP).  PCP
          establishment requires the conserved
          Frizzled/Dishevelled PCP pathway. Prickle interacts
          with Dishevelled, thereby modulating
          Frizzled/Dishevelled activity and PCP signaling. Two
          forms of prickles have been identified; namely prickle
          1 and prickle 2. Prickle 1 and prickle 2 are
          differentially expressed. While prickle 1 is expressed
          in fetal heart and hematological malignancies, prickle
          2 is found in fetal brain, adult cartilage, pancreatic
          islet, and some types of timorous cells. LIM domains
          are 50-60 amino acids in size and share two
          characteristic zinc finger motifs. The two zinc fingers
          contain eight conserved residues, mostly cysteines and
          histidines, which coordinately bond to two zinc atoms.
          LIM domains function as adaptors or scaffolds to
          support the assembly of multimeric protein complexes.
          Length = 59

 Score = 25.9 bits (57), Expect = 5.5
 Identities = 11/39 (28%), Positives = 17/39 (43%), Gaps = 6/39 (15%)

Query: 8  AETTSLSCVRCGKPAHLQCPKCMELKLPREGAAFCTQDC 46
          A      C +C K   L  P      LP++G  +C++ C
Sbjct: 26 ATEKCFCCAQCKKSL-LGRPF-----LPKQGQIYCSRAC 58


>gnl|CDD|233843 TIGR02390, RNA_pol_rpoA1, DNA-directed RNA polymerase subunit A'.
           This family consists of the archaeal A' subunit of the
           DNA-directed RNA polymerase. The example from
           Methanocaldococcus jannaschii contains an intein.
          Length = 867

 Score = 28.5 bits (64), Expect = 5.7
 Identities = 15/68 (22%), Positives = 22/68 (32%), Gaps = 26/68 (38%)

Query: 104 PYPISSKLTVPAYIELPDWALDGTPKVEPNSDLQHVVEIKTPDQIERMRETCRIAREVLD 163
           P  I+ +LTVP  +                          TP  I+ +RE      +   
Sbjct: 338 PEQIAKELTVPERV--------------------------TPWNIDELREYVLNGPDSWP 371

Query: 164 AAARMIRP 171
            A  +IRP
Sbjct: 372 GANYVIRP 379


>gnl|CDD|232831 TIGR00115, tig, trigger factor.  Trigger factor is a
           ribosome-associated molecular chaperone and is the first
           chaperone to interact with nascent polypeptide. Trigger
           factor can bind at the same time as the signal
           recognition particle (SRP), but is excluded by the SRP
           receptor (FtsY). The central domain of trigger factor
           has peptidyl-prolyl cis/trans isomerase activity. This
           protein is found in a single copy in virtually every
           bacterial genome [Protein fate, Protein folding and
           stabilization].
          Length = 408

 Score = 28.3 bits (64), Expect = 6.1
 Identities = 8/22 (36%), Positives = 10/22 (45%)

Query: 219 DSRKLEDGDIVNIDVTVYYKGV 240
           + R  E GD V ID   +  G 
Sbjct: 143 ERRAAEKGDRVTIDFEGFIDGE 164


>gnl|CDD|233581 TIGR01810, betA, choline dehydrogenase.  Choline dehydrogenase
           catalyzes the conversion of exogenously supplied choline
           into the intermediate glycine betaine aldehyde, as part
           of a two-step oxidative reaction leading to the
           formation of osmoprotectant betaine. This enzymatic
           system can be found in both gram-positive and
           gram-negative bacteria. As in Escherichia coli ,
           Staphylococcus xylosus , and Sinorhizobium meliloti,
           this enzyme is found associated in a transciptionally
           co-induced gene cluster with betaine aldehyde
           dehydrogenase, the second catalytic enzyme in this
           reaction. Other gram-positive organisms have been shown
           to employ a different enzymatic system, utlizing a
           soluable choline oxidase or type III alcohol
           dehydrogenase instead of choline dehydrogenase. This
           enzyme is a member of the GMC oxidoreductase family
           (pfam00732 and pfam05199), sharing a common
           evoluntionary origin and enzymatic reaction with alcohol
           dehydrogenase. Outgrouping from this model, Caulobacter
           crescentus shares sequence homology with choline
           dehydrogenase, yet other genes participating in this
           enzymatic reaction have not currently been identified
           [Cellular processes, Adaptations to atypical
           conditions].
          Length = 532

 Score = 27.9 bits (62), Expect = 7.4
 Identities = 17/57 (29%), Positives = 22/57 (38%), Gaps = 9/57 (15%)

Query: 142 IKTPDQIERMRETCRIAREVLDAAA------RMIRPGV---TTDEIDRVVHEATITA 189
           +   +     RE  R+ RE+L   A        I PG    T +EID  V     TA
Sbjct: 408 MSHEEDWREFREAIRVTREILKQKALDPYRGGEISPGPEVQTDEEIDEFVRRHGETA 464


>gnl|CDD|225971 COG3437, COG3437, Response regulator containing a CheY-like
           receiver domain and an HD-GYP domain [Transcription /
           Signal transduction mechanisms].
          Length = 360

 Score = 27.7 bits (62), Expect = 7.8
 Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 131 EPNSDLQHVVEIKTPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVHE 184
            P+S L       T ++ E M+    +  E+L ++ R+++      EI R  HE
Sbjct: 228 IPDSIL-LKPGKLTSEEFEIMKGHPILGAEILKSSERLMQVAA---EIARHHHE 277


>gnl|CDD|143439 cd07121, ALDH_EutE, Ethanolamine utilization protein EutE-like.
           Coenzyme A acylating aldehyde dehydrogenase (ACDH), an
           NAD+ and CoA-dependent acetaldehyde dehydrogenase,
           acetylating (EC=1.2.1.10), converts acetaldehyde into
           acetyl-CoA.  This CD is limited to such monofunctional
           enzymes as the Ethanolamine utilization protein, EutE,
           in Salmonella typhimurium.  Mutations in eutE abolish
           the ability to utilize ethanolamine as a carbon source.
          Length = 429

 Score = 27.6 bits (62), Expect = 9.0
 Identities = 13/50 (26%), Positives = 22/50 (44%), Gaps = 4/50 (8%)

Query: 239 GVHGDLNETYFVGNADEASRQLVQCTYECLEKAISIVKPGV--RFREIGE 286
           GV   +++   V  A  A +Q  +CT    EK I  ++  +     E+ E
Sbjct: 1   GVFATVDDA--VAAAKAAQKQYRKCTLADREKIIEAIREALLSNAEELAE 48


>gnl|CDD|187539 cd05228, AR_FR_like_1_SDR_e, uncharacterized subgroup of aldehyde
           reductase and flavonoid reductase related proteins,
           extended (e) SDRs.  This subgroup contains proteins of
           unknown function related to aldehyde reductase and
           flavonoid reductase of the extended SDR-type. Aldehyde
           reductase I (aka carbonyl reductase) is an NADP-binding
           SDR; it has an NADP-binding motif consensus that is
           slightly different from the canonical SDR form and lacks
           the Asn of the extended SDR active site tetrad. Aldehyde
           reductase I catalyzes the NADP-dependent  reduction of
           ethyl 4-chloro-3-oxobutanoate to ethyl
           (R)-4-chloro-3-hydroxybutanoate. The related flavonoid
           reductases act in the NADP-dependent reduction of
           flavonoids, ketone-containing plant secondary
           metabolites. Extended SDRs are distinct from classical
           SDRs. In addition to the Rossmann fold (alpha/beta
           folding pattern with a central beta-sheet) core region
           typical of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 318

 Score = 27.6 bits (62), Expect = 9.3
 Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 8/38 (21%)

Query: 158 AREVLDAAARMIRPGVTTDEIDRVVHEATITAGGYPSP 195
            R VLDAA      GV      RVVH ++I A G P  
Sbjct: 91  TRNVLDAALEA---GVR-----RVVHTSSIAALGGPPD 120


>gnl|CDD|224365 COG1448, TyrB, Aspartate/tyrosine/aromatic aminotransferase [Amino
           acid transport and metabolism].
          Length = 396

 Score = 27.6 bits (62), Expect = 9.9
 Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 3/40 (7%)

Query: 144 TPDQIERMRETCRIAREVLDAAARMIRPGVTTDEIDRVVH 183
           +P+Q++R+RE   I    L A+ R+   G+ T  ID V  
Sbjct: 354 SPEQVDRLREEFGI---YLVASGRINVAGLNTSNIDYVAK 390


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.419 

Gapped
Lambda     K      H
   0.267   0.0731    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,372,177
Number of extensions: 1438605
Number of successful extensions: 1639
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1589
Number of HSP's successfully gapped: 76
Length of query: 305
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 208
Effective length of database: 6,635,264
Effective search space: 1380134912
Effective search space used: 1380134912
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)