RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15498
         (635 letters)



>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score =  647 bits (1670), Expect = 0.0
 Identities = 243/418 (58%), Positives = 299/418 (71%), Gaps = 30/418 (7%)

Query: 243 KKKKKKKNKNKAAAATEEAPKNGIP-------------------------QQTTPPSIPV 277
           KKKKKKK K K            +P                         +QT PP+IPV
Sbjct: 58  KKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPTIPV 117

Query: 278 SQLYPDGNFPVGQEMEYPTLKDGRTGKDRFTNEEKKAIDRLHNDMYNEARQAAEAHRQVR 337
           S+ + DG +PVG+  EYP          R ++EEK+ +++L  + Y + R+AAE HRQVR
Sbjct: 118 SKQFKDGEYPVGEIQEYPGEN-----SSRTSSEEKRELEKLSEEQYQDLRRAAEVHRQVR 172

Query: 338 KHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT 397
           ++    IKPG+ +I+I E +E  SR LI  DGLK G AFPTGCS NHCAAHYTPN GDKT
Sbjct: 173 RYAQSVIKPGVKLIDICERIESKSRELIEADGLKCGWAFPTGCSLNHCAAHYTPNTGDKT 232

Query: 398 VLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDATNTGIKAAGIDVPLCEV 457
           VL +DD+ K+DFGTH+NGRIIDCAFT+A+N KYD L++A +DATNTGIK AGIDV L ++
Sbjct: 233 VLTYDDVCKLDFGTHVNGRIIDCAFTVAFNPKYDPLLQATKDATNTGIKEAGIDVRLSDI 292

Query: 458 GAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPIVRGGEGTLMEENEF 517
           GAAIQEV+ESYEVEI GKTY +KSIRNLNGHSI  Y IH GK+VPIV+GGE T MEE E 
Sbjct: 293 GAAIQEVIESYEVEIKGKTYPIKSIRNLNGHSIGPYIIHGGKSVPIVKGGENTRMEEGEL 352

Query: 518 YAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLLNTINKNFGTLAFCKRWL 577
           +AIETF STG+G V++D++ SHYMK+  A ++PLRL  +K LL  IN NFGTLAFC+RWL
Sbjct: 353 FAIETFASTGRGYVNEDLECSHYMKDPGAEFVPLRLPKAKQLLKHINTNFGTLAFCRRWL 412

Query: 578 DRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIVLRPTCKEVISRGDDY 635
           DR G  ++ +ALK L D GIV  YPPLCD++GSYT+Q EHTI+LRPTCKEV+SRGDDY
Sbjct: 413 DRLGQDRHLLALKQLVDAGIVNPYPPLCDVRGSYTSQMEHTILLRPTCKEVLSRGDDY 470



 Score =  209 bits (534), Expect = 1e-60
 Identities = 98/267 (36%), Positives = 133/267 (49%), Gaps = 51/267 (19%)

Query: 5   KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDE-----IEDGD 59
             E    + K++K +NK     ++  K+              +   DDD      I +  
Sbjct: 2   MNENGENEVKQQKQQNKQKGTKKKNKKS-------------KKDVDDDDAFLAELISENQ 48

Query: 60  VEDGAAEASKKKKRKKKKKKAGAGEAAADV-----------PIDGNGIA-----EQLEQC 103
             +      KKKK+KKKKKK   GEA                 D + I       + E  
Sbjct: 49  EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWK 108

Query: 104 GL------------NGNFPVGQEMEYPTLKDGRTGKDRFTNEEKKAIDRLHNDMYNEARQ 151
                         +G +PVG+  EYP          R ++EEK+ +++L  + Y + R+
Sbjct: 109 QTQPPTIPVSKQFKDGEYPVGEIQEYPGEN-----SSRTSSEEKRELEKLSEEQYQDLRR 163

Query: 152 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAH 211
           AAE HRQVR++    IKPG+ +I+I E +E  SR LI  DGLK G AFPTGCS NHCAAH
Sbjct: 164 AAEVHRQVRRYAQSVIKPGVKLIDICERIESKSRELIEADGLKCGWAFPTGCSLNHCAAH 223

Query: 212 YTPNNGDKTVLEFDDITKIDFGTHING 238
           YTPN GDKTVL +DD+ K+DFGTH+NG
Sbjct: 224 YTPNTGDKTVLTYDDVCKLDFGTHVNG 250


>gnl|CDD|238521 cd01088, MetAP2, Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also
           known as methionyl aminopeptidase and peptidase M.
           Catalyzes release of N-terminal amino acids,
           preferentially methionine, from peptides and arylamides.
          Length = 291

 Score =  478 bits (1232), Expect = e-167
 Identities = 165/309 (53%), Positives = 205/309 (66%), Gaps = 18/309 (5%)

Query: 323 YNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSR 382
             + R+A E HRQVRK+    IKPGMT++EI E +E   R L       AG AFP   S 
Sbjct: 1   LEKYREAGEIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELG------AGPAFPVNLSI 54

Query: 383 NHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDATN 442
           N CAAHYTPN GD TVL+  D+ K+DFG H++G I D AFT+ ++ KYD L+EA ++A N
Sbjct: 55  NECAAHYTPNAGDDTVLKEGDVVKLDFGAHVDGYIADSAFTVDFDPKYDDLLEAAKEALN 114

Query: 443 TGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVP 502
             IK AG DV L E+G AI+EV+ES         Y  K IRNL GHSI+ YR+HAGK++P
Sbjct: 115 AAIKEAGPDVRLGEIGEAIEEVIES---------YGFKPIRNLTGHSIERYRLHAGKSIP 165

Query: 503 IVRGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLLNT 562
            V+GGEGT +EE + YAIE F +TGKG VHD  + S YM N +    PLRL  ++ LL+ 
Sbjct: 166 NVKGGEGTRLEEGDVYAIEPFATTGKGYVHDGPECSIYMLNRDK---PLRLPRARKLLDV 222

Query: 563 INKNFGTLAFCKRWLDRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIVLR 622
           I +NFGTL F +RWLDR G TK  MALK+LC  GIV  YP L +I G Y AQFEHTI++R
Sbjct: 223 IYENFGTLPFARRWLDRLGETKLLMALKNLCKAGIVYPYPVLKEISGGYVAQFEHTIIVR 282

Query: 623 PTCKEVISR 631
              KEV +R
Sbjct: 283 EDGKEVTTR 291



 Score =  143 bits (363), Expect = 2e-38
 Identities = 48/93 (51%), Positives = 59/93 (63%), Gaps = 6/93 (6%)

Query: 146 YNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSR 205
             + R+A E HRQVRK+    IKPGMT++EI E +E   R L       AG AFP   S 
Sbjct: 1   LEKYREAGEIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELG------AGPAFPVNLSI 54

Query: 206 NHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
           N CAAHYTPN GD TVL+  D+ K+DFG H++G
Sbjct: 55  NECAAHYTPNAGDDTVLKEGDVVKLDFGAHVDG 87


>gnl|CDD|129592 TIGR00501, met_pdase_II, methionine aminopeptidase, type II.
           Methionine aminopeptidase (map) is a cobalt-binding
           enzyme. Bacterial and organellar examples (type I)
           differ from eukaroytic and archaeal (type II) examples
           in lacking a region of approximately 60 amino acids
           between the 4th and 5th cobalt-binding ligands. The role
           of this protein in general is to produce the mature
           amino end of cytosolic proteins by removing the
           N-terminal methionine. This model describes type II,
           among which the eukaryotic members typically have an
           N-terminal extension not present in archaeal members. It
           can act cotranslationally. The enzyme from rat has been
           shown to associate with translation initiation factor 2
           (IF-2) and may have a role in translational regulation
           [Protein fate, Protein modification and repair].
          Length = 295

 Score =  291 bits (747), Expect = 4e-94
 Identities = 128/311 (41%), Positives = 182/311 (58%), Gaps = 18/311 (5%)

Query: 321 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGC 380
           +   +  +A + H +VR+     I PG+ ++E+ E +E   R L  E       AFP   
Sbjct: 3   ERAEKWIEAGKIHSKVRREAADRIVPGVKLLEVAEFVENRIRELGAEP------AFPCNI 56

Query: 381 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDA 440
           S N CAAH+TP  GDKTV +  D+ K+D G H++G I D A T+   ++YD L++A +DA
Sbjct: 57  SINECAAHFTPKAGDKTVFKDGDVVKLDLGAHVDGYIADTAITVDLGDQYDNLVKAAKDA 116

Query: 441 TNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKT 500
             T IK     V + E+G AIQEV+ES         Y VK I NL GHS+  YR+H GK+
Sbjct: 117 LYTAIKEIRAGVRVGEIGKAIQEVIES---------YGVKPISNLTGHSMAPYRLHGGKS 167

Query: 501 VPIVRGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLL 560
           +P V+  + T +EE +  AIE F + G G V D  +VS Y    E    P+RL S+++LL
Sbjct: 168 IPNVKERDTTKLEEGDVVAIEPFATDGVGYVTDGGEVSIYAFLAER---PVRLDSARNLL 224

Query: 561 NTINKNFGTLAFCKRWLDRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIV 620
            TI++N+GTL F +RWLD+ G  KY  AL +L   G++  YP L +I G Y AQ+EHTI+
Sbjct: 225 KTIDENYGTLPFARRWLDKLGDEKYLFALNNLIRHGLIYDYPVLNEISGGYVAQWEHTIL 284

Query: 621 LRPTCKEVISR 631
           +    KEV ++
Sbjct: 285 VEEHGKEVTTK 295



 Score = 86.8 bits (215), Expect = 9e-19
 Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 6/95 (6%)

Query: 144 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGC 203
           +   +  +A + H +VR+     I PG+ ++E+ E +E   R L  E       AFP   
Sbjct: 3   ERAEKWIEAGKIHSKVRREAADRIVPGVKLLEVAEFVENRIRELGAEP------AFPCNI 56

Query: 204 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
           S N CAAH+TP  GDKTV +  D+ K+D G H++G
Sbjct: 57  SINECAAHFTPKAGDKTVFKDGDVVKLDLGAHVDG 91


>gnl|CDD|181534 PRK08671, PRK08671, methionine aminopeptidase; Provisional.
          Length = 291

 Score =  258 bits (661), Expect = 2e-81
 Identities = 102/299 (34%), Positives = 157/299 (52%), Gaps = 19/299 (6%)

Query: 322 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS 381
              +  +A +   +VR+     IKPG  ++++ E +E      I E G  A  AFP   S
Sbjct: 1   ELEKYLEAGKIASKVREEAAKLIKPGAKLLDVAEFVENR----IRELG--AKPAFPCNIS 54

Query: 382 RNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDAT 441
            N  AAHYTP+ GD+ V    D+ K+D G H++G I D A T+    KY+ L+EA  +A 
Sbjct: 55  INEVAAHYTPSPGDERVFPEGDVVKLDLGAHVDGYIADTAVTVDLGGKYEDLVEASEEAL 114

Query: 442 NTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTV 501
              I+     V + E+G  I+E + SY     G     K IRNL GH ++ Y +HAG ++
Sbjct: 115 EAAIEVVRPGVSVGEIGRVIEETIRSY-----G----FKPIRNLTGHGLERYELHAGPSI 165

Query: 502 PIVRGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLLN 561
           P    G G  +EE + YAIE F + G+G V +  +V  Y      +  P+RL +++ LL 
Sbjct: 166 PNYDEGGGVKLEEGDVYAIEPFATDGEGKVVEGPEVEIYS--LLRNR-PVRLPAARKLLE 222

Query: 562 TINKNFGTLAFCKRWLDRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIV 620
            I + + TL F +RWL+     + ++ L+ L   G +  YP L ++KG   +Q EHT++
Sbjct: 223 EIEEEYNTLPFAERWLEGLF-GEDKLELRRLLKAGALYGYPVLKEVKGGLVSQAEHTVI 280



 Score = 86.4 bits (215), Expect = 1e-18
 Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 6/94 (6%)

Query: 145 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS 204
              +  +A +   +VR+     IKPG  ++++ E +E      I E G  A  AFP   S
Sbjct: 1   ELEKYLEAGKIASKVREEAAKLIKPGAKLLDVAEFVENR----IRELG--AKPAFPCNIS 54

Query: 205 RNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
            N  AAHYTP+ GD+ V    D+ K+D G H++G
Sbjct: 55  INEVAAHYTPSPGDERVFPEGDVVKLDLGAHVDG 88


>gnl|CDD|223103 COG0024, Map, Methionine aminopeptidase [Translation, ribosomal
           structure and biogenesis].
          Length = 255

 Score =  173 bits (441), Expect = 7e-50
 Identities = 70/226 (30%), Positives = 103/226 (45%), Gaps = 25/226 (11%)

Query: 321 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKA-----GLA 375
           +   + R+A +   +  K +   +KPG+T +E+ E  E++   +  +    A     G  
Sbjct: 9   EEIEKMREAGKIAAKALKEVASLVKPGVTTLELDEIAEEF---IREKGAYPAFLGYKGFP 65

Query: 376 FPTGCSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNK----YD 431
           FPT  S N   AH  P   DK VL+  DI KID G HI+G I D A T            
Sbjct: 66  FPTCISVNEVVAHGIPG--DKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDEDAK 123

Query: 432 KLIEAVRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSID 491
           +L+EA ++A   GI+A      L ++G AIQE  ES              +RNL GH I 
Sbjct: 124 RLLEATKEALYAGIEAVKPGARLGDIGRAIQEYAES---------RGFSVVRNLTGHGIG 174

Query: 492 SYRIHAGKTVPIV-RGGEGTLMEENEFYAIETFGSTGKGMVHDDMD 536
              +H   ++P   + G G  ++E   +AIE   +TG G V +   
Sbjct: 175 R-ELHEEPSIPNYGKDGTGVRLKEGMVFAIEPMINTGSGEVVEGPS 219



 Score = 86.5 bits (215), Expect = 7e-19
 Identities = 39/134 (29%), Positives = 60/134 (44%), Gaps = 16/134 (11%)

Query: 144 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKA-----GLA 198
           +   + R+A +   +  K +   +KPG+T +E+ E  E++   +  +    A     G  
Sbjct: 9   EEIEKMREAGKIAAKALKEVASLVKPGVTTLELDEIAEEF---IREKGAYPAFLGYKGFP 65

Query: 199 FPTGCSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING------EEVAKKKKKKKNKN 252
           FPT  S N   AH  P   DK VL+  DI KID G HI+G            +   ++  
Sbjct: 66  FPTCISVNEVVAHGIPG--DKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDEDAK 123

Query: 253 KAAAATEEAPKNGI 266
           +   AT+EA   GI
Sbjct: 124 RLLEATKEALYAGI 137



 Score = 38.0 bits (89), Expect = 0.009
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 604 LCDIKGSYTAQFEHTIVLRPTCKEVISRGDDY 635
           L    GS +AQFEHT+++     E+++   + 
Sbjct: 224 LVTKDGSLSAQFEHTVIVTEDGCEILTLRPEE 255


>gnl|CDD|215993 pfam00557, Peptidase_M24, Metallopeptidase family M24.  This family
           contains metallopeptidases. It also contains
           non-peptidase homologues such as the N terminal domain
           of Spt16 which is a histone H3-H4 binding module.
          Length = 208

 Score =  122 bits (307), Expect = 1e-31
 Identities = 57/213 (26%), Positives = 80/213 (37%), Gaps = 24/213 (11%)

Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPT--GCSRNH 384
           R+AA       +  +  I+PG+T  E+  ELE          G   G AFP       N 
Sbjct: 4   RKAARIAAAALEAALAAIRPGVTERELAAELEAA----FLARGGARGPAFPPIVASGPNA 59

Query: 385 CAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAY---NNKYDKLIEAVRDAT 441
              HY P++    VL+  D+  ID G   +G   D   T        +  +L EAV +A 
Sbjct: 60  AVPHYIPSD---RVLKDGDLVLIDVGAEYDGYHSDITRTFVVGKPTPEQRELYEAVLEAQ 116

Query: 442 NTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTV 501
              I A    V   +V AA +EV+E                 +  GH I    +H     
Sbjct: 117 EAAIAAVKPGVTGGDVDAAAREVLEEGGYGEY--------FPHGLGHGIG-LDVHDEGPY 167

Query: 502 PIVRGGEGTLMEENEFYAIE--TFGSTGKGMVH 532
            I RGG   ++E    + IE   +   G G V 
Sbjct: 168 -ISRGGNDRVLEPGMVFTIEPGIYFIPGWGGVR 199



 Score = 76.9 bits (190), Expect = 5e-16
 Identities = 27/91 (29%), Positives = 38/91 (41%), Gaps = 9/91 (9%)

Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPT--GCSRNH 207
           R+AA       +  +  I+PG+T  E+  ELE          G   G AFP       N 
Sbjct: 4   RKAARIAAAALEAALAAIRPGVTERELAAELEAA----FLARGGARGPAFPPIVASGPNA 59

Query: 208 CAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
              HY P++    VL+  D+  ID G   +G
Sbjct: 60  AVPHYIPSD---RVLKDGDLVLIDVGAEYDG 87


>gnl|CDD|238514 cd01066, APP_MetAP, A family including aminopeptidase P,
           aminopeptidase M, and prolidase. Also known as
           metallopeptidase family M24. This family of enzymes is
           able to cleave amido-, imido- and amidino-containing
           bonds. Members exibit relatively narrow substrate
           specificity compared to other metallo-aminopeptidases,
           suggesting they play roles in regulation of biological
           processes rather than general protein degradation.
          Length = 207

 Score = 97.5 bits (243), Expect = 3e-23
 Identities = 53/216 (24%), Positives = 85/216 (39%), Gaps = 24/216 (11%)

Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 386
           R+AAE            I+PG+T  E+   +E+  R      G         G       
Sbjct: 5   RKAAEIAEAAMAAAAEAIRPGVTEAEVAAAIEQALR----AAGGYPAGPTIVGSGARTAL 60

Query: 387 AHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---KLIEAVRDATNT 443
            HY P++     L+  D+  +D G   +G   D   T       D   +L EAVR+A   
Sbjct: 61  PHYRPDD---RRLQEGDLVLVDLGGVYDGYHADLTRTFVIGEPSDEQRELYEAVREAQEA 117

Query: 444 GIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPI 503
            + A    V   EV AA +EV+E + +          +  +  GH I    IH     P+
Sbjct: 118 ALAALRPGVTAEEVDAAAREVLEEHGLG--------PNFGHRTGHGIG-LEIHEP---PV 165

Query: 504 VRGGEGTLMEENEFYAIETF--GSTGKGMVHDDMDV 537
           ++ G+ T++E    +A+E       G G+  +D  +
Sbjct: 166 LKAGDDTVLEPGMVFAVEPGLYLPGGGGVRIEDTVL 201



 Score = 55.5 bits (134), Expect = 1e-08
 Identities = 21/89 (23%), Positives = 33/89 (37%), Gaps = 7/89 (7%)

Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 209
           R+AAE            I+PG+T  E+   +E+  R      G         G       
Sbjct: 5   RKAAEIAEAAMAAAAEAIRPGVTEAEVAAAIEQALR----AAGGYPAGPTIVGSGARTAL 60

Query: 210 AHYTPNNGDKTVLEFDDITKIDFGTHING 238
            HY P++     L+  D+  +D G   +G
Sbjct: 61  PHYRPDD---RRLQEGDLVLVDLGGVYDG 86


>gnl|CDD|129586 TIGR00495, crvDNA_42K, 42K curved DNA binding protein.  Proteins
           identified by This model have been identified in a
           number of species as a nuclear (but not nucleolar)
           protein with a cell cycle dependence. Various names
           given to members of this family have included cell cycle
           protein p38-2G4, DNA-binding protein GBP16, and
           proliferation-associated protein 1. This protein is
           closely related to methionine aminopeptidase, a
           cobolt-binding protein [Unknown function, General].
          Length = 389

 Score = 98.8 bits (246), Expect = 3e-22
 Identities = 82/341 (24%), Positives = 153/341 (44%), Gaps = 26/341 (7%)

Query: 310 EEKKAIDRLHND-MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEE-----LEKYSRL 363
           E+++    L N  +  + + A E    V K ++    PG  +++I E+     +E+ +++
Sbjct: 5   EQQEQAYSLSNPEVVTKYKMAGEIANNVLKSVVEACSPGAKVVDICEKGDAFIMEETAKI 64

Query: 364 LIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT-VLEFDDITKIDFGTHINGRIIDCAF 422
              E  ++ G+AFPT  S N+C  H++P   D+  +L+  D+ KID G HI+G I   A 
Sbjct: 65  FKKEKEMEKGIAFPTCISVNNCVGHFSPLKSDQDYILKEGDVVKIDLGCHIDGFIALVAH 124

Query: 423 TLAYNNKYDKLIEA----VRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQ 478
           T       ++ +      V  A +   +AA   + L + G    +V E+        +Y 
Sbjct: 125 TFVVGVAQEEPVTGRKADVIAAAHLAAEAA---LRLVKPGNTNTQVTEAINKVAH--SYG 179

Query: 479 VKSIRNLNGHSIDSYRIHAGKTVPIVRGGEGTL------MEENEFYAIETFGSTGKGMVH 532
              +  +  H +  + I   K +                 EENE YA++   STG+G   
Sbjct: 180 CTPVEGMLSHQLKQHVIDGEKVIISNPSDSQKKDHDTAEFEENEVYAVDILVSTGEGKAK 239

Query: 533 D-DMDVSHYMKNFEASYIPLRLASSKSLLNTINKNFGTLAFCKRWLDRAGCTKYQMALKD 591
           D D   + Y ++   +Y  L++ +S++  + I + F  + F  R  +     +  M L +
Sbjct: 240 DADQRTTIYKRDPSKTYG-LKMKASRAFFSEIERRFDAMPFTLRNFEDEKRAR--MGLVE 296

Query: 592 LCDKGIVEAYPPLCDIKGSYTAQFEHTIVLRPTCKEVISRG 632
                +++ YP L + +G + AQF+ T++L P     I+ G
Sbjct: 297 CVKHELLQPYPVLYEKEGEFVAQFKFTVLLMPNGPMRITSG 337



 Score = 54.9 bits (132), Expect = 7e-08
 Identities = 33/113 (29%), Positives = 61/113 (53%), Gaps = 7/113 (6%)

Query: 133 EEKKAIDRLHND-MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEE-----LEKYSRL 186
           E+++    L N  +  + + A E    V K ++    PG  +++I E+     +E+ +++
Sbjct: 5   EQQEQAYSLSNPEVVTKYKMAGEIANNVLKSVVEACSPGAKVVDICEKGDAFIMEETAKI 64

Query: 187 LIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT-VLEFDDITKIDFGTHING 238
              E  ++ G+AFPT  S N+C  H++P   D+  +L+  D+ KID G HI+G
Sbjct: 65  FKKEKEMEKGIAFPTCISVNNCVGHFSPLKSDQDYILKEGDVVKIDLGCHIDG 117



 Score = 30.6 bits (69), Expect = 2.5
 Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 48  LYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAA--DVPID 92
           LY  + E++D +++   A   K+KK+KKK KKA     AA     ++
Sbjct: 343 LYKSEMEVQDPEIKALLASPIKRKKQKKKAKKASKTGEAATEGETME 389


>gnl|CDD|238519 cd01086, MetAP1, Methionine Aminopeptidase 1. E.C. 3.4.11.18. Also
           known as methionyl aminopeptidase and Peptidase M.
           Catalyzes release of N-terminal amino acids,
           preferentially methionine, from peptides and arylamides.
          Length = 238

 Score = 83.7 bits (208), Expect = 4e-18
 Identities = 58/217 (26%), Positives = 84/217 (38%), Gaps = 31/217 (14%)

Query: 322 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP 377
           M    R  AE   ++ K     IKPG+T  E+ +   ++    I E G          FP
Sbjct: 4   MREAGRIVAEVLDELAKA----IKPGVTTKELDQIAHEF----IEEHGAYPAPLGYYGFP 55

Query: 378 -TGC-SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---K 432
            + C S N    H  P   D  VL+  DI  ID G  ++G   D A T       +   K
Sbjct: 56  KSICTSVNEVVCHGIP---DDRVLKDGDIVNIDVGVELDGYHGDSARTFIVGEVSEEAKK 112

Query: 433 LIEAVRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDS 492
           L+E   +A   GI+A      + ++G AI++  E          Y V  +R   GH I  
Sbjct: 113 LVEVTEEALYKGIEAVKPGNRIGDIGHAIEKYAEKN-------GYSV--VREFGGHGIGR 163

Query: 493 YRIHAGKTVPIV-RGGEGTLMEENEFYAIETFGSTGK 528
              H    +P   R G G  ++    + IE   + G 
Sbjct: 164 K-FHEEPQIPNYGRPGTGPKLKPGMVFTIEPMINLGT 199



 Score = 37.5 bits (88), Expect = 0.011
 Identities = 30/100 (30%), Positives = 41/100 (41%), Gaps = 17/100 (17%)

Query: 145 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP 200
           M    R  AE   ++ K     IKPG+T  E+ +   ++    I E G          FP
Sbjct: 4   MREAGRIVAEVLDELAKA----IKPGVTTKELDQIAHEF----IEEHGAYPAPLGYYGFP 55

Query: 201 -TGC-SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
            + C S N    H  P   D  VL+  DI  ID G  ++G
Sbjct: 56  KSICTSVNEVVCHGIP---DDRVLKDGDIVNIDVGVELDG 92



 Score = 30.2 bits (69), Expect = 3.1
 Identities = 8/13 (61%), Positives = 11/13 (84%)

Query: 608 KGSYTAQFEHTIV 620
            GS +AQFEHT++
Sbjct: 215 DGSLSAQFEHTVL 227


>gnl|CDD|237252 PRK12896, PRK12896, methionine aminopeptidase; Reviewed.
          Length = 255

 Score = 68.3 bits (168), Expect = 8e-13
 Identities = 50/204 (24%), Positives = 82/204 (40%), Gaps = 28/204 (13%)

Query: 335 QVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDG----LKAGLAFPTGC--SRNHCAAH 388
              K M   ++PGMT     +EL++ +   + E G     +    FP     S N   AH
Sbjct: 28  TALKEMGKAVEPGMT----TKELDRIAEKRLEEHGAIPSPEGYYGFPGSTCISVNEEVAH 83

Query: 389 YTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNN---KYDKLIEAVRDATNTGI 445
             P      V++  D+  ID   +++G   D   T A      + +KL     +A   GI
Sbjct: 84  GIP---GPRVIKDGDLVNIDVSAYLDGYHGDTGITFAVGPVSEEAEKLCRVAEEALWAGI 140

Query: 446 KAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPIVR 505
           K      PL ++G AI++       + +G  Y V  +R+L GH +    +H   +V +  
Sbjct: 141 KQVKAGRPLNDIGRAIED-----FAKKNG--YSV--VRDLTGHGVGR-SLHEEPSVILTY 190

Query: 506 G--GEGTLMEENEFYAIETFGSTG 527
                  L+      A+E F + G
Sbjct: 191 TDPLPNRLLRPGMTLAVEPFLNLG 214



 Score = 41.0 bits (97), Expect = 0.001
 Identities = 22/87 (25%), Positives = 36/87 (41%), Gaps = 13/87 (14%)

Query: 158 QVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDG----LKAGLAFPTGC--SRNHCAAH 211
              K M   ++PGMT     +EL++ +   + E G     +    FP     S N   AH
Sbjct: 28  TALKEMGKAVEPGMT----TKELDRIAEKRLEEHGAIPSPEGYYGFPGSTCISVNEEVAH 83

Query: 212 YTPNNGDKTVLEFDDITKIDFGTHING 238
             P      V++  D+  ID   +++G
Sbjct: 84  GIP---GPRVIKDGDLVNIDVSAYLDG 107



 Score = 30.2 bits (69), Expect = 2.9
 Identities = 8/12 (66%), Positives = 10/12 (83%)

Query: 609 GSYTAQFEHTIV 620
            S +AQFEHT+V
Sbjct: 232 KSLSAQFEHTVV 243


>gnl|CDD|235576 PRK05716, PRK05716, methionine aminopeptidase; Validated.
          Length = 252

 Score = 67.5 bits (166), Expect = 2e-12
 Identities = 57/226 (25%), Positives = 86/226 (38%), Gaps = 47/226 (20%)

Query: 329 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLL------IGEDGLKAGLAFP--TGC 380
           AAE   ++  H    +KPG+T  E+    E+Y R        +G  G      FP     
Sbjct: 21  AAEVLDEIEPH----VKPGVTTKELDRIAEEYIRDQGAIPAPLGYHG------FPKSICT 70

Query: 381 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAY---NNKYDKLIEAV 437
           S N    H  P      VL+  DI  ID     +G   D + T      + +  +L E  
Sbjct: 71  SVNEVVCHGIP---SDKVLKEGDIVNIDVTVIKDGYHGDTSRTFGVGEISPEDKRLCEVT 127

Query: 438 RDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHA 497
           ++A   GI A      L ++G AIQ+  E+         + V  +R   GH I       
Sbjct: 128 KEALYLGIAAVKPGARLGDIGHAIQKYAEAE-------GFSV--VREYCGHGI------- 171

Query: 498 GKTV---PIV----RGGEGTLMEENEFYAIETFGSTGKGMVHDDMD 536
           G+     P +      G+G +++E   + IE   + GK  V    D
Sbjct: 172 GRKFHEEPQIPHYGAPGDGPVLKEGMVFTIEPMINAGKREVKTLKD 217



 Score = 34.7 bits (81), Expect = 0.11
 Identities = 27/95 (28%), Positives = 36/95 (37%), Gaps = 21/95 (22%)

Query: 152 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLL------IGEDGLKAGLAFP--TGC 203
           AAE   ++  H    +KPG+T  E+    E+Y R        +G  G      FP     
Sbjct: 21  AAEVLDEIEPH----VKPGVTTKELDRIAEEYIRDQGAIPAPLGYHG------FPKSICT 70

Query: 204 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
           S N    H  P      VL+  DI  ID     +G
Sbjct: 71  SVNEVVCHGIP---SDKVLKEGDIVNIDVTVIKDG 102


>gnl|CDD|238525 cd01092, APP-like, Similar to Prolidase and Aminopeptidase P. The
           members of this subfamily presumably catalyse hydrolysis
           of Xaa-Pro dipeptides and/or release of any N-terminal
           amino acid, including proline, that is linked with
           proline.
          Length = 208

 Score = 65.6 bits (161), Expect = 4e-12
 Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 13/147 (8%)

Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 386
           R+AA    +  + ++ +IKPGMT  E+  ELE + R L  E     G +F T  +    +
Sbjct: 5   RKAARIADKAFEELLEFIKPGMTEREVAAELEYFMRKLGAE-----GPSFDTIVASGPNS 59

Query: 387 A--HYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIE---AVRDAT 441
           A  H  P       +E  D+  IDFG   +G   D   T+A     D+L E    V +A 
Sbjct: 60  ALPHGVP---SDRKIEEGDLVLIDFGAIYDGYCSDITRTVAVGEPSDELKEIYEIVLEAQ 116

Query: 442 NTGIKAAGIDVPLCEVGAAIQEVMESY 468
              IKA    V   EV  A ++V+E  
Sbjct: 117 QAAIKAVKPGVTAKEVDKAARDVIEEA 143



 Score = 46.7 bits (112), Expect = 1e-05
 Identities = 29/91 (31%), Positives = 43/91 (47%), Gaps = 10/91 (10%)

Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 209
           R+AA    +  + ++ +IKPGMT  E+  ELE + R L  E     G +F T  +    +
Sbjct: 5   RKAARIADKAFEELLEFIKPGMTEREVAAELEYFMRKLGAE-----GPSFDTIVASGPNS 59

Query: 210 A--HYTPNNGDKTVLEFDDITKIDFGTHING 238
           A  H  P       +E  D+  IDFG   +G
Sbjct: 60  ALPHGVP---SDRKIEEGDLVLIDFGAIYDG 87


>gnl|CDD|223085 COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and
           metabolism].
          Length = 384

 Score = 59.7 bits (145), Expect = 2e-09
 Identities = 54/203 (26%), Positives = 77/203 (37%), Gaps = 28/203 (13%)

Query: 326 ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS--RN 383
           A  A  A        +  I+PGMT  EI  ELE   R    E     G +F T  +   N
Sbjct: 167 AEIADAALEAA----LEAIRPGMTEAEIAAELEYALRKGGAE-----GPSFDTIVASGEN 217

Query: 384 HCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYN---NKYDKLIEAVRDA 440
               HYTP++     L   D+  ID G   NG   D   T       ++  ++ EAV +A
Sbjct: 218 AALPHYTPSD---RKLRDGDLVLIDLGGVYNGYCSDITRTFPIGKPSDEQREIYEAVLEA 274

Query: 441 TNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSI-DSYRIHAGK 499
               I A    V   EV AA ++V+E     +           +  GH +     +H   
Sbjct: 275 QEAAIAAIRPGVTGGEVDAAARQVLEKAGYGLY--------FLHGTGHGVGFVLDVHEH- 325

Query: 500 TVPIVRGGEGTLMEENEFYAIET 522
              +  G + TL E    ++IE 
Sbjct: 326 PQYLSPGSDTTL-EPGMVFSIEP 347



 Score = 48.2 bits (115), Expect = 8e-06
 Identities = 30/92 (32%), Positives = 37/92 (40%), Gaps = 14/92 (15%)

Query: 149 ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS--RN 206
           A  A  A        +  I+PGMT  EI  ELE   R    E     G +F T  +   N
Sbjct: 167 AEIADAALEAA----LEAIRPGMTEAEIAAELEYALRKGGAE-----GPSFDTIVASGEN 217

Query: 207 HCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
               HYTP++     L   D+  ID G   NG
Sbjct: 218 AALPHYTPSD---RKLRDGDLVLIDLGGVYNG 246


>gnl|CDD|238522 cd01089, PA2G4-like, Related to aminopepdidase M, this family
           contains proliferation-associated protein 2G4. Family
           members have been implicated in cell cycle control.
          Length = 228

 Score = 57.7 bits (140), Expect = 2e-09
 Identities = 43/136 (31%), Positives = 60/136 (44%), Gaps = 20/136 (14%)

Query: 323 YNEARQAAEAHRQVRKHMMGWIKPGMTMIE--------IVEELEKYSRLLIGEDGLKAGL 374
           Y  A Q A     V K ++    PG  +++        I+EEL K  +    E  L+ G+
Sbjct: 4   YKTAGQIANK---VLKQVISLCVPGAKVVDLCEKGDKLILEELGKVYK---KEKKLEKGI 57

Query: 375 AFPTGCSRNHCAAHYTPNNGDKTV-LEFDDITKIDFGTHINGRIIDCAFTL-AYNNKYDK 432
           AFPT  S N+C  H++P   D T  L+  D+ KID G HI+G I   A T+         
Sbjct: 58  AFPTCISVNNCVCHFSPLKSDATYTLKDGDVVKIDLGCHIDGYIAVVAHTIVVGAEAETP 117

Query: 433 LIEAVRDATNTGIKAA 448
           +     D     I AA
Sbjct: 118 VTGKKADV----IAAA 129



 Score = 56.2 bits (136), Expect = 9e-09
 Identities = 36/102 (35%), Positives = 51/102 (50%), Gaps = 15/102 (14%)

Query: 146 YNEARQAAEAHRQVRKHMMGWIKPGMTMIE--------IVEELEKYSRLLIGEDGLKAGL 197
           Y  A Q A     V K ++    PG  +++        I+EEL K  +    E  L+ G+
Sbjct: 4   YKTAGQIANK---VLKQVISLCVPGAKVVDLCEKGDKLILEELGKVYK---KEKKLEKGI 57

Query: 198 AFPTGCSRNHCAAHYTPNNGDKTV-LEFDDITKIDFGTHING 238
           AFPT  S N+C  H++P   D T  L+  D+ KID G HI+G
Sbjct: 58  AFPTCISVNNCVCHFSPLKSDATYTLKDGDVVKIDLGCHIDG 99



 Score = 31.5 bits (72), Expect = 0.91
 Identities = 14/43 (32%), Positives = 22/43 (51%), Gaps = 4/43 (9%)

Query: 581 GCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIVLRP 623
           G  K    L +    G++  YP L + +G   AQF+ T++L P
Sbjct: 182 GKAK----LVECVKHGLLFPYPVLYEKEGEVVAQFKLTVLLTP 220


>gnl|CDD|129591 TIGR00500, met_pdase_I, methionine aminopeptidase, type I.
           Methionine aminopeptidase is a cobalt-binding enzyme.
           Bacterial and organellar examples (type I) differ from
           eukaroytic and archaeal (type II) examples in lacking a
           region of approximately 60 amino acids between the 4th
           and 5th cobalt-binding ligands. This model describes
           type I. The role of this protein in general is to
           produce the mature form of cytosolic proteins by
           removing the N-terminal methionine [Protein fate,
           Protein modification and repair].
          Length = 247

 Score = 53.5 bits (129), Expect = 7e-08
 Identities = 53/213 (24%), Positives = 86/213 (40%), Gaps = 27/213 (12%)

Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP-TGC- 380
           R+A     +V + +   +KPG++  E+    + +    I + G K        FP + C 
Sbjct: 13  RKAGRLAAEVLEELEREVKPGVSTKELDRIAKDF----IEKHGAKPAFLGYYGFPGSVCI 68

Query: 381 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNN---KYDKLIEAV 437
           S N    H  P   DK VL+  DI  ID G   +G   D A T        + +KL+E  
Sbjct: 69  SVNEVVIHGIP---DKKVLKDGDIVNIDVGVIYDGYHGDTAKTFLVGKISPEAEKLLECT 125

Query: 438 RDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHA 497
            ++    I+ A     + E+GAAIQ+  E+    +         +R   GH I   + H 
Sbjct: 126 EESLYKAIEEAKPGNRIGEIGAAIQKYAEAKGFSV---------VREYCGHGI-GRKFHE 175

Query: 498 GKTVP-IVRGGEGTLMEENEFYAIETFGSTGKG 529
              +P   +      ++E   + IE   +TG  
Sbjct: 176 EPQIPNYGKKFTNVRLKEGMVFTIEPMVNTGTE 208



 Score = 29.2 bits (66), Expect = 6.3
 Identities = 27/95 (28%), Positives = 41/95 (43%), Gaps = 13/95 (13%)

Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP-TGC- 203
           R+A     +V + +   +KPG++  E+    + +    I + G K        FP + C 
Sbjct: 13  RKAGRLAAEVLEELEREVKPGVSTKELDRIAKDF----IEKHGAKPAFLGYYGFPGSVCI 68

Query: 204 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
           S N    H  P   DK VL+  DI  ID G   +G
Sbjct: 69  SVNEVVIHGIP---DKKVLKDGDIVNIDVGVIYDG 100


>gnl|CDD|180918 PRK07281, PRK07281, methionine aminopeptidase; Reviewed.
          Length = 286

 Score = 48.7 bits (116), Expect = 4e-06
 Identities = 61/240 (25%), Positives = 94/240 (39%), Gaps = 48/240 (20%)

Query: 344 IKPGMTMIEIVEELEKYSR------LLIGEDGLKAGLAFPTGCSRNHCAAH-----YTPN 392
           IKPG+ M E+ E + +  +      L IG DG      + T C  N   AH     Y   
Sbjct: 31  IKPGVDMWEVEEYVRRRCKEENVLPLQIGVDGAMMDYPYATCCGLNDEVAHAFPRHYILK 90

Query: 393 NGDKTVLEFD------------DITKIDFGT---------HINGRIIDCAFTLAYNNKYD 431
            GD  +L+ D            D++K++F              G + D  +  A     D
Sbjct: 91  EGD--LLKVDMVLSEPLDKSIVDVSKLNFDNVEQMKKYTESYRGGLADSCWAYAVGTPSD 148

Query: 432 K---LIEAVRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGH 488
           +   L++  ++A   GI+ A +   + ++GAAIQE  ES       + Y V  +R+L GH
Sbjct: 149 EVKNLMDVTKEAMYRGIEQAVVGNRIGDIGAAIQEYAES-------RGYGV--VRDLVGH 199

Query: 489 SIDSYRIHAGKTVPIV-RGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEAS 547
            +    +H    VP     G G  + E     IE   +TG   +  DM      K  +  
Sbjct: 200 GVGP-TMHEEPMVPNYGTAGRGLRLREGMVLTIEPMINTGTWEIDTDMKTGWAHKTLDGG 258



 Score = 31.0 bits (70), Expect = 1.7
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 9/71 (12%)

Query: 167 IKPGMTMIEIVEELEKYSR------LLIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT 220
           IKPG+ M E+ E + +  +      L IG DG      + T C  N   AH  P +    
Sbjct: 31  IKPGVDMWEVEEYVRRRCKEENVLPLQIGVDGAMMDYPYATCCGLNDEVAHAFPRH---Y 87

Query: 221 VLEFDDITKID 231
           +L+  D+ K+D
Sbjct: 88  ILKEGDLLKVD 98


>gnl|CDD|238520 cd01087, Prolidase, Prolidase. E.C. 3.4.13.9. Also known as Xaa-Pro
           dipeptidase, X-Pro dipeptidase, proline dipeptidase.,
           imidodipeptidase, peptidase D, gamma-peptidase.
           Catalyses hydrolysis of Xaa-Pro dipeptides; also acts on
           aminoacyl-hydroxyproline analogs. No action on Pro-Pro.
          Length = 243

 Score = 43.7 bits (104), Expect = 1e-04
 Identities = 35/124 (28%), Positives = 46/124 (37%), Gaps = 15/124 (12%)

Query: 329 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAH 388
           +AEAHR   K      +PGM+  E+  E E   R      G +   ++      N    H
Sbjct: 11  SAEAHRAAMKA----SRPGMSEYELEAEFEYEFR----SRGARLAYSYIVAAGSNAAILH 62

Query: 389 YTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDK----LIEAVRDATNTG 444
           Y  N+     L+  D+  ID G    G   D   T   N K+      L EAV  A    
Sbjct: 63  YVHNDQ---PLKDGDLVLIDAGAEYGGYASDITRTFPVNGKFTDEQRELYEAVLAAQKAA 119

Query: 445 IKAA 448
           I A 
Sbjct: 120 IAAC 123



 Score = 37.9 bits (89), Expect = 0.010
 Identities = 18/67 (26%), Positives = 26/67 (38%), Gaps = 8/67 (11%)

Query: 152 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAH 211
           +AEAHR   K      +PGM+  E+  E E   R      G +   ++      N    H
Sbjct: 11  SAEAHRAAMKA----SRPGMSEYELEAEFEYEFR----SRGARLAYSYIVAAGSNAAILH 62

Query: 212 YTPNNGD 218
           Y  N+  
Sbjct: 63  YVHNDQP 69


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 39.5 bits (92), Expect = 0.006
 Identities = 24/101 (23%), Positives = 44/101 (43%), Gaps = 5/101 (4%)

Query: 5    KGEEVAKKKKKKKNKNKAAA---ATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVE 61
            K EE  +K ++   + ++ A    + +APK   P++TT  +   S+   +       + E
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASE-SETTEETYGSSAMETE 1232

Query: 62   DGAAEASKKKKRKKKKKKAGAGEAAADVPIDGNGIAEQLEQ 102
            +  AE  K K R   KKKA A     +   +   + ++L  
Sbjct: 1233 N-VAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAA 1272



 Score = 32.9 bits (75), Expect = 0.65
 Identities = 20/70 (28%), Positives = 26/70 (37%), Gaps = 5/70 (7%)

Query: 23   AAATEEAPKNGVPQQTTPPS--IPVSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKA 80
            A   E        +          VS +   DD+ +D  VE   AE  KKK  +K    A
Sbjct: 1283 AKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKP---A 1339

Query: 81   GAGEAAADVP 90
             A + AA  P
Sbjct: 1340 AANKKAAKPP 1349



 Score = 32.5 bits (74), Expect = 0.81
 Identities = 20/102 (19%), Positives = 34/102 (33%), Gaps = 19/102 (18%)

Query: 7    EEVAKKKKKKKNKNKAAAATEE------APKNGVPQQTTPPSIPVSQLYPDDDEIEDGDV 60
            E    ++ KKK   K AAA ++      A K   P         ++++       E   +
Sbjct: 1323 EVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEML---KPAEAIGI 1379

Query: 61   EDGAAEASKKKKRKKKK----KKAGAGEAAADVPIDGNGIAE 98
                    +KK RK +     KK+G+    A    +      
Sbjct: 1380 SP------EKKVRKMRASPFNKKSGSVLGRAATNKETESSEN 1415



 Score = 31.0 bits (70), Expect = 2.8
 Identities = 15/49 (30%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 36   QQTTPPSIPVSQLYPDDDEIEDGDVEDGAAEASKKK-KRKKKKKKAGAG 83
            ++TTP S+ +  L   + E++  D ED  AE +++K +R   + ++GA 
Sbjct: 1146 KKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAA 1194


>gnl|CDD|183434 PRK12318, PRK12318, methionine aminopeptidase; Provisional.
          Length = 291

 Score = 36.7 bits (85), Expect = 0.027
 Identities = 48/195 (24%), Positives = 74/195 (37%), Gaps = 28/195 (14%)

Query: 345 KPGMTMIEIVEELEKYSRLLIGE-DGLKAGL-----AFP-TGC-SRNHCAAHYTPNNGDK 396
           K G+T      EL++ SR L  E + + A L      FP T C S N    H  PN+   
Sbjct: 71  KEGVT----TNELDELSRELHKEYNAIPAPLNYGSPPFPKTICTSLNEVICHGIPND--- 123

Query: 397 TVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---KLIEAVRDATNTGIKAAGIDVP 453
             L+  DI  ID    ++G   DC+  +      +   K+ +A  +  N  I      +P
Sbjct: 124 IPLKNGDIMNIDVSCIVDGYYGDCSRMVMIGEVSEIKKKVCQASLECLNAAIAILKPGIP 183

Query: 454 LCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPIVRGGEGTLME 513
           L E+G  I+   + Y   +         +    GH +   + H    VP  R      + 
Sbjct: 184 LYEIGEVIENCADKYGFSV---------VDQFVGHGV-GIKFHENPYVPHHRNSSKIPLA 233

Query: 514 ENEFYAIETFGSTGK 528
               + IE   + GK
Sbjct: 234 PGMIFTIEPMINVGK 248


>gnl|CDD|171806 PRK12897, PRK12897, methionine aminopeptidase; Reviewed.
          Length = 248

 Score = 36.6 bits (84), Expect = 0.028
 Identities = 59/213 (27%), Positives = 82/213 (38%), Gaps = 41/213 (19%)

Query: 322 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKA--GLAFPTG 379
           M+   +  A  HR++ K M    KPG+T  EI   +E Y          K   G  +   
Sbjct: 13  MHESGKLLASCHREIAKIM----KPGITTKEINTFVEAYLEKHGATSEQKGYNGYPYAIC 68

Query: 380 CSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---KLIEA 436
            S N    H  P +     L   DI  ID   ++NG + D A+T       D   KL+  
Sbjct: 69  ASVNDEMCHAFPAD---VPLTEGDIVTIDMVVNLNGGLSDSAWTYRVGKVSDEAEKLLLV 125

Query: 437 VRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIH 496
             +A   GI  A I   + ++G AI    ESY   +  + + V   R+  GH I      
Sbjct: 126 AENALYKGIDQAVIGNRVGDIGYAI----ESY---VANEGFSVA--RDFTGHGI------ 170

Query: 497 AGKTVPIVRGGEGTLMEENEFYAIETFGSTGKG 529
            GK             E +E  AI  FG  G+G
Sbjct: 171 -GK-------------EIHEEPAIFHFGKQGQG 189


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 36.5 bits (85), Expect = 0.038
 Identities = 17/85 (20%), Positives = 30/85 (35%)

Query: 4   QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
           +K +++A K K K    K         K   P+Q     I +S +  D D+  + D+   
Sbjct: 18  KKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKK 77

Query: 64  AAEASKKKKRKKKKKKAGAGEAAAD 88
             + + K    K   K    +    
Sbjct: 78  KTKTAAKAAAAKAPAKKKLKDELDS 102



 Score = 29.6 bits (67), Expect = 5.6
 Identities = 21/87 (24%), Positives = 30/87 (34%), Gaps = 3/87 (3%)

Query: 5   KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
           K E  + KK +KKN               V  Q             DDD+I+D D ++  
Sbjct: 97  KDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDD 156

Query: 65  AEASKKKKRKK---KKKKAGAGEAAAD 88
            E            +KK+A   E  +D
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSD 183


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 34.6 bits (80), Expect = 0.19
 Identities = 21/89 (23%), Positives = 31/89 (34%)

Query: 4   QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
           +   E    K  K+++       EE      P   T  S+  S    D  +     ++  
Sbjct: 464 EAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKA 523

Query: 64  AAEASKKKKRKKKKKKAGAGEAAADVPID 92
           A + SK   + KKKKK        D  ID
Sbjct: 524 ANKISKAAVKVKKKKKKEKSIDLDDDLID 552


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 31.6 bits (72), Expect = 0.19
 Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 5  KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVS-QLYPDDDEIEDGDVEDG 63
          KG+++  KKKKKK K K  +  E   +    ++++  S     +   D+++IE  +    
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMN 72

Query: 64 AAEASKKKKRKKKKK 78
            EA +  +  ++K+
Sbjct: 73 LTEAERAFEEAQRKR 87


>gnl|CDD|238295 cd00530, PTE, Phosphotriesterase (PTE) catalyzes the hydrolysis of
           organophosphate nerve agents, including the chemical
           warfare agents VX, soman, and sarin as well as the
           insecticide paraoxon. PTE exists as a homodimer with one
           active site per monomer. The active site is located next
           to a binuclear metal center, at the C-terminal end of a
           TIM alpha- beta barrel motif.  The native enzyme
           contains two zinc ions at the active site however these
           can be replaced with other metals such as cobalt,
           cadmium, nickel or manganese and the enzyme remains
           active.
          Length = 293

 Score = 32.6 bits (75), Expect = 0.50
 Identities = 13/60 (21%), Positives = 22/60 (36%), Gaps = 15/60 (25%)

Query: 125 TGKDRFTNEEKKAIDRLHNDMYNEARQAAEAHRQ----VRKHMMGWIKPGMTMIEIVEEL 180
            G    T  E+K +           R AA A ++    +  H    +  G+  + I+EE 
Sbjct: 125 GGSPAITPLEEKVL-----------RAAARAQKETGVPISTHTQAGLTMGLEQLRILEEE 173



 Score = 32.6 bits (75), Expect = 0.50
 Identities = 13/60 (21%), Positives = 22/60 (36%), Gaps = 15/60 (25%)

Query: 302 TGKDRFTNEEKKAIDRLHNDMYNEARQAAEAHRQ----VRKHMMGWIKPGMTMIEIVEEL 357
            G    T  E+K +           R AA A ++    +  H    +  G+  + I+EE 
Sbjct: 125 GGSPAITPLEEKVL-----------RAAARAQKETGVPISTHTQAGLTMGLEQLRILEEE 173


>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
           transduction mechanisms].
          Length = 208

 Score = 31.4 bits (71), Expect = 0.99
 Identities = 21/98 (21%), Positives = 33/98 (33%), Gaps = 11/98 (11%)

Query: 5   KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
           K    A++ +++  K + A A  E P    P     P                 +V    
Sbjct: 107 KARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKA----------PREEVHTPV 156

Query: 65  AEASKKKKRKKKKKKAGAGEAAADV-PIDGNGIAEQLE 101
           ++ S     +  K KAG     A V  I  +G+  QL 
Sbjct: 157 SDISALTVGQALKVKAGQNAMDATVLEITKDGVRVQLN 194


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 31.4 bits (72), Expect = 1.1
 Identities = 8/45 (17%), Positives = 18/45 (40%)

Query: 4   QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQL 48
           +   E  K  ++++     A   +  P+   P+    P   +S+L
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISEL 167



 Score = 29.9 bits (68), Expect = 3.3
 Identities = 8/43 (18%), Positives = 17/43 (39%)

Query: 238 GEEVAKKKKKKKNKNKAAAATEEAPKNGIPQQTTPPSIPVSQL 280
             E  K  ++++     A   +  P+   P+    P   +S+L
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISEL 167



 Score = 29.1 bits (66), Expect = 5.5
 Identities = 20/104 (19%), Positives = 35/104 (33%), Gaps = 23/104 (22%)

Query: 4   QKGEEVAKKK-----KKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDG 58
           Q+ E+ AKK+     K+K  + +     +   K   P+   P                  
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP------------ 160

Query: 59  DVEDGAAEASKKKKRKKKKKKAGAGEAAADV-PIDGNGIAEQLE 101
                 ++ S+    +  K KAG     A V  I  + +  QL+
Sbjct: 161 -----VSDISELTVGQAVKVKAGKSAMDATVLEITKDDVRVQLD 199


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 30.6 bits (69), Expect = 1.2
 Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 1/72 (1%)

Query: 10  AKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPD-DDEIEDGDVEDGAAEAS 68
           AK ++K+  K +A        K  +                D DDE ++ D    +A   
Sbjct: 39  AKAQEKEHEKERAEREEAREKKAKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQ 98

Query: 69  KKKKRKKKKKKA 80
             KK +K+++K 
Sbjct: 99  WGKKARKRQRKV 110


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 29.5 bits (67), Expect = 1.8
 Identities = 14/44 (31%), Positives = 23/44 (52%)

Query: 226 DITKIDFGTHINGEEVAKKKKKKKNKNKAAAATEEAPKNGIPQQ 269
           DI+ +     IN E   ++KKKKK K++     E+  K  + +Q
Sbjct: 22  DISGVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQ 65


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 31.4 bits (72), Expect = 1.9
 Identities = 21/135 (15%), Positives = 44/135 (32%), Gaps = 1/135 (0%)

Query: 7   EEVAKK-KKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGAA 65
           EE A+K + K++ + +A A  EEA      ++    +   +      +     +     A
Sbjct: 40  EEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPA 99

Query: 66  EASKKKKRKKKKKKAGAGEAAADVPIDGNGIAEQLEQCGLNGNFPVGQEMEYPTLKDGRT 125
           E    +  +   ++  A +AA           +   +    G    G +         R 
Sbjct: 100 EDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRR 159

Query: 126 GKDRFTNEEKKAIDR 140
            K +   +  + I R
Sbjct: 160 RKKKKKQKPTEKIPR 174


>gnl|CDD|152339 pfam11903, DUF3423, Protein of unknown function (DUF3423).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are typically between 73 to 118 amino acids in
           length. This protein appears to be related to
           ribbon-helix-helix DNA-binding domains, suggesting these
           proteins may also bind DNA.
          Length = 73

 Score = 28.4 bits (64), Expect = 1.9
 Identities = 9/37 (24%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 140 RLHNDMYNEARQAAEA-HRQVRKHMMGWIKPGMTMIE 175
           ++ +D++ +AR+A++A  R +   +  W + G  + E
Sbjct: 5   KISDDLHEQARRASKAMSRSINAQIEHWARIGR-LAE 40



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 9/37 (24%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 317 RLHNDMYNEARQAAEA-HRQVRKHMMGWIKPGMTMIE 352
           ++ +D++ +AR+A++A  R +   +  W + G  + E
Sbjct: 5   KISDDLHEQARRASKAMSRSINAQIEHWARIGR-LAE 40


>gnl|CDD|238017 cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic
           and prokaryotic proteins. Putative nuclear signalling
           domain. FHA domains may bind phosphothreonine,
           phosphoserine and sometimes phosphotyrosine. In
           eukaryotes, many FHA domain-containing proteins localize
           to the nucleus, where they participate in establishing
           or maintaining cell cycle checkpoints, DNA repair, or
           transcriptional regulation. Members of the FHA family
           include: Dun1, Rad53,  Cds1, Mek1,
           KAPP(kinase-associated protein phosphatase),and Ki-67 (a
           human nuclear protein related to cell proliferation).
          Length = 102

 Score = 29.3 bits (66), Expect = 2.1
 Identities = 16/69 (23%), Positives = 25/69 (36%), Gaps = 6/69 (8%)

Query: 179 ELEKYSRLLIGEDGLKAGLAFP-TGCSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHIN 237
            L+      IG D     +       SR H    Y  + G   V+  D  +    GT +N
Sbjct: 17  YLDPGGTYTIGRDSDNCDIVLDDPSVSRRHAVIRYDGDGG---VVLIDLGST--NGTFVN 71

Query: 238 GEEVAKKKK 246
           G+ V+  + 
Sbjct: 72  GQRVSPGEP 80


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 30.9 bits (70), Expect = 2.5
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 51  DDDEIEDGDVEDGAAEASKKKKRKKKKKKAGA 82
           DD E +D +  DG  + +KKK +KK+++K  +
Sbjct: 165 DDGEDDDEEGSDGEEDGAKKKAKKKRQRKPES 196


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 30.8 bits (69), Expect = 2.6
 Identities = 23/81 (28%), Positives = 30/81 (37%), Gaps = 23/81 (28%)

Query: 3   IQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQ----QTTPPSIPVSQL---------- 48
           IQ   E     + + +KNKAA    +AP + VP     Q TP    V             
Sbjct: 457 IQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEP 516

Query: 49  -----YPDDDEIEDGDVEDGA 64
                +PD+D       EDGA
Sbjct: 517 FYGYGFPDNDC----PPEDGA 533


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 30.7 bits (70), Expect = 2.9
 Identities = 11/39 (28%), Positives = 20/39 (51%)

Query: 49  YPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAA 87
            P   E E+ + E+G    +++KK +KK++KA       
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKE 428


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 30.1 bits (67), Expect = 2.9
 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 1/72 (1%)

Query: 11  KKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPD-DDEIEDGDVEDGAAEASK 69
           K   + K    ++  T++        +   P +P  +   D DD+ E+GD +DG AE  +
Sbjct: 40  KSPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDDGGAEDEE 99

Query: 70  KKKRKKKKKKAG 81
           ++K + +  + G
Sbjct: 100 EEKVRGQSGQEG 111


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 30.6 bits (69), Expect = 3.2
 Identities = 19/84 (22%), Positives = 32/84 (38%), Gaps = 7/84 (8%)

Query: 2   LIQKGEEVAKKKKKKKNKNKAAAATEE--APKNGVPQQTTPPSIPVSQLYPDDDEIEDGD 59
           ++    +   K+KK+K +   +   EE               S   + L  D+DE E   
Sbjct: 217 IMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKP 276

Query: 60  V-----EDGAAEASKKKKRKKKKK 78
                  +   E  +K+KRK+ KK
Sbjct: 277 SGERSDSEEETEEKEKEKRKRLKK 300


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 28/145 (19%), Positives = 49/145 (33%), Gaps = 24/145 (16%)

Query: 1   MLIQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDV 60
           M+ +K +E   K+K++K   ++A + E +P +  P  + P +  +        E      
Sbjct: 127 MIERKPKEPGLKRKQRKKAQESATSPESSPSST-PNSSRPSTPHLL----KAKEGPSRRA 181

Query: 61  EDGA---------AEASKK-----KKRKKKKKKAGAGEAAADVPI-DGNGIAEQLEQCGL 105
           +  A          E S K     KK  KK +K        D  + D +      E    
Sbjct: 182 KKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVLDYSAPDANDEN--- 238

Query: 106 NGNFPVGQEMEYPTLKDGRTGKDRF 130
                  +E++        T K  F
Sbjct: 239 ADAPEDVEEVD-QESWGRGTMKGDF 262


>gnl|CDD|232910 TIGR00298, TIGR00298, 2-phosphosulfolactate phosphatase.
           2-phosphosulfolactate phosphatase catalyzes the
           sulfonation of phosphoenolpyruvate to form
           2-phospho-3-sulfolactate, the second step in coenzyme M
           biosynthesis. Coenzyme M is the terminal methyl carrier
           in methanogenesis [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Other, Energy metabolism,
           Methanogenesis].
          Length = 216

 Score = 29.7 bits (67), Expect = 3.4
 Identities = 25/80 (31%), Positives = 31/80 (38%), Gaps = 11/80 (13%)

Query: 174 IEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT---------VLEF 224
           IE   EL+K + +  GE   K    F  G S      +   NNGD           VLE 
Sbjct: 30  IEEAMELKKENAIKAGERNGKKIEGFDVGNSPLEVLNY--KNNGDNLVLTTTNGTRVLES 87

Query: 225 DDITKIDFGTHINGEEVAKK 244
                I  G+  N + VAKK
Sbjct: 88  IKSDNILVGSINNAKAVAKK 107


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 30.4 bits (68), Expect = 3.6
 Identities = 18/94 (19%), Positives = 30/94 (31%), Gaps = 18/94 (19%)

Query: 2   LIQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIP---VSQLYPDDDEIEDG 58
           L    E        KK   ++        K    +     S P    S+L  ++D   D 
Sbjct: 56  LAVDEEPDENGAVSKKKPTRS-------VKRATKKTVVEISEPLEEGSELVVNEDAALD- 107

Query: 59  DVEDGAAEASKKKKRKKKKKKAGAGEAAADVPID 92
                  + SKK  R+ ++K A A     +   +
Sbjct: 108 -------KESKKTPRRTRRKAAAASSDVEEEKTE 134


>gnl|CDD|226291 COG3768, COG3768, Predicted membrane protein [Function unknown].
          Length = 350

 Score = 30.1 bits (68), Expect = 3.7
 Identities = 14/58 (24%), Positives = 23/58 (39%)

Query: 26 TEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAG 83
           E APK   P++    +   +    D ++ ED    +   EA  + +    K   GAG
Sbjct: 14 VEAAPKFRAPREFDEDAENFTPAELDVEDPEDEGAAEAPVEAPLRPRSSFWKIMLGAG 71


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 29.6 bits (67), Expect = 4.2
 Identities = 19/72 (26%), Positives = 25/72 (34%), Gaps = 6/72 (8%)

Query: 7   EEVAKKKKKKKNKN--KAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
           EE AKKKKKKK K   K  A    A         T  S    Q+       +   +    
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAP-- 235

Query: 65  AEASKKKKRKKK 76
              ++K +  K 
Sbjct: 236 --DNEKSEVYKS 245


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 29.0 bits (65), Expect = 4.8
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 47  QLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAA 86
            L+P++ E E  +V     E    K +K+KK+K     A 
Sbjct: 89  NLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAE 128


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 30.1 bits (68), Expect = 4.8
 Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 7/120 (5%)

Query: 2   LIQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVE 61
           L  +G++  KK  K   K+ A    + AP   +      P         +   + + D E
Sbjct: 43  LASEGKKNRKKMPKVFQKSSAPRQIQAAPPQALNPTAAAPQSSRGPTLRELLALPEDDGE 102

Query: 62  DGAAEASKKKKR-------KKKKKKAGAGEAAADVPIDGNGIAEQLEQCGLNGNFPVGQE 114
             A E+S   +R        + + +  AG        D     + L +   +   PV +E
Sbjct: 103 TQAPESSPSARRLTRSEGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKE 162


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 29.9 bits (67), Expect = 5.3
 Identities = 13/57 (22%), Positives = 17/57 (29%)

Query: 35  PQQTTPPSIPVSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAADVPI 91
           PQ T PP I V  L   +    D          +           A  G +   VP+
Sbjct: 303 PQSTIPPRIVVPWLPDTEGREGDDLAPTAVCTDAGLLPDTPLLPDAPEGSSDPVVPV 359


>gnl|CDD|182186 PRK09990, PRK09990, DNA-binding transcriptional regulator GlcC;
           Provisional.
          Length = 251

 Score = 29.3 bits (66), Expect = 5.4
 Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 5/54 (9%)

Query: 134 EKKAIDRLHNDMYNE--ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSR 185
           +KK IDR H  +YN    R   +A R  R H+         + EI +E ++  R
Sbjct: 194 QKKQIDRQHARLYNAVLQRLPEQAQRAARDHIRTVRD---NLREIEQEEQRLVR 244



 Score = 29.3 bits (66), Expect = 5.4
 Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 5/54 (9%)

Query: 311 EKKAIDRLHNDMYNE--ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSR 362
           +KK IDR H  +YN    R   +A R  R H+         + EI +E ++  R
Sbjct: 194 QKKQIDRQHARLYNAVLQRLPEQAQRAARDHIRTVRD---NLREIEQEEQRLVR 244


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
          protein SURF-6 is shown to be a component of the
          nucleolar matrix and has a strong binding capacity for
          nucleic acids.
          Length = 206

 Score = 28.8 bits (65), Expect = 6.3
 Identities = 19/85 (22%), Positives = 32/85 (37%), Gaps = 13/85 (15%)

Query: 4  QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDV--- 60
          ++ +  A+KK+K+K   K   A +         +        S+      E  +G++   
Sbjct: 14 KREQRKARKKQKRKEAKKKEDAQKSE------AEEVKNEENKSKKKAAPIENAEGNIVFS 67

Query: 61 ----EDGAAEASKKKKRKKKKKKAG 81
               DG       K +KKKKKK  
Sbjct: 68 KVEFADGEQAKKDLKLKKKKKKKKT 92


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 29.2 bits (65), Expect = 7.3
 Identities = 22/81 (27%), Positives = 33/81 (40%), Gaps = 20/81 (24%)

Query: 6   GEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVE---- 61
            EE +KKKKKK  KNK     ++  K G                 D+ + +DGD E    
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDA------------DEYDSDDGDDEGREE 284

Query: 62  ----DGAAEASKKKKRKKKKK 78
               D +A  +  ++R+ K  
Sbjct: 285 DYISDSSASGNDPEEREDKLS 305


>gnl|CDD|222793 PHA00437, PHA00437, tail assembly protein.
          Length = 94

 Score = 27.6 bits (61), Expect = 7.3
 Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 4/81 (4%)

Query: 2  LIQKGEEVAKKKKKKKNKN--KAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGD 59
          L +K ++  KK  K  +K    A A      K+  P        P   +   + +  D +
Sbjct: 3  LFKKIKKAVKKVTKSVSKPVVGAVAGALGGGKDAPPVVVAAAPPPAQIVEVPEKD--DVE 60

Query: 60 VEDGAAEASKKKKRKKKKKKA 80
           ED A   S KKK +   KK+
Sbjct: 61 TEDEAQTESGKKKARAGGKKS 81


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 29.2 bits (65), Expect = 7.4
 Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 1/84 (1%)

Query: 4   QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
           ++G E   +++K   K KAAAA +        Q+       V++   + D+ +       
Sbjct: 113 REGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTE-EVTEEEEETDKEKAKAKAAA 171

Query: 64  AAEASKKKKRKKKKKKAGAGEAAA 87
           AA+A      K+K  +AG G    
Sbjct: 172 AAKAKAAALAKQKAAEAGEGTEEV 195


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 28.1 bits (63), Expect = 7.7
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 58  GDVEDGAAEASKKKKRKKKKKKAGAGEAAA 87
            + + G    +   K+KK   +A A EAAA
Sbjct: 111 AEADGGPTAEATTPKKKKAAAEAAAAEAAA 140



 Score = 28.1 bits (63), Expect = 9.7
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 64  AAEASKKKKRKKKKKKAGAGEAAADVP 90
            AEA+  KK+K   + A A  AA    
Sbjct: 118 TAEATTPKKKKAAAEAAAAEAAAPAAE 144


>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 264

 Score = 29.0 bits (65), Expect = 7.8
 Identities = 12/76 (15%), Positives = 29/76 (38%), Gaps = 2/76 (2%)

Query: 4   QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
           Q  +   +++ + +++   + AT    +   P+       PV+   P  + +E    +  
Sbjct: 117 QPEQARIEEQPRTQSQKAQSQATTVQTQPVKPKPRPEKPQPVAPA-PAPEPVEK-APKAE 174

Query: 64  AAEASKKKKRKKKKKK 79
           AA   K K     + +
Sbjct: 175 AAPPPKPKAEDAAETR 190


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
          are designated YL1. These proteins have been shown to
          be DNA-binding and may be a transcription factor.
          Length = 238

 Score = 28.9 bits (65), Expect = 8.2
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 51 DDDEIEDGDVEDGAAEASKKKKRKKKKKKA 80
          +DDE E  D E+G  E  ++++ KKKK+  
Sbjct: 62 EDDEPESDDEEEGEKELQREERLKKKKRVK 91


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 29.0 bits (65), Expect = 8.5
 Identities = 24/118 (20%), Positives = 36/118 (30%), Gaps = 34/118 (28%)

Query: 6   GEEVAKKKKKKKNKNK----------AAAATEEAPKNGVPQQTT-----------PPSIP 44
            E V  KK KKK K K                E   +                   P   
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKR 329

Query: 45  VSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAADVPIDGNGIAEQLEQ 102
             +   DD   +D   +D  A  +K+++  +KK+K    E           IA Q+ +
Sbjct: 330 KEEQEDDDFVEDD---DDLQASLAKQRRLAQKKRKKLRPED----------IARQIAE 374


>gnl|CDD|176005 cd04040, C2D_Tricalbin-like, C2 domain fourth repeat present in
           Tricalbin-like proteins.  5 to 6 copies of the C2 domain
           are present in Tricalbin, a yeast homolog of
           Synaptotagmin, which is involved in membrane trafficking
           and sorting.  C2 domains fold into an 8-standed
           beta-sandwich that can adopt 2 structural arrangements:
           Type I and Type II, distinguished by a circular
           permutation involving their N- and C-terminal beta
           strands. Many C2 domains are Ca2+-dependent
           membrane-targeting modules that bind a wide variety of
           substances including bind phospholipids, inositol
           polyphosphates, and intracellular proteins.  Most C2
           domain proteins are either signal transduction enzymes
           that contain a single C2 domain, such as protein kinase
           C, or membrane trafficking proteins which contain at
           least two C2 domains, such as synaptotagmin 1.  However,
           there are a few exceptions to this including RIM
           isoforms and some splice variants of piccolo/aczonin and
           intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions. This cd contains the fifth C2
           repeat, C2E, and has a type-II topology.
          Length = 115

 Score = 27.5 bits (62), Expect = 8.7
 Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 235 HINGEEVAKKKKKKKNKN 252
           ++NGE+V K K  KK  N
Sbjct: 27  YLNGEKVFKTKTIKKTLN 44


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 29.0 bits (65), Expect = 9.0
 Identities = 15/84 (17%), Positives = 26/84 (30%), Gaps = 18/84 (21%)

Query: 5   KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
           K E  A  K   + K KA A  ++                      +  +    + +  A
Sbjct: 186 KAEAEAAAKAAAEAKKKAEAEAKKKAAA------------------EAKKKAAAEAKAAA 227

Query: 65  AEASKKKKRKKKKKKAGAGEAAAD 88
           A+A+ + K   +K  A      A 
Sbjct: 228 AKAAAEAKAAAEKAAAAKAAEKAA 251


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 29.2 bits (65), Expect = 9.3
 Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 2/42 (4%)

Query: 243 KKKKKKKNKNKAAAATEEAPKNGIPQQTTP--PSIPVSQLYP 282
           KK KKKK+  K  A   + P     ++        P+S++ P
Sbjct: 124 KKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSRVIP 165


>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
          Length = 1713

 Score = 29.1 bits (65), Expect = 9.6
 Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 7/75 (9%)

Query: 19  KNKAAAATEEAPKNGVPQQTTPPSI--PVSQLYPDDD-EIEDGDVEDGAAEASKKKKRKK 75
           ++  +     +  + V    +P SI    ++ Y +D   + D +++DG     ++  RK 
Sbjct: 333 EDSPSHDIVGS--HPVCNGASPVSITPEENESYVEDAVSLPDSEIKDGKLSKLQRVTRKA 390

Query: 76  KKKKAG--AGEAAAD 88
           KK+K G  A E  AD
Sbjct: 391 KKRKLGDMAYEGDAD 405


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 29.1 bits (65), Expect = 9.7
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 58  GDVEDGAAEASKKKKRKKKKKKAGAGEAA 86
            + E+ A ++ KK K+ KKKK  G G +A
Sbjct: 238 AEAENEAGKSDKKDKKSKKKKVLGKGTSA 266


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 28.6 bits (64), Expect = 9.8
 Identities = 26/83 (31%), Positives = 32/83 (38%), Gaps = 17/83 (20%)

Query: 4   QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
           QK  E AK K+  + K KA A  E+  K    +Q                  E+      
Sbjct: 118 QKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQA-----------------EEEAKAKA 160

Query: 64  AAEASKKKKRKKKKKKAGAGEAA 86
           AAEA KK    KKK +A A   A
Sbjct: 161 AAEAKKKAAEAKKKAEAEAKAKA 183


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.133    0.388 

Gapped
Lambda     K      H
   0.267   0.0803    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,199,041
Number of extensions: 3329531
Number of successful extensions: 5350
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4931
Number of HSP's successfully gapped: 181
Length of query: 635
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 532
Effective length of database: 6,369,140
Effective search space: 3388382480
Effective search space used: 3388382480
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 62 (27.5 bits)