RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15498
(635 letters)
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 647 bits (1670), Expect = 0.0
Identities = 243/418 (58%), Positives = 299/418 (71%), Gaps = 30/418 (7%)
Query: 243 KKKKKKKNKNKAAAATEEAPKNGIP-------------------------QQTTPPSIPV 277
KKKKKKK K K +P +QT PP+IPV
Sbjct: 58 KKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPTIPV 117
Query: 278 SQLYPDGNFPVGQEMEYPTLKDGRTGKDRFTNEEKKAIDRLHNDMYNEARQAAEAHRQVR 337
S+ + DG +PVG+ EYP R ++EEK+ +++L + Y + R+AAE HRQVR
Sbjct: 118 SKQFKDGEYPVGEIQEYPGEN-----SSRTSSEEKRELEKLSEEQYQDLRRAAEVHRQVR 172
Query: 338 KHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT 397
++ IKPG+ +I+I E +E SR LI DGLK G AFPTGCS NHCAAHYTPN GDKT
Sbjct: 173 RYAQSVIKPGVKLIDICERIESKSRELIEADGLKCGWAFPTGCSLNHCAAHYTPNTGDKT 232
Query: 398 VLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDATNTGIKAAGIDVPLCEV 457
VL +DD+ K+DFGTH+NGRIIDCAFT+A+N KYD L++A +DATNTGIK AGIDV L ++
Sbjct: 233 VLTYDDVCKLDFGTHVNGRIIDCAFTVAFNPKYDPLLQATKDATNTGIKEAGIDVRLSDI 292
Query: 458 GAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPIVRGGEGTLMEENEF 517
GAAIQEV+ESYEVEI GKTY +KSIRNLNGHSI Y IH GK+VPIV+GGE T MEE E
Sbjct: 293 GAAIQEVIESYEVEIKGKTYPIKSIRNLNGHSIGPYIIHGGKSVPIVKGGENTRMEEGEL 352
Query: 518 YAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLLNTINKNFGTLAFCKRWL 577
+AIETF STG+G V++D++ SHYMK+ A ++PLRL +K LL IN NFGTLAFC+RWL
Sbjct: 353 FAIETFASTGRGYVNEDLECSHYMKDPGAEFVPLRLPKAKQLLKHINTNFGTLAFCRRWL 412
Query: 578 DRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIVLRPTCKEVISRGDDY 635
DR G ++ +ALK L D GIV YPPLCD++GSYT+Q EHTI+LRPTCKEV+SRGDDY
Sbjct: 413 DRLGQDRHLLALKQLVDAGIVNPYPPLCDVRGSYTSQMEHTILLRPTCKEVLSRGDDY 470
Score = 209 bits (534), Expect = 1e-60
Identities = 98/267 (36%), Positives = 133/267 (49%), Gaps = 51/267 (19%)
Query: 5 KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDE-----IEDGD 59
E + K++K +NK ++ K+ + DDD I +
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKS-------------KKDVDDDDAFLAELISENQ 48
Query: 60 VEDGAAEASKKKKRKKKKKKAGAGEAAADV-----------PIDGNGIA-----EQLEQC 103
+ KKKK+KKKKKK GEA D + I + E
Sbjct: 49 EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWK 108
Query: 104 GL------------NGNFPVGQEMEYPTLKDGRTGKDRFTNEEKKAIDRLHNDMYNEARQ 151
+G +PVG+ EYP R ++EEK+ +++L + Y + R+
Sbjct: 109 QTQPPTIPVSKQFKDGEYPVGEIQEYPGEN-----SSRTSSEEKRELEKLSEEQYQDLRR 163
Query: 152 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAH 211
AAE HRQVR++ IKPG+ +I+I E +E SR LI DGLK G AFPTGCS NHCAAH
Sbjct: 164 AAEVHRQVRRYAQSVIKPGVKLIDICERIESKSRELIEADGLKCGWAFPTGCSLNHCAAH 223
Query: 212 YTPNNGDKTVLEFDDITKIDFGTHING 238
YTPN GDKTVL +DD+ K+DFGTH+NG
Sbjct: 224 YTPNTGDKTVLTYDDVCKLDFGTHVNG 250
>gnl|CDD|238521 cd01088, MetAP2, Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also
known as methionyl aminopeptidase and peptidase M.
Catalyzes release of N-terminal amino acids,
preferentially methionine, from peptides and arylamides.
Length = 291
Score = 478 bits (1232), Expect = e-167
Identities = 165/309 (53%), Positives = 205/309 (66%), Gaps = 18/309 (5%)
Query: 323 YNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSR 382
+ R+A E HRQVRK+ IKPGMT++EI E +E R L AG AFP S
Sbjct: 1 LEKYREAGEIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELG------AGPAFPVNLSI 54
Query: 383 NHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDATN 442
N CAAHYTPN GD TVL+ D+ K+DFG H++G I D AFT+ ++ KYD L+EA ++A N
Sbjct: 55 NECAAHYTPNAGDDTVLKEGDVVKLDFGAHVDGYIADSAFTVDFDPKYDDLLEAAKEALN 114
Query: 443 TGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVP 502
IK AG DV L E+G AI+EV+ES Y K IRNL GHSI+ YR+HAGK++P
Sbjct: 115 AAIKEAGPDVRLGEIGEAIEEVIES---------YGFKPIRNLTGHSIERYRLHAGKSIP 165
Query: 503 IVRGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLLNT 562
V+GGEGT +EE + YAIE F +TGKG VHD + S YM N + PLRL ++ LL+
Sbjct: 166 NVKGGEGTRLEEGDVYAIEPFATTGKGYVHDGPECSIYMLNRDK---PLRLPRARKLLDV 222
Query: 563 INKNFGTLAFCKRWLDRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIVLR 622
I +NFGTL F +RWLDR G TK MALK+LC GIV YP L +I G Y AQFEHTI++R
Sbjct: 223 IYENFGTLPFARRWLDRLGETKLLMALKNLCKAGIVYPYPVLKEISGGYVAQFEHTIIVR 282
Query: 623 PTCKEVISR 631
KEV +R
Sbjct: 283 EDGKEVTTR 291
Score = 143 bits (363), Expect = 2e-38
Identities = 48/93 (51%), Positives = 59/93 (63%), Gaps = 6/93 (6%)
Query: 146 YNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSR 205
+ R+A E HRQVRK+ IKPGMT++EI E +E R L AG AFP S
Sbjct: 1 LEKYREAGEIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELG------AGPAFPVNLSI 54
Query: 206 NHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
N CAAHYTPN GD TVL+ D+ K+DFG H++G
Sbjct: 55 NECAAHYTPNAGDDTVLKEGDVVKLDFGAHVDG 87
>gnl|CDD|129592 TIGR00501, met_pdase_II, methionine aminopeptidase, type II.
Methionine aminopeptidase (map) is a cobalt-binding
enzyme. Bacterial and organellar examples (type I)
differ from eukaroytic and archaeal (type II) examples
in lacking a region of approximately 60 amino acids
between the 4th and 5th cobalt-binding ligands. The role
of this protein in general is to produce the mature
amino end of cytosolic proteins by removing the
N-terminal methionine. This model describes type II,
among which the eukaryotic members typically have an
N-terminal extension not present in archaeal members. It
can act cotranslationally. The enzyme from rat has been
shown to associate with translation initiation factor 2
(IF-2) and may have a role in translational regulation
[Protein fate, Protein modification and repair].
Length = 295
Score = 291 bits (747), Expect = 4e-94
Identities = 128/311 (41%), Positives = 182/311 (58%), Gaps = 18/311 (5%)
Query: 321 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGC 380
+ + +A + H +VR+ I PG+ ++E+ E +E R L E AFP
Sbjct: 3 ERAEKWIEAGKIHSKVRREAADRIVPGVKLLEVAEFVENRIRELGAEP------AFPCNI 56
Query: 381 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDA 440
S N CAAH+TP GDKTV + D+ K+D G H++G I D A T+ ++YD L++A +DA
Sbjct: 57 SINECAAHFTPKAGDKTVFKDGDVVKLDLGAHVDGYIADTAITVDLGDQYDNLVKAAKDA 116
Query: 441 TNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKT 500
T IK V + E+G AIQEV+ES Y VK I NL GHS+ YR+H GK+
Sbjct: 117 LYTAIKEIRAGVRVGEIGKAIQEVIES---------YGVKPISNLTGHSMAPYRLHGGKS 167
Query: 501 VPIVRGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLL 560
+P V+ + T +EE + AIE F + G G V D +VS Y E P+RL S+++LL
Sbjct: 168 IPNVKERDTTKLEEGDVVAIEPFATDGVGYVTDGGEVSIYAFLAER---PVRLDSARNLL 224
Query: 561 NTINKNFGTLAFCKRWLDRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIV 620
TI++N+GTL F +RWLD+ G KY AL +L G++ YP L +I G Y AQ+EHTI+
Sbjct: 225 KTIDENYGTLPFARRWLDKLGDEKYLFALNNLIRHGLIYDYPVLNEISGGYVAQWEHTIL 284
Query: 621 LRPTCKEVISR 631
+ KEV ++
Sbjct: 285 VEEHGKEVTTK 295
Score = 86.8 bits (215), Expect = 9e-19
Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 6/95 (6%)
Query: 144 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGC 203
+ + +A + H +VR+ I PG+ ++E+ E +E R L E AFP
Sbjct: 3 ERAEKWIEAGKIHSKVRREAADRIVPGVKLLEVAEFVENRIRELGAEP------AFPCNI 56
Query: 204 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
S N CAAH+TP GDKTV + D+ K+D G H++G
Sbjct: 57 SINECAAHFTPKAGDKTVFKDGDVVKLDLGAHVDG 91
>gnl|CDD|181534 PRK08671, PRK08671, methionine aminopeptidase; Provisional.
Length = 291
Score = 258 bits (661), Expect = 2e-81
Identities = 102/299 (34%), Positives = 157/299 (52%), Gaps = 19/299 (6%)
Query: 322 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS 381
+ +A + +VR+ IKPG ++++ E +E I E G A AFP S
Sbjct: 1 ELEKYLEAGKIASKVREEAAKLIKPGAKLLDVAEFVENR----IRELG--AKPAFPCNIS 54
Query: 382 RNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIEAVRDAT 441
N AAHYTP+ GD+ V D+ K+D G H++G I D A T+ KY+ L+EA +A
Sbjct: 55 INEVAAHYTPSPGDERVFPEGDVVKLDLGAHVDGYIADTAVTVDLGGKYEDLVEASEEAL 114
Query: 442 NTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTV 501
I+ V + E+G I+E + SY G K IRNL GH ++ Y +HAG ++
Sbjct: 115 EAAIEVVRPGVSVGEIGRVIEETIRSY-----G----FKPIRNLTGHGLERYELHAGPSI 165
Query: 502 PIVRGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEASYIPLRLASSKSLLN 561
P G G +EE + YAIE F + G+G V + +V Y + P+RL +++ LL
Sbjct: 166 PNYDEGGGVKLEEGDVYAIEPFATDGEGKVVEGPEVEIYS--LLRNR-PVRLPAARKLLE 222
Query: 562 TINKNFGTLAFCKRWLDRAGCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIV 620
I + + TL F +RWL+ + ++ L+ L G + YP L ++KG +Q EHT++
Sbjct: 223 EIEEEYNTLPFAERWLEGLF-GEDKLELRRLLKAGALYGYPVLKEVKGGLVSQAEHTVI 280
Score = 86.4 bits (215), Expect = 1e-18
Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 6/94 (6%)
Query: 145 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS 204
+ +A + +VR+ IKPG ++++ E +E I E G A AFP S
Sbjct: 1 ELEKYLEAGKIASKVREEAAKLIKPGAKLLDVAEFVENR----IRELG--AKPAFPCNIS 54
Query: 205 RNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
N AAHYTP+ GD+ V D+ K+D G H++G
Sbjct: 55 INEVAAHYTPSPGDERVFPEGDVVKLDLGAHVDG 88
>gnl|CDD|223103 COG0024, Map, Methionine aminopeptidase [Translation, ribosomal
structure and biogenesis].
Length = 255
Score = 173 bits (441), Expect = 7e-50
Identities = 70/226 (30%), Positives = 103/226 (45%), Gaps = 25/226 (11%)
Query: 321 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKA-----GLA 375
+ + R+A + + K + +KPG+T +E+ E E++ + + A G
Sbjct: 9 EEIEKMREAGKIAAKALKEVASLVKPGVTTLELDEIAEEF---IREKGAYPAFLGYKGFP 65
Query: 376 FPTGCSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNK----YD 431
FPT S N AH P DK VL+ DI KID G HI+G I D A T
Sbjct: 66 FPTCISVNEVVAHGIPG--DKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDEDAK 123
Query: 432 KLIEAVRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSID 491
+L+EA ++A GI+A L ++G AIQE ES +RNL GH I
Sbjct: 124 RLLEATKEALYAGIEAVKPGARLGDIGRAIQEYAES---------RGFSVVRNLTGHGIG 174
Query: 492 SYRIHAGKTVPIV-RGGEGTLMEENEFYAIETFGSTGKGMVHDDMD 536
+H ++P + G G ++E +AIE +TG G V +
Sbjct: 175 R-ELHEEPSIPNYGKDGTGVRLKEGMVFAIEPMINTGSGEVVEGPS 219
Score = 86.5 bits (215), Expect = 7e-19
Identities = 39/134 (29%), Positives = 60/134 (44%), Gaps = 16/134 (11%)
Query: 144 DMYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKA-----GLA 198
+ + R+A + + K + +KPG+T +E+ E E++ + + A G
Sbjct: 9 EEIEKMREAGKIAAKALKEVASLVKPGVTTLELDEIAEEF---IREKGAYPAFLGYKGFP 65
Query: 199 FPTGCSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING------EEVAKKKKKKKNKN 252
FPT S N AH P DK VL+ DI KID G HI+G + ++
Sbjct: 66 FPTCISVNEVVAHGIPG--DKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDEDAK 123
Query: 253 KAAAATEEAPKNGI 266
+ AT+EA GI
Sbjct: 124 RLLEATKEALYAGI 137
Score = 38.0 bits (89), Expect = 0.009
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 604 LCDIKGSYTAQFEHTIVLRPTCKEVISRGDDY 635
L GS +AQFEHT+++ E+++ +
Sbjct: 224 LVTKDGSLSAQFEHTVIVTEDGCEILTLRPEE 255
>gnl|CDD|215993 pfam00557, Peptidase_M24, Metallopeptidase family M24. This family
contains metallopeptidases. It also contains
non-peptidase homologues such as the N terminal domain
of Spt16 which is a histone H3-H4 binding module.
Length = 208
Score = 122 bits (307), Expect = 1e-31
Identities = 57/213 (26%), Positives = 80/213 (37%), Gaps = 24/213 (11%)
Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPT--GCSRNH 384
R+AA + + I+PG+T E+ ELE G G AFP N
Sbjct: 4 RKAARIAAAALEAALAAIRPGVTERELAAELEAA----FLARGGARGPAFPPIVASGPNA 59
Query: 385 CAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAY---NNKYDKLIEAVRDAT 441
HY P++ VL+ D+ ID G +G D T + +L EAV +A
Sbjct: 60 AVPHYIPSD---RVLKDGDLVLIDVGAEYDGYHSDITRTFVVGKPTPEQRELYEAVLEAQ 116
Query: 442 NTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTV 501
I A V +V AA +EV+E + GH I +H
Sbjct: 117 EAAIAAVKPGVTGGDVDAAAREVLEEGGYGEY--------FPHGLGHGIG-LDVHDEGPY 167
Query: 502 PIVRGGEGTLMEENEFYAIE--TFGSTGKGMVH 532
I RGG ++E + IE + G G V
Sbjct: 168 -ISRGGNDRVLEPGMVFTIEPGIYFIPGWGGVR 199
Score = 76.9 bits (190), Expect = 5e-16
Identities = 27/91 (29%), Positives = 38/91 (41%), Gaps = 9/91 (9%)
Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPT--GCSRNH 207
R+AA + + I+PG+T E+ ELE G G AFP N
Sbjct: 4 RKAARIAAAALEAALAAIRPGVTERELAAELEAA----FLARGGARGPAFPPIVASGPNA 59
Query: 208 CAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
HY P++ VL+ D+ ID G +G
Sbjct: 60 AVPHYIPSD---RVLKDGDLVLIDVGAEYDG 87
>gnl|CDD|238514 cd01066, APP_MetAP, A family including aminopeptidase P,
aminopeptidase M, and prolidase. Also known as
metallopeptidase family M24. This family of enzymes is
able to cleave amido-, imido- and amidino-containing
bonds. Members exibit relatively narrow substrate
specificity compared to other metallo-aminopeptidases,
suggesting they play roles in regulation of biological
processes rather than general protein degradation.
Length = 207
Score = 97.5 bits (243), Expect = 3e-23
Identities = 53/216 (24%), Positives = 85/216 (39%), Gaps = 24/216 (11%)
Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 386
R+AAE I+PG+T E+ +E+ R G G
Sbjct: 5 RKAAEIAEAAMAAAAEAIRPGVTEAEVAAAIEQALR----AAGGYPAGPTIVGSGARTAL 60
Query: 387 AHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---KLIEAVRDATNT 443
HY P++ L+ D+ +D G +G D T D +L EAVR+A
Sbjct: 61 PHYRPDD---RRLQEGDLVLVDLGGVYDGYHADLTRTFVIGEPSDEQRELYEAVREAQEA 117
Query: 444 GIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPI 503
+ A V EV AA +EV+E + + + + GH I IH P+
Sbjct: 118 ALAALRPGVTAEEVDAAAREVLEEHGLG--------PNFGHRTGHGIG-LEIHEP---PV 165
Query: 504 VRGGEGTLMEENEFYAIETF--GSTGKGMVHDDMDV 537
++ G+ T++E +A+E G G+ +D +
Sbjct: 166 LKAGDDTVLEPGMVFAVEPGLYLPGGGGVRIEDTVL 201
Score = 55.5 bits (134), Expect = 1e-08
Identities = 21/89 (23%), Positives = 33/89 (37%), Gaps = 7/89 (7%)
Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 209
R+AAE I+PG+T E+ +E+ R G G
Sbjct: 5 RKAAEIAEAAMAAAAEAIRPGVTEAEVAAAIEQALR----AAGGYPAGPTIVGSGARTAL 60
Query: 210 AHYTPNNGDKTVLEFDDITKIDFGTHING 238
HY P++ L+ D+ +D G +G
Sbjct: 61 PHYRPDD---RRLQEGDLVLVDLGGVYDG 86
>gnl|CDD|129586 TIGR00495, crvDNA_42K, 42K curved DNA binding protein. Proteins
identified by This model have been identified in a
number of species as a nuclear (but not nucleolar)
protein with a cell cycle dependence. Various names
given to members of this family have included cell cycle
protein p38-2G4, DNA-binding protein GBP16, and
proliferation-associated protein 1. This protein is
closely related to methionine aminopeptidase, a
cobolt-binding protein [Unknown function, General].
Length = 389
Score = 98.8 bits (246), Expect = 3e-22
Identities = 82/341 (24%), Positives = 153/341 (44%), Gaps = 26/341 (7%)
Query: 310 EEKKAIDRLHND-MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEE-----LEKYSRL 363
E+++ L N + + + A E V K ++ PG +++I E+ +E+ +++
Sbjct: 5 EQQEQAYSLSNPEVVTKYKMAGEIANNVLKSVVEACSPGAKVVDICEKGDAFIMEETAKI 64
Query: 364 LIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT-VLEFDDITKIDFGTHINGRIIDCAF 422
E ++ G+AFPT S N+C H++P D+ +L+ D+ KID G HI+G I A
Sbjct: 65 FKKEKEMEKGIAFPTCISVNNCVGHFSPLKSDQDYILKEGDVVKIDLGCHIDGFIALVAH 124
Query: 423 TLAYNNKYDKLIEA----VRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQ 478
T ++ + V A + +AA + L + G +V E+ +Y
Sbjct: 125 TFVVGVAQEEPVTGRKADVIAAAHLAAEAA---LRLVKPGNTNTQVTEAINKVAH--SYG 179
Query: 479 VKSIRNLNGHSIDSYRIHAGKTVPIVRGGEGTL------MEENEFYAIETFGSTGKGMVH 532
+ + H + + I K + EENE YA++ STG+G
Sbjct: 180 CTPVEGMLSHQLKQHVIDGEKVIISNPSDSQKKDHDTAEFEENEVYAVDILVSTGEGKAK 239
Query: 533 D-DMDVSHYMKNFEASYIPLRLASSKSLLNTINKNFGTLAFCKRWLDRAGCTKYQMALKD 591
D D + Y ++ +Y L++ +S++ + I + F + F R + + M L +
Sbjct: 240 DADQRTTIYKRDPSKTYG-LKMKASRAFFSEIERRFDAMPFTLRNFEDEKRAR--MGLVE 296
Query: 592 LCDKGIVEAYPPLCDIKGSYTAQFEHTIVLRPTCKEVISRG 632
+++ YP L + +G + AQF+ T++L P I+ G
Sbjct: 297 CVKHELLQPYPVLYEKEGEFVAQFKFTVLLMPNGPMRITSG 337
Score = 54.9 bits (132), Expect = 7e-08
Identities = 33/113 (29%), Positives = 61/113 (53%), Gaps = 7/113 (6%)
Query: 133 EEKKAIDRLHND-MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEE-----LEKYSRL 186
E+++ L N + + + A E V K ++ PG +++I E+ +E+ +++
Sbjct: 5 EQQEQAYSLSNPEVVTKYKMAGEIANNVLKSVVEACSPGAKVVDICEKGDAFIMEETAKI 64
Query: 187 LIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT-VLEFDDITKIDFGTHING 238
E ++ G+AFPT S N+C H++P D+ +L+ D+ KID G HI+G
Sbjct: 65 FKKEKEMEKGIAFPTCISVNNCVGHFSPLKSDQDYILKEGDVVKIDLGCHIDG 117
Score = 30.6 bits (69), Expect = 2.5
Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 48 LYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAA--DVPID 92
LY + E++D +++ A K+KK+KKK KKA AA ++
Sbjct: 343 LYKSEMEVQDPEIKALLASPIKRKKQKKKAKKASKTGEAATEGETME 389
>gnl|CDD|238519 cd01086, MetAP1, Methionine Aminopeptidase 1. E.C. 3.4.11.18. Also
known as methionyl aminopeptidase and Peptidase M.
Catalyzes release of N-terminal amino acids,
preferentially methionine, from peptides and arylamides.
Length = 238
Score = 83.7 bits (208), Expect = 4e-18
Identities = 58/217 (26%), Positives = 84/217 (38%), Gaps = 31/217 (14%)
Query: 322 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP 377
M R AE ++ K IKPG+T E+ + ++ I E G FP
Sbjct: 4 MREAGRIVAEVLDELAKA----IKPGVTTKELDQIAHEF----IEEHGAYPAPLGYYGFP 55
Query: 378 -TGC-SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---K 432
+ C S N H P D VL+ DI ID G ++G D A T + K
Sbjct: 56 KSICTSVNEVVCHGIP---DDRVLKDGDIVNIDVGVELDGYHGDSARTFIVGEVSEEAKK 112
Query: 433 LIEAVRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDS 492
L+E +A GI+A + ++G AI++ E Y V +R GH I
Sbjct: 113 LVEVTEEALYKGIEAVKPGNRIGDIGHAIEKYAEKN-------GYSV--VREFGGHGIGR 163
Query: 493 YRIHAGKTVPIV-RGGEGTLMEENEFYAIETFGSTGK 528
H +P R G G ++ + IE + G
Sbjct: 164 K-FHEEPQIPNYGRPGTGPKLKPGMVFTIEPMINLGT 199
Score = 37.5 bits (88), Expect = 0.011
Identities = 30/100 (30%), Positives = 41/100 (41%), Gaps = 17/100 (17%)
Query: 145 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP 200
M R AE ++ K IKPG+T E+ + ++ I E G FP
Sbjct: 4 MREAGRIVAEVLDELAKA----IKPGVTTKELDQIAHEF----IEEHGAYPAPLGYYGFP 55
Query: 201 -TGC-SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
+ C S N H P D VL+ DI ID G ++G
Sbjct: 56 KSICTSVNEVVCHGIP---DDRVLKDGDIVNIDVGVELDG 92
Score = 30.2 bits (69), Expect = 3.1
Identities = 8/13 (61%), Positives = 11/13 (84%)
Query: 608 KGSYTAQFEHTIV 620
GS +AQFEHT++
Sbjct: 215 DGSLSAQFEHTVL 227
>gnl|CDD|237252 PRK12896, PRK12896, methionine aminopeptidase; Reviewed.
Length = 255
Score = 68.3 bits (168), Expect = 8e-13
Identities = 50/204 (24%), Positives = 82/204 (40%), Gaps = 28/204 (13%)
Query: 335 QVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDG----LKAGLAFPTGC--SRNHCAAH 388
K M ++PGMT +EL++ + + E G + FP S N AH
Sbjct: 28 TALKEMGKAVEPGMT----TKELDRIAEKRLEEHGAIPSPEGYYGFPGSTCISVNEEVAH 83
Query: 389 YTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNN---KYDKLIEAVRDATNTGI 445
P V++ D+ ID +++G D T A + +KL +A GI
Sbjct: 84 GIP---GPRVIKDGDLVNIDVSAYLDGYHGDTGITFAVGPVSEEAEKLCRVAEEALWAGI 140
Query: 446 KAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPIVR 505
K PL ++G AI++ + +G Y V +R+L GH + +H +V +
Sbjct: 141 KQVKAGRPLNDIGRAIED-----FAKKNG--YSV--VRDLTGHGVGR-SLHEEPSVILTY 190
Query: 506 G--GEGTLMEENEFYAIETFGSTG 527
L+ A+E F + G
Sbjct: 191 TDPLPNRLLRPGMTLAVEPFLNLG 214
Score = 41.0 bits (97), Expect = 0.001
Identities = 22/87 (25%), Positives = 36/87 (41%), Gaps = 13/87 (14%)
Query: 158 QVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDG----LKAGLAFPTGC--SRNHCAAH 211
K M ++PGMT +EL++ + + E G + FP S N AH
Sbjct: 28 TALKEMGKAVEPGMT----TKELDRIAEKRLEEHGAIPSPEGYYGFPGSTCISVNEEVAH 83
Query: 212 YTPNNGDKTVLEFDDITKIDFGTHING 238
P V++ D+ ID +++G
Sbjct: 84 GIP---GPRVIKDGDLVNIDVSAYLDG 107
Score = 30.2 bits (69), Expect = 2.9
Identities = 8/12 (66%), Positives = 10/12 (83%)
Query: 609 GSYTAQFEHTIV 620
S +AQFEHT+V
Sbjct: 232 KSLSAQFEHTVV 243
>gnl|CDD|235576 PRK05716, PRK05716, methionine aminopeptidase; Validated.
Length = 252
Score = 67.5 bits (166), Expect = 2e-12
Identities = 57/226 (25%), Positives = 86/226 (38%), Gaps = 47/226 (20%)
Query: 329 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLL------IGEDGLKAGLAFP--TGC 380
AAE ++ H +KPG+T E+ E+Y R +G G FP
Sbjct: 21 AAEVLDEIEPH----VKPGVTTKELDRIAEEYIRDQGAIPAPLGYHG------FPKSICT 70
Query: 381 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAY---NNKYDKLIEAV 437
S N H P VL+ DI ID +G D + T + + +L E
Sbjct: 71 SVNEVVCHGIP---SDKVLKEGDIVNIDVTVIKDGYHGDTSRTFGVGEISPEDKRLCEVT 127
Query: 438 RDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHA 497
++A GI A L ++G AIQ+ E+ + V +R GH I
Sbjct: 128 KEALYLGIAAVKPGARLGDIGHAIQKYAEAE-------GFSV--VREYCGHGI------- 171
Query: 498 GKTV---PIV----RGGEGTLMEENEFYAIETFGSTGKGMVHDDMD 536
G+ P + G+G +++E + IE + GK V D
Sbjct: 172 GRKFHEEPQIPHYGAPGDGPVLKEGMVFTIEPMINAGKREVKTLKD 217
Score = 34.7 bits (81), Expect = 0.11
Identities = 27/95 (28%), Positives = 36/95 (37%), Gaps = 21/95 (22%)
Query: 152 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLL------IGEDGLKAGLAFP--TGC 203
AAE ++ H +KPG+T E+ E+Y R +G G FP
Sbjct: 21 AAEVLDEIEPH----VKPGVTTKELDRIAEEYIRDQGAIPAPLGYHG------FPKSICT 70
Query: 204 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
S N H P VL+ DI ID +G
Sbjct: 71 SVNEVVCHGIP---SDKVLKEGDIVNIDVTVIKDG 102
>gnl|CDD|238525 cd01092, APP-like, Similar to Prolidase and Aminopeptidase P. The
members of this subfamily presumably catalyse hydrolysis
of Xaa-Pro dipeptides and/or release of any N-terminal
amino acid, including proline, that is linked with
proline.
Length = 208
Score = 65.6 bits (161), Expect = 4e-12
Identities = 46/147 (31%), Positives = 66/147 (44%), Gaps = 13/147 (8%)
Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 386
R+AA + + ++ +IKPGMT E+ ELE + R L E G +F T + +
Sbjct: 5 RKAARIADKAFEELLEFIKPGMTEREVAAELEYFMRKLGAE-----GPSFDTIVASGPNS 59
Query: 387 A--HYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDKLIE---AVRDAT 441
A H P +E D+ IDFG +G D T+A D+L E V +A
Sbjct: 60 ALPHGVP---SDRKIEEGDLVLIDFGAIYDGYCSDITRTVAVGEPSDELKEIYEIVLEAQ 116
Query: 442 NTGIKAAGIDVPLCEVGAAIQEVMESY 468
IKA V EV A ++V+E
Sbjct: 117 QAAIKAVKPGVTAKEVDKAARDVIEEA 143
Score = 46.7 bits (112), Expect = 1e-05
Identities = 29/91 (31%), Positives = 43/91 (47%), Gaps = 10/91 (10%)
Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCA 209
R+AA + + ++ +IKPGMT E+ ELE + R L E G +F T + +
Sbjct: 5 RKAARIADKAFEELLEFIKPGMTEREVAAELEYFMRKLGAE-----GPSFDTIVASGPNS 59
Query: 210 A--HYTPNNGDKTVLEFDDITKIDFGTHING 238
A H P +E D+ IDFG +G
Sbjct: 60 ALPHGVP---SDRKIEEGDLVLIDFGAIYDG 87
>gnl|CDD|223085 COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and
metabolism].
Length = 384
Score = 59.7 bits (145), Expect = 2e-09
Identities = 54/203 (26%), Positives = 77/203 (37%), Gaps = 28/203 (13%)
Query: 326 ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS--RN 383
A A A + I+PGMT EI ELE R E G +F T + N
Sbjct: 167 AEIADAALEAA----LEAIRPGMTEAEIAAELEYALRKGGAE-----GPSFDTIVASGEN 217
Query: 384 HCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYN---NKYDKLIEAVRDA 440
HYTP++ L D+ ID G NG D T ++ ++ EAV +A
Sbjct: 218 AALPHYTPSD---RKLRDGDLVLIDLGGVYNGYCSDITRTFPIGKPSDEQREIYEAVLEA 274
Query: 441 TNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSI-DSYRIHAGK 499
I A V EV AA ++V+E + + GH + +H
Sbjct: 275 QEAAIAAIRPGVTGGEVDAAARQVLEKAGYGLY--------FLHGTGHGVGFVLDVHEH- 325
Query: 500 TVPIVRGGEGTLMEENEFYAIET 522
+ G + TL E ++IE
Sbjct: 326 PQYLSPGSDTTL-EPGMVFSIEP 347
Score = 48.2 bits (115), Expect = 8e-06
Identities = 30/92 (32%), Positives = 37/92 (40%), Gaps = 14/92 (15%)
Query: 149 ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCS--RN 206
A A A + I+PGMT EI ELE R E G +F T + N
Sbjct: 167 AEIADAALEAA----LEAIRPGMTEAEIAAELEYALRKGGAE-----GPSFDTIVASGEN 217
Query: 207 HCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
HYTP++ L D+ ID G NG
Sbjct: 218 AALPHYTPSD---RKLRDGDLVLIDLGGVYNG 246
>gnl|CDD|238522 cd01089, PA2G4-like, Related to aminopepdidase M, this family
contains proliferation-associated protein 2G4. Family
members have been implicated in cell cycle control.
Length = 228
Score = 57.7 bits (140), Expect = 2e-09
Identities = 43/136 (31%), Positives = 60/136 (44%), Gaps = 20/136 (14%)
Query: 323 YNEARQAAEAHRQVRKHMMGWIKPGMTMIE--------IVEELEKYSRLLIGEDGLKAGL 374
Y A Q A V K ++ PG +++ I+EEL K + E L+ G+
Sbjct: 4 YKTAGQIANK---VLKQVISLCVPGAKVVDLCEKGDKLILEELGKVYK---KEKKLEKGI 57
Query: 375 AFPTGCSRNHCAAHYTPNNGDKTV-LEFDDITKIDFGTHINGRIIDCAFTL-AYNNKYDK 432
AFPT S N+C H++P D T L+ D+ KID G HI+G I A T+
Sbjct: 58 AFPTCISVNNCVCHFSPLKSDATYTLKDGDVVKIDLGCHIDGYIAVVAHTIVVGAEAETP 117
Query: 433 LIEAVRDATNTGIKAA 448
+ D I AA
Sbjct: 118 VTGKKADV----IAAA 129
Score = 56.2 bits (136), Expect = 9e-09
Identities = 36/102 (35%), Positives = 51/102 (50%), Gaps = 15/102 (14%)
Query: 146 YNEARQAAEAHRQVRKHMMGWIKPGMTMIE--------IVEELEKYSRLLIGEDGLKAGL 197
Y A Q A V K ++ PG +++ I+EEL K + E L+ G+
Sbjct: 4 YKTAGQIANK---VLKQVISLCVPGAKVVDLCEKGDKLILEELGKVYK---KEKKLEKGI 57
Query: 198 AFPTGCSRNHCAAHYTPNNGDKTV-LEFDDITKIDFGTHING 238
AFPT S N+C H++P D T L+ D+ KID G HI+G
Sbjct: 58 AFPTCISVNNCVCHFSPLKSDATYTLKDGDVVKIDLGCHIDG 99
Score = 31.5 bits (72), Expect = 0.91
Identities = 14/43 (32%), Positives = 22/43 (51%), Gaps = 4/43 (9%)
Query: 581 GCTKYQMALKDLCDKGIVEAYPPLCDIKGSYTAQFEHTIVLRP 623
G K L + G++ YP L + +G AQF+ T++L P
Sbjct: 182 GKAK----LVECVKHGLLFPYPVLYEKEGEVVAQFKLTVLLTP 220
>gnl|CDD|129591 TIGR00500, met_pdase_I, methionine aminopeptidase, type I.
Methionine aminopeptidase is a cobalt-binding enzyme.
Bacterial and organellar examples (type I) differ from
eukaroytic and archaeal (type II) examples in lacking a
region of approximately 60 amino acids between the 4th
and 5th cobalt-binding ligands. This model describes
type I. The role of this protein in general is to
produce the mature form of cytosolic proteins by
removing the N-terminal methionine [Protein fate,
Protein modification and repair].
Length = 247
Score = 53.5 bits (129), Expect = 7e-08
Identities = 53/213 (24%), Positives = 86/213 (40%), Gaps = 27/213 (12%)
Query: 327 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP-TGC- 380
R+A +V + + +KPG++ E+ + + I + G K FP + C
Sbjct: 13 RKAGRLAAEVLEELEREVKPGVSTKELDRIAKDF----IEKHGAKPAFLGYYGFPGSVCI 68
Query: 381 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNN---KYDKLIEAV 437
S N H P DK VL+ DI ID G +G D A T + +KL+E
Sbjct: 69 SVNEVVIHGIP---DKKVLKDGDIVNIDVGVIYDGYHGDTAKTFLVGKISPEAEKLLECT 125
Query: 438 RDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHA 497
++ I+ A + E+GAAIQ+ E+ + +R GH I + H
Sbjct: 126 EESLYKAIEEAKPGNRIGEIGAAIQKYAEAKGFSV---------VREYCGHGI-GRKFHE 175
Query: 498 GKTVP-IVRGGEGTLMEENEFYAIETFGSTGKG 529
+P + ++E + IE +TG
Sbjct: 176 EPQIPNYGKKFTNVRLKEGMVFTIEPMVNTGTE 208
Score = 29.2 bits (66), Expect = 6.3
Identities = 27/95 (28%), Positives = 41/95 (43%), Gaps = 13/95 (13%)
Query: 150 RQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAG----LAFP-TGC- 203
R+A +V + + +KPG++ E+ + + I + G K FP + C
Sbjct: 13 RKAGRLAAEVLEELEREVKPGVSTKELDRIAKDF----IEKHGAKPAFLGYYGFPGSVCI 68
Query: 204 SRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHING 238
S N H P DK VL+ DI ID G +G
Sbjct: 69 SVNEVVIHGIP---DKKVLKDGDIVNIDVGVIYDG 100
>gnl|CDD|180918 PRK07281, PRK07281, methionine aminopeptidase; Reviewed.
Length = 286
Score = 48.7 bits (116), Expect = 4e-06
Identities = 61/240 (25%), Positives = 94/240 (39%), Gaps = 48/240 (20%)
Query: 344 IKPGMTMIEIVEELEKYSR------LLIGEDGLKAGLAFPTGCSRNHCAAH-----YTPN 392
IKPG+ M E+ E + + + L IG DG + T C N AH Y
Sbjct: 31 IKPGVDMWEVEEYVRRRCKEENVLPLQIGVDGAMMDYPYATCCGLNDEVAHAFPRHYILK 90
Query: 393 NGDKTVLEFD------------DITKIDFGT---------HINGRIIDCAFTLAYNNKYD 431
GD +L+ D D++K++F G + D + A D
Sbjct: 91 EGD--LLKVDMVLSEPLDKSIVDVSKLNFDNVEQMKKYTESYRGGLADSCWAYAVGTPSD 148
Query: 432 K---LIEAVRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGH 488
+ L++ ++A GI+ A + + ++GAAIQE ES + Y V +R+L GH
Sbjct: 149 EVKNLMDVTKEAMYRGIEQAVVGNRIGDIGAAIQEYAES-------RGYGV--VRDLVGH 199
Query: 489 SIDSYRIHAGKTVPIV-RGGEGTLMEENEFYAIETFGSTGKGMVHDDMDVSHYMKNFEAS 547
+ +H VP G G + E IE +TG + DM K +
Sbjct: 200 GVGP-TMHEEPMVPNYGTAGRGLRLREGMVLTIEPMINTGTWEIDTDMKTGWAHKTLDGG 258
Score = 31.0 bits (70), Expect = 1.7
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 9/71 (12%)
Query: 167 IKPGMTMIEIVEELEKYSR------LLIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT 220
IKPG+ M E+ E + + + L IG DG + T C N AH P +
Sbjct: 31 IKPGVDMWEVEEYVRRRCKEENVLPLQIGVDGAMMDYPYATCCGLNDEVAHAFPRH---Y 87
Query: 221 VLEFDDITKID 231
+L+ D+ K+D
Sbjct: 88 ILKEGDLLKVD 98
>gnl|CDD|238520 cd01087, Prolidase, Prolidase. E.C. 3.4.13.9. Also known as Xaa-Pro
dipeptidase, X-Pro dipeptidase, proline dipeptidase.,
imidodipeptidase, peptidase D, gamma-peptidase.
Catalyses hydrolysis of Xaa-Pro dipeptides; also acts on
aminoacyl-hydroxyproline analogs. No action on Pro-Pro.
Length = 243
Score = 43.7 bits (104), Expect = 1e-04
Identities = 35/124 (28%), Positives = 46/124 (37%), Gaps = 15/124 (12%)
Query: 329 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAH 388
+AEAHR K +PGM+ E+ E E R G + ++ N H
Sbjct: 11 SAEAHRAAMKA----SRPGMSEYELEAEFEYEFR----SRGARLAYSYIVAAGSNAAILH 62
Query: 389 YTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYDK----LIEAVRDATNTG 444
Y N+ L+ D+ ID G G D T N K+ L EAV A
Sbjct: 63 YVHNDQ---PLKDGDLVLIDAGAEYGGYASDITRTFPVNGKFTDEQRELYEAVLAAQKAA 119
Query: 445 IKAA 448
I A
Sbjct: 120 IAAC 123
Score = 37.9 bits (89), Expect = 0.010
Identities = 18/67 (26%), Positives = 26/67 (38%), Gaps = 8/67 (11%)
Query: 152 AAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAH 211
+AEAHR K +PGM+ E+ E E R G + ++ N H
Sbjct: 11 SAEAHRAAMKA----SRPGMSEYELEAEFEYEFR----SRGARLAYSYIVAAGSNAAILH 62
Query: 212 YTPNNGD 218
Y N+
Sbjct: 63 YVHNDQP 69
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 39.5 bits (92), Expect = 0.006
Identities = 24/101 (23%), Positives = 44/101 (43%), Gaps = 5/101 (4%)
Query: 5 KGEEVAKKKKKKKNKNKAAA---ATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVE 61
K EE +K ++ + ++ A + +APK P++TT + S+ + + E
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASE-SETTEETYGSSAMETE 1232
Query: 62 DGAAEASKKKKRKKKKKKAGAGEAAADVPIDGNGIAEQLEQ 102
+ AE K K R KKKA A + + + ++L
Sbjct: 1233 N-VAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAA 1272
Score = 32.9 bits (75), Expect = 0.65
Identities = 20/70 (28%), Positives = 26/70 (37%), Gaps = 5/70 (7%)
Query: 23 AAATEEAPKNGVPQQTTPPS--IPVSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKA 80
A E + VS + DD+ +D VE AE KKK +K A
Sbjct: 1283 AKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKP---A 1339
Query: 81 GAGEAAADVP 90
A + AA P
Sbjct: 1340 AANKKAAKPP 1349
Score = 32.5 bits (74), Expect = 0.81
Identities = 20/102 (19%), Positives = 34/102 (33%), Gaps = 19/102 (18%)
Query: 7 EEVAKKKKKKKNKNKAAAATEE------APKNGVPQQTTPPSIPVSQLYPDDDEIEDGDV 60
E ++ KKK K AAA ++ A K P ++++ E +
Sbjct: 1323 EVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEML---KPAEAIGI 1379
Query: 61 EDGAAEASKKKKRKKKK----KKAGAGEAAADVPIDGNGIAE 98
+KK RK + KK+G+ A +
Sbjct: 1380 SP------EKKVRKMRASPFNKKSGSVLGRAATNKETESSEN 1415
Score = 31.0 bits (70), Expect = 2.8
Identities = 15/49 (30%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 36 QQTTPPSIPVSQLYPDDDEIEDGDVEDGAAEASKKK-KRKKKKKKAGAG 83
++TTP S+ + L + E++ D ED AE +++K +R + ++GA
Sbjct: 1146 KKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAA 1194
>gnl|CDD|183434 PRK12318, PRK12318, methionine aminopeptidase; Provisional.
Length = 291
Score = 36.7 bits (85), Expect = 0.027
Identities = 48/195 (24%), Positives = 74/195 (37%), Gaps = 28/195 (14%)
Query: 345 KPGMTMIEIVEELEKYSRLLIGE-DGLKAGL-----AFP-TGC-SRNHCAAHYTPNNGDK 396
K G+T EL++ SR L E + + A L FP T C S N H PN+
Sbjct: 71 KEGVT----TNELDELSRELHKEYNAIPAPLNYGSPPFPKTICTSLNEVICHGIPND--- 123
Query: 397 TVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---KLIEAVRDATNTGIKAAGIDVP 453
L+ DI ID ++G DC+ + + K+ +A + N I +P
Sbjct: 124 IPLKNGDIMNIDVSCIVDGYYGDCSRMVMIGEVSEIKKKVCQASLECLNAAIAILKPGIP 183
Query: 454 LCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIHAGKTVPIVRGGEGTLME 513
L E+G I+ + Y + + GH + + H VP R +
Sbjct: 184 LYEIGEVIENCADKYGFSV---------VDQFVGHGV-GIKFHENPYVPHHRNSSKIPLA 233
Query: 514 ENEFYAIETFGSTGK 528
+ IE + GK
Sbjct: 234 PGMIFTIEPMINVGK 248
>gnl|CDD|171806 PRK12897, PRK12897, methionine aminopeptidase; Reviewed.
Length = 248
Score = 36.6 bits (84), Expect = 0.028
Identities = 59/213 (27%), Positives = 82/213 (38%), Gaps = 41/213 (19%)
Query: 322 MYNEARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSRLLIGEDGLKA--GLAFPTG 379
M+ + A HR++ K M KPG+T EI +E Y K G +
Sbjct: 13 MHESGKLLASCHREIAKIM----KPGITTKEINTFVEAYLEKHGATSEQKGYNGYPYAIC 68
Query: 380 CSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHINGRIIDCAFTLAYNNKYD---KLIEA 436
S N H P + L DI ID ++NG + D A+T D KL+
Sbjct: 69 ASVNDEMCHAFPAD---VPLTEGDIVTIDMVVNLNGGLSDSAWTYRVGKVSDEAEKLLLV 125
Query: 437 VRDATNTGIKAAGIDVPLCEVGAAIQEVMESYEVEIDGKTYQVKSIRNLNGHSIDSYRIH 496
+A GI A I + ++G AI ESY + + + V R+ GH I
Sbjct: 126 AENALYKGIDQAVIGNRVGDIGYAI----ESY---VANEGFSVA--RDFTGHGI------ 170
Query: 497 AGKTVPIVRGGEGTLMEENEFYAIETFGSTGKG 529
GK E +E AI FG G+G
Sbjct: 171 -GK-------------EIHEEPAIFHFGKQGQG 189
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 36.5 bits (85), Expect = 0.038
Identities = 17/85 (20%), Positives = 30/85 (35%)
Query: 4 QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
+K +++A K K K K K P+Q I +S + D D+ + D+
Sbjct: 18 KKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKK 77
Query: 64 AAEASKKKKRKKKKKKAGAGEAAAD 88
+ + K K K +
Sbjct: 78 KTKTAAKAAAAKAPAKKKLKDELDS 102
Score = 29.6 bits (67), Expect = 5.6
Identities = 21/87 (24%), Positives = 30/87 (34%), Gaps = 3/87 (3%)
Query: 5 KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
K E + KK +KKN V Q DDD+I+D D ++
Sbjct: 97 KDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDD 156
Query: 65 AEASKKKKRKK---KKKKAGAGEAAAD 88
E +KK+A E +D
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSD 183
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 34.6 bits (80), Expect = 0.19
Identities = 21/89 (23%), Positives = 31/89 (34%)
Query: 4 QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
+ E K K+++ EE P T S+ S D + ++
Sbjct: 464 EAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKA 523
Query: 64 AAEASKKKKRKKKKKKAGAGEAAADVPID 92
A + SK + KKKKK D ID
Sbjct: 524 ANKISKAAVKVKKKKKKEKSIDLDDDLID 552
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 31.6 bits (72), Expect = 0.19
Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 5 KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVS-QLYPDDDEIEDGDVEDG 63
KG+++ KKKKKK K K + E + ++++ S + D+++IE +
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMN 72
Query: 64 AAEASKKKKRKKKKK 78
EA + + ++K+
Sbjct: 73 LTEAERAFEEAQRKR 87
>gnl|CDD|238295 cd00530, PTE, Phosphotriesterase (PTE) catalyzes the hydrolysis of
organophosphate nerve agents, including the chemical
warfare agents VX, soman, and sarin as well as the
insecticide paraoxon. PTE exists as a homodimer with one
active site per monomer. The active site is located next
to a binuclear metal center, at the C-terminal end of a
TIM alpha- beta barrel motif. The native enzyme
contains two zinc ions at the active site however these
can be replaced with other metals such as cobalt,
cadmium, nickel or manganese and the enzyme remains
active.
Length = 293
Score = 32.6 bits (75), Expect = 0.50
Identities = 13/60 (21%), Positives = 22/60 (36%), Gaps = 15/60 (25%)
Query: 125 TGKDRFTNEEKKAIDRLHNDMYNEARQAAEAHRQ----VRKHMMGWIKPGMTMIEIVEEL 180
G T E+K + R AA A ++ + H + G+ + I+EE
Sbjct: 125 GGSPAITPLEEKVL-----------RAAARAQKETGVPISTHTQAGLTMGLEQLRILEEE 173
Score = 32.6 bits (75), Expect = 0.50
Identities = 13/60 (21%), Positives = 22/60 (36%), Gaps = 15/60 (25%)
Query: 302 TGKDRFTNEEKKAIDRLHNDMYNEARQAAEAHRQ----VRKHMMGWIKPGMTMIEIVEEL 357
G T E+K + R AA A ++ + H + G+ + I+EE
Sbjct: 125 GGSPAITPLEEKVL-----------RAAARAQKETGVPISTHTQAGLTMGLEQLRILEEE 173
>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
transduction mechanisms].
Length = 208
Score = 31.4 bits (71), Expect = 0.99
Identities = 21/98 (21%), Positives = 33/98 (33%), Gaps = 11/98 (11%)
Query: 5 KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
K A++ +++ K + A A E P P P +V
Sbjct: 107 KARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKA----------PREEVHTPV 156
Query: 65 AEASKKKKRKKKKKKAGAGEAAADV-PIDGNGIAEQLE 101
++ S + K KAG A V I +G+ QL
Sbjct: 157 SDISALTVGQALKVKAGQNAMDATVLEITKDGVRVQLN 194
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 31.4 bits (72), Expect = 1.1
Identities = 8/45 (17%), Positives = 18/45 (40%)
Query: 4 QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQL 48
+ E K ++++ A + P+ P+ P +S+L
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISEL 167
Score = 29.9 bits (68), Expect = 3.3
Identities = 8/43 (18%), Positives = 17/43 (39%)
Query: 238 GEEVAKKKKKKKNKNKAAAATEEAPKNGIPQQTTPPSIPVSQL 280
E K ++++ A + P+ P+ P +S+L
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISEL 167
Score = 29.1 bits (66), Expect = 5.5
Identities = 20/104 (19%), Positives = 35/104 (33%), Gaps = 23/104 (22%)
Query: 4 QKGEEVAKKK-----KKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDG 58
Q+ E+ AKK+ K+K + + + K P+ P
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP------------ 160
Query: 59 DVEDGAAEASKKKKRKKKKKKAGAGEAAADV-PIDGNGIAEQLE 101
++ S+ + K KAG A V I + + QL+
Sbjct: 161 -----VSDISELTVGQAVKVKAGKSAMDATVLEITKDDVRVQLD 199
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 30.6 bits (69), Expect = 1.2
Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 1/72 (1%)
Query: 10 AKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPD-DDEIEDGDVEDGAAEAS 68
AK ++K+ K +A K + D DDE ++ D +A
Sbjct: 39 AKAQEKEHEKERAEREEAREKKAKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQ 98
Query: 69 KKKKRKKKKKKA 80
KK +K+++K
Sbjct: 99 WGKKARKRQRKV 110
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 29.5 bits (67), Expect = 1.8
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 226 DITKIDFGTHINGEEVAKKKKKKKNKNKAAAATEEAPKNGIPQQ 269
DI+ + IN E ++KKKKK K++ E+ K + +Q
Sbjct: 22 DISGVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQ 65
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 31.4 bits (72), Expect = 1.9
Identities = 21/135 (15%), Positives = 44/135 (32%), Gaps = 1/135 (0%)
Query: 7 EEVAKK-KKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGAA 65
EE A+K + K++ + +A A EEA ++ + + + + A
Sbjct: 40 EEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPA 99
Query: 66 EASKKKKRKKKKKKAGAGEAAADVPIDGNGIAEQLEQCGLNGNFPVGQEMEYPTLKDGRT 125
E + + ++ A +AA + + G G + R
Sbjct: 100 EDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRR 159
Query: 126 GKDRFTNEEKKAIDR 140
K + + + I R
Sbjct: 160 RKKKKKQKPTEKIPR 174
>gnl|CDD|152339 pfam11903, DUF3423, Protein of unknown function (DUF3423). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 73 to 118 amino acids in
length. This protein appears to be related to
ribbon-helix-helix DNA-binding domains, suggesting these
proteins may also bind DNA.
Length = 73
Score = 28.4 bits (64), Expect = 1.9
Identities = 9/37 (24%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 140 RLHNDMYNEARQAAEA-HRQVRKHMMGWIKPGMTMIE 175
++ +D++ +AR+A++A R + + W + G + E
Sbjct: 5 KISDDLHEQARRASKAMSRSINAQIEHWARIGR-LAE 40
Score = 28.4 bits (64), Expect = 1.9
Identities = 9/37 (24%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 317 RLHNDMYNEARQAAEA-HRQVRKHMMGWIKPGMTMIE 352
++ +D++ +AR+A++A R + + W + G + E
Sbjct: 5 KISDDLHEQARRASKAMSRSINAQIEHWARIGR-LAE 40
>gnl|CDD|238017 cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic
and prokaryotic proteins. Putative nuclear signalling
domain. FHA domains may bind phosphothreonine,
phosphoserine and sometimes phosphotyrosine. In
eukaryotes, many FHA domain-containing proteins localize
to the nucleus, where they participate in establishing
or maintaining cell cycle checkpoints, DNA repair, or
transcriptional regulation. Members of the FHA family
include: Dun1, Rad53, Cds1, Mek1,
KAPP(kinase-associated protein phosphatase),and Ki-67 (a
human nuclear protein related to cell proliferation).
Length = 102
Score = 29.3 bits (66), Expect = 2.1
Identities = 16/69 (23%), Positives = 25/69 (36%), Gaps = 6/69 (8%)
Query: 179 ELEKYSRLLIGEDGLKAGLAFP-TGCSRNHCAAHYTPNNGDKTVLEFDDITKIDFGTHIN 237
L+ IG D + SR H Y + G V+ D + GT +N
Sbjct: 17 YLDPGGTYTIGRDSDNCDIVLDDPSVSRRHAVIRYDGDGG---VVLIDLGST--NGTFVN 71
Query: 238 GEEVAKKKK 246
G+ V+ +
Sbjct: 72 GQRVSPGEP 80
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 30.9 bits (70), Expect = 2.5
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 51 DDDEIEDGDVEDGAAEASKKKKRKKKKKKAGA 82
DD E +D + DG + +KKK +KK+++K +
Sbjct: 165 DDGEDDDEEGSDGEEDGAKKKAKKKRQRKPES 196
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 30.8 bits (69), Expect = 2.6
Identities = 23/81 (28%), Positives = 30/81 (37%), Gaps = 23/81 (28%)
Query: 3 IQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQ----QTTPPSIPVSQL---------- 48
IQ E + + +KNKAA +AP + VP Q TP V
Sbjct: 457 IQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEP 516
Query: 49 -----YPDDDEIEDGDVEDGA 64
+PD+D EDGA
Sbjct: 517 FYGYGFPDNDC----PPEDGA 533
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 30.7 bits (70), Expect = 2.9
Identities = 11/39 (28%), Positives = 20/39 (51%)
Query: 49 YPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAA 87
P E E+ + E+G +++KK +KK++KA
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKE 428
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 30.1 bits (67), Expect = 2.9
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 1/72 (1%)
Query: 11 KKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPD-DDEIEDGDVEDGAAEASK 69
K + K ++ T++ + P +P + D DD+ E+GD +DG AE +
Sbjct: 40 KSPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDDGGAEDEE 99
Query: 70 KKKRKKKKKKAG 81
++K + + + G
Sbjct: 100 EEKVRGQSGQEG 111
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 30.6 bits (69), Expect = 3.2
Identities = 19/84 (22%), Positives = 32/84 (38%), Gaps = 7/84 (8%)
Query: 2 LIQKGEEVAKKKKKKKNKNKAAAATEE--APKNGVPQQTTPPSIPVSQLYPDDDEIEDGD 59
++ + K+KK+K + + EE S + L D+DE E
Sbjct: 217 IMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKP 276
Query: 60 V-----EDGAAEASKKKKRKKKKK 78
+ E +K+KRK+ KK
Sbjct: 277 SGERSDSEEETEEKEKEKRKRLKK 300
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 30.1 bits (68), Expect = 3.3
Identities = 28/145 (19%), Positives = 49/145 (33%), Gaps = 24/145 (16%)
Query: 1 MLIQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDV 60
M+ +K +E K+K++K ++A + E +P + P + P + + E
Sbjct: 127 MIERKPKEPGLKRKQRKKAQESATSPESSPSST-PNSSRPSTPHLL----KAKEGPSRRA 181
Query: 61 EDGA---------AEASKK-----KKRKKKKKKAGAGEAAADVPI-DGNGIAEQLEQCGL 105
+ A E S K KK KK +K D + D + E
Sbjct: 182 KKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVLDYSAPDANDEN--- 238
Query: 106 NGNFPVGQEMEYPTLKDGRTGKDRF 130
+E++ T K F
Sbjct: 239 ADAPEDVEEVD-QESWGRGTMKGDF 262
>gnl|CDD|232910 TIGR00298, TIGR00298, 2-phosphosulfolactate phosphatase.
2-phosphosulfolactate phosphatase catalyzes the
sulfonation of phosphoenolpyruvate to form
2-phospho-3-sulfolactate, the second step in coenzyme M
biosynthesis. Coenzyme M is the terminal methyl carrier
in methanogenesis [Biosynthesis of cofactors, prosthetic
groups, and carriers, Other, Energy metabolism,
Methanogenesis].
Length = 216
Score = 29.7 bits (67), Expect = 3.4
Identities = 25/80 (31%), Positives = 31/80 (38%), Gaps = 11/80 (13%)
Query: 174 IEIVEELEKYSRLLIGEDGLKAGLAFPTGCSRNHCAAHYTPNNGDKT---------VLEF 224
IE EL+K + + GE K F G S + NNGD VLE
Sbjct: 30 IEEAMELKKENAIKAGERNGKKIEGFDVGNSPLEVLNY--KNNGDNLVLTTTNGTRVLES 87
Query: 225 DDITKIDFGTHINGEEVAKK 244
I G+ N + VAKK
Sbjct: 88 IKSDNILVGSINNAKAVAKK 107
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 30.4 bits (68), Expect = 3.6
Identities = 18/94 (19%), Positives = 30/94 (31%), Gaps = 18/94 (19%)
Query: 2 LIQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIP---VSQLYPDDDEIEDG 58
L E KK ++ K + S P S+L ++D D
Sbjct: 56 LAVDEEPDENGAVSKKKPTRS-------VKRATKKTVVEISEPLEEGSELVVNEDAALD- 107
Query: 59 DVEDGAAEASKKKKRKKKKKKAGAGEAAADVPID 92
+ SKK R+ ++K A A + +
Sbjct: 108 -------KESKKTPRRTRRKAAAASSDVEEEKTE 134
>gnl|CDD|226291 COG3768, COG3768, Predicted membrane protein [Function unknown].
Length = 350
Score = 30.1 bits (68), Expect = 3.7
Identities = 14/58 (24%), Positives = 23/58 (39%)
Query: 26 TEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAG 83
E APK P++ + + D ++ ED + EA + + K GAG
Sbjct: 14 VEAAPKFRAPREFDEDAENFTPAELDVEDPEDEGAAEAPVEAPLRPRSSFWKIMLGAG 71
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 29.6 bits (67), Expect = 4.2
Identities = 19/72 (26%), Positives = 25/72 (34%), Gaps = 6/72 (8%)
Query: 7 EEVAKKKKKKKNKN--KAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
EE AKKKKKKK K K A A T S Q+ + +
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAP-- 235
Query: 65 AEASKKKKRKKK 76
++K + K
Sbjct: 236 --DNEKSEVYKS 245
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 29.0 bits (65), Expect = 4.8
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 47 QLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAA 86
L+P++ E E +V E K +K+KK+K A
Sbjct: 89 NLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAE 128
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 30.1 bits (68), Expect = 4.8
Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 7/120 (5%)
Query: 2 LIQKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVE 61
L +G++ KK K K+ A + AP + P + + + D E
Sbjct: 43 LASEGKKNRKKMPKVFQKSSAPRQIQAAPPQALNPTAAAPQSSRGPTLRELLALPEDDGE 102
Query: 62 DGAAEASKKKKR-------KKKKKKAGAGEAAADVPIDGNGIAEQLEQCGLNGNFPVGQE 114
A E+S +R + + + AG D + L + + PV +E
Sbjct: 103 TQAPESSPSARRLTRSEGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKE 162
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 29.9 bits (67), Expect = 5.3
Identities = 13/57 (22%), Positives = 17/57 (29%)
Query: 35 PQQTTPPSIPVSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAADVPI 91
PQ T PP I V L + D + A G + VP+
Sbjct: 303 PQSTIPPRIVVPWLPDTEGREGDDLAPTAVCTDAGLLPDTPLLPDAPEGSSDPVVPV 359
>gnl|CDD|182186 PRK09990, PRK09990, DNA-binding transcriptional regulator GlcC;
Provisional.
Length = 251
Score = 29.3 bits (66), Expect = 5.4
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 5/54 (9%)
Query: 134 EKKAIDRLHNDMYNE--ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSR 185
+KK IDR H +YN R +A R R H+ + EI +E ++ R
Sbjct: 194 QKKQIDRQHARLYNAVLQRLPEQAQRAARDHIRTVRD---NLREIEQEEQRLVR 244
Score = 29.3 bits (66), Expect = 5.4
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 5/54 (9%)
Query: 311 EKKAIDRLHNDMYNE--ARQAAEAHRQVRKHMMGWIKPGMTMIEIVEELEKYSR 362
+KK IDR H +YN R +A R R H+ + EI +E ++ R
Sbjct: 194 QKKQIDRQHARLYNAVLQRLPEQAQRAARDHIRTVRD---NLREIEQEEQRLVR 244
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 28.8 bits (65), Expect = 6.3
Identities = 19/85 (22%), Positives = 32/85 (37%), Gaps = 13/85 (15%)
Query: 4 QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDV--- 60
++ + A+KK+K+K K A + + S+ E +G++
Sbjct: 14 KREQRKARKKQKRKEAKKKEDAQKSE------AEEVKNEENKSKKKAAPIENAEGNIVFS 67
Query: 61 ----EDGAAEASKKKKRKKKKKKAG 81
DG K +KKKKKK
Sbjct: 68 KVEFADGEQAKKDLKLKKKKKKKKT 92
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.2 bits (65), Expect = 7.3
Identities = 22/81 (27%), Positives = 33/81 (40%), Gaps = 20/81 (24%)
Query: 6 GEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVE---- 61
EE +KKKKKK KNK ++ K G D+ + +DGD E
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDA------------DEYDSDDGDDEGREE 284
Query: 62 ----DGAAEASKKKKRKKKKK 78
D +A + ++R+ K
Sbjct: 285 DYISDSSASGNDPEEREDKLS 305
>gnl|CDD|222793 PHA00437, PHA00437, tail assembly protein.
Length = 94
Score = 27.6 bits (61), Expect = 7.3
Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 4/81 (4%)
Query: 2 LIQKGEEVAKKKKKKKNKN--KAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGD 59
L +K ++ KK K +K A A K+ P P + + + D +
Sbjct: 3 LFKKIKKAVKKVTKSVSKPVVGAVAGALGGGKDAPPVVVAAAPPPAQIVEVPEKD--DVE 60
Query: 60 VEDGAAEASKKKKRKKKKKKA 80
ED A S KKK + KK+
Sbjct: 61 TEDEAQTESGKKKARAGGKKS 81
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 29.2 bits (65), Expect = 7.4
Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 1/84 (1%)
Query: 4 QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
++G E +++K K KAAAA + Q+ V++ + D+ +
Sbjct: 113 REGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTE-EVTEEEEETDKEKAKAKAAA 171
Query: 64 AAEASKKKKRKKKKKKAGAGEAAA 87
AA+A K+K +AG G
Sbjct: 172 AAKAKAAALAKQKAAEAGEGTEEV 195
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 28.1 bits (63), Expect = 7.7
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 58 GDVEDGAAEASKKKKRKKKKKKAGAGEAAA 87
+ + G + K+KK +A A EAAA
Sbjct: 111 AEADGGPTAEATTPKKKKAAAEAAAAEAAA 140
Score = 28.1 bits (63), Expect = 9.7
Identities = 10/27 (37%), Positives = 13/27 (48%)
Query: 64 AAEASKKKKRKKKKKKAGAGEAAADVP 90
AEA+ KK+K + A A AA
Sbjct: 118 TAEATTPKKKKAAAEAAAAEAAAPAAE 144
>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
partitioning].
Length = 264
Score = 29.0 bits (65), Expect = 7.8
Identities = 12/76 (15%), Positives = 29/76 (38%), Gaps = 2/76 (2%)
Query: 4 QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
Q + +++ + +++ + AT + P+ PV+ P + +E +
Sbjct: 117 QPEQARIEEQPRTQSQKAQSQATTVQTQPVKPKPRPEKPQPVAPA-PAPEPVEK-APKAE 174
Query: 64 AAEASKKKKRKKKKKK 79
AA K K + +
Sbjct: 175 AAPPPKPKAEDAAETR 190
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to
be DNA-binding and may be a transcription factor.
Length = 238
Score = 28.9 bits (65), Expect = 8.2
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 51 DDDEIEDGDVEDGAAEASKKKKRKKKKKKA 80
+DDE E D E+G E ++++ KKKK+
Sbjct: 62 EDDEPESDDEEEGEKELQREERLKKKKRVK 91
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 29.0 bits (65), Expect = 8.5
Identities = 24/118 (20%), Positives = 36/118 (30%), Gaps = 34/118 (28%)
Query: 6 GEEVAKKKKKKKNKNK----------AAAATEEAPKNGVPQQTT-----------PPSIP 44
E V KK KKK K K E + P
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKR 329
Query: 45 VSQLYPDDDEIEDGDVEDGAAEASKKKKRKKKKKKAGAGEAAADVPIDGNGIAEQLEQ 102
+ DD +D +D A +K+++ +KK+K E IA Q+ +
Sbjct: 330 KEEQEDDDFVEDD---DDLQASLAKQRRLAQKKRKKLRPED----------IARQIAE 374
>gnl|CDD|176005 cd04040, C2D_Tricalbin-like, C2 domain fourth repeat present in
Tricalbin-like proteins. 5 to 6 copies of the C2 domain
are present in Tricalbin, a yeast homolog of
Synaptotagmin, which is involved in membrane trafficking
and sorting. C2 domains fold into an 8-standed
beta-sandwich that can adopt 2 structural arrangements:
Type I and Type II, distinguished by a circular
permutation involving their N- and C-terminal beta
strands. Many C2 domains are Ca2+-dependent
membrane-targeting modules that bind a wide variety of
substances including bind phospholipids, inositol
polyphosphates, and intracellular proteins. Most C2
domain proteins are either signal transduction enzymes
that contain a single C2 domain, such as protein kinase
C, or membrane trafficking proteins which contain at
least two C2 domains, such as synaptotagmin 1. However,
there are a few exceptions to this including RIM
isoforms and some splice variants of piccolo/aczonin and
intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. This cd contains the fifth C2
repeat, C2E, and has a type-II topology.
Length = 115
Score = 27.5 bits (62), Expect = 8.7
Identities = 9/18 (50%), Positives = 12/18 (66%)
Query: 235 HINGEEVAKKKKKKKNKN 252
++NGE+V K K KK N
Sbjct: 27 YLNGEKVFKTKTIKKTLN 44
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 29.0 bits (65), Expect = 9.0
Identities = 15/84 (17%), Positives = 26/84 (30%), Gaps = 18/84 (21%)
Query: 5 KGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDGA 64
K E A K + K KA A ++ + + + + A
Sbjct: 186 KAEAEAAAKAAAEAKKKAEAEAKKKAAA------------------EAKKKAAAEAKAAA 227
Query: 65 AEASKKKKRKKKKKKAGAGEAAAD 88
A+A+ + K +K A A
Sbjct: 228 AKAAAEAKAAAEKAAAAKAAEKAA 251
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 29.2 bits (65), Expect = 9.3
Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 2/42 (4%)
Query: 243 KKKKKKKNKNKAAAATEEAPKNGIPQQTTP--PSIPVSQLYP 282
KK KKKK+ K A + P ++ P+S++ P
Sbjct: 124 KKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSRVIP 165
>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
Length = 1713
Score = 29.1 bits (65), Expect = 9.6
Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 7/75 (9%)
Query: 19 KNKAAAATEEAPKNGVPQQTTPPSI--PVSQLYPDDD-EIEDGDVEDGAAEASKKKKRKK 75
++ + + + V +P SI ++ Y +D + D +++DG ++ RK
Sbjct: 333 EDSPSHDIVGS--HPVCNGASPVSITPEENESYVEDAVSLPDSEIKDGKLSKLQRVTRKA 390
Query: 76 KKKKAG--AGEAAAD 88
KK+K G A E AD
Sbjct: 391 KKRKLGDMAYEGDAD 405
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 29.1 bits (65), Expect = 9.7
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 58 GDVEDGAAEASKKKKRKKKKKKAGAGEAA 86
+ E+ A ++ KK K+ KKKK G G +A
Sbjct: 238 AEAENEAGKSDKKDKKSKKKKVLGKGTSA 266
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 28.6 bits (64), Expect = 9.8
Identities = 26/83 (31%), Positives = 32/83 (38%), Gaps = 17/83 (20%)
Query: 4 QKGEEVAKKKKKKKNKNKAAAATEEAPKNGVPQQTTPPSIPVSQLYPDDDEIEDGDVEDG 63
QK E AK K+ + K KA A E+ K +Q E+
Sbjct: 118 QKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQA-----------------EEEAKAKA 160
Query: 64 AAEASKKKKRKKKKKKAGAGEAA 86
AAEA KK KKK +A A A
Sbjct: 161 AAEAKKKAAEAKKKAEAEAKAKA 183
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.133 0.388
Gapped
Lambda K H
0.267 0.0803 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,199,041
Number of extensions: 3329531
Number of successful extensions: 5350
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4931
Number of HSP's successfully gapped: 181
Length of query: 635
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 532
Effective length of database: 6,369,140
Effective search space: 3388382480
Effective search space used: 3388382480
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 62 (27.5 bits)