RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5250
(1102 letters)
>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
tricorn interacting factor F3, Endoplasmic reticulum
aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ). This
M1 peptidase family includes eukaryotic and bacterial
members: aminopeptidase N (APN), aminopeptidase Q (APQ,
laeverin), endoplasmic reticulum aminopeptidase 1
(ERAP1) as well as tricorn interacting factor F3.
Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease,
consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types (leukocyte, fibroblast,
endothelial and epithelial cells). APN expression is
dysregulated in inflammatory diseases such as chronic
pain, rheumatoid arthritis, multiple sclerosis, systemic
sclerosis, systemic lupus erythematosus,
polymyositis/dermatomyosytis and pulmonary sarcoidosis,
and is enhanced in tumor cells such as melanoma, renal,
prostate, pancreas, colon, gastric and thyroid cancers.
It is considered a marker of differentiation since it is
predominantly expressed on stem cells and on cells of
the granulocytic and monocytic lineages at distinct
stages of differentiation. Thus, APN inhibition may lead
to the development of anti-cancer and anti-inflammatory
drugs. ERAP1 also known as endoplasmic reticulum
aminopeptidase associated with antigen processing
(ERAAP), adipocyte derived leucine aminopeptidase
(A-LAP) or aminopeptidase regulating tumor necrosis
factor receptor I (THFRI) shedding (ARTS-1), associates
with the closely related ER aminopeptidase ERAP2, for
the final trimming of peptides within the ER for
presentation by MHC class I molecules. ERAP1 is
associated with ankylosing spondylitis (AS), an
inflammatory arthritis that predominantly affects the
spine. ERAP1 also aids in the shedding of membrane-bound
cytokine receptors. The tricorn interacting factor F3,
together with factors F1 and F2, degrades the tricorn
protease products, producing free amino acids, thus
completing the proteasomal degradation pathway. F3 is
homologous to F2, but not F1, and shows a strong
preference for glutamate in the P1' position. APQ, also
known as laeverin, is specifically expressed in human
embryo-derived extravillous trophoblasts (EVTs) that
invade the uterus during early placentation. It cleaves
the N-terminal amino acid of various peptides such as
angiotensin III, endokinin C, and kisspeptin-10, all
expressed in the placenta in large quantities. APN is a
receptor for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs are also putative Cry toxin
receptors. Cry1 proteins are pore-forming toxins that
bind to the midgut epithelial cell membrane of
susceptible insect larvae, causing extensive damage.
Several different toxins, including Cry1Aa, Cry1Ab,
Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
bind to APNs; however, a direct role of APN in
cytotoxicity has been yet to be firmly established.
Length = 446
Score = 243 bits (622), Expect = 8e-71
Identities = 89/142 (62%), Positives = 107/142 (75%)
Query: 150 DDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSESPQPDG 209
E RY+A TQFE TDARR FPC+DEPA KA F+I+++ P ALSNMPV+SE DG
Sbjct: 110 GGETRYLAATQFEPTDARRAFPCFDEPAFKATFTITITHPAGYTALSNMPVESEEVLGDG 169
Query: 210 HRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVL 269
+ +FET+P MSTYLVA VVG+FDYVE T+ +GV VRVY GK EQG +AL VA K+L
Sbjct: 170 WKTTEFETTPPMSTYLVAFVVGDFDYVEGTTKNGVPVRVYARPGKIEQGDYALEVAPKIL 229
Query: 270 PFYKDYFNIAYPLPKIDLVAIP 291
F++DYF I YPLPK+DLVAIP
Sbjct: 230 EFFEDYFGIPYPLPKLDLVAIP 251
Score = 205 bits (525), Expect = 1e-57
Identities = 64/101 (63%), Positives = 80/101 (79%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MENWGL+TYRE LL D + +SA +Q +A VV HELAHQWFGNLVTM+WW LWLNEG+
Sbjct: 258 MENWGLITYRETALLYDPKTSSASNKQRVATVVAHELAHQWFGNLVTMKWWDDLWLNEGF 317
Query: 756 ASFVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHP 796
A+++E+L V HL PE+++W QFV D+L AL LD+L SSHP
Sbjct: 318 ATYMEYLGVDHLEPEWNMWDQFVLDDLQSALALDSLASSHP 358
Score = 158 bits (403), Expect = 1e-41
Identities = 62/157 (39%), Positives = 84/157 (53%), Gaps = 5/157 (3%)
Query: 11 FKIVSETKELKLHVIDLDFKKVQLELADGKV-LTPETKISTEDETITLTFSETLPV--GE 67
++ T E+ LH DL + + G V + S E+E + +T E L + G
Sbjct: 24 LEVTEPTNEIVLHAKDLTITSATVTVGGGNVDILVTITYSEENEFLVITLDEPLTLAGGN 83
Query: 68 VKLLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAKF 127
L E+ GELND ++GFYRS Y E RY+A TQF TDARR FPC+DEPA KA F
Sbjct: 84 YTLTIEFTGELNDDLRGFYRSSYT-DNGGETRYLAATQFEPTDARRAFPCFDEPAFKATF 142
Query: 128 AISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELT 164
I+++ P+ ALSNM + D + T+FE T
Sbjct: 143 TITITHPAGYTALSNMPVESEEVLGDGWK-TTEFETT 178
Score = 152 bits (386), Expect = 2e-39
Identities = 64/130 (49%), Positives = 79/130 (60%), Gaps = 24/130 (18%)
Query: 481 SMTHEDNYTVWITICNCLQKIDLLLSNTEYHHLFYQFGPVKSESPQPDGHRLLQFETSPI 540
++TH YT LSN PV+SE DG + +FET+P
Sbjct: 145 TITHPAGYTA--------------LSNM----------PVESEEVLGDGWKTTEFETTPP 180
Query: 541 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAY 600
MSTYLVA VVG+FDYVE T+ +GV VRVY GK EQG +AL VA K+L F++DYF I Y
Sbjct: 181 MSTYLVAFVVGDFDYVEGTTKNGVPVRVYARPGKIEQGDYALEVAPKILEFFEDYFGIPY 240
Query: 601 PLPKIDLVAL 610
PLPK+DLVA+
Sbjct: 241 PLPKLDLVAI 250
Score = 68.7 bits (169), Expect = 6e-12
Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 15/58 (25%)
Query: 337 PVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGV---------------NSSSDSLW 379
PV P E+ EIFD ISY+KGAS++RML+ ++G+ V N+++D LW
Sbjct: 362 PVETPDEISEIFDAISYSKGASVLRMLEHFLGEEVFRKGLRNYLKKHAYGNATTDDLW 419
Score = 68.7 bits (169), Expect = 7e-12
Identities = 19/33 (57%), Positives = 27/33 (81%)
Query: 291 PVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
PV P E+ EIFD ISY+KGAS++RML+ ++G+
Sbjct: 362 PVETPDEISEIFDAISYSKGASVLRMLEHFLGE 394
>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1. Members of this
family are aminopeptidases. The members differ widely in
specificity, hydrolysing acidic, basic or neutral
N-terminal residues. This family includes leukotriene-A4
hydrolase, this enzyme also has an aminopeptidase
activity.
Length = 390
Score = 216 bits (551), Expect = 6e-62
Identities = 103/285 (36%), Positives = 138/285 (48%), Gaps = 58/285 (20%)
Query: 10 YFKIVSETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDET--ITLTFSETLPVG- 66
+ + T E+ LH DL+ V L V + +DE + + + TL G
Sbjct: 31 TLQAKAATNEIVLHSKDLEITSVTLRGEPVSVNNLISVFQLDDEDEFLVINLASTLQAGQ 90
Query: 67 EVKLLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAK 126
L EY G+LND M+GF YRS +Y+ T
Sbjct: 91 PYTLEIEYEGKLNDDMRGF----YRS------QYLDQT---------------------- 118
Query: 127 FAISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISL 186
+ E + MA TQFE TDARR FPC+DEP+VKA F I++
Sbjct: 119 -----------------------NGEKKPMATTQFEPTDARRAFPCFDEPSVKATFDITI 155
Query: 187 SVPNNKVALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLV 246
+ P + ALSNMP P DG + +FET+P MSTYL+A VG+ +Y+E + GV V
Sbjct: 156 NHPADYTALSNMPEIESEPLDDGRVITEFETTPKMSTYLLAFAVGDLEYLETKTKSGVPV 215
Query: 247 RVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIP 291
RVY G GQ+AL V K+L F++DYF YPLPK+D VA+P
Sbjct: 216 RVYARPGAINAGQYALEVTQKLLEFFEDYFGFPYPLPKLDQVALP 260
Score = 185 bits (471), Expect = 4e-51
Identities = 62/123 (50%), Positives = 83/123 (67%), Gaps = 3/123 (2%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MENWGL+TYRE LL D N+++ +Q +A V+ HELAHQWFGNLVTM+WW LWLNEG+
Sbjct: 267 MENWGLITYREPALLYDPGNSTSSDKQRVAEVIAHELAHQWFGNLVTMDWWDDLWLNEGF 326
Query: 756 ASFVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHPTQFVTDNLVRALELDALKS 815
A+++E+L L PE+ I QFV + AL LD+L SSHP +T N+ E+D +
Sbjct: 327 ATYMEYLITDALEPEWRIEEQFVLREVQSALALDSLDSSHP---ITVNVNTPSEIDDIFD 383
Query: 816 SHP 818
+
Sbjct: 384 AIS 386
Score = 120 bits (303), Expect = 5e-29
Identities = 48/109 (44%), Positives = 62/109 (56%), Gaps = 9/109 (8%)
Query: 518 GPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQ 577
P P DG + +FET+P MSTYL+A VG+ +Y+E + GV VRVY G
Sbjct: 167 MPEIESEPLDDGRVITEFETTPKMSTYLLAFAVGDLEYLETKTKSGVPVRVYARPGAINA 226
Query: 578 GQFALHVASKVLPFYKDYFNIAYPLPKIDLVAL---------NPGTVGY 617
GQ+AL V K+L F++DYF YPLPK+D VAL N G + Y
Sbjct: 227 GQYALEVTQKLLEFFEDYFGFPYPLPKLDQVALPDFSAGAMENWGLITY 275
Score = 36.5 bits (85), Expect = 0.076
Identities = 15/29 (51%), Positives = 18/29 (62%), Gaps = 2/29 (6%)
Query: 330 LRPSFQI--PVGHPSEVEEIFDDISYNKG 356
L S I V PSE+++IFD ISY KG
Sbjct: 362 LDSSHPITVNVNTPSEIDDIFDAISYEKG 390
Score = 34.2 bits (79), Expect = 0.33
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 291 PVGHPSEVEEIFDDISYNKG 310
V PSE+++IFD ISY KG
Sbjct: 371 NVNTPSEIDDIFDAISYEKG 390
>gnl|CDD|223385 COG0308, PepN, Aminopeptidase N [Amino acid transport and
metabolism].
Length = 859
Score = 143 bits (363), Expect = 8e-35
Identities = 53/102 (51%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MENWGLVT+RE LL D + + +N+ V+ HELAHQWFGNLVTM+WW LWLNEG+
Sbjct: 279 MENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTMKWWDDLWLNEGF 338
Query: 756 ASFVEFLCVHHLFP-EYDIWTQFVTDNLVRALELDALKSSHP 796
A+F E L L + W F T AL D+L SSHP
Sbjct: 339 ATFREVLWSEDLGGRAWKRWEDFRTLRTSIALAEDSLPSSHP 380
Score = 139 bits (352), Expect = 1e-33
Identities = 67/239 (28%), Positives = 101/239 (42%), Gaps = 29/239 (12%)
Query: 89 KYRSVLHDEDRYMAVTQFCSTDA-----RRCFPCWDEPAVKAKFAISLSVPSSKVALSNM 143
+ RSV D A + DA P E + P S L +
Sbjct: 67 EIRSVKIDGKALTAWYRL-DGDALTITVAPPIPERSERPFTLAITYEFTGPVSNDTLEGL 125
Query: 144 VRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSE 203
R + +TQ E ARR FPC DEP VKA F++++ +SN +
Sbjct: 126 YRSG---YGGKPYLITQCEAEGARRIFPCIDEPDVKATFTLTIRADKGPKLISNGNLIDG 182
Query: 204 SPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEE---TSSDGVLVRVYTPVGKREQGQF 260
DG ++++FE +P M TYL A+V G+ + + T S V + +Y P G ++ ++
Sbjct: 183 GTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVPPGVLDRAKY 242
Query: 261 ALHVASKVLPFYKDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDISY----NKGASIIR 315
AL + + FY++YF + Y LP D+VA+P D S N G R
Sbjct: 243 ALDETKRSIEFYEEYFGLPYALPI-DIVAVP------------DFSAGAMENWGLVTFR 288
Score = 84.5 bits (209), Expect = 2e-16
Identities = 46/161 (28%), Positives = 69/161 (42%), Gaps = 19/161 (11%)
Query: 3 EKKPFERYFKIV------SETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDETIT 56
EK FE I S L L + L+ + V++ DGK LT ++ + TIT
Sbjct: 36 EKTTFEGSVTIRLDAGWRSGADPLVLDAVGLEIRSVKI---DGKALTAWYRLDGDALTIT 92
Query: 57 LTFSETLPV---GEVKLLFEYVGEL-NDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDAR 112
+ + + +E+ G + ND ++G YRS Y + +TQ + AR
Sbjct: 93 VAPPIPERSERPFTLAITYEFTGPVSNDTLEGLYRSGY------GGKPYLITQCEAEGAR 146
Query: 113 RCFPCWDEPAVKAKFAISLSVPSSKVALSNMVRIAILDDED 153
R FPC DEP VKA F +++ +SN I D
Sbjct: 147 RIFPCIDEPDVKATFTLTIRADKGPKLISNGNLIDGGTLVD 187
Score = 83.7 bits (207), Expect = 3e-16
Identities = 34/113 (30%), Positives = 58/113 (51%), Gaps = 13/113 (11%)
Query: 518 GPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEE---TSSDGVLVRVYTPVGK 574
G + DG ++++FE +P M TYL A+V G+ + + T S V + +Y P G
Sbjct: 177 GNLIDGGTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVPPGV 236
Query: 575 REQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAL---------NPGTVGYY 618
++ ++AL + + FY++YF + Y LP D+VA+ N G V +
Sbjct: 237 LDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGAMENWGLVTFR 288
Score = 63.7 bits (155), Expect = 4e-10
Identities = 43/215 (20%), Positives = 82/215 (38%), Gaps = 30/215 (13%)
Query: 288 VAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGKYPILLRPSFQIPVGHPSEVEEI 347
+ + V P E+ + FD I Y KGAS++RML+ +G+ + L F+ G + ++
Sbjct: 381 IRVDVYDPKEINDFFDAIVYEKGASVLRMLETLLGEEAFRKGLSLYFKRHAGGNATTMDL 440
Query: 348 FDDISYNKGASIIRMLQKYI-----------GDGVNS-------------SSDSLWYVPL 383
+ + G + + ++ + W +PL
Sbjct: 441 WKALEDASGKDLSAFFESWLSQAGYPVLTVSVRYDDFFKLTQKQFTPPGQEEKRPWPIPL 500
Query: 384 SFCTQANPSEEVFSAEMSTRVTQVTIPDVSPGHWI--KLNPGTVGYYRVKYPRETLAQFI 441
+ +V ++ VT V + K+N +YRV Y ++L++ +
Sbjct: 501 AIKLLDGGGVKV--LLLTEGEQTVTFELVGIPPFPSLKVNDSAPVFYRVDYSDQSLSKLL 558
Query: 442 PSVEDKSIPPLDRLSLLDDLFALAQVGEVSLVEVL 476
D + RL+L+ D AL G+ S + L
Sbjct: 559 QH--DPRLEAAQRLALVADRRALTAAGKGSAEDKL 591
Score = 53.3 bits (128), Expect = 7e-07
Identities = 15/47 (31%), Positives = 26/47 (55%)
Query: 325 KYPILLRPSFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGV 371
+ + ++ V P E+ + FD I Y KGAS++RML+ +G+
Sbjct: 372 EDSLPSSHPIRVDVYDPKEINDFFDAIVYEKGASVLRMLETLLGEEA 418
Score = 46.7 bits (111), Expect = 7e-05
Identities = 31/145 (21%), Positives = 51/145 (35%), Gaps = 8/145 (5%)
Query: 819 IELYQNSDMQEEKDRISRSFSALKDPELLRKVL-DFSMSDLVRAQDSVFVIISAAQTKTG 877
+E YQ+ E + +F L S D V QD +I +A +
Sbjct: 716 LEAYQSPTRAEALRDFADAFGRFPLVMDKWFALQAISPGDTVLEQDIGLMIHAAFEAPNP 775
Query: 878 RELAWEFLK---NNYATFTERYKGG--LLGRLVKHTTENFASESHAQEVTEFFTKNPTSW 932
E W + N A G LG +V F + +++ EF
Sbjct: 776 NEARWLYGTFAFENPALLHALDGSGYRFLGVIVL-ELNIFNPQLALRKIEEFLELKKLKA 834
Query: 933 IERTVQQSVETIRLNSECLKRDGEA 957
+ER +++++ETI L +D
Sbjct: 835 LERLIREALETIAARER-LSKDLAE 858
Score = 43.6 bits (103), Expect = 7e-04
Identities = 30/145 (20%), Positives = 49/145 (33%), Gaps = 8/145 (5%)
Query: 957 ALYQNSDMQEEKDRISRSFSALKDPELLRKVL-DFSMSDLVRAQDSVFVIISAAQTKTGR 1015
YQ+ E + +F L S D V QD +I +A +
Sbjct: 717 EAYQSPTRAEALRDFADAFGRFPLVMDKWFALQAISPGDTVLEQDIGLMIHAAFEAPNPN 776
Query: 1016 ELAWDFLK---NNYATFTERYKGG--LLGRLVKHTTENFASESHAQEVTEFFTKNPTSWI 1070
E W + N A G LG +V F + +++ EF +
Sbjct: 777 EARWLYGTFAFENPALLHALDGSGYRFLGVIVL-ELNIFNPQLALRKIEEFLELKKLKAL 835
Query: 1071 ERTVQQSVETIRLNSECLKRDGEAV 1095
ER +++++ETI L +D
Sbjct: 836 ERLIREALETIAARER-LSKDLAEK 859
>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
This family contains bacterial and eukaryotic
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 438
Score = 112 bits (283), Expect = 4e-26
Identities = 40/101 (39%), Positives = 55/101 (54%), Gaps = 2/101 (1%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MEN G VT+RE + + T+A R A + HE+AH WFG+LVTM+WW LWLNE +
Sbjct: 256 MENPGCVTFREN-YVFREEVTTAQ-RLRRANTIAHEMAHMWFGDLVTMKWWNDLWLNESF 313
Query: 756 ASFVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHP 796
A ++ + + P D W F D A D ++HP
Sbjct: 314 AEYMAYKALSEATPFTDAWLTFFLDRKQWAYRADQTPTTHP 354
Score = 102 bits (256), Expect = 1e-22
Identities = 48/154 (31%), Positives = 78/154 (50%), Gaps = 11/154 (7%)
Query: 145 RIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSES 204
R D Y+ TQFE DARR FPC+D+P +KA F+++++ P + +SN +S
Sbjct: 100 RFVDPADGKTYL-YTQFEPDDARRVFPCFDQPDLKAPFTLTVTAPKDWTVISNTAATEQS 158
Query: 205 PQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHV 264
+G+ +F +P +STYL A V G + + + DG+ + +Y + Q
Sbjct: 159 TIRNGYVRWEFPETPPLSTYLFAFVAGPYHVISDKEHDGIPLGLYC---RESLAQALDRD 215
Query: 265 ASKVLP-------FYKDYFNIAYPLPKIDLVAIP 291
A ++ F+++YF I YP K D V +P
Sbjct: 216 ADEIFEITKQGLDFFEEYFGIPYPFGKYDQVFVP 249
Score = 52.2 bits (126), Expect = 9e-07
Identities = 28/115 (24%), Positives = 48/115 (41%), Gaps = 20/115 (17%)
Query: 519 PVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQG 578
+ + + R +F +P +STYL A V G + + + DG+ + +Y +
Sbjct: 154 ATEQSTIRNGYVRW-EFPETPPLSTYLFAFVAGPYHVISDKEHDGIPLGLYC---RESLA 209
Query: 579 QFALHVASKVLP-------FYKDYFNIAYPLPKIDLVAL---------NPGTVGY 617
Q A ++ F+++YF I YP K D V + NPG V +
Sbjct: 210 QALDRDADEIFEITKQGLDFFEEYFGIPYPFGKYDQVFVPEFNFGAMENPGCVTF 264
Score = 49.9 bits (120), Expect = 6e-06
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 288 VAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
+A V + + FD I+Y KGAS+++ L +G+
Sbjct: 355 IAGDVENTEDALNNFDGITYAKGASVLKQLVALLGE 390
Score = 47.2 bits (113), Expect = 4e-05
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 337 PVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
V + + FD I+Y KGAS+++ L +G+
Sbjct: 358 DVENTEDALNNFDGITYAKGASVLKQLVALLGE 390
>gnl|CDD|233857 TIGR02412, pepN_strep_liv, aminopeptidase N, Streptomyces lividans
type. This family is a subset of the members of the
zinc metallopeptidase family M1 (pfam01433), with a
single member characterized in Streptomyces lividans 66
and designated aminopeptidase N. The spectrum of
activity may differ somewhat from the aminopeptidase N
clade of E. coli and most other Proteobacteria, well
separated phylogenetically within the M1 family. The M1
family also includes leukotriene A-4
hydrolase/aminopeptidase (with a bifunctional active
site).
Length = 831
Score = 100 bits (251), Expect = 2e-21
Identities = 45/102 (44%), Positives = 58/102 (56%), Gaps = 4/102 (3%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MEN G VT+ E L ++ T A ++N A V+ HE+AH WFG+LVTM WW LWLNE +
Sbjct: 261 MENAGCVTFAEN-FLHRAEATRA-EKENRAGVILHEMAHMWFGDLVTMRWWNDLWLNESF 318
Query: 756 ASFVEFLCVHHLFPEY-DIWTQFVTDNLVRALELDALKSSHP 796
A ++ L EY D WT F A E D L ++HP
Sbjct: 319 AEYMGTLASAEA-TEYTDAWTTFAAQGKQWAYEADQLPTTHP 359
Score = 78.3 bits (193), Expect = 1e-14
Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 4/135 (2%)
Query: 159 TQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSESPQPDGHRLLQFETS 218
TQFE DARR F +D+P +KA F S+ P + +SN +P+P R +F +
Sbjct: 122 TQFEPADARRVFAVFDQPDLKANFKFSVKAPEDWTVISNSRETDVTPEPAD-RRWEFPET 180
Query: 219 PIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQ--GQFALHVASKVLPFYKDYF 276
P +STYL AV G + + + S + +Y + + + L F+ F
Sbjct: 181 PKLSTYLTAVAAGPY-HSVQDESRSYPLGIYARRSLAQYLDADAIFTITRQGLAFFHRKF 239
Query: 277 NIAYPLPKIDLVAIP 291
YP K D + +P
Sbjct: 240 GYPYPFKKYDQIFVP 254
Score = 39.8 bits (93), Expect = 0.009
Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 13/104 (12%)
Query: 523 ESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQ--GQF 580
+P+P R +F +P +STYL AV G + + + S + +Y +
Sbjct: 166 VTPEPAD-RRWEFPETPKLSTYLTAVAAGPY-HSVQDESRSYPLGIYARRSLAQYLDADA 223
Query: 581 ALHVASKVLPFYKDYFNIAYPLPKIDLVAL---------NPGTV 615
+ + L F+ F YP K D + + N G V
Sbjct: 224 IFTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAMENAGCV 267
Score = 39.0 bits (91), Expect = 0.018
Identities = 14/43 (32%), Positives = 24/43 (55%), Gaps = 8/43 (18%)
Query: 335 QIPVGHP--------SEVEEIFDDISYNKGASIIRMLQKYIGD 369
Q+P HP ++ FD I+Y KGAS+++ L ++G+
Sbjct: 353 QLPTTHPIVADVADLADALSNFDGITYAKGASVLKQLVAWVGE 395
Score = 38.6 bits (90), Expect = 0.022
Identities = 13/45 (28%), Positives = 24/45 (53%)
Query: 282 LPKIDLVAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGKY 326
LP + V ++ FD I+Y KGAS+++ L ++G+ +
Sbjct: 354 LPTTHPIVADVADLADALSNFDGITYAKGASVLKQLVAWVGEEAF 398
>gnl|CDD|221257 pfam11838, DUF3358, Domain of unknown function (DUF3358). This
domain is functionally uncharacterized. This domain is
found in eukaryotes and bacteria. This domain is found to
the C-terminus of an aminopeptidase domain.
Length = 323
Score = 92.4 bits (230), Expect = 5e-20
Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 6/132 (4%)
Query: 957 ALYQNSDMQEEKDRISRSFSALKDPELLRKVLDFSMSD-LVRAQDSVFVIISAAQT-KTG 1014
A Y+N + K+ R+ +A+ DPEL K L+F + D V QD V+ ++ + G
Sbjct: 192 AEYKNDPTADGKEAALRALAAVPDPELKAKALNFLLDDDSVSNQDIRAVMAGLRRSNQAG 251
Query: 1015 RELAWDFLKNNYATFTERY--KGGLLGRLVKHTTENFASESHAQEVTEFFTKNP--TSWI 1070
REL W +++ N+ ++ LGR+V F+S EV F + P T +
Sbjct: 252 RELLWPWVERNFDALAKKLPRGSSALGRVVGLYPSGFSSAELLDEVEAFLARKPKDTPGL 311
Query: 1071 ERTVQQSVETIR 1082
R++ Q+++TIR
Sbjct: 312 RRSLAQALDTIR 323
Score = 92.0 bits (229), Expect = 6e-20
Identities = 39/132 (29%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 820 ELYQNSDMQEEKDRISRSFSALKDPELLRKVLDFSMSD-LVRAQDSVFVIISAAQT-KTG 877
Y+N + K+ R+ +A+ DPEL K L+F + D V QD V+ ++ + G
Sbjct: 192 AEYKNDPTADGKEAALRALAAVPDPELKAKALNFLLDDDSVSNQDIRAVMAGLRRSNQAG 251
Query: 878 RELAWEFLKNNYATFTERY--KGGLLGRLVKHTTENFASESHAQEVTEFFTKNP--TSWI 933
REL W +++ N+ ++ LGR+V F+S EV F + P T +
Sbjct: 252 RELLWPWVERNFDALAKKLPRGSSALGRVVGLYPSGFSSAELLDEVEAFLARKPKDTPGL 311
Query: 934 ERTVQQSVETIR 945
R++ Q+++TIR
Sbjct: 312 RRSLAQALDTIR 323
Score = 77.7 bits (192), Expect = 3e-15
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 1/90 (1%)
Query: 417 WIKLNPGTVGYYRVKYPRETLAQFIPSVEDKSIPPLDRLSLLDDLFALAQVGEVSLVEVL 476
+ LN +GYYRV+Y E+LA ++ + PLDR L D +AL + GE+ + L
Sbjct: 1 LVLLNDDDLGYYRVRYDPESLATLGEALARLKLDPLDRAGLWADAWALVRDGELPTRDFL 60
Query: 477 KMIQSMT-HEDNYTVWITICNCLQKIDLLL 505
++ + +E +Y V I L + L
Sbjct: 61 DLVLAFLPNETDYVVLSEILAQLGTLKSAL 90
Score = 47.7 bits (114), Expect = 2e-05
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 607 LVALNPGTVGYYRVKYPRETLAQFIPSVEDKSIPPLDRLSLLDDLFA 653
LV LN +GYYRV+Y E+LA ++ + PLDR L D +A
Sbjct: 1 LVLLNDDDLGYYRVRYDPESLATLGEALARLKLDPLDRAGLWADAWA 47
>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
leukotriene A4 hydrolase. M1 Peptidase family includes
aminopeptidase N (APN) and leukotriene A4 hydrolase
(LTA4H). All peptidases in this family bind a single
catalytic zinc ion which is tetrahedrally co-ordinated
by three amino acid ligands and a water molecule that
forms the nucleophile on activation during catalysis.
APN consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types. APN expression is
dysregulated in many inflammatory diseases and is
enhanced in numerous tumor cells, making it a lead
target in the development of anti-cancer and
anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
possessing an aminopeptidase as well as an epoxide
hydrolase activity. The two activities occupy different,
but overlapping sites. The activity and physiological
relevance of the aminopeptidase in LTA4H is as yet
unknown while the epoxide hydrolase converts leukotriene
A4 (LTA4) into leukotriene B4 (LTB4), a potent
chemotaxin that is fundamental to the inflammatory
response of mammals.
Length = 407
Score = 89.4 bits (222), Expect = 1e-18
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 7/104 (6%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MEN GL+ + + LL + + V+ HELAHQWFGNLVT W LWLNEG+
Sbjct: 244 MENPGLIFFSQSLLLAMIDAGD----ELLENVIAHELAHQWFGNLVTGARWNDLWLNEGF 299
Query: 756 ASFVEFLCVHHLFPEY---DIWTQFVTDNLVRALELDALKSSHP 796
A+++E L + F + D L + P
Sbjct: 300 ATYLEGLWMEATFGTSAREAYRLEGTRGLRRWRKLQDELPPAPP 343
Score = 65.9 bits (161), Expect = 5e-11
Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 15/172 (8%)
Query: 122 AVKAKFAISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAK 181
++K + S P + E ++ +Q E AR FPC D P A
Sbjct: 79 SLKVEVEFSGKPPPMGLGWQTA------GAEGAFLF-SQGEAIGARSWFPCQDRPDSVAT 131
Query: 182 FSISLSVPNNK-VALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDY-VEET 239
+ +++VP+ VA SN K + ++F ++ YL+A+VVG+ +Y V E
Sbjct: 132 YYTTVTVPDKTLVAASNGN-KDDGEVRRK--WVEFSPPIPIAPYLIALVVGDLEYPVREQ 188
Query: 240 SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIP 291
+ G+ + +Y L A FY+ YP + D+V +P
Sbjct: 189 TVQGIPLHLYFLTPLAVDTVLRLQSAFL---FYETDLGGPYPYSEYDVVEVP 237
Score = 42.0 bits (99), Expect = 0.002
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 332 PSFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
P+ + HP + FD + Y KGA ++R L+K +GD
Sbjct: 340 PAPPVMGKHPDDPSVEFDGVFYEKGALVLRYLEKRLGD 377
Score = 41.6 bits (98), Expect = 0.002
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 290 IPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
+ HP + FD + Y KGA ++R L+K +GD
Sbjct: 344 VMGKHPDDPSVEFDGVFYEKGALVLRYLEKRLGD 377
Score = 40.1 bits (94), Expect = 0.006
Identities = 28/137 (20%), Positives = 45/137 (32%), Gaps = 10/137 (7%)
Query: 9 RYFKIVSETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDETITL-TFSETLPVGE 67
EL L + L V + A E E +T+
Sbjct: 21 TVDPTEPNLGELVLDLKALAISSVDVNGAAAAFAKNERP-VVEGSRLTIPGPIALDKGQS 79
Query: 68 VKLLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAKF 127
+K+ E+ G+ G+ + + +Q + AR FPC D P A +
Sbjct: 80 LKVEVEFSGKPPPMGLGWQTA-------GAEGAFLFSQGEAIGARSWFPCQDRPDSVATY 132
Query: 128 AISLSVPSSK-VALSNM 143
+++VP VA SN
Sbjct: 133 YTTVTVPDKTLVAASNG 149
Score = 39.7 bits (93), Expect = 0.007
Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 4/79 (5%)
Query: 535 FETSPIMSTYLVAVVVGEFDY-VEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYK 593
F ++ YL+A+VVG+ +Y V E + G+ + +Y L A FY+
Sbjct: 163 FSPPIPIAPYLIALVVGDLEYPVREQTVQGIPLHLYFLTPLAVDTVLRLQSAFL---FYE 219
Query: 594 DYFNIAYPLPKIDLVALNP 612
YP + D+V +
Sbjct: 220 TDLGGPYPYSEYDVVEVPE 238
>gnl|CDD|189010 cd09603, M1_APN_4, Peptidase M1 family Aminopeptidase N. This
family contains mostly bacterial and some archaeal
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 415
Score = 80.7 bits (200), Expect = 9e-16
Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 2/136 (1%)
Query: 156 MAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSESPQPDGHRLLQF 215
T + A FPC D P+ KA F IS++VP +SN + S + G +
Sbjct: 106 GVWTAGQPEGASTWFPCNDHPSDKATFDISVTVPAGYTVVSNGRLVSRTDL-GGRTTWHW 164
Query: 216 ETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDY 275
M+TYLV + VG ++ +E+T++ GV V Y P G + A ++L F+++
Sbjct: 165 RMDEPMATYLVTLAVGRYEVLEDTTAGGVPVVYYVPPGLAADARRAFARTPEMLDFFEEL 224
Query: 276 FNIAYPLPKIDLVAIP 291
F YP K V +
Sbjct: 225 F-GPYPFEKYGQVVVD 239
Score = 77.7 bits (192), Expect = 9e-15
Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 9/67 (13%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MEN Y L D + ++ HELAHQWFG+ VT W+ +WLNEG+
Sbjct: 246 MENQTRTVYGAGFLDGDRSSER---------LIAHELAHQWFGDSVTCADWSDIWLNEGF 296
Query: 756 ASFVEFL 762
A++ E+L
Sbjct: 297 ATYAEWL 303
Score = 53.0 bits (128), Expect = 5e-07
Identities = 31/133 (23%), Positives = 47/133 (35%), Gaps = 12/133 (9%)
Query: 11 FKIVSETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDETITLTFSETLPVG-EVK 69
+ + L L + L V + DG+ + + +T+T LP G
Sbjct: 26 ARATQDLSSLNLDLAGLTVSSVTV---DGRPAAF----RHDGDELTITPPRPLPKGQTFT 78
Query: 70 LLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAKFAI 129
+ Y G G Y + T A FPC D P+ KA F I
Sbjct: 79 VTVRYSGV---PRPGLYPWGGDGGWEEGPD-GVWTAGQPEGASTWFPCNDHPSDKATFDI 134
Query: 130 SLSVPSSKVALSN 142
S++VP+ +SN
Sbjct: 135 SVTVPAGYTVVSN 147
Score = 49.9 bits (120), Expect = 5e-06
Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 528 DGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASK 587
G + M+TYLV + VG ++ +E+T++ GV V Y P G + A +
Sbjct: 157 GGRTTWHWRMDEPMATYLVTLAVGRYEVLEDTTAGGVPVVYYVPPGLAADARRAFARTPE 216
Query: 588 VLPFYKDYFNIAYPLPKIDLV 608
+L F+++ F YP K V
Sbjct: 217 MLDFFEELF-GPYPFEKYGQV 236
Score = 33.4 bits (77), Expect = 0.67
Identities = 12/41 (29%), Positives = 19/41 (46%)
Query: 329 LLRPSFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
+ P E+FDD Y KGA ++ L++ +GD
Sbjct: 325 YANEADLGPRPPVPGPAELFDDHVYEKGALVLHALRRLLGD 365
Score = 32.2 bits (74), Expect = 1.6
Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 2/53 (3%)
Query: 271 FYKDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
Y Y + DL P E+FDD Y KGA ++ L++ +GD
Sbjct: 315 QYALYAYARLYANEADLGPRPPVPGPA--ELFDDHVYEKGALVLHALRRLLGD 365
>gnl|CDD|222169 pfam13485, Peptidase_MA_2, Peptidase MA superfamily.
Length = 128
Score = 60.2 bits (146), Expect = 9e-11
Identities = 26/99 (26%), Positives = 33/99 (33%), Gaps = 13/99 (13%)
Query: 698 NWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGYAS 757
G V Y L+V + + V+ HELAH GNLV WL EG A
Sbjct: 1 WTGGVYYPGSRLIVP--RVAPGDPDELRGVLAHELAHVVLGNLVGGNGNLPRWLTEGLAE 58
Query: 758 FVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHP 796
+V + PE L A+ L
Sbjct: 59 YVA----GRIDPERA-------AELRDAVRSGRLPPLDE 86
>gnl|CDD|189006 cd09599, M1_LTA4H, Peptidase M1 family contains leukotriene A4
hydrolase. This family includes leukotriene A4
hydrolase (LTA4H; E.C. 3.3.2.6) and the close homolog
cold-active aminopeptidase (Colwellia
psychrerythraea-type peptidase; ColAP), both members of
the aminopeptidase M1 family. LTA4H, is a bifunctional
enzyme possessing an aminopeptidase as well as an
epoxide hydrolase activity. The two activities occupy
different, but overlapping sites. The activity and
physiological relevance of the aminopeptidase is as yet
unknown while the epoxide hydrolase converts leukotriene
A4 (LTA4) into leukotriene B4 (LTB4), a potent
chemotaxin that is fundamental to the inflammatory
response of mammals. It accepts a variety of substrates,
including some opioid, di- and tripeptides, as well as
chromogenic aminoacyl-p-nitroanilide derivatives. The
aminopeptidase activity of LTA4H is possibly involved in
the processing of peptides related to inflammation and
host defense. Kinetic analysis shows that LTA4H
hydrolyzes arginyl tripeptides with high efficiency and
specificity, indicating its function as an arginyl
aminopeptidase. LTA4H is overexpressed in certain human
cancers, and has been identified as a functionally
important target for mediating anticancer properties of
resveratrol, a well known red wine polyphenolic compound
with cancer chemopreventive activity.
Length = 442
Score = 64.9 bits (159), Expect = 1e-10
Identities = 20/34 (58%), Positives = 24/34 (70%)
Query: 727 VVGHELAHQWFGNLVTMEWWTHLWLNEGYASFVE 760
VV HE+AH W GNLVT W H WLNEG+ ++E
Sbjct: 285 VVAHEIAHSWSGNLVTNATWEHFWLNEGFTVYLE 318
Score = 49.1 bits (118), Expect = 1e-05
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 5/76 (6%)
Query: 159 TQFELTDARRCFPCWDEPAVKAKFSISLSVPNN-KVALSNMPVKSESPQPDGHRLLQFET 217
+Q + AR FPC D P+VK+ +S +++VP +S +P + +P FE
Sbjct: 129 SQCQAIHARSLFPCQDTPSVKSTYSATVTVPKELTALMSAIPPVEDGEEPRK--TYHFEQ 186
Query: 218 S-PIMSTYLVAVVVGE 232
PI +YL+A+ VG+
Sbjct: 187 PVPI-PSYLIAIAVGD 201
Score = 37.5 bits (88), Expect = 0.042
Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 2/59 (3%)
Query: 104 TQFCSTDARRCFPCWDEPAVKAKFAISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFE 162
+Q + AR FPC D P+VK+ ++ +++VP AL M I ++D + FE
Sbjct: 129 SQCQAIHARSLFPCQDTPSVKSTYSATVTVPKELTAL--MSAIPPVEDGEEPRKTYHFE 185
>gnl|CDD|233856 TIGR02411, leuko_A4_hydro, leukotriene A-4
hydrolase/aminopeptidase. Members of this family
represent a distinctive subset within the zinc
metallopeptidase family M1 (pfam01433). The majority of
the members of pfam01433 are aminopeptidases, but the
sequences in this family for which the function is known
are leukotriene A-4 hydrolase. A dual epoxide hydrolase
and aminopeptidase activity at the same active site is
indicated. The physiological substrate for
aminopeptidase activity is not known.
Length = 602
Score = 59.4 bits (144), Expect = 7e-09
Identities = 22/40 (55%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 721 RQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGYASFVE 760
R N+ V+ HELAH W GNLVT W H WLNEG+ ++E
Sbjct: 277 RSNVD-VIAHELAHSWSGNLVTNCSWEHFWLNEGWTVYLE 315
Score = 40.1 bits (94), Expect = 0.007
Identities = 27/106 (25%), Positives = 44/106 (41%), Gaps = 5/106 (4%)
Query: 129 ISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSV 188
IS S AL + + Y+ +Q + AR FPC D P+VK+ ++ +
Sbjct: 99 ISFSTTPKCTALQWLNPEQTSGKKHPYL-FSQCQAIHARSLFPCQDTPSVKSTYTAEVES 157
Query: 189 PNNKVALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFD 234
P + S + S P + Q PI + YL+A+ G+
Sbjct: 158 PLPVLM-SGIRDGETSNDPGKYLFKQ--KVPIPA-YLIAIASGDLA 199
Score = 31.3 bits (71), Expect = 3.6
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 111 ARRCFPCWDEPAVKAKFAISLSVP 134
AR FPC D P+VK+ + + P
Sbjct: 135 ARSLFPCQDTPSVKSTYTAEVESP 158
>gnl|CDD|233858 TIGR02414, pepN_proteo, aminopeptidase N, Escherichia coli type.
The M1 family of zinc metallopeptidases contains a
number of distinct, well-separated clades of proteins
with aminopeptidase activity. Several are designated
aminopeptidase N, EC 3.4.11.2, after the Escherichia
coli enzyme, suggesting a similar activity profile (see
SP|P04825 for a description of catalytic activity). This
family consists of all aminopeptidases closely related
to E. coli PepN and presumed to have similar (not
identical) function. Nearly all are found in
Proteobacteria, but members are found also in
Cyanobacteria, plants, and apicomplexan parasites. This
family differs greatly in sequence from the family of
aminopeptidases typified by Streptomyces lividans PepN
(TIGR02412), from the membrane bound aminopeptidase N
family in animals, etc [Protein fate, Degradation of
proteins, peptides, and glycopeptides].
Length = 863
Score = 50.0 bits (120), Expect = 7e-06
Identities = 40/143 (27%), Positives = 68/143 (47%), Gaps = 10/143 (6%)
Query: 156 MAVTQFELTDARRC--FPCWDEPAVKAKFSISLSVPNNK--VALSNMPVKSESPQPDGHR 211
TQ E RR FP D P V ++++++++ K V LSN + PDG
Sbjct: 107 NFCTQCEAEGFRRITYFP--DRPDVMSRYTVTITADKKKYPVLLSNGNKIASGELPDGRH 164
Query: 212 LLQFETSPIMSTYLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASK 267
++E +YL A+V G+ D +E+T S V +RVY G +++ A+ K
Sbjct: 165 WAEWEDPFPKPSYLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKK 224
Query: 268 VLPFYKDYFNIAYPLPKIDLVAI 290
+ + ++ F + Y L +VA+
Sbjct: 225 AMKWDEEVFGLEYDLDIFMIVAV 247
Score = 48.9 bits (117), Expect = 1e-05
Identities = 23/63 (36%), Positives = 29/63 (46%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
MEN GL + +L D + + + I V+ HE H W GN VT W L L EG
Sbjct: 255 MENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGL 314
Query: 756 ASF 758
F
Sbjct: 315 TVF 317
Score = 41.2 bits (97), Expect = 0.004
Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 4/72 (5%)
Query: 544 YLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIA 599
YL A+V G+ D +E+T S V +RVY G +++ A+ K + + ++ F +
Sbjct: 177 YLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVFGLE 236
Query: 600 YPLPKIDLVALN 611
Y L +VA++
Sbjct: 237 YDLDIFMIVAVD 248
Score = 30.0 bits (68), Expect = 9.3
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 6/44 (13%)
Query: 289 AIPVGHP---SEVEEI---FDDISYNKGASIIRMLQKYIGDGKY 326
A P+ HP EI + Y KGA +IRML +G+ +
Sbjct: 349 AGPMAHPVRPESYVEINNFYTATVYEKGAEVIRMLHTLLGEEGF 392
>gnl|CDD|189007 cd09600, M1_APN_1, Peptidase M1 family containing Aminopeptidase N.
This family contains aminopeptidase N (APN; CD13;
Alanyl aminopeptidase; EC 3.4.11.2), a Type II integral
membrane protease belonging to the M1 gluzincin family.
It includes bacterial-type alanyl aminopeptidases as
well as PfA-M1 aminopeptidase (Plasmodium
falciparum-type). APN consists of a small N-terminal
cytoplasmic domain, a single transmembrane domain and a
large extracellular ectodomain that contains the active
site. It preferentially cleaves neutral amino acids from
the N-terminus of oligopeptides and, in higher
eukaryotes, is present in a variety of human tissues and
cell types (leukocyte, fibroblast, endothelial and
epithelial cells). APN expression is dysregulated in
inflammatory diseases such as chronic pain, rheumatoid
arthritis, multiple sclerosis, systemic sclerosis,
systemic lupus erythematosus,
polymyositis/dermatomyosytis and pulmonary sarcoidosis,
and is enhanced in tumor cells such as melanoma, renal,
prostate, pancreas, colon, gastric and thyroid cancers.
It is predominantly expressed on stem cells and on cells
of the granulocytic and monocytic lineages at distinct
stages of differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 861
Score = 49.2 bits (118), Expect = 1e-05
Identities = 23/59 (38%), Positives = 29/59 (49%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEG 754
MEN GL + +L D + + + I V+GHE H W GN VT W L L EG
Sbjct: 255 MENKGLNIFNSKLVLADPETATDADYERIESVIGHEYFHNWTGNRVTCRDWFQLSLKEG 313
Score = 40.3 bits (95), Expect = 0.006
Identities = 37/140 (26%), Positives = 58/140 (41%), Gaps = 6/140 (4%)
Query: 158 VTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNK--VALSNMPVKSESPQPDGHRLLQF 215
TQ E RR D P V +KF++ + K V LSN + E DG +
Sbjct: 109 CTQCEAEGFRRITYYLDRPDVMSKFTVRIEADKTKYPVLLSNGNLVEEGELEDGRHFAVW 168
Query: 216 ETSPIMSTYLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASKVLPF 271
E YL A+V G+ +E+ S V + +Y G + A+ K + +
Sbjct: 169 EDPFPKPCYLFALVAGDLGVLEDKFTTKSGRKVALEIYVEPGDESKCAHAMESLKKSMKW 228
Query: 272 YKDYFNIAYPLPKIDLVAIP 291
+D F + Y L ++VA+
Sbjct: 229 DEDRFGLEYDLDLFNIVAVD 248
Score = 31.8 bits (73), Expect = 2.2
Identities = 27/111 (24%), Positives = 44/111 (39%), Gaps = 14/111 (12%)
Query: 503 LLLSNTEYHHLFYQFGPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEET--- 559
+LLSN G + E DG +E YL A+V G+ +E+
Sbjct: 146 VLLSN----------GNLVEEGELEDGRHFAVWEDPFPKPCYLFALVAGDLGVLEDKFTT 195
Query: 560 -SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVA 609
S V + +Y G + A+ K + + +D F + Y L ++VA
Sbjct: 196 KSGRKVALEIYVEPGDESKCAHAMESLKKSMKWDEDRFGLEYDLDLFNIVA 246
>gnl|CDD|189001 cd09594, GluZincin, Peptidase Gluzincin family (thermolysin-like
proteinases, TLPs) includes peptidases M1, M2, M3, M4,
M13, M32 and M36 (fungalysins). Gluzincin family
(thermolysin-like peptidases or TLPs) includes several
zinc-dependent metallopeptidases such as the M1, M2, M3,
M4, M13, M32, M36 peptidases (MEROPS classification),
and contain HEXXH and EXXXD motifs as part of their
active site. All peptidases in this family bind a single
catalytic zinc ion which is tetrahedrally co-ordinated
by three amino acid ligands and a water molecule that
forms the nucleophile on activation during catalysis. M1
family includes aminopeptidase N (APN) and leukotriene
A4 hydrolase (LTA4H). APN preferentially cleaves
neutral amino acids from the N-terminus of oligopeptides
and is present in a variety of human tissues and cell
types. LTA4H is a bifunctional enzyme, possessing an
aminopeptidase as well as an epoxide hydrolase activity
such that the two activities occupy different, but
overlapping sites. The peptidase M3 or neurolysin-like
family, includes M3, M2 and M32 metallopeptidases. The
M3 peptidases have two subfamilies: M3A, includes thimet
oligopeptidase (TOP; endopeptidase 3.4.24.15),
neurolysin (3.4.24.16), and the mitochondrial
intermediate peptidase; M3B contains oligopeptidase F.
M2 peptidase angiotensin converting enzyme (ACE, EC
3.4.15.1) catalyzes the conversion of decapeptide
angiotensin I to the potent vasopressor octapeptide
angiotensin II. ACE is a key part of the
renin-angiotensin system that regulates blood pressure,
thus ACE inhibitors are important for the treatment of
hypertension. M32 family includes two eukaryotic enzymes
from protozoa Trypanosoma cruzi, a causative agent of
Chagas' disease, and Leishmania major, a parasite that
causes leishmaniasis, making them attractive targets for
drug development. The M4 family includes secreted
protease thermolysin (EC 3.4.24.27), pseudolysin,
aureolysin, neutral protease as well as fungalysin and
bacillolysin (EC 3.4.24.28) that degrade extracellular
proteins and peptides for bacterial nutrition,
especially prior to sporulation. Thermolysin is widely
used as a nonspecific protease to obtain fragments for
peptide sequencing as well as in production of the
artificial sweetener aspartame. M13 family includes
neprilysin (EC 3.4.24.11) and endothelin-converting
enzyme I (ECE-1, EC 3.4.24.71), which fulfill a broad
range of physiological roles due to the greater
variation in the S2' subsite allowing substrate
specificity and are prime therapeutic targets for
selective inhibition. Peptidase M36 (fungamysin) family
includes endopeptidases from pathogenic fungi.
Fungalysin hydrolyzes extracellular matrix proteins such
as elastin and keratin. Aspergillus fumigatus causes the
pulmonary disease aspergillosis by invading the lungs of
immuno-compromised animals and secreting fungalysin that
possibly breaks down proteinaceous structural barriers.
Length = 125
Score = 42.1 bits (99), Expect = 2e-04
Identities = 16/47 (34%), Positives = 19/47 (40%), Gaps = 5/47 (10%)
Query: 724 IALVVGHELAHQWFGNLVTMEW-----WTHLWLNEGYASFVEFLCVH 765
VVGHEL H LV + LNEG + F E L +
Sbjct: 69 DFGVVGHELTHGVTDQLVGNDPDLLYTNGSGGLNEGPSDFFELLVAY 115
>gnl|CDD|189011 cd09604, M1_APN_5, Peptidase M1 family containing bacterial
Aminopeptidase N. This family contains bacterial
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 435
Score = 43.5 bits (103), Expect = 5e-04
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 727 VVGHELAHQWFGNLVTMEWWTHLWLNEGYASFVEFLC 763
V+ HE+AHQW+ +V + WL+EG +++
Sbjct: 291 VLAHEIAHQWWYGIVGNDERNEPWLDEGLTTYLTDYY 327
Score = 40.4 bits (95), Expect = 0.004
Identities = 29/117 (24%), Positives = 50/117 (42%), Gaps = 18/117 (15%)
Query: 182 FSISLSVPNNKVALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDY------ 235
+ ++L+VP + ++ ++ E+ + G + +FE A V +F
Sbjct: 159 YDVTLTVPAGYIVVATGRLQEETTEDGGTKTWEFE----------AENVRDFALAASPKF 208
Query: 236 -VEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIP 291
V+E + DG+ VR Y E + L A K L FY + F YP + +V P
Sbjct: 209 IVDEATVDGIKVRAYYFPEDAELAKRYLDAAKKALEFYSELF-GPYPYKEFSVVENP 264
Score = 39.3 bits (92), Expect = 0.010
Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 5/77 (6%)
Query: 547 AVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKID 606
A+ V+E + DG+ VR Y E + L A K L FY + F YP +
Sbjct: 201 ALAASPKFIVDEATVDGIKVRAYYFPEDAELAKRYLDAAKKALEFYSELF-GPYPYKEFS 259
Query: 607 LVALNPGTVGYYRVKYP 623
+V NP G +YP
Sbjct: 260 VVE-NPFPGGM---EYP 272
>gnl|CDD|189018 cd09839, M1_TAF2, TATA binding protein (TBP) associated factor 2.
This family includes TATA binding protein (TBP)
associated factor 2 (TAF2, TBP-associated factor
TAFII150, transcription initiation factor TFIID subunit
2, RNA polymerase II TBP-associated factor subunit B),
and has homology to the aminopeptidase N (APN)
subfamily, belonging to the M1 gluzincin family. TAF2 is
part of the TFIID multidomain subunit complex essential
for transcription of most protein-encoded genes by RNA
polymerase II. TAF2 is known to interact with the
initiator element (Inr) found at the transcription start
site of many genes, thus possibly playing a key role in
promoter binding as well as start-site selection. Image
analysis has shown TAF2 to form a complex with TAF1 and
TBP, inferring its role in promoter recognition.
Peptidases in the M1 family bind a single catalytic zinc
ion which is tetrahedrally co-ordinated by three amino
acid ligands and a water molecule that forms the
nucleophile on activation during catalysis. TAF2,
however, does not seem to contain any of the active site
residues.
Length = 507
Score = 40.8 bits (96), Expect = 0.004
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 2/56 (3%)
Query: 732 LAHQWFGNLVTMEWWTHLWLNEGYASFVEFLCVHHLF--PEYDIWTQFVTDNLVRA 785
LA QWFG ++ E W+ WL +G A ++ L V LF EY + D +
Sbjct: 322 LASQWFGCFISPEAWSDEWLLKGIAGYITGLFVKKLFGNNEYRFRIKKELDRVCEY 377
Score = 32.3 bits (74), Expect = 1.7
Identities = 31/162 (19%), Positives = 65/162 (40%), Gaps = 24/162 (14%)
Query: 166 ARRCFPCWDEPAVKAKFSISLSVPNNKVALSN--MPVKSESPQPDGHRLLQFE-TSPIMS 222
AR FPC D+P+ + + +VP N VA+S+ + + + + + T P +
Sbjct: 157 ARCWFPCVDDPSQLCTWELEFTVPANMVAVSSGDLLEQVYDTEDMRKKTYHYALTVPTAA 216
Query: 223 TYLVAVVVGEFDYVEETSSDGV----LVRVYTPVGKREQGQFALHVASKVLPFYKDYFNI 278
+ + VG F+ + + + + L + + + LH + F+++Y +
Sbjct: 217 QN-IGLAVGPFEILVDPHAHEITHFCLPGLLPEL---KNTTSYLH---EAFEFFEEYLSC 269
Query: 279 AYPLPKIDLVAIPVGHPSEVEEIFDDISYNKGASII--RMLQ 318
+P V V+E +D++ SI +L
Sbjct: 270 RFPFSSYKQVF--------VDEAAEDVTSYASLSIFSSNLLY 303
>gnl|CDD|70841 pfam07390, P30, Mycoplasma P30 protein. This family consists of
several P30 proteins which seem to be specific to
Mycoplasma agalactiae. P30 is a 30-kDa immunodominant
antigen and is known to be a transmembrane protein.
Length = 266
Score = 32.5 bits (73), Expect = 0.99
Identities = 21/83 (25%), Positives = 46/83 (55%), Gaps = 6/83 (7%)
Query: 908 TTENFASESHAQE---VTEFFTKNPTSWI---ERTVQQSVETIRLNSECLKRDGEALYQN 961
T +F +E++A+E + + +++NP + I E+ +++S E + + + EA+Y++
Sbjct: 75 TQNHFGAETNAKESPALNDLYSENPATPIFKNEKGIKESSEGSKNEGDKVIAGKEAIYKD 134
Query: 962 SDMQEEKDRISRSFSALKDPELL 984
D K +I+ LKD +L+
Sbjct: 135 IDFDISKVKITIDKKDLKDEDLI 157
>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265). Members of
this family of bacterial proteins comprise various
hypothetical proteins and putative aminopeptidases.
Their exact function, has not, as yet, been defined.
Length = 338
Score = 32.6 bits (75), Expect = 1.2
Identities = 31/127 (24%), Positives = 42/127 (33%), Gaps = 37/127 (29%)
Query: 730 HELAHQWF---GNLVTMEWWTHLWLNEGYASFVEFLCVHHLFPEYDIWTQFVTDNLVRAL 786
HELAHQ G+ T NE +A+ VE V W D RA
Sbjct: 172 HELAHQVVYIKGD--TA-------FNESFATAVEREGVRR-------WLAARGDPAARA- 214
Query: 787 ELDALKSSHPTQFVTDNLVRALELDALKSSHPIELYQN----SDMQEEK----DRISRSF 838
A QF L+ A D L+ LY + + K ++ +
Sbjct: 215 -RYAAFDERRQQFRA--LLLA-TRDRLE-----ALYASPLADERKRAAKAAIFQQLRADY 265
Query: 839 SALKDPE 845
+ALK
Sbjct: 266 AALKARW 272
>gnl|CDD|227705 COG5418, COG5418, Predicted secreted protein [Function unknown].
Length = 164
Score = 31.4 bits (71), Expect = 1.3
Identities = 20/83 (24%), Positives = 35/83 (42%), Gaps = 8/83 (9%)
Query: 247 RVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDIS 306
VY ++++ A V + KD+ I P P+ + + P P +E+FD
Sbjct: 18 YVYRGP-EKKEYDTAKEVRKALPSNPKDWNIIQLPCPEFEYLGWP--RPPMTKEVFDHPG 74
Query: 307 YNK-----GASIIRMLQKYIGDG 324
Y + I R+L++ DG
Sbjct: 75 YRRVCRKIADPIGRVLEEEKPDG 97
>gnl|CDD|237585 PRK14015, pepN, aminopeptidase N; Provisional.
Length = 875
Score = 32.4 bits (75), Expect = 1.6
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 307 YNKGASIIRMLQKYIGDGKY 326
Y KGA +IRML +G+ +
Sbjct: 386 YEKGAEVIRMLHTLLGEEGF 405
Score = 32.0 bits (74), Expect = 2.2
Identities = 9/17 (52%), Positives = 12/17 (70%)
Query: 353 YNKGASIIRMLQKYIGD 369
Y KGA +IRML +G+
Sbjct: 386 YEKGAEVIRMLHTLLGE 402
Score = 31.6 bits (73), Expect = 3.1
Identities = 22/59 (37%), Positives = 28/59 (47%)
Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEG 754
MEN GL + +L D + + + I V+ HE H W GN VT W L L EG
Sbjct: 268 MENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEG 326
>gnl|CDD|240973 cd12529, RRM2_MEI2_like, RNA recognition motif 2 in plant Mei2-like
proteins. This subgroup corresponds to the RRM2 of
Mei2-like proteins that represent an ancient eukaryotic
RNA-binding proteins family. Their corresponding
Mei2-like genes appear to have arisen early in eukaryote
evolution, been lost from some lineages such as
Saccharomyces cerevisiae and metazoans, and diversified
in the plant lineage. The plant Mei2-like genes may
function in cell fate specification during development,
rather than as stimulators of meiosis. Members in this
family contain three RNA recognition motifs (RRMs), also
termed RBDs (RNA binding domains) or RNPs
(ribonucleoprotein domains). The C-terminal RRM (RRM3)
is unique to Mei2-like proteins and is highly conserved
between plants and fungi. To date, the intracellular
localization, RNA target(s), cellular interactions and
phosphorylation states of Mei2-like proteins in plants
remain unclear. .
Length = 71
Score = 29.4 bits (66), Expect = 1.8
Identities = 11/37 (29%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 501 IDLLLSNTEYHHLFYQFGPVKS--ESPQPDGHRLLQF 535
+D +SN + H +F +G +K E+P H+ ++F
Sbjct: 9 LDPSVSNDDLHQIFGAYGEIKEIRETPNKRHHKFIEF 45
>gnl|CDD|206213 pfam14043, WVELL, WVELL protein. This family includes the B.
subtilis YfjH protein, which is functionally
uncharacterized. This is not a homologue of E. coli
YfjH, a synonym for IscX, which belongs to pfam04384.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length and contain a highly conserved WVELL motif.
Length = 75
Score = 28.0 bits (63), Expect = 4.8
Identities = 22/64 (34%), Positives = 28/64 (43%), Gaps = 10/64 (15%)
Query: 869 ISAAQTKTGRELAWEFLKNNYATFTERYKGGLLGRLVKHTTENFAS---ESHAQEVTEFF 925
+S AQ +T EL WE + YA YKG + TE E + + EF
Sbjct: 18 LSYAQARTWVELLWEDFETTYAKAGRPYKG-------EEMTERIVRQWIEQYGSRLHEFQ 70
Query: 926 TKNP 929
TKNP
Sbjct: 71 TKNP 74
>gnl|CDD|235046 PRK02507, PRK02507, proton extrusion protein PcxA; Provisional.
Length = 422
Score = 29.9 bits (68), Expect = 7.7
Identities = 8/34 (23%), Positives = 19/34 (55%)
Query: 797 TQFVTDNLVRALELDALKSSHPIELYQNSDMQEE 830
TQ ++ + + +D + P E++ N +++EE
Sbjct: 215 TQQLSKTFIVSPLVDHFFPALPFEIFINPELEEE 248
>gnl|CDD|232987 TIGR00467, lysS_arch, lysyl-tRNA synthetase, archaeal and
spirochete. This model represents the lysyl-tRNA
synthetases that are class I amino-acyl tRNA
synthetases. It includes archaeal and spirochete
examples of the enzyme. All other known examples are
class IIc amino-acyl tRNA synthetases and seem to form a
separate orthologous set [Protein synthesis, tRNA
aminoacylation].
Length = 515
Score = 29.9 bits (67), Expect = 8.2
Identities = 20/97 (20%), Positives = 37/97 (38%), Gaps = 5/97 (5%)
Query: 273 KDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGKYPILLRP 332
K F+I++ L I L D K + R+ + + R
Sbjct: 305 KPEFHISFDLDVIKLYED-YDKFERFYYGVKDKDEEKKRAFKRIYEL----SQPMPSERI 359
Query: 333 SFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
+Q+P H S + +IF++ K I++ +Q + D
Sbjct: 360 PYQVPFRHLSVISQIFENNDIEKILEILKRVQYTVDD 396
>gnl|CDD|218033 pfam04339, DUF482, Protein of unknown function, DUF482. This
family contains several proteins of uncharacterized
function.
Length = 370
Score = 29.8 bits (68), Expect = 9.3
Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 14/36 (38%)
Query: 741 VTMEWWTHL--------------WLNEGYASFVEFL 762
V + L W N GYASF +FL
Sbjct: 142 VDPDEAPLLEEAGFLQRLGQQFHWFNRGYASFDDFL 177
>gnl|CDD|237495 PRK13765, PRK13765, ATP-dependent protease Lon; Provisional.
Length = 637
Score = 30.0 bits (68), Expect = 9.6
Identities = 31/122 (25%), Positives = 48/122 (39%), Gaps = 41/122 (33%)
Query: 796 PTQFVTDNLVRALELDALKSSHP----------IELYQNSDMQEEKDRISRSFSALKDPE 845
P F+ +V A LDAL++ HP E+Y M++ PE
Sbjct: 275 PCDFI---MVAAGNLDALENMHPALRSRIKGYGYEVYMRDTMED-------------TPE 318
Query: 846 LLRKVLDFSMSDLVR-------AQDSVFVIISAAQTKTGRELAWEFLKNNYATFTERYKG 898
RK++ F ++ R +D+V II A+ + GR+ + T R G
Sbjct: 319 NRRKLVRFVAQEVKRDGKIPHFDRDAVEEIIREAKRRAGRK--------GHLTLKLRDLG 370
Query: 899 GL 900
GL
Sbjct: 371 GL 372
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.396
Gapped
Lambda K H
0.267 0.0731 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 56,227,010
Number of extensions: 5606179
Number of successful extensions: 4734
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4695
Number of HSP's successfully gapped: 85
Length of query: 1102
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 995
Effective length of database: 6,191,724
Effective search space: 6160765380
Effective search space used: 6160765380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)