RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1463
(947 letters)
>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
tricorn interacting factor F3, Endoplasmic reticulum
aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ). This
M1 peptidase family includes eukaryotic and bacterial
members: aminopeptidase N (APN), aminopeptidase Q (APQ,
laeverin), endoplasmic reticulum aminopeptidase 1
(ERAP1) as well as tricorn interacting factor F3.
Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease,
consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types (leukocyte, fibroblast,
endothelial and epithelial cells). APN expression is
dysregulated in inflammatory diseases such as chronic
pain, rheumatoid arthritis, multiple sclerosis, systemic
sclerosis, systemic lupus erythematosus,
polymyositis/dermatomyosytis and pulmonary sarcoidosis,
and is enhanced in tumor cells such as melanoma, renal,
prostate, pancreas, colon, gastric and thyroid cancers.
It is considered a marker of differentiation since it is
predominantly expressed on stem cells and on cells of
the granulocytic and monocytic lineages at distinct
stages of differentiation. Thus, APN inhibition may lead
to the development of anti-cancer and anti-inflammatory
drugs. ERAP1 also known as endoplasmic reticulum
aminopeptidase associated with antigen processing
(ERAAP), adipocyte derived leucine aminopeptidase
(A-LAP) or aminopeptidase regulating tumor necrosis
factor receptor I (THFRI) shedding (ARTS-1), associates
with the closely related ER aminopeptidase ERAP2, for
the final trimming of peptides within the ER for
presentation by MHC class I molecules. ERAP1 is
associated with ankylosing spondylitis (AS), an
inflammatory arthritis that predominantly affects the
spine. ERAP1 also aids in the shedding of membrane-bound
cytokine receptors. The tricorn interacting factor F3,
together with factors F1 and F2, degrades the tricorn
protease products, producing free amino acids, thus
completing the proteasomal degradation pathway. F3 is
homologous to F2, but not F1, and shows a strong
preference for glutamate in the P1' position. APQ, also
known as laeverin, is specifically expressed in human
embryo-derived extravillous trophoblasts (EVTs) that
invade the uterus during early placentation. It cleaves
the N-terminal amino acid of various peptides such as
angiotensin III, endokinin C, and kisspeptin-10, all
expressed in the placenta in large quantities. APN is a
receptor for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs are also putative Cry toxin
receptors. Cry1 proteins are pore-forming toxins that
bind to the midgut epithelial cell membrane of
susceptible insect larvae, causing extensive damage.
Several different toxins, including Cry1Aa, Cry1Ab,
Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
bind to APNs; however, a direct role of APN in
cytotoxicity has been yet to be firmly established.
Length = 446
Score = 273 bits (700), Expect = 2e-82
Identities = 105/227 (46%), Positives = 137/227 (60%), Gaps = 48/227 (21%)
Query: 420 ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGVD 479
E ++LYD + S+ +K+R+AT+VAHELAHQWFGNLVT+ WW+DLWLNEGFA+Y+EY GVD
Sbjct: 268 ETALLYDPKTSSASNKQRVATVVAHELAHQWFGNLVTMKWWDDLWLNEGFATYMEYLGVD 327
Query: 480 SVEHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRM 539
+E W + D FV+D+LQ+ LD+L SSHP+ V V PDEI+EIFD ISYSKG+S+LRM
Sbjct: 328 HLEPEWNMWDQFVLDDLQSALALDSLASSHPISVPVETPDEISEIFDAISYSKGASVLRM 387
Query: 540 AEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHEMRTLPEKMDVLKLGLQKYI 599
EHFL EV + GL+ Y+KK A G++T +LW L+ A
Sbjct: 388 LEHFLGEEVFRKGLRNYLKKHAYGNATTDDLWEALSEA---------------------- 425
Query: 600 KKKAMGSSTQAELWAFLTNAGHEMRTLPENMDVETIMNTWTLQTGFP 646
DV+ IM+TWTLQ G+P
Sbjct: 426 --------------------------SKLGKDVKEIMDTWTLQPGYP 446
Score = 190 bits (484), Expect = 2e-52
Identities = 69/171 (40%), Positives = 96/171 (56%), Gaps = 13/171 (7%)
Query: 243 YKIKILPFLIENNFTFLGEVWIHVEVSQTTNNITLHMNDLTILERSIKQVDNRSANWESD 302
Y + + P L +NFTF G V I +EV++ TN I LH DLTI ++ +
Sbjct: 3 YDLTLTPDL--DNFTFSGSVTITLEVTEPTNEIVLHAKDLTITSATVTVGGGNVDILVT- 59
Query: 303 EGTSLTIGQVRNDTINQFMVFELEDEQFWATKRYVLYIKYVGKLNDQMRGLYRSSYEVNN 362
+ N+F+V L++ A Y L I++ G+LND +RG YRSSY N
Sbjct: 60 ---------ITYSEENEFLVITLDEPLTLAGGNYTLTIEFTGELNDDLRGFYRSSYTDNG 110
Query: 363 -TKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLK 412
R++ A+QF+ TDARRAFPCFDEP+ KA F I+I TA+SNMP++
Sbjct: 111 GETRYLAATQFEPTDARRAFPCFDEPAFKATFTITITHPAGYTALSNMPVE 161
Score = 153 bits (390), Expect = 5e-40
Identities = 52/118 (44%), Positives = 73/118 (61%), Gaps = 4/118 (3%)
Query: 816 ISN-P--SKIERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEY 872
+SN P S+ + F+ T MSTYLVA V DF + + RV++R
Sbjct: 155 LSNMPVESEEVLGDGWKTTEFETTPPMSTYLVAFVVGDFDYVEGTTKNGVPVRVYARPGK 214
Query: 873 INQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFR-TAIL 929
I Q Y+L++ P++L++FE YF YPL K D++A+PDF AGAMEN+GLIT+R TA+L
Sbjct: 215 IEQGDYALEVAPKILEFFEDYFGIPYPLPKLDLVAIPDFAAGAMENWGLITYRETALL 272
>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1. Members of this
family are aminopeptidases. The members differ widely in
specificity, hydrolysing acidic, basic or neutral
N-terminal residues. This family includes leukotriene-A4
hydrolase, this enzyme also has an aminopeptidase
activity.
Length = 390
Score = 185 bits (471), Expect = 3e-51
Identities = 65/115 (56%), Positives = 88/115 (76%)
Query: 419 PENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGV 478
E ++LYD ST+ K+R+A ++AHELAHQWFGNLVT+ WW+DLWLNEGFA+Y+EY
Sbjct: 276 REPALLYDPGNSTSSDKQRVAEVIAHELAHQWFGNLVTMDWWDDLWLNEGFATYMEYLIT 335
Query: 479 DSVEHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKG 533
D++E W+I++ FV+ E+Q+ LD+L SSHP+ V V+ P EI +IFD ISY KG
Sbjct: 336 DALEPEWRIEEQFVLREVQSALALDSLDSSHPITVNVNTPSEIDDIFDAISYEKG 390
Score = 156 bits (397), Expect = 2e-41
Identities = 76/193 (39%), Positives = 98/193 (50%), Gaps = 34/193 (17%)
Query: 233 RLPTGVKPLAYKIKILPFLIENNFTFLGEVWIHVEVSQTTNNITLHMNDLTILERSIKQV 292
RLPT V P+ Y + + P TF G V I ++ TN I LH DL I
Sbjct: 1 RLPTTVVPIHYDLTLTPDF--EKPTFSGSVTITLQAKAATNEIVLHSKDLEI-------- 50
Query: 293 DNRSANWESDEGTSLTIGQVRNDTINQFMVFELEDEQFW----------ATKRYVLYIKY 342
TS+T+ N VF+L+DE + A + Y L I+Y
Sbjct: 51 ------------TSVTLRGEPVSVNNLISVFQLDDEDEFLVINLASTLQAGQPYTLEIEY 98
Query: 343 VGKLNDQMRGLYRSSY--EVNNTKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRL 400
GKLND MRG YRS Y + N K+ + +QF+ TDARRAFPCFDEPS+KA F I+I
Sbjct: 99 EGKLNDDMRGFYRSQYLDQTNGEKKPMATTQFEPTDARRAFPCFDEPSVKATFDITINHP 158
Query: 401 PNMTAISNMPLKD 413
+ TA+SNMP +
Sbjct: 159 ADYTALSNMPEIE 171
Score = 130 bits (329), Expect = 2e-32
Identities = 51/117 (43%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 816 ISNPSKIERTGP---YVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEY 872
+SN +IE V F+ T MSTYL+A AV D + RV++R
Sbjct: 164 LSNMPEIESEPLDDGRVITEFETTPKMSTYLLAFAVGDLEYLETKTKSGVPVRVYARPGA 223
Query: 873 INQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
IN Y+L++ +LL++FE YF + YPL K D +ALPDF AGAMEN+GLIT+R L
Sbjct: 224 INAGQYALEVTQKLLEFFEDYFGFPYPLPKLDQVALPDFSAGAMENWGLITYREPAL 280
>gnl|CDD|223385 COG0308, PepN, Aminopeptidase N [Amino acid transport and
metabolism].
Length = 859
Score = 168 bits (427), Expect = 7e-43
Identities = 79/239 (33%), Positives = 110/239 (46%), Gaps = 51/239 (21%)
Query: 419 PENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGV 478
E +L D + +T+ E + ++AHELAHQWFGNLVT+ WW+DLWLNEGFA++ E
Sbjct: 288 REKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTMKWWDDLWLNEGFATFREVLWS 347
Query: 479 DSV-EHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLL 537
+ + WK + F D+L SSHP+ V+V P EI + FD I Y KG+S+L
Sbjct: 348 EDLGGRAWKRWEDFRTLRTSIALAEDSLPSSHPIRVDVYDPKEINDFFDAIVYEKGASVL 407
Query: 538 RMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHEMRTLPEKMDVLKLGLQK 597
RM E L E + GL Y K+ A G++T +LW L
Sbjct: 408 RMLETLLGEEAFRKGLSLYFKRHAGGNATTMDLWKAL----------------------- 444
Query: 598 YIKKKAMGSSTQAELWAFLTNAGHEMRTLPENMDVETIMNTWTLQTGFPVIRVARDYDA 656
+ A G D+ +W Q G+PV+ V+ YD
Sbjct: 445 ---EDASGK------------------------DLSAFFESWLSQAGYPVLTVSVRYDD 476
Score = 89.1 bits (221), Expect = 6e-18
Identities = 40/117 (34%), Positives = 60/117 (51%), Gaps = 7/117 (5%)
Query: 816 ISNPSKI---ERTGPYVWDHFQETVFMSTYLVAMAVTDFS-HRVHNDDHSGSF--RVWSR 869
ISN + I F++T M TYL A+ D R D S ++
Sbjct: 174 ISNGNLIDGGTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVP 233
Query: 870 EEYINQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFRT 926
+++ Y+LD R ++++E+YF Y L D++A+PDF AGAMEN+GL+TFR
Sbjct: 234 PGVLDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGAMENWGLVTFRE 289
Score = 82.5 bits (204), Expect = 5e-16
Identities = 50/184 (27%), Positives = 73/184 (39%), Gaps = 24/184 (13%)
Query: 238 VKPLAYKIKILPFLIENNFTFLGEVWIHVEVSQTTNN--ITLHMNDLTILERSIKQVDNR 295
+ + + + P TF G V I ++ + + L L I RS+K
Sbjct: 24 IYDIDLDLDLDP----EKTTFEGSVTIRLDAGWRSGADPLVLDAVGLEI--RSVKIDGKA 77
Query: 296 SANWESDEGTSLTIGQVRNDTINQFMVFELEDEQFWATKRYVLYIKYVGKL-NDQMRGLY 354
W +G +LTI I E + F + ++ G + ND + GLY
Sbjct: 78 LTAWYRLDGDALTI--TVAPPI-----PERSERPF----TLAITYEFTGPVSNDTLEGLY 126
Query: 355 RSSYEVNNTKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLKDG 414
RS Y + +Q +A ARR FPC DEP +KA F ++I ISN L DG
Sbjct: 127 RSGYGGKP----YLITQCEAEGARRIFPCIDEPDVKATFTLTIRADKGPKLISNGNLIDG 182
Query: 415 NQSD 418
Sbjct: 183 GTLV 186
>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
This family contains bacterial and eukaryotic
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 438
Score = 136 bits (346), Expect = 2e-34
Identities = 58/162 (35%), Positives = 87/162 (53%), Gaps = 8/162 (4%)
Query: 420 ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGVD 479
EN + +E + R A +AHE+AH WFG+LVT+ WWNDLWLNE FA Y+ Y +
Sbjct: 266 ENYVFREEVTTAQR--LRRANTIAHEMAHMWFGDLVTMKWWNDLWLNESFAEYMAYKALS 323
Query: 480 SV---EHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSL 536
W F +D Q + D ++HP+ +V + ++ FD I+Y+KG+S+
Sbjct: 324 EATPFTDAWL---TFFLDRKQWAYRADQTPTTHPIAGDVENTEDALNNFDGITYAKGASV 380
Query: 537 LRMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAG 578
L+ L E + GL++Y KK A G++T A+ L A
Sbjct: 381 LKQLVALLGEEKFREGLREYFKKHAFGNATLADFLGALDEAS 422
Score = 81.9 bits (203), Expect = 3e-16
Identities = 43/131 (32%), Positives = 58/131 (44%), Gaps = 27/131 (20%)
Query: 813 QNW--ISN----PSKIERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSG-SFR 865
++W ISN R G W F ET +STYL A + H + + +H G
Sbjct: 144 KDWTVISNTAATEQSTIRNGYVRW-EFPETPPLSTYLFAFVAGPY-HVISDKEHDGIPLG 201
Query: 866 VWSRE-----------EYINQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAG 914
++ RE E T LD +FE+YF YP K D + +P+F G
Sbjct: 202 LYCRESLAQALDRDADEIFEITKQGLD-------FFEEYFGIPYPFGKYDQVFVPEFNFG 254
Query: 915 AMENFGLITFR 925
AMEN G +TFR
Sbjct: 255 AMENPGCVTFR 265
Score = 54.9 bits (133), Expect = 1e-07
Identities = 35/183 (19%), Positives = 73/183 (39%), Gaps = 24/183 (13%)
Query: 238 VKPLAYKIKILPFLIENNFTFLGEVWIHVEVSQTTNNITLHMNDLTILERSIKQVDNRSA 297
+ ++Y++ + L + TF G I ++ + ++ + +L S ++
Sbjct: 5 ISVVSYELDLD--LTKGKETFRGTTTITFDLRKNGGDLFIDFKGGQVLSVS---LNGNPV 59
Query: 298 NWESDEGTSLTIGQVRNDTINQFMVFELEDEQFWATKRYVLYIKYVGKLNDQMRGLYRSS 357
+ ++ E + + R + I + ++ GL+R
Sbjct: 60 DPDTFEDGRIPL-----------------PGLELKNGRNEVVIDFTNPYSNDGEGLHRFV 102
Query: 358 YEVNNTKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLKDGNQS 417
+ K ++ +QF+ DARR FPCFD+P LKA F +++ + T ISN + +
Sbjct: 103 DPADG-KTYLY-TQFEPDDARRVFPCFDQPDLKAPFTLTVTAPKDWTVISNTAATEQSTI 160
Query: 418 DPE 420
Sbjct: 161 RNG 163
>gnl|CDD|233857 TIGR02412, pepN_strep_liv, aminopeptidase N, Streptomyces lividans
type. This family is a subset of the members of the
zinc metallopeptidase family M1 (pfam01433), with a
single member characterized in Streptomyces lividans 66
and designated aminopeptidase N. The spectrum of
activity may differ somewhat from the aminopeptidase N
clade of E. coli and most other Proteobacteria, well
separated phylogenetically within the M1 family. The M1
family also includes leukotriene A-4
hydrolase/aminopeptidase (with a bifunctional active
site).
Length = 831
Score = 126 bits (318), Expect = 1e-29
Identities = 52/143 (36%), Positives = 80/143 (55%), Gaps = 6/143 (4%)
Query: 431 TNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFG-VDSVEHT--WKI 487
T KE A ++ HE+AH WFG+LVT+ WWNDLWLNE FA Y+ ++ E+T W
Sbjct: 280 TRAEKENRAGVILHEMAHMWFGDLVTMRWWNDLWLNESFAEYMGTLASAEATEYTDAWT- 338
Query: 488 KDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTE 547
F Q + D L ++HP+ +V+ + FD I+Y+KG+S+L+ ++ E
Sbjct: 339 --TFAAQGKQWAYEADQLPTTHPIVADVADLADALSNFDGITYAKGASVLKQLVAWVGEE 396
Query: 548 VLKLGLQKYIKKKAMGSSTQAEL 570
G+ Y K+ A G++T +L
Sbjct: 397 AFFAGVNAYFKRHAFGNATLDDL 419
Score = 84.8 bits (210), Expect = 1e-16
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 6/106 (5%)
Query: 823 ERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSRE---EYINQTAYS 879
W+ F ET +STYL A+A + H V ++ S +++R +Y++ A
Sbjct: 168 PEPADRRWE-FPETPKLSTYLTAVAAGPY-HSVQDESRSYPLGIYARRSLAQYLDADAI- 224
Query: 880 LDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFR 925
I + L +F + F Y YP +K D I +P+F AGAMEN G +TF
Sbjct: 225 FTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAMENAGCVTFA 270
Score = 45.6 bits (108), Expect = 1e-04
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 370 SQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISN 408
+QF+ DARR F FD+P LKA F S+ + T ISN
Sbjct: 122 TQFEPADARRVFAVFDQPDLKANFKFSVKAPEDWTVISN 160
>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
leukotriene A4 hydrolase. M1 Peptidase family includes
aminopeptidase N (APN) and leukotriene A4 hydrolase
(LTA4H). All peptidases in this family bind a single
catalytic zinc ion which is tetrahedrally co-ordinated
by three amino acid ligands and a water molecule that
forms the nucleophile on activation during catalysis.
APN consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types. APN expression is
dysregulated in many inflammatory diseases and is
enhanced in numerous tumor cells, making it a lead
target in the development of anti-cancer and
anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
possessing an aminopeptidase as well as an epoxide
hydrolase activity. The two activities occupy different,
but overlapping sites. The activity and physiological
relevance of the aminopeptidase in LTA4H is as yet
unknown while the epoxide hydrolase converts leukotriene
A4 (LTA4) into leukotriene B4 (LTB4), a potent
chemotaxin that is fundamental to the inflammatory
response of mammals.
Length = 407
Score = 117 bits (294), Expect = 6e-28
Identities = 60/161 (37%), Positives = 79/161 (49%), Gaps = 19/161 (11%)
Query: 424 LYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE------YFG 477
L I + ++AHELAHQWFGNLVT A WNDLWLNEGFA+Y+E FG
Sbjct: 256 LLLAMIDAGDE--LLENVIAHELAHQWFGNLVTGARWNDLWLNEGFATYLEGLWMEATFG 313
Query: 478 VDSVEHTWKIKDIFVVDE---LQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGS 534
+ E + + D L + P V HPD+ + FD + Y KG+
Sbjct: 314 TSAREA------YRLEGTRGLRRWRKLQDELPPAPP--VMGKHPDDPSVEFDGVFYEKGA 365
Query: 535 SLLRMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLT 575
+LR E L E GL+KY++K A S+T +L L
Sbjct: 366 LVLRYLEKRLGDEAFFKGLRKYVEKHAGQSATTDDLLQALE 406
Score = 60.1 bits (146), Expect = 2e-09
Identities = 31/117 (26%), Positives = 49/117 (41%), Gaps = 8/117 (6%)
Query: 816 ISNPSKIERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEYINQ 875
SN +K + W F + ++ YL+A+ V D + V G
Sbjct: 146 ASNGNKDDGEVRRKWVEFSPPIPIAPYLIALVVGDLEYPVREQTVQGIP--LHLYFLTPL 203
Query: 876 TAYSLDIGPRLLKYFEKYFDYH---YPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
++ RL F Y YP + D++ +P+F +GAMEN GLI F ++L
Sbjct: 204 AVDTVL---RLQSAFLFYETDLGGPYPYSEYDVVEVPEFPSGAMENPGLIFFSQSLL 257
Score = 39.3 bits (92), Expect = 0.008
Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 3/50 (6%)
Query: 367 IMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPN--MTAISNMPLKDG 414
+ SQ +A AR FPC D P A + ++ +P+ + A SN DG
Sbjct: 106 FLFSQGEAIGARSWFPCQDRPDSVATYYTTV-TVPDKTLVAASNGNKDDG 154
>gnl|CDD|189010 cd09603, M1_APN_4, Peptidase M1 family Aminopeptidase N. This
family contains mostly bacterial and some archaeal
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 415
Score = 87.3 bits (217), Expect = 5e-18
Identities = 42/135 (31%), Positives = 59/135 (43%), Gaps = 16/135 (11%)
Query: 440 TIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE--YFGVDSVEH--TWKIKDIFVVDE 495
++AHELAHQWFG+ VT A W+D+WLNEGFA+Y E + EH +
Sbjct: 267 RLIAHELAHQWFGDSVTCADWSDIWLNEGFATYAEWLWS-----EHSGGRDAAQYALYAY 321
Query: 496 LQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTEVLKLGLQK 555
+ L PV E+FD Y KG+ +L L E L+
Sbjct: 322 ARLYANEADLGPRPPVPGP-------AELFDDHVYEKGALVLHALRRLLGDEAFFRLLRT 374
Query: 556 YIKKKAMGSSTQAEL 570
Y+ + G+ T +
Sbjct: 375 YLAEHRGGNVTTEDF 389
Score = 60.7 bits (148), Expect = 2e-09
Identities = 23/93 (24%), Positives = 36/93 (38%), Gaps = 1/93 (1%)
Query: 832 HFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEYINQTAYSLDIGPRLLKYFE 891
H++ M+TYLV +AV + + + P +L +FE
Sbjct: 163 HWRMDEPMATYLVTLAVGRYEVLEDTTAGGVPVVYYVPPGLAADARRAFARTPEMLDFFE 222
Query: 892 KYFDYHYPLEKTDMIALPDFGAGAMENFGLITF 924
+ F YP EK + + DF G MEN +
Sbjct: 223 ELF-GPYPFEKYGQVVVDDFLGGGMENQTRTVY 254
Score = 45.7 bits (109), Expect = 8e-05
Identities = 35/150 (23%), Positives = 54/150 (36%), Gaps = 23/150 (15%)
Query: 260 GEVWIHVEVSQTTNNITLHMNDLTILERSIKQVDNRSANWESDEGTSLTIGQVRNDTINQ 319
G I +Q +++ L + LT+ S+ VD R A + D G LTI
Sbjct: 20 GTATITARATQDLSSLNLDLAGLTV--SSV-TVDGRPAAFRHD-GDELTIT--------- 66
Query: 320 FMVFELEDEQFWATKRYVLYIKYVGKLNDQMRGLYRSSYEVNNTKRWIMASQFQATDARR 379
+ + + ++Y G + + + Q A
Sbjct: 67 ------PPRPLPKGQTFTVTVRYSGV-PRPGLYPWGGDGGWEEGPDGV-WTAGQPEGAST 118
Query: 380 AFPCFDEPSLKAKFAISIGRLPN-MTAISN 408
FPC D PS KA F IS+ +P T +SN
Sbjct: 119 WFPCNDHPSDKATFDISV-TVPAGYTVVSN 147
>gnl|CDD|221257 pfam11838, DUF3358, Domain of unknown function (DUF3358). This
domain is functionally uncharacterized. This domain is
found in eukaryotes and bacteria. This domain is found
to the C-terminus of an aminopeptidase domain.
Length = 323
Score = 84.3 bits (209), Expect = 2e-17
Identities = 47/168 (27%), Positives = 74/168 (44%), Gaps = 15/168 (8%)
Query: 666 GYYRVLYDEKNWYLIIATLRNSTTYNTIHLLNRAQLIDDAMNLARAGLLDYKIALD-VTA 724
GYYRV YD ++ ATL + + L+RA L DA L R G L + LD V A
Sbjct: 10 GYYRVRYDPESL----ATLGEALARLKLDPLDRAGLWADAWALVRDGELPTRDFLDLVLA 65
Query: 725 YLQYETELVPWRSAMQALGYIEGQLY-----RRAYFDKYKKYLLHIIRPMYESIGFDGSP 779
+L ET+ V + LG ++ LY R+ KK++ + P+ E +G++ P
Sbjct: 66 FLPNETDYVVLSEILAQLGTLKSALYFEPEERKEAL---KKFVRKLAAPLLEKLGWEFGP 122
Query: 780 KDDQMTVYKRVDVLNRACILGLKDCVQKALSKYQNWISNPSKIERTGP 827
D R +L+ A + V AL +++ + + P
Sbjct: 123 GSDHQLQLLRALLLSLAGGDE--EVVAAALDRFRALLDGDKSLAAIDP 168
>gnl|CDD|189006 cd09599, M1_LTA4H, Peptidase M1 family contains leukotriene A4
hydrolase. This family includes leukotriene A4
hydrolase (LTA4H; E.C. 3.3.2.6) and the close homolog
cold-active aminopeptidase (Colwellia
psychrerythraea-type peptidase; ColAP), both members of
the aminopeptidase M1 family. LTA4H, is a bifunctional
enzyme possessing an aminopeptidase as well as an
epoxide hydrolase activity. The two activities occupy
different, but overlapping sites. The activity and
physiological relevance of the aminopeptidase is as yet
unknown while the epoxide hydrolase converts leukotriene
A4 (LTA4) into leukotriene B4 (LTB4), a potent
chemotaxin that is fundamental to the inflammatory
response of mammals. It accepts a variety of substrates,
including some opioid, di- and tripeptides, as well as
chromogenic aminoacyl-p-nitroanilide derivatives. The
aminopeptidase activity of LTA4H is possibly involved in
the processing of peptides related to inflammation and
host defense. Kinetic analysis shows that LTA4H
hydrolyzes arginyl tripeptides with high efficiency and
specificity, indicating its function as an arginyl
aminopeptidase. LTA4H is overexpressed in certain human
cancers, and has been identified as a functionally
important target for mediating anticancer properties of
resveratrol, a well known red wine polyphenolic compound
with cancer chemopreventive activity.
Length = 442
Score = 83.3 bits (207), Expect = 1e-16
Identities = 44/145 (30%), Positives = 57/145 (39%), Gaps = 20/145 (13%)
Query: 441 IVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE------YFGVDSVEHTWKIKDIFVVD 494
+VAHE+AH W GNLVT A W WLNEGF Y+E +G E + + I
Sbjct: 285 VVAHEIAHSWSGNLVTNATWEHFWLNEGFTVYLERRILERLYG----EDYRQFEAIIGWK 340
Query: 495 ELQNVFFLDALKSSHP-----VHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTEVL 549
ELQ + ++ PD + F + Y KG L E EV
Sbjct: 341 ELQES--IKTFGEDPEYTKLVPDLKGVDPD---DAFSSVPYEKGFQFLYYLEQLGGREVF 395
Query: 550 KLGLQKYIKKKAMGSSTQAELWAFL 574
L+ Y K S + FL
Sbjct: 396 DPFLRSYFDKFKFKSIDTEQFKDFL 420
Score = 36.0 bits (84), Expect = 0.083
Identities = 30/96 (31%), Positives = 44/96 (45%), Gaps = 7/96 (7%)
Query: 832 HFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEYINQTAYSLDIGPRLLKYFE 891
HF++ V + +YL+A+AV D R S RVW+ ++ A + LK E
Sbjct: 183 HFEQPVPIPSYLIAIAVGDLESR-PIGPRS---RVWAEPSLLDAAAEEFADTEKFLKAAE 238
Query: 892 KYFDYHYPLEKTDMIALPD-FGAGAMENFGLITFRT 926
Y + D++ LP F G MEN +TF T
Sbjct: 239 DLI-GPYVWGRYDLLVLPPSFPYGGMEN-PCLTFAT 272
Score = 34.4 bits (80), Expect = 0.26
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 368 MASQFQATDARRAFPCFDEPSLKAKFAISI 397
+ SQ QA AR FPC D PS+K+ ++ ++
Sbjct: 127 LFSQCQAIHARSLFPCQDTPSVKSTYSATV 156
>gnl|CDD|222169 pfam13485, Peptidase_MA_2, Peptidase MA superfamily.
Length = 128
Score = 61.4 bits (149), Expect = 3e-11
Identities = 28/126 (22%), Positives = 45/126 (35%), Gaps = 15/126 (11%)
Query: 431 TNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGVDSVEHTWKIKDI 490
+ + ++AHELAH GNLV WL EG A Y+ D
Sbjct: 18 APGDPDELRGVLAHELAHVVLGNLVGGNGNLPRWLTEGLAEYVAG-----------RIDP 66
Query: 491 FVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTEVLK 550
EL++ L + + S E + Y++G++ +R E L+
Sbjct: 67 ERAAELRDAVRSGRLPPLDELSADFSANSEDGSLA----YAQGAAFVRYLAERYGEEKLR 122
Query: 551 LGLQKY 556
L+
Sbjct: 123 ALLRAL 128
>gnl|CDD|189007 cd09600, M1_APN_1, Peptidase M1 family containing Aminopeptidase N.
This family contains aminopeptidase N (APN; CD13;
Alanyl aminopeptidase; EC 3.4.11.2), a Type II integral
membrane protease belonging to the M1 gluzincin family.
It includes bacterial-type alanyl aminopeptidases as
well as PfA-M1 aminopeptidase (Plasmodium
falciparum-type). APN consists of a small N-terminal
cytoplasmic domain, a single transmembrane domain and a
large extracellular ectodomain that contains the active
site. It preferentially cleaves neutral amino acids from
the N-terminus of oligopeptides and, in higher
eukaryotes, is present in a variety of human tissues and
cell types (leukocyte, fibroblast, endothelial and
epithelial cells). APN expression is dysregulated in
inflammatory diseases such as chronic pain, rheumatoid
arthritis, multiple sclerosis, systemic sclerosis,
systemic lupus erythematosus,
polymyositis/dermatomyosytis and pulmonary sarcoidosis,
and is enhanced in tumor cells such as melanoma, renal,
prostate, pancreas, colon, gastric and thyroid cancers.
It is predominantly expressed on stem cells and on cells
of the granulocytic and monocytic lineages at distinct
stages of differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 861
Score = 63.8 bits (156), Expect = 4e-10
Identities = 48/163 (29%), Positives = 72/163 (44%), Gaps = 16/163 (9%)
Query: 402 NMTAISNMPLKDGNQSDPENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWN 461
NM A+ N L N +L D + +T+ ERI +++ HE H W GN VT W
Sbjct: 251 NMGAMENKGLNIFNSK----LVLADPETATDADYERIESVIGHEYFHNWTGNRVTCRDWF 306
Query: 462 DLWLNEGFASYIEY-FGVDSVEHTWK-IKDIFVVDELQNVFFLDALKSSHPVHVEVSHPD 519
L L EG + + F D K I+D+ + Q F DA +HP+ PD
Sbjct: 307 QLSLKEGLTVFRDQEFSADMGSRAVKRIEDVRFLRAHQ--FPEDAGPMAHPI-----RPD 359
Query: 520 EITEI---FDKISYSKGSSLLRMAEHFLTTEVLKLGLQKYIKK 559
E+ + Y KG+ ++RM L E + G+ Y ++
Sbjct: 360 SYIEMNNFYTATVYEKGAEVIRMYHTLLGEEGFRKGMDLYFQR 402
Score = 42.6 bits (101), Expect = 0.001
Identities = 24/95 (25%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 842 YLVAMAVTDFSHRVHNDDH----SG---SFRVWSREEYINQTAYSLDIGPRLLKYFEKYF 894
YL A+ D +D SG + ++ ++ A++++ + +K+ E F
Sbjct: 177 YLFALVAGDLGVL---EDKFTTKSGRKVALEIYVEPGDESKCAHAMESLKKSMKWDEDRF 233
Query: 895 DYHYPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
Y L+ +++A+ DF GAMEN GL F + ++
Sbjct: 234 GLEYDLDLFNIVAVDDFNMGAMENKGLNIFNSKLV 268
>gnl|CDD|233856 TIGR02411, leuko_A4_hydro, leukotriene A-4
hydrolase/aminopeptidase. Members of this family
represent a distinctive subset within the zinc
metallopeptidase family M1 (pfam01433). The majority of
the members of pfam01433 are aminopeptidases, but the
sequences in this family for which the function is known
are leukotriene A-4 hydrolase. A dual epoxide hydrolase
and aminopeptidase activity at the same active site is
indicated. The physiological substrate for
aminopeptidase activity is not known.
Length = 602
Score = 62.1 bits (151), Expect = 1e-09
Identities = 42/134 (31%), Positives = 57/134 (42%), Gaps = 13/134 (9%)
Query: 439 ATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE--YFGVDSVEHTWKIKDIFVVDEL 496
++AHELAH W GNLVT W WLNEG+ Y+E G E T + +L
Sbjct: 280 VDVIAHELAHSWSGNLVTNCSWEHFWLNEGWTVYLERRIIGRLYGEKTRHFSALIGWGDL 339
Query: 497 QNVFFLDALKSSHP-----VHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFL-TTEVLK 550
Q + L + V ++ + PD + F + Y KG + L E L
Sbjct: 340 QES--VKTLGETPEFTKLVVDLKDNDPD---DAFSSVPYEKGFNFLFYLEQLLGGPAEFD 394
Query: 551 LGLQKYIKKKAMGS 564
L+ Y KK A S
Sbjct: 395 PFLRHYFKKFAYKS 408
Score = 38.6 bits (90), Expect = 0.014
Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 3/62 (4%)
Query: 368 MASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLKDGNQSDPENSMLYDE 427
+ SQ QA AR FPC D PS+K+ + + P +S + DG S+ L+ +
Sbjct: 126 LFSQCQAIHARSLFPCQDTPSVKSTYTAEV-ESPLPVLMSGIR--DGETSNDPGKYLFKQ 182
Query: 428 QI 429
++
Sbjct: 183 KV 184
>gnl|CDD|233858 TIGR02414, pepN_proteo, aminopeptidase N, Escherichia coli type.
The M1 family of zinc metallopeptidases contains a
number of distinct, well-separated clades of proteins
with aminopeptidase activity. Several are designated
aminopeptidase N, EC 3.4.11.2, after the Escherichia
coli enzyme, suggesting a similar activity profile (see
SP|P04825 for a description of catalytic activity). This
family consists of all aminopeptidases closely related
to E. coli PepN and presumed to have similar (not
identical) function. Nearly all are found in
Proteobacteria, but members are found also in
Cyanobacteria, plants, and apicomplexan parasites. This
family differs greatly in sequence from the family of
aminopeptidases typified by Streptomyces lividans PepN
(TIGR02412), from the membrane bound aminopeptidase N
family in animals, etc [Protein fate, Degradation of
proteins, peptides, and glycopeptides].
Length = 863
Score = 62.0 bits (151), Expect = 1e-09
Identities = 50/163 (30%), Positives = 72/163 (44%), Gaps = 16/163 (9%)
Query: 402 NMTAISNMPLKDGNQSDPENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWN 461
NM A+ N L N +L D + +T+ ERI +++AHE H W GN VT W
Sbjct: 251 NMGAMENKGLNIFNSK----YVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWF 306
Query: 462 DLWLNEGFASYIEY-FGVDSVEHTWKIKDIFVVDELQNVFFL-DALKSSHPVHVEVSHPD 519
L L EG + + F D + +K I V L+ F DA +HPV P+
Sbjct: 307 QLSLKEGLTVFRDQEFSADM--TSRAVKRIEDVRLLRAHQFPEDAGPMAHPV-----RPE 359
Query: 520 EITEI---FDKISYSKGSSLLRMAEHFLTTEVLKLGLQKYIKK 559
EI + Y KG+ ++RM L E + G+ Y +
Sbjct: 360 SYVEINNFYTATVYEKGAEVIRMLHTLLGEEGFRKGMDLYFSR 402
Score = 42.7 bits (101), Expect = 0.001
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 842 YLVAMAVTDFSHRVHNDDH----SG---SFRVWSREEYINQTAYSLDIGPRLLKYFEKYF 894
YL A+ D +D SG + RV+ E ++ ++++ + +K+ E+ F
Sbjct: 177 YLFALVAGDLDVL---EDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVF 233
Query: 895 DYHYPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
Y L+ ++A+ DF GAMEN GL F + +
Sbjct: 234 GLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYV 268
>gnl|CDD|189011 cd09604, M1_APN_5, Peptidase M1 family containing bacterial
Aminopeptidase N. This family contains bacterial
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 435
Score = 58.1 bits (141), Expect = 1e-08
Identities = 31/158 (19%), Positives = 58/158 (36%), Gaps = 29/158 (18%)
Query: 435 KERIATIVAHELAHQWFGNLVTLAWWNDL----WLNEGFASYIEYF--------GVDSVE 482
+ T++AHE+AHQW+ +V ND WL+EG +Y+ + +
Sbjct: 285 PFILETVLAHEIAHQWWYGIVG----NDERNEPWLDEGLTTYLTDYYLEERYGKEAARLY 340
Query: 483 HTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEH 542
++ D + +N L ++ + I+Y KG+ +L M
Sbjct: 341 RLRRLTDYAALVNSRNDPPLAFFFRNNGAYSA-------------IAYGKGAMVLHMLRK 387
Query: 543 FLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHE 580
+ E L+ Y ++ A +T + A
Sbjct: 388 EIGDEAFDKALRTYYREYAFKHATPEDFRRAFEKASGR 425
>gnl|CDD|189018 cd09839, M1_TAF2, TATA binding protein (TBP) associated factor 2.
This family includes TATA binding protein (TBP)
associated factor 2 (TAF2, TBP-associated factor
TAFII150, transcription initiation factor TFIID subunit
2, RNA polymerase II TBP-associated factor subunit B),
and has homology to the aminopeptidase N (APN)
subfamily, belonging to the M1 gluzincin family. TAF2 is
part of the TFIID multidomain subunit complex essential
for transcription of most protein-encoded genes by RNA
polymerase II. TAF2 is known to interact with the
initiator element (Inr) found at the transcription start
site of many genes, thus possibly playing a key role in
promoter binding as well as start-site selection. Image
analysis has shown TAF2 to form a complex with TAF1 and
TBP, inferring its role in promoter recognition.
Peptidases in the M1 family bind a single catalytic zinc
ion which is tetrahedrally co-ordinated by three amino
acid ligands and a water molecule that forms the
nucleophile on activation during catalysis. TAF2,
however, does not seem to contain any of the active site
residues.
Length = 507
Score = 53.1 bits (128), Expect = 5e-07
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 18/165 (10%)
Query: 421 NSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYI------E 474
+++LY E I + R A LA QWFG ++ W+D WL +G A YI +
Sbjct: 299 SNLLYPEDIIDQTYDTRRKLAYA--LASQWFGCFISPEAWSDEWLLKGIAGYITGLFVKK 356
Query: 475 YFGVDSVEHTWKIK----DIFVVDELQNVFFLDA--LKSSHPVHVEVSHPDEITEIFDKI 528
FG + E+ ++IK + D + LD L P++ + HP + ++ +
Sbjct: 357 LFGNN--EYRFRIKKELDRVCEYDIKRPGILLDPIILPGKLPLNPDPKHPHTCSPLYGEF 414
Query: 529 SYSKGSSLLRMAEHFLTTEVLKLGLQKYIKK--KAMGSSTQAELW 571
K +LRM E + E L K + + +A TQ+ W
Sbjct: 415 MALKAHLVLRMLERRIGKESFLQVLNKILSRALQASQQKTQSGDW 459
>gnl|CDD|189001 cd09594, GluZincin, Peptidase Gluzincin family (thermolysin-like
proteinases, TLPs) includes peptidases M1, M2, M3, M4,
M13, M32 and M36 (fungalysins). Gluzincin family
(thermolysin-like peptidases or TLPs) includes several
zinc-dependent metallopeptidases such as the M1, M2, M3,
M4, M13, M32, M36 peptidases (MEROPS classification),
and contain HEXXH and EXXXD motifs as part of their
active site. All peptidases in this family bind a single
catalytic zinc ion which is tetrahedrally co-ordinated
by three amino acid ligands and a water molecule that
forms the nucleophile on activation during catalysis. M1
family includes aminopeptidase N (APN) and leukotriene
A4 hydrolase (LTA4H). APN preferentially cleaves
neutral amino acids from the N-terminus of oligopeptides
and is present in a variety of human tissues and cell
types. LTA4H is a bifunctional enzyme, possessing an
aminopeptidase as well as an epoxide hydrolase activity
such that the two activities occupy different, but
overlapping sites. The peptidase M3 or neurolysin-like
family, includes M3, M2 and M32 metallopeptidases. The
M3 peptidases have two subfamilies: M3A, includes thimet
oligopeptidase (TOP; endopeptidase 3.4.24.15),
neurolysin (3.4.24.16), and the mitochondrial
intermediate peptidase; M3B contains oligopeptidase F.
M2 peptidase angiotensin converting enzyme (ACE, EC
3.4.15.1) catalyzes the conversion of decapeptide
angiotensin I to the potent vasopressor octapeptide
angiotensin II. ACE is a key part of the
renin-angiotensin system that regulates blood pressure,
thus ACE inhibitors are important for the treatment of
hypertension. M32 family includes two eukaryotic enzymes
from protozoa Trypanosoma cruzi, a causative agent of
Chagas' disease, and Leishmania major, a parasite that
causes leishmaniasis, making them attractive targets for
drug development. The M4 family includes secreted
protease thermolysin (EC 3.4.24.27), pseudolysin,
aureolysin, neutral protease as well as fungalysin and
bacillolysin (EC 3.4.24.28) that degrade extracellular
proteins and peptides for bacterial nutrition,
especially prior to sporulation. Thermolysin is widely
used as a nonspecific protease to obtain fragments for
peptide sequencing as well as in production of the
artificial sweetener aspartame. M13 family includes
neprilysin (EC 3.4.24.11) and endothelin-converting
enzyme I (ECE-1, EC 3.4.24.71), which fulfill a broad
range of physiological roles due to the greater
variation in the S2' subsite allowing substrate
specificity and are prime therapeutic targets for
selective inhibition. Peptidase M36 (fungamysin) family
includes endopeptidases from pathogenic fungi.
Fungalysin hydrolyzes extracellular matrix proteins such
as elastin and keratin. Aspergillus fumigatus causes the
pulmonary disease aspergillosis by invading the lungs of
immuno-compromised animals and secreting fungalysin that
possibly breaks down proteinaceous structural barriers.
Length = 125
Score = 42.4 bits (100), Expect = 1e-04
Identities = 14/69 (20%), Positives = 22/69 (31%), Gaps = 6/69 (8%)
Query: 414 GNQSDPENSMLY-DEQISTNYHKERIATIVAHELAHQWFGNLVT-----LAWWNDLWLNE 467
+ + ++ D + +V HEL H LV L LNE
Sbjct: 44 AMCNGLDARIVMNDGILVAFLLDSDDFGVVGHELTHGVTDQLVGNDPDLLYTNGSGGLNE 103
Query: 468 GFASYIEYF 476
G + + E
Sbjct: 104 GPSDFFELL 112
>gnl|CDD|226774 COG4324, COG4324, Predicted aminopeptidase [General function
prediction only].
Length = 376
Score = 36.5 bits (84), Expect = 0.060
Identities = 20/63 (31%), Positives = 28/63 (44%), Gaps = 14/63 (22%)
Query: 417 SDP-ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEY 475
DP ++ML +A+++ HELAHQ ND NE FA+ +E
Sbjct: 182 DDPLLSTML-------RQDDTYLASLIFHELAHQKI------YVNNDTAFNESFATAVET 228
Query: 476 FGV 478
GV
Sbjct: 229 SGV 231
>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265). Members of
this family of bacterial proteins comprise various
hypothetical proteins and putative aminopeptidases.
Their exact function, has not, as yet, been defined.
Length = 338
Score = 35.3 bits (82), Expect = 0.13
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 14/63 (22%)
Query: 417 SDP-ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEY 475
+DP ++ + Y +A ++ HELAHQ + D NE FA+ +E
Sbjct: 151 NDPLLSTFIR-------YPDGELARLIFHELAHQ------VVYIKGDTAFNESFATAVER 197
Query: 476 FGV 478
GV
Sbjct: 198 EGV 200
>gnl|CDD|237585 PRK14015, pepN, aminopeptidase N; Provisional.
Length = 875
Score = 33.6 bits (78), Expect = 0.55
Identities = 43/136 (31%), Positives = 57/136 (41%), Gaps = 28/136 (20%)
Query: 436 ERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGF-----ASYIEYFGVDSVEHTWK-IKD 489
ERI +++AHE H W GN VT W L L EG F D K I+D
Sbjct: 294 ERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQE----FSADLGSRAVKRIED 349
Query: 490 IFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKIS-------YSKGSSLLRMAEH 542
+ V+ Q F DA +HPV PD E I+ Y KG+ ++RM
Sbjct: 350 VRVLRAAQ--FAEDAGPMAHPV-----RPDSYIE----INNFYTATVYEKGAEVIRMLHT 398
Query: 543 FLTTEVLKLGLQKYIK 558
L E + G+ Y +
Sbjct: 399 LLGEEGFRKGMDLYFE 414
>gnl|CDD|198344 cd10424, GST_C_9, C-terminal, alpha helical domain of an unknown
subfamily 9 of Glutathione S-transferases. Glutathione
S-transferase (GST) C-terminal domain family, unknown
subfamily 9; composed of uncharacterized proteins with
similarity to GSTs. GSTs are cytosolic dimeric proteins
involved in cellular detoxification by catalyzing the
conjugation of glutathione (GSH) with a wide range of
endogenous and xenobiotic alkylating agents, including
carcinogens, therapeutic drugs, environmental toxins,
and products of oxidative stress. GSTs also show GSH
peroxidase activity and are involved in the synthesis of
prostaglandins and leukotrienes. The GST fold contains
an N-terminal thioredoxin-fold domain and a C-terminal
alpha helical domain, with an active site located in a
cleft between the two domains. GSH binds to the
N-terminal domain while the hydrophobic substrate
occupies a pocket in the C-terminal domain.
Length = 103
Score = 30.8 bits (70), Expect = 0.91
Identities = 13/54 (24%), Positives = 20/54 (37%), Gaps = 2/54 (3%)
Query: 167 PTRNLIAELFKV--EEKTTKAKFNGRLPTDVKPLAYRIKILPFLIENNFTIAEL 218
P R L E F K + L + LA + P++ FT+A+
Sbjct: 15 PARRLYPEAFFGGKVSPEIKEEVRKDLLRGIAALARLARFAPYVAGETFTLADC 68
>gnl|CDD|227631 COG5321, COG5321, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 164
Score = 31.0 bits (70), Expect = 1.4
Identities = 25/86 (29%), Positives = 37/86 (43%), Gaps = 6/86 (6%)
Query: 400 LPNMTAISNMPLKDGNQSDPENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAW 459
+P ++ I PL DG QS E +ML + A + LA +L+ L+
Sbjct: 1 MPIISPIQTNPLADGRQS--ERAMLVRRGVQRLLMAMGHAVLPELTLASGRRADLIALSP 58
Query: 460 WNDLWLNEGFASYIEYFGVDSVEHTW 485
++W+ E S IE F VD W
Sbjct: 59 KGEIWIIE-IKSSIEDFRVD---RKW 80
>gnl|CDD|192017 pfam08372, PRT_C, Plant phosphoribosyltransferase C-terminal. This
domain is found at the C-terminus of
phosphoribosyltransferases and
phosphoribosyltransferase-like proteins. It contains
putative transmembrane regions. It often appears
together with calcium-ion dependent C2 domains
(pfam00168).
Length = 156
Score = 30.7 bits (70), Expect = 2.0
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 7/38 (18%)
Query: 509 HPVHV-------EVSHPDEITEIFDKISYSKGSSLLRM 539
HP H+ + +HPDE+ E FD S+ ++RM
Sbjct: 22 HPPHMDTRLSHADAAHPDELDEEFDTFPTSRPPDVVRM 59
>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 31.6 bits (73), Expect = 2.2
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 643 TGFPVIRVARDYDAGSAVVKQVRGYYRVLY 672
G V RV+ D D G +V Q+ + LY
Sbjct: 263 AGIIVTRVSSDGDVGEQIVGQLFANPKALY 292
>gnl|CDD|238134 cd00217, INT_Flp, Flp recombinase, C-terminal catalytic domain.
Yeast Flp-like recombinases mediate the amplification of
the 2 micron circular plasmid copy number by catalyzing
the intra-molecular recombination between two inverted
repeats during replication. They belong to the DNA
breaking-rejoining enzyme superfamily, which also
includes prokaryotic tyrosine recombinases and type IB
topoisomerases. These enzymes share the same fold in
their catalytic domain containing six conserved active
site residues and the overall reaction mechanism.
Flp-like recombinases are almost exclusively found in
yeast and are highly diverged in sequence from the
prokaryotic tyrosine recombinases. They cleave their
target DNA in trans with a composite active site in
which the catalytic tyrosine is provided by a promoter
bound to a site other than the one being cleaved. Thus
each active site within Flp complexes is assembled by
domain swapping and contains catalytic residues from two
different monomers. Two DNA segments are synapsed by the
tetrameric enzyme, carrying the nucleophilic tyrosine in
each active site with only two of the four monomers
active at a given time. The catalytic domain is linked
through a flexible loop to the N-terminal domain, which
is largely responsible for non-specific DNA binding and
isomerization. Its overall fold is similar to the SAM
domain fold also found in the N-terminal domains of
lambda integrase and XerD recombinase.
Length = 339
Score = 30.5 bits (69), Expect = 4.2
Identities = 21/126 (16%), Positives = 42/126 (33%), Gaps = 19/126 (15%)
Query: 521 ITEIFDKISYSKGSSLLRMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHE 580
+ I +S+ K ++ H +L L K + F
Sbjct: 55 KSCISKTLSFDKDDKIVSFKYHLKKPTILGKTLSKAFSP-----------YKF------V 97
Query: 581 MRTLPEKMDVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHEMRTLPENMDVETIMNTWT 640
+++ K + L+ ++K+ + + T ++ VE I+N+
Sbjct: 98 IKSRSLKEVDIFSKLELKLQKREEKDKGGSLIDQEFTKLLRTDESI--WKIVEKILNSIE 155
Query: 641 LQTGFP 646
LQT FP
Sbjct: 156 LQTRFP 161
>gnl|CDD|238511 cd01060, Membrane-FADS-like, The membrane fatty acid desaturase
(Membrane_FADS)-like CD includes membrane FADSs, alkane
hydroxylases, beta carotene ketolases (CrtW-like),
hydroxylases (CrtR-like), and other related proteins.
They are present in all groups of organisms with the
exception of archaea. Membrane FADSs are non-heme,
iron-containing, oxygen-dependent enzymes involved in
regioselective introduction of double bonds in fatty
acyl aliphatic chains. They play an important role in
the maintenance of the proper structure and functioning
of biological membranes. Alkane hydroxylases are
bacterial, integral-membrane di-iron enzymes that share
a requirement for iron and oxygen for activity similar
to that of membrane FADSs, and are involved in the
initial oxidation of inactivated alkanes. Beta-carotene
ketolase and beta-carotene hydroxylase are carotenoid
biosynthetic enzymes for astaxanthin and zeaxanthin,
respectively. This superfamily domain has extensive
hydrophobic regions that would be capable of spanning
the membrane bilayer at least twice. Comparison of these
sequences also reveals three regions of conserved
histidine cluster motifs that contain eight histidine
residues: HXXX(X)H, HXX(X)HH, and HXXHH (an additional
conserved histidine residue is seen between clusters 2
and 3). Spectroscopic and genetic evidence point to a
nitrogen-rich coordination environment located in the
cytoplasm with as many as eight histidines coordinating
the two iron ions and a carboxylate residue bridging the
two metals in the Pseudomonas oleovorans alkane
hydroxylase (AlkB). In addition, the eight histidine
residues are reported to be catalytically essential and
proposed to be the ligands for the iron atoms contained
within the rat stearoyl CoA delta-9 desaturase.
Length = 122
Score = 28.2 bits (63), Expect = 9.1
Identities = 9/29 (31%), Positives = 12/29 (41%), Gaps = 3/29 (10%)
Query: 437 RIATIVAHELAHQWFGNLVTLAWWNDLWL 465
T++AHEL H + W N L
Sbjct: 13 LGLTVLAHELGH---RSFFRSRWLNRLLG 38
>gnl|CDD|219486 pfam07607, DUF1570, Protein of unknown function (DUF1570). A
family of hypothetical proteins in Rhodopirellula
baltica. This family carries a highly conserved HExxH
sequence motif characteristic of members of the
Peptidase clan MA.
Length = 128
Score = 28.1 bits (63), Expect = 9.4
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 438 IATIVAHELAHQWFGN--LVTLAWWNDLWLNEGFASYIE 474
IAT+ HE HQ N L + N +W++EG A+Y E
Sbjct: 2 IATVA-HEATHQLAFNCGLQSRLADNPMWVSEGLATYFE 39
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.402
Gapped
Lambda K H
0.267 0.0623 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 48,800,139
Number of extensions: 4845157
Number of successful extensions: 3866
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3829
Number of HSP's successfully gapped: 61
Length of query: 947
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 841
Effective length of database: 6,236,078
Effective search space: 5244541598
Effective search space used: 5244541598
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.7 bits)