RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9963
(800 letters)
>gnl|CDD|189000 cd08662, M13, Peptidase family M13 includes neprilysin,
endothelin-converting enzyme I. M13 family of
metallopeptidases includes neprilysin (neutral
endopeptidase, NEP, enkephalinase, CD10, CALLA, EC
3.4.24.11), endothelin-converting enzyme I (ECE-1, EC
3.4.24.71), erythrocyte surface antigen KELL (ECE-3),
phosphate-regulating gene on the X chromosome (PHEX),
soluble secreted endopeptidase (SEP), and damage-induced
neuronal endopeptidase (DINE)/X-converting enzyme (XCE).
These proteins consist of a short N-terminal cytoplasmic
domain, a single transmembrane helix, and a larger
C-terminal extracellular domain containing the active
site. Proteins in this family fulfill a broad range of
physiological roles due to the greater variation in the
S2' subsite allowing substrate specificity. NEP is
expressed in a variety of tissues including kidney and
brain, and is involved in many physiological and
pathological processes, including blood pressure and
inflammatory response. It degrades a wide array of
substrates such as substance P, enkephalins,
cholecystokinin, neurotensin and somatostatin. It is an
important enzyme in the regulation of amyloid-beta
(Abeta) protein that forms amyloid plaques that are
associated with Alzeimers disease (AD). ECE-1 catalyzes
the final rate-limiting step in the biosynthesis of
endothelins via post-translational conversion of the
biologically inactive big endothelins. Like NEP, it also
hydrolyses bradykinin, substance P, neurotensin and
Abeta. Endothelin-1 overproduction has been implicated
in various diseases, including stroke, asthma,
hypertension, and cardiac and renal failure. Kell is a
homolog of NEP and constitutes a major antigen on human
erythrocytes; it preferentially cleaves big endothelin-3
to produce bioactive endothelin-3, but is also known to
cleave substance P and neurokinin A. PHEX forms a
complex interaction with fibroblast growth factor 23
(FGF23) and matrix extracellular phosphoglycoprotein,
causing bone mineralization. A loss-of-function mutation
in PHEX disrupts this interaction leading to
hypophosphatemic rickets; X-linked hypophosphatemic
(XLH) rickets is the most common form of metabolic
rickets. ECEL1 is a brain metalloprotease involved in
the critical role in the nervous regulation of the
respiratory system, while DINE (damage induced neuronal
endopeptidase) is abundantly expressed in the
hypothalamus and its expression responds to nerve injury
as well. Thus, majority of these M13 proteases are prime
therapeutic targets for selective inhibition.
Length = 611
Score = 621 bits (1603), Expect = 0.0
Identities = 244/691 (35%), Positives = 355/691 (51%), Gaps = 84/691 (12%)
Query: 5 VQPCDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEEPQSASGP--HIF 62
V PCD+FY++ACGN+LK IP D++S +FS++ +++ ++K +LEE +
Sbjct: 1 VDPCDDFYQYACGNWLKNHPIPADKSSYGSFSELREKVEERLKEILEEAAAEKASDSSAE 60
Query: 63 NLMKKFYATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYKLRDLG 122
+K FY +CMD + ++ LG++P+
Sbjct: 61 QKIKDFYRSCMDTEAIEALGLKPLL----------------------------------- 85
Query: 123 FNHNLLINIVFCKQTQHPINTTQIPGAGPDFKNSTRRRLLIDQVDLK-PLEDFLREGWRN 181
LL + G PD KNS+R L +DQ L P D+ +
Sbjct: 86 ---PLLFGL----------------GVSPDLKNSSRNILYLDQPGLGLPDRDYYLDEKSK 126
Query: 182 KNVRAYHSYMVKTAVLFGAPVSRAFRELKDSIDFEITMTKFIVPDQELRNLSYLYNLMSV 241
K AY +Y+ K VL G A ++ + FE + K ++E R+ YN M++
Sbjct: 127 KIRAAYKAYLAKLLVLAGEDEEDAEALAEEVLAFETELAKISWSEEERRDPEKTYNPMTL 186
Query: 242 HTLNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQPKYMSSIERLV 301
L K P I + +L + P +E V+VTQP Y+ + +L+
Sbjct: 187 AELQKLAPGID-W------------KAYLEALGLP----SEDEKVIVTQPDYLKKLNKLL 229
Query: 302 QITPKRILANYMLMQTVLSSSAYLCKSARRPYHDFKYVLTGVNSTRIRSQRCVDEVAEKF 361
TP R L NY++ + + S + YL + R + L+G R R +RCV V
Sbjct: 230 ASTPLRTLKNYLIWRLLDSLAPYLSEEFRDANFFYGKTLSGTKEQRPRWKRCVSLVNGLL 289
Query: 362 SLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKVH 421
A+G LYVR +F +K ELV NL++ L+N+ WMD +TKK ALEK + V
Sbjct: 290 GEALGRLYVRKYFPPEAKARVEELVENLKKAFRERLENLDWMDEETKKKALEKLDAMTVK 349
Query: 422 IGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKSVDYAYTQLHEPINKTSWKEFGS 481
IGYPD+ D K+D+ Y DL +VL L + ++ +L +P+++T W +
Sbjct: 350 IGYPDKWRDYSKLDIDYDDL----DSYFGNVLRLRRFELERNLAKLGKPVDRTEWGM--T 403
Query: 482 VATVNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYD 541
TVNA+ + +N I FPAGILQ P+F D P +NYG IG VIGHEITH FDD+G Q+D
Sbjct: 404 PQTVNAYYNPTQNEIVFPAGILQPPFFDPDAPDAVNYGGIGAVIGHEITHGFDDQGRQFD 463
Query: 542 KFGNVMNWWDDESKMRYNEKASCIIKQYSNYVIEDIGFRINGITTQGENIADNGGVKESY 601
+ GN+ NWW E + + E+ C++ QYSNY + G +NG T GENIAD GG++ +Y
Sbjct: 464 EDGNLRNWWTPEDRKAFEERTQCLVDQYSNYEV-PPGLHVNGKLTLGENIADLGGLRLAY 522
Query: 602 RAYLECAKQKNAEYKLPGLENFSDKQMFWISFANVWCTKYRKEAMELLFKNDDHPPAEIR 661
AY + K K AE LP + F+ Q+F++SFA VWC+KYR EA+ L D H P + R
Sbjct: 523 DAYKKWLKGKGAE--LPPGDGFTPDQLFFLSFAQVWCSKYRPEALRQLLLTDPHSPGKFR 580
Query: 662 VNGPLSNSYDFARDFQCYSGSK-MNPKHKCE 691
VNG LSNS +FA F C G MNP+ +C
Sbjct: 581 VNGVLSNSPEFAEAFNCPPGDPYMNPEKRCR 611
Score = 115 bits (290), Expect = 1e-26
Identities = 42/93 (45%), Positives = 55/93 (59%)
Query: 703 RSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTK 762
R +RCV V A+G LYVR +F +K ELV NL++ L+N+ WMD +TK
Sbjct: 277 RWKRCVSLVNGLLGEALGRLYVRKYFPPEAKARVEELVENLKKAFRERLENLDWMDEETK 336
Query: 763 KNALEKSKVLKVHIGYPDELLDDDKIDLHYKDL 795
K ALEK + V IGYPD+ D K+D+ Y DL
Sbjct: 337 KKALEKLDAMTVKIGYPDKWRDYSKLDIDYDDL 369
>gnl|CDD|226118 COG3590, PepO, Predicted metalloendopeptidase [Posttranslational
modification, protein turnover, chaperones].
Length = 654
Score = 356 bits (916), Expect = e-112
Identities = 190/705 (26%), Positives = 303/705 (42%), Gaps = 76/705 (10%)
Query: 1 MDPGVQPCDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEE---PQSAS 57
MD +P D+ Y + G +LKT+ IP DR+ F K++DR + ++ ++E + A
Sbjct: 14 MDAMTRPQDDLYGYVNGEWLKTAEIPADRSRDGAFDKLDDRAEALVRDIIEAAAANEQAP 73
Query: 58 GPHIFNLMKKFYATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYK 117
I + K Y + MDE +K GV P+K + ++ L S F+ +
Sbjct: 74 EDAILQRIGKLYRSFMDEAKREKAGVDPLKPELAEIDSLA-------SFSDFAAALGQLE 126
Query: 118 LRDLGFNHNLLINIVFCKQTQHPINTTQIPGAGPDFKNSTRRRLLIDQVDLK-PLEDFLR 176
G PDFK+STR L Q L P + R
Sbjct: 127 RAGQG----NPFGF----------------SVSPDFKDSTRYVLYFSQSGLGLPDTTYYR 166
Query: 177 EGWRNKNVR-AYHSYMVKTAVLFGAPVSRAFRELKDS----IDFEITMTKFIVPDQELRN 231
+ ++ + AY ++ + LFG S + + E + + R+
Sbjct: 167 DE-QHAELLAAYKEHVARMLGLFG--FSEEEEDAAKHALRVVALETKLANASWEVVKYRD 223
Query: 232 LSYLYNLMSVHTLNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQP 291
L + YN + L P L +G +++++V P
Sbjct: 224 LYHTYNPATFAELQPELPG--------------DDWSLLFSALGQL----PDKVIVVENP 265
Query: 292 KYMSSIERLVQITPKRILANYMLMQTVLSSSAYLCKSARRPYHDF-KYVLTGVNSTRIRS 350
Y+ L+ ++ + + +++ YL + + DF L+G R R
Sbjct: 266 FYLKEFASLLAEENWADWKAWLRWKLIRAAAPYLTEDLVDEHFDFYGRTLSGQPEARDRW 325
Query: 351 QRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKN 410
+R V+ F A+G LYV+ +F +K + ELV NL + + + WM P+T++
Sbjct: 326 KRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPETREK 385
Query: 411 ALEKSKVLKVHIGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKSVDYAYTQLHEP 470
ALEK IGYPD Y L I L +VL + + +++ +P
Sbjct: 386 ALEKLNKFTAKIGYPDPWRY-------YSKLEIKRDSLYGNVLRASAFNHAHELSKIGKP 438
Query: 471 INKTSWKEFGSVATVNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEIT 530
+++ W TVNA+ + KN I FPA ILQ P+F + NYG IG VIGHEI
Sbjct: 439 VDRDEW--EMPPQTVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIG 496
Query: 531 HTFDDEGSQYDKFGNVMNWWDDESKMRYNEKASCIIKQYSNYVIEDIGFRINGITTQGEN 590
H FDD+G+++D GN+ +WW DE + E+ ++ Q+ Y E G T GEN
Sbjct: 497 HGFDDQGAKFDGDGNLNDWWTDEDAAAFKERTKKLVAQFDGYEPEG-GKDNGNALTVGEN 555
Query: 591 IADNGGVKESYRAYLECAKQKNAEYKLPGLENFSDKQMFWISFANVWCTKYRKEAMELLF 650
IAD GG+ + AY K + P ++ F+ Q F++++A +W K R E ++
Sbjct: 556 IADLGGLAIALDAY------KLSLDPAPVIDGFTGLQRFFLNWAQIWRMKARPEEQQMRL 609
Query: 651 KNDDHPPAEIRVNGPLSNSYDFARDFQCYSGSKM--NPKHKCEVW 693
D H PAE RVNGP+ N +F F G M P+ + +W
Sbjct: 610 SVDPHSPAEFRVNGPVRNMDEFYEAFDVKEGDAMYRAPEERVVIW 654
Score = 79.4 bits (196), Expect = 3e-15
Identities = 31/88 (35%), Positives = 45/88 (51%)
Query: 701 RIRSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQ 760
R R +R V+ F A+G LYV+ +F +K + ELV NL + + + WM P+
Sbjct: 322 RDRWKRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPE 381
Query: 761 TKKNALEKSKVLKVHIGYPDELLDDDKI 788
T++ ALEK IGYPD K+
Sbjct: 382 TREKALEKLNKFTAKIGYPDPWRYYSKL 409
>gnl|CDD|218676 pfam05649, Peptidase_M13_N, Peptidase family M13. M13 peptidases
are well-studied proteases found in a wide range of
organisms including mammals and bacteria. In mammals
they participate in processes such as cardiovascular
development, blood-pressure regulation, nervous control
of respiration, and regulation of the function of
neuropeptides in the central nervous system. In bacteria
they may be used for digestion of milk.
Length = 380
Score = 296 bits (760), Expect = 3e-93
Identities = 128/423 (30%), Positives = 205/423 (48%), Gaps = 48/423 (11%)
Query: 8 CDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEEPQSASGPH-IFNLMK 66
CD+FY++ACG +LK + IP DR+S TFS++ D++ Q++ +LEE S K
Sbjct: 1 CDDFYQYACGGWLKNTPIPADRSSYGTFSELRDKIEKQLRAILEEAASGEADSSNERKAK 60
Query: 67 KFYATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYKLRDLGFNHN 126
Y +CMD D ++KLG++P+K +++++G + S+F + KL G ++
Sbjct: 61 DLYRSCMDTDAIEKLGLKPLKPLLKKIGAI---------KSKFDLADALAKLERGG-SNG 110
Query: 127 LLINIVFCKQTQHPINTTQIPGAGPDFKNSTRRRLLIDQVDLK-PLEDFLREG--WRNKN 183
L G PDFKNS+R L + Q L P D+ + K
Sbjct: 111 PLFGF----------------GVSPDFKNSSRNILYLSQGGLGLPDRDYYLKDDAKSEKI 154
Query: 184 VRAYHSYMVKTAVLFGAPVSRAFRELKDSIDFEITMTKFIVPDQELRNLSYLYNLMSVHT 243
AY Y+ K L G A + +D + FE + K +ELR+ + LYN M++
Sbjct: 155 REAYKKYIAKLLKLAGLDEDEAEKAAEDVVAFETKLAKASWSREELRDPAKLYNPMTLAE 214
Query: 244 LNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQPKYMSSIERLVQI 303
L P + +L+ + PK D +E++ V+QP Y+ ++ +L+
Sbjct: 215 LQALAPGLDW-------------KRYLSALGLPK---DPDEVI-VSQPDYLKALNKLLAE 257
Query: 304 TPKRILANYMLMQTVLSSSAYLCKSARRPYHDFKYV-LTGVNSTRIRSQRCVDEVAEKFS 362
TP L NY++ + + S + YL + R +F L+G R R +RCV V
Sbjct: 258 TPLETLKNYLIWKLIDSFAPYLSEEFRDANFEFYGKTLSGTKEQRPRWKRCVSLVNGLLG 317
Query: 363 LAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKVHI 422
A+G LYV +F +K + E+V N+++ L+ + WM +TKK ALEK + V I
Sbjct: 318 EALGRLYVEKYFPPEAKADVEEMVKNIKKAFRERLEELDWMSEETKKKALEKLDAMTVKI 377
Query: 423 GYP 425
GYP
Sbjct: 378 GYP 380
Score = 94.7 bits (236), Expect = 9e-21
Identities = 30/77 (38%), Positives = 43/77 (55%)
Query: 703 RSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTK 762
R +RCV V A+G LYV +F +K + E+V N+++ L+ + WM +TK
Sbjct: 304 RWKRCVSLVNGLLGEALGRLYVEKYFPPEAKADVEEMVKNIKKAFRERLEELDWMSEETK 363
Query: 763 KNALEKSKVLKVHIGYP 779
K ALEK + V IGYP
Sbjct: 364 KKALEKLDAMTVKIGYP 380
>gnl|CDD|216499 pfam01431, Peptidase_M13, Peptidase family M13. Mammalian enzymes
are typically type-II membrane anchored enzymes which
are known, or believed to activate or inactivate
oligopeptide (pro)-hormones such as opioid peptides. The
family also contains a bacterial member believed to be
involved with milk protein cleavage.
Length = 206
Score = 204 bits (520), Expect = 6e-61
Identities = 89/210 (42%), Positives = 119/210 (56%), Gaps = 7/210 (3%)
Query: 486 NAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFGN 545
NA+ + N + FPAGIL P+F P +NYG IG V+GHEI H FDD+G Q+DK GN
Sbjct: 1 NAYYNPGFNYLTFPAGILVPPFFNDLYPPAVNYGGIGFVLGHEIMHGFDDQGIQFDKDGN 60
Query: 546 VMNWWDDESKMRYNEKASCIIKQYSNYVIEDIGFR--INGITTQGENIADNGGVKESYRA 603
+ WW DE + + + A C+I QY Y D NG TT GENIAD GG++ + +A
Sbjct: 61 LCGWWTDEDEAEFKDAAQCLIDQYDEYGDPDAVKPDCWNGATTFGENIADLGGLRIALKA 120
Query: 604 YLECAKQKNA-EYKLPGLENFSDKQMFWISFANVWCTKYRKEAMELLFKNDDHPPAEIRV 662
Y +K+A E +LPGLEN + Q+F+ ++A WC K R + D H P E+RV
Sbjct: 121 Y----AKKSANEQRLPGLENLTPDQLFFETYAQSWCRKSRPKNSLTQLLVDPHSPLELRV 176
Query: 663 NGPLSNSYDFARDFQCYSGSKMNPKHKCEV 692
NG + N F F C G +M PK +
Sbjct: 177 NGAVRNMPAFYSAFNCKPGDRMFPKPEKRC 206
>gnl|CDD|189001 cd09594, GluZincin, Peptidase Gluzincin family (thermolysin-like
proteinases, TLPs) includes peptidases M1, M2, M3, M4,
M13, M32 and M36 (fungalysins). Gluzincin family
(thermolysin-like peptidases or TLPs) includes several
zinc-dependent metallopeptidases such as the M1, M2, M3,
M4, M13, M32, M36 peptidases (MEROPS classification),
and contain HEXXH and EXXXD motifs as part of their
active site. All peptidases in this family bind a single
catalytic zinc ion which is tetrahedrally co-ordinated
by three amino acid ligands and a water molecule that
forms the nucleophile on activation during catalysis. M1
family includes aminopeptidase N (APN) and leukotriene
A4 hydrolase (LTA4H). APN preferentially cleaves
neutral amino acids from the N-terminus of oligopeptides
and is present in a variety of human tissues and cell
types. LTA4H is a bifunctional enzyme, possessing an
aminopeptidase as well as an epoxide hydrolase activity
such that the two activities occupy different, but
overlapping sites. The peptidase M3 or neurolysin-like
family, includes M3, M2 and M32 metallopeptidases. The
M3 peptidases have two subfamilies: M3A, includes thimet
oligopeptidase (TOP; endopeptidase 3.4.24.15),
neurolysin (3.4.24.16), and the mitochondrial
intermediate peptidase; M3B contains oligopeptidase F.
M2 peptidase angiotensin converting enzyme (ACE, EC
3.4.15.1) catalyzes the conversion of decapeptide
angiotensin I to the potent vasopressor octapeptide
angiotensin II. ACE is a key part of the
renin-angiotensin system that regulates blood pressure,
thus ACE inhibitors are important for the treatment of
hypertension. M32 family includes two eukaryotic enzymes
from protozoa Trypanosoma cruzi, a causative agent of
Chagas' disease, and Leishmania major, a parasite that
causes leishmaniasis, making them attractive targets for
drug development. The M4 family includes secreted
protease thermolysin (EC 3.4.24.27), pseudolysin,
aureolysin, neutral protease as well as fungalysin and
bacillolysin (EC 3.4.24.28) that degrade extracellular
proteins and peptides for bacterial nutrition,
especially prior to sporulation. Thermolysin is widely
used as a nonspecific protease to obtain fragments for
peptide sequencing as well as in production of the
artificial sweetener aspartame. M13 family includes
neprilysin (EC 3.4.24.11) and endothelin-converting
enzyme I (ECE-1, EC 3.4.24.71), which fulfill a broad
range of physiological roles due to the greater
variation in the S2' subsite allowing substrate
specificity and are prime therapeutic targets for
selective inhibition. Peptidase M36 (fungamysin) family
includes endopeptidases from pathogenic fungi.
Fungalysin hydrolyzes extracellular matrix proteins such
as elastin and keratin. Aspergillus fumigatus causes the
pulmonary disease aspergillosis by invading the lungs of
immuno-compromised animals and secreting fungalysin that
possibly breaks down proteinaceous structural barriers.
Length = 125
Score = 35.5 bits (82), Expect = 0.026
Identities = 16/60 (26%), Positives = 20/60 (33%), Gaps = 9/60 (15%)
Query: 485 VNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFG 544
NA + I GIL FL V+GHE+TH D+ D
Sbjct: 42 NNAMCNGLDARIVMNDGILVA---------FLLDSDDFGVVGHELTHGVTDQLVGNDPDL 92
>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
tricorn interacting factor F3, Endoplasmic reticulum
aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ). This
M1 peptidase family includes eukaryotic and bacterial
members: aminopeptidase N (APN), aminopeptidase Q (APQ,
laeverin), endoplasmic reticulum aminopeptidase 1
(ERAP1) as well as tricorn interacting factor F3.
Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease,
consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types (leukocyte, fibroblast,
endothelial and epithelial cells). APN expression is
dysregulated in inflammatory diseases such as chronic
pain, rheumatoid arthritis, multiple sclerosis, systemic
sclerosis, systemic lupus erythematosus,
polymyositis/dermatomyosytis and pulmonary sarcoidosis,
and is enhanced in tumor cells such as melanoma, renal,
prostate, pancreas, colon, gastric and thyroid cancers.
It is considered a marker of differentiation since it is
predominantly expressed on stem cells and on cells of
the granulocytic and monocytic lineages at distinct
stages of differentiation. Thus, APN inhibition may lead
to the development of anti-cancer and anti-inflammatory
drugs. ERAP1 also known as endoplasmic reticulum
aminopeptidase associated with antigen processing
(ERAAP), adipocyte derived leucine aminopeptidase
(A-LAP) or aminopeptidase regulating tumor necrosis
factor receptor I (THFRI) shedding (ARTS-1), associates
with the closely related ER aminopeptidase ERAP2, for
the final trimming of peptides within the ER for
presentation by MHC class I molecules. ERAP1 is
associated with ankylosing spondylitis (AS), an
inflammatory arthritis that predominantly affects the
spine. ERAP1 also aids in the shedding of membrane-bound
cytokine receptors. The tricorn interacting factor F3,
together with factors F1 and F2, degrades the tricorn
protease products, producing free amino acids, thus
completing the proteasomal degradation pathway. F3 is
homologous to F2, but not F1, and shows a strong
preference for glutamate in the P1' position. APQ, also
known as laeverin, is specifically expressed in human
embryo-derived extravillous trophoblasts (EVTs) that
invade the uterus during early placentation. It cleaves
the N-terminal amino acid of various peptides such as
angiotensin III, endokinin C, and kisspeptin-10, all
expressed in the placenta in large quantities. APN is a
receptor for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs are also putative Cry toxin
receptors. Cry1 proteins are pore-forming toxins that
bind to the midgut epithelial cell membrane of
susceptible insect larvae, causing extensive damage.
Several different toxins, including Cry1Aa, Cry1Ab,
Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
bind to APNs; however, a direct role of APN in
cytotoxicity has been yet to be firmly established.
Length = 446
Score = 35.2 bits (82), Expect = 0.12
Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 11/34 (32%)
Query: 521 IGTVIGHEITHTFDDEGSQYDKFGNV--MNWWDD 552
+ TV+ HE+ H Q+ FGN+ M WWDD
Sbjct: 286 VATVVAHELAH-------QW--FGNLVTMKWWDD 310
>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1. Members of this
family are aminopeptidases. The members differ widely in
specificity, hydrolysing acidic, basic or neutral
N-terminal residues. This family includes leukotriene-A4
hydrolase, this enzyme also has an aminopeptidase
activity.
Length = 390
Score = 35.4 bits (82), Expect = 0.13
Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 22/79 (27%)
Query: 487 AFNHIDKNAI-EFPAG--------ILQEPYFAADR--PCFLNYGAIGTVIGHEITHTFDD 535
+D+ A+ +F AG +EP D + + VI HE+ H
Sbjct: 250 PLPKLDQVALPDFSAGAMENWGLITYREPALLYDPGNSTSSDKQRVAEVIAHELAH---- 305
Query: 536 EGSQYDKFGNV--MNWWDD 552
Q+ FGN+ M+WWDD
Sbjct: 306 ---QW--FGNLVTMDWWDD 319
>gnl|CDD|223382 COG0305, DnaB, Replicative DNA helicase [DNA replication,
recombination, and repair].
Length = 435
Score = 33.0 bits (76), Expect = 0.60
Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 1/53 (1%)
Query: 340 LTGVNSTRIRSQRCVDEVAEKFSLAVGSLYVRH-FFDRTSKENALELVNNLRQ 391
+G+ S+++R+ R D+ E+ A L F D T E+ + R+
Sbjct: 248 ESGIESSKLRTGRLSDDEWERLIKAASELSEAPIFIDDTPGLTITEIRSKARR 300
Score = 30.7 bits (70), Expect = 3.0
Identities = 12/53 (22%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
Query: 694 KESVNSTRIRSQRCVDEVAEKFSLAVGSLYVRH-FFDRTSKENALELVNNLRQ 745
+ + S+++R+ R D+ E+ A L F D T E+ + R+
Sbjct: 248 ESGIESSKLRTGRLSDDEWERLIKAASELSEAPIFIDDTPGLTITEIRSKARR 300
>gnl|CDD|182802 PRK10877, PRK10877, protein disulfide isomerase II DsbC;
Provisional.
Length = 232
Score = 32.0 bits (73), Expect = 0.92
Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 7/52 (13%)
Query: 619 GLENFSDKQMFWISFANVWCTKYRKEAMELLFKNDDHPPAEIRVNGPLSNSY 670
GL++ ++K M ++WC R +A + K D PA V+ +++ Y
Sbjct: 148 GLDSQAEKDM-----KSIWCAADRNKAFDDAMKGKDVSPASCDVD--IADHY 192
>gnl|CDD|215083 PLN00162, PLN00162, transport protein sec23; Provisional.
Length = 761
Score = 32.2 bits (74), Expect = 1.1
Identities = 15/32 (46%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 78 LDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRF 109
LD++GV MKV +E+ GGL VV + + S F
Sbjct: 352 LDQVGVAEMKVAVERTGGL-VVLAESFGHSVF 382
>gnl|CDD|148461 pfam06857, ACP, Malonate decarboxylase delta subunit (MdcD).
This family consists of several bacterial malonate
decarboxylase delta subunit (MdcD) proteins. Malonate
decarboxylase of Klebsiella pneumoniae consists of four
different subunits and catalyzes the conversion of
malonate plus H+ to acetate and CO2. The catalysis
proceeds via acetyl and malonyl thioester residues with
the phosphribosyl-dephospho-CoA prosthetic group of the
acyl carrier protein (ACP) subunit. MdcC is the (apo)
ACP subunit. The family also contains the CitD family
of citrate lyase acyl carrier proteins.
Length = 87
Score = 29.1 bits (66), Expect = 2.3
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 75 EDTLDKLGVQPMKVIIEQLGGLPVV 99
DTL +LGV +KV I G L V
Sbjct: 49 RDTLARLGVTNVKVSINDKGALDCV 73
>gnl|CDD|117515 pfam08948, DUF1859, Domain of unknown function (DUF1859). This
domain has no known function. It is predominantly found
in the N-terminus of bacteriophage spike proteins.
Length = 126
Score = 29.7 bits (66), Expect = 2.4
Identities = 9/65 (13%), Positives = 18/65 (27%)
Query: 438 YKDLRIVPGDLLTSVLNLYKKSVDYAYTQLHEPINKTSWKEFGSVATVNAFNHIDKNAIE 497
Y D G V ++ A Q + P+ +F + + + +
Sbjct: 59 YIDNAENDGAFEIDVEETGQRIKCPAGKQGYFPLLVPGRAKFVARHLGSGKKSVPLFFLN 118
Query: 498 FPAGI 502
F
Sbjct: 119 FTIAQ 123
>gnl|CDD|233524 TIGR01677, pln_FAD_oxido, plant-specific FAD-dependent
oxidoreductase. This model represents an
uncharacterized plant-specific family of FAD-dependent
oxidoreductases. At least seven distinct members are
found in Arabidopsis thaliana. The family shows
considerable sequence similarity to three different
enzymes of ascorbic acid biosynthesis:
L-galactono-1,4-lactone dehydrogenase (EC 1.3.2.3) from
higher plants, D-arabinono-1,4-lactone oxidase (EC
1.1.3.37 from Saccharomyces cerevisiae, and
L-gulonolactone oxidase (EC 1.1.3.8) from mouse, as well
as to a bacterial sorbitol oxidase. The class of
compound acted on by members of this family is unknown.
Length = 557
Score = 31.0 bits (70), Expect = 3.3
Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 293 YMSSIERLVQITPKRI----LANYMLMQTVLSSSAYLCKSARRPYHDFKY 338
++ ++RL + PK + L N +L++ V +S AYL K DF Y
Sbjct: 384 FVLDVKRLRDMEPKSLCGVELYNGILIRYVKASPAYLGKEEDAVDFDFTY 433
>gnl|CDD|201611 pfam01126, Heme_oxygenase, Heme oxygenase.
Length = 204
Score = 30.0 bits (68), Expect = 3.3
Identities = 19/86 (22%), Positives = 28/86 (32%), Gaps = 13/86 (15%)
Query: 644 EAMEL-LFKNDDHPPAEIRVNGPLSNSYDFARDFQCYSGSKMNPKHKCEVWKESVN---S 699
A+E L +N D P A L+ RD G+ W+ + +
Sbjct: 49 SALEEELERNRDSPVAAPIYFPELNRKAALERDLAYLYGAD---------WRADIQDSPA 99
Query: 700 TRIRSQRCVDEVAEKFSLAVGSLYVR 725
T+ R + E L V Y R
Sbjct: 100 TQEYVPRIREIGNESPELLVAHAYTR 125
>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
leukotriene A4 hydrolase. M1 Peptidase family includes
aminopeptidase N (APN) and leukotriene A4 hydrolase
(LTA4H). All peptidases in this family bind a single
catalytic zinc ion which is tetrahedrally co-ordinated
by three amino acid ligands and a water molecule that
forms the nucleophile on activation during catalysis.
APN consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types. APN expression is
dysregulated in many inflammatory diseases and is
enhanced in numerous tumor cells, making it a lead
target in the development of anti-cancer and
anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
possessing an aminopeptidase as well as an epoxide
hydrolase activity. The two activities occupy different,
but overlapping sites. The activity and physiological
relevance of the aminopeptidase in LTA4H is as yet
unknown while the epoxide hydrolase converts leukotriene
A4 (LTA4) into leukotriene B4 (LTB4), a potent
chemotaxin that is fundamental to the inflammatory
response of mammals.
Length = 407
Score = 30.1 bits (68), Expect = 4.9
Identities = 10/36 (27%), Positives = 14/36 (38%), Gaps = 11/36 (30%)
Query: 521 IGTVIGHEITHTFDDEGSQYDKFGN--VMNWWDDES 554
+ VI HE+ H + FGN W+D
Sbjct: 268 LENVIAHELAHQW---------FGNLVTGARWNDLW 294
>gnl|CDD|205445 pfam13265, DUF4056, Protein of unknown function (DUF4056). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 355 and 380 amino
acids in length.
Length = 271
Score = 29.5 bits (67), Expect = 5.9
Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 6/46 (13%)
Query: 66 KKFYATCMD---EDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSR 108
+ ++ MD L +LG Q +DG WDSSR
Sbjct: 202 LEQFSQAMDQALPQALKQLGAVSKSETKAQFD---QLDGIWWDSSR 244
>gnl|CDD|239588 cd03511, Rhizopine-oxygenase-like, This CD includes the putative
hydrocarbon oxygenase, MocD, a bacterial rhizopine
(3-O-methyl-scyllo-inosamine, 3-O-MSI) oxygenase, and
other related proteins. It has been proposed that MocD,
MocE (Rieske-like ferredoxin), and MocF (ferredoxin
reductase) under the regulation of MocR, act in concert
to form a ferredoxin oxygenase system that demethylates
3-O-MSI to form scyllo-inosamine. This domain family
appears to be structurally related to the membrane fatty
acid desaturases and the alkane hydroxylases. They all
share in common extensive hydrophobic regions that would
be capable of spanning the membrane bilayer at least
twice. Comparison of sequences also reveals the
existence of three regions of conserved histidine
cluster motifs that contain eight histidine residues:
HXXXH, HXXHH, and HXXHH. These histidine residues are
reported to be catalytically essential and proposed to
be the ligands for the iron atoms contained within
homologs, stearoyl CoA desaturase and alkane
hydroxylase.
Length = 285
Score = 29.6 bits (67), Expect = 5.9
Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 15/78 (19%)
Query: 197 LFGAPVSRAFRELK------DSIDFEITMTKFIVPDQELRNLSYLYNLMSVHTLNKYYPS 250
L G P+ R F + D+ D T T L +LY M H + YPS
Sbjct: 202 LLGQPILRLFLLAEHGGCPEDANDLRNTRTTLT-----NPPLRFLYWNMPYHAEHHMYPS 256
Query: 251 IPCFYLEIDGPSLNKKLE 268
+P L P L++ ++
Sbjct: 257 VPFHAL----PKLHELIK 270
>gnl|CDD|234286 TIGR03626, L3_arch, archaeal ribosomal protein L3. This model
describes exclusively the archaeal class of ribosomal
protein L3. A separate model (TIGR03625) describes the
bacterial/organelle form, and both belong to Pfam family
pfam00297. Eukaryotic proteins are excluded from this
model [Protein synthesis, Ribosomal proteins: synthesis
and modification].
Length = 330
Score = 29.6 bits (67), Expect = 6.3
Identities = 13/45 (28%), Positives = 22/45 (48%), Gaps = 5/45 (11%)
Query: 241 VHTLNKYYPSIP-----CFYLEIDGPSLNKKLEFLNRIIGPKISI 280
VHT K +P + I G S+ ++ E+ I+G +IS+
Sbjct: 140 VHTQPKLVTGVPKKKPEIMEIRIGGGSVEERFEYALSILGKEISV 184
>gnl|CDD|188993 cd06258, M3_like, Peptidase M3-like family, a zincin
metallopeptidase, includes M3 and M32 families. The
peptidase M3-like family, also called neurolysin-like
family, is part of the "zincins" metallopeptidases, and
includes M3 and M32 families of metallopeptidases. The
M3 family is subdivided into two subfamilies: the
widespread M3A, which comprises a number of
high-molecular mass endo- and exopeptidases from
bacteria, archaea, protozoa, fungi, plants and animals,
and the small M3B, whose members are enzymes primarily
from bacteria. Well-known mammalian/eukaryotic M3A
endopeptidases are the thimet oligopeptidase (TOP;
endopeptidase 3.4.24.15), neurolysin (alias
endopeptidase 3.4.24.16), and the mitochondrial
intermediate peptidase. The first two are intracellular
oligopeptidases, which act only on relatively short
substrates of less than 20 amino acid residues, while
the latter cleaves N-terminal octapeptides from proteins
during their import into the mitochondria. The M3A
subfamily also contains several bacterial
endopeptidases, collectively called oligopeptidases A,
as well as a large number of bacterial
carboxypeptidases, called dipeptidyl peptidases (Dcp;
Dcp II; peptidyl dipeptidase; EC 3.4.15.5). The
peptidases in the M3 family contain the HEXXH motif that
forms the active site in conjunction with a
C-terminally-located Glutamic acid (Glu) residue. A
single zinc ion is ligated by the side-chains of the two
Histidine (His) residues, and the more C-terminal Glu.
Most of the peptidases are synthesized without signal
peptides or propeptides, and function intracellularly.
There are similarities to the thermostable
carboxypeptidases from Pyrococcus furiosus
carboxypeptidase (PfuCP), and Thermus aquaticus (TaqCP),
belonging to peptidase family M32. Little is known about
function of this family, including carboxypeptidases Taq
and Pfu.
Length = 400
Score = 29.7 bits (67), Expect = 6.4
Identities = 14/104 (13%), Positives = 32/104 (30%), Gaps = 18/104 (17%)
Query: 292 KYMSSIERLVQITPKRILANYMLMQT--VLSSSAYLCKSARRPYHDFKYVLTGVNSTRIR 349
K + + P + SS Y YH Y+L + + ++R
Sbjct: 301 KLARELLGVRPEGPSP----PPWAAKIHIASSPVY--------YHG--YLLAEMLAAQLR 346
Query: 350 SQRCVDEVAEKF-SLAVGSLYVRHFFDRTSKENALELVNNLRQE 392
++ + + G+ + + + EL+ + E
Sbjct: 347 -AHLKEKFGDLVDNPEAGAWLKEKLWRPGNSLDWEELLKSATGE 389
>gnl|CDD|239819 cd04327, ZnMc_MMP_like_3, Zinc-dependent metalloprotease; MMP_like
sub-family 3. A group of bacterial and fungal
metalloproteinase domains similar to matrix
metalloproteinases and astacin.
Length = 198
Score = 28.9 bits (65), Expect = 7.5
Identities = 12/43 (27%), Positives = 16/43 (37%), Gaps = 8/43 (18%)
Query: 631 ISFANVWCTKYRKEAMELLFKNDDHPPAEIRVN---GPLSNSY 670
+ A W Y L FK A+IR++ G SY
Sbjct: 26 RAAAREW-LPYAN----LKFKFVTDADADIRISFTPGDGYWSY 63
>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
This family contains bacterial and eukaryotic
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 438
Score = 29.5 bits (67), Expect = 7.6
Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 11/31 (35%)
Query: 524 VIGHEITHTFDDEGSQYDKFGNV--MNWWDD 552
I HE+ H + FG++ M WW+D
Sbjct: 285 TIAHEMAHMW---------FGDLVTMKWWND 306
>gnl|CDD|224076 COG1154, Dxs, Deoxyxylulose-5-phosphate synthase [Coenzyme
metabolism / Lipid metabolism].
Length = 627
Score = 29.5 bits (67), Expect = 7.9
Identities = 10/52 (19%), Positives = 20/52 (38%), Gaps = 2/52 (3%)
Query: 410 NALEKSKVLK--VHIGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKS 459
L +L +++G PDE +D + +L + + +L K
Sbjct: 574 EFLAAHGILVPVLNLGLPDEFIDHGSPEELLAELGLDAEGIARRILEWLKAR 625
>gnl|CDD|238755 cd01478, Sec23-like, Sec23-like: Protein and membrane traffic in
eukaryotes is mediated by at least in part by the
budding and fusion of intracellular transport vesicles
that selectively carry cargo proteins and lipids from
donor to acceptor organelles. The two main classes of
vesicular carriers within the endocytic and the
biosynthetic pathways are COP- and clathrin-coated
vesicles. Formation of COPII vesicles requires the
ordered assembly of the coat built from several
cytosolic components GTPase Sar1, complexes of
Sec23-Sec24 and Sec13-Sec31. The process is initiated by
the conversion of GDP to GTP by the GTPase Sar1 which
then recruits the heterodimeric complex of Sec23 and
Sec24. This heterodimeric complex generates the
pre-budding complex. The final step leading to membrane
deformation and budding of COPII-coated vesicles is
carried by the heterodimeric complex Sec13-Sec31. The
members of this CD belong to the Sec23-like family. Sec
23 is very similar to Sec24. The Sec23 and Sec24
polypeptides fold into five distinct domains: a
beta-barrel, a zinc finger, a vWA or trunk, an all
helical region and a carboxy Gelsolin domain. The
members of this subgroup lack the consensus MIDAS motif
but have the overall Para-Rossmann type fold that is
characteristic of this superfamily.
Length = 267
Score = 29.3 bits (66), Expect = 8.6
Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 6/41 (14%)
Query: 69 YATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRF 109
+A C LD++G+ MKV++ GG VV D + +S F
Sbjct: 227 FAGC-----LDQVGLLEMKVLVNSTGGH-VVLSDSFTTSIF 261
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.136 0.408
Gapped
Lambda K H
0.267 0.0764 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,049,602
Number of extensions: 4061272
Number of successful extensions: 3282
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3257
Number of HSP's successfully gapped: 38
Length of query: 800
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 695
Effective length of database: 6,280,432
Effective search space: 4364900240
Effective search space used: 4364900240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.0 bits)