RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1463
         (947 letters)



>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
           tricorn interacting factor F3, Endoplasmic reticulum
           aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ).  This
           M1 peptidase family includes eukaryotic and bacterial
           members: aminopeptidase N (APN), aminopeptidase Q (APQ,
           laeverin), endoplasmic reticulum aminopeptidase 1
           (ERAP1) as well as tricorn interacting factor F3.
           Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease,
           consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types (leukocyte, fibroblast,
           endothelial and epithelial cells). APN expression is
           dysregulated in inflammatory diseases such as chronic
           pain, rheumatoid arthritis, multiple sclerosis, systemic
           sclerosis, systemic lupus erythematosus,
           polymyositis/dermatomyosytis and pulmonary sarcoidosis,
           and is enhanced in tumor cells such as melanoma, renal,
           prostate, pancreas, colon, gastric and thyroid cancers.
           It is considered a marker of differentiation since it is
           predominantly expressed on stem cells and on cells of
           the granulocytic and monocytic lineages at distinct
           stages of differentiation. Thus, APN inhibition may lead
           to the development of anti-cancer and anti-inflammatory
           drugs. ERAP1 also known as endoplasmic reticulum
           aminopeptidase associated with antigen processing
           (ERAAP), adipocyte derived leucine aminopeptidase
           (A-LAP) or aminopeptidase regulating tumor necrosis
           factor receptor I (THFRI) shedding (ARTS-1), associates
           with the closely related ER aminopeptidase ERAP2, for
           the final trimming of peptides within the ER for
           presentation by MHC class I molecules. ERAP1 is
           associated with ankylosing spondylitis (AS), an
           inflammatory arthritis that predominantly affects the
           spine. ERAP1 also aids in the shedding of membrane-bound
           cytokine receptors. The tricorn interacting factor F3,
           together with factors F1 and F2, degrades the tricorn
           protease products, producing free amino acids, thus
           completing the proteasomal degradation pathway. F3 is
           homologous to F2, but not F1, and shows a strong
           preference for glutamate in the P1' position. APQ, also
           known as laeverin, is specifically expressed in human
           embryo-derived extravillous trophoblasts (EVTs) that
           invade the uterus during early placentation. It cleaves
           the N-terminal amino acid of various peptides such as
           angiotensin III, endokinin C, and kisspeptin-10, all
           expressed in the placenta in large quantities. APN is a
           receptor for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs are also putative Cry toxin
           receptors. Cry1 proteins are pore-forming toxins that
           bind to the midgut epithelial cell membrane of
           susceptible insect larvae, causing extensive damage.
           Several different toxins, including Cry1Aa, Cry1Ab,
           Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
           bind to APNs; however, a direct role of APN in
           cytotoxicity has been yet to be firmly established.
          Length = 446

 Score =  273 bits (700), Expect = 2e-82
 Identities = 105/227 (46%), Positives = 137/227 (60%), Gaps = 48/227 (21%)

Query: 420 ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGVD 479
           E ++LYD + S+  +K+R+AT+VAHELAHQWFGNLVT+ WW+DLWLNEGFA+Y+EY GVD
Sbjct: 268 ETALLYDPKTSSASNKQRVATVVAHELAHQWFGNLVTMKWWDDLWLNEGFATYMEYLGVD 327

Query: 480 SVEHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRM 539
            +E  W + D FV+D+LQ+   LD+L SSHP+ V V  PDEI+EIFD ISYSKG+S+LRM
Sbjct: 328 HLEPEWNMWDQFVLDDLQSALALDSLASSHPISVPVETPDEISEIFDAISYSKGASVLRM 387

Query: 540 AEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHEMRTLPEKMDVLKLGLQKYI 599
            EHFL  EV + GL+ Y+KK A G++T  +LW  L+ A                      
Sbjct: 388 LEHFLGEEVFRKGLRNYLKKHAYGNATTDDLWEALSEA---------------------- 425

Query: 600 KKKAMGSSTQAELWAFLTNAGHEMRTLPENMDVETIMNTWTLQTGFP 646
                                          DV+ IM+TWTLQ G+P
Sbjct: 426 --------------------------SKLGKDVKEIMDTWTLQPGYP 446



 Score =  190 bits (484), Expect = 2e-52
 Identities = 69/171 (40%), Positives = 96/171 (56%), Gaps = 13/171 (7%)

Query: 243 YKIKILPFLIENNFTFLGEVWIHVEVSQTTNNITLHMNDLTILERSIKQVDNRSANWESD 302
           Y + + P L  +NFTF G V I +EV++ TN I LH  DLTI   ++           + 
Sbjct: 3   YDLTLTPDL--DNFTFSGSVTITLEVTEPTNEIVLHAKDLTITSATVTVGGGNVDILVT- 59

Query: 303 EGTSLTIGQVRNDTINQFMVFELEDEQFWATKRYVLYIKYVGKLNDQMRGLYRSSYEVNN 362
                    +     N+F+V  L++    A   Y L I++ G+LND +RG YRSSY  N 
Sbjct: 60  ---------ITYSEENEFLVITLDEPLTLAGGNYTLTIEFTGELNDDLRGFYRSSYTDNG 110

Query: 363 -TKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLK 412
              R++ A+QF+ TDARRAFPCFDEP+ KA F I+I      TA+SNMP++
Sbjct: 111 GETRYLAATQFEPTDARRAFPCFDEPAFKATFTITITHPAGYTALSNMPVE 161



 Score =  153 bits (390), Expect = 5e-40
 Identities = 52/118 (44%), Positives = 73/118 (61%), Gaps = 4/118 (3%)

Query: 816 ISN-P--SKIERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEY 872
           +SN P  S+      +    F+ T  MSTYLVA  V DF +      +    RV++R   
Sbjct: 155 LSNMPVESEEVLGDGWKTTEFETTPPMSTYLVAFVVGDFDYVEGTTKNGVPVRVYARPGK 214

Query: 873 INQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFR-TAIL 929
           I Q  Y+L++ P++L++FE YF   YPL K D++A+PDF AGAMEN+GLIT+R TA+L
Sbjct: 215 IEQGDYALEVAPKILEFFEDYFGIPYPLPKLDLVAIPDFAAGAMENWGLITYRETALL 272


>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1.  Members of this
           family are aminopeptidases. The members differ widely in
           specificity, hydrolysing acidic, basic or neutral
           N-terminal residues. This family includes leukotriene-A4
           hydrolase, this enzyme also has an aminopeptidase
           activity.
          Length = 390

 Score =  185 bits (471), Expect = 3e-51
 Identities = 65/115 (56%), Positives = 88/115 (76%)

Query: 419 PENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGV 478
            E ++LYD   ST+  K+R+A ++AHELAHQWFGNLVT+ WW+DLWLNEGFA+Y+EY   
Sbjct: 276 REPALLYDPGNSTSSDKQRVAEVIAHELAHQWFGNLVTMDWWDDLWLNEGFATYMEYLIT 335

Query: 479 DSVEHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKG 533
           D++E  W+I++ FV+ E+Q+   LD+L SSHP+ V V+ P EI +IFD ISY KG
Sbjct: 336 DALEPEWRIEEQFVLREVQSALALDSLDSSHPITVNVNTPSEIDDIFDAISYEKG 390



 Score =  156 bits (397), Expect = 2e-41
 Identities = 76/193 (39%), Positives = 98/193 (50%), Gaps = 34/193 (17%)

Query: 233 RLPTGVKPLAYKIKILPFLIENNFTFLGEVWIHVEVSQTTNNITLHMNDLTILERSIKQV 292
           RLPT V P+ Y + + P       TF G V I ++    TN I LH  DL I        
Sbjct: 1   RLPTTVVPIHYDLTLTPDF--EKPTFSGSVTITLQAKAATNEIVLHSKDLEI-------- 50

Query: 293 DNRSANWESDEGTSLTIGQVRNDTINQFMVFELEDEQFW----------ATKRYVLYIKY 342
                       TS+T+        N   VF+L+DE  +          A + Y L I+Y
Sbjct: 51  ------------TSVTLRGEPVSVNNLISVFQLDDEDEFLVINLASTLQAGQPYTLEIEY 98

Query: 343 VGKLNDQMRGLYRSSY--EVNNTKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRL 400
            GKLND MRG YRS Y  + N  K+ +  +QF+ TDARRAFPCFDEPS+KA F I+I   
Sbjct: 99  EGKLNDDMRGFYRSQYLDQTNGEKKPMATTQFEPTDARRAFPCFDEPSVKATFDITINHP 158

Query: 401 PNMTAISNMPLKD 413
            + TA+SNMP  +
Sbjct: 159 ADYTALSNMPEIE 171



 Score =  130 bits (329), Expect = 2e-32
 Identities = 51/117 (43%), Positives = 68/117 (58%), Gaps = 3/117 (2%)

Query: 816 ISNPSKIERTGP---YVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEY 872
           +SN  +IE        V   F+ T  MSTYL+A AV D  +           RV++R   
Sbjct: 164 LSNMPEIESEPLDDGRVITEFETTPKMSTYLLAFAVGDLEYLETKTKSGVPVRVYARPGA 223

Query: 873 INQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
           IN   Y+L++  +LL++FE YF + YPL K D +ALPDF AGAMEN+GLIT+R   L
Sbjct: 224 INAGQYALEVTQKLLEFFEDYFGFPYPLPKLDQVALPDFSAGAMENWGLITYREPAL 280


>gnl|CDD|223385 COG0308, PepN, Aminopeptidase N [Amino acid transport and
           metabolism].
          Length = 859

 Score =  168 bits (427), Expect = 7e-43
 Identities = 79/239 (33%), Positives = 110/239 (46%), Gaps = 51/239 (21%)

Query: 419 PENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGV 478
            E  +L D + +T+   E +  ++AHELAHQWFGNLVT+ WW+DLWLNEGFA++ E    
Sbjct: 288 REKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTMKWWDDLWLNEGFATFREVLWS 347

Query: 479 DSV-EHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLL 537
           + +    WK  + F           D+L SSHP+ V+V  P EI + FD I Y KG+S+L
Sbjct: 348 EDLGGRAWKRWEDFRTLRTSIALAEDSLPSSHPIRVDVYDPKEINDFFDAIVYEKGASVL 407

Query: 538 RMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHEMRTLPEKMDVLKLGLQK 597
           RM E  L  E  + GL  Y K+ A G++T  +LW  L                       
Sbjct: 408 RMLETLLGEEAFRKGLSLYFKRHAGGNATTMDLWKAL----------------------- 444

Query: 598 YIKKKAMGSSTQAELWAFLTNAGHEMRTLPENMDVETIMNTWTLQTGFPVIRVARDYDA 656
              + A G                         D+     +W  Q G+PV+ V+  YD 
Sbjct: 445 ---EDASGK------------------------DLSAFFESWLSQAGYPVLTVSVRYDD 476



 Score = 89.1 bits (221), Expect = 6e-18
 Identities = 40/117 (34%), Positives = 60/117 (51%), Gaps = 7/117 (5%)

Query: 816 ISNPSKI---ERTGPYVWDHFQETVFMSTYLVAMAVTDFS-HRVHNDDHSGSF--RVWSR 869
           ISN + I             F++T  M TYL A+   D    R   D  S      ++  
Sbjct: 174 ISNGNLIDGGTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVP 233

Query: 870 EEYINQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFRT 926
              +++  Y+LD   R ++++E+YF   Y L   D++A+PDF AGAMEN+GL+TFR 
Sbjct: 234 PGVLDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGAMENWGLVTFRE 289



 Score = 82.5 bits (204), Expect = 5e-16
 Identities = 50/184 (27%), Positives = 73/184 (39%), Gaps = 24/184 (13%)

Query: 238 VKPLAYKIKILPFLIENNFTFLGEVWIHVEVSQTTNN--ITLHMNDLTILERSIKQVDNR 295
           +  +   + + P       TF G V I ++    +    + L    L I  RS+K     
Sbjct: 24  IYDIDLDLDLDP----EKTTFEGSVTIRLDAGWRSGADPLVLDAVGLEI--RSVKIDGKA 77

Query: 296 SANWESDEGTSLTIGQVRNDTINQFMVFELEDEQFWATKRYVLYIKYVGKL-NDQMRGLY 354
              W   +G +LTI       I      E  +  F       +  ++ G + ND + GLY
Sbjct: 78  LTAWYRLDGDALTI--TVAPPI-----PERSERPF----TLAITYEFTGPVSNDTLEGLY 126

Query: 355 RSSYEVNNTKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLKDG 414
           RS Y         + +Q +A  ARR FPC DEP +KA F ++I        ISN  L DG
Sbjct: 127 RSGYGGKP----YLITQCEAEGARRIFPCIDEPDVKATFTLTIRADKGPKLISNGNLIDG 182

Query: 415 NQSD 418
               
Sbjct: 183 GTLV 186


>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
            This family contains bacterial and eukaryotic
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 438

 Score =  136 bits (346), Expect = 2e-34
 Identities = 58/162 (35%), Positives = 87/162 (53%), Gaps = 8/162 (4%)

Query: 420 ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGVD 479
           EN +  +E  +      R A  +AHE+AH WFG+LVT+ WWNDLWLNE FA Y+ Y  + 
Sbjct: 266 ENYVFREEVTTAQR--LRRANTIAHEMAHMWFGDLVTMKWWNDLWLNESFAEYMAYKALS 323

Query: 480 SV---EHTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSL 536
                   W     F +D  Q  +  D   ++HP+  +V + ++    FD I+Y+KG+S+
Sbjct: 324 EATPFTDAWL---TFFLDRKQWAYRADQTPTTHPIAGDVENTEDALNNFDGITYAKGASV 380

Query: 537 LRMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAG 578
           L+     L  E  + GL++Y KK A G++T A+    L  A 
Sbjct: 381 LKQLVALLGEEKFREGLREYFKKHAFGNATLADFLGALDEAS 422



 Score = 81.9 bits (203), Expect = 3e-16
 Identities = 43/131 (32%), Positives = 58/131 (44%), Gaps = 27/131 (20%)

Query: 813 QNW--ISN----PSKIERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSG-SFR 865
           ++W  ISN         R G   W  F ET  +STYL A     + H + + +H G    
Sbjct: 144 KDWTVISNTAATEQSTIRNGYVRW-EFPETPPLSTYLFAFVAGPY-HVISDKEHDGIPLG 201

Query: 866 VWSRE-----------EYINQTAYSLDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAG 914
           ++ RE           E    T   LD       +FE+YF   YP  K D + +P+F  G
Sbjct: 202 LYCRESLAQALDRDADEIFEITKQGLD-------FFEEYFGIPYPFGKYDQVFVPEFNFG 254

Query: 915 AMENFGLITFR 925
           AMEN G +TFR
Sbjct: 255 AMENPGCVTFR 265



 Score = 54.9 bits (133), Expect = 1e-07
 Identities = 35/183 (19%), Positives = 73/183 (39%), Gaps = 24/183 (13%)

Query: 238 VKPLAYKIKILPFLIENNFTFLGEVWIHVEVSQTTNNITLHMNDLTILERSIKQVDNRSA 297
           +  ++Y++ +   L +   TF G   I  ++ +   ++ +      +L  S   ++    
Sbjct: 5   ISVVSYELDLD--LTKGKETFRGTTTITFDLRKNGGDLFIDFKGGQVLSVS---LNGNPV 59

Query: 298 NWESDEGTSLTIGQVRNDTINQFMVFELEDEQFWATKRYVLYIKYVGKLNDQMRGLYRSS 357
           + ++ E   + +                         R  + I +    ++   GL+R  
Sbjct: 60  DPDTFEDGRIPL-----------------PGLELKNGRNEVVIDFTNPYSNDGEGLHRFV 102

Query: 358 YEVNNTKRWIMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLKDGNQS 417
              +  K ++  +QF+  DARR FPCFD+P LKA F +++    + T ISN    + +  
Sbjct: 103 DPADG-KTYLY-TQFEPDDARRVFPCFDQPDLKAPFTLTVTAPKDWTVISNTAATEQSTI 160

Query: 418 DPE 420
              
Sbjct: 161 RNG 163


>gnl|CDD|233857 TIGR02412, pepN_strep_liv, aminopeptidase N, Streptomyces lividans
           type.  This family is a subset of the members of the
           zinc metallopeptidase family M1 (pfam01433), with a
           single member characterized in Streptomyces lividans 66
           and designated aminopeptidase N. The spectrum of
           activity may differ somewhat from the aminopeptidase N
           clade of E. coli and most other Proteobacteria, well
           separated phylogenetically within the M1 family. The M1
           family also includes leukotriene A-4
           hydrolase/aminopeptidase (with a bifunctional active
           site).
          Length = 831

 Score =  126 bits (318), Expect = 1e-29
 Identities = 52/143 (36%), Positives = 80/143 (55%), Gaps = 6/143 (4%)

Query: 431 TNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFG-VDSVEHT--WKI 487
           T   KE  A ++ HE+AH WFG+LVT+ WWNDLWLNE FA Y+      ++ E+T  W  
Sbjct: 280 TRAEKENRAGVILHEMAHMWFGDLVTMRWWNDLWLNESFAEYMGTLASAEATEYTDAWT- 338

Query: 488 KDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTE 547
              F     Q  +  D L ++HP+  +V+   +    FD I+Y+KG+S+L+    ++  E
Sbjct: 339 --TFAAQGKQWAYEADQLPTTHPIVADVADLADALSNFDGITYAKGASVLKQLVAWVGEE 396

Query: 548 VLKLGLQKYIKKKAMGSSTQAEL 570
               G+  Y K+ A G++T  +L
Sbjct: 397 AFFAGVNAYFKRHAFGNATLDDL 419



 Score = 84.8 bits (210), Expect = 1e-16
 Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 6/106 (5%)

Query: 823 ERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSRE---EYINQTAYS 879
                  W+ F ET  +STYL A+A   + H V ++  S    +++R    +Y++  A  
Sbjct: 168 PEPADRRWE-FPETPKLSTYLTAVAAGPY-HSVQDESRSYPLGIYARRSLAQYLDADAI- 224

Query: 880 LDIGPRLLKYFEKYFDYHYPLEKTDMIALPDFGAGAMENFGLITFR 925
             I  + L +F + F Y YP +K D I +P+F AGAMEN G +TF 
Sbjct: 225 FTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAMENAGCVTFA 270



 Score = 45.6 bits (108), Expect = 1e-04
 Identities = 19/39 (48%), Positives = 24/39 (61%)

Query: 370 SQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISN 408
           +QF+  DARR F  FD+P LKA F  S+    + T ISN
Sbjct: 122 TQFEPADARRVFAVFDQPDLKANFKFSVKAPEDWTVISN 160


>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
           leukotriene A4 hydrolase.  M1 Peptidase family includes
           aminopeptidase N (APN) and leukotriene A4 hydrolase
           (LTA4H).  All peptidases in this family bind a single
           catalytic zinc ion which is tetrahedrally co-ordinated
           by three amino acid ligands and a water molecule that
           forms the nucleophile on activation during catalysis.
           APN consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types. APN expression is
           dysregulated in many inflammatory diseases and is
           enhanced in numerous tumor cells, making it a lead
           target in the development of anti-cancer and
           anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
           possessing an aminopeptidase as well as an epoxide
           hydrolase activity. The two activities occupy different,
           but overlapping sites. The activity and physiological
           relevance of the aminopeptidase in LTA4H is as yet
           unknown while the epoxide hydrolase converts leukotriene
           A4 (LTA4) into leukotriene B4 (LTB4), a potent
           chemotaxin that is fundamental to the inflammatory
           response of mammals.
          Length = 407

 Score =  117 bits (294), Expect = 6e-28
 Identities = 60/161 (37%), Positives = 79/161 (49%), Gaps = 19/161 (11%)

Query: 424 LYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE------YFG 477
           L    I        +  ++AHELAHQWFGNLVT A WNDLWLNEGFA+Y+E       FG
Sbjct: 256 LLLAMIDAGDE--LLENVIAHELAHQWFGNLVTGARWNDLWLNEGFATYLEGLWMEATFG 313

Query: 478 VDSVEHTWKIKDIFVVDE---LQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGS 534
             + E         +       +     D L  + P  V   HPD+ +  FD + Y KG+
Sbjct: 314 TSAREA------YRLEGTRGLRRWRKLQDELPPAPP--VMGKHPDDPSVEFDGVFYEKGA 365

Query: 535 SLLRMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLT 575
            +LR  E  L  E    GL+KY++K A  S+T  +L   L 
Sbjct: 366 LVLRYLEKRLGDEAFFKGLRKYVEKHAGQSATTDDLLQALE 406



 Score = 60.1 bits (146), Expect = 2e-09
 Identities = 31/117 (26%), Positives = 49/117 (41%), Gaps = 8/117 (6%)

Query: 816 ISNPSKIERTGPYVWDHFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEYINQ 875
            SN +K +      W  F   + ++ YL+A+ V D  + V      G             
Sbjct: 146 ASNGNKDDGEVRRKWVEFSPPIPIAPYLIALVVGDLEYPVREQTVQGIP--LHLYFLTPL 203

Query: 876 TAYSLDIGPRLLKYFEKYFDYH---YPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
              ++    RL   F  Y       YP  + D++ +P+F +GAMEN GLI F  ++L
Sbjct: 204 AVDTVL---RLQSAFLFYETDLGGPYPYSEYDVVEVPEFPSGAMENPGLIFFSQSLL 257



 Score = 39.3 bits (92), Expect = 0.008
 Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 367 IMASQFQATDARRAFPCFDEPSLKAKFAISIGRLPN--MTAISNMPLKDG 414
            + SQ +A  AR  FPC D P   A +  ++  +P+  + A SN    DG
Sbjct: 106 FLFSQGEAIGARSWFPCQDRPDSVATYYTTV-TVPDKTLVAASNGNKDDG 154


>gnl|CDD|189010 cd09603, M1_APN_4, Peptidase M1 family Aminopeptidase N.  This
           family contains mostly bacterial and some archaeal
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 415

 Score = 87.3 bits (217), Expect = 5e-18
 Identities = 42/135 (31%), Positives = 59/135 (43%), Gaps = 16/135 (11%)

Query: 440 TIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE--YFGVDSVEH--TWKIKDIFVVDE 495
            ++AHELAHQWFG+ VT A W+D+WLNEGFA+Y E  +      EH          +   
Sbjct: 267 RLIAHELAHQWFGDSVTCADWSDIWLNEGFATYAEWLWS-----EHSGGRDAAQYALYAY 321

Query: 496 LQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTEVLKLGLQK 555
            +       L    PV           E+FD   Y KG+ +L      L  E     L+ 
Sbjct: 322 ARLYANEADLGPRPPVPGP-------AELFDDHVYEKGALVLHALRRLLGDEAFFRLLRT 374

Query: 556 YIKKKAMGSSTQAEL 570
           Y+ +   G+ T  + 
Sbjct: 375 YLAEHRGGNVTTEDF 389



 Score = 60.7 bits (148), Expect = 2e-09
 Identities = 23/93 (24%), Positives = 36/93 (38%), Gaps = 1/93 (1%)

Query: 832 HFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEYINQTAYSLDIGPRLLKYFE 891
           H++    M+TYLV +AV  +               +           +    P +L +FE
Sbjct: 163 HWRMDEPMATYLVTLAVGRYEVLEDTTAGGVPVVYYVPPGLAADARRAFARTPEMLDFFE 222

Query: 892 KYFDYHYPLEKTDMIALPDFGAGAMENFGLITF 924
           + F   YP EK   + + DF  G MEN     +
Sbjct: 223 ELF-GPYPFEKYGQVVVDDFLGGGMENQTRTVY 254



 Score = 45.7 bits (109), Expect = 8e-05
 Identities = 35/150 (23%), Positives = 54/150 (36%), Gaps = 23/150 (15%)

Query: 260 GEVWIHVEVSQTTNNITLHMNDLTILERSIKQVDNRSANWESDEGTSLTIGQVRNDTINQ 319
           G   I    +Q  +++ L +  LT+   S+  VD R A +  D G  LTI          
Sbjct: 20  GTATITARATQDLSSLNLDLAGLTV--SSV-TVDGRPAAFRHD-GDELTIT--------- 66

Query: 320 FMVFELEDEQFWATKRYVLYIKYVGKLNDQMRGLYRSSYEVNNTKRWIMASQFQATDARR 379
                         + + + ++Y G         +            +  +  Q   A  
Sbjct: 67  ------PPRPLPKGQTFTVTVRYSGV-PRPGLYPWGGDGGWEEGPDGV-WTAGQPEGAST 118

Query: 380 AFPCFDEPSLKAKFAISIGRLPN-MTAISN 408
            FPC D PS KA F IS+  +P   T +SN
Sbjct: 119 WFPCNDHPSDKATFDISV-TVPAGYTVVSN 147


>gnl|CDD|221257 pfam11838, DUF3358, Domain of unknown function (DUF3358).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes and bacteria. This domain is found
           to the C-terminus of an aminopeptidase domain.
          Length = 323

 Score = 84.3 bits (209), Expect = 2e-17
 Identities = 47/168 (27%), Positives = 74/168 (44%), Gaps = 15/168 (8%)

Query: 666 GYYRVLYDEKNWYLIIATLRNSTTYNTIHLLNRAQLIDDAMNLARAGLLDYKIALD-VTA 724
           GYYRV YD ++     ATL  +     +  L+RA L  DA  L R G L  +  LD V A
Sbjct: 10  GYYRVRYDPESL----ATLGEALARLKLDPLDRAGLWADAWALVRDGELPTRDFLDLVLA 65

Query: 725 YLQYETELVPWRSAMQALGYIEGQLY-----RRAYFDKYKKYLLHIIRPMYESIGFDGSP 779
           +L  ET+ V     +  LG ++  LY     R+      KK++  +  P+ E +G++  P
Sbjct: 66  FLPNETDYVVLSEILAQLGTLKSALYFEPEERKEAL---KKFVRKLAAPLLEKLGWEFGP 122

Query: 780 KDDQMTVYKRVDVLNRACILGLKDCVQKALSKYQNWISNPSKIERTGP 827
             D      R  +L+ A      + V  AL +++  +     +    P
Sbjct: 123 GSDHQLQLLRALLLSLAGGDE--EVVAAALDRFRALLDGDKSLAAIDP 168


>gnl|CDD|189006 cd09599, M1_LTA4H, Peptidase M1 family contains leukotriene A4
           hydrolase.  This family includes leukotriene A4
           hydrolase (LTA4H; E.C. 3.3.2.6) and the close homolog
           cold-active aminopeptidase (Colwellia
           psychrerythraea-type peptidase; ColAP), both members of
           the aminopeptidase M1 family. LTA4H, is a bifunctional
           enzyme possessing an aminopeptidase as well as an
           epoxide hydrolase activity.  The two activities occupy
           different, but overlapping sites. The activity and
           physiological relevance of the aminopeptidase is as yet
           unknown while the epoxide hydrolase converts leukotriene
           A4 (LTA4) into leukotriene B4 (LTB4), a potent
           chemotaxin that is fundamental to the inflammatory
           response of mammals. It accepts a variety of substrates,
           including some opioid, di- and tripeptides, as well as
           chromogenic aminoacyl-p-nitroanilide derivatives. The
           aminopeptidase activity of LTA4H is possibly involved in
           the processing of peptides related to inflammation and
           host defense. Kinetic analysis shows that LTA4H
           hydrolyzes arginyl tripeptides with high efficiency and
           specificity, indicating its function as an arginyl
           aminopeptidase. LTA4H is overexpressed in certain human
           cancers, and has been identified as a functionally
           important target for mediating anticancer properties of
           resveratrol, a well known red wine polyphenolic compound
           with cancer chemopreventive activity.
          Length = 442

 Score = 83.3 bits (207), Expect = 1e-16
 Identities = 44/145 (30%), Positives = 57/145 (39%), Gaps = 20/145 (13%)

Query: 441 IVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE------YFGVDSVEHTWKIKDIFVVD 494
           +VAHE+AH W GNLVT A W   WLNEGF  Y+E       +G    E   + + I    
Sbjct: 285 VVAHEIAHSWSGNLVTNATWEHFWLNEGFTVYLERRILERLYG----EDYRQFEAIIGWK 340

Query: 495 ELQNVFFLDALKSSHP-----VHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTEVL 549
           ELQ    +               ++   PD   + F  + Y KG   L   E     EV 
Sbjct: 341 ELQES--IKTFGEDPEYTKLVPDLKGVDPD---DAFSSVPYEKGFQFLYYLEQLGGREVF 395

Query: 550 KLGLQKYIKKKAMGSSTQAELWAFL 574
              L+ Y  K    S    +   FL
Sbjct: 396 DPFLRSYFDKFKFKSIDTEQFKDFL 420



 Score = 36.0 bits (84), Expect = 0.083
 Identities = 30/96 (31%), Positives = 44/96 (45%), Gaps = 7/96 (7%)

Query: 832 HFQETVFMSTYLVAMAVTDFSHRVHNDDHSGSFRVWSREEYINQTAYSLDIGPRLLKYFE 891
           HF++ V + +YL+A+AV D   R      S   RVW+    ++  A       + LK  E
Sbjct: 183 HFEQPVPIPSYLIAIAVGDLESR-PIGPRS---RVWAEPSLLDAAAEEFADTEKFLKAAE 238

Query: 892 KYFDYHYPLEKTDMIALPD-FGAGAMENFGLITFRT 926
                 Y   + D++ LP  F  G MEN   +TF T
Sbjct: 239 DLI-GPYVWGRYDLLVLPPSFPYGGMEN-PCLTFAT 272



 Score = 34.4 bits (80), Expect = 0.26
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 368 MASQFQATDARRAFPCFDEPSLKAKFAISI 397
           + SQ QA  AR  FPC D PS+K+ ++ ++
Sbjct: 127 LFSQCQAIHARSLFPCQDTPSVKSTYSATV 156


>gnl|CDD|222169 pfam13485, Peptidase_MA_2, Peptidase MA superfamily. 
          Length = 128

 Score = 61.4 bits (149), Expect = 3e-11
 Identities = 28/126 (22%), Positives = 45/126 (35%), Gaps = 15/126 (11%)

Query: 431 TNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEYFGVDSVEHTWKIKDI 490
                + +  ++AHELAH   GNLV        WL EG A Y+               D 
Sbjct: 18  APGDPDELRGVLAHELAHVVLGNLVGGNGNLPRWLTEGLAEYVAG-----------RIDP 66

Query: 491 FVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFLTTEVLK 550
               EL++      L     +  + S   E   +     Y++G++ +R        E L+
Sbjct: 67  ERAAELRDAVRSGRLPPLDELSADFSANSEDGSLA----YAQGAAFVRYLAERYGEEKLR 122

Query: 551 LGLQKY 556
             L+  
Sbjct: 123 ALLRAL 128


>gnl|CDD|189007 cd09600, M1_APN_1, Peptidase M1 family containing Aminopeptidase N.
            This family contains aminopeptidase N (APN; CD13;
           Alanyl aminopeptidase; EC 3.4.11.2), a Type II integral
           membrane protease belonging to the M1 gluzincin family.
           It includes bacterial-type alanyl aminopeptidases as
           well as PfA-M1 aminopeptidase (Plasmodium
           falciparum-type). APN consists of a small N-terminal
           cytoplasmic domain, a single transmembrane domain and a
           large extracellular ectodomain that contains the active
           site. It preferentially cleaves neutral amino acids from
           the N-terminus of oligopeptides and, in higher
           eukaryotes, is present in a variety of human tissues and
           cell types (leukocyte, fibroblast, endothelial and
           epithelial cells). APN expression is dysregulated in
           inflammatory diseases such as chronic pain, rheumatoid
           arthritis, multiple sclerosis, systemic sclerosis,
           systemic lupus erythematosus,
           polymyositis/dermatomyosytis and pulmonary sarcoidosis,
           and is enhanced in tumor cells such as melanoma, renal,
           prostate, pancreas, colon, gastric and thyroid cancers.
           It is predominantly expressed on stem cells and on cells
           of the granulocytic and monocytic lineages at distinct
           stages of differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 861

 Score = 63.8 bits (156), Expect = 4e-10
 Identities = 48/163 (29%), Positives = 72/163 (44%), Gaps = 16/163 (9%)

Query: 402 NMTAISNMPLKDGNQSDPENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWN 461
           NM A+ N  L   N       +L D + +T+   ERI +++ HE  H W GN VT   W 
Sbjct: 251 NMGAMENKGLNIFNSK----LVLADPETATDADYERIESVIGHEYFHNWTGNRVTCRDWF 306

Query: 462 DLWLNEGFASYIEY-FGVDSVEHTWK-IKDIFVVDELQNVFFLDALKSSHPVHVEVSHPD 519
            L L EG   + +  F  D      K I+D+  +   Q  F  DA   +HP+      PD
Sbjct: 307 QLSLKEGLTVFRDQEFSADMGSRAVKRIEDVRFLRAHQ--FPEDAGPMAHPI-----RPD 359

Query: 520 EITEI---FDKISYSKGSSLLRMAEHFLTTEVLKLGLQKYIKK 559
              E+   +    Y KG+ ++RM    L  E  + G+  Y ++
Sbjct: 360 SYIEMNNFYTATVYEKGAEVIRMYHTLLGEEGFRKGMDLYFQR 402



 Score = 42.6 bits (101), Expect = 0.001
 Identities = 24/95 (25%), Positives = 46/95 (48%), Gaps = 10/95 (10%)

Query: 842 YLVAMAVTDFSHRVHNDDH----SG---SFRVWSREEYINQTAYSLDIGPRLLKYFEKYF 894
           YL A+   D       +D     SG   +  ++      ++ A++++   + +K+ E  F
Sbjct: 177 YLFALVAGDLGVL---EDKFTTKSGRKVALEIYVEPGDESKCAHAMESLKKSMKWDEDRF 233

Query: 895 DYHYPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
              Y L+  +++A+ DF  GAMEN GL  F + ++
Sbjct: 234 GLEYDLDLFNIVAVDDFNMGAMENKGLNIFNSKLV 268


>gnl|CDD|233856 TIGR02411, leuko_A4_hydro, leukotriene A-4
           hydrolase/aminopeptidase.  Members of this family
           represent a distinctive subset within the zinc
           metallopeptidase family M1 (pfam01433). The majority of
           the members of pfam01433 are aminopeptidases, but the
           sequences in this family for which the function is known
           are leukotriene A-4 hydrolase. A dual epoxide hydrolase
           and aminopeptidase activity at the same active site is
           indicated. The physiological substrate for
           aminopeptidase activity is not known.
          Length = 602

 Score = 62.1 bits (151), Expect = 1e-09
 Identities = 42/134 (31%), Positives = 57/134 (42%), Gaps = 13/134 (9%)

Query: 439 ATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIE--YFGVDSVEHTWKIKDIFVVDEL 496
             ++AHELAH W GNLVT   W   WLNEG+  Y+E    G    E T     +    +L
Sbjct: 280 VDVIAHELAHSWSGNLVTNCSWEHFWLNEGWTVYLERRIIGRLYGEKTRHFSALIGWGDL 339

Query: 497 QNVFFLDALKSSHP-----VHVEVSHPDEITEIFDKISYSKGSSLLRMAEHFL-TTEVLK 550
           Q    +  L  +       V ++ + PD   + F  + Y KG + L   E  L       
Sbjct: 340 QES--VKTLGETPEFTKLVVDLKDNDPD---DAFSSVPYEKGFNFLFYLEQLLGGPAEFD 394

Query: 551 LGLQKYIKKKAMGS 564
             L+ Y KK A  S
Sbjct: 395 PFLRHYFKKFAYKS 408



 Score = 38.6 bits (90), Expect = 0.014
 Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 3/62 (4%)

Query: 368 MASQFQATDARRAFPCFDEPSLKAKFAISIGRLPNMTAISNMPLKDGNQSDPENSMLYDE 427
           + SQ QA  AR  FPC D PS+K+ +   +   P    +S +   DG  S+     L+ +
Sbjct: 126 LFSQCQAIHARSLFPCQDTPSVKSTYTAEV-ESPLPVLMSGIR--DGETSNDPGKYLFKQ 182

Query: 428 QI 429
           ++
Sbjct: 183 KV 184


>gnl|CDD|233858 TIGR02414, pepN_proteo, aminopeptidase N, Escherichia coli type.
           The M1 family of zinc metallopeptidases contains a
           number of distinct, well-separated clades of proteins
           with aminopeptidase activity. Several are designated
           aminopeptidase N, EC 3.4.11.2, after the Escherichia
           coli enzyme, suggesting a similar activity profile (see
           SP|P04825 for a description of catalytic activity). This
           family consists of all aminopeptidases closely related
           to E. coli PepN and presumed to have similar (not
           identical) function. Nearly all are found in
           Proteobacteria, but members are found also in
           Cyanobacteria, plants, and apicomplexan parasites. This
           family differs greatly in sequence from the family of
           aminopeptidases typified by Streptomyces lividans PepN
           (TIGR02412), from the membrane bound aminopeptidase N
           family in animals, etc [Protein fate, Degradation of
           proteins, peptides, and glycopeptides].
          Length = 863

 Score = 62.0 bits (151), Expect = 1e-09
 Identities = 50/163 (30%), Positives = 72/163 (44%), Gaps = 16/163 (9%)

Query: 402 NMTAISNMPLKDGNQSDPENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWN 461
           NM A+ N  L   N       +L D + +T+   ERI +++AHE  H W GN VT   W 
Sbjct: 251 NMGAMENKGLNIFNSK----YVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWF 306

Query: 462 DLWLNEGFASYIEY-FGVDSVEHTWKIKDIFVVDELQNVFFL-DALKSSHPVHVEVSHPD 519
            L L EG   + +  F  D    +  +K I  V  L+   F  DA   +HPV      P+
Sbjct: 307 QLSLKEGLTVFRDQEFSADM--TSRAVKRIEDVRLLRAHQFPEDAGPMAHPV-----RPE 359

Query: 520 EITEI---FDKISYSKGSSLLRMAEHFLTTEVLKLGLQKYIKK 559
              EI   +    Y KG+ ++RM    L  E  + G+  Y  +
Sbjct: 360 SYVEINNFYTATVYEKGAEVIRMLHTLLGEEGFRKGMDLYFSR 402



 Score = 42.7 bits (101), Expect = 0.001
 Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 10/95 (10%)

Query: 842 YLVAMAVTDFSHRVHNDDH----SG---SFRVWSREEYINQTAYSLDIGPRLLKYFEKYF 894
           YL A+   D       +D     SG   + RV+  E   ++  ++++   + +K+ E+ F
Sbjct: 177 YLFALVAGDLDVL---EDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVF 233

Query: 895 DYHYPLEKTDMIALPDFGAGAMENFGLITFRTAIL 929
              Y L+   ++A+ DF  GAMEN GL  F +  +
Sbjct: 234 GLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYV 268


>gnl|CDD|189011 cd09604, M1_APN_5, Peptidase M1 family containing bacterial
           Aminopeptidase N.  This family contains bacterial
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 435

 Score = 58.1 bits (141), Expect = 1e-08
 Identities = 31/158 (19%), Positives = 58/158 (36%), Gaps = 29/158 (18%)

Query: 435 KERIATIVAHELAHQWFGNLVTLAWWNDL----WLNEGFASYIEYF--------GVDSVE 482
              + T++AHE+AHQW+  +V     ND     WL+EG  +Y+  +            + 
Sbjct: 285 PFILETVLAHEIAHQWWYGIVG----NDERNEPWLDEGLTTYLTDYYLEERYGKEAARLY 340

Query: 483 HTWKIKDIFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKISYSKGSSLLRMAEH 542
              ++ D   +   +N   L     ++  +               I+Y KG+ +L M   
Sbjct: 341 RLRRLTDYAALVNSRNDPPLAFFFRNNGAYSA-------------IAYGKGAMVLHMLRK 387

Query: 543 FLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHE 580
            +  E     L+ Y ++ A   +T  +       A   
Sbjct: 388 EIGDEAFDKALRTYYREYAFKHATPEDFRRAFEKASGR 425


>gnl|CDD|189018 cd09839, M1_TAF2, TATA binding protein (TBP) associated factor 2.
           This family includes TATA binding protein (TBP)
           associated factor 2 (TAF2, TBP-associated factor
           TAFII150, transcription initiation factor TFIID subunit
           2, RNA polymerase II TBP-associated factor subunit B),
           and has homology to the aminopeptidase N (APN)
           subfamily, belonging to the M1 gluzincin family. TAF2 is
           part of the TFIID multidomain subunit complex essential
           for transcription of most protein-encoded genes by RNA
           polymerase II. TAF2 is known to interact with the
           initiator element (Inr) found at the transcription start
           site of many genes, thus possibly playing a key role in
           promoter binding as well as start-site selection. Image
           analysis has shown TAF2 to form a complex with TAF1 and
           TBP, inferring its role in promoter recognition.
           Peptidases in the M1 family bind a single catalytic zinc
           ion which is tetrahedrally co-ordinated by three amino
           acid ligands and a water molecule that forms the
           nucleophile on activation during catalysis. TAF2,
           however, does not seem to contain any of the active site
           residues.
          Length = 507

 Score = 53.1 bits (128), Expect = 5e-07
 Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 18/165 (10%)

Query: 421 NSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYI------E 474
           +++LY E I    +  R     A  LA QWFG  ++   W+D WL +G A YI      +
Sbjct: 299 SNLLYPEDIIDQTYDTRRKLAYA--LASQWFGCFISPEAWSDEWLLKGIAGYITGLFVKK 356

Query: 475 YFGVDSVEHTWKIK----DIFVVDELQNVFFLDA--LKSSHPVHVEVSHPDEITEIFDKI 528
            FG +  E+ ++IK     +   D  +    LD   L    P++ +  HP   + ++ + 
Sbjct: 357 LFGNN--EYRFRIKKELDRVCEYDIKRPGILLDPIILPGKLPLNPDPKHPHTCSPLYGEF 414

Query: 529 SYSKGSSLLRMAEHFLTTEVLKLGLQKYIKK--KAMGSSTQAELW 571
              K   +LRM E  +  E     L K + +  +A    TQ+  W
Sbjct: 415 MALKAHLVLRMLERRIGKESFLQVLNKILSRALQASQQKTQSGDW 459


>gnl|CDD|189001 cd09594, GluZincin, Peptidase Gluzincin family (thermolysin-like
           proteinases, TLPs) includes peptidases M1, M2, M3, M4,
           M13, M32 and M36 (fungalysins).  Gluzincin family
           (thermolysin-like peptidases or TLPs) includes several
           zinc-dependent metallopeptidases such as the M1, M2, M3,
           M4, M13, M32, M36 peptidases (MEROPS classification),
           and contain HEXXH and EXXXD motifs as part of their
           active site. All peptidases in this family bind a single
           catalytic zinc ion which is tetrahedrally co-ordinated
           by three amino acid ligands and a water molecule that
           forms the nucleophile on activation during catalysis. M1
           family includes aminopeptidase N (APN) and leukotriene
           A4 hydrolase (LTA4H).  APN preferentially cleaves
           neutral amino acids from the N-terminus of oligopeptides
           and is present in a variety of human tissues and cell
           types. LTA4H is a bifunctional enzyme, possessing an
           aminopeptidase as well as an epoxide hydrolase activity
           such that the two activities occupy different, but
           overlapping sites. The peptidase M3 or neurolysin-like
           family, includes M3, M2 and M32 metallopeptidases.  The
           M3 peptidases have two subfamilies: M3A, includes thimet
           oligopeptidase (TOP; endopeptidase 3.4.24.15),
           neurolysin (3.4.24.16), and the mitochondrial
           intermediate peptidase; M3B contains oligopeptidase F.
           M2 peptidase angiotensin converting enzyme (ACE, EC
           3.4.15.1) catalyzes the conversion of decapeptide
           angiotensin I to the potent vasopressor octapeptide
           angiotensin II. ACE is a key part of the
           renin-angiotensin system that regulates blood pressure,
           thus ACE inhibitors are important for the treatment of
           hypertension. M32 family includes two eukaryotic enzymes
           from protozoa Trypanosoma cruzi, a causative agent of
           Chagas' disease, and Leishmania major, a parasite that
           causes leishmaniasis, making them attractive targets for
           drug development. The M4 family includes secreted
           protease  thermolysin (EC 3.4.24.27), pseudolysin,
           aureolysin, neutral protease as well as fungalysin and
           bacillolysin (EC 3.4.24.28) that degrade extracellular
           proteins and peptides for bacterial nutrition,
           especially prior to sporulation. Thermolysin is widely
           used as a nonspecific protease to obtain fragments for
           peptide sequencing as well as in production of the
           artificial sweetener aspartame. M13 family includes
           neprilysin (EC 3.4.24.11) and endothelin-converting
           enzyme I (ECE-1, EC 3.4.24.71), which fulfill a broad
           range of physiological roles due to the greater
           variation in the S2' subsite allowing substrate
           specificity and are prime therapeutic targets for
           selective inhibition. Peptidase M36 (fungamysin) family
           includes endopeptidases from pathogenic fungi.
           Fungalysin hydrolyzes extracellular matrix proteins such
           as elastin and keratin. Aspergillus fumigatus causes the
           pulmonary disease aspergillosis by invading the lungs of
           immuno-compromised animals and secreting fungalysin that
           possibly breaks down proteinaceous structural barriers.
          Length = 125

 Score = 42.4 bits (100), Expect = 1e-04
 Identities = 14/69 (20%), Positives = 22/69 (31%), Gaps = 6/69 (8%)

Query: 414 GNQSDPENSMLY-DEQISTNYHKERIATIVAHELAHQWFGNLVT-----LAWWNDLWLNE 467
              +  +  ++  D  +           +V HEL H     LV      L       LNE
Sbjct: 44  AMCNGLDARIVMNDGILVAFLLDSDDFGVVGHELTHGVTDQLVGNDPDLLYTNGSGGLNE 103

Query: 468 GFASYIEYF 476
           G + + E  
Sbjct: 104 GPSDFFELL 112


>gnl|CDD|226774 COG4324, COG4324, Predicted aminopeptidase [General function
           prediction only].
          Length = 376

 Score = 36.5 bits (84), Expect = 0.060
 Identities = 20/63 (31%), Positives = 28/63 (44%), Gaps = 14/63 (22%)

Query: 417 SDP-ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEY 475
            DP  ++ML             +A+++ HELAHQ           ND   NE FA+ +E 
Sbjct: 182 DDPLLSTML-------RQDDTYLASLIFHELAHQKI------YVNNDTAFNESFATAVET 228

Query: 476 FGV 478
            GV
Sbjct: 229 SGV 231


>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265).  Members of
           this family of bacterial proteins comprise various
           hypothetical proteins and putative aminopeptidases.
           Their exact function, has not, as yet, been defined.
          Length = 338

 Score = 35.3 bits (82), Expect = 0.13
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 14/63 (22%)

Query: 417 SDP-ENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGFASYIEY 475
           +DP  ++ +        Y    +A ++ HELAHQ       +    D   NE FA+ +E 
Sbjct: 151 NDPLLSTFIR-------YPDGELARLIFHELAHQ------VVYIKGDTAFNESFATAVER 197

Query: 476 FGV 478
            GV
Sbjct: 198 EGV 200


>gnl|CDD|237585 PRK14015, pepN, aminopeptidase N; Provisional.
          Length = 875

 Score = 33.6 bits (78), Expect = 0.55
 Identities = 43/136 (31%), Positives = 57/136 (41%), Gaps = 28/136 (20%)

Query: 436 ERIATIVAHELAHQWFGNLVTLAWWNDLWLNEGF-----ASYIEYFGVDSVEHTWK-IKD 489
           ERI +++AHE  H W GN VT   W  L L EG            F  D      K I+D
Sbjct: 294 ERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQE----FSADLGSRAVKRIED 349

Query: 490 IFVVDELQNVFFLDALKSSHPVHVEVSHPDEITEIFDKIS-------YSKGSSLLRMAEH 542
           + V+   Q  F  DA   +HPV      PD   E    I+       Y KG+ ++RM   
Sbjct: 350 VRVLRAAQ--FAEDAGPMAHPV-----RPDSYIE----INNFYTATVYEKGAEVIRMLHT 398

Query: 543 FLTTEVLKLGLQKYIK 558
            L  E  + G+  Y +
Sbjct: 399 LLGEEGFRKGMDLYFE 414


>gnl|CDD|198344 cd10424, GST_C_9, C-terminal, alpha helical domain of an unknown
           subfamily 9 of Glutathione S-transferases.  Glutathione
           S-transferase (GST) C-terminal domain family, unknown
           subfamily 9; composed of uncharacterized proteins with
           similarity to GSTs. GSTs are cytosolic dimeric proteins
           involved in cellular detoxification by catalyzing the
           conjugation of glutathione (GSH) with a wide range of
           endogenous and xenobiotic alkylating agents, including
           carcinogens, therapeutic drugs, environmental toxins,
           and products of oxidative stress. GSTs also show GSH
           peroxidase activity and are involved in the synthesis of
           prostaglandins and leukotrienes. The GST fold contains
           an N-terminal thioredoxin-fold domain and a C-terminal
           alpha helical domain, with an active site located in a
           cleft between the two domains. GSH binds to the
           N-terminal domain while the hydrophobic substrate
           occupies a pocket in the C-terminal domain.
          Length = 103

 Score = 30.8 bits (70), Expect = 0.91
 Identities = 13/54 (24%), Positives = 20/54 (37%), Gaps = 2/54 (3%)

Query: 167 PTRNLIAELFKV--EEKTTKAKFNGRLPTDVKPLAYRIKILPFLIENNFTIAEL 218
           P R L  E F         K +    L   +  LA   +  P++    FT+A+ 
Sbjct: 15  PARRLYPEAFFGGKVSPEIKEEVRKDLLRGIAALARLARFAPYVAGETFTLADC 68


>gnl|CDD|227631 COG5321, COG5321, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 164

 Score = 31.0 bits (70), Expect = 1.4
 Identities = 25/86 (29%), Positives = 37/86 (43%), Gaps = 6/86 (6%)

Query: 400 LPNMTAISNMPLKDGNQSDPENSMLYDEQISTNYHKERIATIVAHELAHQWFGNLVTLAW 459
           +P ++ I   PL DG QS  E +ML    +         A +    LA     +L+ L+ 
Sbjct: 1   MPIISPIQTNPLADGRQS--ERAMLVRRGVQRLLMAMGHAVLPELTLASGRRADLIALSP 58

Query: 460 WNDLWLNEGFASYIEYFGVDSVEHTW 485
             ++W+ E   S IE F VD     W
Sbjct: 59  KGEIWIIE-IKSSIEDFRVD---RKW 80


>gnl|CDD|192017 pfam08372, PRT_C, Plant phosphoribosyltransferase C-terminal.  This
           domain is found at the C-terminus of
           phosphoribosyltransferases and
           phosphoribosyltransferase-like proteins. It contains
           putative transmembrane regions. It often appears
           together with calcium-ion dependent C2 domains
           (pfam00168).
          Length = 156

 Score = 30.7 bits (70), Expect = 2.0
 Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 7/38 (18%)

Query: 509 HPVHV-------EVSHPDEITEIFDKISYSKGSSLLRM 539
           HP H+       + +HPDE+ E FD    S+   ++RM
Sbjct: 22  HPPHMDTRLSHADAAHPDELDEEFDTFPTSRPPDVVRM 59


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 31.6 bits (73), Expect = 2.2
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 643 TGFPVIRVARDYDAGSAVVKQVRGYYRVLY 672
            G  V RV+ D D G  +V Q+    + LY
Sbjct: 263 AGIIVTRVSSDGDVGEQIVGQLFANPKALY 292


>gnl|CDD|238134 cd00217, INT_Flp, Flp recombinase, C-terminal catalytic domain.
           Yeast Flp-like recombinases mediate the amplification of
           the 2 micron circular plasmid copy number by catalyzing
           the intra-molecular recombination between two inverted
           repeats during replication.  They belong to the DNA
           breaking-rejoining enzyme superfamily, which also
           includes prokaryotic tyrosine recombinases and type IB
           topoisomerases. These enzymes share the same fold in
           their catalytic domain containing six conserved active
           site residues and the overall reaction mechanism.
           Flp-like recombinases are almost exclusively found in
           yeast and are highly diverged in sequence from the
           prokaryotic tyrosine recombinases.  They cleave their
           target DNA in trans with a composite active site in
           which the catalytic tyrosine is provided by a promoter
           bound to a site other than the one being cleaved.  Thus
           each active site within Flp complexes is assembled by
           domain swapping and contains catalytic residues from two
           different monomers. Two DNA segments are synapsed by the
           tetrameric enzyme, carrying the nucleophilic tyrosine in
           each active site with only two of the four monomers
           active at a given time.  The catalytic domain is linked
           through a flexible loop to the N-terminal domain, which
           is largely responsible for non-specific DNA binding and
           isomerization. Its overall fold is similar to the SAM
           domain fold also found in the N-terminal domains of
           lambda integrase and XerD recombinase.
          Length = 339

 Score = 30.5 bits (69), Expect = 4.2
 Identities = 21/126 (16%), Positives = 42/126 (33%), Gaps = 19/126 (15%)

Query: 521 ITEIFDKISYSKGSSLLRMAEHFLTTEVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHE 580
            + I   +S+ K   ++    H     +L   L K               + F       
Sbjct: 55  KSCISKTLSFDKDDKIVSFKYHLKKPTILGKTLSKAFSP-----------YKF------V 97

Query: 581 MRTLPEKMDVLKLGLQKYIKKKAMGSSTQAELWAFLTNAGHEMRTLPENMDVETIMNTWT 640
           +++   K   +   L+  ++K+       + +    T       ++     VE I+N+  
Sbjct: 98  IKSRSLKEVDIFSKLELKLQKREEKDKGGSLIDQEFTKLLRTDESI--WKIVEKILNSIE 155

Query: 641 LQTGFP 646
           LQT FP
Sbjct: 156 LQTRFP 161


>gnl|CDD|238511 cd01060, Membrane-FADS-like, The membrane fatty acid desaturase
           (Membrane_FADS)-like CD includes membrane FADSs, alkane
           hydroxylases, beta carotene ketolases (CrtW-like),
           hydroxylases (CrtR-like), and other related proteins.
           They are present in all groups of organisms with the
           exception of archaea. Membrane FADSs are non-heme,
           iron-containing, oxygen-dependent enzymes involved in
           regioselective introduction of double bonds in fatty
           acyl aliphatic chains. They play an important role in
           the maintenance of the proper structure and functioning
           of biological membranes. Alkane hydroxylases are
           bacterial, integral-membrane di-iron enzymes that share
           a requirement for iron and oxygen for activity similar
           to that of membrane FADSs, and are involved in the
           initial oxidation of inactivated alkanes. Beta-carotene
           ketolase and beta-carotene hydroxylase are carotenoid
           biosynthetic enzymes for astaxanthin and zeaxanthin,
           respectively. This superfamily domain has extensive
           hydrophobic regions that would be capable of spanning
           the membrane bilayer at least twice. Comparison of these
           sequences also reveals three regions of conserved
           histidine cluster motifs that contain eight histidine
           residues: HXXX(X)H, HXX(X)HH, and HXXHH (an additional
           conserved histidine residue is seen between clusters 2
           and 3). Spectroscopic and genetic evidence point to a
           nitrogen-rich coordination environment located in the
           cytoplasm with as many as eight histidines coordinating
           the two iron ions and a carboxylate residue bridging the
           two metals in the Pseudomonas oleovorans alkane
           hydroxylase (AlkB). In addition, the eight histidine
           residues are reported to be catalytically essential and
           proposed to be the ligands for the iron atoms contained
           within the rat stearoyl CoA delta-9 desaturase.
          Length = 122

 Score = 28.2 bits (63), Expect = 9.1
 Identities = 9/29 (31%), Positives = 12/29 (41%), Gaps = 3/29 (10%)

Query: 437 RIATIVAHELAHQWFGNLVTLAWWNDLWL 465
              T++AHEL H    +     W N L  
Sbjct: 13  LGLTVLAHELGH---RSFFRSRWLNRLLG 38


>gnl|CDD|219486 pfam07607, DUF1570, Protein of unknown function (DUF1570).  A
           family of hypothetical proteins in Rhodopirellula
           baltica. This family carries a highly conserved HExxH
           sequence motif characteristic of members of the
           Peptidase clan MA.
          Length = 128

 Score = 28.1 bits (63), Expect = 9.4
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 438 IATIVAHELAHQWFGN--LVTLAWWNDLWLNEGFASYIE 474
           IAT+  HE  HQ   N  L +    N +W++EG A+Y E
Sbjct: 2   IATVA-HEATHQLAFNCGLQSRLADNPMWVSEGLATYFE 39


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.402 

Gapped
Lambda     K      H
   0.267   0.0623    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 48,800,139
Number of extensions: 4845157
Number of successful extensions: 3866
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3829
Number of HSP's successfully gapped: 61
Length of query: 947
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 841
Effective length of database: 6,236,078
Effective search space: 5244541598
Effective search space used: 5244541598
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.7 bits)