RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5250
         (1102 letters)



>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
           tricorn interacting factor F3, Endoplasmic reticulum
           aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ).  This
           M1 peptidase family includes eukaryotic and bacterial
           members: aminopeptidase N (APN), aminopeptidase Q (APQ,
           laeverin), endoplasmic reticulum aminopeptidase 1
           (ERAP1) as well as tricorn interacting factor F3.
           Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease,
           consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types (leukocyte, fibroblast,
           endothelial and epithelial cells). APN expression is
           dysregulated in inflammatory diseases such as chronic
           pain, rheumatoid arthritis, multiple sclerosis, systemic
           sclerosis, systemic lupus erythematosus,
           polymyositis/dermatomyosytis and pulmonary sarcoidosis,
           and is enhanced in tumor cells such as melanoma, renal,
           prostate, pancreas, colon, gastric and thyroid cancers.
           It is considered a marker of differentiation since it is
           predominantly expressed on stem cells and on cells of
           the granulocytic and monocytic lineages at distinct
           stages of differentiation. Thus, APN inhibition may lead
           to the development of anti-cancer and anti-inflammatory
           drugs. ERAP1 also known as endoplasmic reticulum
           aminopeptidase associated with antigen processing
           (ERAAP), adipocyte derived leucine aminopeptidase
           (A-LAP) or aminopeptidase regulating tumor necrosis
           factor receptor I (THFRI) shedding (ARTS-1), associates
           with the closely related ER aminopeptidase ERAP2, for
           the final trimming of peptides within the ER for
           presentation by MHC class I molecules. ERAP1 is
           associated with ankylosing spondylitis (AS), an
           inflammatory arthritis that predominantly affects the
           spine. ERAP1 also aids in the shedding of membrane-bound
           cytokine receptors. The tricorn interacting factor F3,
           together with factors F1 and F2, degrades the tricorn
           protease products, producing free amino acids, thus
           completing the proteasomal degradation pathway. F3 is
           homologous to F2, but not F1, and shows a strong
           preference for glutamate in the P1' position. APQ, also
           known as laeverin, is specifically expressed in human
           embryo-derived extravillous trophoblasts (EVTs) that
           invade the uterus during early placentation. It cleaves
           the N-terminal amino acid of various peptides such as
           angiotensin III, endokinin C, and kisspeptin-10, all
           expressed in the placenta in large quantities. APN is a
           receptor for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs are also putative Cry toxin
           receptors. Cry1 proteins are pore-forming toxins that
           bind to the midgut epithelial cell membrane of
           susceptible insect larvae, causing extensive damage.
           Several different toxins, including Cry1Aa, Cry1Ab,
           Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
           bind to APNs; however, a direct role of APN in
           cytotoxicity has been yet to be firmly established.
          Length = 446

 Score =  243 bits (622), Expect = 8e-71
 Identities = 89/142 (62%), Positives = 107/142 (75%)

Query: 150 DDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSESPQPDG 209
             E RY+A TQFE TDARR FPC+DEPA KA F+I+++ P    ALSNMPV+SE    DG
Sbjct: 110 GGETRYLAATQFEPTDARRAFPCFDEPAFKATFTITITHPAGYTALSNMPVESEEVLGDG 169

Query: 210 HRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVL 269
            +  +FET+P MSTYLVA VVG+FDYVE T+ +GV VRVY   GK EQG +AL VA K+L
Sbjct: 170 WKTTEFETTPPMSTYLVAFVVGDFDYVEGTTKNGVPVRVYARPGKIEQGDYALEVAPKIL 229

Query: 270 PFYKDYFNIAYPLPKIDLVAIP 291
            F++DYF I YPLPK+DLVAIP
Sbjct: 230 EFFEDYFGIPYPLPKLDLVAIP 251



 Score =  205 bits (525), Expect = 1e-57
 Identities = 64/101 (63%), Positives = 80/101 (79%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MENWGL+TYRE  LL D + +SA  +Q +A VV HELAHQWFGNLVTM+WW  LWLNEG+
Sbjct: 258 MENWGLITYRETALLYDPKTSSASNKQRVATVVAHELAHQWFGNLVTMKWWDDLWLNEGF 317

Query: 756 ASFVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHP 796
           A+++E+L V HL PE+++W QFV D+L  AL LD+L SSHP
Sbjct: 318 ATYMEYLGVDHLEPEWNMWDQFVLDDLQSALALDSLASSHP 358



 Score =  158 bits (403), Expect = 1e-41
 Identities = 62/157 (39%), Positives = 84/157 (53%), Gaps = 5/157 (3%)

Query: 11  FKIVSETKELKLHVIDLDFKKVQLELADGKV-LTPETKISTEDETITLTFSETLPV--GE 67
            ++   T E+ LH  DL      + +  G V +      S E+E + +T  E L +  G 
Sbjct: 24  LEVTEPTNEIVLHAKDLTITSATVTVGGGNVDILVTITYSEENEFLVITLDEPLTLAGGN 83

Query: 68  VKLLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAKF 127
             L  E+ GELND ++GFYRS Y      E RY+A TQF  TDARR FPC+DEPA KA F
Sbjct: 84  YTLTIEFTGELNDDLRGFYRSSYT-DNGGETRYLAATQFEPTDARRAFPCFDEPAFKATF 142

Query: 128 AISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELT 164
            I+++ P+   ALSNM   +     D +   T+FE T
Sbjct: 143 TITITHPAGYTALSNMPVESEEVLGDGWK-TTEFETT 178



 Score =  152 bits (386), Expect = 2e-39
 Identities = 64/130 (49%), Positives = 79/130 (60%), Gaps = 24/130 (18%)

Query: 481 SMTHEDNYTVWITICNCLQKIDLLLSNTEYHHLFYQFGPVKSESPQPDGHRLLQFETSPI 540
           ++TH   YT               LSN           PV+SE    DG +  +FET+P 
Sbjct: 145 TITHPAGYTA--------------LSNM----------PVESEEVLGDGWKTTEFETTPP 180

Query: 541 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAY 600
           MSTYLVA VVG+FDYVE T+ +GV VRVY   GK EQG +AL VA K+L F++DYF I Y
Sbjct: 181 MSTYLVAFVVGDFDYVEGTTKNGVPVRVYARPGKIEQGDYALEVAPKILEFFEDYFGIPY 240

Query: 601 PLPKIDLVAL 610
           PLPK+DLVA+
Sbjct: 241 PLPKLDLVAI 250



 Score = 68.7 bits (169), Expect = 6e-12
 Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 15/58 (25%)

Query: 337 PVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGV---------------NSSSDSLW 379
           PV  P E+ EIFD ISY+KGAS++RML+ ++G+ V               N+++D LW
Sbjct: 362 PVETPDEISEIFDAISYSKGASVLRMLEHFLGEEVFRKGLRNYLKKHAYGNATTDDLW 419



 Score = 68.7 bits (169), Expect = 7e-12
 Identities = 19/33 (57%), Positives = 27/33 (81%)

Query: 291 PVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
           PV  P E+ EIFD ISY+KGAS++RML+ ++G+
Sbjct: 362 PVETPDEISEIFDAISYSKGASVLRMLEHFLGE 394


>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1.  Members of this
           family are aminopeptidases. The members differ widely in
           specificity, hydrolysing acidic, basic or neutral
           N-terminal residues. This family includes leukotriene-A4
           hydrolase, this enzyme also has an aminopeptidase
           activity.
          Length = 390

 Score =  216 bits (551), Expect = 6e-62
 Identities = 103/285 (36%), Positives = 138/285 (48%), Gaps = 58/285 (20%)

Query: 10  YFKIVSETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDET--ITLTFSETLPVG- 66
             +  + T E+ LH  DL+   V L      V    +    +DE   + +  + TL  G 
Sbjct: 31  TLQAKAATNEIVLHSKDLEITSVTLRGEPVSVNNLISVFQLDDEDEFLVINLASTLQAGQ 90

Query: 67  EVKLLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAK 126
              L  EY G+LND M+GF    YRS      +Y+  T                      
Sbjct: 91  PYTLEIEYEGKLNDDMRGF----YRS------QYLDQT---------------------- 118

Query: 127 FAISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISL 186
                                  + E + MA TQFE TDARR FPC+DEP+VKA F I++
Sbjct: 119 -----------------------NGEKKPMATTQFEPTDARRAFPCFDEPSVKATFDITI 155

Query: 187 SVPNNKVALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLV 246
           + P +  ALSNMP     P  DG  + +FET+P MSTYL+A  VG+ +Y+E  +  GV V
Sbjct: 156 NHPADYTALSNMPEIESEPLDDGRVITEFETTPKMSTYLLAFAVGDLEYLETKTKSGVPV 215

Query: 247 RVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIP 291
           RVY   G    GQ+AL V  K+L F++DYF   YPLPK+D VA+P
Sbjct: 216 RVYARPGAINAGQYALEVTQKLLEFFEDYFGFPYPLPKLDQVALP 260



 Score =  185 bits (471), Expect = 4e-51
 Identities = 62/123 (50%), Positives = 83/123 (67%), Gaps = 3/123 (2%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MENWGL+TYRE  LL D  N+++  +Q +A V+ HELAHQWFGNLVTM+WW  LWLNEG+
Sbjct: 267 MENWGLITYREPALLYDPGNSTSSDKQRVAEVIAHELAHQWFGNLVTMDWWDDLWLNEGF 326

Query: 756 ASFVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHPTQFVTDNLVRALELDALKS 815
           A+++E+L    L PE+ I  QFV   +  AL LD+L SSHP   +T N+    E+D +  
Sbjct: 327 ATYMEYLITDALEPEWRIEEQFVLREVQSALALDSLDSSHP---ITVNVNTPSEIDDIFD 383

Query: 816 SHP 818
           +  
Sbjct: 384 AIS 386



 Score =  120 bits (303), Expect = 5e-29
 Identities = 48/109 (44%), Positives = 62/109 (56%), Gaps = 9/109 (8%)

Query: 518 GPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQ 577
            P     P  DG  + +FET+P MSTYL+A  VG+ +Y+E  +  GV VRVY   G    
Sbjct: 167 MPEIESEPLDDGRVITEFETTPKMSTYLLAFAVGDLEYLETKTKSGVPVRVYARPGAINA 226

Query: 578 GQFALHVASKVLPFYKDYFNIAYPLPKIDLVAL---------NPGTVGY 617
           GQ+AL V  K+L F++DYF   YPLPK+D VAL         N G + Y
Sbjct: 227 GQYALEVTQKLLEFFEDYFGFPYPLPKLDQVALPDFSAGAMENWGLITY 275



 Score = 36.5 bits (85), Expect = 0.076
 Identities = 15/29 (51%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 330 LRPSFQI--PVGHPSEVEEIFDDISYNKG 356
           L  S  I   V  PSE+++IFD ISY KG
Sbjct: 362 LDSSHPITVNVNTPSEIDDIFDAISYEKG 390



 Score = 34.2 bits (79), Expect = 0.33
 Identities = 12/20 (60%), Positives = 15/20 (75%)

Query: 291 PVGHPSEVEEIFDDISYNKG 310
            V  PSE+++IFD ISY KG
Sbjct: 371 NVNTPSEIDDIFDAISYEKG 390


>gnl|CDD|223385 COG0308, PepN, Aminopeptidase N [Amino acid transport and
           metabolism].
          Length = 859

 Score =  143 bits (363), Expect = 8e-35
 Identities = 53/102 (51%), Positives = 64/102 (62%), Gaps = 1/102 (0%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MENWGLVT+RE  LL D +  +    +N+  V+ HELAHQWFGNLVTM+WW  LWLNEG+
Sbjct: 279 MENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTMKWWDDLWLNEGF 338

Query: 756 ASFVEFLCVHHLFP-EYDIWTQFVTDNLVRALELDALKSSHP 796
           A+F E L    L    +  W  F T     AL  D+L SSHP
Sbjct: 339 ATFREVLWSEDLGGRAWKRWEDFRTLRTSIALAEDSLPSSHP 380



 Score =  139 bits (352), Expect = 1e-33
 Identities = 67/239 (28%), Positives = 101/239 (42%), Gaps = 29/239 (12%)

Query: 89  KYRSVLHDEDRYMAVTQFCSTDA-----RRCFPCWDEPAVKAKFAISLSVPSSKVALSNM 143
           + RSV  D     A  +    DA         P   E           + P S   L  +
Sbjct: 67  EIRSVKIDGKALTAWYRL-DGDALTITVAPPIPERSERPFTLAITYEFTGPVSNDTLEGL 125

Query: 144 VRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSE 203
            R        +   +TQ E   ARR FPC DEP VKA F++++        +SN  +   
Sbjct: 126 YRSG---YGGKPYLITQCEAEGARRIFPCIDEPDVKATFTLTIRADKGPKLISNGNLIDG 182

Query: 204 SPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEE---TSSDGVLVRVYTPVGKREQGQF 260
               DG ++++FE +P M TYL A+V G+ +   +   T S  V + +Y P G  ++ ++
Sbjct: 183 GTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVPPGVLDRAKY 242

Query: 261 ALHVASKVLPFYKDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDISY----NKGASIIR 315
           AL    + + FY++YF + Y LP  D+VA+P            D S     N G    R
Sbjct: 243 ALDETKRSIEFYEEYFGLPYALPI-DIVAVP------------DFSAGAMENWGLVTFR 288



 Score = 84.5 bits (209), Expect = 2e-16
 Identities = 46/161 (28%), Positives = 69/161 (42%), Gaps = 19/161 (11%)

Query: 3   EKKPFERYFKIV------SETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDETIT 56
           EK  FE    I       S    L L  + L+ + V++   DGK LT   ++  +  TIT
Sbjct: 36  EKTTFEGSVTIRLDAGWRSGADPLVLDAVGLEIRSVKI---DGKALTAWYRLDGDALTIT 92

Query: 57  LTFSETLPV---GEVKLLFEYVGEL-NDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDAR 112
           +             + + +E+ G + ND ++G YRS Y        +   +TQ  +  AR
Sbjct: 93  VAPPIPERSERPFTLAITYEFTGPVSNDTLEGLYRSGY------GGKPYLITQCEAEGAR 146

Query: 113 RCFPCWDEPAVKAKFAISLSVPSSKVALSNMVRIAILDDED 153
           R FPC DEP VKA F +++        +SN   I      D
Sbjct: 147 RIFPCIDEPDVKATFTLTIRADKGPKLISNGNLIDGGTLVD 187



 Score = 83.7 bits (207), Expect = 3e-16
 Identities = 34/113 (30%), Positives = 58/113 (51%), Gaps = 13/113 (11%)

Query: 518 GPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEE---TSSDGVLVRVYTPVGK 574
           G +       DG ++++FE +P M TYL A+V G+ +   +   T S  V + +Y P G 
Sbjct: 177 GNLIDGGTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVPPGV 236

Query: 575 REQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAL---------NPGTVGYY 618
            ++ ++AL    + + FY++YF + Y LP  D+VA+         N G V + 
Sbjct: 237 LDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGAMENWGLVTFR 288



 Score = 63.7 bits (155), Expect = 4e-10
 Identities = 43/215 (20%), Positives = 82/215 (38%), Gaps = 30/215 (13%)

Query: 288 VAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGKYPILLRPSFQIPVGHPSEVEEI 347
           + + V  P E+ + FD I Y KGAS++RML+  +G+  +   L   F+   G  +   ++
Sbjct: 381 IRVDVYDPKEINDFFDAIVYEKGASVLRMLETLLGEEAFRKGLSLYFKRHAGGNATTMDL 440

Query: 348 FDDISYNKGASIIRMLQKYI-----------GDGVNS-------------SSDSLWYVPL 383
           +  +    G  +    + ++               +                   W +PL
Sbjct: 441 WKALEDASGKDLSAFFESWLSQAGYPVLTVSVRYDDFFKLTQKQFTPPGQEEKRPWPIPL 500

Query: 384 SFCTQANPSEEVFSAEMSTRVTQVTIPDVSPGHWI--KLNPGTVGYYRVKYPRETLAQFI 441
           +         +V    ++     VT   V    +   K+N     +YRV Y  ++L++ +
Sbjct: 501 AIKLLDGGGVKV--LLLTEGEQTVTFELVGIPPFPSLKVNDSAPVFYRVDYSDQSLSKLL 558

Query: 442 PSVEDKSIPPLDRLSLLDDLFALAQVGEVSLVEVL 476
               D  +    RL+L+ D  AL   G+ S  + L
Sbjct: 559 QH--DPRLEAAQRLALVADRRALTAAGKGSAEDKL 591



 Score = 53.3 bits (128), Expect = 7e-07
 Identities = 15/47 (31%), Positives = 26/47 (55%)

Query: 325 KYPILLRPSFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGV 371
           +  +      ++ V  P E+ + FD I Y KGAS++RML+  +G+  
Sbjct: 372 EDSLPSSHPIRVDVYDPKEINDFFDAIVYEKGASVLRMLETLLGEEA 418



 Score = 46.7 bits (111), Expect = 7e-05
 Identities = 31/145 (21%), Positives = 51/145 (35%), Gaps = 8/145 (5%)

Query: 819 IELYQNSDMQEEKDRISRSFSALKDPELLRKVL-DFSMSDLVRAQDSVFVIISAAQTKTG 877
           +E YQ+    E     + +F            L   S  D V  QD   +I +A +    
Sbjct: 716 LEAYQSPTRAEALRDFADAFGRFPLVMDKWFALQAISPGDTVLEQDIGLMIHAAFEAPNP 775

Query: 878 RELAWEFLK---NNYATFTERYKGG--LLGRLVKHTTENFASESHAQEVTEFFTKNPTSW 932
            E  W +      N A        G   LG +V      F  +   +++ EF        
Sbjct: 776 NEARWLYGTFAFENPALLHALDGSGYRFLGVIVL-ELNIFNPQLALRKIEEFLELKKLKA 834

Query: 933 IERTVQQSVETIRLNSECLKRDGEA 957
           +ER +++++ETI      L +D   
Sbjct: 835 LERLIREALETIAARER-LSKDLAE 858



 Score = 43.6 bits (103), Expect = 7e-04
 Identities = 30/145 (20%), Positives = 49/145 (33%), Gaps = 8/145 (5%)

Query: 957  ALYQNSDMQEEKDRISRSFSALKDPELLRKVL-DFSMSDLVRAQDSVFVIISAAQTKTGR 1015
              YQ+    E     + +F            L   S  D V  QD   +I +A +     
Sbjct: 717  EAYQSPTRAEALRDFADAFGRFPLVMDKWFALQAISPGDTVLEQDIGLMIHAAFEAPNPN 776

Query: 1016 ELAWDFLK---NNYATFTERYKGG--LLGRLVKHTTENFASESHAQEVTEFFTKNPTSWI 1070
            E  W +      N A        G   LG +V      F  +   +++ EF        +
Sbjct: 777  EARWLYGTFAFENPALLHALDGSGYRFLGVIVL-ELNIFNPQLALRKIEEFLELKKLKAL 835

Query: 1071 ERTVQQSVETIRLNSECLKRDGEAV 1095
            ER +++++ETI      L +D    
Sbjct: 836  ERLIREALETIAARER-LSKDLAEK 859


>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
            This family contains bacterial and eukaryotic
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 438

 Score =  112 bits (283), Expect = 4e-26
 Identities = 40/101 (39%), Positives = 55/101 (54%), Gaps = 2/101 (1%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MEN G VT+RE   +   + T+A  R   A  + HE+AH WFG+LVTM+WW  LWLNE +
Sbjct: 256 MENPGCVTFREN-YVFREEVTTAQ-RLRRANTIAHEMAHMWFGDLVTMKWWNDLWLNESF 313

Query: 756 ASFVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHP 796
           A ++ +  +    P  D W  F  D    A   D   ++HP
Sbjct: 314 AEYMAYKALSEATPFTDAWLTFFLDRKQWAYRADQTPTTHP 354



 Score =  102 bits (256), Expect = 1e-22
 Identities = 48/154 (31%), Positives = 78/154 (50%), Gaps = 11/154 (7%)

Query: 145 RIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSES 204
           R     D   Y+  TQFE  DARR FPC+D+P +KA F+++++ P +   +SN     +S
Sbjct: 100 RFVDPADGKTYL-YTQFEPDDARRVFPCFDQPDLKAPFTLTVTAPKDWTVISNTAATEQS 158

Query: 205 PQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHV 264
              +G+   +F  +P +STYL A V G +  + +   DG+ + +Y    +    Q     
Sbjct: 159 TIRNGYVRWEFPETPPLSTYLFAFVAGPYHVISDKEHDGIPLGLYC---RESLAQALDRD 215

Query: 265 ASKVLP-------FYKDYFNIAYPLPKIDLVAIP 291
           A ++         F+++YF I YP  K D V +P
Sbjct: 216 ADEIFEITKQGLDFFEEYFGIPYPFGKYDQVFVP 249



 Score = 52.2 bits (126), Expect = 9e-07
 Identities = 28/115 (24%), Positives = 48/115 (41%), Gaps = 20/115 (17%)

Query: 519 PVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQG 578
             +  + +    R  +F  +P +STYL A V G +  + +   DG+ + +Y    +    
Sbjct: 154 ATEQSTIRNGYVRW-EFPETPPLSTYLFAFVAGPYHVISDKEHDGIPLGLYC---RESLA 209

Query: 579 QFALHVASKVLP-------FYKDYFNIAYPLPKIDLVAL---------NPGTVGY 617
           Q     A ++         F+++YF I YP  K D V +         NPG V +
Sbjct: 210 QALDRDADEIFEITKQGLDFFEEYFGIPYPFGKYDQVFVPEFNFGAMENPGCVTF 264



 Score = 49.9 bits (120), Expect = 6e-06
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 288 VAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
           +A  V +  +    FD I+Y KGAS+++ L   +G+
Sbjct: 355 IAGDVENTEDALNNFDGITYAKGASVLKQLVALLGE 390



 Score = 47.2 bits (113), Expect = 4e-05
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 337 PVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
            V +  +    FD I+Y KGAS+++ L   +G+
Sbjct: 358 DVENTEDALNNFDGITYAKGASVLKQLVALLGE 390


>gnl|CDD|233857 TIGR02412, pepN_strep_liv, aminopeptidase N, Streptomyces lividans
           type.  This family is a subset of the members of the
           zinc metallopeptidase family M1 (pfam01433), with a
           single member characterized in Streptomyces lividans 66
           and designated aminopeptidase N. The spectrum of
           activity may differ somewhat from the aminopeptidase N
           clade of E. coli and most other Proteobacteria, well
           separated phylogenetically within the M1 family. The M1
           family also includes leukotriene A-4
           hydrolase/aminopeptidase (with a bifunctional active
           site).
          Length = 831

 Score =  100 bits (251), Expect = 2e-21
 Identities = 45/102 (44%), Positives = 58/102 (56%), Gaps = 4/102 (3%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MEN G VT+ E   L  ++ T A  ++N A V+ HE+AH WFG+LVTM WW  LWLNE +
Sbjct: 261 MENAGCVTFAEN-FLHRAEATRA-EKENRAGVILHEMAHMWFGDLVTMRWWNDLWLNESF 318

Query: 756 ASFVEFLCVHHLFPEY-DIWTQFVTDNLVRALELDALKSSHP 796
           A ++  L       EY D WT F       A E D L ++HP
Sbjct: 319 AEYMGTLASAEA-TEYTDAWTTFAAQGKQWAYEADQLPTTHP 359



 Score = 78.3 bits (193), Expect = 1e-14
 Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 4/135 (2%)

Query: 159 TQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSESPQPDGHRLLQFETS 218
           TQFE  DARR F  +D+P +KA F  S+  P +   +SN      +P+P   R  +F  +
Sbjct: 122 TQFEPADARRVFAVFDQPDLKANFKFSVKAPEDWTVISNSRETDVTPEPAD-RRWEFPET 180

Query: 219 PIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQ--GQFALHVASKVLPFYKDYF 276
           P +STYL AV  G + +  +  S    + +Y      +         +  + L F+   F
Sbjct: 181 PKLSTYLTAVAAGPY-HSVQDESRSYPLGIYARRSLAQYLDADAIFTITRQGLAFFHRKF 239

Query: 277 NIAYPLPKIDLVAIP 291
              YP  K D + +P
Sbjct: 240 GYPYPFKKYDQIFVP 254



 Score = 39.8 bits (93), Expect = 0.009
 Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 13/104 (12%)

Query: 523 ESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQ--GQF 580
            +P+P   R  +F  +P +STYL AV  G + +  +  S    + +Y      +      
Sbjct: 166 VTPEPAD-RRWEFPETPKLSTYLTAVAAGPY-HSVQDESRSYPLGIYARRSLAQYLDADA 223

Query: 581 ALHVASKVLPFYKDYFNIAYPLPKIDLVAL---------NPGTV 615
              +  + L F+   F   YP  K D + +         N G V
Sbjct: 224 IFTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAMENAGCV 267



 Score = 39.0 bits (91), Expect = 0.018
 Identities = 14/43 (32%), Positives = 24/43 (55%), Gaps = 8/43 (18%)

Query: 335 QIPVGHP--------SEVEEIFDDISYNKGASIIRMLQKYIGD 369
           Q+P  HP        ++    FD I+Y KGAS+++ L  ++G+
Sbjct: 353 QLPTTHPIVADVADLADALSNFDGITYAKGASVLKQLVAWVGE 395



 Score = 38.6 bits (90), Expect = 0.022
 Identities = 13/45 (28%), Positives = 24/45 (53%)

Query: 282 LPKIDLVAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGKY 326
           LP    +   V   ++    FD I+Y KGAS+++ L  ++G+  +
Sbjct: 354 LPTTHPIVADVADLADALSNFDGITYAKGASVLKQLVAWVGEEAF 398


>gnl|CDD|221257 pfam11838, DUF3358, Domain of unknown function (DUF3358).  This
            domain is functionally uncharacterized. This domain is
            found in eukaryotes and bacteria. This domain is found to
            the C-terminus of an aminopeptidase domain.
          Length = 323

 Score = 92.4 bits (230), Expect = 5e-20
 Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 6/132 (4%)

Query: 957  ALYQNSDMQEEKDRISRSFSALKDPELLRKVLDFSMSD-LVRAQDSVFVIISAAQT-KTG 1014
            A Y+N    + K+   R+ +A+ DPEL  K L+F + D  V  QD   V+    ++ + G
Sbjct: 192  AEYKNDPTADGKEAALRALAAVPDPELKAKALNFLLDDDSVSNQDIRAVMAGLRRSNQAG 251

Query: 1015 RELAWDFLKNNYATFTERY--KGGLLGRLVKHTTENFASESHAQEVTEFFTKNP--TSWI 1070
            REL W +++ N+    ++       LGR+V      F+S     EV  F  + P  T  +
Sbjct: 252  RELLWPWVERNFDALAKKLPRGSSALGRVVGLYPSGFSSAELLDEVEAFLARKPKDTPGL 311

Query: 1071 ERTVQQSVETIR 1082
             R++ Q+++TIR
Sbjct: 312  RRSLAQALDTIR 323



 Score = 92.0 bits (229), Expect = 6e-20
 Identities = 39/132 (29%), Positives = 66/132 (50%), Gaps = 6/132 (4%)

Query: 820 ELYQNSDMQEEKDRISRSFSALKDPELLRKVLDFSMSD-LVRAQDSVFVIISAAQT-KTG 877
             Y+N    + K+   R+ +A+ DPEL  K L+F + D  V  QD   V+    ++ + G
Sbjct: 192 AEYKNDPTADGKEAALRALAAVPDPELKAKALNFLLDDDSVSNQDIRAVMAGLRRSNQAG 251

Query: 878 RELAWEFLKNNYATFTERY--KGGLLGRLVKHTTENFASESHAQEVTEFFTKNP--TSWI 933
           REL W +++ N+    ++       LGR+V      F+S     EV  F  + P  T  +
Sbjct: 252 RELLWPWVERNFDALAKKLPRGSSALGRVVGLYPSGFSSAELLDEVEAFLARKPKDTPGL 311

Query: 934 ERTVQQSVETIR 945
            R++ Q+++TIR
Sbjct: 312 RRSLAQALDTIR 323



 Score = 77.7 bits (192), Expect = 3e-15
 Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 1/90 (1%)

Query: 417 WIKLNPGTVGYYRVKYPRETLAQFIPSVEDKSIPPLDRLSLLDDLFALAQVGEVSLVEVL 476
            + LN   +GYYRV+Y  E+LA    ++    + PLDR  L  D +AL + GE+   + L
Sbjct: 1   LVLLNDDDLGYYRVRYDPESLATLGEALARLKLDPLDRAGLWADAWALVRDGELPTRDFL 60

Query: 477 KMIQSMT-HEDNYTVWITICNCLQKIDLLL 505
            ++ +   +E +Y V   I   L  +   L
Sbjct: 61  DLVLAFLPNETDYVVLSEILAQLGTLKSAL 90



 Score = 47.7 bits (114), Expect = 2e-05
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 607 LVALNPGTVGYYRVKYPRETLAQFIPSVEDKSIPPLDRLSLLDDLFA 653
           LV LN   +GYYRV+Y  E+LA    ++    + PLDR  L  D +A
Sbjct: 1   LVLLNDDDLGYYRVRYDPESLATLGEALARLKLDPLDRAGLWADAWA 47


>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
           leukotriene A4 hydrolase.  M1 Peptidase family includes
           aminopeptidase N (APN) and leukotriene A4 hydrolase
           (LTA4H).  All peptidases in this family bind a single
           catalytic zinc ion which is tetrahedrally co-ordinated
           by three amino acid ligands and a water molecule that
           forms the nucleophile on activation during catalysis.
           APN consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types. APN expression is
           dysregulated in many inflammatory diseases and is
           enhanced in numerous tumor cells, making it a lead
           target in the development of anti-cancer and
           anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
           possessing an aminopeptidase as well as an epoxide
           hydrolase activity. The two activities occupy different,
           but overlapping sites. The activity and physiological
           relevance of the aminopeptidase in LTA4H is as yet
           unknown while the epoxide hydrolase converts leukotriene
           A4 (LTA4) into leukotriene B4 (LTB4), a potent
           chemotaxin that is fundamental to the inflammatory
           response of mammals.
          Length = 407

 Score = 89.4 bits (222), Expect = 1e-18
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 7/104 (6%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MEN GL+ + +  LL           + +  V+ HELAHQWFGNLVT   W  LWLNEG+
Sbjct: 244 MENPGLIFFSQSLLLAMIDAGD----ELLENVIAHELAHQWFGNLVTGARWNDLWLNEGF 299

Query: 756 ASFVEFLCVHHLFPEY---DIWTQFVTDNLVRALELDALKSSHP 796
           A+++E L +   F          +            D L  + P
Sbjct: 300 ATYLEGLWMEATFGTSAREAYRLEGTRGLRRWRKLQDELPPAPP 343



 Score = 65.9 bits (161), Expect = 5e-11
 Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 15/172 (8%)

Query: 122 AVKAKFAISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAK 181
           ++K +   S   P   +             E  ++  +Q E   AR  FPC D P   A 
Sbjct: 79  SLKVEVEFSGKPPPMGLGWQTA------GAEGAFLF-SQGEAIGARSWFPCQDRPDSVAT 131

Query: 182 FSISLSVPNNK-VALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDY-VEET 239
           +  +++VP+   VA SN   K +         ++F     ++ YL+A+VVG+ +Y V E 
Sbjct: 132 YYTTVTVPDKTLVAASNGN-KDDGEVRRK--WVEFSPPIPIAPYLIALVVGDLEYPVREQ 188

Query: 240 SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIP 291
           +  G+ + +Y            L  A     FY+      YP  + D+V +P
Sbjct: 189 TVQGIPLHLYFLTPLAVDTVLRLQSAFL---FYETDLGGPYPYSEYDVVEVP 237



 Score = 42.0 bits (99), Expect = 0.002
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 332 PSFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
           P+  +   HP +    FD + Y KGA ++R L+K +GD
Sbjct: 340 PAPPVMGKHPDDPSVEFDGVFYEKGALVLRYLEKRLGD 377



 Score = 41.6 bits (98), Expect = 0.002
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 290 IPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
           +   HP +    FD + Y KGA ++R L+K +GD
Sbjct: 344 VMGKHPDDPSVEFDGVFYEKGALVLRYLEKRLGD 377



 Score = 40.1 bits (94), Expect = 0.006
 Identities = 28/137 (20%), Positives = 45/137 (32%), Gaps = 10/137 (7%)

Query: 9   RYFKIVSETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDETITL-TFSETLPVGE 67
                     EL L +  L    V +  A       E     E   +T+           
Sbjct: 21  TVDPTEPNLGELVLDLKALAISSVDVNGAAAAFAKNERP-VVEGSRLTIPGPIALDKGQS 79

Query: 68  VKLLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAKF 127
           +K+  E+ G+      G+  +         +     +Q  +  AR  FPC D P   A +
Sbjct: 80  LKVEVEFSGKPPPMGLGWQTA-------GAEGAFLFSQGEAIGARSWFPCQDRPDSVATY 132

Query: 128 AISLSVPSSK-VALSNM 143
             +++VP    VA SN 
Sbjct: 133 YTTVTVPDKTLVAASNG 149



 Score = 39.7 bits (93), Expect = 0.007
 Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 4/79 (5%)

Query: 535 FETSPIMSTYLVAVVVGEFDY-VEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYK 593
           F     ++ YL+A+VVG+ +Y V E +  G+ + +Y            L  A     FY+
Sbjct: 163 FSPPIPIAPYLIALVVGDLEYPVREQTVQGIPLHLYFLTPLAVDTVLRLQSAFL---FYE 219

Query: 594 DYFNIAYPLPKIDLVALNP 612
                 YP  + D+V +  
Sbjct: 220 TDLGGPYPYSEYDVVEVPE 238


>gnl|CDD|189010 cd09603, M1_APN_4, Peptidase M1 family Aminopeptidase N.  This
           family contains mostly bacterial and some archaeal
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 415

 Score = 80.7 bits (200), Expect = 9e-16
 Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 2/136 (1%)

Query: 156 MAVTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNKVALSNMPVKSESPQPDGHRLLQF 215
              T  +   A   FPC D P+ KA F IS++VP     +SN  + S +    G     +
Sbjct: 106 GVWTAGQPEGASTWFPCNDHPSDKATFDISVTVPAGYTVVSNGRLVSRTDL-GGRTTWHW 164

Query: 216 ETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDY 275
                M+TYLV + VG ++ +E+T++ GV V  Y P G     + A     ++L F+++ 
Sbjct: 165 RMDEPMATYLVTLAVGRYEVLEDTTAGGVPVVYYVPPGLAADARRAFARTPEMLDFFEEL 224

Query: 276 FNIAYPLPKIDLVAIP 291
           F   YP  K   V + 
Sbjct: 225 F-GPYPFEKYGQVVVD 239



 Score = 77.7 bits (192), Expect = 9e-15
 Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 9/67 (13%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MEN     Y    L  D  +           ++ HELAHQWFG+ VT   W+ +WLNEG+
Sbjct: 246 MENQTRTVYGAGFLDGDRSSER---------LIAHELAHQWFGDSVTCADWSDIWLNEGF 296

Query: 756 ASFVEFL 762
           A++ E+L
Sbjct: 297 ATYAEWL 303



 Score = 53.0 bits (128), Expect = 5e-07
 Identities = 31/133 (23%), Positives = 47/133 (35%), Gaps = 12/133 (9%)

Query: 11  FKIVSETKELKLHVIDLDFKKVQLELADGKVLTPETKISTEDETITLTFSETLPVG-EVK 69
            +   +   L L +  L    V +   DG+          + + +T+T    LP G    
Sbjct: 26  ARATQDLSSLNLDLAGLTVSSVTV---DGRPAAF----RHDGDELTITPPRPLPKGQTFT 78

Query: 70  LLFEYVGELNDKMKGFYRSKYRSVLHDEDRYMAVTQFCSTDARRCFPCWDEPAVKAKFAI 129
           +   Y G       G Y         +       T      A   FPC D P+ KA F I
Sbjct: 79  VTVRYSGV---PRPGLYPWGGDGGWEEGPD-GVWTAGQPEGASTWFPCNDHPSDKATFDI 134

Query: 130 SLSVPSSKVALSN 142
           S++VP+    +SN
Sbjct: 135 SVTVPAGYTVVSN 147



 Score = 49.9 bits (120), Expect = 5e-06
 Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 528 DGHRLLQFETSPIMSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASK 587
            G     +     M+TYLV + VG ++ +E+T++ GV V  Y P G     + A     +
Sbjct: 157 GGRTTWHWRMDEPMATYLVTLAVGRYEVLEDTTAGGVPVVYYVPPGLAADARRAFARTPE 216

Query: 588 VLPFYKDYFNIAYPLPKIDLV 608
           +L F+++ F   YP  K   V
Sbjct: 217 MLDFFEELF-GPYPFEKYGQV 236



 Score = 33.4 bits (77), Expect = 0.67
 Identities = 12/41 (29%), Positives = 19/41 (46%)

Query: 329 LLRPSFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
               +   P        E+FDD  Y KGA ++  L++ +GD
Sbjct: 325 YANEADLGPRPPVPGPAELFDDHVYEKGALVLHALRRLLGD 365



 Score = 32.2 bits (74), Expect = 1.6
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 2/53 (3%)

Query: 271 FYKDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 323
            Y  Y        + DL   P        E+FDD  Y KGA ++  L++ +GD
Sbjct: 315 QYALYAYARLYANEADLGPRPPVPGPA--ELFDDHVYEKGALVLHALRRLLGD 365


>gnl|CDD|222169 pfam13485, Peptidase_MA_2, Peptidase MA superfamily. 
          Length = 128

 Score = 60.2 bits (146), Expect = 9e-11
 Identities = 26/99 (26%), Positives = 33/99 (33%), Gaps = 13/99 (13%)

Query: 698 NWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGYAS 757
             G V Y    L+V     +      +  V+ HELAH   GNLV        WL EG A 
Sbjct: 1   WTGGVYYPGSRLIVP--RVAPGDPDELRGVLAHELAHVVLGNLVGGNGNLPRWLTEGLAE 58

Query: 758 FVEFLCVHHLFPEYDIWTQFVTDNLVRALELDALKSSHP 796
           +V       + PE           L  A+    L     
Sbjct: 59  YVA----GRIDPERA-------AELRDAVRSGRLPPLDE 86


>gnl|CDD|189006 cd09599, M1_LTA4H, Peptidase M1 family contains leukotriene A4
           hydrolase.  This family includes leukotriene A4
           hydrolase (LTA4H; E.C. 3.3.2.6) and the close homolog
           cold-active aminopeptidase (Colwellia
           psychrerythraea-type peptidase; ColAP), both members of
           the aminopeptidase M1 family. LTA4H, is a bifunctional
           enzyme possessing an aminopeptidase as well as an
           epoxide hydrolase activity.  The two activities occupy
           different, but overlapping sites. The activity and
           physiological relevance of the aminopeptidase is as yet
           unknown while the epoxide hydrolase converts leukotriene
           A4 (LTA4) into leukotriene B4 (LTB4), a potent
           chemotaxin that is fundamental to the inflammatory
           response of mammals. It accepts a variety of substrates,
           including some opioid, di- and tripeptides, as well as
           chromogenic aminoacyl-p-nitroanilide derivatives. The
           aminopeptidase activity of LTA4H is possibly involved in
           the processing of peptides related to inflammation and
           host defense. Kinetic analysis shows that LTA4H
           hydrolyzes arginyl tripeptides with high efficiency and
           specificity, indicating its function as an arginyl
           aminopeptidase. LTA4H is overexpressed in certain human
           cancers, and has been identified as a functionally
           important target for mediating anticancer properties of
           resveratrol, a well known red wine polyphenolic compound
           with cancer chemopreventive activity.
          Length = 442

 Score = 64.9 bits (159), Expect = 1e-10
 Identities = 20/34 (58%), Positives = 24/34 (70%)

Query: 727 VVGHELAHQWFGNLVTMEWWTHLWLNEGYASFVE 760
           VV HE+AH W GNLVT   W H WLNEG+  ++E
Sbjct: 285 VVAHEIAHSWSGNLVTNATWEHFWLNEGFTVYLE 318



 Score = 49.1 bits (118), Expect = 1e-05
 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 5/76 (6%)

Query: 159 TQFELTDARRCFPCWDEPAVKAKFSISLSVPNN-KVALSNMPVKSESPQPDGHRLLQFET 217
           +Q +   AR  FPC D P+VK+ +S +++VP      +S +P   +  +P       FE 
Sbjct: 129 SQCQAIHARSLFPCQDTPSVKSTYSATVTVPKELTALMSAIPPVEDGEEPRK--TYHFEQ 186

Query: 218 S-PIMSTYLVAVVVGE 232
             PI  +YL+A+ VG+
Sbjct: 187 PVPI-PSYLIAIAVGD 201



 Score = 37.5 bits (88), Expect = 0.042
 Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 2/59 (3%)

Query: 104 TQFCSTDARRCFPCWDEPAVKAKFAISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFE 162
           +Q  +  AR  FPC D P+VK+ ++ +++VP    AL  M  I  ++D +       FE
Sbjct: 129 SQCQAIHARSLFPCQDTPSVKSTYSATVTVPKELTAL--MSAIPPVEDGEEPRKTYHFE 185


>gnl|CDD|233856 TIGR02411, leuko_A4_hydro, leukotriene A-4
           hydrolase/aminopeptidase.  Members of this family
           represent a distinctive subset within the zinc
           metallopeptidase family M1 (pfam01433). The majority of
           the members of pfam01433 are aminopeptidases, but the
           sequences in this family for which the function is known
           are leukotriene A-4 hydrolase. A dual epoxide hydrolase
           and aminopeptidase activity at the same active site is
           indicated. The physiological substrate for
           aminopeptidase activity is not known.
          Length = 602

 Score = 59.4 bits (144), Expect = 7e-09
 Identities = 22/40 (55%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 721 RQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGYASFVE 760
           R N+  V+ HELAH W GNLVT   W H WLNEG+  ++E
Sbjct: 277 RSNVD-VIAHELAHSWSGNLVTNCSWEHFWLNEGWTVYLE 315



 Score = 40.1 bits (94), Expect = 0.007
 Identities = 27/106 (25%), Positives = 44/106 (41%), Gaps = 5/106 (4%)

Query: 129 ISLSVPSSKVALSNMVRIAILDDEDRYMAVTQFELTDARRCFPCWDEPAVKAKFSISLSV 188
           IS S      AL  +        +  Y+  +Q +   AR  FPC D P+VK+ ++  +  
Sbjct: 99  ISFSTTPKCTALQWLNPEQTSGKKHPYL-FSQCQAIHARSLFPCQDTPSVKSTYTAEVES 157

Query: 189 PNNKVALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFD 234
           P   +  S +     S  P  +   Q    PI + YL+A+  G+  
Sbjct: 158 PLPVLM-SGIRDGETSNDPGKYLFKQ--KVPIPA-YLIAIASGDLA 199



 Score = 31.3 bits (71), Expect = 3.6
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 111 ARRCFPCWDEPAVKAKFAISLSVP 134
           AR  FPC D P+VK+ +   +  P
Sbjct: 135 ARSLFPCQDTPSVKSTYTAEVESP 158


>gnl|CDD|233858 TIGR02414, pepN_proteo, aminopeptidase N, Escherichia coli type.
           The M1 family of zinc metallopeptidases contains a
           number of distinct, well-separated clades of proteins
           with aminopeptidase activity. Several are designated
           aminopeptidase N, EC 3.4.11.2, after the Escherichia
           coli enzyme, suggesting a similar activity profile (see
           SP|P04825 for a description of catalytic activity). This
           family consists of all aminopeptidases closely related
           to E. coli PepN and presumed to have similar (not
           identical) function. Nearly all are found in
           Proteobacteria, but members are found also in
           Cyanobacteria, plants, and apicomplexan parasites. This
           family differs greatly in sequence from the family of
           aminopeptidases typified by Streptomyces lividans PepN
           (TIGR02412), from the membrane bound aminopeptidase N
           family in animals, etc [Protein fate, Degradation of
           proteins, peptides, and glycopeptides].
          Length = 863

 Score = 50.0 bits (120), Expect = 7e-06
 Identities = 40/143 (27%), Positives = 68/143 (47%), Gaps = 10/143 (6%)

Query: 156 MAVTQFELTDARRC--FPCWDEPAVKAKFSISLSVPNNK--VALSNMPVKSESPQPDGHR 211
              TQ E    RR   FP  D P V ++++++++    K  V LSN    +    PDG  
Sbjct: 107 NFCTQCEAEGFRRITYFP--DRPDVMSRYTVTITADKKKYPVLLSNGNKIASGELPDGRH 164

Query: 212 LLQFETSPIMSTYLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASK 267
             ++E      +YL A+V G+ D +E+T    S   V +RVY   G +++   A+    K
Sbjct: 165 WAEWEDPFPKPSYLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKK 224

Query: 268 VLPFYKDYFNIAYPLPKIDLVAI 290
            + + ++ F + Y L    +VA+
Sbjct: 225 AMKWDEEVFGLEYDLDIFMIVAV 247



 Score = 48.9 bits (117), Expect = 1e-05
 Identities = 23/63 (36%), Positives = 29/63 (46%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEGY 755
           MEN GL  +    +L D +  +    + I  V+ HE  H W GN VT   W  L L EG 
Sbjct: 255 MENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGL 314

Query: 756 ASF 758
             F
Sbjct: 315 TVF 317



 Score = 41.2 bits (97), Expect = 0.004
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 4/72 (5%)

Query: 544 YLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIA 599
           YL A+V G+ D +E+T    S   V +RVY   G +++   A+    K + + ++ F + 
Sbjct: 177 YLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVFGLE 236

Query: 600 YPLPKIDLVALN 611
           Y L    +VA++
Sbjct: 237 YDLDIFMIVAVD 248



 Score = 30.0 bits (68), Expect = 9.3
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 6/44 (13%)

Query: 289 AIPVGHP---SEVEEI---FDDISYNKGASIIRMLQKYIGDGKY 326
           A P+ HP       EI   +    Y KGA +IRML   +G+  +
Sbjct: 349 AGPMAHPVRPESYVEINNFYTATVYEKGAEVIRMLHTLLGEEGF 392


>gnl|CDD|189007 cd09600, M1_APN_1, Peptidase M1 family containing Aminopeptidase N.
            This family contains aminopeptidase N (APN; CD13;
           Alanyl aminopeptidase; EC 3.4.11.2), a Type II integral
           membrane protease belonging to the M1 gluzincin family.
           It includes bacterial-type alanyl aminopeptidases as
           well as PfA-M1 aminopeptidase (Plasmodium
           falciparum-type). APN consists of a small N-terminal
           cytoplasmic domain, a single transmembrane domain and a
           large extracellular ectodomain that contains the active
           site. It preferentially cleaves neutral amino acids from
           the N-terminus of oligopeptides and, in higher
           eukaryotes, is present in a variety of human tissues and
           cell types (leukocyte, fibroblast, endothelial and
           epithelial cells). APN expression is dysregulated in
           inflammatory diseases such as chronic pain, rheumatoid
           arthritis, multiple sclerosis, systemic sclerosis,
           systemic lupus erythematosus,
           polymyositis/dermatomyosytis and pulmonary sarcoidosis,
           and is enhanced in tumor cells such as melanoma, renal,
           prostate, pancreas, colon, gastric and thyroid cancers.
           It is predominantly expressed on stem cells and on cells
           of the granulocytic and monocytic lineages at distinct
           stages of differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 861

 Score = 49.2 bits (118), Expect = 1e-05
 Identities = 23/59 (38%), Positives = 29/59 (49%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEG 754
           MEN GL  +    +L D +  +    + I  V+GHE  H W GN VT   W  L L EG
Sbjct: 255 MENKGLNIFNSKLVLADPETATDADYERIESVIGHEYFHNWTGNRVTCRDWFQLSLKEG 313



 Score = 40.3 bits (95), Expect = 0.006
 Identities = 37/140 (26%), Positives = 58/140 (41%), Gaps = 6/140 (4%)

Query: 158 VTQFELTDARRCFPCWDEPAVKAKFSISLSVPNNK--VALSNMPVKSESPQPDGHRLLQF 215
            TQ E    RR     D P V +KF++ +     K  V LSN  +  E    DG     +
Sbjct: 109 CTQCEAEGFRRITYYLDRPDVMSKFTVRIEADKTKYPVLLSNGNLVEEGELEDGRHFAVW 168

Query: 216 ETSPIMSTYLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASKVLPF 271
           E       YL A+V G+   +E+     S   V + +Y   G   +   A+    K + +
Sbjct: 169 EDPFPKPCYLFALVAGDLGVLEDKFTTKSGRKVALEIYVEPGDESKCAHAMESLKKSMKW 228

Query: 272 YKDYFNIAYPLPKIDLVAIP 291
            +D F + Y L   ++VA+ 
Sbjct: 229 DEDRFGLEYDLDLFNIVAVD 248



 Score = 31.8 bits (73), Expect = 2.2
 Identities = 27/111 (24%), Positives = 44/111 (39%), Gaps = 14/111 (12%)

Query: 503 LLLSNTEYHHLFYQFGPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDYVEET--- 559
           +LLSN          G +  E    DG     +E       YL A+V G+   +E+    
Sbjct: 146 VLLSN----------GNLVEEGELEDGRHFAVWEDPFPKPCYLFALVAGDLGVLEDKFTT 195

Query: 560 -SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVA 609
            S   V + +Y   G   +   A+    K + + +D F + Y L   ++VA
Sbjct: 196 KSGRKVALEIYVEPGDESKCAHAMESLKKSMKWDEDRFGLEYDLDLFNIVA 246


>gnl|CDD|189001 cd09594, GluZincin, Peptidase Gluzincin family (thermolysin-like
           proteinases, TLPs) includes peptidases M1, M2, M3, M4,
           M13, M32 and M36 (fungalysins).  Gluzincin family
           (thermolysin-like peptidases or TLPs) includes several
           zinc-dependent metallopeptidases such as the M1, M2, M3,
           M4, M13, M32, M36 peptidases (MEROPS classification),
           and contain HEXXH and EXXXD motifs as part of their
           active site. All peptidases in this family bind a single
           catalytic zinc ion which is tetrahedrally co-ordinated
           by three amino acid ligands and a water molecule that
           forms the nucleophile on activation during catalysis. M1
           family includes aminopeptidase N (APN) and leukotriene
           A4 hydrolase (LTA4H).  APN preferentially cleaves
           neutral amino acids from the N-terminus of oligopeptides
           and is present in a variety of human tissues and cell
           types. LTA4H is a bifunctional enzyme, possessing an
           aminopeptidase as well as an epoxide hydrolase activity
           such that the two activities occupy different, but
           overlapping sites. The peptidase M3 or neurolysin-like
           family, includes M3, M2 and M32 metallopeptidases.  The
           M3 peptidases have two subfamilies: M3A, includes thimet
           oligopeptidase (TOP; endopeptidase 3.4.24.15),
           neurolysin (3.4.24.16), and the mitochondrial
           intermediate peptidase; M3B contains oligopeptidase F.
           M2 peptidase angiotensin converting enzyme (ACE, EC
           3.4.15.1) catalyzes the conversion of decapeptide
           angiotensin I to the potent vasopressor octapeptide
           angiotensin II. ACE is a key part of the
           renin-angiotensin system that regulates blood pressure,
           thus ACE inhibitors are important for the treatment of
           hypertension. M32 family includes two eukaryotic enzymes
           from protozoa Trypanosoma cruzi, a causative agent of
           Chagas' disease, and Leishmania major, a parasite that
           causes leishmaniasis, making them attractive targets for
           drug development. The M4 family includes secreted
           protease  thermolysin (EC 3.4.24.27), pseudolysin,
           aureolysin, neutral protease as well as fungalysin and
           bacillolysin (EC 3.4.24.28) that degrade extracellular
           proteins and peptides for bacterial nutrition,
           especially prior to sporulation. Thermolysin is widely
           used as a nonspecific protease to obtain fragments for
           peptide sequencing as well as in production of the
           artificial sweetener aspartame. M13 family includes
           neprilysin (EC 3.4.24.11) and endothelin-converting
           enzyme I (ECE-1, EC 3.4.24.71), which fulfill a broad
           range of physiological roles due to the greater
           variation in the S2' subsite allowing substrate
           specificity and are prime therapeutic targets for
           selective inhibition. Peptidase M36 (fungamysin) family
           includes endopeptidases from pathogenic fungi.
           Fungalysin hydrolyzes extracellular matrix proteins such
           as elastin and keratin. Aspergillus fumigatus causes the
           pulmonary disease aspergillosis by invading the lungs of
           immuno-compromised animals and secreting fungalysin that
           possibly breaks down proteinaceous structural barriers.
          Length = 125

 Score = 42.1 bits (99), Expect = 2e-04
 Identities = 16/47 (34%), Positives = 19/47 (40%), Gaps = 5/47 (10%)

Query: 724 IALVVGHELAHQWFGNLVTMEW-----WTHLWLNEGYASFVEFLCVH 765
              VVGHEL H     LV  +           LNEG + F E L  +
Sbjct: 69  DFGVVGHELTHGVTDQLVGNDPDLLYTNGSGGLNEGPSDFFELLVAY 115


>gnl|CDD|189011 cd09604, M1_APN_5, Peptidase M1 family containing bacterial
           Aminopeptidase N.  This family contains bacterial
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 435

 Score = 43.5 bits (103), Expect = 5e-04
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 727 VVGHELAHQWFGNLVTMEWWTHLWLNEGYASFVEFLC 763
           V+ HE+AHQW+  +V  +     WL+EG  +++    
Sbjct: 291 VLAHEIAHQWWYGIVGNDERNEPWLDEGLTTYLTDYY 327



 Score = 40.4 bits (95), Expect = 0.004
 Identities = 29/117 (24%), Positives = 50/117 (42%), Gaps = 18/117 (15%)

Query: 182 FSISLSVPNNKVALSNMPVKSESPQPDGHRLLQFETSPIMSTYLVAVVVGEFDY------ 235
           + ++L+VP   + ++   ++ E+ +  G +  +FE          A  V +F        
Sbjct: 159 YDVTLTVPAGYIVVATGRLQEETTEDGGTKTWEFE----------AENVRDFALAASPKF 208

Query: 236 -VEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIP 291
            V+E + DG+ VR Y      E  +  L  A K L FY + F   YP  +  +V  P
Sbjct: 209 IVDEATVDGIKVRAYYFPEDAELAKRYLDAAKKALEFYSELF-GPYPYKEFSVVENP 264



 Score = 39.3 bits (92), Expect = 0.010
 Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 5/77 (6%)

Query: 547 AVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKID 606
           A+       V+E + DG+ VR Y      E  +  L  A K L FY + F   YP  +  
Sbjct: 201 ALAASPKFIVDEATVDGIKVRAYYFPEDAELAKRYLDAAKKALEFYSELF-GPYPYKEFS 259

Query: 607 LVALNPGTVGYYRVKYP 623
           +V  NP   G    +YP
Sbjct: 260 VVE-NPFPGGM---EYP 272


>gnl|CDD|189018 cd09839, M1_TAF2, TATA binding protein (TBP) associated factor 2.
           This family includes TATA binding protein (TBP)
           associated factor 2 (TAF2, TBP-associated factor
           TAFII150, transcription initiation factor TFIID subunit
           2, RNA polymerase II TBP-associated factor subunit B),
           and has homology to the aminopeptidase N (APN)
           subfamily, belonging to the M1 gluzincin family. TAF2 is
           part of the TFIID multidomain subunit complex essential
           for transcription of most protein-encoded genes by RNA
           polymerase II. TAF2 is known to interact with the
           initiator element (Inr) found at the transcription start
           site of many genes, thus possibly playing a key role in
           promoter binding as well as start-site selection. Image
           analysis has shown TAF2 to form a complex with TAF1 and
           TBP, inferring its role in promoter recognition.
           Peptidases in the M1 family bind a single catalytic zinc
           ion which is tetrahedrally co-ordinated by three amino
           acid ligands and a water molecule that forms the
           nucleophile on activation during catalysis. TAF2,
           however, does not seem to contain any of the active site
           residues.
          Length = 507

 Score = 40.8 bits (96), Expect = 0.004
 Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 2/56 (3%)

Query: 732 LAHQWFGNLVTMEWWTHLWLNEGYASFVEFLCVHHLF--PEYDIWTQFVTDNLVRA 785
           LA QWFG  ++ E W+  WL +G A ++  L V  LF   EY    +   D +   
Sbjct: 322 LASQWFGCFISPEAWSDEWLLKGIAGYITGLFVKKLFGNNEYRFRIKKELDRVCEY 377



 Score = 32.3 bits (74), Expect = 1.7
 Identities = 31/162 (19%), Positives = 65/162 (40%), Gaps = 24/162 (14%)

Query: 166 ARRCFPCWDEPAVKAKFSISLSVPNNKVALSN--MPVKSESPQPDGHRLLQFE-TSPIMS 222
           AR  FPC D+P+    + +  +VP N VA+S+  +  +    +    +   +  T P  +
Sbjct: 157 ARCWFPCVDDPSQLCTWELEFTVPANMVAVSSGDLLEQVYDTEDMRKKTYHYALTVPTAA 216

Query: 223 TYLVAVVVGEFDYVEETSSDGV----LVRVYTPVGKREQGQFALHVASKVLPFYKDYFNI 278
              + + VG F+ + +  +  +    L  +   +   +     LH   +   F+++Y + 
Sbjct: 217 QN-IGLAVGPFEILVDPHAHEITHFCLPGLLPEL---KNTTSYLH---EAFEFFEEYLSC 269

Query: 279 AYPLPKIDLVAIPVGHPSEVEEIFDDISYNKGASII--RMLQ 318
            +P      V         V+E  +D++     SI    +L 
Sbjct: 270 RFPFSSYKQVF--------VDEAAEDVTSYASLSIFSSNLLY 303


>gnl|CDD|70841 pfam07390, P30, Mycoplasma P30 protein.  This family consists of
           several P30 proteins which seem to be specific to
           Mycoplasma agalactiae. P30 is a 30-kDa immunodominant
           antigen and is known to be a transmembrane protein.
          Length = 266

 Score = 32.5 bits (73), Expect = 0.99
 Identities = 21/83 (25%), Positives = 46/83 (55%), Gaps = 6/83 (7%)

Query: 908 TTENFASESHAQE---VTEFFTKNPTSWI---ERTVQQSVETIRLNSECLKRDGEALYQN 961
           T  +F +E++A+E   + + +++NP + I   E+ +++S E  +   + +    EA+Y++
Sbjct: 75  TQNHFGAETNAKESPALNDLYSENPATPIFKNEKGIKESSEGSKNEGDKVIAGKEAIYKD 134

Query: 962 SDMQEEKDRISRSFSALKDPELL 984
            D    K +I+     LKD +L+
Sbjct: 135 IDFDISKVKITIDKKDLKDEDLI 157


>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265).  Members of
           this family of bacterial proteins comprise various
           hypothetical proteins and putative aminopeptidases.
           Their exact function, has not, as yet, been defined.
          Length = 338

 Score = 32.6 bits (75), Expect = 1.2
 Identities = 31/127 (24%), Positives = 42/127 (33%), Gaps = 37/127 (29%)

Query: 730 HELAHQWF---GNLVTMEWWTHLWLNEGYASFVEFLCVHHLFPEYDIWTQFVTDNLVRAL 786
           HELAHQ     G+  T         NE +A+ VE   V         W     D   RA 
Sbjct: 172 HELAHQVVYIKGD--TA-------FNESFATAVEREGVRR-------WLAARGDPAARA- 214

Query: 787 ELDALKSSHPTQFVTDNLVRALELDALKSSHPIELYQN----SDMQEEK----DRISRSF 838
              A       QF    L+ A   D L+      LY +       +  K     ++   +
Sbjct: 215 -RYAAFDERRQQFRA--LLLA-TRDRLE-----ALYASPLADERKRAAKAAIFQQLRADY 265

Query: 839 SALKDPE 845
           +ALK   
Sbjct: 266 AALKARW 272


>gnl|CDD|227705 COG5418, COG5418, Predicted secreted protein [Function unknown].
          Length = 164

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 20/83 (24%), Positives = 35/83 (42%), Gaps = 8/83 (9%)

Query: 247 RVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDIS 306
            VY    ++++   A  V   +    KD+  I  P P+ + +  P   P   +E+FD   
Sbjct: 18  YVYRGP-EKKEYDTAKEVRKALPSNPKDWNIIQLPCPEFEYLGWP--RPPMTKEVFDHPG 74

Query: 307 YNK-----GASIIRMLQKYIGDG 324
           Y +        I R+L++   DG
Sbjct: 75  YRRVCRKIADPIGRVLEEEKPDG 97


>gnl|CDD|237585 PRK14015, pepN, aminopeptidase N; Provisional.
          Length = 875

 Score = 32.4 bits (75), Expect = 1.6
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 307 YNKGASIIRMLQKYIGDGKY 326
           Y KGA +IRML   +G+  +
Sbjct: 386 YEKGAEVIRMLHTLLGEEGF 405



 Score = 32.0 bits (74), Expect = 2.2
 Identities = 9/17 (52%), Positives = 12/17 (70%)

Query: 353 YNKGASIIRMLQKYIGD 369
           Y KGA +IRML   +G+
Sbjct: 386 YEKGAEVIRMLHTLLGE 402



 Score = 31.6 bits (73), Expect = 3.1
 Identities = 22/59 (37%), Positives = 28/59 (47%)

Query: 696 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHLWLNEG 754
           MEN GL  +    +L D +  +    + I  V+ HE  H W GN VT   W  L L EG
Sbjct: 268 MENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEG 326


>gnl|CDD|240973 cd12529, RRM2_MEI2_like, RNA recognition motif 2 in plant Mei2-like
           proteins.  This subgroup corresponds to the RRM2 of
           Mei2-like proteins that represent an ancient eukaryotic
           RNA-binding proteins family. Their corresponding
           Mei2-like genes appear to have arisen early in eukaryote
           evolution, been lost from some lineages such as
           Saccharomyces cerevisiae and metazoans, and diversified
           in the plant lineage. The plant Mei2-like genes may
           function in cell fate specification during development,
           rather than as stimulators of meiosis. Members in this
           family contain three RNA recognition motifs (RRMs), also
           termed RBDs (RNA binding domains) or RNPs
           (ribonucleoprotein domains). The C-terminal RRM (RRM3)
           is unique to Mei2-like proteins and is highly conserved
           between plants and fungi. To date, the intracellular
           localization, RNA target(s), cellular interactions and
           phosphorylation states of Mei2-like proteins in plants
           remain unclear. .
          Length = 71

 Score = 29.4 bits (66), Expect = 1.8
 Identities = 11/37 (29%), Positives = 21/37 (56%), Gaps = 2/37 (5%)

Query: 501 IDLLLSNTEYHHLFYQFGPVKS--ESPQPDGHRLLQF 535
           +D  +SN + H +F  +G +K   E+P    H+ ++F
Sbjct: 9   LDPSVSNDDLHQIFGAYGEIKEIRETPNKRHHKFIEF 45


>gnl|CDD|206213 pfam14043, WVELL, WVELL protein.  This family includes the B.
           subtilis YfjH protein, which is functionally
           uncharacterized. This is not a homologue of E. coli
           YfjH, a synonym for IscX, which belongs to pfam04384.
           This family of proteins is found in bacteria. Proteins
           in this family are approximately 90 amino acids in
           length and contain a highly conserved WVELL motif.
          Length = 75

 Score = 28.0 bits (63), Expect = 4.8
 Identities = 22/64 (34%), Positives = 28/64 (43%), Gaps = 10/64 (15%)

Query: 869 ISAAQTKTGRELAWEFLKNNYATFTERYKGGLLGRLVKHTTENFAS---ESHAQEVTEFF 925
           +S AQ +T  EL WE  +  YA     YKG       +  TE       E +   + EF 
Sbjct: 18  LSYAQARTWVELLWEDFETTYAKAGRPYKG-------EEMTERIVRQWIEQYGSRLHEFQ 70

Query: 926 TKNP 929
           TKNP
Sbjct: 71  TKNP 74


>gnl|CDD|235046 PRK02507, PRK02507, proton extrusion protein PcxA; Provisional.
          Length = 422

 Score = 29.9 bits (68), Expect = 7.7
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 797 TQFVTDNLVRALELDALKSSHPIELYQNSDMQEE 830
           TQ ++   + +  +D    + P E++ N +++EE
Sbjct: 215 TQQLSKTFIVSPLVDHFFPALPFEIFINPELEEE 248


>gnl|CDD|232987 TIGR00467, lysS_arch, lysyl-tRNA synthetase, archaeal and
           spirochete.  This model represents the lysyl-tRNA
           synthetases that are class I amino-acyl tRNA
           synthetases. It includes archaeal and spirochete
           examples of the enzyme. All other known examples are
           class IIc amino-acyl tRNA synthetases and seem to form a
           separate orthologous set [Protein synthesis, tRNA
           aminoacylation].
          Length = 515

 Score = 29.9 bits (67), Expect = 8.2
 Identities = 20/97 (20%), Positives = 37/97 (38%), Gaps = 5/97 (5%)

Query: 273 KDYFNIAYPLPKIDLVAIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGDGKYPILLRP 332
           K  F+I++ L  I L                D    K  +  R+ +      +     R 
Sbjct: 305 KPEFHISFDLDVIKLYED-YDKFERFYYGVKDKDEEKKRAFKRIYEL----SQPMPSERI 359

Query: 333 SFQIPVGHPSEVEEIFDDISYNKGASIIRMLQKYIGD 369
            +Q+P  H S + +IF++    K   I++ +Q  + D
Sbjct: 360 PYQVPFRHLSVISQIFENNDIEKILEILKRVQYTVDD 396


>gnl|CDD|218033 pfam04339, DUF482, Protein of unknown function, DUF482.  This
           family contains several proteins of uncharacterized
           function.
          Length = 370

 Score = 29.8 bits (68), Expect = 9.3
 Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 14/36 (38%)

Query: 741 VTMEWWTHL--------------WLNEGYASFVEFL 762
           V  +    L              W N GYASF +FL
Sbjct: 142 VDPDEAPLLEEAGFLQRLGQQFHWFNRGYASFDDFL 177


>gnl|CDD|237495 PRK13765, PRK13765, ATP-dependent protease Lon; Provisional.
          Length = 637

 Score = 30.0 bits (68), Expect = 9.6
 Identities = 31/122 (25%), Positives = 48/122 (39%), Gaps = 41/122 (33%)

Query: 796 PTQFVTDNLVRALELDALKSSHP----------IELYQNSDMQEEKDRISRSFSALKDPE 845
           P  F+   +V A  LDAL++ HP           E+Y    M++              PE
Sbjct: 275 PCDFI---MVAAGNLDALENMHPALRSRIKGYGYEVYMRDTMED-------------TPE 318

Query: 846 LLRKVLDFSMSDLVR-------AQDSVFVIISAAQTKTGRELAWEFLKNNYATFTERYKG 898
             RK++ F   ++ R        +D+V  II  A+ + GR+         + T   R  G
Sbjct: 319 NRRKLVRFVAQEVKRDGKIPHFDRDAVEEIIREAKRRAGRK--------GHLTLKLRDLG 370

Query: 899 GL 900
           GL
Sbjct: 371 GL 372


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.396 

Gapped
Lambda     K      H
   0.267   0.0731    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 56,227,010
Number of extensions: 5606179
Number of successful extensions: 4734
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4695
Number of HSP's successfully gapped: 85
Length of query: 1102
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 995
Effective length of database: 6,191,724
Effective search space: 6160765380
Effective search space used: 6160765380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)