RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9963
         (800 letters)



>gnl|CDD|189000 cd08662, M13, Peptidase family M13 includes neprilysin,
           endothelin-converting enzyme I.  M13 family of
           metallopeptidases includes neprilysin (neutral
           endopeptidase, NEP, enkephalinase, CD10, CALLA, EC
           3.4.24.11), endothelin-converting enzyme I (ECE-1, EC
           3.4.24.71), erythrocyte surface antigen KELL (ECE-3),
           phosphate-regulating gene on the X chromosome (PHEX),
           soluble secreted endopeptidase (SEP), and damage-induced
           neuronal endopeptidase (DINE)/X-converting enzyme (XCE).
           These proteins consist of a short N-terminal cytoplasmic
           domain, a single transmembrane helix, and a larger
           C-terminal extracellular domain containing the active
           site. Proteins in this family fulfill a broad range of
           physiological roles due to the greater variation in the
           S2' subsite allowing substrate specificity. NEP is
           expressed in a variety of tissues including kidney and
           brain, and is involved in many physiological and
           pathological processes, including blood pressure and
           inflammatory response. It degrades a wide array of
           substrates such as substance P, enkephalins,
           cholecystokinin, neurotensin and somatostatin.  It is an
           important enzyme in the regulation of amyloid-beta
           (Abeta) protein that forms amyloid plaques that are
           associated with Alzeimers disease (AD). ECE-1 catalyzes
           the final rate-limiting step in the biosynthesis of
           endothelins via post-translational conversion of the
           biologically inactive big endothelins. Like NEP, it also
           hydrolyses bradykinin, substance P, neurotensin and
           Abeta.  Endothelin-1 overproduction has been implicated
           in various diseases, including stroke, asthma,
           hypertension, and cardiac and renal failure. Kell is a
           homolog of NEP and constitutes a major antigen on human
           erythrocytes; it preferentially cleaves big endothelin-3
           to produce bioactive endothelin-3, but is also known to
           cleave substance P and neurokinin A. PHEX forms a
           complex interaction with fibroblast growth factor 23
           (FGF23) and matrix extracellular phosphoglycoprotein,
           causing bone mineralization. A loss-of-function mutation
           in PHEX disrupts this interaction leading to
           hypophosphatemic rickets; X-linked hypophosphatemic
           (XLH) rickets is the most common form of metabolic
           rickets. ECEL1 is a brain metalloprotease involved in
           the critical role in the nervous regulation of the
           respiratory system, while DINE (damage induced neuronal
           endopeptidase) is abundantly expressed in the
           hypothalamus and its expression responds to nerve injury
           as well. Thus, majority of these M13 proteases are prime
           therapeutic targets for selective inhibition.
          Length = 611

 Score =  621 bits (1603), Expect = 0.0
 Identities = 244/691 (35%), Positives = 355/691 (51%), Gaps = 84/691 (12%)

Query: 5   VQPCDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEEPQSASGP--HIF 62
           V PCD+FY++ACGN+LK   IP D++S  +FS++ +++  ++K +LEE  +         
Sbjct: 1   VDPCDDFYQYACGNWLKNHPIPADKSSYGSFSELREKVEERLKEILEEAAAEKASDSSAE 60

Query: 63  NLMKKFYATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYKLRDLG 122
             +K FY +CMD + ++ LG++P+                                    
Sbjct: 61  QKIKDFYRSCMDTEAIEALGLKPLL----------------------------------- 85

Query: 123 FNHNLLINIVFCKQTQHPINTTQIPGAGPDFKNSTRRRLLIDQVDLK-PLEDFLREGWRN 181
               LL  +                G  PD KNS+R  L +DQ  L  P  D+  +    
Sbjct: 86  ---PLLFGL----------------GVSPDLKNSSRNILYLDQPGLGLPDRDYYLDEKSK 126

Query: 182 KNVRAYHSYMVKTAVLFGAPVSRAFRELKDSIDFEITMTKFIVPDQELRNLSYLYNLMSV 241
           K   AY +Y+ K  VL G     A    ++ + FE  + K    ++E R+    YN M++
Sbjct: 127 KIRAAYKAYLAKLLVLAGEDEEDAEALAEEVLAFETELAKISWSEEERRDPEKTYNPMTL 186

Query: 242 HTLNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQPKYMSSIERLV 301
             L K  P I  +              +L  +  P      +E V+VTQP Y+  + +L+
Sbjct: 187 AELQKLAPGID-W------------KAYLEALGLP----SEDEKVIVTQPDYLKKLNKLL 229

Query: 302 QITPKRILANYMLMQTVLSSSAYLCKSARRPYHDFKYVLTGVNSTRIRSQRCVDEVAEKF 361
             TP R L NY++ + + S + YL +  R     +   L+G    R R +RCV  V    
Sbjct: 230 ASTPLRTLKNYLIWRLLDSLAPYLSEEFRDANFFYGKTLSGTKEQRPRWKRCVSLVNGLL 289

Query: 362 SLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKVH 421
             A+G LYVR +F   +K    ELV NL++     L+N+ WMD +TKK ALEK   + V 
Sbjct: 290 GEALGRLYVRKYFPPEAKARVEELVENLKKAFRERLENLDWMDEETKKKALEKLDAMTVK 349

Query: 422 IGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKSVDYAYTQLHEPINKTSWKEFGS 481
           IGYPD+  D  K+D+ Y DL         +VL L +  ++    +L +P+++T W    +
Sbjct: 350 IGYPDKWRDYSKLDIDYDDL----DSYFGNVLRLRRFELERNLAKLGKPVDRTEWGM--T 403

Query: 482 VATVNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYD 541
             TVNA+ +  +N I FPAGILQ P+F  D P  +NYG IG VIGHEITH FDD+G Q+D
Sbjct: 404 PQTVNAYYNPTQNEIVFPAGILQPPFFDPDAPDAVNYGGIGAVIGHEITHGFDDQGRQFD 463

Query: 542 KFGNVMNWWDDESKMRYNEKASCIIKQYSNYVIEDIGFRINGITTQGENIADNGGVKESY 601
           + GN+ NWW  E +  + E+  C++ QYSNY +   G  +NG  T GENIAD GG++ +Y
Sbjct: 464 EDGNLRNWWTPEDRKAFEERTQCLVDQYSNYEV-PPGLHVNGKLTLGENIADLGGLRLAY 522

Query: 602 RAYLECAKQKNAEYKLPGLENFSDKQMFWISFANVWCTKYRKEAMELLFKNDDHPPAEIR 661
            AY +  K K AE  LP  + F+  Q+F++SFA VWC+KYR EA+  L   D H P + R
Sbjct: 523 DAYKKWLKGKGAE--LPPGDGFTPDQLFFLSFAQVWCSKYRPEALRQLLLTDPHSPGKFR 580

Query: 662 VNGPLSNSYDFARDFQCYSGSK-MNPKHKCE 691
           VNG LSNS +FA  F C  G   MNP+ +C 
Sbjct: 581 VNGVLSNSPEFAEAFNCPPGDPYMNPEKRCR 611



 Score =  115 bits (290), Expect = 1e-26
 Identities = 42/93 (45%), Positives = 55/93 (59%)

Query: 703 RSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTK 762
           R +RCV  V      A+G LYVR +F   +K    ELV NL++     L+N+ WMD +TK
Sbjct: 277 RWKRCVSLVNGLLGEALGRLYVRKYFPPEAKARVEELVENLKKAFRERLENLDWMDEETK 336

Query: 763 KNALEKSKVLKVHIGYPDELLDDDKIDLHYKDL 795
           K ALEK   + V IGYPD+  D  K+D+ Y DL
Sbjct: 337 KKALEKLDAMTVKIGYPDKWRDYSKLDIDYDDL 369


>gnl|CDD|226118 COG3590, PepO, Predicted metalloendopeptidase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 654

 Score =  356 bits (916), Expect = e-112
 Identities = 190/705 (26%), Positives = 303/705 (42%), Gaps = 76/705 (10%)

Query: 1   MDPGVQPCDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEE---PQSAS 57
           MD   +P D+ Y +  G +LKT+ IP DR+    F K++DR  + ++ ++E     + A 
Sbjct: 14  MDAMTRPQDDLYGYVNGEWLKTAEIPADRSRDGAFDKLDDRAEALVRDIIEAAAANEQAP 73

Query: 58  GPHIFNLMKKFYATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYK 117
              I   + K Y + MDE   +K GV P+K  + ++  L          S F+      +
Sbjct: 74  EDAILQRIGKLYRSFMDEAKREKAGVDPLKPELAEIDSLA-------SFSDFAAALGQLE 126

Query: 118 LRDLGFNHNLLINIVFCKQTQHPINTTQIPGAGPDFKNSTRRRLLIDQVDLK-PLEDFLR 176
               G                            PDFK+STR  L   Q  L  P   + R
Sbjct: 127 RAGQG----NPFGF----------------SVSPDFKDSTRYVLYFSQSGLGLPDTTYYR 166

Query: 177 EGWRNKNVR-AYHSYMVKTAVLFGAPVSRAFRELKDS----IDFEITMTKFIVPDQELRN 231
           +  ++  +  AY  ++ +   LFG   S    +        +  E  +        + R+
Sbjct: 167 DE-QHAELLAAYKEHVARMLGLFG--FSEEEEDAAKHALRVVALETKLANASWEVVKYRD 223

Query: 232 LSYLYNLMSVHTLNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQP 291
           L + YN  +   L    P                    L   +G       +++++V  P
Sbjct: 224 LYHTYNPATFAELQPELPG--------------DDWSLLFSALGQL----PDKVIVVENP 265

Query: 292 KYMSSIERLVQITPKRILANYMLMQTVLSSSAYLCKSARRPYHDF-KYVLTGVNSTRIRS 350
            Y+     L+          ++  + + +++ YL +     + DF    L+G    R R 
Sbjct: 266 FYLKEFASLLAEENWADWKAWLRWKLIRAAAPYLTEDLVDEHFDFYGRTLSGQPEARDRW 325

Query: 351 QRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKN 410
           +R V+     F  A+G LYV+ +F   +K +  ELV NL +     +  + WM P+T++ 
Sbjct: 326 KRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPETREK 385

Query: 411 ALEKSKVLKVHIGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKSVDYAYTQLHEP 470
           ALEK       IGYPD           Y  L I    L  +VL     +  +  +++ +P
Sbjct: 386 ALEKLNKFTAKIGYPDPWRY-------YSKLEIKRDSLYGNVLRASAFNHAHELSKIGKP 438

Query: 471 INKTSWKEFGSVATVNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEIT 530
           +++  W       TVNA+ +  KN I FPA ILQ P+F  +     NYG IG VIGHEI 
Sbjct: 439 VDRDEW--EMPPQTVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIG 496

Query: 531 HTFDDEGSQYDKFGNVMNWWDDESKMRYNEKASCIIKQYSNYVIEDIGFRINGITTQGEN 590
           H FDD+G+++D  GN+ +WW DE    + E+   ++ Q+  Y  E  G       T GEN
Sbjct: 497 HGFDDQGAKFDGDGNLNDWWTDEDAAAFKERTKKLVAQFDGYEPEG-GKDNGNALTVGEN 555

Query: 591 IADNGGVKESYRAYLECAKQKNAEYKLPGLENFSDKQMFWISFANVWCTKYRKEAMELLF 650
           IAD GG+  +  AY      K +    P ++ F+  Q F++++A +W  K R E  ++  
Sbjct: 556 IADLGGLAIALDAY------KLSLDPAPVIDGFTGLQRFFLNWAQIWRMKARPEEQQMRL 609

Query: 651 KNDDHPPAEIRVNGPLSNSYDFARDFQCYSGSKM--NPKHKCEVW 693
             D H PAE RVNGP+ N  +F   F    G  M   P+ +  +W
Sbjct: 610 SVDPHSPAEFRVNGPVRNMDEFYEAFDVKEGDAMYRAPEERVVIW 654



 Score = 79.4 bits (196), Expect = 3e-15
 Identities = 31/88 (35%), Positives = 45/88 (51%)

Query: 701 RIRSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQ 760
           R R +R V+     F  A+G LYV+ +F   +K +  ELV NL +     +  + WM P+
Sbjct: 322 RDRWKRAVNLAERLFGEAIGLLYVKRYFPPEAKADMEELVANLIKAYKARISKLDWMTPE 381

Query: 761 TKKNALEKSKVLKVHIGYPDELLDDDKI 788
           T++ ALEK       IGYPD      K+
Sbjct: 382 TREKALEKLNKFTAKIGYPDPWRYYSKL 409


>gnl|CDD|218676 pfam05649, Peptidase_M13_N, Peptidase family M13.  M13 peptidases
           are well-studied proteases found in a wide range of
           organisms including mammals and bacteria. In mammals
           they participate in processes such as cardiovascular
           development, blood-pressure regulation, nervous control
           of respiration, and regulation of the function of
           neuropeptides in the central nervous system. In bacteria
           they may be used for digestion of milk.
          Length = 380

 Score =  296 bits (760), Expect = 3e-93
 Identities = 128/423 (30%), Positives = 205/423 (48%), Gaps = 48/423 (11%)

Query: 8   CDNFYKFACGNFLKTSVIPPDRASVSTFSKIEDRLSSQIKTVLEEPQSASGPH-IFNLMK 66
           CD+FY++ACG +LK + IP DR+S  TFS++ D++  Q++ +LEE  S           K
Sbjct: 1   CDDFYQYACGGWLKNTPIPADRSSYGTFSELRDKIEKQLRAILEEAASGEADSSNERKAK 60

Query: 67  KFYATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRFSWINTTYKLRDLGFNHN 126
             Y +CMD D ++KLG++P+K +++++G +          S+F   +   KL   G ++ 
Sbjct: 61  DLYRSCMDTDAIEKLGLKPLKPLLKKIGAI---------KSKFDLADALAKLERGG-SNG 110

Query: 127 LLINIVFCKQTQHPINTTQIPGAGPDFKNSTRRRLLIDQVDLK-PLEDFLREG--WRNKN 183
            L                   G  PDFKNS+R  L + Q  L  P  D+  +      K 
Sbjct: 111 PLFGF----------------GVSPDFKNSSRNILYLSQGGLGLPDRDYYLKDDAKSEKI 154

Query: 184 VRAYHSYMVKTAVLFGAPVSRAFRELKDSIDFEITMTKFIVPDQELRNLSYLYNLMSVHT 243
             AY  Y+ K   L G     A +  +D + FE  + K     +ELR+ + LYN M++  
Sbjct: 155 REAYKKYIAKLLKLAGLDEDEAEKAAEDVVAFETKLAKASWSREELRDPAKLYNPMTLAE 214

Query: 244 LNKYYPSIPCFYLEIDGPSLNKKLEFLNRIIGPKISIDSEELVMVTQPKYMSSIERLVQI 303
           L    P +                 +L+ +  PK   D +E++ V+QP Y+ ++ +L+  
Sbjct: 215 LQALAPGLDW-------------KRYLSALGLPK---DPDEVI-VSQPDYLKALNKLLAE 257

Query: 304 TPKRILANYMLMQTVLSSSAYLCKSARRPYHDFKYV-LTGVNSTRIRSQRCVDEVAEKFS 362
           TP   L NY++ + + S + YL +  R    +F    L+G    R R +RCV  V     
Sbjct: 258 TPLETLKNYLIWKLIDSFAPYLSEEFRDANFEFYGKTLSGTKEQRPRWKRCVSLVNGLLG 317

Query: 363 LAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTKKNALEKSKVLKVHI 422
            A+G LYV  +F   +K +  E+V N+++     L+ + WM  +TKK ALEK   + V I
Sbjct: 318 EALGRLYVEKYFPPEAKADVEEMVKNIKKAFRERLEELDWMSEETKKKALEKLDAMTVKI 377

Query: 423 GYP 425
           GYP
Sbjct: 378 GYP 380



 Score = 94.7 bits (236), Expect = 9e-21
 Identities = 30/77 (38%), Positives = 43/77 (55%)

Query: 703 RSQRCVDEVAEKFSLAVGSLYVRHFFDRTSKENALELVNNLRQEMDYILQNVSWMDPQTK 762
           R +RCV  V      A+G LYV  +F   +K +  E+V N+++     L+ + WM  +TK
Sbjct: 304 RWKRCVSLVNGLLGEALGRLYVEKYFPPEAKADVEEMVKNIKKAFRERLEELDWMSEETK 363

Query: 763 KNALEKSKVLKVHIGYP 779
           K ALEK   + V IGYP
Sbjct: 364 KKALEKLDAMTVKIGYP 380


>gnl|CDD|216499 pfam01431, Peptidase_M13, Peptidase family M13.  Mammalian enzymes
           are typically type-II membrane anchored enzymes which
           are known, or believed to activate or inactivate
           oligopeptide (pro)-hormones such as opioid peptides. The
           family also contains a bacterial member believed to be
           involved with milk protein cleavage.
          Length = 206

 Score =  204 bits (520), Expect = 6e-61
 Identities = 89/210 (42%), Positives = 119/210 (56%), Gaps = 7/210 (3%)

Query: 486 NAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFGN 545
           NA+ +   N + FPAGIL  P+F    P  +NYG IG V+GHEI H FDD+G Q+DK GN
Sbjct: 1   NAYYNPGFNYLTFPAGILVPPFFNDLYPPAVNYGGIGFVLGHEIMHGFDDQGIQFDKDGN 60

Query: 546 VMNWWDDESKMRYNEKASCIIKQYSNYVIEDIGFR--INGITTQGENIADNGGVKESYRA 603
           +  WW DE +  + + A C+I QY  Y   D       NG TT GENIAD GG++ + +A
Sbjct: 61  LCGWWTDEDEAEFKDAAQCLIDQYDEYGDPDAVKPDCWNGATTFGENIADLGGLRIALKA 120

Query: 604 YLECAKQKNA-EYKLPGLENFSDKQMFWISFANVWCTKYRKEAMELLFKNDDHPPAEIRV 662
           Y     +K+A E +LPGLEN +  Q+F+ ++A  WC K R +        D H P E+RV
Sbjct: 121 Y----AKKSANEQRLPGLENLTPDQLFFETYAQSWCRKSRPKNSLTQLLVDPHSPLELRV 176

Query: 663 NGPLSNSYDFARDFQCYSGSKMNPKHKCEV 692
           NG + N   F   F C  G +M PK +   
Sbjct: 177 NGAVRNMPAFYSAFNCKPGDRMFPKPEKRC 206


>gnl|CDD|189001 cd09594, GluZincin, Peptidase Gluzincin family (thermolysin-like
           proteinases, TLPs) includes peptidases M1, M2, M3, M4,
           M13, M32 and M36 (fungalysins).  Gluzincin family
           (thermolysin-like peptidases or TLPs) includes several
           zinc-dependent metallopeptidases such as the M1, M2, M3,
           M4, M13, M32, M36 peptidases (MEROPS classification),
           and contain HEXXH and EXXXD motifs as part of their
           active site. All peptidases in this family bind a single
           catalytic zinc ion which is tetrahedrally co-ordinated
           by three amino acid ligands and a water molecule that
           forms the nucleophile on activation during catalysis. M1
           family includes aminopeptidase N (APN) and leukotriene
           A4 hydrolase (LTA4H).  APN preferentially cleaves
           neutral amino acids from the N-terminus of oligopeptides
           and is present in a variety of human tissues and cell
           types. LTA4H is a bifunctional enzyme, possessing an
           aminopeptidase as well as an epoxide hydrolase activity
           such that the two activities occupy different, but
           overlapping sites. The peptidase M3 or neurolysin-like
           family, includes M3, M2 and M32 metallopeptidases.  The
           M3 peptidases have two subfamilies: M3A, includes thimet
           oligopeptidase (TOP; endopeptidase 3.4.24.15),
           neurolysin (3.4.24.16), and the mitochondrial
           intermediate peptidase; M3B contains oligopeptidase F.
           M2 peptidase angiotensin converting enzyme (ACE, EC
           3.4.15.1) catalyzes the conversion of decapeptide
           angiotensin I to the potent vasopressor octapeptide
           angiotensin II. ACE is a key part of the
           renin-angiotensin system that regulates blood pressure,
           thus ACE inhibitors are important for the treatment of
           hypertension. M32 family includes two eukaryotic enzymes
           from protozoa Trypanosoma cruzi, a causative agent of
           Chagas' disease, and Leishmania major, a parasite that
           causes leishmaniasis, making them attractive targets for
           drug development. The M4 family includes secreted
           protease  thermolysin (EC 3.4.24.27), pseudolysin,
           aureolysin, neutral protease as well as fungalysin and
           bacillolysin (EC 3.4.24.28) that degrade extracellular
           proteins and peptides for bacterial nutrition,
           especially prior to sporulation. Thermolysin is widely
           used as a nonspecific protease to obtain fragments for
           peptide sequencing as well as in production of the
           artificial sweetener aspartame. M13 family includes
           neprilysin (EC 3.4.24.11) and endothelin-converting
           enzyme I (ECE-1, EC 3.4.24.71), which fulfill a broad
           range of physiological roles due to the greater
           variation in the S2' subsite allowing substrate
           specificity and are prime therapeutic targets for
           selective inhibition. Peptidase M36 (fungamysin) family
           includes endopeptidases from pathogenic fungi.
           Fungalysin hydrolyzes extracellular matrix proteins such
           as elastin and keratin. Aspergillus fumigatus causes the
           pulmonary disease aspergillosis by invading the lungs of
           immuno-compromised animals and secreting fungalysin that
           possibly breaks down proteinaceous structural barriers.
          Length = 125

 Score = 35.5 bits (82), Expect = 0.026
 Identities = 16/60 (26%), Positives = 20/60 (33%), Gaps = 9/60 (15%)

Query: 485 VNAFNHIDKNAIEFPAGILQEPYFAADRPCFLNYGAIGTVIGHEITHTFDDEGSQYDKFG 544
            NA  +     I    GIL           FL       V+GHE+TH   D+    D   
Sbjct: 42  NNAMCNGLDARIVMNDGILVA---------FLLDSDDFGVVGHELTHGVTDQLVGNDPDL 92


>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
           tricorn interacting factor F3, Endoplasmic reticulum
           aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ).  This
           M1 peptidase family includes eukaryotic and bacterial
           members: aminopeptidase N (APN), aminopeptidase Q (APQ,
           laeverin), endoplasmic reticulum aminopeptidase 1
           (ERAP1) as well as tricorn interacting factor F3.
           Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease,
           consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types (leukocyte, fibroblast,
           endothelial and epithelial cells). APN expression is
           dysregulated in inflammatory diseases such as chronic
           pain, rheumatoid arthritis, multiple sclerosis, systemic
           sclerosis, systemic lupus erythematosus,
           polymyositis/dermatomyosytis and pulmonary sarcoidosis,
           and is enhanced in tumor cells such as melanoma, renal,
           prostate, pancreas, colon, gastric and thyroid cancers.
           It is considered a marker of differentiation since it is
           predominantly expressed on stem cells and on cells of
           the granulocytic and monocytic lineages at distinct
           stages of differentiation. Thus, APN inhibition may lead
           to the development of anti-cancer and anti-inflammatory
           drugs. ERAP1 also known as endoplasmic reticulum
           aminopeptidase associated with antigen processing
           (ERAAP), adipocyte derived leucine aminopeptidase
           (A-LAP) or aminopeptidase regulating tumor necrosis
           factor receptor I (THFRI) shedding (ARTS-1), associates
           with the closely related ER aminopeptidase ERAP2, for
           the final trimming of peptides within the ER for
           presentation by MHC class I molecules. ERAP1 is
           associated with ankylosing spondylitis (AS), an
           inflammatory arthritis that predominantly affects the
           spine. ERAP1 also aids in the shedding of membrane-bound
           cytokine receptors. The tricorn interacting factor F3,
           together with factors F1 and F2, degrades the tricorn
           protease products, producing free amino acids, thus
           completing the proteasomal degradation pathway. F3 is
           homologous to F2, but not F1, and shows a strong
           preference for glutamate in the P1' position. APQ, also
           known as laeverin, is specifically expressed in human
           embryo-derived extravillous trophoblasts (EVTs) that
           invade the uterus during early placentation. It cleaves
           the N-terminal amino acid of various peptides such as
           angiotensin III, endokinin C, and kisspeptin-10, all
           expressed in the placenta in large quantities. APN is a
           receptor for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs are also putative Cry toxin
           receptors. Cry1 proteins are pore-forming toxins that
           bind to the midgut epithelial cell membrane of
           susceptible insect larvae, causing extensive damage.
           Several different toxins, including Cry1Aa, Cry1Ab,
           Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
           bind to APNs; however, a direct role of APN in
           cytotoxicity has been yet to be firmly established.
          Length = 446

 Score = 35.2 bits (82), Expect = 0.12
 Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 11/34 (32%)

Query: 521 IGTVIGHEITHTFDDEGSQYDKFGNV--MNWWDD 552
           + TV+ HE+ H       Q+  FGN+  M WWDD
Sbjct: 286 VATVVAHELAH-------QW--FGNLVTMKWWDD 310


>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1.  Members of this
           family are aminopeptidases. The members differ widely in
           specificity, hydrolysing acidic, basic or neutral
           N-terminal residues. This family includes leukotriene-A4
           hydrolase, this enzyme also has an aminopeptidase
           activity.
          Length = 390

 Score = 35.4 bits (82), Expect = 0.13
 Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 22/79 (27%)

Query: 487 AFNHIDKNAI-EFPAG--------ILQEPYFAADR--PCFLNYGAIGTVIGHEITHTFDD 535
               +D+ A+ +F AG          +EP    D       +   +  VI HE+ H    
Sbjct: 250 PLPKLDQVALPDFSAGAMENWGLITYREPALLYDPGNSTSSDKQRVAEVIAHELAH---- 305

Query: 536 EGSQYDKFGNV--MNWWDD 552
              Q+  FGN+  M+WWDD
Sbjct: 306 ---QW--FGNLVTMDWWDD 319


>gnl|CDD|223382 COG0305, DnaB, Replicative DNA helicase [DNA replication,
           recombination, and repair].
          Length = 435

 Score = 33.0 bits (76), Expect = 0.60
 Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 1/53 (1%)

Query: 340 LTGVNSTRIRSQRCVDEVAEKFSLAVGSLYVRH-FFDRTSKENALELVNNLRQ 391
            +G+ S+++R+ R  D+  E+   A   L     F D T      E+ +  R+
Sbjct: 248 ESGIESSKLRTGRLSDDEWERLIKAASELSEAPIFIDDTPGLTITEIRSKARR 300



 Score = 30.7 bits (70), Expect = 3.0
 Identities = 12/53 (22%), Positives = 23/53 (43%), Gaps = 1/53 (1%)

Query: 694 KESVNSTRIRSQRCVDEVAEKFSLAVGSLYVRH-FFDRTSKENALELVNNLRQ 745
           +  + S+++R+ R  D+  E+   A   L     F D T      E+ +  R+
Sbjct: 248 ESGIESSKLRTGRLSDDEWERLIKAASELSEAPIFIDDTPGLTITEIRSKARR 300


>gnl|CDD|182802 PRK10877, PRK10877, protein disulfide isomerase II DsbC;
           Provisional.
          Length = 232

 Score = 32.0 bits (73), Expect = 0.92
 Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 7/52 (13%)

Query: 619 GLENFSDKQMFWISFANVWCTKYRKEAMELLFKNDDHPPAEIRVNGPLSNSY 670
           GL++ ++K M      ++WC   R +A +   K  D  PA   V+  +++ Y
Sbjct: 148 GLDSQAEKDM-----KSIWCAADRNKAFDDAMKGKDVSPASCDVD--IADHY 192


>gnl|CDD|215083 PLN00162, PLN00162, transport protein sec23; Provisional.
          Length = 761

 Score = 32.2 bits (74), Expect = 1.1
 Identities = 15/32 (46%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 78  LDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRF 109
           LD++GV  MKV +E+ GGL VV  + +  S F
Sbjct: 352 LDQVGVAEMKVAVERTGGL-VVLAESFGHSVF 382


>gnl|CDD|148461 pfam06857, ACP, Malonate decarboxylase delta subunit (MdcD).
          This family consists of several bacterial malonate
          decarboxylase delta subunit (MdcD) proteins. Malonate
          decarboxylase of Klebsiella pneumoniae consists of four
          different subunits and catalyzes the conversion of
          malonate plus H+ to acetate and CO2. The catalysis
          proceeds via acetyl and malonyl thioester residues with
          the phosphribosyl-dephospho-CoA prosthetic group of the
          acyl carrier protein (ACP) subunit. MdcC is the (apo)
          ACP subunit. The family also contains the CitD family
          of citrate lyase acyl carrier proteins.
          Length = 87

 Score = 29.1 bits (66), Expect = 2.3
 Identities = 12/25 (48%), Positives = 14/25 (56%)

Query: 75 EDTLDKLGVQPMKVIIEQLGGLPVV 99
           DTL +LGV  +KV I   G L  V
Sbjct: 49 RDTLARLGVTNVKVSINDKGALDCV 73


>gnl|CDD|117515 pfam08948, DUF1859, Domain of unknown function (DUF1859).  This
           domain has no known function. It is predominantly found
           in the N-terminus of bacteriophage spike proteins.
          Length = 126

 Score = 29.7 bits (66), Expect = 2.4
 Identities = 9/65 (13%), Positives = 18/65 (27%)

Query: 438 YKDLRIVPGDLLTSVLNLYKKSVDYAYTQLHEPINKTSWKEFGSVATVNAFNHIDKNAIE 497
           Y D     G     V    ++    A  Q + P+      +F +    +    +    + 
Sbjct: 59  YIDNAENDGAFEIDVEETGQRIKCPAGKQGYFPLLVPGRAKFVARHLGSGKKSVPLFFLN 118

Query: 498 FPAGI 502
           F    
Sbjct: 119 FTIAQ 123


>gnl|CDD|233524 TIGR01677, pln_FAD_oxido, plant-specific FAD-dependent
           oxidoreductase.  This model represents an
           uncharacterized plant-specific family of FAD-dependent
           oxidoreductases. At least seven distinct members are
           found in Arabidopsis thaliana. The family shows
           considerable sequence similarity to three different
           enzymes of ascorbic acid biosynthesis:
           L-galactono-1,4-lactone dehydrogenase (EC 1.3.2.3) from
           higher plants, D-arabinono-1,4-lactone oxidase (EC
           1.1.3.37 from Saccharomyces cerevisiae, and
           L-gulonolactone oxidase (EC 1.1.3.8) from mouse, as well
           as to a bacterial sorbitol oxidase. The class of
           compound acted on by members of this family is unknown.
          Length = 557

 Score = 31.0 bits (70), Expect = 3.3
 Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 293 YMSSIERLVQITPKRI----LANYMLMQTVLSSSAYLCKSARRPYHDFKY 338
           ++  ++RL  + PK +    L N +L++ V +S AYL K       DF Y
Sbjct: 384 FVLDVKRLRDMEPKSLCGVELYNGILIRYVKASPAYLGKEEDAVDFDFTY 433


>gnl|CDD|201611 pfam01126, Heme_oxygenase, Heme oxygenase. 
          Length = 204

 Score = 30.0 bits (68), Expect = 3.3
 Identities = 19/86 (22%), Positives = 28/86 (32%), Gaps = 13/86 (15%)

Query: 644 EAMEL-LFKNDDHPPAEIRVNGPLSNSYDFARDFQCYSGSKMNPKHKCEVWKESVN---S 699
            A+E  L +N D P A       L+      RD     G+          W+  +    +
Sbjct: 49  SALEEELERNRDSPVAAPIYFPELNRKAALERDLAYLYGAD---------WRADIQDSPA 99

Query: 700 TRIRSQRCVDEVAEKFSLAVGSLYVR 725
           T+    R  +   E   L V   Y R
Sbjct: 100 TQEYVPRIREIGNESPELLVAHAYTR 125


>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
           leukotriene A4 hydrolase.  M1 Peptidase family includes
           aminopeptidase N (APN) and leukotriene A4 hydrolase
           (LTA4H).  All peptidases in this family bind a single
           catalytic zinc ion which is tetrahedrally co-ordinated
           by three amino acid ligands and a water molecule that
           forms the nucleophile on activation during catalysis.
           APN consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types. APN expression is
           dysregulated in many inflammatory diseases and is
           enhanced in numerous tumor cells, making it a lead
           target in the development of anti-cancer and
           anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
           possessing an aminopeptidase as well as an epoxide
           hydrolase activity. The two activities occupy different,
           but overlapping sites. The activity and physiological
           relevance of the aminopeptidase in LTA4H is as yet
           unknown while the epoxide hydrolase converts leukotriene
           A4 (LTA4) into leukotriene B4 (LTB4), a potent
           chemotaxin that is fundamental to the inflammatory
           response of mammals.
          Length = 407

 Score = 30.1 bits (68), Expect = 4.9
 Identities = 10/36 (27%), Positives = 14/36 (38%), Gaps = 11/36 (30%)

Query: 521 IGTVIGHEITHTFDDEGSQYDKFGN--VMNWWDDES 554
           +  VI HE+ H +         FGN      W+D  
Sbjct: 268 LENVIAHELAHQW---------FGNLVTGARWNDLW 294


>gnl|CDD|205445 pfam13265, DUF4056, Protein of unknown function (DUF4056).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 355 and 380 amino
           acids in length.
          Length = 271

 Score = 29.5 bits (67), Expect = 5.9
 Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 6/46 (13%)

Query: 66  KKFYATCMD---EDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSR 108
            + ++  MD      L +LG         Q      +DG  WDSSR
Sbjct: 202 LEQFSQAMDQALPQALKQLGAVSKSETKAQFD---QLDGIWWDSSR 244


>gnl|CDD|239588 cd03511, Rhizopine-oxygenase-like, This CD includes the putative
           hydrocarbon oxygenase, MocD, a bacterial rhizopine
           (3-O-methyl-scyllo-inosamine, 3-O-MSI) oxygenase, and
           other related proteins. It has been proposed that MocD,
           MocE (Rieske-like ferredoxin), and MocF (ferredoxin
           reductase) under the regulation of MocR, act in concert
           to form a ferredoxin oxygenase system that demethylates
           3-O-MSI to form scyllo-inosamine.  This domain family
           appears to be structurally related to the membrane fatty
           acid desaturases and the alkane hydroxylases. They all
           share in common extensive hydrophobic regions that would
           be capable of spanning the membrane bilayer at least
           twice. Comparison of sequences also reveals the
           existence of three regions of conserved histidine
           cluster motifs that contain eight histidine residues:
           HXXXH, HXXHH, and HXXHH. These histidine residues are
           reported to be catalytically essential and proposed to
           be the ligands for the iron atoms contained within
           homologs, stearoyl CoA desaturase and alkane
           hydroxylase.
          Length = 285

 Score = 29.6 bits (67), Expect = 5.9
 Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 15/78 (19%)

Query: 197 LFGAPVSRAFRELK------DSIDFEITMTKFIVPDQELRNLSYLYNLMSVHTLNKYYPS 250
           L G P+ R F   +      D+ D   T T           L +LY  M  H  +  YPS
Sbjct: 202 LLGQPILRLFLLAEHGGCPEDANDLRNTRTTLT-----NPPLRFLYWNMPYHAEHHMYPS 256

Query: 251 IPCFYLEIDGPSLNKKLE 268
           +P   L    P L++ ++
Sbjct: 257 VPFHAL----PKLHELIK 270


>gnl|CDD|234286 TIGR03626, L3_arch, archaeal ribosomal protein L3.  This model
           describes exclusively the archaeal class of ribosomal
           protein L3. A separate model (TIGR03625) describes the
           bacterial/organelle form, and both belong to Pfam family
           pfam00297. Eukaryotic proteins are excluded from this
           model [Protein synthesis, Ribosomal proteins: synthesis
           and modification].
          Length = 330

 Score = 29.6 bits (67), Expect = 6.3
 Identities = 13/45 (28%), Positives = 22/45 (48%), Gaps = 5/45 (11%)

Query: 241 VHTLNKYYPSIP-----CFYLEIDGPSLNKKLEFLNRIIGPKISI 280
           VHT  K    +P        + I G S+ ++ E+   I+G +IS+
Sbjct: 140 VHTQPKLVTGVPKKKPEIMEIRIGGGSVEERFEYALSILGKEISV 184


>gnl|CDD|188993 cd06258, M3_like, Peptidase M3-like family, a zincin
           metallopeptidase, includes M3 and M32 families.  The
           peptidase M3-like family, also called neurolysin-like
           family, is part of the "zincins" metallopeptidases, and
           includes M3 and M32 families of metallopeptidases.  The
           M3 family is subdivided into two subfamilies: the
           widespread M3A, which comprises a number of
           high-molecular mass endo- and exopeptidases from
           bacteria, archaea, protozoa, fungi, plants and animals,
           and the small M3B, whose members are enzymes primarily
           from bacteria. Well-known mammalian/eukaryotic M3A
           endopeptidases are the thimet oligopeptidase (TOP;
           endopeptidase 3.4.24.15), neurolysin (alias
           endopeptidase 3.4.24.16), and the mitochondrial
           intermediate peptidase. The first two are intracellular
           oligopeptidases, which act only on relatively short
           substrates of less than 20 amino acid residues, while
           the latter cleaves N-terminal octapeptides from proteins
           during their import into the mitochondria. The M3A
           subfamily also contains several bacterial
           endopeptidases, collectively called oligopeptidases A,
           as well as a large number of bacterial
           carboxypeptidases, called dipeptidyl peptidases (Dcp;
           Dcp II; peptidyl dipeptidase; EC 3.4.15.5). The
           peptidases in the M3 family contain the HEXXH motif that
           forms the active site in conjunction with a
           C-terminally-located Glutamic acid (Glu) residue. A
           single zinc ion is ligated by the side-chains of the two
           Histidine (His) residues, and the more C-terminal Glu.
           Most of the peptidases are synthesized without signal
           peptides or propeptides, and function intracellularly.
           There are similarities to the thermostable
           carboxypeptidases  from Pyrococcus furiosus
           carboxypeptidase (PfuCP), and Thermus aquaticus (TaqCP),
           belonging to peptidase family M32. Little is known about
           function of this family, including carboxypeptidases Taq
           and Pfu.
          Length = 400

 Score = 29.7 bits (67), Expect = 6.4
 Identities = 14/104 (13%), Positives = 32/104 (30%), Gaps = 18/104 (17%)

Query: 292 KYMSSIERLVQITPKRILANYMLMQT--VLSSSAYLCKSARRPYHDFKYVLTGVNSTRIR 349
           K    +  +    P              + SS  Y        YH   Y+L  + + ++R
Sbjct: 301 KLARELLGVRPEGPSP----PPWAAKIHIASSPVY--------YHG--YLLAEMLAAQLR 346

Query: 350 SQRCVDEVAEKF-SLAVGSLYVRHFFDRTSKENALELVNNLRQE 392
                ++  +   +   G+      +   +  +  EL+ +   E
Sbjct: 347 -AHLKEKFGDLVDNPEAGAWLKEKLWRPGNSLDWEELLKSATGE 389


>gnl|CDD|239819 cd04327, ZnMc_MMP_like_3, Zinc-dependent metalloprotease; MMP_like
           sub-family 3. A group of bacterial and fungal
           metalloproteinase domains similar to matrix
           metalloproteinases and astacin.
          Length = 198

 Score = 28.9 bits (65), Expect = 7.5
 Identities = 12/43 (27%), Positives = 16/43 (37%), Gaps = 8/43 (18%)

Query: 631 ISFANVWCTKYRKEAMELLFKNDDHPPAEIRVN---GPLSNSY 670
            + A  W   Y      L FK      A+IR++   G    SY
Sbjct: 26  RAAAREW-LPYAN----LKFKFVTDADADIRISFTPGDGYWSY 63


>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
            This family contains bacterial and eukaryotic
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 438

 Score = 29.5 bits (67), Expect = 7.6
 Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 11/31 (35%)

Query: 524 VIGHEITHTFDDEGSQYDKFGNV--MNWWDD 552
            I HE+ H +         FG++  M WW+D
Sbjct: 285 TIAHEMAHMW---------FGDLVTMKWWND 306


>gnl|CDD|224076 COG1154, Dxs, Deoxyxylulose-5-phosphate synthase [Coenzyme
           metabolism / Lipid metabolism].
          Length = 627

 Score = 29.5 bits (67), Expect = 7.9
 Identities = 10/52 (19%), Positives = 20/52 (38%), Gaps = 2/52 (3%)

Query: 410 NALEKSKVLK--VHIGYPDELLDDDKIDLHYKDLRIVPGDLLTSVLNLYKKS 459
             L    +L   +++G PDE +D    +    +L +    +   +L   K  
Sbjct: 574 EFLAAHGILVPVLNLGLPDEFIDHGSPEELLAELGLDAEGIARRILEWLKAR 625


>gnl|CDD|238755 cd01478, Sec23-like, Sec23-like: Protein and membrane traffic in
           eukaryotes is mediated by at least in part by the
           budding and fusion of intracellular transport vesicles
           that selectively carry cargo proteins and lipids from
           donor to acceptor organelles. The two main classes of
           vesicular carriers within the endocytic and the
           biosynthetic pathways are COP- and clathrin-coated
           vesicles. Formation of COPII vesicles requires the
           ordered assembly of the coat built from several
           cytosolic components GTPase Sar1, complexes of
           Sec23-Sec24 and Sec13-Sec31. The process is initiated by
           the conversion of GDP to GTP by the GTPase Sar1 which
           then recruits the heterodimeric complex of Sec23 and
           Sec24. This heterodimeric complex generates the
           pre-budding complex. The final step leading to membrane
           deformation and budding of COPII-coated vesicles is
           carried by the heterodimeric complex Sec13-Sec31. The
           members of this CD belong to the Sec23-like family. Sec
           23 is very similar to Sec24. The Sec23 and Sec24
           polypeptides fold into five distinct domains: a
           beta-barrel, a zinc finger, a vWA or trunk, an all
           helical region and a carboxy Gelsolin domain. The
           members of this subgroup lack the consensus MIDAS motif
           but have the overall Para-Rossmann type fold that is
           characteristic of this superfamily.
          Length = 267

 Score = 29.3 bits (66), Expect = 8.6
 Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 6/41 (14%)

Query: 69  YATCMDEDTLDKLGVQPMKVIIEQLGGLPVVDGDRWDSSRF 109
           +A C     LD++G+  MKV++   GG  VV  D + +S F
Sbjct: 227 FAGC-----LDQVGLLEMKVLVNSTGGH-VVLSDSFTTSIF 261


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.136    0.408 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,049,602
Number of extensions: 4061272
Number of successful extensions: 3282
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3257
Number of HSP's successfully gapped: 38
Length of query: 800
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 695
Effective length of database: 6,280,432
Effective search space: 4364900240
Effective search space used: 4364900240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.0 bits)