RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1606
         (285 letters)



>gnl|CDD|239797 cd04269, ZnMc_adamalysin_II_like, Zinc-dependent metalloprotease;
           adamalysin_II_like subfamily. Adamalysin II is a snake
           venom zinc endopeptidase. This subfamily contains other
           snake venom metalloproteinases, as well as
           membrane-anchored metalloproteases belonging to the ADAM
           family. ADAMs (A Disintegrin And Metalloprotease) are
           glycoproteins, which play roles in cell signaling, cell
           fusion, and cell-cell interactions.
          Length = 194

 Score =  226 bits (578), Expect = 9e-75
 Identities = 84/195 (43%), Positives = 119/195 (61%), Gaps = 1/195 (0%)

Query: 85  RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLNIFIALVGVVVWTEYDEI 144
           +Y+ELV+VVDN LY  +  N   V +   +I N+++++Y  LNI + LVG+ +WT+ D+I
Sbjct: 1   KYVELVVVVDNSLYKKYGSNLSKVRQRVIEIVNIVDSIYRPLNIRVVLVGLEIWTDKDKI 60

Query: 145 TLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGV 204
           +++ +   TL  FL +++  L+   P+DNAQLLTG  F    VG A  G +C+ ++SGGV
Sbjct: 61  SVSGDAGETLNRFLDWKRSNLLPRKPHDNAQLLTGRDFDGNTVGLAYVGGMCSPKYSGGV 120

Query: 205 NVDHKNVVGLVATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSSVSPTEWSSCSLEY 264
             DH   + L A T+AHE+GHNLGMEHD   CTC    CIMAPS SS     +S+CS E 
Sbjct: 121 VQDHSRNLLLFAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS-LTDAFSNCSYED 179

Query: 265 LALSFDHGMDYCMRN 279
                  G   C+ N
Sbjct: 180 YQKFLSRGGGQCLLN 194


>gnl|CDD|216491 pfam01421, Reprolysin, Reprolysin (M12B) family zinc
           metalloprotease.  The members of this family are enzymes
           that cleave peptides. These proteases require zinc for
           catalysis. Members of this family are also known as
           adamalysins. Most members of this family are snake venom
           endopeptidases, but there are also some mammalian
           proteins such as human ADAM8 and fertilin. Fertilin and
           closely related proteins appear to not have some active
           site residues and may not be active enzymes.
          Length = 198

 Score =  203 bits (518), Expect = 1e-65
 Identities = 78/198 (39%), Positives = 118/198 (59%), Gaps = 1/198 (0%)

Query: 85  RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLNIFIALVGVVVWTEYDEI 144
           +Y+EL IVVD+ ++  +  +   + +    I N++N +Y  LNI + LVG+ +W++ D+I
Sbjct: 1   KYIELFIVVDHGMFTKYGSDLNKIRQRVHQIVNLVNEIYRPLNIRVVLVGLEIWSDGDKI 60

Query: 145 TLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGV 204
           T+  + + TL  FL +R+  L+    +DNAQLLTG+ F    +G A  G +C+ + S GV
Sbjct: 61  TVQGDANDTLHRFLEWRETDLLKRKSHDNAQLLTGIDFDGNTIGAAYVGGMCSPKRSVGV 120

Query: 205 NVDHKNVVGLVATTVAHEMGHNLGMEHDTT-ECTCPSDRCIMAPSSSSVSPTEWSSCSLE 263
             DH  +V LVA T+AHE+GHNLGM HD    CTC    CIM P +SS    ++S+CS++
Sbjct: 121 VQDHSPIVLLVAVTMAHELGHNLGMTHDDIDGCTCGGGGCIMNPVASSSPGKKFSNCSMD 180

Query: 264 YLALSFDHGMDYCMRNKP 281
                   G   C+ NKP
Sbjct: 181 DYQQFLTKGKPQCLLNKP 198


>gnl|CDD|239795 cd04267, ZnMc_ADAM_like, Zinc-dependent metalloprotease, ADAM_like
           or reprolysin_like subgroup. The adamalysin_like or ADAM
           family of metalloproteases contains proteolytic domains
           from snake venoms, proteases from the mammalian
           reproductive tract, and the tumor necrosis factor alpha
           convertase, TACE. ADAMs (A Disintegrin And
           Metalloprotease) are glycoproteins, which play roles in
           cell signaling, cell fusion, and cell-cell interactions.
          Length = 192

 Score =  109 bits (273), Expect = 5e-29
 Identities = 57/189 (30%), Positives = 90/189 (47%), Gaps = 12/189 (6%)

Query: 85  RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLNIF----IALVGVVVWTE 140
           R +ELV+V D+R+ + FN +  ++  +  ++ N+ N++Y   N+     I+L G+ +   
Sbjct: 1   REIELVVVADHRMVSYFNSDENILQAYITELINIANSIYRSTNLRLGIRISLEGLQILKG 60

Query: 141 YDEITLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDG-VVGKALKGPICTYE 199
                   +      N  S+ +    + H  DNA LLT   F +G ++G A  G +C   
Sbjct: 61  EQFAPPIDSDASNTLNSFSFWRAEGPIRH--DNAVLLTAQDFIEGDILGLAYVGSMCNPY 118

Query: 200 FSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTTECTCPS----DRCIMAPSSSSVSPT 255
            S GV  D      L A T+AHE+GHNLG EHD  +            IMAP  S ++  
Sbjct: 119 SSVGVVEDT-GFTLLTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSGLNSY 177

Query: 256 EWSSCSLEY 264
            +S CS+  
Sbjct: 178 RFSQCSIGS 186


>gnl|CDD|239801 cd04273, ZnMc_ADAMTS_like, Zinc-dependent metalloprotease,
           ADAMTS_like subgroup. ADAMs (A Disintegrin And
           Metalloprotease) are glycoproteins, which play roles in
           cell signaling, cell fusion, and cell-cell interactions.
           This particular subfamily represents domain
           architectures that combine ADAM-like metalloproteinases
           with thrombospondin type-1 repeats. ADAMTS (a
           disintegrin and metalloproteinase with thrombospondin
           motifs) proteinases are inhibited by TIMPs (tissue
           inhibitors of metalloproteinases), and they play roles
           in coagulation, angiogenesis, development and
           progression of arthritis. They hydrolyze the von
           Willebrand factor precursor and various components of
           the extracellular matrix.
          Length = 207

 Score =  100 bits (250), Expect = 2e-25
 Identities = 61/214 (28%), Positives = 102/214 (47%), Gaps = 27/214 (12%)

Query: 85  RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALY--EKL--NIFIALVGVVVWTE 140
           RY+E ++V D+++    +     +  +   + N++ +LY    L  +I I +V ++V  +
Sbjct: 1   RYVETLVVADSKMVEFHHGED--LEHYILTLMNIVASLYKDPSLGNSINIVVVRLIVLED 58

Query: 141 YDEITLNVNGDI--TLTNFLSYRKDRLVL--SHPN--DNAQLLTGMTF-----SDGVVGK 189
            +E  L ++G+   +L +F  ++K       S P   D+A LLT         +   +G 
Sbjct: 59  -EESGLLISGNAQKSLKSFCRWQKKLNPPNDSDPEHHDHAILLTRQDICRSNGNCDTLGL 117

Query: 190 ALKGPICTYEFSGGVNVDHKNVVGL-VATTVAHEMGHNLGMEHDTTECTCPSDR---CIM 245
           A  G +C+   S  +N D     GL  A T+AHE+GH LGM HD    +C  +     IM
Sbjct: 118 APVGGMCSPSRSCSINEDT----GLSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIM 173

Query: 246 APSSSSVS-PTEWSSCSLEYLALSFDHGMDYCMR 278
           +P+  + + P  WS CS  YL    D G   C+ 
Sbjct: 174 SPTLGANTGPFTWSKCSRRYLTSFLDTGDGNCLL 207


>gnl|CDD|222320 pfam13688, Peptidase_M84, Metallo-peptidase family M84. 
          Length = 193

 Score = 69.7 bits (171), Expect = 2e-14
 Identities = 51/196 (26%), Positives = 74/196 (37%), Gaps = 35/196 (17%)

Query: 89  LVIVVDNRLYNLFNKNSKLVHRHCKDISNVINA---LYEK-LNIFIALVGVVVWTEYDEI 144
           L++  D      F            +I N++N    +YE+  NI + LV + +       
Sbjct: 7   LLVAADCSYVAAFGGTDAAQ----ANIINLVNTASNVYEREFNISLGLVNLTISDSTCPY 62

Query: 145 T----LNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEF 200
           T     + N    L+ F ++   R   +   D A        S G  G A  G +C    
Sbjct: 63  TPPASSSGNASDLLSRFQAFSAWRGRQND--DLAYWTLMTNCSTG--GLAWLGQLCNSGS 118

Query: 201 SGGVNVDHKNVVGLVATT------VAHEMGHNLGMEHD------TTEC-----TCPSD-R 242
           +G V+        +V  T       AHE+GHN G  HD      +  C     TCP+  R
Sbjct: 119 AGSVSNQVSGSANVVVGTATEWQVFAHEIGHNFGAVHDCDSSTESQCCPLSSSTCPAGGR 178

Query: 243 CIMAPSSSSVSPTEWS 258
            IM PSSS    T +S
Sbjct: 179 YIMNPSSSPNI-TYFS 193


>gnl|CDD|238124 cd00203, ZnMc, Zinc-dependent metalloprotease. This super-family of
           metalloproteases contains two major branches, the
           astacin-like proteases and the
           adamalysin/reprolysin-like proteases. Both branches have
           wide phylogenetic distribution, and contain
           sub-families, which are involved in vertebrate
           development and disease.
          Length = 167

 Score = 67.9 bits (166), Expect = 6e-14
 Identities = 42/198 (21%), Positives = 66/198 (33%), Gaps = 52/198 (26%)

Query: 85  RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLNIFIALVGVVVWTEYDEI 144
           + +  V+V D+R      +   L  +    I   +    + LNI   LVGV +       
Sbjct: 1   KVIPYVVVADDRDV----EEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEI------- 49

Query: 145 TLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGV 204
                                      D A L+T   F  G  G A  G +C      GV
Sbjct: 50  ------------------------DKADIAILVTRQDFDGGTGGWAYLGRVCDSLRGVGV 85

Query: 205 NVDHKNVVGLVATTVAHEMGHNLGMEHDTTECTCPSD--------------RCIMAPSSS 250
             D+++     A T+AHE+GH LG  HD                         +M+ +  
Sbjct: 86  LQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKG 145

Query: 251 SVSPT---EWSSCSLEYL 265
           S S     ++S C ++ +
Sbjct: 146 SFSDGQRKDFSQCDIDQI 163


>gnl|CDD|222240 pfam13582, Reprolysin_3, Metallo-peptidase family M12B
           Reprolysin-like.  This zinc-binding metallo-peptidase
           has the characteristic binding motif HExxGHxxGxxH of
           Reprolysin-like peptidases of family M12B.
          Length = 123

 Score = 63.9 bits (156), Expect = 5e-13
 Identities = 31/117 (26%), Positives = 49/117 (41%), Gaps = 3/117 (2%)

Query: 117 NVINALYEK-LNIFIALVGVVVWTEYDEITLNVNGDITLTNFLSYRKDRLVLSHPNDNAQ 175
           N +N +YE+ L I + LV +++     +   + + + TL N  +   D  + +   D   
Sbjct: 9   NRVNEVYERDLGIRLELVNIIILDSATDPYSSSDANETLDNLQTVF-DARIGTAGYDIGH 67

Query: 176 LLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHD 232
           L +G     G  G A  G +C      GV+             VAHE+GHN G  H 
Sbjct: 68  LFSG-YDGGGGCGLAYVGGVCNSGKKAGVSASSSPTGDFGIDVVAHEIGHNFGANHT 123


>gnl|CDD|239800 cd04272, ZnMc_salivary_gland_MPs, Zinc-dependent metalloprotease,
           salivary_gland_MPs. Metalloproteases secreted by the
           salivary glands of arthropods.
          Length = 220

 Score = 57.7 bits (140), Expect = 5e-10
 Identities = 53/211 (25%), Positives = 80/211 (37%), Gaps = 42/211 (19%)

Query: 86  YLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLN---IFIALVGVVVWTEYD 142
           Y EL +VVD    + F  N +L+      + N  N  Y  L    I + LVG+ +  + D
Sbjct: 2   YPELFVVVDYDHQSEFFSNEQLIRYLAV-MVNAANLRYRDLKSPRIRLLLVGITISKDPD 60

Query: 143 EITLNVNGDI-------TLTNFLSYRKDRLVLSHPNDNAQLLTGM---TFSDGVVGKALK 192
                   +        TL NF  Y K +    +P D   L+TG+   T+S G +     
Sbjct: 61  FEPYIHPINYGYIDAAETLENFNEYVKKKRDYFNP-DVVFLVTGLDMSTYSGGSLQTGTG 119

Query: 193 GPICTYEFSGGVNVDHKNVVGLV---------ATTVAHEMGHNLGMEHDTTECT------ 237
           G    Y + GG   +++  V +            T+ HE+ H LG  HD +         
Sbjct: 120 G----YAYVGGACTENR--VAMGEDTPGSYYGVYTMTHELAHLLGAPHDGSPPPSWVKGH 173

Query: 238 -----CP-SDRCIMAPSSSSVSPTEWSSCSL 262
                CP  D  IM+   +      +S CS 
Sbjct: 174 PGSLDCPWDDGYIMSYVVNGERQYRFSQCSQ 204


>gnl|CDD|222233 pfam13574, Reprolysin_2, Metallo-peptidase family M12B
           Reprolysin-like.  This zinc-binding metallo-peptidase
           has the characteristic binding motif HExxGHxxGxxH of
           Reprolysin-like peptidases of family M12B.
          Length = 173

 Score = 53.4 bits (129), Expect = 1e-08
 Identities = 30/108 (27%), Positives = 41/108 (37%), Gaps = 15/108 (13%)

Query: 163 DRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICT--YEFSGGVNVDHKNVVGLVATTVA 220
             L+     D   L +  TF  G +G A  G IC   Y+ +G                VA
Sbjct: 63  SNLIGEANYDIGHLFS--TFGGGGLGLAWLGGICQKGYKGTGSTTPSGDPFE---IDVVA 117

Query: 221 HEMGHNLGMEHDTTEC------TCP-SDRCIMAPSSSSVSPTEWSSCS 261
           HE+GH  G  H  +        T P S   IM+ +    + T +S CS
Sbjct: 118 HEIGHQFGANHTFSGGCEGSSATEPGSGSTIMSYAGIC-NNTLFSPCS 164


>gnl|CDD|239799 cd04271, ZnMc_ADAM_fungal, Zinc-dependent metalloprotease,
           ADAM_fungal subgroup. The adamalysin_like or ADAM (A
           Disintegrin And Metalloprotease) family of
           metalloproteases are integral membrane proteases acting
           on a variety of extracellular targets. They are involved
           in shedding soluble peptides or proteins from the cell
           surface. This subfamily contains fungal ADAMs, whose
           precise function has yet to be determined.
          Length = 228

 Score = 51.3 bits (123), Expect = 1e-07
 Identities = 54/193 (27%), Positives = 71/193 (36%), Gaps = 45/193 (23%)

Query: 108 VHRHCKDISNVINALYEK-LNIFIALVGVVVWTEYDEITL----------NVNGDIT--L 154
             R+  +  N  + LYE   NI + L  + +       T           N   DI   L
Sbjct: 23  ARRNILNNVNSASQLYESSFNISLGLRNLTISDASCPSTAVDSAPWNLPCNSRIDIDDRL 82

Query: 155 TNFLSYRKDRLVLSHPNDNAQLLTGMTF--SDGVVGKALKGPICTYEFSGGVNVDHKNVV 212
           + F  +R  +     P+D     T MT   S   VG A  G +C    S   N       
Sbjct: 83  SIFSQWRGQQ-----PDDGNAFWTLMTACPSGSEVGVAWLGQLCRTGASDQGNETVAGTN 137

Query: 213 GLVATT-----VAHEMGHNLGMEHDTTECTCP-----SDRC--------------IMAPS 248
            +V T+      AHE+GH  G  HD T  TC      S +C              IM PS
Sbjct: 138 VVVRTSNEWQVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPS 197

Query: 249 SSSVSPTEWSSCS 261
           SSS   TE+S C+
Sbjct: 198 SSS-GITEFSPCT 209


>gnl|CDD|239798 cd04270, ZnMc_TACE_like,  Zinc-dependent metalloprotease; TACE_like
           subfamily. TACE, the tumor-necrosis factor-alpha
           converting enzyme, releases soluble TNF-alpha from
           transmembrane pro-TNF-alpha.
          Length = 244

 Score = 47.4 bits (113), Expect = 2e-06
 Identities = 35/123 (28%), Positives = 46/123 (37%), Gaps = 36/123 (29%)

Query: 174 AQLLTGMTFSDGVVGKALKGP--------ICT--YEFSGGVNVDHKNVVGLVAT------ 217
           A L T   F  G +G A  G         IC   Y +S G    + N  GL  T      
Sbjct: 104 AHLFTYRDFDMGTLGLAYVGSPRDNSAGGICEKAYYYSNGKKK-YLNT-GLTTTVNYGKR 161

Query: 218 --------TVAHEMGHNLGMEHD--TTECTCPSD----RCIMAPSSSS---VSPTEWSSC 260
                     AHE+GHN G  HD    EC  P +      IM   ++S    +  ++S C
Sbjct: 162 VPTKESDLVTAHELGHNFGSPHDPDIAECA-PGESQGGNYIMYARATSGDKENNKKFSPC 220

Query: 261 SLE 263
           S +
Sbjct: 221 SKK 223


>gnl|CDD|222241 pfam13583, Reprolysin_4, Metallo-peptidase family M12B
           Reprolysin-like.  This zinc-binding metallo-peptidase
           has the characteristic binding motif HExxGHxxGxxH of
           Reprolysin-like peptidases of family M12B.
          Length = 195

 Score = 46.6 bits (111), Expect = 3e-06
 Identities = 45/201 (22%), Positives = 74/201 (36%), Gaps = 26/201 (12%)

Query: 89  LVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEK-LNIFIALVG--VVVWTEYDEIT 145
           L +V D   Y++F  +   V     ++   +N +Y + + I + L+G   +++T      
Sbjct: 7   LAVVADYSYYSIFGGSVDKVKAFINNVVARLNEVYGRNVGISLTLIGDERLIYTTSSADD 66

Query: 146 LNVNGDI--TLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGG 203
            N N D+        +  +        N +   L  M  S+   G A  G +C     GG
Sbjct: 67  FNDNRDVLNKRLATFNSWRGSK-----NYDLGHLFTMYTSN--CGLAWLGALCQNAKGGG 119

Query: 204 VNVDHKNVVGLVATTVAHEMGHNLGMEHD-TTECTCPSDRC-------IMAPSSSSVSPT 255
           V    K          AHE+GH  G  HD T+     S          IM+  ++  S T
Sbjct: 120 VARPTKEF-----DIFAHEIGHLFGAAHDCTSSGETASSATEPDSGNTIMSY-ANDPSGT 173

Query: 256 EWSSCSLEYLALSFDHGMDYC 276
            +S  S+ Y+         Y 
Sbjct: 174 YFSPPSIYYIRGVLICSDAYY 194


>gnl|CDD|213029 cd11375, Peptidase_M54, Peptidase family M54, also called
           archaemetzincins or archaelysins.  Peptidase M54
           (archaemetzincin or archaelysin) is a zinc-dependent
           aminopeptidase that contains the consensus zinc-binding
           sequence HEXXHXXGXXH/D and a conserved Met residue at
           the active site, and is thus classified as a metzincin.
           Archaemetzincins, first identified in archaea, are also
           found in bacteria and eukaryotes, including two human
           members, archaemetzincin-1 and -2 (AMZ1 and AMZ2). AMZ1
           is mainly found in the liver and heart while AMZ2 is
           primarily expressed in testis and heart; both have been
           reported to degrade synthetic substrates and peptides.
           The Peptidase M54 family contains an extended metzincin
           concensus sequence of HEXXHXXGX3CX4CXMX17CXXC such that
           a second zinc ion is bound to four cysteines, thus
           resembling a zinc finger. Phylogenetic analysis of this
           family reveals a complex evolutionary process involving
           a series of lateral gene transfer, gene loss and genetic
           duplication events.
          Length = 173

 Score = 42.3 bits (100), Expect = 6e-05
 Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 6/36 (16%)

Query: 215 VATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSS 250
           +     HE+GH  G++H      CP   C+M  S+S
Sbjct: 123 LLKEAVHELGHLFGLDH------CPYYACVMNFSNS 152


>gnl|CDD|239803 cd04276, ZnMc_MMP_like_2, Zinc-dependent metalloprotease; MMP_like
           sub-family 2. A group of bacterial metalloproteinase
           domains similar to matrix metalloproteinases and
           astacin.
          Length = 197

 Score = 40.0 bits (94), Expect = 4e-04
 Identities = 18/66 (27%), Positives = 28/66 (42%), Gaps = 6/66 (9%)

Query: 170 PNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGL----VATTVAHEMGH 225
            + N     G +  D   G+ LK  +    +SG +  D      L    +   +AHE+GH
Sbjct: 69  HSPNGGWAYGPSVVDPRTGEILKADV--ILYSGFLRQDQLWYEDLLAASLRYLLAHEVGH 126

Query: 226 NLGMEH 231
            LG+ H
Sbjct: 127 TLGLRH 132


>gnl|CDD|224825 COG1913, COG1913, Predicted Zn-dependent proteases [General
           function prediction only].
          Length = 181

 Score = 39.7 bits (93), Expect = 5e-04
 Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 6/32 (18%)

Query: 219 VAHEMGHNLGMEHDTTECTCPSDRCIMAPSSS 250
           V HE+GH LG+ H      CP+ RC+M  S+S
Sbjct: 128 VLHELGHLLGLSH------CPNPRCVMNFSNS 153


>gnl|CDD|214743 smart00608, ACR, ADAM Cysteine-Rich Domain. 
          Length = 137

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 7/62 (11%)

Query: 1   MLHCQHLSEKLEYGIETVAILSHSFLNPGGFIIPCRSAIVDLGLNQVDPGLSPDGARCGD 60
            L C ++SE    G     I S+         + C S    LG    D G+  DG +CG 
Sbjct: 73  KLQCTNVSELPLLGEHATVIYSNIG------GLVCWSLDYHLG-TDPDIGMVKDGTKCGP 125

Query: 61  GK 62
           GK
Sbjct: 126 GK 127


>gnl|CDD|239796 cd04268, ZnMc_MMP_like, Zinc-dependent metalloprotease, MMP_like
           subfamily. This group contains matrix metalloproteinases
           (MMPs), serralysins, and the astacin_like family of
           proteases.
          Length = 165

 Score = 35.9 bits (83), Expect = 0.008
 Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 6/53 (11%)

Query: 179 GMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEH 231
           G +  D + G+ L   +  Y  S  V      +      T  HE+GH LG+ H
Sbjct: 64  GPSQVDPLTGEILLARVYLY--SSFVEYSGARLRN----TAEHELGHALGLRH 110


>gnl|CDD|237325 PRK13267, PRK13267, archaemetzincin-like protein; Reviewed.
          Length = 179

 Score = 35.4 bits (82), Expect = 0.014
 Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 6/32 (18%)

Query: 219 VAHEMGHNLGMEHDTTECTCPSDRCIMAPSSS 250
           V HE+GH LG+EH      C + RC+M  S+S
Sbjct: 129 VTHELGHTLGLEH------CDNPRCVMNFSNS 154


>gnl|CDD|191923 pfam07998, Peptidase_M54, Peptidase family M54.  This is a family
           of metallopeptidases. Two human proteins have been
           reported to degrade synthetic substrates and peptides.
          Length = 176

 Score = 34.8 bits (80), Expect = 0.022
 Identities = 13/36 (36%), Positives = 18/36 (50%), Gaps = 6/36 (16%)

Query: 215 VATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSS 250
           V   V HE+GH  G+ H      C +  C+M  S+S
Sbjct: 126 VVKEVTHELGHTYGLSH------CNNTDCVMNFSNS 155


>gnl|CDD|215908 pfam00413, Peptidase_M10, Matrixin.  The members of this family are
           enzymes that cleave peptides. These proteases require
           zinc for catalysis.
          Length = 159

 Score = 32.6 bits (75), Expect = 0.11
 Identities = 11/31 (35%), Positives = 14/31 (45%), Gaps = 6/31 (19%)

Query: 217 TTVAHEMGHNLGMEHDTTECTCPSDRCIMAP 247
              AHE+GH LG+ H +          IM P
Sbjct: 110 LVAAHEIGHALGLGHSSDP------DAIMYP 134


>gnl|CDD|239802 cd04275, ZnMc_pappalysin_like, Zinc-dependent metalloprotease,
           pappalysin_like subfamily. The pregnancy-associated
           plasma protein A (PAPP-A or pappalysin-1) cleaves
           insulin-like growth factor-binding proteins 4 and 5,
           thereby promoting cell growth by releasing bound growth
           factor. This model includes pappalysins and related
           metalloprotease domains from all three kingdoms of life.
           The three-dimensional structure of an archaeal
           representative, ulilysin, has been solved.
          Length = 225

 Score = 32.7 bits (75), Expect = 0.15
 Identities = 28/109 (25%), Positives = 40/109 (36%), Gaps = 16/109 (14%)

Query: 181 TFSDGVVGKA--LKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEH---DTTE 235
           TF D +V  A    G +       G +    N+      T  HE+GH LG+ H     + 
Sbjct: 105 TFPDSLVSLAFITDGVVINPSSLPGGSAAPYNL----GDTATHEVGHWLGLYHTFQGGSP 160

Query: 236 CTCPSDRCIMAPSSSSVS---PTEWSSCSLEYLALSFDHGMDY----CM 277
           C    D     P+ +S S   P    +C  +       + MDY    CM
Sbjct: 161 CCTTGDYVADTPAEASPSYGCPAGRDTCPGQPGLDPIHNYMDYSDDSCM 209


>gnl|CDD|239805 cd04278, ZnMc_MMP, Zinc-dependent metalloprotease, matrix
           metalloproteinase (MMP) sub-family. MMPs are responsible
           for a great deal of pericellular proteolysis of
           extracellular matrix and cell surface molecules, playing
           crucial roles in morphogenesis, cell fate specification,
           cell migration, tissue repair, tumorigenesis, gain or
           loss of tissue-specific functions, and apoptosis. In
           many instances, they are anchored to cell membranes via
           trans-membrane domains, and their activity is controlled
           via TIMPs (tissue inhibitors of metalloproteinases).
          Length = 157

 Score = 31.8 bits (73), Expect = 0.20
 Identities = 11/32 (34%), Positives = 15/32 (46%), Gaps = 6/32 (18%)

Query: 217 TTVAHEMGHNLGMEHDTTECTCPSDRCIMAPS 248
           +  AHE+GH LG+ H +          IM P 
Sbjct: 109 SVAAHEIGHALGLGHSSDP------DSIMYPY 134


>gnl|CDD|239806 cd04279, ZnMc_MMP_like_1, Zinc-dependent metalloprotease; MMP_like
           sub-family 1. A group of bacterial, archaeal, and fungal
           metalloproteinase domains similar to matrix
           metalloproteinases and astacin.
          Length = 156

 Score = 31.3 bits (71), Expect = 0.29
 Identities = 12/60 (20%), Positives = 18/60 (30%), Gaps = 5/60 (8%)

Query: 199 EFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSSVSPTEWS 258
                +          +     HE+GH LG+ H +     P D   M PS         +
Sbjct: 88  RTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDR---PED--AMYPSQGQGPDGNPT 142


>gnl|CDD|239819 cd04327, ZnMc_MMP_like_3, Zinc-dependent metalloprotease; MMP_like
           sub-family 3. A group of bacterial and fungal
           metalloproteinase domains similar to matrix
           metalloproteinases and astacin.
          Length = 198

 Score = 31.2 bits (71), Expect = 0.45
 Identities = 8/16 (50%), Positives = 9/16 (56%)

Query: 216 ATTVAHEMGHNLGMEH 231
           +  V HE GH LG  H
Sbjct: 93  SRVVLHEFGHALGFIH 108


>gnl|CDD|215458 PLN02853, PLN02853, Probable phenylalanyl-tRNA synthetase alpha
           chain.
          Length = 492

 Score = 31.2 bits (71), Expect = 0.69
 Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 8/92 (8%)

Query: 197 TYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSS---VS 253
           + +F+    +DH  VVG++ +           ++ +T   T    +     S       +
Sbjct: 21  SGQFAASHGLDHNEVVGVIKSLHGFRYVDAQDIKRETWVLTEEGKKYAAEGSPEVQLFAA 80

Query: 254 PTEWSSCSLEYL-----ALSFDHGMDYCMRNK 280
                S S + L        FD G    M+NK
Sbjct: 81  VPAEGSISKDELQKKLDPAVFDIGFKQAMKNK 112


>gnl|CDD|214576 smart00235, ZnMc, Zinc-dependent metalloprotease.  Neutral zinc
           metallopeptidases. This alignment represents a subset of
           known subfamilies. Highest similarity occurs in the
           HExxH zinc-binding site/ active site.
          Length = 139

 Score = 29.6 bits (67), Expect = 0.91
 Identities = 11/25 (44%), Positives = 14/25 (56%), Gaps = 4/25 (16%)

Query: 218 TVAHEMGHNLGMEHDTTECTCPSDR 242
             AHE+GH LG+ H+       SDR
Sbjct: 87  VAAHELGHALGLYHE----QSRSDR 107


>gnl|CDD|218964 pfam06262, DUF1025, Possibl zinc metallo-peptidase.  This is
           possibly a family of bacterial zinc metallo-peptidases.
           Although they carry the HExxHxxGxxD motif, they are
           missing a final methionine which would class them as
           Met-zincins.
          Length = 96

 Score = 28.6 bits (65), Expect = 1.2
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 206 VDHKNVVGLVATTVAHEMGHNLGM 229
            D + +  LV   V HE+GH+ G+
Sbjct: 63  RDREELGELVRHVVIHEIGHHFGL 86


>gnl|CDD|188450 TIGR03935, fragilysin, fragilysin.  Members of this family are
           fragilysin, the Bacteroides fragilis enterotoxin. This
           enzyme is a Zn metalloprotease. Three distinct subtypes
           included in this family all are produced by
           enterotoxigenic (by definition) strains of Bacteroides
           fragilis [Cellular processes, Pathogenesis].
          Length = 386

 Score = 29.9 bits (67), Expect = 1.4
 Identities = 26/127 (20%), Positives = 50/127 (39%), Gaps = 15/127 (11%)

Query: 114 DISNVINALYEKLNIFIALVGVVV--WTEYDEITLNVNGDITLTNFLSY-RKDRLVLSHP 170
           +++  +      L   +    V+V   TEY   +   +    L  F +  + +     + 
Sbjct: 234 EVTAQMQDAANSLKRLVNNHFVLVEYTTEYSCPSGGADESKGLDGFTASLKSNPKAKGYD 293

Query: 171 NDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVAT------TVAHEMG 224
                L+   T+ + ++G    G + +Y  +   N +     G+  T      T+AHE+G
Sbjct: 294 KQIYILIRWGTWDNNILGI---GWLNSYNVNTASNFE---ASGMSTTQLMYPGTLAHELG 347

Query: 225 HNLGMEH 231
           H LG EH
Sbjct: 348 HILGAEH 354


>gnl|CDD|150526 pfam09865, DUF2092, Predicted periplasmic protein (DUF2092).  This
           domain, found in various hypothetical prokaryotic
           proteins, has no known function.
          Length = 215

 Score = 29.2 bits (66), Expect = 1.8
 Identities = 22/124 (17%), Positives = 52/124 (41%), Gaps = 15/124 (12%)

Query: 51  LSPDGARCGDGKTKRDTVSKHDEIRGPYNANIKSRYLELVIVVDNRLYNLFNKNSKLVHR 110
           +  DG +     +   TV + D +R       +    ++ +  D + + L+  N+ +  +
Sbjct: 30  VLEDGQKLQFAASGDLTVRRPDRLR----VTRRGDGADVELYFDGKTFTLYGPNANVYAQ 85

Query: 111 HCK--DISNVINALYEKLNIFIALVGVVVWTEYDEITLNV-------NGDI--TLTNFLS 159
                 I  +++ L ++L I + L  +++   YDE+   V        G +     + L+
Sbjct: 86  APAPGTIDALVDRLRDRLGIELPLADLLLSDPYDELKDGVTSAKYVGQGVVGGVECDHLA 145

Query: 160 YRKD 163
           +R+D
Sbjct: 146 FRQD 149


>gnl|CDD|153070 cd03600, CLECT_thrombomodulin_like, C-type lectin-like domain
           (CTLD) of the type found in human thrombomodulin(TM),
           Endosialin, C14orf27, and C1qR.
           CLECT_thrombomodulin_like: C-type lectin-like domain
           (CTLD) of the type found in human thrombomodulin(TM),
           Endosialin, C14orf27, and C1qR.  CTLD refers to a domain
           homologous to the carbohydrate-recognition domains
           (CRDs) of the C-type lectins.  In these
           thrombomodulin-like proteins the residues involved in
           coordinating Ca2+ in the classical MBP-A CTLD are not
           conserved.  TM exerts anti-fibrinolytic and
           anti-inflammatory activity.  TM also regulates blood
           coagulation in the anticoagulant protein C pathway.  In
           this pathway, the procoagulant properties of thrombin
           (T) are lost when it binds TM.  TM also plays a key role
           in tumor biology.  It is expressed on endothelial cells
           and on several type of tumor cell including squamous
           cell carcinoma.  Loss of TM expression correlates with
           advanced stage and poor prognosis.  Loss of function of
           TM function may be associated with arterial or venous
           thrombosis and with late fetal loss.  Soluble molecules
           of TM retaining the CTLD are detected in human plasma
           and urine where higher levels indicate injury and/or
           enhanced turnover of the endothelium.  C1qR is expressed
           on endothelial cells and stem cells.  It is also
           expressed on monocots and neutrophils, where it is
           subject to ectodomain shedding.  Soluble forms of C1qR
           retaining the CTLD is detected in human plasma.  C1qR
           modulates the phagocytosis of apoptotic cells in vivo.
           C1qR-deficient mice are defective in clearance of
           apoptotic cells in vivo.  The cytoplasmic tail of C1qR,
           C-terminal to the CTLD of CD93, contains a PDZ binding
           domain which interacts with the PDZ domain-containing
           adaptor protein, GIPC.  The juxtamembrane region of this
           tail interacts with the ezrin/radixin/moesin family.
           Endosialin functions in the growth and progression of
           abdominal tumors and is expressed in the stroma of
           several tumors.
          Length = 141

 Score = 28.6 bits (64), Expect = 2.1
 Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 3/29 (10%)

Query: 237 TCPSDRCI-MAPSSSSVSPTEWS--SCSL 262
           TC S RC+ ++ + S+    +W    CS 
Sbjct: 102 TCTSPRCVALSAAGSTPDNLKWKDGPCSA 130


>gnl|CDD|239807 cd04280, ZnMc_astacin_like, Zinc-dependent metalloprotease,
           astacin_like subfamily or peptidase family M12A, a group
           of zinc-dependent proteolytic enzymes with a HExxH
           zinc-binding site/active site. Members of this family
           may have an amino terminal propeptide, which is cleaved
           to yield the active protease domain, which is
           consequently always found at the N-terminus in
           multi-domain architectures. This family includes:
           astacin, a digestive enzyme from Crayfish; meprin, a
           multiple domain membrane component that is constructed
           from a homologous alpha and beta chain, proteins
           involved in (bone) morphogenesis, tolloid from
           drosophila, and the sea urchin SPAN protein, which may
           also play a role in development.
          Length = 180

 Score = 28.7 bits (65), Expect = 2.2
 Identities = 7/14 (50%), Positives = 9/14 (64%)

Query: 218 TVAHEMGHNLGMEH 231
           T+ HE+ H LG  H
Sbjct: 77  TIVHELMHALGFYH 90


>gnl|CDD|201253 pfam00480, ROK, ROK family. 
          Length = 181

 Score = 28.4 bits (64), Expect = 3.3
 Identities = 13/65 (20%), Positives = 19/65 (29%), Gaps = 19/65 (29%)

Query: 198 YEFSGGVNVDHKNVVGLV------------------ATTVAHEMGHNLGMEHDTTECTCP 239
            E   G   D  NV+ +                   A   A E+GH    +     C C 
Sbjct: 109 GEKVFGAGKDVSNVIYVTIGTGIGAGVIINGKLFRGAHGEAGEIGH-PLDDPHGFVCGCG 167

Query: 240 SDRCI 244
           +  C+
Sbjct: 168 NHGCL 172


>gnl|CDD|239804 cd04277, ZnMc_serralysin_like, Zinc-dependent metalloprotease,
           serralysin_like subfamily. Serralysins and related
           proteases are important virulence factors in pathogenic
           bacteria. They may be secreted into the medium via a
           mechanism found in gram-negative bacteria, that does not
           require n-terminal signal sequences which are cleaved
           after the transmembrane translocation. A calcium-binding
           domain c-terminal to the metalloprotease domain, which
           contains multiple tandem repeats of a nine-residue motif
           including the pattern GGxGxD, and which forms a parallel
           beta roll may be involved in the translocation mechanism
           and/or substrate binding. Serralysin family members may
           have a broad spectrum of substrates each, including host
           immunoglobulins, complement proteins, cell matrix and
           cytoskeletal proteins, as well as antimicrobial
           peptides.
          Length = 186

 Score = 28.2 bits (63), Expect = 3.9
 Identities = 9/14 (64%), Positives = 12/14 (85%)

Query: 218 TVAHEMGHNLGMEH 231
           T+ HE+GH LG+EH
Sbjct: 116 TIIHEIGHALGLEH 129


>gnl|CDD|239323 cd03025, DsbA_FrnE_like, DsbA family, FrnE-like subfamily; composed
           of uncharacterized proteins containing a CXXC motif with
           similarity to DsbA and FrnE. FrnE is presumed to be a
           thiol oxidoreductase involved in polyketide
           biosynthesis, specifically in the production of the
           aromatic antibiotics frenolicin and nanaomycins.
          Length = 193

 Score = 28.1 bits (63), Expect = 4.0
 Identities = 10/48 (20%), Positives = 17/48 (35%)

Query: 139 TEYDEITLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGV 186
               E+ L++ G +   N     K   +  H +     LTG  F +  
Sbjct: 29  GGGIEVELHLGGLLPGNNARQITKQWRIYVHWHKARIALTGQPFGEDY 76


>gnl|CDD|240572 cd12952, MMP_ACEL2062, Minimal MMP-like domain found in
           Acidothermus cellulolyticus hypothetical protein
           ACEL2062 and similar protein.  The subfamily includes an
           uncharacterized protein from Acidothermus cellulolyticus
           (ACEL2062) and its homologs from bacteria. Although its
           biological role remains unclear, ACEL2062 contains a
           minimal metalloprotease (MMP)-like domain consisting of
           3-stranded mixed 2-beta sheets and a HExxHxxGxxD/S (x
           could be any amino acid) motif. It may belong to a
           superfamily of bacterial zinc metallo-peptidases, which
           is characterized by a conserved HExxHxxGxxD motif.
          Length = 117

 Score = 27.5 bits (62), Expect = 4.2
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query: 214 LVATTVAHEMGHNLGMEHD 232
            V  TV HE+GH+ G+  D
Sbjct: 95  EVRHTVLHEIGHHFGLSDD 113


>gnl|CDD|218117 pfam04504, DUF573, Protein of unknown function, DUF573. 
          Length = 98

 Score = 26.9 bits (60), Expect = 4.2
 Identities = 8/38 (21%), Positives = 15/38 (39%), Gaps = 7/38 (18%)

Query: 137 VWTEYDEITLNVNGDITLTNFLSYRKDRLVLSHPNDNA 174
           +W+E DEI L       L   + ++         + +A
Sbjct: 6   LWSEEDEIVL-------LQGMIDFKAKTGKSPSDDTDA 36


>gnl|CDD|234413 TIGR03952, metzin_BF0631, zinc-dependent metalloproteinase
           lipoprotein, BF0631 family.  Members of this protein
           family are zinc-dependent metalloproteinases, related to
           ulilysin and other members of the pappalysin family.
           Members occur as predicted lipoproteins and occur mostly
           in the genera Bacteriodes and Prevotella [Protein fate,
           Degradation of proteins, peptides, and glycopeptides].
          Length = 351

 Score = 28.2 bits (63), Expect = 5.4
 Identities = 9/17 (52%), Positives = 12/17 (70%)

Query: 215 VATTVAHEMGHNLGMEH 231
              T+AHE+GH LG+ H
Sbjct: 232 FNVTLAHELGHYLGLFH 248


>gnl|CDD|216306 pfam01117, Aerolysin, Aerolysin toxin.  This family represents the
           pore forming lobe of aerolysin.
          Length = 359

 Score = 28.2 bits (63), Expect = 6.0
 Identities = 12/48 (25%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 125 KLNIFIALVGVVVWTEYDEITLNVNGDITLTNFLSYRKDRLVLSHPND 172
           KL I + L    +   Y E   +++ D+ L  FL +  +     HP +
Sbjct: 194 KLPIRVELFKSTIDYPY-EFKADMSYDVELDGFLRWGGNAW-YDHPTN 239


>gnl|CDD|227264 COG4927, COG4927, Predicted choloylglycine hydrolase [General
           function prediction only].
          Length = 336

 Score = 28.0 bits (62), Expect = 6.4
 Identities = 15/67 (22%), Positives = 26/67 (38%), Gaps = 5/67 (7%)

Query: 139 TEYDEITLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMT-FSDGVVGKALK---GP 194
           T       N     T ++F SY + + +LS   D +  L  +T F+  +     K   G 
Sbjct: 231 TRLRNACTNHFRAATPSSF-SYARQQFILSALEDPSMSLEKLTDFNLSLRSPLYKGKVGA 289

Query: 195 ICTYEFS 201
           I    ++
Sbjct: 290 IHPTLYT 296


>gnl|CDD|239589 cd03512, Alkane-hydroxylase, Alkane hydroxylase is a bacterial,
           integral-membrane di-iron enzyme that shares a
           requirement for iron and oxygen for activity similar to
           that of the non-heme integral-membrane acyl coenzyme A
           (CoA) desaturases and acyl lipid desaturases. The alk
           genes in Pseudomonas oleovorans encode conversion of
           alkanes to acyl CoA. The alkane omega-hydroxylase (AlkB)
           system is responsible for the initial oxidation of
           inactivated alkanes. It is a three-component system
           comprising a soluble NADH-rubredoxin reductase (AlkT), a
           soluble rubredoxin (AlkG), and the integral membrane
           oxygenase (AlkB). AlkB utilizes the oxygen rebound
           mechanism to hydroxylate alkanes. This mechanism
           involves homolytic cleavage of the C-H bond by an
           electrophilic metal-oxo intermediate to generate a
           substrate-based radical. As with other members of this
           superfamily, this domain family has extensive
           hydrophobic regions that would be capable of spanning
           the membrane bilayer at least twice. The active site
           structure of AlkB is not known, however, spectroscopic
           and genetic evidence points to a nitrogen-rich
           coordination environment located in the cytoplasm with
           as many as eight histidines coordinating the two iron
           ions and a carboxylate residue bridging the two metals.
           Like all other members of this superfamily, there are
           eight conserved histidines seen in the histidine cluster
           motifs: HXXXH, HXXXHH, and HXXHH. These histidine
           residues are reported to be catalytically essential and
           proposed to be the ligands for the iron atoms contained
           within the homolog, stearoyl CoA desaturase. Also
           included in this CD are terminal alkane hydroxylases
           (AlkM), xylene monooxygenase hydroxylases (XylM),
           p-cymene monooxygenase hydroxylases (CymAa), and other
           related proteins.
          Length = 314

 Score = 28.0 bits (63), Expect = 6.5
 Identities = 5/15 (33%), Positives = 9/15 (60%)

Query: 211 VVGLVATTVAHEMGH 225
           + G++    AHE+ H
Sbjct: 82  LSGVIGINTAHELIH 96


>gnl|CDD|218790 pfam05876, Terminase_GpA, Phage terminase large subunit (GpA).
           This family consists of several phage terminase large
           subunit proteins as well as related sequences from
           several bacterial species. The DNA packaging enzyme of
           bacteriophage lambda, terminase, is a heteromultimer
           composed of a small subunit, gpNu1, and a large subunit,
           gpA, products of the Nu1 and A genes, respectively.
           Terminase is involved in the site-specific binding and
           cutting of the DNA in the initial stages of packaging.
           It is now known that gpA is actively involved in late
           stages of packaging, including DNA translocation, and
           that this enzyme contains separate functional domains
           for its early and late packaging activities.
          Length = 552

 Score = 28.0 bits (63), Expect = 7.3
 Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 9/46 (19%)

Query: 133 VGVVVWTEYDEITLNVNGD---ITL-----TNFLSYRKDRLVLSHP 170
           V  V+  E D    +V+G+   I+L       F S RK  L  S P
Sbjct: 135 VRYVILDEVDAYPEDVDGEGDPISLAEKRTETFGSRRK-ILAGSTP 179


>gnl|CDD|222105 pfam13402, M60-like, Peptidase M60-like family.  Members of this
           family are related to the Enhancin peptidase family.
           Therefore these proteins may act as peptidases.
          Length = 303

 Score = 27.2 bits (61), Expect = 8.7
 Identities = 9/24 (37%), Positives = 10/24 (41%), Gaps = 4/24 (16%)

Query: 219 VAHEMGHNLGME----HDTTECTC 238
             HE+GHN          TTE T 
Sbjct: 219 PWHELGHNHQQGPWTWDGTTEVTN 242


>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
           tricorn interacting factor F3, Endoplasmic reticulum
           aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ).  This
           M1 peptidase family includes eukaryotic and bacterial
           members: aminopeptidase N (APN), aminopeptidase Q (APQ,
           laeverin), endoplasmic reticulum aminopeptidase 1
           (ERAP1) as well as tricorn interacting factor F3.
           Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease,
           consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types (leukocyte, fibroblast,
           endothelial and epithelial cells). APN expression is
           dysregulated in inflammatory diseases such as chronic
           pain, rheumatoid arthritis, multiple sclerosis, systemic
           sclerosis, systemic lupus erythematosus,
           polymyositis/dermatomyosytis and pulmonary sarcoidosis,
           and is enhanced in tumor cells such as melanoma, renal,
           prostate, pancreas, colon, gastric and thyroid cancers.
           It is considered a marker of differentiation since it is
           predominantly expressed on stem cells and on cells of
           the granulocytic and monocytic lineages at distinct
           stages of differentiation. Thus, APN inhibition may lead
           to the development of anti-cancer and anti-inflammatory
           drugs. ERAP1 also known as endoplasmic reticulum
           aminopeptidase associated with antigen processing
           (ERAAP), adipocyte derived leucine aminopeptidase
           (A-LAP) or aminopeptidase regulating tumor necrosis
           factor receptor I (THFRI) shedding (ARTS-1), associates
           with the closely related ER aminopeptidase ERAP2, for
           the final trimming of peptides within the ER for
           presentation by MHC class I molecules. ERAP1 is
           associated with ankylosing spondylitis (AS), an
           inflammatory arthritis that predominantly affects the
           spine. ERAP1 also aids in the shedding of membrane-bound
           cytokine receptors. The tricorn interacting factor F3,
           together with factors F1 and F2, degrades the tricorn
           protease products, producing free amino acids, thus
           completing the proteasomal degradation pathway. F3 is
           homologous to F2, but not F1, and shows a strong
           preference for glutamate in the P1' position. APQ, also
           known as laeverin, is specifically expressed in human
           embryo-derived extravillous trophoblasts (EVTs) that
           invade the uterus during early placentation. It cleaves
           the N-terminal amino acid of various peptides such as
           angiotensin III, endokinin C, and kisspeptin-10, all
           expressed in the placenta in large quantities. APN is a
           receptor for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs are also putative Cry toxin
           receptors. Cry1 proteins are pore-forming toxins that
           bind to the midgut epithelial cell membrane of
           susceptible insect larvae, causing extensive damage.
           Several different toxins, including Cry1Aa, Cry1Ab,
           Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
           bind to APNs; however, a direct role of APN in
           cytotoxicity has been yet to be firmly established.
          Length = 446

 Score = 27.5 bits (62), Expect = 8.9
 Identities = 10/17 (58%), Positives = 11/17 (64%), Gaps = 4/17 (23%)

Query: 215 VATTVAHEMGH----NL 227
           VAT VAHE+ H    NL
Sbjct: 286 VATVVAHELAHQWFGNL 302


>gnl|CDD|100081 cd06160, S2P-M50_like_2, Uncharacterized homologs of Site-2
           protease (S2P), zinc metalloproteases (MEROPS family
           M50) which cleave transmembrane domains of substrate
           proteins, regulating intramembrane proteolysis (RIP) of
           diverse signal transduction mechanisms. Members of the
           S2P/M50 family of RIP proteases use proteolytic activity
           within the membrane to transfer information across
           membranes to integrate gene expression with physiologic
           stresses occurring in another cellular compartment. In
           eukaryotic cells they regulate such processes as sterol
           and lipid metabolism, and endoplasmic reticulum stress
           responses. In prokaryotes they regulate such processes
           as sporulation, cell division, stress response, and cell
           differentiation. This group includes bacterial,
           eukaryotic, and Archaeal S2P/M50s homologs with
           additional putative N- and C-terminal transmembrane
           spanning regions, relative to the core protein, and no
           PDZ domains.
          Length = 183

 Score = 26.8 bits (60), Expect = 9.2
 Identities = 8/18 (44%), Positives = 10/18 (55%)

Query: 212 VGLVATTVAHEMGHNLGM 229
           + L+A    HEMGH L  
Sbjct: 38  LALLAILGIHEMGHYLAA 55


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.136    0.417 

Gapped
Lambda     K      H
   0.267   0.0705    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,279,758
Number of extensions: 1332859
Number of successful extensions: 1138
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1117
Number of HSP's successfully gapped: 53
Length of query: 285
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 189
Effective length of database: 6,679,618
Effective search space: 1262447802
Effective search space used: 1262447802
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.2 bits)