BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 013429
         (443 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|A2Q1V6|ATG4_MEDTR Cysteine protease ATG4 OS=Medicago truncatula GN=ATG4 PE=3 SV=1
          Length = 487

 Score =  556 bits (1432), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 282/412 (68%), Positives = 324/412 (78%), Gaps = 5/412 (1%)

Query: 34  GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
           G +  V+++V+ GSMRR  ERVLG  RT +SSS  DIWLLGVCHKI+Q E+ GD    N 
Sbjct: 79  GWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCHKISQHESTGDVDIRNV 138

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
            A F QDF SRILI+YRKGFD I DSK TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 FAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
            + KP D+EY++IL LFGDSE + FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LAR 
Sbjct: 199 TVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARN 258

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
           QR +   G Q LPMAIYVVSGDEDGERGGAPVVCI+DA + C  FS+G   WTP+LLLVP
Sbjct: 259 QREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLEFSRGLVPWTPLLLLVP 318

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           LVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ + A YLDPH+V+PV+N
Sbjct: 319 LVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQNDKAFYLDPHEVKPVVN 378

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
           I  D  E +TS+YH ++ RH+ LDSIDPSLAIGFYCRDKDDFDDFC+RA+KLAEESNGAP
Sbjct: 379 ITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDKDDFDDFCSRATKLAEESNGAP 438

Query: 394 LFTVTQTHKKP--VNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           LFTV Q+   P  V  + V G+     EDDSL +  +NDA    +EDDWQ L
Sbjct: 439 LFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSMNLVNDA---GNEDDWQFL 487


>sp|Q8S929|ATG4A_ARATH Cysteine protease ATG4a OS=Arabidopsis thaliana GN=ATG4A PE=2 SV=1
          Length = 467

 Score =  520 bits (1340), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 244/377 (64%), Positives = 306/377 (81%), Gaps = 3/377 (0%)

Query: 34  GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
           G +  VKR+ + +G++RR  ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE  G+     
Sbjct: 74  GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 133

Query: 93  GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            LA    DFSS+IL++YRKGF+P  D+  TSDV WGCM+RSSQML AQALLFHRLGR W 
Sbjct: 134 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWT 193

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
           K  + P ++EY+E L  FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA 
Sbjct: 194 KKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 252

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
            +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C  FSKGQ++WTPI+LLV
Sbjct: 253 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 312

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           PLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQE+   YLDPH+VQ V+
Sbjct: 313 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 372

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
            + K+  + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDDFC RA KLAEESNGA
Sbjct: 373 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEESNGA 432

Query: 393 PLFTVTQTHKKPVNHSD 409
           PLFTVTQTH   +N S+
Sbjct: 433 PLFTVTQTHTA-INQSN 448


>sp|Q9M1Y0|ATG4B_ARATH Cysteine protease ATG4b OS=Arabidopsis thaliana GN=ATG4B PE=1 SV=1
          Length = 477

 Score =  498 bits (1282), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 247/401 (61%), Positives = 312/401 (77%), Gaps = 10/401 (2%)

Query: 43  VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
           + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QDFS
Sbjct: 87  MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 146

Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           S IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P D +
Sbjct: 147 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 206

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
           Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET    
Sbjct: 207 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 266

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
           +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 267 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 326

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           PRYIP+L  TFTFPQSLGI+GGKPGASTYIVGVQE+   YLDPHDVQ V+ + K++ + D
Sbjct: 327 PRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVD 386

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
           TS+YH + +R++ L+S+DPSLA+GFYC+ KDDFDDFC RA+KLA +SNGAPLFTVTQ+H+
Sbjct: 387 TSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLAGDSNGAPLFTVTQSHR 446

Query: 403 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           +  N   +   +        +         G  HEDDWQLL
Sbjct: 447 R--NDCGIAETSSSTETSTEIS--------GEEHEDDWQLL 477


>sp|Q7XPW8|ATG4B_ORYSJ Cysteine protease ATG4B OS=Oryza sativa subsp. japonica GN=ATG4B
           PE=2 SV=1
          Length = 478

 Score =  492 bits (1266), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 256/435 (58%), Positives = 324/435 (74%), Gaps = 17/435 (3%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + N+S  S       S  ++R+V +GSM R     LG S+   SS   D+W L
Sbjct: 56  FEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE   FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
           KDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DVLG +G    D ++ V  +
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG----DGNINVEDL 464

Query: 429 NDAVGNAHEDDWQLL 443
            DA G   E++WQ+L
Sbjct: 465 -DASGETGEEEWQIL 478


>sp|Q2XPP4|ATG4B_ORYSI Cysteine protease ATG4B OS=Oryza sativa subsp. indica GN=ATG4B PE=1
           SV=2
          Length = 478

 Score =  487 bits (1253), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 254/435 (58%), Positives = 322/435 (74%), Gaps = 17/435 (3%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + N+S  S          ++R+V +GSM R     LG S+   SS   D+W L
Sbjct: 56  FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE   FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
           KDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DVLG +G    D ++ V  +
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG----DGNINVEDL 464

Query: 429 NDAVGNAHEDDWQLL 443
            DA G   E++WQ+L
Sbjct: 465 -DASGETGEEEWQIL 478


>sp|A2XHJ5|ATG4A_ORYSI Cysteine protease ATG4A OS=Oryza sativa subsp. indica GN=ATG4A PE=3
           SV=1
          Length = 473

 Score =  471 bits (1213), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 247/434 (56%), Positives = 312/434 (71%), Gaps = 16/434 (3%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + +R L         S  ++R+   GSM R     LG S+   + ++SD+W L
Sbjct: 52  FEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 104

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E    +   +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 105 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 164

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE   FSIHNLLQAGK+YGLA
Sbjct: 165 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 224

Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R   E   G  + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 225 AGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 284

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 285 AQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETFTFPQSLGILGGKPGTSTY 344

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           + GVQ++  +YLDPH+VQ  ++I  D+LEADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 345 VAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 404

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMN-- 429
           KDDFDDFC+RAS+L +++NGAPLFTV Q+ +      +    +G     D + ++++   
Sbjct: 405 KDDFDDFCSRASELVDKANGAPLFTVMQSVQPSKQMYNEESSSG-----DGMDIINVEGL 459

Query: 430 DAVGNAHEDDWQLL 443
           D  G   E++WQ+L
Sbjct: 460 DGSGETGEEEWQIL 473


>sp|Q75KP8|ATG4A_ORYSJ Cysteine protease ATG4A OS=Oryza sativa subsp. japonica GN=ATG4A
           PE=3 SV=1
          Length = 474

 Score =  469 bits (1206), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 252/435 (57%), Positives = 312/435 (71%), Gaps = 18/435 (4%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + NRSL         S  ++R+   GSM R     LG S+   + ++SD+W L
Sbjct: 53  FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E    +   +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE   FSIHNLLQAGK+YGLA
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 225

Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L    R   E   G  + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 226 AGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 285

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T TFPQSLGI+GGKPG STY
Sbjct: 286 AQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETLTFPQSLGILGGKPGTSTY 345

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D+LEA TS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 346 IAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRD 405

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
           KDDFDDFC+RAS+L +++NGAPLFTV Q+    K+  N     G+  G+   DS+ V  +
Sbjct: 406 KDDFDDFCSRASELVDKANGAPLFTVVQSVQPSKQMYNEESSSGD--GM---DSINVEGL 460

Query: 429 NDAVGNAHEDDWQLL 443
            D  G   E++WQ+L
Sbjct: 461 -DGSGETGEEEWQIL 474


>sp|Q8BGE6|ATG4B_MOUSE Cysteine protease ATG4B OS=Mus musculus GN=Atg4b PE=1 SV=2
          Length = 393

 Score =  207 bits (527), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
               A + C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CQDVLN 366


>sp|Q6DG88|ATG4B_DANRE Cysteine protease ATG4B OS=Danio rerio GN=atg4b PE=2 SV=2
          Length = 394

 Score =  206 bits (524), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR W+    +  
Sbjct: 45  DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
             EYV IL+ F D + S +SIH + Q G   G + G W GP          A+  SW  L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
           A     +  +  + +     +     D +RG       P     D    C++  +  A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
            P++LL+PL LGL  +N  YI  L+  F  PQSLG++GGKP ++ Y +G   +  IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           H  QP ++  +D    D S +       +H+  +DPS+A GF+C+ +DDFDD+CA+  K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341

Query: 386 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           +    G P+F +  +    +  +DVL  T    + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378


>sp|Q9Y4P1|ATG4B_HUMAN Cysteine protease ATG4B OS=Homo sapiens GN=ATG4B PE=1 SV=2
          Length = 393

 Score =  204 bits (519), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>sp|Q8C9S8|ATG4A_MOUSE Cysteine protease ATG4A OS=Mus musculus GN=Atg4a PE=2 SV=2
          Length = 396

 Score =  201 bits (510), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 176/353 (49%), Gaps = 50/353 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
            +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++I +  L  D + +  
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
              + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352


>sp|Q6PZ03|ATG4B_BOVIN Cysteine protease ATG4B OS=Bos taurus GN=ATG4B PE=2 SV=1
          Length = 393

 Score =  200 bits (508), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 175/359 (48%), Gaps = 26/359 (7%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V++      R   
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187

Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P    +    D+ RHC+ F          A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           + +  +DPS+A+GF+C  +DDF+D+C + SKL+      P+F + +     +   DVL 
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPSHLACPDVLN 366


>sp|Q640G7|ATG4B_XENLA Cysteine protease ATG4B OS=Xenopus laevis GN=atg4b PE=2 SV=1
          Length = 384

 Score =  199 bits (505), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 115/343 (33%), Positives = 169/343 (49%), Gaps = 36/343 (10%)

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
            D +SR+  +YR+ F  IG +  TSD GWGCMLR  QM+ AQAL+   +GR WR   QKP
Sbjct: 44  NDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWDKQKP 103

Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
              EY+ IL  F D + S +SIH + Q G   G   G W GP  + +    LA   +  +
Sbjct: 104 -KGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQWSS 162

Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------------- 264
                   +A+++   +          V +D+  R C   S   +D              
Sbjct: 163 --------IAVHIAMDN---------TVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDP 205

Query: 265 ----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
               W P++LL+PL LGL ++N  YI TL+  F  PQSLG++GG+P ++ Y +G   +  
Sbjct: 206 SCAQWKPLVLLIPLRLGLSEINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDEL 265

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           IYLDPH  Q  +         D S +       +H+  IDPS+A+GF+C  ++DF+D+C 
Sbjct: 266 IYLDPHTTQLSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSIAVGFFCSSQEDFEDWCQ 325

Query: 381 RASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
              KL+      P+F V       +++ DVL  T    + D L
Sbjct: 326 HIKKLSLSGGALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368


>sp|Q5R699|ATG4A_PONAB Cysteine protease ATG4A OS=Pongo abelii GN=ATG4A PE=2 SV=1
          Length = 398

 Score =  196 bits (498), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 178/356 (50%), Gaps = 53/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
           D  + C V                     SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++ G++    D + 
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTGENGTVNDQTF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +     + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>sp|Q8WYN0|ATG4A_HUMAN Cysteine protease ATG4A OS=Homo sapiens GN=ATG4A PE=1 SV=1
          Length = 398

 Score =  194 bits (494), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197

Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>sp|Q6PZ02|ATG4B_CHICK Cysteine protease ATG4B OS=Gallus gallus GN=ATG4B PE=2 SV=1
          Length = 393

 Score =  192 bits (488), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 190 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 224
            G + G W GP  + +         +W +LA                 CQ   +  G  +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193

Query: 225 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
            P      +Y    +E G R    +                   W P++LL+PL LGL +
Sbjct: 194 CPAVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
           +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +        
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
            D S +       + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + + 
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354

Query: 401 HKKPVNHSDVLGETGGVPEDDSL 423
                ++ DVL  T    + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377


>sp|Q6PZ05|ATG4A_BOVIN Cysteine protease ATG4A OS=Bos taurus GN=ATG4A PE=2 SV=1
          Length = 398

 Score =  192 bits (487), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 238
            G + G W GP          A+   W +LA     +  +  + +      +S   D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++  AD  T+H     + +++ 
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>sp|Q8BGV9|ATG4D_MOUSE Cysteine protease ATG4D OS=Mus musculus GN=Atg4d PE=1 SV=1
          Length = 474

 Score =  178 bits (452), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>sp|Q684M2|ATG4D_PIG Cysteine protease ATG4D OS=Sus scrofa GN=ATG4D PE=3 SV=1
          Length = 469

 Score =  177 bits (449), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 110/378 (29%), Positives = 178/378 (47%), Gaps = 62/378 (16%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQKPF----------- 159
             +TSD GWGCMLRS QM++AQ LL H L R W          P   P            
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSQGVGLGPPESSPNRYRGPAHWMPP 192

Query: 160 -----------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
                      +R + +I+  F D   +PF +H L++ G++ G  AG W GP        
Sbjct: 193 HWVQAAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------S 245

Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 268
            +A   R       +   + +YV       +   A +V   D +          A+W  +
Sbjct: 246 LVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKAV 295

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           ++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  
Sbjct: 296 VILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYC 355

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++   
Sbjct: 356 QPTVDVSQADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTRVLSS 413

Query: 389 SNGA---PLFTVTQTHKK 403
           S+     P+FT+ + H +
Sbjct: 414 SSATERYPMFTLVEGHAQ 431


>sp|A6SDQ3|ATG4_BOTFB Probable cysteine protease atg4 OS=Botryotinia fuckeliana (strain
           B05.10) GN=atg4 PE=3 SV=1
          Length = 439

 Score =  176 bits (446), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 106/310 (34%), Positives = 159/310 (51%), Gaps = 51/310 (16%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF ++I ++YR  F  I  S+                        TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A ALL  R+GR WR+ +    +R+   IL LF D   +P+SIH  ++ G  A G 
Sbjct: 163 GQSLLANALLTLRMGREWRRGVSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +AL+  Q            + +Y+ +GD      G+ V       
Sbjct: 220 HPGEWFGPSATARCIQALSNSQAKSE--------LRVYI-TGD------GSDVY----ED 260

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           +  S+     +D+TP L+LV   LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 261 KFMSIAKPNHSDFTPTLILVGTRLGLDKITPVYWEALKYSLQMPQSVGIAGGRPSSSHYF 320

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
           +GVQE    YLDPH  +P +   KD++E     D  + H+  +R +H+  +DPS+ I F 
Sbjct: 321 IGVQESDFFYLDPHQTRPALPY-KDNVEDYTTEDIDSCHTRRLRRLHIKEMDPSMLIAFL 379

Query: 369 CRDKDDFDDF 378
            RD++D++++
Sbjct: 380 IRDENDWNEW 389


>sp|Q6GPU1|ATG4A_XENLA Cysteine protease ATG4A OS=Xenopus laevis GN=atg4a PE=2 SV=1
          Length = 397

 Score =  176 bits (445), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 163/320 (50%), Gaps = 21/320 (6%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR WR  
Sbjct: 45  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWE 104

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 105 KHKNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 164

Query: 215 RAETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSKGQ----- 262
              +        +A+Y      VV  D        P  C +  A+ H S +S+ +     
Sbjct: 165 EWNS--------LAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGH 216

Query: 263 -ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
            + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  I
Sbjct: 217 CSGWRPLLLVVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEII 276

Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
           YLDPH  Q  ++  +     D + +       + + ++DPS+A+GF+C+D++DF+++C  
Sbjct: 277 YLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFNNWCEV 336

Query: 382 ASKLAEESNGAPLFTVTQTH 401
             K   +     +F +T  H
Sbjct: 337 IEKEILKHQSLRMFELTPKH 356


>sp|Q5ZIW7|ATG4A_CHICK Cysteine protease ATG4A OS=Gallus gallus GN=ATG4A PE=2 SV=1
          Length = 380

 Score =  175 bits (444), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H + +D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 12  VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 60

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W+    K    EY  ILH F D +   +SIH + Q G  
Sbjct: 61  MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 120

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 121 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 163

Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 164 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 223

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 224 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 283

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 284 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 339


>sp|Q86TL0|ATG4D_HUMAN Cysteine protease ATG4D OS=Homo sapiens GN=ATG4D PE=2 SV=1
          Length = 474

 Score =  172 bits (436), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R           + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>sp|A2QY50|ATG4_ASPNC Probable cysteine protease atg4 OS=Aspergillus niger (strain CBS
           513.88 / FGSC A1513) GN=atg4 PE=3 SV=1
          Length = 404

 Score =  171 bits (433), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 122/397 (30%), Positives = 181/397 (45%), Gaps = 72/397 (18%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 96
           +RI + +  P         S IW LG+ +   +D    +    N   E            
Sbjct: 11  KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKDANTRETPDKNNTRENVMGTTNYRKPS 70

Query: 97  -------FNQDFSSRILISYRKGFDPI----GDSK-------------------ITSDVG 126
                  F  DF SRI ++YR  F PI    GD K                    TSD G
Sbjct: 71  EHAWPESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTG 130

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+RS Q L+A AL    LGR WR+  +  F+ E  ++L LF D+ T+PFS+H  ++ 
Sbjct: 131 WGCMIRSGQSLLANALSMLVLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKH 187

Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G ++ G   G W GP A  +  EAL+          C +  + +YV +   +  +     
Sbjct: 188 GAESCGKYPGEWFGPSATAKCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK--- 236

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
               D +R+ S        + P L+L+   LG++ + P Y   L+    FPQS+GI GG+
Sbjct: 237 --FMDIARNTS------GAFQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGR 288

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           P AS Y VG Q     YLDPH  +P +     G+   + +  TYH+  +R IH+  +DPS
Sbjct: 289 PSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPS 348

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           + IGF  R+++D+ D+  R     E   G P+  V +
Sbjct: 349 MLIGFLIRNQEDWADWLKR----IEAVKGRPIIHVLK 381


>sp|A7F045|ATG4_SCLS1 Probable cysteine protease atg4 OS=Sclerotinia sclerotiorum (strain
           ATCC 18683 / 1980 / Ss-1) GN=atg4 PE=3 SV=2
          Length = 439

 Score =  169 bits (428), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 104/309 (33%), Positives = 150/309 (48%), Gaps = 49/309 (15%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF ++I ++YR  F  I  S+                        TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A ALL  R+GR WR+      +R+   IL LF D   +P+SIH  ++ G  A G 
Sbjct: 163 GQSLLANALLTLRMGREWRRGSSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP        A ARC +A T    +S  + +Y+     D           +D  
Sbjct: 220 HPGEWFGP-------SAAARCIQALTNSQVES-ELRVYITGDGSD---------VYEDT- 261

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
              S+       +TP L+LV   LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 262 -FMSIAKPNSTKFTPTLILVGTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYF 320

Query: 313 VGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           +GVQE    YLDPH  +P +      +D    D  + H+  +R +H+  +DPS+ I F  
Sbjct: 321 IGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLI 380

Query: 370 RDKDDFDDF 378
           RD++D+ D+
Sbjct: 381 RDENDWKDW 389


>sp|Q811C2|ATG4C_MOUSE Cysteine protease ATG4C OS=Mus musculus GN=Atg4c PE=2 SV=2
          Length = 458

 Score =  166 bits (420), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    ++           A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429


>sp|A1CJ08|ATG4_ASPCL Probable cysteine protease atg4 OS=Aspergillus clavatus (strain
           ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1)
           GN=atg4 PE=3 SV=1
          Length = 400

 Score =  166 bits (420), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 150/313 (47%), Gaps = 49/313 (15%)

Query: 96  EFNQDFSSRILISYRKGFDPIG----------------------DSK-ITSDVGWGCMLR 132
           EF  D  SRI I+YR  F PI                       DS+  TSD GWGCM+R
Sbjct: 75  EFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTGWGCMIR 134

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
           S Q L+A A+L   LGR WR+  +   +    ++LH F D   +PFSIH  +Q G  +  
Sbjct: 135 SGQSLLANAMLILLLGRDWRRGTEAGKE---AQLLHQFADHPEAPFSIHRFVQHGAEFCN 191

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP A  R  +AL     A+ G    S  + +Y+     D        +  D  
Sbjct: 192 KYPGEWFGPSATARCIQALV----AQQG----SSELRVYITDDTAD--------IYEDKF 235

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           +R   +      D+ P L+LV   LG++ V P Y   L+     PQS+GI GG+P AS Y
Sbjct: 236 AR---IAQAEHGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLPQSVGIAGGRPSASHY 292

Query: 312 IVGVQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
            +GV  +   YLDPH  +P     ++       + +TYH+  +R IH+  +DPS+ IGF 
Sbjct: 293 FIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRLRRIHIKDMDPSMLIGFI 352

Query: 369 CRDKDDFDDFCAR 381
            R ++D+ D+  R
Sbjct: 353 IRSREDWTDWKTR 365


>sp|Q2U5B0|ATG4_ASPOR Probable cysteine protease atg4 OS=Aspergillus oryzae (strain ATCC
           42149 / RIB 40) GN=atg4 PE=3 SV=2
          Length = 407

 Score =  166 bits (419), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 120/386 (31%), Positives = 169/386 (43%), Gaps = 71/386 (18%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA-----------QDE------ALG 86
           +RI + +  P         + IW LGV +     KI            QDE       + 
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPGKLGNYQDELEAGTSKID 70

Query: 87  DAAGNNGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITS 123
           D   +     F  DF S+I ++YR  F PI                            TS
Sbjct: 71  DVTAHGWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTS 130

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCM+RS Q L+A A+L   LGR WR+  +     E   +L LF D   +P SIH  
Sbjct: 131 DTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRF 187

Query: 184 LQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           ++ G ++ G   G W GP A  R  EAL+          C ++   +YV +   D     
Sbjct: 188 VKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD----- 234

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
              V  D   R   V   G     P L+L+   LG++ V P Y   L+     PQS+GI 
Sbjct: 235 ---VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIA 288

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSI 359
           GG+P AS Y +G Q     YLDPH  +P +    D     + + STYH+  +R IH+  +
Sbjct: 289 GGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDM 348

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKL 385
           DPS+ IGF  R++DD++D+  R   +
Sbjct: 349 DPSMLIGFLVRNEDDWEDWKGRVGSV 374


>sp|Q96DT6|ATG4C_HUMAN Cysteine protease ATG4C OS=Homo sapiens GN=ATG4C PE=2 SV=1
          Length = 458

 Score =  163 bits (412), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>sp|Q2HH40|ATG4_CHAGB Probable cysteine protease ATG4 OS=Chaetomium globosum (strain ATCC
           6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970)
           GN=ATG4 PE=3 SV=2
          Length = 448

 Score =  163 bits (412), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 56/310 (18%)

Query: 97  FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+PI                      GD +  +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
            Q L+A ALL  +LGR WR+      +R    I+ LF D   +P+S+ N ++ G  A G 
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAERN---IVALFADDARAPYSLQNFVKHGAIACGK 229

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +ALA    +          + IY          G  P V  D   
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269

Query: 253 RHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
              S  +  + D   + P L+LV   LG++K+NP Y   L  T    QS+GI GG+P +S
Sbjct: 270 ---SFLATARPDGETFHPTLILVCTRLGIDKINPVYEEALISTLQMEQSIGIAGGRPSSS 326

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIG 366
            Y VGVQ +   YLDPH  +P +   ++ L     +  + H+  +R++H++ +DPS+ IG
Sbjct: 327 HYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIG 386

Query: 367 FYCRDKDDFD 376
           F  +D+DD+D
Sbjct: 387 FLIQDEDDWD 396


>sp|A7KAI3|ATG4_PICAN Probable cysteine protease ATG4 OS=Pichia angusta GN=ATG4 PE=3 SV=1
          Length = 509

 Score =  163 bits (412), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 187/404 (46%), Gaps = 80/404 (19%)

Query: 72  LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 120
           L  + HK   D+A    A  +   EF +D  SRI ++YR GF  I  ++           
Sbjct: 51  LRTLFHKFKPDQAADTEA--SWPREFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSL 108

Query: 121 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
                         T+D GWGCM+R+SQ L+A +LL  RLGR WR    +   + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANSLLQLRLGRGWRYDQTRECAK-HAEIV 167

Query: 168 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
             F D  T+PFSIHN ++ G    G   G W GP A  RS + L      +TGL      
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKTGLKV---- 223

Query: 227 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 282
              +  SGD  ED                   +F   Q  A+  P+L+L  + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQQGAELRPVLILAGIRLGVKNVN 263

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 337
           P Y   L+ T  +PQS+GI GG+P +S Y  G Q +   YLDPH  Q  + I  +     
Sbjct: 264 PLYWDFLKKTLGWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323

Query: 338 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
                  ++E+  D  + H++ IR +HLD +DPS+ +G    ++  +D   A    +   
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHSINSH 380

Query: 389 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 430
             G+    V  +  +PV  +     +GG+ E +   LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419


>sp|Q5XH30|ATG4C_XENLA Cysteine protease ATG4C OS=Xenopus laevis GN=atg4c PE=2 SV=1
          Length = 450

 Score =  161 bits (408), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 172/403 (42%), Gaps = 95/403 (23%)

Query: 67  TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
            S ++LLG C+    +++    D   N+G          + EF +DF SRI ++YRK F 
Sbjct: 38  NSPVFLLGKCYHFKYEDSGVTADDCSNSGSDSKEDLSGNVDEFRKDFISRIWLTYRKEFP 97

Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
            I  S  T+D GWGC LR+ QML+AQ LL H LGR W                       
Sbjct: 98  QIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWTWTEALDIFCSESDFWTANTARK 157

Query: 152 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 187
                              ++PLQ    + Y E LH      F D   + F +H L++ G
Sbjct: 158 LDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRKIISWFADYPLAYFGLHQLVKLG 217

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
           K  G  AG W GP  +      L R    E+                  D E  G  +  
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256

Query: 248 IDDASRHCSVFSK-------GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
             D    C++++         + +   +++LVP+ LG E+ N  Y   ++   +    +G
Sbjct: 257 AQD----CTIYNADVYDLQCNKGNEKAVVILVPVRLGGERTNMEYFEYVKGILSLEFCIG 312

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG Q++S IY+DPH  Q  +++   +   +  ++H    + +    +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMD 370

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 401
           PS  +GFYCR+  +F+      +K+ + S     PLFT    H
Sbjct: 371 PSCTVGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413


>sp|Q7S3X7|ATG4_NEUCR Probable cysteine protease atg-4 OS=Neurospora crassa (strain ATCC
           24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
           GN=atg-4 PE=3 SV=1
          Length = 506

 Score =  160 bits (406), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 131/405 (32%), Positives = 182/405 (44%), Gaps = 87/405 (21%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSR 60
             G R  A A+ C S ++      S A  GS+LGS +TV   VT+G     ++  L    
Sbjct: 112 FNGVRTTATAT-CLSDTS-----MSAAPTGSQLGSFDTVPDSVTSG-----YDSALAYEE 160

Query: 61  TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF------- 113
            G                  QD     A        F  DF SRI ++YR  F       
Sbjct: 161 PG------------------QDGGWPPA--------FLDDFESRIWMTYRTDFALIPRSS 194

Query: 114 DPIGDSKIT----------------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
           DP   S ++                SD GWGCM+RS Q L+A A+L  RLGR WR+    
Sbjct: 195 DPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQSLLANAILIARLGREWRRGTD- 253

Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
             D E  +I+ LF D   +P+S+HN ++ G  A G   G W GP A  R  +ALA     
Sbjct: 254 -LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGKYPGEWFGPSATARCIQALA--DEK 309

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
           ++GL   S                G  P V  D      +V +     + P L+LV   L
Sbjct: 310 QSGLRVYST---------------GDLPDVYEDS---FMAVANPDGRGFQPTLILVCTRL 351

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G++K+N  Y   L  T   PQS+GI GG+P +S Y VGVQ +   YLDPH  +P +   +
Sbjct: 352 GIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYFVGVQGQRLFYLDPHHPRPALPYRE 411

Query: 337 DD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           D       +  T H+  +R +H+  +DPS+ IGF  +D+DD+D +
Sbjct: 412 DPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLIKDEDDWDTW 456


>sp|A7KAL5|ATG4_PENCW Probable cysteine protease atg4 OS=Penicillium chrysogenum (strain
           ATCC 28089 / DSM 1075 / Wisconsin 54-1255) GN=atg4 PE=3
           SV=1
          Length = 401

 Score =  160 bits (404), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 171/380 (45%), Gaps = 75/380 (19%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAE-----------------FNQDFSSRILISYRKG 112
           IW LG   + A  +   D A NN  +                  F  DF SRI I+YR  
Sbjct: 29  IWCLG--REYAPSQPPSDPASNNPRSPSRQPNASTLNDTTWPKAFLSDFGSRIWITYRSN 86

Query: 113 FDPIGDSK-----------------------ITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
           F PI  +K                        TSD GWGCM+RS Q L+A       LGR
Sbjct: 87  FTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQSLLANTFSVLLLGR 146

Query: 150 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWE 208
            WR+  +     E  +++ +F D   +PFSIH  +  G ++ G   G W GP        
Sbjct: 147 DWRRGEKV---EEESKLISMFADHPEAPFSIHRFVNRGAESCGKYPGEWFGP-------S 196

Query: 209 ALARCQRAETGLGCQS-LP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
           A A+C +    L  QS +P + +Y+ +   D           +D   H +    G+    
Sbjct: 197 ATAKCIQL---LSTQSEVPQLRVYLTNDTSD---------VYEDKFAHVAHDESGRIQ-- 242

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P L+L+   LG++ V P Y   LR   T+PQS+GI GG+P AS Y VG Q+    +LDPH
Sbjct: 243 PTLILIGTRLGIDNVTPAYWDGLRAALTYPQSVGIAGGRPSASHYFVGAQDCHLFFLDPH 302

Query: 327 DVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
             +P      D L  + +  +Y++  +R IH+  +DPS+ IGF  +D+DD+ D+     K
Sbjct: 303 TTRPATLYRPDGLYTQEELDSYYTSRLRRIHIKDMDPSMLIGFLVKDEDDWADW----KK 358

Query: 385 LAEESNGAPLFTVTQTHKKP 404
               + G P+  +  +  +P
Sbjct: 359 RIRSTPGQPIVHIFPSQHQP 378


>sp|Q0U199|ATG4_PHANO Probable cysteine protease ATG4 OS=Phaeosphaeria nodorum (strain
           SN15 / ATCC MYA-4574 / FGSC 10173) GN=ATG4 PE=3 SV=1
          Length = 467

 Score =  159 bits (402), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 158/354 (44%), Gaps = 87/354 (24%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
           N  + F  DF SR+ ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           +RS Q ++A AL   RLGR WR   +   D+++ EIL LF D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209

Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G   G W GP A  R  + LA   R E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVYV-SGD------GADVY--E 252

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  +  +V   G   W P L+LV   LG++K+ P Y   L+ +   PQS+GI GG+P AS
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKASLQIPQSIGIAGGRPSAS 310

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEA---------------------------- 341
            Y VGVQ  +  YLDPH  +P++      L A                            
Sbjct: 311 HYFVGVQGNNFYYLDPHSTRPLLPFHPPSLAAATSDTPNLTASTTSVSSTTSSTTIVPPA 370

Query: 342 -----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                            D ST H+  IR + +  +DPS+ + F    + D+ D+
Sbjct: 371 DSIPAPSDPRQSLYPPSDLSTCHTRRIRRLQIREMDPSMLLAFLVTSEADYQDW 424


>sp|Q1E5M9|ATG4_COCIM Probable cysteine protease ATG4 OS=Coccidioides immitis (strain RS)
           GN=ATG4 PE=3 SV=1
          Length = 432

 Score =  157 bits (398), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 100/297 (33%), Positives = 138/297 (46%), Gaps = 50/297 (16%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF S+   +YR  F  I  S+                        T+D GWGCM+RS
Sbjct: 105 FLDDFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRS 164

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A AL    LGR WR+  +    +E  E+L LF D+  +PFSIH  +  G  A G 
Sbjct: 165 GQSLLANALSILNLGRDWRRGSKI---KEECELLSLFADNPQAPFSIHRFVDYGASACGK 221

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  EAL+          C+   + +YV+S   D        +   D  
Sbjct: 222 HPGEWFGPSATARCIEALSN--------ECKHTDLNVYVMSDGSDVHEDQFRQIAGPDGI 273

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           R             P L+L+ + LG+E V P Y   LR    +PQS+GI GG+P +S Y 
Sbjct: 274 R-------------PTLILLGVRLGIESVTPVYWEALRAIIRYPQSVGIAGGRPSSSLYF 320

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGF 367
           +GVQ     YLDPH  +P ++   D      +  TYH+  +R +H+  +DPS+ IGF
Sbjct: 321 IGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLDTYHTRRLRRLHIREMDPSMLIGF 377


>sp|Q86ZL5|ATG4_PODAS Probable cysteine protease ATG4 OS=Podospora anserina GN=ATG4 PE=3
           SV=1
          Length = 500

 Score =  157 bits (398), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 64/310 (20%)

Query: 97  FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+ I                      GD +  +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A A+L  R GR WR+      +RE   I+ LF D   +P+SI N +  G A  G 
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI---YVVSGDEDGERGGAPVVCID 249
             G W GP        A ARC  +      + LP      ++ + + DG           
Sbjct: 290 YPGEWFGP-------SATARCIHSLRVYLTRDLPEVYEDNFMSTANPDGNH--------- 333

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
                          + P L+LV   LG++K+NP Y   L  T   PQ++GI GG+P +S
Sbjct: 334 ---------------FHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSS 378

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
            Y +G Q +   YLDPH  +P +   +   D    +  + H+  +RH+H++ +DPS+ IG
Sbjct: 379 HYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIG 438

Query: 367 FYCRDKDDFD 376
           F  +D+DD+D
Sbjct: 439 FLIKDEDDWD 448


>sp|Q4U3V5|ATG4_CRYPA Probable cysteine protease ATG4 OS=Cryphonectria parasitica GN=ATG4
           PE=2 SV=1
          Length = 459

 Score =  156 bits (395), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 109/332 (32%), Positives = 152/332 (45%), Gaps = 60/332 (18%)

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 120
           +A DE L DA        F  DF SR+ ++YR  F+PI  S                   
Sbjct: 109 LAYDELLEDAGWP---IAFLDDFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLA 165

Query: 121 ----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
                +SD GWGCM+RS Q L+A  L+  +LGR WR+       R+  EIL  F D   +
Sbjct: 166 DQGGFSSDTGWGCMIRSGQSLLANTLVICQLGRDWRRGKAA---RQEREILARFADDPRA 222

Query: 177 PFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
           P+S+HN ++ G  A G   G W GP A  R  +ALA    +          + +Y     
Sbjct: 223 PYSLHNFVRHGAVACGKFPGEWFGPSATARCIQALANSNESS---------LRVYST--- 270

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
                G  P V  D      +V       + P L+LV   LG++K+N  Y   L  T   
Sbjct: 271 -----GDLPDVYEDS---FMAVAKPDGETFHPTLILVGTRLGIDKINQVYWEALTATLQM 322

Query: 296 PQSLGIVGGKPGASTYIVGVQEES--------AIYLDPHDVQPVINIGKD---DLEADTS 344
           PQS+GI GG+P AS Y +G Q             YLDPH  +P +   +D       D +
Sbjct: 323 PQSVGIAGGRPSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN 382

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
           T H+  +R +H+  +DPS+ IGF  +D+DD+D
Sbjct: 383 TCHTRRLRRLHVRDMDPSMLIGFLIKDEDDWD 414


>sp|Q6CH28|ATG4_YARLI Probable cysteine protease ATG4 OS=Yarrowia lipolytica (strain CLIB
           122 / E 150) GN=ATG4 PE=3 SV=1
          Length = 545

 Score =  156 bits (394), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 98/393 (24%)

Query: 96  EFNQDFSSRILISYRKGF--------------------------DPIGDSKITSDVGWGC 129
           +F  D  SRI +SYR GF                          DP G    TSDVGWGC
Sbjct: 64  DFLADVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRG---YTSDVGWGC 120

Query: 130 MLRSSQMLVAQALLFHRLGRPWR----------------------------KPLQKPFDR 161
           M+R+SQ L+A ALLF  LGR WR                            K  +     
Sbjct: 121 MIRTSQSLLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSE 180

Query: 162 EYV----EILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
           E       I+  F DS  SPFSIH  ++ G KA    AG W GP A   S  AL      
Sbjct: 181 ETAVSEETIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYAL------ 234

Query: 217 ETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
                C   P   + +Y      +G  GG   V  D+      +   G     P+L+L  
Sbjct: 235 -----CNEFPDSGLKVYY-----NGNGGGD--VYEDE------LLETG----FPLLVLCG 272

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           L LG++ VNP Y  +LR   + PQS+GI GG+P  S Y  G Q E   YLDPH  +P + 
Sbjct: 273 LRLGIDNVNPIYWDSLRQMLSLPQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPAVK 332

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
                 + DT+++HS  I  +HL  +DPS+ +GFY   + D++ F    +   E+++   
Sbjct: 333 T----TDKDTTSFHSSRIWKLHLKEMDPSMLVGFYITSEADWETFKGSLTASKEKTSSQI 388

Query: 394 LFTVTQTHKKP-VNHSDVLGETGGVPEDDSLGV 425
           +      H  P  +  D     GG  +DD + V
Sbjct: 389 VHIHPSRHNIPSFDEEDEYVSIGGASDDDFVDV 421


>sp|Q5B7L0|ATG4_EMENI Cysteine protease atg4 OS=Emericella nidulans (strain FGSC A4 /
           ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=atg4 PE=3
           SV=2
          Length = 402

 Score =  154 bits (389), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 120/390 (30%), Positives = 178/390 (45%), Gaps = 68/390 (17%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGV-----CHKIAQDEALGDAAGNN--------GLA 95
           +RI + +  P         S IW LG      C +   DE+     G          G  
Sbjct: 11  KRIIQYIWDPEPKNDEEPGSPIWCLGTRYPPQCVEETADESRNPDHGQQQNTNTSAPGWP 70

Query: 96  E-FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
           E F  DF S+I ++YR  F PI                            TSD GWGCM+
Sbjct: 71  EAFLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMI 130

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
           RS Q L+A ++    LGR WR+  +     E  ++L LF DS  +PFSIH+ ++ G  + 
Sbjct: 131 RSGQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFC 187

Query: 191 GLAAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
           G   G W GP A  R  + LA R  ++          + +Y+   + D  +     V  D
Sbjct: 188 GKHPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRD 238

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           +         KG     P L+L+ L LG++++   Y   L+     PQS+GI GG+P AS
Sbjct: 239 E---------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSAS 287

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
            Y V VQ     YLDPH+ +P +   +     E + +TYH+  +R +++  +DPS+ IGF
Sbjct: 288 HYFVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGF 347

Query: 368 YCRDKDDFDDFCARASKLAEESNGAPLFTV 397
             RD+DD++D+ AR   L     G P+ T+
Sbjct: 348 LIRDEDDWEDWKARIMSL----EGKPIITI 373


>sp|Q68EP9|ATG4C_XENTR Cysteine protease ATG4C OS=Xenopus tropicalis GN=atg4c PE=2 SV=1
          Length = 450

 Score =  154 bits (388), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 111/403 (27%), Positives = 172/403 (42%), Gaps = 95/403 (23%)

Query: 67  TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
            S ++LLG C+    +++    D   N+G          + EF +DF SRI ++YR+ F 
Sbjct: 38  NSPVFLLGKCYHFKYEDSSVTSDGGSNSGSESKEDLSGNVDEFRKDFISRIWLTYREEFP 97

Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
            I  S  T+D GWGC LR+ QML+AQ L+ H LGR W                       
Sbjct: 98  QIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWTWTEALDIFSSESEFWTANTARK 157

Query: 152 -------------------RKPLQ---KPFDRE--YVEILHLFGDSETSPFSIHNLLQAG 187
                              ++PL    K  + E  + +I+  F D   + F +H L++ G
Sbjct: 158 LTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQKIISWFADYPLAYFGLHQLVKLG 217

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
           K  G  AG W GP  +      L R    E+                  D E  G  +  
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256

Query: 248 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
             D    C+++S    D          +++LVP+ LG E+ N  Y   ++   +    +G
Sbjct: 257 AQD----CTIYSADVYDLQCNKGTEKAVVILVPVRLGGERTNMEYFEFVKGILSLEFCIG 312

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG Q++S IY+DPH  Q  +++   +   +  ++H    + +    +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSVKNFPLE--SFHCPSPKKMSFKKMD 370

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 401
           PS  IGFYCR+  +F+      +K+ + S     PLFT    H
Sbjct: 371 PSCTIGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413


>sp|Q523C3|ATG4_MAGO7 Cysteine protease ATG4 OS=Magnaporthe oryzae (strain 70-15 / ATCC
           MYA-4617 / FGSC 8958) GN=ATG4 PE=3 SV=2
          Length = 491

 Score =  152 bits (384), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+PI  S                         T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367

Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427

Query: 364 AIGFYCRDKDDF 375
            IGF   D++++
Sbjct: 428 LIGFLILDEENW 439


>sp|Q68FJ9|ATG4D_XENLA Cysteine protease ATG4D OS=Xenopus laevis GN=atg4d PE=2 SV=1
          Length = 469

 Score =  143 bits (360), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 161/356 (45%), Gaps = 63/356 (17%)

Query: 91  NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           ++ +  F +DF SR+ ++YR+ F  +  + +T+D GWGCM+RS QML+AQ LL H L R 
Sbjct: 93  DDEIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSRE 152

Query: 151 W--RKPLQKPF----------------------------------------DREYVEILH 168
           W   + L + F                                        D+ +  I+ 
Sbjct: 153 WTWSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMR 212

Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
            F D   SPF +H L+  G  +G  AG W GP         +A   +       +   ++
Sbjct: 213 WFSDHPGSPFGLHQLVTLGSIFGKKAGDWYGP-------SIVAHIIKKAIETSSEVPELS 265

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
           +YV S D    +     +   D     +    G+A    +++LVP+ LG E  NP Y   
Sbjct: 266 VYV-SQDCTVYKADIEQLFAGDVPHAETSRGAGKA----VIILVPVRLGGETFNPVYKHC 320

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L+     P  LGI+GGKP  S Y +G Q+   +YLDPH  QP I+  K+D   +  ++H 
Sbjct: 321 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPYIDTSKNDFPLE--SFHC 378

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL-----AEESNGAPLFTVTQ 399
           +  R I +  +DPS    FY ++ +DF   C    K+     AEE    P+F++++
Sbjct: 379 NSPRKISITRMDPSCTFAFYAKNSEDFGKLCDHLMKVLHSPRAEEK--YPIFSISE 432


>sp|Q6BYP8|ATG4_DEBHA Probable cysteine protease ATG4 OS=Debaryomyces hansenii (strain
           ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
           GN=ATG4 PE=3 SV=2
          Length = 492

 Score =  136 bits (343), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 156/343 (45%), Gaps = 79/343 (23%)

Query: 87  DAAGNNGLAEFNQDFSSRILISYRKGFDPIG----------------------------- 117
           D + ++G+ E  QD  S+I ++YR GF+PI                              
Sbjct: 77  DISVDDGVIE--QDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNF 134

Query: 118 -----DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW-----RKPLQKPFDREYVEIL 167
                +   T+DVGWGCM+R+SQ L+A       LGR +     R P        + EI+
Sbjct: 135 HGLLDNDNFTTDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRDRSP-------RHDEII 187

Query: 168 HLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
            +F D   +PFS+HN ++      L    G W GP A   S + L           C + 
Sbjct: 188 DMFMDEPRAPFSLHNFIKVASESPLKVKPGQWFGPNAASLSIKRL-----------CDN- 235

Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP----ILLLVPLVLGLEKV 281
              +Y  +G      G   VV  + ++ +  + ++      P    IL+L+P+ LG++KV
Sbjct: 236 ---VYESNG-----TGRVKVVISESSNLYDDIITQMFTTLNPVPDAILVLLPVRLGIDKV 287

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           NP Y  ++       QS+GI GGKP +S Y  G +    +YLDPH  Q V N       +
Sbjct: 288 NPLYHASVLELLALRQSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRN-----KTS 342

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
              TYH++  + + +D +DPS+ IG   +D +D++DF +  +K
Sbjct: 343 VYDTYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKSSCTK 385


>sp|Q9P373|ATG4_SCHPO Probable cysteine protease atg4 OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=atg4 PE=3 SV=1
          Length = 320

 Score =  136 bits (343), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 144/341 (42%), Gaps = 53/341 (15%)

Query: 48  MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 107
           M R  ER L  + T      + IW LG  +KI   +            +F  D  S I I
Sbjct: 4   MARFLERYLHFAPTNTEPPGTLIWFLGHSYKIEDSQ---------WPEKFLYDSFSLITI 54

Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
           +YR G +  G   +TSD GWGCM+RS+Q L+A  L   R+  P         +++  EIL
Sbjct: 55  TYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL---RICYP---------EKQLKEIL 100

Query: 168 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
            LF D  ++PFSIH  +  GK    +  G W GP   C     +AR            +P
Sbjct: 101 ALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSC---SCVARLSDQNP-----DVP 152

Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
           + +YV        R     V                    P+LLL+P  LG++ +N  Y 
Sbjct: 153 LHVYVARNGNAIYRDQLSKVSF------------------PVLLLIPTRLGIDSINESYY 194

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
             L   F     +GI GG+P ++ Y    Q +   YLDPH         +    A   T+
Sbjct: 195 DQLLQVFEIRSFVGITGGRPRSAHYFYARQNQYFFYLDPHCTHFAHTTTQ---PASEETF 251

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
           HS  +R + +  +DP +  GF  RD++++  F A     A+
Sbjct: 252 HSATLRRVAIQDLDPCMIFGFLIRDEEEWHSFEANQKYFAD 292


>sp|Q6CQ60|ATG4_KLULA Probable cysteine protease ATG4 OS=Kluyveromyces lactis (strain
           ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL
           Y-1140 / WM37) GN=ATG4 PE=3 SV=1
          Length = 450

 Score =  126 bits (316), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 164/383 (42%), Gaps = 67/383 (17%)

Query: 26  LASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEAL 85
           L+ +   LG  E V R  T   + + +  +   SRT + +  S           A +  +
Sbjct: 4   LSRISQHLGIVEDVDRDGTVFILGKEYAPLNNKSRTDVETDDS-----------ALESLI 52

Query: 86  GDAAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT------------------ 122
              + N GL     D  SR+  +YR  F PI     G S I                   
Sbjct: 53  NIVSLNPGLL---SDVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALT 109

Query: 123 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
                 SD+GWGCM+R+ Q L+A A+   +L R +R    +  D E + ++  F D    
Sbjct: 110 DPDSFYSDIGWGCMIRTGQALLANAIQRVKLAREFRINASRIDDNE-LNLIRWFQDDVKY 168

Query: 177 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
           P S+HN ++A  K  G+  G W GP A  RS + L      E    C      I   S D
Sbjct: 169 PLSLHNFVKAEEKISGMKPGQWFGPSATARSIKTLI-----EGFPLCGIKNCIISTQSAD 223

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
                     +  D+ +R   +F K +     +LLL  + LG++K+N  Y   +    + 
Sbjct: 224 ----------IYEDEVTR---IFHKDRD--ANLLLLFAVRLGVDKINSLYWKDIFKILSS 268

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           P S+GI GGKP +S Y  G Q E+  YLDPH+ Q   ++  DDLE   S  H      +H
Sbjct: 269 PYSVGIAGGKPSSSLYFFGYQNENLFYLDPHNTQQS-SLMMDDLEFYRSC-HGHKFNKLH 326

Query: 356 LDSIDPSLAIGFYCRDKDDFDDF 378
           +   DPS+ +G     K+++D F
Sbjct: 327 ISETDPSMLLGMLISGKNEWDQF 349


>sp|A3LQU0|ATG4_PICST Probable cysteine protease ATG4 OS=Scheffersomyces stipitis (strain
           ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545)
           GN=ATG4 PE=3 SV=2
          Length = 514

 Score =  122 bits (307), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 95/323 (29%), Positives = 153/323 (47%), Gaps = 43/323 (13%)

Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
           FS  +L + +   + I     T+DVGWGCM+R+SQ L+A    F RL       L K  D
Sbjct: 138 FSKSLLYNLQNFNNFIEKENFTTDVGWGCMIRTSQSLLANT--FVRL-------LDKQSD 188

Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 218
                I+ LF D+  +PFS+HN ++   +  L    G W GP A   S + L  C     
Sbjct: 189 -----IIALFNDTYLAPFSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRL--CDGYYD 241

Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
               +++   I V+  +            ++ ++      +KG      +L+L+P+ LG+
Sbjct: 242 NSTSETILPRINVLISESTDLYDSQIAQLLEPSTE-----TKG------LLVLLPVRLGI 290

Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
           + +N  Y  +L    +  QS+GI GGKP +S Y  G Q+ S IY+DPH  Q    I   D
Sbjct: 291 DSINSYYFSSLLHLLSLEQSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQ----IFSSD 346

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF---CARASKLAEESNGAPLF 395
           +  D STY++   + + +  +DPS+ IG + RD   +++F   C  A+      +     
Sbjct: 347 I--DMSTYYATRYQRVDIGKLDPSMLIGVFIRDLTSYENFKKSCLDAANKIVHFHATERS 404

Query: 396 TVTQTHKK-----PVNHSDVLGE 413
           TV ++ +K      +N SD+  E
Sbjct: 405 TVPESRRKNSEFVNINRSDLKDE 427


>sp|A7TQN1|ATG4_VANPO Probable cysteine protease ATG4 OS=Vanderwaltozyma polyspora
           (strain ATCC 22028 / DSM 70294) GN=ATG4 PE=3 SV=1
          Length = 411

 Score =  121 bits (303), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 142/309 (45%), Gaps = 57/309 (18%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 127
           F  D  SRI  +YR  F PI  S                                +D+GW
Sbjct: 74  FLSDVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGW 133

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
           GCM+R+ Q L+A A+    LGR +R       + +  +I+  F D+   PFS+HN ++ G
Sbjct: 134 GCMIRTGQSLLANAIQIAILGREFRVN-DGDVNEQERKIISWFMDTPDEPFSLHNFVKKG 192

Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
            +      G W GP A  RS ++L   Q  + G+    + ++   +  DE          
Sbjct: 193 CELSSKKPGEWFGPAATSRSIQSLVE-QFPDCGIDRCIVSVSSADIFKDE---------- 241

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
            I+D      +F   +  ++ ILLL+ + LG++KVN  Y+  +R       S+GI GG+P
Sbjct: 242 -IND------IFKNKR--YSNILLLMGVKLGVDKVNEYYLKDIRKILESRYSVGISGGRP 292

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
            +S Y  G Q+++ +Y DPH  QP        +E+   T H+D    I++  +DPS+ IG
Sbjct: 293 SSSLYFFGYQDDTLLYFDPHKPQPST------IESLLETCHTDNFDKINISDMDPSMLIG 346

Query: 367 FYCRDKDDF 375
              + +DD+
Sbjct: 347 VLLQGEDDW 355


>sp|Q59UG3|ATG4_CANAL Cysteine protease ATG4 OS=Candida albicans (strain SC5314 / ATCC
           MYA-2876) GN=ATG4 PE=3 SV=1
          Length = 446

 Score =  120 bits (302), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 136/317 (42%), Gaps = 70/317 (22%)

Query: 98  NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 123
           N    S++ +SYR GF+PI  S                                    TS
Sbjct: 80  NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCM+R+SQ L+A  LL             K + +   EI+ LF D  +SPFSIHN 
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDDTSSPFSIHNF 186

Query: 184 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
           ++      L    G W GP A   S + LA     +  +    +P     +S + D    
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVF--ISENSD---- 240

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
                  DD  R   VF+K +     +L+L P+ LG++KVN  Y  ++        S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
            GGKP +S Y +G ++   IY DPH  Q V      +   +  +YH+     +++  +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345

Query: 362 SLAIGFYCRDKDDFDDF 378
           S+ IG    + D++ DF
Sbjct: 346 SMMIGILVTNIDEYIDF 362


>sp|P0CQ10|ATG4_CRYNJ Cysteine protease ATG4 OS=Cryptococcus neoformans var. neoformans
           serotype D (strain JEC21 / ATCC MYA-565) GN=ATG4 PE=3
           SV=1
          Length = 1193

 Score =  117 bits (294), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 82/240 (34%), Positives = 110/240 (45%), Gaps = 28/240 (11%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
           +TSD GWGCMLR+ Q L+  AL+   LGR WR P       E               Y +
Sbjct: 562 LTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAKYAQ 621

Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
           +L  F D  S   PFS+H +   GK  G   G W GP     + + LA    A  G+   
Sbjct: 622 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 680

Query: 224 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 274
           +   +I      Y  S    D     +P        R     +K +  W    +L+LV +
Sbjct: 681 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRRGDNEAK-EEKWGKRAVLILVGV 739

Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
            LGL+ VNP Y  +++  FTFPQS+GI GG+P +S Y VG Q     YLDPH  +P I +
Sbjct: 740 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 799



 Score = 53.9 bits (128), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           TYH + I+ + L  +DPS+ +GF C+D+DDF+DF  R ++L ++     +FTV
Sbjct: 952 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 999


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.136    0.415 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 175,521,858
Number of Sequences: 539616
Number of extensions: 7693184
Number of successful extensions: 15316
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 62
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 15053
Number of HSP's gapped (non-prelim): 112
length of query: 443
length of database: 191,569,459
effective HSP length: 121
effective length of query: 322
effective length of database: 126,275,923
effective search space: 40660847206
effective search space used: 40660847206
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)