BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 013409
         (443 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
 gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
          Length = 489

 Score =  630 bits (1624), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 332/491 (67%), Positives = 379/491 (77%), Gaps = 50/491 (10%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGS------------------------- 35
           MKGFRE+  AS+C SK   DTPNRSL S   E GS                         
Sbjct: 1   MKGFRERV-ASRCSSKCPVDTPNRSLTSDCLESGSNFSTKGSLWSSFFASAFSVFETYRE 59

Query: 36  -----------------SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHK 78
                            +  VK++V+ GSMRRIHERVLGPSRTGISS+TSDIWLLGVC+K
Sbjct: 60  SPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVCYK 119

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLV 138
           I++DE+ G+A   N LAEF  D+SSRIL++YR+GFD IGDSK  SDVGWGCMLRSSQMLV
Sbjct: 120 ISEDES-GNADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLV 178

Query: 139 AQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
           AQALLFH+LGR W KP QKP D+ YVEILHLFGDSE +PFSIHNL+QAGKAY LAAGSWV
Sbjct: 179 AQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWV 238

Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
           GPYAMCRSWE+LAR +R E  L  QSLPMA+YVVSGDEDGERGGAPVV I+DASRHC  F
Sbjct: 239 GPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEF 298

Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
           S+GQADWTPILLLVPLVLGL+KVNPRYIP+L+ TFTF QSLGI+GGKPGASTYIVGVQ++
Sbjct: 299 SRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQDD 358

Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           +A YLDPH+VQ V+NIG+DD+EADTS+YHSD++RHI L SIDPSLAIGFYCRDKDDFD+F
Sbjct: 359 NAFYLDPHEVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFDEF 418

Query: 379 CARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL-GETGGVPEDDSLGV-MSMNDAV-- 432
           C  ASKLA++S GAPLFTV   HK  KPV+H D+L  E   V EDDS+ V M +ND    
Sbjct: 419 CLLASKLADDSQGAPLFTVAHCHKLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDAEG 478

Query: 433 GNAHEDDWQLL 443
           G A ED+WQLL
Sbjct: 479 GGAQEDEWQLL 489


>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
 gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
          Length = 486

 Score =  619 bits (1596), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 308/433 (71%), Positives = 357/433 (82%), Gaps = 4/433 (0%)

Query: 15  SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 74
           S+S+P    +     G   G +  V+++VT  SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54  SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113

Query: 75  VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 134
           +C+KI+Q+E+   A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173

Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
           QMLVAQALL HR+GR WRK   KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233

Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
           GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
           C  FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L  TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353

Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
           VQ+E A YLDPH+ Q V++I +++LEADTS+YH ++IRHI LDSIDPSLAIGFYCRDKDD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDD 413

Query: 375 FDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAV 432
           FDDFC RASKLA++SNGAPLFTV   H   KP++ SD + +  G  EDDS  V+S   A 
Sbjct: 414 FDDFCIRASKLADKSNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAE 473

Query: 433 G--NAHEDDWQLL 443
           G  + HEDDWQLL
Sbjct: 474 GYEHEHEDDWQLL 486


>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
 gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  618 bits (1593), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 323/488 (66%), Positives = 373/488 (76%), Gaps = 51/488 (10%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSET---------------------- 38
           MKGFRE+   +   S ST ++PNRS  S  SELGS++T                      
Sbjct: 1   MKGFRERGFVASSKSSSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDTH 60

Query: 39  -----------------------VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 75
                                  VK++V  GSMRRI E VLG S+TGIS++T DIWLLG 
Sbjct: 61  CDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGA 120

Query: 76  CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
           C+KI+QD + GDAA  N LA FN DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 121 CYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQ 180

Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
           MLVAQALLFHRLGR WRKPL KP DREYVEILHLFGDSE+S FSIHNLL+AGKAYGLAAG
Sbjct: 181 MLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAG 240

Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
           SWVGPYA+C SWE+L R +R ET L  QSL MA+YVVSG EDGERGGAPV+CI++A+RHC
Sbjct: 241 SWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHC 300

Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
           S FSKGQ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 301 SEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 360

Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
           Q+E+A YLDPH+VQPV+N+ +DD+EA+TS+YH +V+RH+ LD IDPSLAIGFYCRDKDDF
Sbjct: 361 QDENAFYLDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDF 420

Query: 376 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 435
           DDFC  ASKL +ESNGAPLFTV  + +K + H     ++G V  DDSLGVM+MND  G  
Sbjct: 421 DDFCTLASKLTDESNGAPLFTVAHS-RKLLKH-----DSGEVRSDDSLGVMTMNDVEGCV 474

Query: 436 HEDDWQLL 443
           HEDDWQLL
Sbjct: 475 HEDDWQLL 482


>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  617 bits (1591), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 310/436 (71%), Positives = 357/436 (81%), Gaps = 7/436 (1%)

Query: 15  SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 74
           S+S+P    +     G   G +  V+++VT  SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54  SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113

Query: 75  VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 134
           +C+KI+Q+E+   A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173

Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
           QMLVAQALL HR+GR WRK   KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233

Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
           GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
           C  FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L  TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353

Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYH---SDVIRHIHLDSIDPSLAIGFYCRD 371
           VQ+E A YLDPH+ Q V++I +++LEADTS+YH   S +IRHI LDSIDPSLAIGFYCRD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRD 413

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMN 429
           KDDFDDFC RASKLA+ESNGAPLFTV   H   KP++ SD + +  G  EDDS  V+S  
Sbjct: 414 KDDFDDFCIRASKLADESNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNK 473

Query: 430 DAVG--NAHEDDWQLL 443
            A G  + HEDDWQLL
Sbjct: 474 GAEGYEHEHEDDWQLL 489


>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
 gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
          Length = 481

 Score =  615 bits (1587), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 302/410 (73%), Positives = 353/410 (86%), Gaps = 6/410 (1%)

Query: 34  GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
           G + +VK++V  G+MRRI ERVLG S+TGIS++TSDIWLLG  +KI+QD++ G+A   N 
Sbjct: 78  GWTSSVKKIVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATNA 137

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           LA F++DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQMLVAQALLFHRLGR WRK
Sbjct: 138 LAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRK 197

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
           P+ KP DR+YVEILHLFGDSE S FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE+LAR 
Sbjct: 198 PVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARS 257

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
           +R ET L  Q+LPMA+YVVSG EDGERGGAPV+ I+DA+RHCS FSKG+ DWTPILLLVP
Sbjct: 258 KREETNLEYQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVP 317

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           LVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGVQ+E+A YLDPH+VQPV+N
Sbjct: 318 LVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVN 377

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
             +DD+EA+TS+YH DV+RHI LD IDPSLAIGFYCRDKDDFDDFC+ ASKLA+ESNGAP
Sbjct: 378 FSRDDVEANTSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNGAP 437

Query: 394 LFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           LFTV  ++K   +      ++  V +DD LGVM+MNDA G  +EDDWQLL
Sbjct: 438 LFTVANSYKSSKH------DSSEVRDDDPLGVMTMNDAEGCLNEDDWQLL 481


>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
 gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
          Length = 483

 Score =  597 bits (1538), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 301/407 (73%), Positives = 339/407 (83%), Gaps = 2/407 (0%)

Query: 38  TVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEF 97
           TV++++T+GSMRRI ER+LG  R+G+ SS  DIWLLGVCHKI+QD    DAA + G+A +
Sbjct: 78  TVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQDHPPDDAASSPGVAGY 137

Query: 98  NQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
            QDFSSRIL++YRKGF  I DSK TSDV WGCMLRSSQMLVAQALLFHRLGR WRKP QK
Sbjct: 138 EQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPSQK 197

Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
           P D+EYVEILHLFGDSETS FSIHNLLQAG+AY LAAGSWVGPYAMCRSWE L R +R  
Sbjct: 198 PLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRSWETLVRSKRET 257

Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
             L  Q LPMAIY+VSGDEDGERGGAPV+ IDDASRHC  FSKGQ DW+PILLLVPLVLG
Sbjct: 258 PILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHDWSPILLLVPLVLG 317

Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
           LEK+NPRYIP+LR TFTFPQSLGI+GGKPGASTYIVGVQ+E+A YLDPH+VQ V+NI KD
Sbjct: 318 LEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQQVVNIDKD 377

Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           DLEADTS+YH +VIRHI L+SIDPSLAIGFYCRDKDDFD+FC RASKLAEES+GAPLFTV
Sbjct: 378 DLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCHRASKLAEESDGAPLFTV 437

Query: 398 TQTHK-KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
            +TH   P   S  L +   + EDD  GV+ M +    +HEDDWQ L
Sbjct: 438 AETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNE-EESHEDDWQFL 483


>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  569 bits (1466), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 295/413 (71%), Positives = 333/413 (80%), Gaps = 9/413 (2%)

Query: 34  GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
           G +  V+++VT GSMRR  ERVLG SRT ISSS  DIWLLGVCHKI+Q E+ G    +NG
Sbjct: 79  GWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQESTGGVDTSNG 138

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           LA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
           P+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LA  
Sbjct: 199 PIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLA-- 256

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
            R +  LG   LPMAIYVVSGDEDGERGGAPVVCI+DAS+ CS FS G A WTP+LLLVP
Sbjct: 257 -RKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWTPLLLLVP 315

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           LVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+N
Sbjct: 316 LVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPHDVQQVVN 375

Query: 334 IGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
           I  D  E   TS+YH +V+RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGA
Sbjct: 376 ISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGA 435

Query: 393 PLFTV--TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           PLFTV  +++  K V++ DV G+  G  EDD  G+   ND V N  EDDWQLL
Sbjct: 436 PLFTVAKSRSFSKQVSN-DVSGDNTGFQEDDFPGMDCGNDTVTN--EDDWQLL 485


>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 292/413 (70%), Positives = 329/413 (79%), Gaps = 8/413 (1%)

Query: 34  GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
           G +  V+++VT GSMRR  ERVLG SRT ISSS  DIWLLGVCHKI+Q E+ G    +NG
Sbjct: 79  GWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQESSGGVDNSNG 138

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           LA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALLFHKLGRSWRK 198

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
           P+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LA  
Sbjct: 199 PIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLA-- 256

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
            R +  LG   LPMAIYVVSGDEDGERGGAPVVCI+DAS+ C  FS G A WTP+LLLVP
Sbjct: 257 -RKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWTPLLLLVP 315

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           LVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+G Q E A YLDPHDVQ V+N
Sbjct: 316 LVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQNEKAFYLDPHDVQQVVN 375

Query: 334 IGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
           I  D  E   TS+YH +++RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGA
Sbjct: 376 ISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGA 435

Query: 393 PLFTVTQTH--KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           PLFTVTQ+    K V  +DV G+  G  E+D  G+   ND   N  EDDWQLL
Sbjct: 436 PLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDTGTN--EDDWQLL 486


>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
 gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
 gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
          Length = 487

 Score =  556 bits (1432), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 282/412 (68%), Positives = 324/412 (78%), Gaps = 5/412 (1%)

Query: 34  GSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNG 93
           G +  V+++V+ GSMRR  ERVLG  RT +SSS  DIWLLGVCHKI+Q E+ GD    N 
Sbjct: 79  GWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCHKISQHESTGDVDIRNV 138

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
            A F QDF SRILI+YRKGFD I DSK TSDV WGCMLRSSQMLVAQALLFH+LGR WRK
Sbjct: 139 FAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
            + KP D+EY++IL LFGDSE + FSIHNLLQAGK YGLA GSWVGPYAMCR+WE LAR 
Sbjct: 199 TVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARN 258

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
           QR +   G Q LPMAIYVVSGDEDGERGGAPVVCI+DA + C  FS+G   WTP+LLLVP
Sbjct: 259 QREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLEFSRGLVPWTPLLLLVP 318

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           LVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ + A YLDPH+V+PV+N
Sbjct: 319 LVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQNDKAFYLDPHEVKPVVN 378

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
           I  D  E +TS+YH ++ RH+ LDSIDPSLAIGFYCRDKDDFDDFC+RA+KLAEESNGAP
Sbjct: 379 ITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDKDDFDDFCSRATKLAEESNGAP 438

Query: 394 LFTVTQTHKKP--VNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           LFTV Q+   P  V  + V G+     EDDSL +  +NDA    +EDDWQ L
Sbjct: 439 LFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSMNLVNDA---GNEDDWQFL 487


>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
          Length = 489

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 288/489 (58%), Positives = 335/489 (68%), Gaps = 48/489 (9%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSETV-------KRLVTAG-SMRRIH 52
           +K F ++  A+KC SKS+ +T + S     S+ GSS++            T+G S+   +
Sbjct: 3   LKAFCDRIVAAKCSSKSSTETVDNSQVPACSKAGSSDSKFPKASLWSSFFTSGFSVIETY 62

Query: 53  ERVLGPSRTGISSSTSD----------IWL--------------------------LGVC 76
            +     +  + S  S            WL                          LGVC
Sbjct: 63  SKSPASEKKAVHSQNSGWGCCCEESCYCWLNEEIPRACTLGQAELTFQALMVIYGFLGVC 122

Query: 77  HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
           HK +Q E+ GD   +   A F QDFSS+IL++YRKGFD IGDSK TSDV WGCMLRSSQM
Sbjct: 123 HKFSQQESTGDVDNSTVFAAFEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQM 182

Query: 137 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 196
           LVAQALLFH+LGR WRK   KP D+EY++IL  FGDSE S FSIHNLLQAGK YGLA GS
Sbjct: 183 LVAQALLFHKLGRMWRKTTDKPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGS 242

Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
           WVGPYAMCRSWE LAR QR     G Q LPMA+YVVSGDEDGERGGAPVVCI+DASR CS
Sbjct: 243 WVGPYAMCRSWEVLARNQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCIEDASRRCS 302

Query: 257 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
            FS+G A WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 EFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQ 362

Query: 317 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
            E A YLDPHDVQPV++I  D  + +TS+YH +++R + LDSIDPSLAIGFYCRDKDDFD
Sbjct: 363 NEKAFYLDPHDVQPVVHINGDAQDPNTSSYHCNIVRQMPLDSIDPSLAIGFYCRDKDDFD 422

Query: 377 DFCARASKLAEESNGAPLFTVTQTHKKPVNHS--DVLGETGGVPEDDSLGVMSMNDAVGN 434
           DFC+RASKLAEESNGAPLFTV Q    P   +  DV G+  G  EDDS GV  +NDA  N
Sbjct: 423 DFCSRASKLAEESNGAPLFTVAQFRSFPFQDAGYDVSGDNTGFQEDDSHGVDLLNDAGTN 482

Query: 435 AHEDDWQLL 443
             EDDWQLL
Sbjct: 483 --EDDWQLL 489


>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
 gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
           Full=Autophagy-related protein 4 homolog a;
           Short=AtAPG4a; Short=Protein autophagy 4a
 gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
 gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
 gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
          Length = 467

 Score =  520 bits (1340), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 244/377 (64%), Positives = 306/377 (81%), Gaps = 3/377 (0%)

Query: 34  GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
           G +  VKR+ + +G++RR  ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE  G+     
Sbjct: 74  GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 133

Query: 93  GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            LA    DFSS+IL++YRKGF+P  D+  TSDV WGCM+RSSQML AQALLFHRLGR W 
Sbjct: 134 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWT 193

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
           K  + P ++EY+E L  FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA 
Sbjct: 194 KKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 252

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
            +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C  FSKGQ++WTPI+LLV
Sbjct: 253 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 312

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           PLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQE+   YLDPH+VQ V+
Sbjct: 313 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 372

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
            + K+  + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDDFC RA KLAEESNGA
Sbjct: 373 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEESNGA 432

Query: 393 PLFTVTQTHKKPVNHSD 409
           PLFTVTQTH   +N S+
Sbjct: 433 PLFTVTQTHTA-INQSN 448


>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
 gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
          Length = 422

 Score =  520 bits (1338), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 244/377 (64%), Positives = 306/377 (81%), Gaps = 3/377 (0%)

Query: 34  GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
           G +  VKR+ + +G++RR  ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE  G+     
Sbjct: 29  GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 88

Query: 93  GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            LA    DFSS+IL++YRKGF+P  D+  TSDV WGCM+RSSQML AQALLFHRLGR W 
Sbjct: 89  VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWT 148

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
           K  + P ++EY+E L  FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA 
Sbjct: 149 KKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 207

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
            +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C  FSKGQ++WTPI+LLV
Sbjct: 208 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 267

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           PLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQE+   YLDPH+VQ V+
Sbjct: 268 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 327

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
            + K+  + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDDFC RA KLAEESNGA
Sbjct: 328 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEESNGA 387

Query: 393 PLFTVTQTHKKPVNHSD 409
           PLFTVTQTH   +N S+
Sbjct: 388 PLFTVTQTHTA-INQSN 403


>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
 gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
          Length = 476

 Score =  509 bits (1312), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 252/401 (62%), Positives = 316/401 (78%), Gaps = 10/401 (2%)

Query: 43  VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
           + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QDFS
Sbjct: 86  MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEAESFEEADAGRVLAAFRQDFS 145

Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           S IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P + +
Sbjct: 146 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPPNEK 205

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
           Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET +  
Sbjct: 206 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDVKH 265

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
           +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G  +W PILLLVPLVLGL+KVN
Sbjct: 266 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGDTEWPPILLLVPLVLGLDKVN 325

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           PRYIP+L  TFTFPQSLGI+GGKPGASTYIVGVQE+   YLDPHDVQ V+ + K++ + D
Sbjct: 326 PRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVD 385

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
           TS+YH + +R++ L+S+DPSLA+GFYC+DKDDFDDFC RA+KLA +SNGAPLFTVTQ+H+
Sbjct: 386 TSSYHCNTLRYVPLESLDPSLALGFYCQDKDDFDDFCIRATKLAGDSNGAPLFTVTQSHR 445

Query: 403 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
                        G+ E  S  V+S  +  G  HEDDWQLL
Sbjct: 446 T---------NDCGIAETSSSTVIS-TEISGEEHEDDWQLL 476


>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
 gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
          Length = 467

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 246/369 (66%), Positives = 307/369 (83%), Gaps = 2/369 (0%)

Query: 34  GSSETVKRLVTA-GSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
           G +  VKR+  A G++RR  ERVLGP+RTG+ S+TSD+WLLGVC+KI++DEA G+     
Sbjct: 74  GWTAFVKRVSMATGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISEDEASGETNTGC 133

Query: 93  GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            LA F QDFSS+IL++YR+GF+P  D+  TSDV WGCM+RSSQML AQALLFHRLGR W 
Sbjct: 134 VLAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRSWT 193

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
           K  + P ++EY+E L  FGDSE+S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA 
Sbjct: 194 KKSELP-EQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGSWVGPYAICRAWESLAC 252

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
            +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C  FSKGQ++WTPILLLV
Sbjct: 253 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPILLLV 312

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           PLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQE+   YLDPH+VQ V+
Sbjct: 313 PLVLGLDSVNPRYIPSLIATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 372

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
            + K+  + DTS+YH +VIR++ L+S+DPSLA+GFYCRDKDDFDDFC RASKLAE+SNGA
Sbjct: 373 TVNKETPDVDTSSYHCNVIRYVPLESLDPSLALGFYCRDKDDFDDFCLRASKLAEDSNGA 432

Query: 393 PLFTVTQTH 401
           PLFT+TQTH
Sbjct: 433 PLFTITQTH 441


>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
           Full=Autophagy-related protein 4 homolog b;
           Short=AtAPG4b; Short=Protein autophagy 4b
 gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
 gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
 gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 477

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 247/401 (61%), Positives = 312/401 (77%), Gaps = 10/401 (2%)

Query: 43  VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
           + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QDFS
Sbjct: 87  MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 146

Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           S IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P D +
Sbjct: 147 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 206

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
           Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET    
Sbjct: 207 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 266

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
           +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 267 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 326

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           PRYIP+L  TFTFPQSLGI+GGKPGASTYIVGVQE+   YLDPHDVQ V+ + K++ + D
Sbjct: 327 PRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVD 386

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
           TS+YH + +R++ L+S+DPSLA+GFYC+ KDDFDDFC RA+KLA +SNGAPLFTVTQ+H+
Sbjct: 387 TSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLAGDSNGAPLFTVTQSHR 446

Query: 403 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           +  N   +   +        +         G  HEDDWQLL
Sbjct: 447 R--NDCGIAETSSSTETSTEIS--------GEEHEDDWQLL 477


>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
 gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
 gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
 gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
 gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 478

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 256/435 (58%), Positives = 324/435 (74%), Gaps = 17/435 (3%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + N+S  S       S  ++R+V +GSM R     LG S+   SS   D+W L
Sbjct: 56  FEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE   FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
           KDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DVLG +G    D ++ V  +
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG----DGNINVEDL 464

Query: 429 NDAVGNAHEDDWQLL 443
            DA G   E++WQ+L
Sbjct: 465 -DASGETGEEEWQIL 478


>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
          Length = 451

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 232/377 (61%), Positives = 293/377 (77%), Gaps = 19/377 (5%)

Query: 34  GSSETVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN 92
           G +  VKR+ + +G++RR  ERVLGP+RTG+ S+TSD+WLLGVC+KI+ DE  G+     
Sbjct: 74  GWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGETDTGT 133

Query: 93  GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            LA    DFSS+IL++YRKGF+P  D+  TSDV WGCM+RSSQML AQ            
Sbjct: 134 VLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQLP---------- 183

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
                  ++EY+E L  FGDSE S FSIHNL+ AG +YGLAAGSWVGPYA+CR+WE+LA 
Sbjct: 184 -------EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLAC 236

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
            +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C  FSKGQ++WTPI+LLV
Sbjct: 237 KKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPIILLV 296

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           PLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQE+   YLDPH+VQ V+
Sbjct: 297 PLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEVQQVV 356

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
            + K+  + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDDFC RA KLAEESNGA
Sbjct: 357 TVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEESNGA 416

Query: 393 PLFTVTQTHKKPVNHSD 409
           PLFTVTQTH   +N S+
Sbjct: 417 PLFTVTQTHTA-INQSN 432


>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B;
           Short=Protein autophagy 4; AltName: Full=OsAtg4
          Length = 478

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 254/435 (58%), Positives = 322/435 (74%), Gaps = 17/435 (3%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + N+S  S          ++R+V +GSM R     LG S+   SS   D+W L
Sbjct: 56  FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKVLTSS---DVWFL 108

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE   FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
           KDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DVLG +G    D ++ V  +
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG----DGNINVEDL 464

Query: 429 NDAVGNAHEDDWQLL 443
            DA G   E++WQ+L
Sbjct: 465 -DASGETGEEEWQIL 478


>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
          Length = 892

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 246/407 (60%), Positives = 310/407 (76%), Gaps = 12/407 (2%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + N+S  S       S  ++R+V +GSM R     LG S+     ++SD+W L
Sbjct: 56  FEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 108

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE   FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG 415
           KDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DVLG +G
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG 455


>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
          Length = 484

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 258/430 (60%), Positives = 313/430 (72%), Gaps = 16/430 (3%)

Query: 20  DTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 79
           D   RS          S  ++R V  GSM R     LG    G + +  D+W LG C+K+
Sbjct: 65  DQSGRSGGHASGSYAWSRVLRRFVGGGSMWRF----LG---CGKALTAGDVWFLGKCYKL 117

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           + +E+  D+    G A F +DFSSR+ I+YRKGFD I DSK+TSDV WGCM+RSSQMLVA
Sbjct: 118 SSEESSSDSDSEGGHAAFLEDFSSRVWITYRKGFDVISDSKLTSDVNWGCMVRSSQMLVA 177

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 199
           QAL+FH LGR WRKP Q P D E+  ILHLFGDSE   FSIHNLLQAGK+YGLAAGSWVG
Sbjct: 178 QALIFHHLGRSWRKPAQNPSDPEHTRILHLFGDSEVCAFSIHNLLQAGKSYGLAAGSWVG 237

Query: 200 PYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 257
           PYAMCR+W+ L R  R +  +    +S PM +YVVSGDEDGERGGAPVVCID A++ C  
Sbjct: 238 PYAMCRAWQTLIRTNREQPEVINRNESFPMVLYVVSGDEDGERGGAPVVCIDVAAQLCYD 297

Query: 258 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
           F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPGASTYI GVQ+
Sbjct: 298 FNKGQSAWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGASTYIAGVQD 357

Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
           + A+YLDPH+VQ  +NI  D+LEADTS+YH   +R + LD IDPSLAIGFYCRDKDDFDD
Sbjct: 358 DRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLAIGFYCRDKDDFDD 417

Query: 378 FCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG-GVPEDDSLGVMSMNDAVG 433
           FC+RAS+LAE++NGAPLFTV Q+    K+  N  D  G +G GV   D++    + D  G
Sbjct: 418 FCSRASELAEQANGAPLFTVVQSVQPSKQMYNQDDGSGCSGYGV--SDNIDTEDL-DGSG 474

Query: 434 NAHEDDWQLL 443
              ED+WQ+L
Sbjct: 475 ETGEDEWQIL 484


>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
          Length = 912

 Score =  476 bits (1225), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 244/407 (59%), Positives = 308/407 (75%), Gaps = 12/407 (2%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + N+S  S          ++R+V +GSM R     LG S+     ++SD+W L
Sbjct: 56  FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 108

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 109 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 168

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE   FSIHNLLQAG +YGLA
Sbjct: 169 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 228

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 229 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 288

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 289 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 348

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 349 IAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 408

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG 415
           KDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DVLG +G
Sbjct: 409 KDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG 455


>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
 gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
 gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
 gi|219886349|gb|ACL53549.1| unknown [Zea mays]
 gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
 gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
          Length = 492

 Score =  475 bits (1223), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 252/433 (58%), Positives = 319/433 (73%), Gaps = 20/433 (4%)

Query: 17  STPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 76
           S+P    RS  S     G S  ++R V +GSM R+    LG  R     ++SD+W LG C
Sbjct: 74  SSPACDARSTKSSSGSYGLSRILRRFVGSGSMWRL----LGCGRV---LTSSDVWFLGKC 126

Query: 77  HKIAQDEALGDAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
           +K++ +E     + ++   A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQ
Sbjct: 127 YKVSPEEEESGDSESDSGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQ 186

Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
           MLVAQAL+FH LGR WRKP +KP++ +Y+ +LHLFGDSE   FSIHNLLQAG+ YGLAAG
Sbjct: 187 MLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAG 246

Query: 196 SWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
           SW+GPYAMCR+W+ L R  R  A+   G ++ PMA+YVVSGDEDGERGGAPVVCID A++
Sbjct: 247 SWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQ 306

Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
            CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI 
Sbjct: 307 LCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLGILGGKPGTSTYIA 366

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
           GVQ++ A+YLDPH+VQ  ++I  D+LEADTS+YH  V+R + L+ IDPSLAIGFYCRDKD
Sbjct: 367 GVQDDRALYLDPHEVQMTVDIALDNLEADTSSYHCSVVRALALEQIDPSLAIGFYCRDKD 426

Query: 374 DFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
           DFDDFC+RAS+LAE++NGAPLFTV Q+    K+     D LG +G    +D       +D
Sbjct: 427 DFDDFCSRASELAEKANGAPLFTVVQSIEPSKQMYKQDDGLGCSGSSMAND-------DD 479

Query: 431 AVGNAHEDDWQLL 443
             G+   ++WQ+L
Sbjct: 480 LDGSGEAEEWQIL 492


>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
          Length = 486

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 256/434 (58%), Positives = 311/434 (71%), Gaps = 24/434 (5%)

Query: 20  DTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 79
           D   RS          S  ++R V  GSM R     LG    G + + +D+  LG C+K+
Sbjct: 67  DQSGRSGGHASGSYAWSRVLRRFVGGGSMWRF----LG---CGKALTAADVQFLGKCYKL 119

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           + +E+  D+    G A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQMLVA
Sbjct: 120 SSEESSSDSDSEGGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVA 179

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVG 199
           QAL+FH LGR WRKP Q P + EY+ ILHLFGDSE   FSIHNLLQAGK+YGLAAGSWVG
Sbjct: 180 QALIFHHLGRSWRKPAQNPSNPEYIRILHLFGDSEACAFSIHNLLQAGKSYGLAAGSWVG 239

Query: 200 PYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 257
           PYAMCR+W+ L R  R +  +    +S PMA+YVVSGDEDGERGGAPVVCID A++ C  
Sbjct: 240 PYAMCRAWQTLIRTNREQPEVINRNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYD 299

Query: 258 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
           F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPGASTYI GVQ+
Sbjct: 300 FNKDQSAWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGASTYIAGVQD 359

Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
           + A+YLDPH+VQ  +NI  D+LEADTS+YH   +R + LD IDPSLAIGFYCRDKDDFDD
Sbjct: 360 DRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLAIGFYCRDKDDFDD 419

Query: 378 FCARASKLAEESNGAPLFTVTQT---HKKPVNHSD-----VLGETGGVPEDDSLGVMSMN 429
           FC+RAS+LAE++NGAPLFTV Q+    K+  N  D       G +G +  +D        
Sbjct: 420 FCSRASELAEQANGAPLFTVVQSVQPSKQMYNRDDGSGCSGYGVSGNIDAEDL------- 472

Query: 430 DAVGNAHEDDWQLL 443
           D  G   ED+WQ+L
Sbjct: 473 DGSGETGEDEWQIL 486


>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
          Length = 493

 Score =  473 bits (1217), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 247/412 (59%), Positives = 308/412 (74%), Gaps = 12/412 (2%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
           S  ++R V  GSM R     LG ++     +  D+W LG C+K + +E+  D   ++G A
Sbjct: 90  SRALRRFVGGGSMWRF----LGCAKV---LTNGDVWFLGKCYKFSSEESSSDLDTDSGHA 142

Query: 96  EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
            F +DFSSRI ++YRKGFD I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP 
Sbjct: 143 AFLEDFSSRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPS 202

Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
           QKP + EY+ ILHLFGDSE   FS+HNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R  R
Sbjct: 203 QKPCNPEYIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNR 262

Query: 216 A--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
              E   G +S PMA+YVVSGDEDGERGGAPVVCID A++ C  F+K Q+ W+PILLLVP
Sbjct: 263 EQPEVSNGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSTWSPILLLVP 322

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           LVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STYI G+Q++ A+YLDPHDVQ  +N
Sbjct: 323 LVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTYIAGIQDDRALYLDPHDVQMAVN 382

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
           I  D+L+ADTS+YH   +R + LD +DPSLAIGFYCRDKDDFDDFC+RAS+L  ++NGAP
Sbjct: 383 IASDNLDADTSSYHCSTVRDMALDLLDPSLAIGFYCRDKDDFDDFCSRASELVVKANGAP 442

Query: 394 LFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           LFTV Q+ +  K + + D    + G    D++ +  + D  G A E++WQ+L
Sbjct: 443 LFTVVQSIQPSKQMYNQDDGSGSSGDGMADNINMEDL-DGSGEAGEEEWQIL 493


>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
          Length = 473

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 247/434 (56%), Positives = 312/434 (71%), Gaps = 16/434 (3%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + +R L         S  ++R+   GSM R     LG S+   + ++SD+W L
Sbjct: 52  FEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 104

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E    +   +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 105 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 164

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE   FSIHNLLQAGK+YGLA
Sbjct: 165 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 224

Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R   E   G  + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 225 AGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 284

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 285 AQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETFTFPQSLGILGGKPGTSTY 344

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           + GVQ++  +YLDPH+VQ  ++I  D+LEADTS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 345 VAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLAIGFYCRD 404

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMN-- 429
           KDDFDDFC+RAS+L +++NGAPLFTV Q+ +      +    +G     D + ++++   
Sbjct: 405 KDDFDDFCSRASELVDKANGAPLFTVMQSVQPSKQMYNEESSSG-----DGMDIINVEGL 459

Query: 430 DAVGNAHEDDWQLL 443
           D  G   E++WQ+L
Sbjct: 460 DGSGETGEEEWQIL 473


>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
 gi|194701156|gb|ACF84662.1| unknown [Zea mays]
 gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
 gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
 gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
          Length = 492

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 252/417 (60%), Positives = 312/417 (74%), Gaps = 23/417 (5%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
           S  ++R V +GSM R+    LG +R     ++ D+W LG C++++ ++E  G +  ++G 
Sbjct: 90  SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
           A F +DFSSRI I+YRKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
            +KP+D +Y+ +LHLFGDSE   FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R  
Sbjct: 203 SEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTN 262

Query: 215 R--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
           R  A+   G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F+KGQ  W+PILLL+
Sbjct: 263 REQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLI 322

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ  +
Sbjct: 323 PLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAV 382

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
           +I  D+LEADTS+YH  V+R + L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGA
Sbjct: 383 DIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGA 442

Query: 393 PLFTVTQT---HKKPVNHSDVLGETGG---VPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           PLFTV Q+    K+     D L    G     ED  L      DA G A E +WQ+L
Sbjct: 443 PLFTVMQSVQPSKQMYKQDDGLCCCSGSSMANEDYDL------DASGEAGE-EWQIL 492


>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
           sativa Japonica Group]
 gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
           Japonica Group]
 gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
 gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 474

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 252/435 (57%), Positives = 312/435 (71%), Gaps = 18/435 (4%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + NRSL         S  ++R+   GSM R     LG S+   + ++SD+W L
Sbjct: 53  FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E    +   +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE   FSIHNLLQAGK+YGLA
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 225

Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L    R   E   G  + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 226 AGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 285

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T TFPQSLGI+GGKPG STY
Sbjct: 286 AQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETLTFPQSLGILGGKPGTSTY 345

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D+LEA TS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 346 IAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRD 405

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSM 428
           KDDFDDFC+RAS+L +++NGAPLFTV Q+    K+  N     G+  G+   DS+ V  +
Sbjct: 406 KDDFDDFCSRASELVDKANGAPLFTVVQSVQPSKQMYNEESSSGD--GM---DSINVEGL 460

Query: 429 NDAVGNAHEDDWQLL 443
            D  G   E++WQ+L
Sbjct: 461 -DGSGETGEEEWQIL 474


>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
          Length = 1216

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 244/440 (55%), Positives = 308/440 (70%), Gaps = 45/440 (10%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + N+S  S          ++R+V +GSM R     LG S+     ++SD+W L
Sbjct: 327 FEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LGTSKV---LTSSDVWFL 379

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD I DSK TSDV WGCM+RS
Sbjct: 380 GKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRS 439

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE   FSIHNLLQAG +YGLA
Sbjct: 440 SQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLA 499

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 500 AGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVA 559

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TFTFPQSLGI+GGKPG STY
Sbjct: 560 AQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTY 619

Query: 312 IVGVQEESAIYLDPHDVQ---------------------------------PVINIGKDD 338
           I GVQ++ A+YLDPH+VQ                                   ++I  D+
Sbjct: 620 IAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYGSYSGVFSTSQAVDIAADN 679

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
           +EADTS+YH   +R + LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV 
Sbjct: 680 IEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVV 739

Query: 399 QT---HKKPVNHSDVLGETG 415
           Q+    K+  N  DVLG +G
Sbjct: 740 QSVQPSKQMYNQDDVLGISG 759


>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
           sativa Japonica Group]
 gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
           Japonica Group]
          Length = 505

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 252/466 (54%), Positives = 312/466 (66%), Gaps = 49/466 (10%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + NRSL         S  ++R+   GSM R     LG S+   + ++SD+W L
Sbjct: 53  FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E    +   +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA 193
           SQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE   FSIHNLLQAGK+YGLA
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLA 225

Query: 194 AGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
           AGSWVGPYAMCR+W+ L    R   E   G  + PMA+YVVSGDEDGERGGAPVVCID A
Sbjct: 226 AGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVA 285

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           ++ C  F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T TFPQSLGI+GGKPG STY
Sbjct: 286 AQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETLTFPQSLGILGGKPGTSTY 345

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           I GVQ++ A+YLDPH+VQ  ++I  D+LEA TS+YH   +R + LD IDPSLAIGFYCRD
Sbjct: 346 IAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRD 405

Query: 372 K-------------------------------DDFDDFCARASKLAEESNGAPLFTVTQT 400
           K                               DDFDDFC+RAS+L +++NGAPLFTV Q+
Sbjct: 406 KGELLLPDKMLGHHLSSLQSWFSYLLCLSAYVDDFDDFCSRASELVDKANGAPLFTVVQS 465

Query: 401 ---HKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
               K+  N     G+  G+   DS+ V  + D  G   E++WQ+L
Sbjct: 466 VQPSKQMYNEESSSGD--GM---DSINVEGL-DGSGETGEEEWQIL 505


>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
 gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
          Length = 462

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 237/371 (63%), Positives = 285/371 (76%), Gaps = 15/371 (4%)

Query: 81  QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQ 140
           ++E  G +  ++G A F +DFSSRI I+YRKGFD I  SK+TSDV WGCM+RSSQMLVAQ
Sbjct: 99  EEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQ 158

Query: 141 ALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
           AL+FH LGR WRKP +KP+D +Y+ +LHLFGDSE   FSIHNLLQAG+ YGLAAGSWVGP
Sbjct: 159 ALIFHHLGRSWRKPSEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGP 218

Query: 201 YAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
           YAMCR+W+ L R  R  A+   G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F
Sbjct: 219 YAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNF 278

Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
           +KGQ  W+PILLL+PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+
Sbjct: 279 NKGQCTWSPILLLIPLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQED 338

Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
            A+YLDPHDVQ  ++I  D+LEADTS+YH  V+R + L+ IDPSLAIGFYCRDKDDFDDF
Sbjct: 339 RALYLDPHDVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKDDFDDF 398

Query: 379 CARASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGG---VPEDDSLGVMSMNDAV 432
           C+RAS+LAE++NGAPLFTV Q+    K+     D L    G     ED  L      DA 
Sbjct: 399 CSRASELAEKANGAPLFTVMQSVQPSKQMYKQDDGLCCCSGSSMANEDYDL------DAS 452

Query: 433 GNAHEDDWQLL 443
           G A E +WQ+L
Sbjct: 453 GEAGE-EWQIL 462


>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 356

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 211/357 (59%), Positives = 281/357 (78%), Gaps = 4/357 (1%)

Query: 46  GSMRRIHERVLGPSRTGISSST-SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR 104
           GSMRR+ E +LGP  T  ++S+ S+IW+LG+C+K++ D    +        EF  DF+SR
Sbjct: 1   GSMRRLQELLLGPRFTAANASSGSEIWVLGLCYKVSADPN-NETLSVQAFEEFISDFTSR 59

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
           I I+YRKGF+ +G SK+TSDVGWGCMLRS QML+AQAL+ H LGR WR+   +P  + Y+
Sbjct: 60  IWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLGRSWRREPGQPCSQAYL 119

Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GC 222
           +IL  FGDSE+ PFSIHNLL+AG  +GLAAGSW+GPYA+CR+ EALAR  R ++    G 
Sbjct: 120 QILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLEALARADREQSQKKGGK 179

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
           ++LP A+YVVSG+ +GERGGAPV+C++D +  CS + +   +WTP+L+LVPLVLGL+KVN
Sbjct: 180 RALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTEEWTPLLVLVPLVLGLDKVN 239

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           PRY+P+LR TFTFPQSLGI GGKPGASTY++GVQ+E A+YLDPH+ Q V+ +  ++LE D
Sbjct: 240 PRYLPSLRATFTFPQSLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELD 299

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           TS+YH   +R + LD+IDPSLAIGFYCRD+ +FDD CAR+S+LA++SNGAP+FTV +
Sbjct: 300 TSSYHCSTVRRLPLDTIDPSLAIGFYCRDRAEFDDLCARSSELAKQSNGAPMFTVAE 356


>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
          Length = 595

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 216/342 (63%), Positives = 268/342 (78%), Gaps = 10/342 (2%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
           S  ++R V +GSM R+    LG +R     ++ D+W LG C++++ ++E  G +  ++G 
Sbjct: 90  SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
           A F +DFSSRI I+YRKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
            +KP+D +Y+ +LHLFGDSE   FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R  
Sbjct: 203 SEKPYDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTN 262

Query: 215 R--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
           R  A+   G ++ PMA+YVVSGDEDGERGGAPV CID A++ CS F+KGQ  W+PILLL+
Sbjct: 263 REQADAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLI 322

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           PLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ  +
Sbjct: 323 PLVLGLDKINPRYIPLLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAV 382

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
           +I  D+LEADTS+YH  V+R + L+ IDPSLAIGFYCRDK D
Sbjct: 383 DIAPDNLEADTSSYHCSVVRDLALEQIDPSLAIGFYCRDKGD 424


>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
          Length = 429

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 219/359 (61%), Positives = 273/359 (76%), Gaps = 10/359 (2%)

Query: 17  STPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 76
           S+P    RS  S     G S  ++R V +GSM R+    LG  R     ++SD+W LG C
Sbjct: 74  SSPACDARSTKSSSGSYGLSRILRRFVGSGSMWRL----LGCGRV---LTSSDVWFLGKC 126

Query: 77  HKIAQDEALGDAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 135
           +K++ +E     + ++   A F +DFSSRI I+YRKGFD I DSK+TSDV WGCM+RSSQ
Sbjct: 127 YKVSPEEEESGDSESDSGHAAFLEDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQ 186

Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
           MLVAQAL+FH LGR WRKP +KP++ +Y+ +LHLFGDSE   FSIHNLLQAG+ YGLAAG
Sbjct: 187 MLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAG 246

Query: 196 SWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
           SW+GPYAMCR+W+ L R  R  A+   G ++ PMA+YVVSGDEDGERGGAPVVCID A++
Sbjct: 247 SWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQ 306

Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
            CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLGI+GGKPG STYI 
Sbjct: 307 LCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLGILGGKPGTSTYIA 366

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 372
           GVQ++ A+YLDPH+VQ  ++I  D+LEADTS+YH  V+R + L+ IDPSLAIGFYCRDK
Sbjct: 367 GVQDDRALYLDPHEVQMTVDIALDNLEADTSSYHCSVVRALALEQIDPSLAIGFYCRDK 425


>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 346

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/339 (58%), Positives = 267/339 (78%), Gaps = 5/339 (1%)

Query: 64  SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           SSS  +IW+LG+C+K++ D A  +A   +   EF  DFSSRI I+YRKGF+ +G+SK+TS
Sbjct: 4   SSSGGEIWVLGICYKVSAD-ANDEAVSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTS 62

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           DVGWGCMLRS Q+L+AQAL+ H LGR WR+   +   +EY++IL  FGDSE+  FSIHNL
Sbjct: 63  DVGWGCMLRSGQILLAQALVCHYLGRTWRRNACQECLQEYLQILQSFGDSESCSFSIHNL 122

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARC---QRAETGLGCQSLPMAIYVVSGDEDGER 240
           L+AG+ +GLAAGSW+GPYA+CR+ EALA+    Q A+ G G ++LP A+YVVSG+ +G+R
Sbjct: 123 LEAGRPFGLAAGSWLGPYALCRTLEALAKADEDQNAKKG-GKRALPFAVYVVSGETEGDR 181

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
           GGAPV C++DA+  CS + +   +W+P+++LVPLVLGL+K+NPRY+P+LR TFT PQSLG
Sbjct: 182 GGAPVRCVEDAAVLCSKWGEATEEWSPLVVLVPLVLGLDKLNPRYLPSLRATFTLPQSLG 241

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           + GGKPGAST+++GVQ + A+YLDPH+ Q V  +  ++LE DTS YH  V+R + LDSID
Sbjct: 242 VAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLELDTSFYHCSVVRRLPLDSID 301

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           PSLAIGFYCRD+ +FDD CAR+S+L ++ NGAP+FTV +
Sbjct: 302 PSLAIGFYCRDRAEFDDLCARSSELVKQYNGAPIFTVAE 340


>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
 gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
          Length = 358

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 202/375 (53%), Positives = 268/375 (71%), Gaps = 29/375 (7%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDAA 89
           +  V+R V  G +RRI E ++G       SS S IWLLG C+++      + DE   ++ 
Sbjct: 2   TAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKEST 59

Query: 90  GNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
            ++   +A+F  DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HRL
Sbjct: 60  SSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRL 119

Query: 148 GRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
           GR WR+  ++P+ REY+EILH F DS +   PFSIHN ++AG  YGLAAGSW+GPYA+C 
Sbjct: 120 GRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCH 178

Query: 206 SWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
           + EALAR      G G +    +A+YVVSGD  GERGGAPV+   D +  C         
Sbjct: 179 AIEALAR----NDGRGREGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC--------- 225

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
             P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YLD
Sbjct: 226 --PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYLD 283

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH+VQ V+++  + LE D+++YH  V+R + LD+IDPSLA+GFYCR+++D DD CARAS+
Sbjct: 284 PHEVQKVVSVSGESLEFDSASYHCSVVRKMLLDAIDPSLALGFYCRNREDLDDLCARASE 343

Query: 385 LAEESNGAPLFTVTQ 399
           LA +SNGAP+FTV +
Sbjct: 344 LASQSNGAPMFTVAE 358


>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
 gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
          Length = 358

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 202/375 (53%), Positives = 268/375 (71%), Gaps = 29/375 (7%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDAA 89
           +  V+R V  G +RRI E ++G       SS S IWLLG C+++      + DE   ++ 
Sbjct: 2   TAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKEST 59

Query: 90  GNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
            ++   +A+F  DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HRL
Sbjct: 60  SSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRL 119

Query: 148 GRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
           GR WR+  ++P+ REY+EILH F DS +   PFSIHN ++AG  YGLAAGSW+GPYA+C 
Sbjct: 120 GRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCH 178

Query: 206 SWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
           + EALAR      G G Q    +A+YVVSGD  GERGGAPV+   D +  C         
Sbjct: 179 AIEALAR----NDGRGRQGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC--------- 225

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
             P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YLD
Sbjct: 226 --PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYLD 283

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH+VQ V+++  + LE D+++YH  V+R + LD+IDPSLA+GFYCR++++ DD CARAS+
Sbjct: 284 PHEVQKVVSVSGESLEFDSASYHCSVVRKMPLDAIDPSLALGFYCRNREELDDLCARASE 343

Query: 385 LAEESNGAPLFTVTQ 399
           LA +SNGAP+FTV +
Sbjct: 344 LASQSNGAPMFTVAE 358


>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 267

 Score =  314 bits (804), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 156/244 (63%), Positives = 198/244 (81%)

Query: 43  VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
           + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QDFS
Sbjct: 1   MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 60

Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           S IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P D +
Sbjct: 61  SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 120

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
           Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET    
Sbjct: 121 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 180

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
           +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 181 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 240

Query: 283 PRYI 286
           PR++
Sbjct: 241 PRFV 244


>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 360

 Score =  308 bits (790), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 155/244 (63%), Positives = 196/244 (80%)

Query: 43  VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
           + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QDFS
Sbjct: 87  MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 146

Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           S IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P D +
Sbjct: 147 SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 206

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
           Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET    
Sbjct: 207 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 266

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
           +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 267 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 326

Query: 283 PRYI 286
           P + 
Sbjct: 327 PSHF 330


>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
          Length = 219

 Score =  287 bits (734), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 158/220 (71%), Positives = 178/220 (80%), Gaps = 4/220 (1%)

Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
           MAIYVVSGDEDGERGGAPVVCI+DA +HCS FS+GQA WTP+LLLVPLVLGL+KVNPRYI
Sbjct: 1   MAIYVVSGDEDGERGGAPVVCIEDAFKHCSEFSRGQAAWTPLLLLVPLVLGLDKVNPRYI 60

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TST 345
           P L  TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+NI  D  E + TS+
Sbjct: 61  PLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQEPNSTSS 120

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH--KK 403
           YH +V+RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGAPLFTV Q+    K
Sbjct: 121 YHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGAPLFTVAQSRSFSK 180

Query: 404 PVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
            V+ +DV G+  G  ED  LG    +D     +EDDWQLL
Sbjct: 181 QVSGNDVSGDNTGFEEDAFLGT-DHDDNDAGTNEDDWQLL 219


>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
          Length = 290

 Score =  278 bits (710), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 143/209 (68%), Positives = 172/209 (82%), Gaps = 2/209 (0%)

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           S  G GCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +L LFGDSE   FSIHN
Sbjct: 14  SLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGDSEACAFSIHN 73

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGER 240
           LLQA + YGLAAGSW+GPYAMCR+W+ L R  R  A+   G ++ PMA+YVVSGDEDGER
Sbjct: 74  LLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGER 133

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
           GGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLG
Sbjct: 134 GGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLG 193

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           I+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 194 ILGGKPGTSTYIAGVQDDRALYLDPHEVQ 222


>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
 gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
          Length = 472

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 142/205 (69%), Positives = 168/205 (81%), Gaps = 2/205 (0%)

Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 172
           FD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR  RKP +KP++ +Y+ +LHLFGD
Sbjct: 34  FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93

Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIY 230
           SE   FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L    R  A+   G ++ PMA+Y
Sbjct: 94  SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
           VVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLK 213

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGV 315
            TF FPQSL I+GGKPG STYI GV
Sbjct: 214 ETFMFPQSLCILGGKPGTSTYIAGV 238


>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
          Length = 169

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 113/165 (68%), Positives = 130/165 (78%)

Query: 53  ERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKG 112
           + +LG S T   SSTSDIWLLG C+K++ +E+ G     NG A F +DFSSRI I+YRKG
Sbjct: 2   QELLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKG 61

Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 172
           FD IGDSK TSDV WGCM+RSSQMLVAQALLFH LGR WRKP QKP D +Y+EILHLFGD
Sbjct: 62  FDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGD 121

Query: 173 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
           SE   FSIHNLL+AGKAYGLAA  WVGPYAMCR+WE + R +R +
Sbjct: 122 SEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQ 166


>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
          Length = 362

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 139/364 (38%), Positives = 194/364 (53%), Gaps = 44/364 (12%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           D  SRI ++YR+GF PI  S ITSDVGWGC LRS QML+AQAL++H +GR WR+ L+  +
Sbjct: 23  DLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQWRRKLEAAY 82

Query: 160 DREYVEILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
             E  ++L  FGD   E  PFSIHN+   G+ +G+ AG W+GP  +C +   +   +   
Sbjct: 83  PEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLADMVN-KVQP 141

Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWTPILLL 271
            GL C+     +    G      GGAPV+C    SR  + F  G      +   +     
Sbjct: 142 GGLQCR-----VVATFG------GGAPVLC---TSRLATAFEGGADRSGGEVGSSGSEES 187

Query: 272 VPLVLGLE-----------KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
            P   GL            K+NPRY   L+   T+PQS+GIVGG+P +S Y +G+Q++  
Sbjct: 188 GPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSLYFIGLQDQHV 247

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           +YLDPH+VQ V +       AD  TY    +R + L +IDPSLAIGFYC    DF+D C 
Sbjct: 248 LYLDPHEVQEVASEA-----ADLDTYFCSSLRLMPLANIDPSLAIGFYCSSLSDFEDLCG 302

Query: 381 RASKLAEESNGAPLFT-VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
           R   L  E+  APL   V +   +P   ++ +    G+P D      S     G A+ D+
Sbjct: 303 RLRTLEAEAGCAPLVCMVDEDAGEPSWPAEEVLSDEGIPSDAD----SPAPPAGGANRDN 358

Query: 440 WQLL 443
           W++L
Sbjct: 359 WEML 362


>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
          Length = 416

 Score =  221 bits (563), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 134/263 (50%), Positives = 166/263 (63%), Gaps = 56/263 (21%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           L  F +DFSSRI I+YRKGFD I D K+TSDV WGCM+RSSQMLVAQAL+FH LGR WRK
Sbjct: 29  LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRK 88

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
           P +K                         L++  +                         
Sbjct: 89  PPEK------------------------TLIRTNR------------------------- 99

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
           ++A+   G ++ PM +YVVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVP
Sbjct: 100 EQADAVDGKENFPMELYVVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVP 159

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI- 332
           LVLGL+K+NPRYIP L+ TF FPQSLGI+G KPG STYI GVQ++ A+YLDPH+VQ V+ 
Sbjct: 160 LVLGLDKINPRYIPLLKETFMFPQSLGILGVKPGTSTYIAGVQDDRALYLDPHEVQMVLA 219

Query: 333 NIGKDDLEADTSTYHSDVIRHIH 355
           NI   +      T  +D I +IH
Sbjct: 220 NIKWPE------TLETDFIYNIH 236


>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 348

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/329 (37%), Positives = 182/329 (55%), Gaps = 19/329 (5%)

Query: 72  LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
           +LGV +    DE   +   ++    + +D+ SR  ++YR+GF+ +G +K  +D GWGC L
Sbjct: 1   MLGVTYWSKDDECNAEKY-DDARRAWERDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTL 59

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAY 190
           RS+QM+VA AL  H  GR WR+ ++   D E V+ +L +F D  ++PFSIH++ +   A+
Sbjct: 60  RSAQMMVANALSIHTRGRHWRRQVKAKEDDESVDHVLSMFIDDASAPFSIHSVCETTTAW 119

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG-DEDGERGGAPVVCID 249
           G   G W  P  MCR++ AL          G     +A++VV G +ED   GG P   ID
Sbjct: 120 GAPPGRWFEPSVMCRAFSALIEAN------GDLRNQIAVHVVGGQNEDDSAGGVPT--ID 171

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
           D          G+A    +LL VPLVLG+   +N RYI  LR    F QS+G++GG+P A
Sbjct: 172 DGELRAKSADVGKA----LLLFVPLVLGVGRNINTRYISQLRSIIAFKQSIGVIGGRPNA 227

Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
           S Y+VG  ++   YLDPH VQP  +  +     D  +Y+      +  + +DP+LA+GFY
Sbjct: 228 SLYLVGHSDDVFFYLDPHTVQPANSFAE---AVDFDSYYCSTPLQMRGELLDPTLALGFY 284

Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTV 397
           CRD DD D   A    LAE +  AP+  V
Sbjct: 285 CRDGDDLDSLFASVKALAEANATAPVLDV 313


>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
          Length = 369

 Score =  217 bits (553), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 128/370 (34%), Positives = 187/370 (50%), Gaps = 40/370 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F PIG +  TS
Sbjct: 19  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 65

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 66  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 125

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
            Q G   G + G W GP  + +  + LA               +A+++   +    ED  
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 177

Query: 240 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
           R   G  P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+
Sbjct: 178 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 237

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +      AD S +
Sbjct: 238 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 297

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDFDD+C +  +L+      P+F + +     + 
Sbjct: 298 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 357

Query: 407 HSDVLGETGG 416
             DVL  + G
Sbjct: 358 CPDVLNVSLG 367


>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
          Length = 390

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 120/333 (36%), Positives = 173/333 (51%), Gaps = 12/333 (3%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++A+AL+   LGR WR    +  
Sbjct: 45  DVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWARGRRQ 104

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
             EYV IL+ F D + S +SIH + Q G   G   G W GP          A+  +W  L
Sbjct: 105 REEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRL 164

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
           A     +  +  + +           D E  G    C++ A   C++  +  A W P++L
Sbjct: 165 AVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGA---CALVEEETALWKPLVL 221

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
           L+PL LGL  +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP
Sbjct: 222 LIPLRLGLSDINEAYIDTLKQCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP 281

Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
            +   +D    D + +       +H+  +DPS+A GF+CR +D+FDD+C R   L+ +  
Sbjct: 282 AVEPSEDGQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRGLSCKRG 341

Query: 391 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           G P+F +  +    +   D L  T    + D L
Sbjct: 342 GLPMFELVDSQPTHMVSVDALNLTPDFSDSDRL 374


>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
          Length = 405

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 127/366 (34%), Positives = 185/366 (50%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F PIG +  TS
Sbjct: 34  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 80

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 81  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 140

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
            Q G   G + G W GP  + +  + LA               +A+++   +    ED  
Sbjct: 141 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 192

Query: 240 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
           R   G  P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+
Sbjct: 193 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 252

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +      AD S +
Sbjct: 253 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 312

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDFDD+C +  +L+      P+F + +     + 
Sbjct: 313 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 372

Query: 407 HSDVLG 412
             DVL 
Sbjct: 373 CPDVLN 378


>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
 gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
          Length = 342

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 178/343 (51%), Gaps = 32/343 (9%)

Query: 58  PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN----GLAEFNQDFSSRILISYRKGF 113
           P +T  +   S IWLLG C+     E   + +        L EF++ F+S I ++YR+ F
Sbjct: 12  PLKTNFNED-SPIWLLGRCYHAKNYEYTSEQSKQQCQILSLEEFHRHFTSLIWLTYRRSF 70

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---KPLQKPFDREYVEILHLF 170
             +  S +TSD GWGCMLRS QM++A  L+FH L + WR   +   +  +  Y  IL  F
Sbjct: 71  VQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRISGRCHSREQEHYYRVILQFF 130

Query: 171 GDS---ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
           GD    E SPFS+H L+  G+  G  AG W GP ++    E              +++  
Sbjct: 131 GDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHILE--------------KAMIS 176

Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVN 282
           A + +  D +        V ID+  R C+     Q D     W P+++LVP+ LG E +N
Sbjct: 177 ATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRLGGEALN 236

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           P YIP ++  FT  Q +GI+GG+P  S Y VG Q+E  I+LDPH  QPV++  ++     
Sbjct: 237 PIYIPCVKSLFTLDQCIGIIGGRPKHSLYFVGFQDEKMIHLDPHYCQPVVDTTQEKFP-- 294

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           T ++H    R      +DPS  IGFYC   +DF+ FC  AS++
Sbjct: 295 TESFHCPNPRKTSFKKMDPSCTIGFYCSSHEDFESFCQHASEV 337


>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
          Length = 390

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 125/359 (34%), Positives = 179/359 (49%), Gaps = 26/359 (7%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 19  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 65

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 66  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQSDSYFNVLNAFIDRKDSYYSIHQI 125

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V+       RG  
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 184

Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+ TL+  F
Sbjct: 185 PCAGATALPTDSSRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 244

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +       
Sbjct: 245 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCRHPPSR 304

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           + +  +DPS+A+GF+C+ +DDFDD+C R  +L+      P+F + +     +   DVL 
Sbjct: 305 MGISELDPSIAVGFFCKTEDDFDDWCQRVRQLSLLGGALPMFELVEQQPSHLACPDVLN 363


>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
          Length = 445

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 124/358 (34%), Positives = 177/358 (49%), Gaps = 26/358 (7%)

Query: 66  STSDIWLLGVCHKIA--QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I+  +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 74  TSEPVWILGRKYSISTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 120

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 121 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 180

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V+       R G 
Sbjct: 181 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMEDIRRLCRAGL 239

Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P         D  RHC+ F  G       A W P++LL+PL LGL  +N  Y+ TL+  F
Sbjct: 240 PCAGAAALPADPGRHCNGFPAGAEVSNRLAPWRPLVLLIPLRLGLTDINEAYVETLKHCF 299

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 300 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFADSCFIPDESFHCQHPPSR 359

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
           + +  +DPS+A+GF+C+ ++DFDD+C R  KL+      P+F + +     +   DVL
Sbjct: 360 MGVRELDPSIAVGFFCQTEEDFDDWCQRVRKLSLLGGALPMFELVEQQPSHLACPDVL 417


>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
          Length = 410

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 121/352 (34%), Positives = 183/352 (51%), Gaps = 34/352 (9%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           + S +W+LG  + +  D           LAE  +D  SR+ ++YRKGFDPIG S  TSD 
Sbjct: 30  TESPVWILGKQYSVLYD-----------LAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQ 78

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           GWGCMLR  QM++AQ+L+   LGR WR    K +D +Y EIL +F D  ++ +S+  +  
Sbjct: 79  GWGCMLRCGQMMLAQSLICRHLGRDWRWTKDK-YDPKYFEILRMFQDKRSAKYSLQVIAS 137

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD------EDGE 239
            G + G A G W GP  + +    L  C   E       + +   V+  D         +
Sbjct: 138 MGTSEGKAIGEWFGPNTISQVLRKL--CVSDEWSNLVVHVALDNTVIIDDVFCLCKSSKK 195

Query: 240 RGGAPVVCIDDASRHCSVFS-----------KGQAD-WTPILLLVPLVLGLEKVNPRYIP 287
               P+  +  A     +F+            G+ D W P+LL+VPL LGL ++NP YIP
Sbjct: 196 ESNEPIPGVHAACASALLFNGHDPTAEGHDPSGEDDSWRPLLLIVPLRLGLSEINPVYIP 255

Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
            L+   TF QS+GI+GGKP  + + +G  E+  +Y+DPH  QP +++ +   E+D S YH
Sbjct: 256 FLKTCLTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPFVDVTQPG-ESDAS-YH 313

Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
                 + +  +DPS+A+GF+C+ + DF+D C    K        P+F + Q
Sbjct: 314 CSYSCRMPVSYLDPSVAVGFFCQTEADFEDLCQCIRKYILHGQKTPMFELHQ 365


>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
 gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
 gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
 gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
 gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
          Length = 393

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 185/366 (50%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
               A + C+       D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CQDVLN 366


>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
          Length = 394

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 123/359 (34%), Positives = 177/359 (49%), Gaps = 26/359 (7%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 23  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 129

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V+       RG  
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 188

Query: 244 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+ TL+  F
Sbjct: 189 PCAGAAALPADSSRHCNGFPAGAEVTNRLAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 248

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 249 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFIPDESFHCQHPPSR 308

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           + +  +DPS+A+GF+C+ + DFDD+C +  +L+      P+F + +     +   DVL 
Sbjct: 309 MSIGELDPSIAVGFFCKTEGDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLACPDVLN 367


>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
          Length = 434

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 127/333 (38%), Positives = 175/333 (52%), Gaps = 33/333 (9%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
           A F   F S +  +YR  F  +G    TSD+GWGCMLR+ QM++AQ L  H LG  WR+ 
Sbjct: 108 ASFLTHFRSVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQ 167

Query: 155 LQK--PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
             +  P    Y +++  F D    PFS+H +  AG  YG   G W GP  M +  E L +
Sbjct: 168 SDRSSPL---YAKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLK 224

Query: 213 CQRAETGLG---CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-WTPI 268
            + + +GL    CQ     +Y+            P+   DD         +GQ   W P+
Sbjct: 225 -EFSPSGLRAYVCQD--GCLYLDQLRRTATAAHWPLDEDDD---------EGQGKSWAPM 272

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           L+++PL LGL+++N  Y P L+ TF  PQS+GI GGKP AS Y VG Q++   YLDPH V
Sbjct: 273 LIMLPLRLGLDQLNEDYAPVLKETFRIPQSVGISGGKPRASLYFVGNQDDYVFYLDPHTV 332

Query: 329 QPVINIGK-DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           QP     +  D+ A      T+H      + +  IDPSL + FYCR+++DFDDFCARA +
Sbjct: 333 QPAPRFPEVGDVPASEDVYDTFHCSAPLRLPIRDIDPSLCLAFYCRNREDFDDFCARAIQ 392

Query: 385 LAEESNGAPLFTVTQ------THKKPVNHSDVL 411
           L+E     P+FTV +         KP  HS+ L
Sbjct: 393 LSE--GPMPIFTVAERMPDYLVRPKPPKHSEKL 423


>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
          Length = 390

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 19  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 66  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 177

Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
               A + C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 178 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 237

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 238 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 297

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 298 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 357

Query: 407 HSDVLG 412
             DVL 
Sbjct: 358 CQDVLN 363


>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
 gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
 gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
          Length = 393

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
               A + C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CQDVLN 366


>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
          Length = 391

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 118/345 (34%), Positives = 178/345 (51%), Gaps = 16/345 (4%)

Query: 92  NGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
           N L E ++   D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LG
Sbjct: 34  NALTEKDEILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVCRHLG 93

Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
           R WR    +    EY+ +L+ F D + S +SIH + Q G   G   G W GP  + +  +
Sbjct: 94  RDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLK 153

Query: 209 ALARCQRAETGLGCQSLPMAIYVVS--------GDEDGERGGAPVV--CIDDASRHCSVF 258
            LA        +   ++   + +           D  GE  G   +  C++ A   C++ 
Sbjct: 154 KLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGA---CAMA 210

Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
            +  A W P++LL+PL LGL  +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E
Sbjct: 211 EEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLPQSLGVIGGKPNSAHYFIGYVGE 270

Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             IYLDPH  QP +   +D    D + +       +H+  +DPS+A GF+CR +D+FDD+
Sbjct: 271 ELIYLDPHTTQPAVEPSEDSQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDW 330

Query: 379 CARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           C R  +L+      P+F +  +    +   D L  T    + D L
Sbjct: 331 CMRIRRLSCNRGTLPMFELVDSQPSHMVSVDTLNLTPDFSDSDRL 375


>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
           cysteine endopeptidase; AltName: Full=Autophagin-1;
           AltName: Full=Autophagy-related cysteine endopeptidase
           1; AltName: Full=Autophagy-related protein 4 homolog B
          Length = 393

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
               A + C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CQDVLN 366


>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
          Length = 393

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 126/372 (33%), Positives = 181/372 (48%), Gaps = 52/372 (13%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
            Q G   G + G W GP          A+  +W ALA     +  +  +        SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 188

Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 280
            A            G A      D+ RHC+ F  G       A W P++LL+PL LGL  
Sbjct: 189 CA------------GAAAFPA--DSDRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTD 234

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
           +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +        
Sbjct: 235 INEAYVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFI 294

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
            D S +       + +  +DPS+A+GF+C+ +DDF+D+C + + L+      P+F + + 
Sbjct: 295 PDESFHCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVTMLSLLGGALPMFELVEQ 354

Query: 401 HKKPVNHSDVLG 412
               +   DVL 
Sbjct: 355 QPSHLACPDVLN 366


>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
          Length = 517

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 172/313 (54%), Gaps = 25/313 (7%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---K 153
           F +DFSSR+  +YR+ F PI  + ITSD GWGCMLRSSQM++AQA++ H LGR WR    
Sbjct: 181 FLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYRRN 240

Query: 154 PLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 210
              +  D  + +++ LFGD  +  SPFS+H L+Q G   G  AG W GP +      EAL
Sbjct: 241 NQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKEAL 300

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS-VFSKGQADWTPIL 269
               + E  L    L + IYV              + ++D    C    S G   W  ++
Sbjct: 301 EGACQTEQLL----LDLRIYVAQD---------CTIYLEDVRALCRGTRSNGAPLWRSVI 347

Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           +LVP+ LG E++NP YIP ++   + P  +G++GG+P  S Y +G Q E  IYLDPH VQ
Sbjct: 348 ILVPVRLGGEQLNPTYIPCVKGMLSHPNCIGVIGGRPRHSLYFLGWQGEKVIYLDPHYVQ 407

Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--- 386
             +++G  D   D  +YH    R +    +DPS  +GFYC+ +D+F+ F     +LA   
Sbjct: 408 EAVDVGPQDFPLD--SYHCSWPRKMSFYKMDPSCTMGFYCKTEDEFEHFVKDVKQLAVPT 465

Query: 387 EESNGAPLFTVTQ 399
           E  +  P+F V++
Sbjct: 466 ESRHEYPVFLVSE 478


>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
          Length = 394

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR W+    +  
Sbjct: 45  DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
             EYV IL+ F D + S +SIH + Q G   G + G W GP          A+  SW  L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
           A     +  +  + +     +     D +RG       P     D    C++  +  A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
            P++LL+PL LGL  +N  YI  L+  F  PQSLG++GGKP ++ Y +G   +  IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           H  QP ++  +D    D S +       +H+  +DPS+A GF+C+ +DDFDD+CA+  K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341

Query: 386 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           +    G P+F +  +    +  +DVL  T    + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378


>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
          Length = 393

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR W+    +  
Sbjct: 44  DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 103

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
             EYV IL+ F D + S +SIH + Q G   G + G W GP          A+  SW  L
Sbjct: 104 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 163

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
           A     +  +  + +     +     D +RG       P     D    C++  +  A W
Sbjct: 164 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 220

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
            P++LL+PL LGL  +N  YI  L+  F  PQSLG++GGKP ++ Y +G   +  IYLDP
Sbjct: 221 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 280

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           H  QP ++  +D    D S +       +H+  +DPS+A GF+C+ +DDFDD+CA+  K+
Sbjct: 281 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 340

Query: 386 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           +    G P+F +  +    +  +DVL  T    + D L
Sbjct: 341 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 377


>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
 gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
          Length = 394

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR W+    +  
Sbjct: 45  DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 210
             EYV IL+ F D + S +SIH + Q G   G + G W GP          A+  SW  L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 265
           A     +  +  + +     +     D +RG       P     D    C++  +  A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
            P++LL+PL LGL  +N  YI  L+  F  PQSLG++GGKP ++ Y +G   +  IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           H  QP ++  +D    D S +       +H+  +DPS+A GF+C+ +DDFDD+CA+  K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341

Query: 386 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           +    G P+F +  +    +  +DVL  T    + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378


>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
          Length = 405

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 127/385 (32%), Positives = 190/385 (49%), Gaps = 42/385 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLGETGGVPEDDSLGVMSMNDA 431
             DVL  + G  E   + V S+ D+
Sbjct: 361 CPDVLNLSLG--ESCQVQVGSLGDS 383


>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
          Length = 415

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLGETGG 416
             DVL  + G
Sbjct: 361 CPDVLNLSLG 370


>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
 gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
          Length = 393

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 122/364 (33%), Positives = 176/364 (48%), Gaps = 36/364 (9%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQALL   LGR WR    +     Y  +LH F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALLCRHLGRGWRWTQWERQPDSYFSVLHAFMDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSG 234
            Q G   G + G W GP  + +         +W ALA        +    +   I  +  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSALA----VHVAMDNTVVMEEIRRLCR 184

Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPT 288
                 G A      D+ RHC+ F            W P++LL+PL LGL  +N  Y  T
Sbjct: 185 SSLPRAGAAAFPA--DSDRHCNGFPAEAEVGPRPVPWRPLVLLIPLRLGLTDINAAYTET 242

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +    L  D S +  
Sbjct: 243 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQVTDSCLIPDESFHCQ 302

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 408
                + +  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + +     +   
Sbjct: 303 HPPHRMSIAELDPSIAVGFFCQTEEDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACP 362

Query: 409 DVLG 412
           DVL 
Sbjct: 363 DVLN 366


>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
          Length = 521

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 127/385 (32%), Positives = 189/385 (49%), Gaps = 42/385 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476

Query: 407 HSDVLGETGGVPEDDSLGVMSMNDA 431
             DVL  + G  E   + V S+ D+
Sbjct: 477 CPDVLNLSLG--ESCQVQVGSLGDS 499


>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
          Length = 510

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 139 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 185

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 186 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 245

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 246 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 297

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 298 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 357

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 358 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 417

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 418 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 477

Query: 407 HSDVL 411
             DVL
Sbjct: 478 CPDVL 482


>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
          Length = 468

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 328

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 448

Query: 407 HSDVLGETGG 416
             DVL  + G
Sbjct: 449 CPDVLNLSLG 458


>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
          Length = 468

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448

Query: 407 HSDVLGETGG 416
             DVL  + G
Sbjct: 449 CPDVLNLSLG 458


>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
          Length = 481

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448

Query: 407 HSDVLG 412
             DVL 
Sbjct: 449 CPDVLN 454


>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
          Length = 393

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
          Length = 393

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
          Length = 380

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360

Query: 407 HSDVLGETGG 416
             DVL  + G
Sbjct: 361 CPDVLNLSLG 370


>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
          Length = 396

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363

Query: 407 HSDVLG 412
             DVL 
Sbjct: 364 CPDVLN 369


>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
 gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
 gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
          Length = 393

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
          Length = 396

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363

Query: 407 HSDVLG 412
             DVL 
Sbjct: 364 CPDVLN 369


>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
          Length = 508

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 137 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 183

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 184 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 243

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 244 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 295

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 296 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 355

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 356 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 415

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 416 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 475

Query: 407 HSDVL 411
             DVL
Sbjct: 476 CPDVL 480


>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
 gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
 gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
 gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
 gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
 gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
           construct]
 gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
          Length = 393

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
 gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
          Length = 398

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 27  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 73

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 74  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 133

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 134 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 185

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 186 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 245

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 246 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 305

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 306 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 365

Query: 407 HSDVLG 412
             DVL 
Sbjct: 366 CPDVLN 371


>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
 gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
          Length = 375

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 109/337 (32%), Positives = 171/337 (50%), Gaps = 33/337 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG C+ +   ++           E   D  SR+  +YRK F PIG +  +SD GWGC
Sbjct: 26  VWILGACYNVKTKKS-----------ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGC 74

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR WR   +K   +EY  IL  F D + S +SIH + Q G  
Sbjct: 75  MLRCGQMILAQALICSHLGRDWRWDPEKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVG 134

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +++YV   +          V I+
Sbjct: 135 EGKSVGEWYGPNTVAQVLKKLALFDDWNS--------LSVYVSMDN---------TVVIE 177

Query: 250 DASRHC-----SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
           D  + C      + S+   DW P+LL++PL +G+  +NP YI  L+  F  PQS G++GG
Sbjct: 178 DIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQALKECFKMPQSCGVLGG 237

Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
           KP  + Y +G  ++  IYLDPH  Q  ++        D S +       + + S+DPS+A
Sbjct: 238 KPNLAYYFIGFIDDELIYLDPHTTQQAVDTESGSAVDDQSFHCQRTPHRMKITSLDPSVA 297

Query: 365 IGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +GF+C+ ++DFD +C    +   +     +F + + H
Sbjct: 298 LGFFCKSEEDFDSWCDLVQQELLKKRNLRMFELVEKH 334


>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
 gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
           cysteine endopeptidase; AltName: Full=Autophagin-1;
           AltName: Full=Autophagy-related cysteine endopeptidase
           1; AltName: Full=Autophagy-related protein 4 homolog B;
           Short=hAPG4B
 gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
          Length = 393

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
          Length = 479

 Score =  204 bits (518), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 120/365 (32%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 108 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 154

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 155 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 214

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 215 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 266

Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 286
               A + C        D+ RHC+ F  G         W P++LL+PL LGL  +N  Y+
Sbjct: 267 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 326

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 327 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 386

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + +     + 
Sbjct: 387 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 446

Query: 407 HSDVL 411
             DVL
Sbjct: 447 CQDVL 451


>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
          Length = 394

 Score =  204 bits (518), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 179/367 (48%), Gaps = 42/367 (11%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 23  TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQALL   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129

Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
            Q G   G + G W GP          A+  +W ALA     +  +  +        SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAIFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 189

Query: 227 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
            A       D +G   G P           +  +   + W P++LL+PL LGL  +N  Y
Sbjct: 190 CAEATAFPADSEGHCNGLPA---------GAEVTNRPSLWRPLVLLIPLRLGLTDINEAY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           + TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +      L  D S 
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSFLIPDESF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
           +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHL 360

Query: 406 NHSDVLG 412
              DVL 
Sbjct: 361 ACPDVLN 367


>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
          Length = 496

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 182/370 (49%), Gaps = 40/370 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476

Query: 407 HSDVLGETGG 416
             DVL  + G
Sbjct: 477 CPDVLNLSLG 486


>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
          Length = 509

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 180/365 (49%), Gaps = 40/365 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476

Query: 407 HSDVL 411
             DVL
Sbjct: 477 CPDVL 481


>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
           pisum]
 gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
           pisum]
 gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
           pisum]
 gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
           pisum]
          Length = 402

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 110/344 (31%), Positives = 178/344 (51%), Gaps = 38/344 (11%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I  +   +W+LG  +    D           L +   D  SR+  +YRKGF  IG++  T
Sbjct: 40  IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           SD GWGCMLR  QM++ QAL+F  LGR WR    K  D +Y++IL +F D  ++P+SIH 
Sbjct: 89  SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           +   G ++G   G W GP  + +  + LA             L   ++ V+ D       
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192

Query: 243 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
              + I++  + C+V  +  +    W P++L++PL LG+  +NP Y+  +++ FTFPQSL
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKMCFTFPQSL 250

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIRHIHL 356
           G++GG+P  + Y +G      I+LDPH  Q +  +   D+E +     +YH   I  + +
Sbjct: 251 GVIGGRPNHALYFIGFVGNDVIFLDPHTTQQIGMLPNKDIETEHKIDHSYHCQQINRLPI 310

Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARAS---KLAEESNGAPLFTV 397
            ++DPSLA  F C+ ++DF+  C         +++S   PL T+
Sbjct: 311 LNMDPSLAACFMCQTENDFNALCHELKVHLVQSDQSPSQPLITI 354


>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
          Length = 393

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
          Length = 380

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 122/370 (32%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  +  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCYMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360

Query: 407 HSDVLGETGG 416
             DVL  + G
Sbjct: 361 CPDVLNLSLG 370


>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
          Length = 394

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 23  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 69

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 129

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 181

Query: 239 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 286
               A + C        D+ RHC+ F  G         W P++LL+PL LGL  +N  Y+
Sbjct: 182 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 241

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 242 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 301

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + +     + 
Sbjct: 302 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 361

Query: 407 HSDVLG 412
             DVL 
Sbjct: 362 CQDVLN 367


>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
          Length = 393

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGCALPMFELVEQQPSHLA 360

Query: 407 HSDVLG 412
             DVL 
Sbjct: 361 CPDVLN 366


>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
          Length = 473

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 175/355 (49%), Gaps = 21/355 (5%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           ++  +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 103 TSEPVWILGRKYSLLTEKN-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 151

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           GWGCMLR  QM+ AQAL+   LGR WR   QK     Y+ +LH F D + S +SIH + Q
Sbjct: 152 GWGCMLRCGQMIFAQALVCRHLGRDWRWTQQKRQPDSYLSVLHAFMDRKDSYYSIHQIAQ 211

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
            G   G + G W GP  + +  + LA      + L          V+       R   P 
Sbjct: 212 MGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRSSHPC 270

Query: 246 VCIDDASR----HCSVFS-----KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
                       HC+ F        ++ W P++LL+PL LGL  +N  Y+ TL+L F  P
Sbjct: 271 AGAATPPAGADWHCNGFPASTEVTNRSPWRPLVLLIPLRLGLTDINEAYVETLKLCFRMP 330

Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHL 356
           QSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +       + +
Sbjct: 331 QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDLCFIPDESFHCQHPPCRMSI 390

Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
             +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 391 GELDPSIAVGFFCKTEEDFNDWCQQVRKLSLLGGALPMFELVEQQPPHLACPDVL 445


>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
          Length = 420

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 119/366 (32%), Positives = 183/366 (50%), Gaps = 40/366 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 49  TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 95

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQALL   LGR WR   ++     Y  +LH F D + S +SIH +
Sbjct: 96  DTGWGCMLRCGQMIFAQALLCRHLGRDWRWAQRRRQPDSYFSVLHAFIDRKDSHYSIHQI 155

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 239
            Q G   G + G W GP  + +  + LA      +        +A+++   +    E+  
Sbjct: 156 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 207

Query: 240 R-------GGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
           R             C  D S+HC+    G       + W P++LL+PL LGL  +N  Y+
Sbjct: 208 RLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTWRPLVLLIPLRLGLTDINEAYV 267

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D + +
Sbjct: 268 ETLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELAGGFSIPDETFH 327

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  +++  +DPS+A+GF+C+ ++DF+D+C +  KL+  S   P+F + +     + 
Sbjct: 328 CQHPPCRMNIAELDPSIAVGFFCKTEEDFNDWCQQVKKLSLLSGALPMFELVEQQPSHLA 387

Query: 407 HSDVLG 412
             DVL 
Sbjct: 388 CPDVLN 393


>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
          Length = 477

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 119/373 (31%), Positives = 175/373 (46%), Gaps = 46/373 (12%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
           S  S +WLLG C+    ++ L  A+                      N + EF +DF SR
Sbjct: 86  SKESPVWLLGQCYLKKSEDPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFVSR 145

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF----- 159
           I ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR   ++P      
Sbjct: 146 IWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQPIETLQQ 205

Query: 160 ---DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
              DR +  I+  FGD   SPFSIH L+  G + G  AG W GP ++         C   
Sbjct: 206 RLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAHLLSQAVECASK 265

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
           ++      L  A+YV              V + D    C         W  ++LLVPL L
Sbjct: 266 QSNSNFDHL--AVYVAQD---------CAVYLQDVENICRT---PDGKWKALVLLVPLRL 311

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G +K+NP Y P L    T    +G++GG+P  S Y +G Q++  I+LDPH  Q  +++ K
Sbjct: 312 GADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVWK 371

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--EESNGAPL 394
           +D     +++H    R + L  +DPS  +GFY  +K+   DF     +     +    P+
Sbjct: 372 NDFSL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNKEALTDFMETIQRFVIPNQKTNYPM 429

Query: 395 FTVTQTHKKPVNH 407
           F   +   K + H
Sbjct: 430 FLFCEGSGKDLQH 442


>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
          Length = 412

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 119/361 (32%), Positives = 178/361 (49%), Gaps = 28/361 (7%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +D+ L D A             SR+  +YR+ F  IG +  TS
Sbjct: 39  TSEPVWILGRKYSIFTEKDDILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 85

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 86  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 145

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V+       R G 
Sbjct: 146 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRTGL 204

Query: 244 P----VVCIDDASRHCSVF--------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
           P         DA RHC+ F         +  + W P++LL+PL LGL  +N  Y+ TL+ 
Sbjct: 205 PCAGAAALPTDADRHCNGFPTQTEVTNRQSPSLWRPLVLLIPLRLGLTDINEAYVETLKH 264

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
            F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D + +     
Sbjct: 265 CFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIPDETFHCQHPP 324

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
             + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 325 CRMGIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 384

Query: 412 G 412
            
Sbjct: 385 N 385


>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
           kowalevskii]
          Length = 356

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 114/311 (36%), Positives = 163/311 (52%), Gaps = 13/311 (4%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  + + +D +           E   D  SRI I+YRK F  IG +  TSD GWGC
Sbjct: 26  VWILGKAYHLIRDRS-----------ELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGC 74

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQALL   LGR WR   ++  +  Y +IL LF D + S +SIH + Q G  
Sbjct: 75  MLRCGQMILAQALLCKHLGREWRWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVG 134

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +    L       +     S+   I VV       R      C  
Sbjct: 135 EGKSIGQWFGPNTVAQVLRKLTLFDDWSSIAVHISMDNTI-VVEDIRKLCRTPLFTECAS 193

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
             +   S+ + G   W P++L +PL LGL ++NP Y+  L+  FT  QSLG++GGKP  +
Sbjct: 194 PKAASASLENGGTTYWKPLVLFIPLRLGLTEINPLYLDVLKKCFTLKQSLGMIGGKPNHA 253

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            Y +G   ++ +YLDPH  QPV++I K     D  TYH      +++  +DPS+A+GF+C
Sbjct: 254 HYFIGFYGKTLVYLDPHTTQPVVDINKWASIPD-DTYHCKHPSRMNIMHLDPSIALGFFC 312

Query: 370 RDKDDFDDFCA 380
             + DFDD C 
Sbjct: 313 HCESDFDDLCT 323


>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
          Length = 394

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 118/372 (31%), Positives = 185/372 (49%), Gaps = 27/372 (7%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
            +T  +W+LG            + +      E   D +SR+  +YRK F PIG +  TSD
Sbjct: 21  ETTEPVWILG-----------NEYSALTEKEEILSDVTSRLWFTYRKSFPPIGGTGPTSD 69

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 184
            GWGCMLR  QM++ QAL+   LGR WR    +   +EY+ IL+ F D + S +SIH + 
Sbjct: 70  TGWGCMLRCGQMILGQALMCRHLGRDWRWVRGQKQRQEYISILNAFIDKKDSYYSIHQIA 129

Query: 185 QAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 234
           Q G   G   G W GP          A+  +W  L      +  +  + +  + +  +  
Sbjct: 130 QMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRLVVHVAMDNTVVIEEIKRLCMPWLDK 189

Query: 235 DE---DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
            E   + ER G    C++ A   C++  +  A W P++LL+PL LGL  +N  YI TL+ 
Sbjct: 190 AEVFGEPERVGELNGCLEGA---CALSEEEVALWKPLVLLIPLRLGLSDINGAYIETLKK 246

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
            F  PQSLG++GGKP ++ Y +G      IYLDPH  Q  +   +     D + +     
Sbjct: 247 CFMLPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFPDDTYHCQHPP 306

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
             +H+  +DPS+A+GF+CR +D+FDD+C R  +L+   +  P+F +  +    +   D +
Sbjct: 307 CRMHICELDPSIAVGFFCRTEDEFDDWCMRIRRLSCNKDNLPMFELVDSQPSHLVGVDAI 366

Query: 412 GETGGVPEDDSL 423
             T    + + L
Sbjct: 367 NLTPDFSDSERL 378


>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
          Length = 393

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 122/375 (32%), Positives = 180/375 (48%), Gaps = 58/375 (15%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           ++  +W+LG  + I  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 22  TSEPVWILGRKYSIFTEKE-----------ELLSDVASRLWFTYRKNFPAIGGTGPTSDT 70

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q
Sbjct: 71  GWGCMLRCGQMIFAQALVCQHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQ 130

Query: 186 AGKAYGLAAGSWVGP---------YAMCRSWEALA------------RCQR-AETGLGCQ 223
            G   G + G W GP          A+  +W +LA              +R   T L C 
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSLPCG 190

Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLG 277
           + P +              AP        +HC+ F  G       + W P++LL+PL LG
Sbjct: 191 TAPAS------------SAAP-------DQHCNGFPAGAEVTTRLSPWRPLVLLIPLRLG 231

Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
           L  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +     
Sbjct: 232 LTDINAAYVETLKRCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEATDS 291

Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
            L  D S +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F +
Sbjct: 292 CLVPDESFHCQHPPCRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFEL 351

Query: 398 TQTHKKPVNHSDVLG 412
            +     +   DVL 
Sbjct: 352 VEQPPSHLACPDVLN 366


>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
 gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
           cysteine endopeptidase; AltName: Full=Autophagin-2;
           AltName: Full=Autophagy-related cysteine endopeptidase
           2; AltName: Full=Autophagy-related protein 4 homolog A
 gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
 gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  201 bits (510), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 176/353 (49%), Gaps = 50/353 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
            +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++I +  L  D + +  
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
              + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352


>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
          Length = 390

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 175/359 (48%), Gaps = 26/359 (7%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V++      R   
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187

Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P    +    D+ RHC+ F          A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           + +  +DPS+A+GF+C  +DDF+D+C + SKL+      P+F + +     +   DVL 
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPSHLACPDVLN 366


>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
 gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related cysteine endopeptidase 2B;
           Short=Autophagin-2B; AltName: Full=Autophagy-related
           protein 4 homolog B; AltName: Full=bAut2B
 gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
          Length = 393

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 175/359 (48%), Gaps = 26/359 (7%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V++      R   
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187

Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P    +    D+ RHC+ F          A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           + +  +DPS+A+GF+C  +DDF+D+C + SKL+      P+F + +     +   DVL 
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPSHLACPDVLN 366


>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
           [Desmodus rotundus]
          Length = 394

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 122/367 (33%), Positives = 176/367 (47%), Gaps = 42/367 (11%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 23  TSEPVWILGRRYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQALL   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129

Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 226
            Q G   G + G W GP          A+  +W ALA     +  +  +        SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHVAMDNTVVMEDIRRLCRSSLP 189

Query: 227 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
            A       D +G   G P           +  +   + W P++LL+PL LGL  +N  Y
Sbjct: 190 CAGASAFPADSEGHCNGFPAR---------AEVTNRPSPWRPLVLLIPLRLGLTDINEAY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           + TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S 
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCSIPDESF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
           +       + +  +DPS+A+GF+C  +DDF D+C +  KL+      P+F + +     +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCETEDDFGDWCQQVKKLSLLGGALPMFELVEQQPSHL 360

Query: 406 NHSDVLG 412
              DVL 
Sbjct: 361 ACPDVLN 367


>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
 gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
          Length = 424

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 126/364 (34%), Positives = 182/364 (50%), Gaps = 57/364 (15%)

Query: 72  LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
           + GV H   ++ + G+ +   G  E+ +D+ SR  ++YR+GF+ +G +K  +D GWGC L
Sbjct: 42  MFGVTH-WDRETSSGERSNEVGRREWERDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTL 100

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDRE----------------------------- 162
           RS+QM++A AL  H  GR WR+ +Q     E                             
Sbjct: 101 RSAQMMLANALSIHSRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSE 160

Query: 163 --------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
                     +IL LF D   +PFSIH + +    +G   G W  P  MCR++EAL    
Sbjct: 161 RTRAGSDAQEDILRLFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALV--- 217

Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
            AE  LG +   + ++VVSG E GE GG P V  D+A         G+A    +LL VP+
Sbjct: 218 -AEHDLGSE---LTVHVVSGRE-GEDGGVPTV--DEAEVRAKSADVGKA----LLLFVPV 266

Query: 275 VLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           VLG+ + +N RY+  LR    F QS+GIVGG+P +S Y+VG  ++   YLDPH VQ   +
Sbjct: 267 VLGVGRTINARYLSQLRSMMAFKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASS 326

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
           +   D E    +Y+     H+    +DP+LA+GFYCRD DD          LA  +  AP
Sbjct: 327 MVTMDFE----SYYCPTPLHVCGGDLDPTLALGFYCRDGDDVASLLVDIEALARVNATAP 382

Query: 394 LFTV 397
              +
Sbjct: 383 ALAI 386


>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
 gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
          Length = 384

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 115/343 (33%), Positives = 169/343 (49%), Gaps = 36/343 (10%)

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
            D +SR+  +YR+ F  IG +  TSD GWGCMLR  QM+ AQAL+   +GR WR   QKP
Sbjct: 44  NDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWDKQKP 103

Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
              EY+ IL  F D + S +SIH + Q G   G   G W GP  + +    LA   +  +
Sbjct: 104 -KGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQWSS 162

Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-------------- 264
                   +A+++   +          V +D+  R C   S   +D              
Sbjct: 163 --------IAVHIAMDN---------TVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDP 205

Query: 265 ----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
               W P++LL+PL LGL ++N  YI TL+  F  PQSLG++GG+P ++ Y +G   +  
Sbjct: 206 SCAQWKPLVLLIPLRLGLSEINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDEL 265

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           IYLDPH  Q  +         D S +       +H+  IDPS+A+GF+C  ++DF+D+C 
Sbjct: 266 IYLDPHTTQLSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSIAVGFFCSSQEDFEDWCQ 325

Query: 381 RASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
              KL+      P+F V       +++ DVL  T    + D L
Sbjct: 326 HIKKLSLSGGALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368


>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
          Length = 393

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 121/359 (33%), Positives = 174/359 (48%), Gaps = 26/359 (7%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V++      R   
Sbjct: 129 AQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187

Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P    +    D+ RHC+ F          A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           + +  +DPS+A+GF+C  +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVLN 366


>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
          Length = 432

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 109/313 (34%), Positives = 171/313 (54%), Gaps = 11/313 (3%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR-LGRPWR 152
           + EF +DFS+++  SYR+GF+ IGDS   +D GWGCMLRS QML+A  LL +  +G+ W+
Sbjct: 88  IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALA 211
           KP    +  ++ +++ LF D  ++PFSIHN+   G+ + G + G W  P  +  +  AL 
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207

Query: 212 -RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC----SVFSKGQADWT 266
            +      G   +            +   +    V   DD S +      +  +    W 
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALRSDGSWM 267

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P+L+L+P  LG++ +N  Y   L   +TFPQ+LGIVGGKP AS Y +  Q+++  YLDPH
Sbjct: 268 PLLILIPTKLGIDTINEIYYRPLLDIYTFPQNLGIVGGKPRASLYFIASQDDNLFYLDPH 327

Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
            VQ  I   + D +   S+Y  ++ +  ++  +DPSL I F+C  K+ F DF  R+ KL 
Sbjct: 328 TVQNSI---ESDSDFSLSSYFCNIPKKANISEVDPSLVIPFFCSTKESFLDFLERSKKL- 383

Query: 387 EESNGAPLFTVTQ 399
           E S+  PL+ + +
Sbjct: 384 ESSSEFPLYNIQE 396


>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
 gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
          Length = 396

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 174/353 (49%), Gaps = 50/353 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H    +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHPFKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + L       +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLTLFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
            +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++I +  L  D + +  
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
              + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352


>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
 gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
          Length = 380

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 116/349 (33%), Positives = 179/349 (51%), Gaps = 41/349 (11%)

Query: 71  WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
           W+LGV +   +D             E   D SSR+  +YRK F PIG +   SD GWGCM
Sbjct: 32  WILGVGYNTVKDRQ-----------ELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCM 80

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           LR  QM++ QAL+   LGR WR      +D +Y +IL LF D + S +SIH + Q G + 
Sbjct: 81  LRCGQMMLGQALICRHLGRDWRWK-SAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSE 139

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE----------DGER 240
           G + G W GP  + +  + LA  +   +        +AI+V   +              R
Sbjct: 140 GKSVGQWFGPNTVAQVLKKLALFEDWSS--------LAIHVAMDNTVIIDDIKKLCRSAR 191

Query: 241 GGAP------VVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
              P       +C   ++   S  S+  A  W P++L++PL LGL ++NP Y   L+  F
Sbjct: 192 QPTPSQVTNSFLCNGVSAEQTSARSRSPALPWQPLMLIIPLRLGLSELNPVYTDCLKACF 251

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
           T  QSLG++GGKP  + Y +G    S +YLDPH  QP + + + ++    S++H      
Sbjct: 252 TLRQSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPAVEL-EGNVPIPDSSFHCTHPSR 310

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKL--AEESNGAPLFTVTQT 400
           +++  +DPS+A+GF+C+D+ DF D C    +L   +++  A +F V Q+
Sbjct: 311 MNIQDLDPSIALGFFCQDEADFADLCENMRRLIIGQKTQNA-MFEVVQS 358


>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
          Length = 392

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 41/366 (11%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G     + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEK-SIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 179

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 180 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 239

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 240 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 299

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 406
                  + + ++DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 300 CQHPPCRMSIANLDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 359

Query: 407 HSDVLG 412
             DVL 
Sbjct: 360 CPDVLN 365


>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
           [Desmodus rotundus]
          Length = 396

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D   
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPS 195

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           E    P+    +A+ H    S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 196 ESSHDPL----NATNHNKAISACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + +     + + + +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQSPQRMSILN 311

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 312 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353


>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
 gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
          Length = 357

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDP   QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPATTQPAVEPTDGCFIPDESFH 303

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356


>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
           Complex
          Length = 357

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWG MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGSMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356


>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
          Length = 398

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 111/343 (32%), Positives = 179/343 (52%), Gaps = 27/343 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     I  +S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAG 197

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           E   +P   ++ ++R  S  S G   W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 E---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 175/353 (49%), Gaps = 50/353 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+Y    +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYDSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
            +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++I +  L  D + +  
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
              + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352


>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
          Length = 453

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 172/346 (49%), Gaps = 60/346 (17%)

Query: 65  SSTSDIWLLGVCHK-----------------IAQDEALGDAAGNNGLAEFNQDFSSRILI 107
           S  S +WLLG C++                   Q ++   ++ + G   F +DF SR+ +
Sbjct: 63  SKESPVWLLGKCYRRIESPSSDSTELGTDVAAFQSQSEIASSDDEGFEGFKKDFISRLWL 122

Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDRE-YVE 165
           +YR+ F  +  S  +SD GWGCMLRS QML+AQAL+ H LGR WR +P  +P  RE ++E
Sbjct: 123 TYRREFPILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDWRWQPDHQPTTRESFIE 182

Query: 166 ------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
                 I+  FGD  S  SPFSIH L+  G+A G  AG W GP                 
Sbjct: 183 VVNHRKIIKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGP----------------- 225

Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------GQADWTPIL 269
            G        A    S  ED     +  VC+   ++ C+V+ K            W  ++
Sbjct: 226 -GFVAHLFRQAFKRAS--EDNYEFDSLTVCV---AQDCAVYIKDVMEECTDKNGKWKSLI 279

Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           LL+P+ LG EK N  Y P L   F+  Q +GI+GG+P  S Y VG Q++  I+LDPH  Q
Sbjct: 280 LLIPVRLGAEKFNSIYAPCLTTLFSLKQCIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQ 339

Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            V+++   D     +++H    R IHL  +DPS  IGFYC  K+ F
Sbjct: 340 EVVDVWAVDFP--LTSFHCRSPRKIHLSKMDPSCCIGFYCPTKESF 383


>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
          Length = 398

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 108/342 (31%), Positives = 175/342 (51%), Gaps = 25/342 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
            G + G W GP          A+   W +LA     +  +  + +     V+    D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTAD 197

Query: 241 GGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
             +P   I  + S+  S F      W P+LL+VPL LG+ ++NP Y+   +  F  PQSL
Sbjct: 198 KSSPDSFITSNQSKDTSAFCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSL 254

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
           G +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ ++
Sbjct: 255 GALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQQMNILNL 314

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 315 DPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
           tropicalis]
          Length = 384

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 118/335 (35%), Positives = 172/335 (51%), Gaps = 20/335 (5%)

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
            D +SR+  +YR+ F  IG +  TSD GWGCMLR  QM+ AQALL   +GR WR   QK 
Sbjct: 44  NDITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWDKQKS 103

Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
              EY+ IL  F D + S +SIH + Q G   G   G W GP  + +    LA   +  +
Sbjct: 104 -QGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQWSS 162

Query: 219 GLGCQSLPMAIYV-----VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPI 268
                   +A+++     V  DE      A      +A   C+ ++ G +D     W P+
Sbjct: 163 --------IAVHIAMDNTVVMDEIRRLCRAGTNESSEAGALCNGYT-GVSDPSCSLWKPL 213

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           +LL+PL LGL  +N  YI TL+  F  PQSLG++GG+P ++ Y +G   +  IYLDPH  
Sbjct: 214 VLLIPLRLGLSDINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDELIYLDPHTT 273

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           Q  +         D S +       +H+  IDPS+A+GF+CR ++DF+D+C +  KL+  
Sbjct: 274 QLAVEPSDCCFVEDESFHCQHPPCRMHVSEIDPSIAVGFFCRSQEDFEDWCQQIKKLSLS 333

Query: 389 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
               P+F V       +++ DVL  T    + D L
Sbjct: 334 GGALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368


>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
          Length = 442

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 117/344 (34%), Positives = 176/344 (51%), Gaps = 28/344 (8%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNN------GLAEFNQDFSSRILISYRKGFDPIGDS 119
           S S IWLLG C+   Q E     A  N      G+  F +DFSS I +SYRK F  + +S
Sbjct: 63  SDSPIWLLGRCYYAKQAEYDSKNAVQNTQYKIHGIDCFFEDFSSLIYLSYRKHFSQLANS 122

Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGD--SET 175
            +TSD GWGCMLR+ QML+A ALL H L   WR   +K  ++ Y+   IL  F D  S+ 
Sbjct: 123 NLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKYTEKNYIYRMILRFFNDENSDN 182

Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 234
           SPFS+H L++ G       G W GP ++  +  A          +   S P +  + V  
Sbjct: 183 SPFSLHELVRIGSK---KPGEWYGPTSVAHTLSA---------AVNLTSHPVLDTFRVYV 230

Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
             D       V+      ++C+  +  +  W  +L+LVP+ LG + +NP YIP L+   T
Sbjct: 231 ANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLNPIYIPCLKALLT 290

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
               +GI+GG+P  S Y VG Q +  I LDPH +Q  +++   +   ++   H    + +
Sbjct: 291 LDYCVGIIGGRPKHSLYFVGFQGKKLINLDPHYLQEYVDMTTQEFPVESFRCHYP--KKM 348

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE---ESNGAPLF 395
               +DPS A+GFYCR ++DF+  C +A ++ +   +    P+F
Sbjct: 349 AFKKMDPSCAVGFYCRTREDFESLCKQAVEMLKPPMQRTEYPMF 392


>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
 gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
          Length = 398

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 178/356 (50%), Gaps = 53/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
           D  + C V                     SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++ G++    D + 
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTGENGTVNDQTF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +     + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
 gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
          Length = 368

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 113/334 (33%), Positives = 174/334 (52%), Gaps = 39/334 (11%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           +  D+W+LG  + I Q    GD      +   N D  SRI ++YRK F  IG +  T+D 
Sbjct: 26  TEEDVWILGKRYNILQ----GD------MGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDS 75

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           GWGCMLR  QM++AQAL+   LGR W+   +     EY++IL  F D + S +SIH + Q
Sbjct: 76  GWGCMLRCGQMMLAQALVCRHLGRDWQWDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQ 135

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
            G + G A GSW GP  + +  + L+      +        + ++V   +          
Sbjct: 136 MGVSEGKAVGSWFGPNTVAQVLKKLSAFDDWSS--------LCLHVAMDN---------T 178

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
           V I+D S           +W P++L +PL LGL ++N  Y   L+  FTF QSLGI+GG+
Sbjct: 179 VIIEDIS-----------NWRPLVLFIPLRLGLTEMNVVYNEPLKACFTFKQSLGIIGGR 227

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P  +TY +G    + +YLDPH  Q  +N  +     D S +H      +++  +DPS+A+
Sbjct: 228 PNHATYFIGYFGNNLVYLDPHTTQQTVNPDELSRIPDGS-FHCVYPCRMNIADVDPSVAL 286

Query: 366 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           GF+C+ ++DFDD C +  K   +    P+F + +
Sbjct: 287 GFFCKSEEDFDDLCQQIQKKIIDGKSRPMFEIAK 320


>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
          Length = 436

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 174/366 (47%), Gaps = 44/366 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG   K  +D           + +FN +  ++   +YR+ F PIG +   SD GWGC
Sbjct: 31  VWILGKHFKPDED-----------MEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGC 79

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQALL   LGR W     +  +  Y+ ILH F D + S +SIH + Q G  
Sbjct: 80  MLRCGQMMLAQALLCRHLGRDWDWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVG 139

Query: 190 YGLAAGSWVGPYAMCRSWEALAR-------------------------CQRAETGLGCQS 224
            G   G W GP  + +  + L                           C+ +    GC  
Sbjct: 140 EGKQIGQWFGPNTVAQVIKKLVLFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYG 199

Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHC-------SVFSKGQADWTPILLLVPLVLG 277
               I+  S     +    P  C  ++S+         S  S+    W P+LL +PL LG
Sbjct: 200 ECSYIHDRSSLTGNQSVSKPPHCSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLG 259

Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
           L ++N  Y  +L++ FT  QSLG++GGKP  + Y +G   +  +YLDPH  Q  I   + 
Sbjct: 260 LSEINSDYYNSLKIMFTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQTIEPERF 319

Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           ++  D S +H      +   S+DPS+A+GFYC  +DDFDD+C   ++L  +    P+F +
Sbjct: 320 NVIPDES-FHCVYPCFMSFQSLDPSVALGFYCHTEDDFDDWCQAVNELVVQREKRPMFEI 378

Query: 398 TQTHKK 403
            QT  +
Sbjct: 379 NQTRPR 384


>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
 gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
 gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
 gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
          Length = 355

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 25  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 73

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 74  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 133

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 134 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 193

Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 194 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 246

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 247 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 306

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 307 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 351


>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
          Length = 398

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQS 305

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
          Length = 411

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 177/356 (49%), Gaps = 53/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  + +           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 42  VWILGKQHLLKTERS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 90

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 91  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 150

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 151 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 193

Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
           D  + C VF                    SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 194 DIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 253

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + 
Sbjct: 254 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTF 313

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +     + +++ ++DPS+A+GF+C+++ DFD++C    K   + N   +F + Q H
Sbjct: 314 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCCLVQKEILKEN-LRMFELVQKH 368


>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 422

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 111/348 (31%), Positives = 179/348 (51%), Gaps = 37/348 (10%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 216

Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
            D  G+R    +   ++ SR  S +      W P+LL+VPL LG+ ++NP Y+   +  F
Sbjct: 217 ADTPGDRPPDSLTASNE-SRGTSAYCPA---WKPLLLIVPLRLGINQINPVYVDAFKECF 272

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + 
Sbjct: 273 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 332

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 333 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 379


>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
 gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
 gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
          Length = 398

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 180/351 (51%), Gaps = 43/351 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP++I    
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSI---- 193

Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
            D  G+R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 194 -DTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
          Length = 393

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 115/364 (31%), Positives = 179/364 (49%), Gaps = 36/364 (9%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           +T  +W+LG  + I  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 22  TTEPVWILGRKYTIFTEKE-----------EILSDVTSRLWFTYRKNFPAIGGTGPTSDT 70

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           GWGCMLR  QM+ AQAL+   LGR WR    +     Y  +L+ F D + S +SIH + Q
Sbjct: 71  GWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 130

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGER 240
            G   G + G W GP  + +  + LA      +        +A+++     V  +E    
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRL 182

Query: 241 GGAPVVCIDDAS--RHCSVFSKGQ----------ADWTPILLLVPLVLGLEKVNPRYIPT 288
             A   C D A+      + S G           + W P++LL+PL LGL  +N  Y  T
Sbjct: 183 CKAGFPCADGAAFPTDSELLSNGYPPAAEVTDRASPWRPLVLLIPLRLGLTDINEAYTET 242

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +   +  +  D + +  
Sbjct: 243 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVESTEGGVFPDETFHCQ 302

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 408
                +++  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + +      +  
Sbjct: 303 HPPCRMNIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSRIPGALPMFELVERQPSHFSCP 362

Query: 409 DVLG 412
           DVL 
Sbjct: 363 DVLN 366


>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
           [Homo sapiens]
          Length = 402

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 33  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 82  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 201

Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 202 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 254

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 255 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 314

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 315 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 359


>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
 gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
 gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
 gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
          Length = 395

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 113/319 (35%), Positives = 163/319 (51%), Gaps = 27/319 (8%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
              D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR WR    
Sbjct: 49  LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKH 108

Query: 157 KPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
           K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA     
Sbjct: 109 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 168

Query: 217 ETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSK-----GQAD 264
            +        +A+Y      VV  D        P  C +  A+ + S +S+     GQ+ 
Sbjct: 169 NS--------LAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSS 220

Query: 265 -WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
            W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  IYL
Sbjct: 221 GWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEIIYL 280

Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDV-IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           DPH  Q  +     D E    TYH       + + ++DPS+A+GF+C+D++DFD++C   
Sbjct: 281 DPHTTQTFV-----DTEDQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFDNWCEVI 335

Query: 383 SKLAEESNGAPLFTVTQTH 401
            K   +     +F +T  H
Sbjct: 336 EKEILKHQSLRMFELTPKH 354


>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 366

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 111/324 (34%), Positives = 163/324 (50%), Gaps = 41/324 (12%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           D +SR+  +YRKGF PIG +  TSD GWGCMLR  QM++ QAL+   LGR WR    +  
Sbjct: 68  DVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWVSGEEQ 127

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
             EYV IL+ F D + S +SIH + +                 +C  W   A    A  G
Sbjct: 128 RHEYVNILNAFIDKKDSYYSIHQIER-----------------LCMPWLDKAEACAASEG 170

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
           +G             + +G   GA           C+   +  A W P++LL+PL LGL 
Sbjct: 171 VG-------------ELNGYLEGA-----------CAFSEEETALWKPLVLLIPLRLGLT 206

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
            +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  Q  ++  +D  
Sbjct: 207 DINEAYIETLKKCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQTAVDPCEDGT 266

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
             D S +       +H+  +DPS+A GF+CR +D+FDD+C R  +L+   +  P+F + +
Sbjct: 267 FTDDSYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRRLSCNRDNLPMFELVE 326

Query: 400 THKKPVNHSDVLGETGGVPEDDSL 423
           +    +   D +  T    + + L
Sbjct: 327 SQPSHMVSVDAINLTPDFSDSERL 350


>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
 gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
          Length = 682

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 162/314 (51%), Gaps = 17/314 (5%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  L ++    G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 262 AAENQLAESPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 321

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S+ SPFSIH L++ G+  G 
Sbjct: 322 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 381

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDE---DGERGGAP 244
             G W GP ++    + AL    R        S+ +A    IY+   +E     E    P
Sbjct: 382 KPGDWYGPASVSYLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKP 441

Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
            V    A R  S   K    W  +++L+PL LG +K+NP Y   L+L  +    LGI+GG
Sbjct: 442 HVPWQQAKRSTSDAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEYCLGIIGG 501

Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
           KP  S Y VG QE+  I+LDPH  Q ++++ ++       ++H    R +    +DPS  
Sbjct: 502 KPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETFP--MHSFHCKSPRKLKSSKMDPSCC 559

Query: 365 IGFYCRDKDDFDDF 378
           IGFYC  K DFD F
Sbjct: 560 IGFYCPTKTDFDSF 573


>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
          Length = 398

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 107/341 (31%), Positives = 176/341 (51%), Gaps = 23/341 (6%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
            G + G W GP          A+   W +LA     +  +  + +     V+    D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVG 197

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
              P   ++ +++    F+   A W P+LL+VPL LG+ ++NP Y+   +  F  PQSLG
Sbjct: 198 ESTPGT-LNASNQSRGTFACCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLG 255

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
            +GGKP  + Y +G   +  I+LDPH  Q  +N  ++    D + +     + +++ ++D
Sbjct: 256 ALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVNTEENGTVDDQTFHCLQSPQRMNILNLD 315

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           PS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 316 PSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
          Length = 398

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPG 197

Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
          Length = 398

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAG 197

Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
 gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
           gorilla]
 gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
           cysteine endopeptidase; AltName: Full=Autophagin-2;
           AltName: Full=Autophagy-related cysteine endopeptidase
           2; AltName: Full=Autophagy-related protein 4 homolog A;
           Short=hAPG4A
 gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
 gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
 gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
 gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
 gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
           construct]
          Length = 398

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 197

Query: 239 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
          Length = 392

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 126/402 (31%), Positives = 190/402 (47%), Gaps = 49/402 (12%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
           S  S +WLLG+C+    +  L  A+                      N + EF +DF SR
Sbjct: 6   SKESPVWLLGLCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 65

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREY 163
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q   +  +
Sbjct: 66  LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQSTDESSH 125

Query: 164 VEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRAE 217
             I+  FGD  T  SPFSIH L+  G + G  AG W GP    + +C++ E      RA 
Sbjct: 126 RMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQAME------RAS 179

Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
                +   +A+YV        +    V C  D  R              ++LLVPL LG
Sbjct: 180 EDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR------------KALILLVPLRLG 227

Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
            +K+NP Y P L    T    +G++GG+P  S Y +G Q++  I+LDPH  Q  +++  +
Sbjct: 228 ADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVEGN 287

Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--EESNGAPLF 395
           + +   +++H    R + L  +DPS  +GFY  DK+   DF     +     ++   P+F
Sbjct: 288 E-KFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLTDFMETIQQFVIPNQNMDYPMF 346

Query: 396 TVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHE 437
              +   K +     + E G +P     G  SM D +    E
Sbjct: 347 LFCEGSGKDLQQGIEVVE-GLLPSSSRFGHESMEDDLFECEE 387


>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
          Length = 461

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 115/364 (31%), Positives = 174/364 (47%), Gaps = 37/364 (10%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           +T  +W+LG  + I  ++            +   D +SR+  +YRK F  IG +  TSD 
Sbjct: 91  TTEPVWILGRKYTIFTEKE-----------DILSDVTSRLWFTYRKNFPAIGGTGPTSDT 139

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           GWGCMLR  QM+ AQALL   LGR WR    +     Y  +L+ F D + S +SIH + Q
Sbjct: 140 GWGCMLRCGQMIFAQALLCRHLGRDWRWKKGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 199

Query: 186 AGKAYGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQ-SLPMA 228
            G   G + G W GP  + +         +W +LA            E    C+ + P  
Sbjct: 200 MGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSSLAVHIAMDNTVVIEEIRRLCKPNFPAG 259

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
                 D +    G P           +  +     W P++LL+PL LGL ++N  YI T
Sbjct: 260 ASAFPTDSEFLLNGFP---------SGAEVTNRPTQWKPLVLLIPLRLGLTEINEAYIET 310

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L+  F  PQSLG++GGKP ++ Y +G      IYLDPH  QP + I       D S +  
Sbjct: 311 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEISGSCFIPDESFHCQ 370

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 408
                +++  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + +      +  
Sbjct: 371 HPPCRMNIVELDPSIAVGFFCKTEEDFNDWCQQVKKLSLIRGALPMFELVEHQPSHFSSP 430

Query: 409 DVLG 412
           DVL 
Sbjct: 431 DVLN 434


>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
          Length = 408

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 39  VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 87

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 88  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 147

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 148 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVG 207

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           E     +    +AS        G+  W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 208 ESPPDTL----NASNQSKGTPAGRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 263

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 264 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 323

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 324 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365


>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
          Length = 396

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190

Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 304 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 353


>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
 gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
 gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
          Length = 398

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
          Length = 398

 Score =  194 bits (492), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 110/348 (31%), Positives = 178/348 (51%), Gaps = 37/348 (10%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
            D  G+R    +    + S+  S +      W P+LL+VPL LG+ ++NP Y+   +  F
Sbjct: 193 ADTAGDRPPDSLTA-SNQSKGTSAYCTA---WKPLLLIVPLRLGINQINPVYVDAFKECF 248

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + 
Sbjct: 249 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 308

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 309 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 441

 Score =  194 bits (492), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 119/312 (38%), Positives = 168/312 (53%), Gaps = 32/312 (10%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
            F  DF SR+ ++YRKGF  I  +  T D GWGCMLRS QMLVA ALLFH LGR WR  L
Sbjct: 137 HFLDDFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWR--L 194

Query: 156 QKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
               DR+    Y  IL  F D  TSP+SI  +   G  +    G W GP  + +  + L 
Sbjct: 195 GDSNDRDTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLV 254

Query: 212 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILL 270
                      Q + + ++V     DG      +  I  A+R       G+   TP +L+
Sbjct: 255 NDD--------QRISLKVHV---SNDGVVYKNEINTILSATR-----DDGK---TPAVLI 295

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
           ++PL LG+E +NP Y P ++  F     +GI GG+P +S + +GV  +  IYLDPH ++P
Sbjct: 296 MIPLRLGVETMNPVYYPGVKHCFAMSHCVGIAGGRPNSSLFFLGVDGDHLIYLDPHHLRP 355

Query: 331 VI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
            +   +I    +E D  +YH + +R + + S+DPSL IGFYC    DFD  CA+ ++LA 
Sbjct: 356 SVDSRDITSYKME-DLLSYHCEKVRLLPIASMDPSLVIGFYCHSLKDFDVLCAKMTELAT 414

Query: 388 ESNGAPLFTVTQ 399
            S  APLF++ +
Sbjct: 415 GS--APLFSIEE 424


>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
          Length = 1114

 Score =  194 bits (492), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 182/364 (50%), Gaps = 32/364 (8%)

Query: 68  SDIWLLGVCHKIAQDEALGDAAGNN-------GLAEFNQDFSSRILISYRKGFDPIGDSK 120
           S +WLLG  + I   + + D             + +F QDFSS +  +YR+ F  I  +K
Sbjct: 226 SPVWLLGKFYHIKPSDLIDDDIQRGKRTRVVPNIEKFKQDFSSLLWFTYRQDFPAIPGTK 285

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFGD--SETS 176
           +TSD GWGCMLRS QM++A+AL  H LG  W     +  ++E    +I+  FGD   + S
Sbjct: 286 LTSDCGWGCMLRSGQMMLAKALTLHYLGPEWNVFSDQTREQETYRKQIIRWFGDYLCDES 345

Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGD 235
           PFS+H L++ GK  G   G W GP ++     E + + Q+ +T L      + +YV    
Sbjct: 346 PFSMHRLVEVGKNLGKQPGEWFGPASVAHILKETMVKGQKTQTVLS----DLCVYVSQDC 401

Query: 236 EDGERGGAPVVCI----------DDASRHCSVFSKGQADWT-PILLLVPLVLGLEKVNPR 284
              ++    + C              S H S       DW   +++L+P+ LG E++NP 
Sbjct: 402 TVYKQDIYELCCTRPRADTKFTNSTESEHESSQDASSMDWKRAVVILIPVRLGGEQLNPV 461

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           YIP ++   +    +GI+GGKP  S Y VG QE+  IYLDPH  Q V++  +        
Sbjct: 462 YIPCVKGLLSQDSCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVDTRERHFP--IQ 519

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTH 401
           +YH    R + +D IDPS  IGFYCR++ +F+ F  +  ++    ++    P+F  +  H
Sbjct: 520 SYHCMSPRKVSIDKIDPSCTIGFYCRNQKEFEKFVQQTEEMVAPPKQRLSYPMFVFSDGH 579

Query: 402 KKPV 405
              V
Sbjct: 580 SNEV 583


>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
          Length = 398

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 107/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTAG 197

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           E     +  ++ +       S  +  W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ESPPGSLTALNQSKGT----SACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G      I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
          Length = 393

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 180/373 (48%), Gaps = 39/373 (10%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVG 133

Query: 190 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 230
            G + G W GP  + +         +W +LA            E    CQS      A  
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAA 193

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
             + + DG   G P    ++A          ++ W P++LL+PL LGL ++N  YI TL+
Sbjct: 194 CPAVESDGLYNGCP----EEAG-----VRDRRSLWKPLVLLIPLRLGLTEINEAYIETLK 244

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +    
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEHNDSGCLPDESFHCQHP 304

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 410
              + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + +      ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEEDFNDWCQQIKKLSLVRAALPMFELVERQPSHFSNPDV 364

Query: 411 LGETGGVPEDDSL 423
           L  T    + D L
Sbjct: 365 LNLTPDSSDADRL 377


>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
          Length = 396

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 116/375 (30%), Positives = 177/375 (47%), Gaps = 58/375 (15%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           +T  +W+LG  + I   +DE L D              +SR+  +YRK F  IG +  TS
Sbjct: 25  TTDPVWILGRKYTIFTEKDEILSDV-------------TSRLWFTYRKNFPAIGGTGPTS 71

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR    +     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQI 131

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      +        +A+++   +        
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDN-------- 175

Query: 244 PVVCIDDASRHCSV-FSKGQA-------------------------DWTPILLLVPLVLG 277
             V ++D  R C   FS   A                          W P++LL+PL LG
Sbjct: 176 -TVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQWRPLVLLIPLRLG 234

Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
           L  +N  Y  TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  Q  + +   
Sbjct: 235 LTDINEAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNG 294

Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
            +  D S +       +++  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F +
Sbjct: 295 GVIPDESFHCQHPPCRMNIGELDPSIAVGFFCKSEEDFNDWCQQVKKLSRIPGALPMFEL 354

Query: 398 TQTHKKPVNHSDVLG 412
            +      +  DVL 
Sbjct: 355 VEHQPSHFSCPDVLN 369


>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
           cuniculus]
          Length = 405

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 174/343 (50%), Gaps = 27/343 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 36  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 84

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 85  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 144

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S +  G
Sbjct: 145 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSANTPG 204

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 205 ERLHDSLT----ASNQSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 260

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G      I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 261 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 320

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 321 LDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 362


>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
          Length = 393

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 190 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 224
            G + G W GP  + +         +W +LA                 CQ   +  G  +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193

Query: 225 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
            P      +Y    +E G R    +                   W P++LL+PL LGL +
Sbjct: 194 CPTVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
           +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +        
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
            D S +       + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + + 
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354

Query: 401 HKKPVNHSDVLGETGGVPEDDSL 423
                ++ DVL  T    + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377


>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
          Length = 393

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/373 (32%), Positives = 178/373 (47%), Gaps = 39/373 (10%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 190 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 230
            G + G W GP  + +         +W +LA            E    CQS      A  
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAA 193

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
             + + D    G P    +D         +  A W P++LL+PL LGL ++N  YI TL+
Sbjct: 194 CPALESDVLYNGCP----EDVG-----LRERLALWKPLVLLIPLRLGLTEINEAYIETLK 244

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +  G      D S +    
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPGDSGCLPDESFHCQHP 304

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 410
              + +  +DPS+A+GF+C  + DF+D+C +  KL+      P+F + +      ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEADFNDWCQQIKKLSLVRGALPMFELVERQPSHFSNPDV 364

Query: 411 LGETGGVPEDDSL 423
           L  T    + D L
Sbjct: 365 LNLTPDSSDADRL 377


>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
 gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related cysteine endopeptidase 2B;
           Short=Autophagin-2B; Short=cAut2B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
          Length = 393

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 190 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 224
            G + G W GP  + +         +W +LA                 CQ   +  G  +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193

Query: 225 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
            P      +Y    +E G R    +                   W P++LL+PL LGL +
Sbjct: 194 CPAVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
           +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +        
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
            D S +       + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + + 
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354

Query: 401 HKKPVNHSDVLGETGGVPEDDSL 423
                ++ DVL  T    + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377


>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
          Length = 488

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 115/353 (32%), Positives = 173/353 (49%), Gaps = 35/353 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           I+LLG  +    + A            F  DFS+R+  +YR+ F P+  +  TSD GWGC
Sbjct: 129 IYLLGHVYHNKNNSA--------SFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGC 180

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFG---DSETSPFSIHNLL 184
           MLRS+QM++A+A +FH LGR WR   Q+      V  +I+  F    D+  +PFS+HN++
Sbjct: 181 MLRSAQMMLAEAFIFHLLGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMV 240

Query: 185 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS---LPMAIYVVSGDEDGERG 241
           +A    G  AG W GP         L RC     G+         MAIYV          
Sbjct: 241 RAAAHCGKKAGDWFGPSTAAY---LLKRCLEEAAGVADSKEIFEQMAIYVAQD------- 290

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
               +   D    C+  S    +W  ++LL+P+ LG E+VN  YI  ++    +   LGI
Sbjct: 291 --CTIYTQDVLDLCT--SDPNIEWKSVVLLIPVRLGGERVNVNYIHCIKEILAYQNCLGI 346

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y VG Q +  +YLDPH +Q   +  +  L    +++H    R +    +DP
Sbjct: 347 IGGKPRHSLYFVGFQGKKLVYLDPHYLQKTTDTSR--LNFSVNSFHCTTARKVSFSKLDP 404

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAE---ESNGAPLFTVTQTHKKPVNHSDVL 411
           S  IGFYC+ + DF+ F +    + E   ++ G P+F +++     VN  + L
Sbjct: 405 SATIGFYCKTRRDFESFQSIMQSVTESCPQNQGYPVFIISEGSSALVNQLNPL 457


>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
          Length = 396

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 238
            G + G W GP          A+   W +LA     +  +  + +      +S   D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++  AD  T+H     + +++ 
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
          Length = 369

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 113/345 (32%), Positives = 182/345 (52%), Gaps = 33/345 (9%)

Query: 71  WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
           W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGCM
Sbjct: 1   WILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 49

Query: 131 LRSSQMLVAQALLFHRLGR--PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
           LR  QM++AQAL+   LGR   W K  ++P  +EY  IL  F D +   +SIH + Q G 
Sbjct: 50  LRCGQMMLAQALICRHLGRDLNWEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGV 107

Query: 189 AYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDED 237
             G + G W GP          A+   W +LA     +  +  + +     I  +S D  
Sbjct: 108 GEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTA 167

Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
           GE   +P   ++ ++R  S  S G   W P+LL+VPL LG+ ++NP Y+   +  F  PQ
Sbjct: 168 GE---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQ 223

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHL 356
           SLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++   D  T+H     + +++
Sbjct: 224 SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNI 282

Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 283 LNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 326


>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
 gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
          Length = 398

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 175/356 (49%), Gaps = 53/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLRTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKG--QAD----------------------WTPILLLVPLVLGLEKVNPRY 285
           D  + C V   G   AD                      W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPLLLIVPLRLGINQINPVY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +  +  D + 
Sbjct: 241 IEAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEESGIVDDETF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +     + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
          Length = 398

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 179/343 (52%), Gaps = 27/343 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D  +R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   REY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIG 197

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           E   +P+  ++ +++  S  +   A W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 E---SPLNTLNASNQSKSAPASCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
 gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related cysteine endopeptidase 2A;
           Short=Autophagin-2A; AltName: Full=Autophagy-related
           protein 4 homolog A; AltName: Full=bAut2A
 gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
          Length = 398

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 238
            G + G W GP          A+   W +LA     +  +  + +      +S   D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 357
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++  AD  T+H     + +++ 
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
          Length = 475

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 180/368 (48%), Gaps = 40/368 (10%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 100 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 146

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 147 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 206

Query: 184 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
            Q G   G + G W GP          A+  +W +LA        +    +   I  +  
Sbjct: 207 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLA----VHVAMDNTVVMEEIRRLCR 262

Query: 235 DEDGERGGAPVVCIDDASRHCSVF----------SKGQADWTPILLLVPLVLGLEKVNPR 284
                 G A +    DA RHC+ F          S   + W P++LL+PL LGL  +N  
Sbjct: 263 SSLPCSGAAALPA--DADRHCNGFPAPMEVTSRPSPSPSPWRPLVLLIPLRLGLTDINEA 320

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           Y+ TL+  F  PQSLG++GGKP ++ Y +G   +  IYLDPH  QP + +       D +
Sbjct: 321 YVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFIPDET 380

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
            +       + +  +DPS+A+GF+C+ +DDF D+C +  KL+ +    P+F + +     
Sbjct: 381 FHCQHPPCRMGIGELDPSIAVGFFCKTEDDFRDWCQQVRKLSLQGGALPMFELVEQQPSH 440

Query: 405 VNHSDVLG 412
           +   DVL 
Sbjct: 441 LACPDVLN 448


>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
          Length = 510

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 115/346 (33%), Positives = 169/346 (48%), Gaps = 62/346 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
           F  DF SR+ ++YR  F  IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR   +
Sbjct: 115 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 174

Query: 157 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR- 212
           +     Y E+L  F D  S  SP+SIH + + G + +    G W  P  +  +   L   
Sbjct: 175 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTE 233

Query: 213 ---------------CQRAETGLGC---------QSLPMAIYVV---------------S 233
                            R E    C         Q  P+ +                  S
Sbjct: 234 HSPNGLKMYVPKDGIIYRKEVYQLCAVQPADGPAQHSPLRVDDDGGDTDHDGDTDGLESS 293

Query: 234 GDEDGERGGAP-----VVCIDDASRHCSVFSKGQAD------------WTPILLLVPLVL 276
            D      G P     +   D +S H  + S  +++            W P+++LVP+ L
Sbjct: 294 TDSMRHSHGNPGVPSTIEAGDYSSSHAELMSSAESECESLDDNFTELTWHPVIILVPVRL 353

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G++ +NP YIPTL+  F+FPQ LG++GGKP +S Y VG Q+   +Y+DPH VQP + +  
Sbjct: 354 GIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQPTVKMDD 413

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           D L     +Y  ++ + +  D IDPSLA+GF C  + +FDDFC  A
Sbjct: 414 DPL-FPIESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 458


>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
          Length = 370

 Score =  191 bits (485), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 123/347 (35%), Positives = 168/347 (48%), Gaps = 44/347 (12%)

Query: 63  ISSSTSDIWLLGV-CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK- 120
           I  ST  +WLLG   H I            N L    QD  S++  +YRK F PIG S  
Sbjct: 26  IPQSTEPVWLLGKKYHAI------------NELNTIRQDIVSKLWFTYRKDFVPIGGSDG 73

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 179
            TSD GWGCMLR  QM++ QAL+   LGR W+  P  +  D  Y+ IL  F DS  +PFS
Sbjct: 74  KTSDKGWGCMLRCGQMVLGQALMSIHLGRDWQWNPTTR--DATYLSILKKFEDSRKAPFS 131

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           IH +   G + G   G W GP  + +  + L +              +AI+V   +    
Sbjct: 132 IHQIASMGISEGKEVGQWFGPNTVAQVLKKLVKFDEGND--------VAIHVALDN---- 179

Query: 240 RGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
                VV I +    C   SK  AD     W P+LL+VPL LGL ++N  Y+  L+  F 
Sbjct: 180 -----VVIISEIRDLC--LSKETADVSTPHWKPLLLIVPLRLGLTQMNSIYLGGLKQCFQ 232

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVI 351
           F QSLGI+GGKP ++ Y +G      IY DPH  Q   ++G  D   +     +YH    
Sbjct: 233 FKQSLGIIGGKPNSALYFIGYVGNEVIYFDPHTTQKAGSVGNKDTSEEKDVDLSYHCKHA 292

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
             + +  +DPS+A+ F CR + DF+D C        ++   PLF V+
Sbjct: 293 SRMSMLGMDPSVAVCFLCRSEADFNDLCQNIKDQLIKTESQPLFEVS 339


>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
           [Tribolium castaneum]
 gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
          Length = 366

 Score =  191 bits (485), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 117/321 (36%), Positives = 166/321 (51%), Gaps = 26/321 (8%)

Query: 92  NGLAEFN---QDFSSRILISYRKGFDPIG-DSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
           N L E +   QD  S+I  +YRK F PIG D  +T+D GWGCMLR  QM++AQAL+   L
Sbjct: 33  NALQELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHL 92

Query: 148 GRPW-RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
           GR W  +P  K  D  Y++IL  F D   +PFSIH +   G +     G W GP  + + 
Sbjct: 93  GRDWVWEPETK--DSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQV 150

Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
            + L +           +L   + +    E         +C+   S  CS       DW 
Sbjct: 151 LKKLVKYDEWSAIEMHIALDNTLIISDIRE---------LCLSQGSDGCS-----SGDWK 196

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P+LL+VPL LGL+++NP Y   L+  F F QSLG++GGKP  + Y +G   +  IYLDPH
Sbjct: 197 PLLLIVPLRLGLQEINPIYASGLKKCFQFKQSLGVIGGKPNLALYFIGHVGDEVIYLDPH 256

Query: 327 DVQP---VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 383
             Q    V +   ++     STYH      I++ S+DPS+A+ F+C  + +F+D C    
Sbjct: 257 TTQKSGSVESKETEEEIELDSTYHCKYASRINILSMDPSVAVCFFCNTEGEFNDLCHSIK 316

Query: 384 KLAEESNGAPLFTVTQTHKKP 404
           K   E    PLF +  T++KP
Sbjct: 317 KDLIEPEKQPLFEI--TYEKP 335


>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
          Length = 373

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 174/356 (48%), Gaps = 53/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178

Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
           D  + C V                     SKG       W P+LL+VPL LG+ ++NP Y
Sbjct: 179 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 238

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + 
Sbjct: 239 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 298

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +     + + + ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 299 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353


>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
          Length = 429

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 174/356 (48%), Gaps = 53/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 60  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 108

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 109 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 168

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 169 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 211

Query: 250 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 285
           D  + C V                     SKG       W P+LL+VPL LG+ ++NP Y
Sbjct: 212 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 271

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + 
Sbjct: 272 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 331

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +     + + + ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 332 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 386


>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
          Length = 398

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 108/343 (31%), Positives = 172/343 (50%), Gaps = 27/343 (7%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 238
            G + G W GP          A+   W +LA     +  +  + +      +S   D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 313

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
          Length = 398

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 176/356 (49%), Gaps = 53/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHC--------------------SVFSKGQAD----WTPILLLVPLVLGLEKVNPRY 285
           D  + C                    S  SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPLLLIVPLRLGINQINPVY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + 
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +     + +++ ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 301 HCLQPPQRMNILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
 gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
          Length = 406

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 112/364 (30%), Positives = 175/364 (48%), Gaps = 61/364 (16%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 284
           D  + C V   G AD                         W P+LL+VPL LG+ ++NP 
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           YI   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +  L  D +
Sbjct: 241 YIEAFKECFKMPQSLGALGGKPNNAYYFIGSLGDELIFLDPHTTQTFVDTEESGLVDDHT 300

Query: 345 TYHSDVIRHIHLDSIDPSLAI-------GFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
            +     + + + ++DPS+A+       GF+C+++ DFD++C+   K   + N   +F +
Sbjct: 301 FHCLQSPQRMSILNLDPSVALVGQGAFMGFFCKEEKDFDNWCSLVQKEILKEN-LRMFEL 359

Query: 398 TQTH 401
            Q H
Sbjct: 360 VQKH 363


>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
          Length = 381

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 111/345 (32%), Positives = 176/345 (51%), Gaps = 44/345 (12%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           S S +W+LG            + +  + + E N +  SR L +YRK F  I DS  TSD 
Sbjct: 28  SDSPVWILG-----------NELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDS 76

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD----REYVEILHLFGDSETSPFSIH 181
           GWGCMLR  QM++A+AL    LGR W+   Q+  D    ++Y++IL LF DS+ +P+S+H
Sbjct: 77  GWGCMLRCGQMVLAEALQRVSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLH 136

Query: 182 NLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
            +   G++       G+W GP  +    + L +   +ET     + P+ ++V   +    
Sbjct: 137 QIALMGESIQSKKPVGTWFGPNTIA---QVLRKLSVSET-----TNPIRVHVAMDN---- 184

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
                 V +D+    C  F    +   P+LL +PL LGL ++NP Y   L+  F FPQ L
Sbjct: 185 -----TVIVDEIKESCG-FIGDPSQGKPLLLFIPLRLGLTEINPIYFQDLKECFEFPQIL 238

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPH-----DVQPVINIGKDDLEADTSTYHSDVIRHI 354
           G++GG+P  + Y +G  +   IYLDPH         V+ +G     ++  TYH+D    +
Sbjct: 239 GVIGGRPNHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGG----SEDKTYHTDRAYRM 294

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
               +DPSL++ F C+D+ +F+D C R        + +PLF + +
Sbjct: 295 DFKDLDPSLSLCFLCKDESEFEDMCERFLFKLIRGHNSPLFEICR 339


>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
           [Ornithorhynchus anatinus]
          Length = 436

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 172/359 (47%), Gaps = 54/359 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 68  VWILGRQHHLKAEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 116

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W     K    EY +IL  F D +   +SIH + Q G  
Sbjct: 117 MLRCGQMMLAQALICRHLGRDWCWEKHKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVG 176

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 177 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 219

Query: 250 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 285
           D  + C +  +G                         A W P+LL+VPL LG+  +NP Y
Sbjct: 220 DIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPLLLIVPLRLGINHINPIY 279

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 280 IDAFKECFKTPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQTFVDTEENGQVDDHSF 339

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
           +     + + + ++DPS+A+GF+C+++ DFD++C+   K         +F + Q  K+P
Sbjct: 340 HCQQAPQRMKIMNLDPSVALGFFCKEEKDFDNWCSLVQKEILRQQSLRMFELVQ--KRP 396


>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
          Length = 517

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 113/346 (32%), Positives = 169/346 (48%), Gaps = 39/346 (11%)

Query: 68  SDIWLLGVCHKIAQDEALGDAAGN----------------NGLAEFNQDFSSRILISYRK 111
           S +WLLG C+ + +     D + N                  L  F  DF S++  +YRK
Sbjct: 67  SPLWLLGKCYHLKKPSLSSDTSENAEGSQQSTSESYNMLPKHLKLFLVDFHSKLWFTYRK 126

Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYV--EILH 168
           GF  + D+ +TSD GWGCMLR++QM++AQ+ + H LGR WR  P +   ++  +   I+ 
Sbjct: 127 GFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLSMEQSDIHRNIIT 186

Query: 169 LFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
            F D +    PFS+H L + G +Y    G+W GP       +    C + +T L    L 
Sbjct: 187 WFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECAKGKTEL----LN 242

Query: 227 MAIYVVSGDEDGERGGAPVVC-----IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
             +  ++ D          +C       DA    S  S  ++    +++L+P+ LG   +
Sbjct: 243 NIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRS----VIVLIPVRLGEATL 298

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDDL 339
           NP YIP ++   T  QS+GI+GGKP  S Y +G Q+E   YLDPH  Q   +    K+DL
Sbjct: 299 NPIYIPCIQSMLTLDQSVGIMGGKPKHSLYFIGFQDEYLFYLDPHYCQQADHPAAFKNDL 358

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
                 YH +  R  ++  +DPS  +GFYCRD  DF  F   A+K 
Sbjct: 359 ---LQNYHCNSPRKTNISKMDPSCCLGFYCRDYKDFQSFVCEANKF 401


>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
 gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
          Length = 385

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 117/352 (33%), Positives = 167/352 (47%), Gaps = 54/352 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVG 126
           +WLLG C+              N L EF++   D +S+   +YRK + PIG    TSD G
Sbjct: 25  VWLLGCCY--------------NPLEEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKG 70

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCMLR  QM++ QAL+   LGR WR    K     Y +IL LF DS+ S +SIH + Q 
Sbjct: 71  WGCMLRCGQMILGQALVMRHLGRDWRWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQM 130

Query: 187 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
           G + G     W GP    +  + L                M +YV   +         +V
Sbjct: 131 GVSEGKKISQWFGPNTAAQVLKKLIMFDEWSQ--------MGVYVAMDN---------IV 173

Query: 247 CIDDASR----HCSVFSKGQA--------------DWTPILLLVPLVLGLEKVNPRYIPT 288
            IDD  +    H +  S+G A               W P+LL +PL LGL  +NP Y   
Sbjct: 174 VIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLLFIPLRLGLTDLNPIYKDK 233

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L   F    +LGI+GGKP ++ Y +G+Q +  +YLDPH VQ  + + K +      TYH 
Sbjct: 234 LNKCFRIKNTLGIIGGKPNSAHYFIGIQGDYLLYLDPHTVQETVKV-KPNCPFSDKTYHQ 292

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA-EESNGAPLFTVTQ 399
                +H   +DPS+A+GFY   +++F++ C   + +    S   PLF V +
Sbjct: 293 KGTNRLHFSYMDPSVALGFYSATEEEFNELCRDFTDVCILNSAQPPLFEVVE 344


>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
          Length = 383

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 116/350 (33%), Positives = 177/350 (50%), Gaps = 34/350 (9%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I  +  ++W+LG  +   QD           L    +D +S I  +YRKGF PIGD  +T
Sbjct: 22  IPETKDNVWVLGKKYSAIQD-----------LERIRRDITSVIWCTYRKGFVPIGDEGLT 70

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 180
           SD GWGCMLR  QM++  AL+   L   W   +  P  R+  Y++I+    + + +P+SI
Sbjct: 71  SDKGWGCMLRCGQMVLGVALIKVHLSADW---VWTPETRDPTYLKIVQRLEERKQAPYSI 127

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H +   G   G   G W GP  + +  + L    +  +        + I+V   +   + 
Sbjct: 128 HQVALMGACEGKEVGQWFGPNTVAQVLKKLVVYDKWSS--------LVIHVALDNTVVKE 179

Query: 241 GGAPVVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
                  +++    CS    G   +DW P+LL+VPL LGL ++NP Y+  L++ F  PQS
Sbjct: 180 DILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGLSEINPIYMEGLKICFQSPQS 239

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIH 355
           +G++GGKP  + Y++G   +  IYLDPH  Q    V N   D+ +    TYH      I 
Sbjct: 240 IGVIGGKPNQALYLIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIP 299

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTHKKP 404
           + S+DPS+A+ F CR + DFD+ C    K L +ES   PLF + +  K+P
Sbjct: 300 ILSMDPSVAVCFLCRTRSDFDELCELIEKRLMQESQ--PLFEICE--KRP 345


>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
           gorilla]
          Length = 379

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 37  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96

Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 97  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149

Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           QP +         D S +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+  
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328

Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
               P+F + +     +   DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351


>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
          Length = 459

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 125/370 (33%), Positives = 185/370 (50%), Gaps = 48/370 (12%)

Query: 64  SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           S ++S +WLLG C+   QD    D+  +     ++  F S +  +YR+ F+ +     TS
Sbjct: 68  SQNSSKLWLLGDCYS-PQDFDNFDSMKD----AYHDAFESILWYTYRRDFETMVPYDFTS 122

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----LQKPFDREYVEILHLFGDSETS-- 176
           D GWGCMLRS+QML+++A   + LG  W+ P     L+ P  + YV++L  F DS  +  
Sbjct: 123 DAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLELP--KVYVKLLKWFVDSFDTEC 180

Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
            +SIHN+ + G  Y    G W GP          A+  R    L  Q  P    V+   +
Sbjct: 181 KYSIHNITRIGMQYDKLPGEWYGP-------TTAAQALRDLVNLHAQESPECNLVMYVPQ 233

Query: 237 DGERGGAPV--VCI---DDASRHCSVFSKGQADWT---------------------PILL 270
           DG      V  +CI   D  +   +V  + Q+D T                      +L+
Sbjct: 234 DGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRDNSEKMWQKSLLI 293

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
           L+PL LGL+ +NPRY+P ++  F FPQ++GI+GGK G S Y VG  +     LDPHD+ P
Sbjct: 294 LIPLRLGLDSINPRYLPAIQRVFEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPHDIHP 353

Query: 331 VINIGKDDLEAD-TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 389
             ++      A    T HS +   + L SIDPSLA+GFYC D+ D+ DF  R  ++  E 
Sbjct: 354 TADLNTAFPTATHLRTVHSRLPLEMSLGSIDPSLALGFYCSDRKDYLDFVDRVDRVQSEL 413

Query: 390 NGAPLFTVTQ 399
            GA  F++ +
Sbjct: 414 GGALPFSIAK 423


>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
          Length = 379

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 37  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96

Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 97  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149

Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPL 208

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           QP +         D S +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+  
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328

Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
               P+F + +     +   DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351


>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
 gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
          Length = 410

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 123/381 (32%), Positives = 179/381 (46%), Gaps = 64/381 (16%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           +  S  ++W++G   ++ Q +   D           ++  SR+  +YRK F PIG +   
Sbjct: 28  LFKSGGEVWIVG---RVWQTQDFDD---------IKKEIRSRMWFTYRKSFSPIGGTGPI 75

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 181
           SD GWGCMLR  QML+AQAL+   LGR W+  P  +  D  YV IL +F D +   +SIH
Sbjct: 76  SDSGWGCMLRCGQMLLAQALICRHLGREWQWSPSCR--DEAYVRILRMFQDKKNELYSIH 133

Query: 182 NLLQAGKAYGLAAGSWVGP---------YAMCRSWEALA----------------RCQR- 215
            + + G++ G   G W GP          A+   W +LA                 C R 
Sbjct: 134 MIAKMGESEGKEIGKWFGPSTIAHVIKKLAIYDDWSSLAVHVAMDNVIVQEDVKKLCSRE 193

Query: 216 ---AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
              A      Q  P  I V    ED  +    V C + +S            W P+LL++
Sbjct: 194 VFDALRKRLLQEEPSEI-VADWFEDARKDNKKVDCANLSS-----------PWKPLLLIL 241

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
           P+ LGL ++NP YIP L+  F    ++G++GGKP  + Y +G  ++  +YLDPH  Q  +
Sbjct: 242 PMRLGLSELNPCYIPALKEFFACKYNIGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTFV 301

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN-- 390
           ++       D S+YHS  I  I  + IDPSLAI FY   + +FDDFC  A ++    N  
Sbjct: 302 DLDVSMDLFDDSSYHSAFILDISFNEIDPSLAIAFYINTEAEFDDFCTFAKQVCLVGNFR 361

Query: 391 ------GAPLFTVTQTHKKPV 405
                    LF V Q +  P+
Sbjct: 362 CFSSGSMVQLFQVLQKYPNPL 382


>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
          Length = 394

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 108/312 (34%), Positives = 159/312 (50%), Gaps = 40/312 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           ++LLGV + + +D A            F +D  SR   +YRK F PIGD+  TSD GWGC
Sbjct: 45  VYLLGVKYDLPRDGA-----------SFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGC 93

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
            LR  QML+   LL   LGR WR       D +Y +IL +F D   S +SI  +   G  
Sbjct: 94  TLRCGQMLLGHTLLLRHLGRDWRWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGAD 153

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
           +G + G W GP  + ++ + LA        +  Q   +A+YV             +V ID
Sbjct: 154 FGRSVGQWFGPNNVAQAIKRLA--------VHDQWSEVAVYVAMD---------MLVVID 196

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D S           ++ P+L+ +PL LG E+ N  Y   ++  F   QS+GI+GGKP  +
Sbjct: 197 DIS-----------NFRPVLVFIPLRLGQERFNMEYKEAVKACFAVRQSVGIIGGKPRHA 245

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            +  G  ++  IYLDPH  Q  + +    + +D STYH+  I  +H+  +DPSLA+GF+C
Sbjct: 246 LWFTGYHDDYLIYLDPHKTQSCVTLPDAGIVSD-STYHTTQIERLHISELDPSLALGFFC 304

Query: 370 RDKDDFDDFCAR 381
           + + D DD C +
Sbjct: 305 QTEADLDDLCDK 316


>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
          Length = 378

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 36  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 95

Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 96  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 148

Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 149 -LAVHIAMDNTVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 207

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 208 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 267

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           QP +         D S +       + +  +DPS+A+GF+C+ +DDF D+C +  KL+  
Sbjct: 268 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLL 327

Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
               P+F + +     +   DVL
Sbjct: 328 GGALPMFELVEQQPSHLACPDVL 350


>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
 gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
          Length = 382

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 110/326 (33%), Positives = 164/326 (50%), Gaps = 41/326 (12%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
           L +   D +S+I ++YRK F  IG +  TSD GWGCMLR  QM++AQAL+   LGR WR 
Sbjct: 35  LDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREWRW 94

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
           +P  K  +++Y+ IL +F D +   FSIH + Q G + G   G W GP  +      LA 
Sbjct: 95  EPGTK--NKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR-HCSVFS------------ 259
             +  +        +AI+V   +          V I++ S+  C +++            
Sbjct: 153 FDKWSS--------LAIHVAMDN---------TVIINEISKFRCHIWAAADGLVRNRTNS 195

Query: 260 ------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
                   +  W P+LL +PL LGL ++N  Y   L+ TF   QSLG++GGKP  + Y +
Sbjct: 196 EPSRPANSEGSWKPLLLFIPLRLGLSEINRIYAFGLKRTFALKQSLGMIGGKPNHALYFI 255

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
           GV E+  I+LDPH  Q   ++  D    D  +YH      +++  +DPS+A+ FY   + 
Sbjct: 256 GVVEDELIFLDPHTTQLACDLDVD--SPDDQSYHCAHASRMNISELDPSVALCFYMATES 313

Query: 374 DFDDFCARASKLAEESNGAPLFTVTQ 399
           DFD +C    K        PLF +TQ
Sbjct: 314 DFDVWCNLVQKHLISRMQQPLFEITQ 339


>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
          Length = 395

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 160/321 (49%), Gaps = 23/321 (7%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W+  
Sbjct: 43  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWE 102

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162

Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
              +        +A+Y VS D          +C        +  A+ H   +S+ +    
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213

Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
             + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           IYLDPH  Q  ++  +     D + +       + +  +DPS+A+GF+C+D+++F+++C 
Sbjct: 274 IYLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333

Query: 381 RASKLAEESNGAPLFTVTQTH 401
              K   +     +F +   H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354


>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
           [Megachile rotundata]
          Length = 518

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 119/387 (30%), Positives = 178/387 (45%), Gaps = 58/387 (14%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
           S  S +WLLG  ++   +E L  A+                      + + EF +DF+SR
Sbjct: 126 SKESPVWLLGKIYRKKPEEFLEKASEAEKTLDTGSEISLAMDAISFEDSIEEFKKDFTSR 185

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR    +P   E  
Sbjct: 186 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWQPDQPIKTEQQ 245

Query: 165 E--------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
           +        I+  FGD     SPFSIH L+  G  +G  AG W GP        ++A   
Sbjct: 246 KLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAGDWYGP-------SSVAHLL 298

Query: 215 RAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
                   + LP    +A+YV              V + D    C +       W  ++L
Sbjct: 299 SQAVEHAAEHLPIFSNLAVYVAQD---------CAVYLQDVESVCQM---PDGKWKSLIL 346

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
            VPL LG +K+NP Y   L    T    +G++GG+P  S Y +G QE+  I LDPH  Q 
Sbjct: 347 FVPLRLGTDKLNPVYTSCLTHLLTLDTCIGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE 406

Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL---AE 387
            +++ KD+     +++H    R + +  +DPS  +GFY  DK+ F +F   A       +
Sbjct: 407 TVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKNQFTNFMEIAPSYLVPED 464

Query: 388 ESNGAPLFTVTQTHKKPVNHSDVLGET 414
           E    P+F   +   K ++    + ET
Sbjct: 465 EKVDYPMFLFCEGSGKDLHQQIEIAET 491


>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
          Length = 398

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 115/309 (37%), Positives = 161/309 (52%), Gaps = 31/309 (10%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
           + + F + +  +YR+ F  +     TSD GWGCMLRS+QML+ QAL    LGR WR P  
Sbjct: 41  YKRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPAL 100

Query: 155 ----LQKPFDREYVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
               +      +YV +L  F DS      +SIH++++ G  Y    G W GP    +   
Sbjct: 101 FEAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLR 160

Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV-------FSKG 261
            L    R E G       +A+YV    ++G      VV  DD +R C          ++ 
Sbjct: 161 DLVNLHRREFGG-----ELAMYV---PQEG------VVYTDDVTRLCFFDPLLHPPTAED 206

Query: 262 QADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
            +DW T +L+L+PL LGL++VN RY+P L  TF FPQS+GI+GGK G S Y VG Q++  
Sbjct: 207 SSDWSTALLILIPLRLGLDQVNERYVPALEKTFAFPQSVGIIGGKKGHSVYFVGTQQDQL 266

Query: 321 IYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
             LDPHDV P   +      A    T HS     +++  IDPSLA+GF C ++ D++DF 
Sbjct: 267 HLLDPHDVHPAPELNPAFPTATHLRTVHSSRPLVMNVTGIDPSLALGFLCDNRADYEDFE 326

Query: 380 ARASKLAEE 388
            R   L +E
Sbjct: 327 RRVRILHDE 335


>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
          Length = 392

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W+  
Sbjct: 40  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 99

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 100 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 159

Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
              +        +A+Y VS D          +C        +  A+ H   +S+ +    
Sbjct: 160 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 210

Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
             + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  
Sbjct: 211 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 270

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           IYLDPH  Q  +   +     D + +       + +  +DPS+A+GF+C+D+++F+++C 
Sbjct: 271 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 330

Query: 381 RASKLAEESNGAPLFTVTQTH 401
              K   +     +F +   H
Sbjct: 331 VIEKEILKHQSLRMFELIPKH 351


>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
 gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
          Length = 395

 Score =  184 bits (468), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W+  
Sbjct: 43  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 102

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162

Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 262
              +        +A+Y VS D          +C        +  A+ H   +S+ +    
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213

Query: 263 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
             + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           IYLDPH  Q  +   +     D + +       + +  +DPS+A+GF+C+D+++F+++C 
Sbjct: 274 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333

Query: 381 RASKLAEESNGAPLFTVTQTH 401
              K   +     +F +   H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354


>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
          Length = 379

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 108/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 37  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96

Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 97  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149

Query: 226 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 268
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           QP +         D S +       + +  +DPS+A+G +C+ +DDF+D+C +  KL+  
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGSFCKTEDDFNDWCQQVKKLSLL 328

Query: 389 SNGAPLFTVTQTHKKPVNHSDVL 411
               P+F + +     +   DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351


>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
           aries]
          Length = 454

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 117/358 (32%), Positives = 168/358 (46%), Gaps = 42/358 (11%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 69  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 115

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +    G  E         
Sbjct: 116 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYCRVPPQMGVGE--------- 166

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
                  G + G W GP  + +  + LA    A + L          V++      R G 
Sbjct: 167 -------GKSIGQWYGPNTVAQVLKKLAVFD-AWSALAVHVAMDNTVVMADVRRLCRSGL 218

Query: 244 PVVCID----DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P    +    D+ RHC+ F  G       A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 219 PCAGAEAFPADSERHCNGFPAGAEGGECTAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 278

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 279 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 338

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
           + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 339 MSITELDPSIAVGFFCKTEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVL 396


>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
          Length = 456

 Score =  184 bits (466), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 116/377 (30%), Positives = 177/377 (46%), Gaps = 51/377 (13%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
           S  S +WLLG C+    ++ L +A+                      N + EF +DF+SR
Sbjct: 62  SKESPVWLLGQCYLKKSEDPLENASEALEPEGTGSQVSLAMDATNFENTIEEFKRDFASR 121

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQ 156
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR W+           Q
Sbjct: 122 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKWRPEQSIENTQQ 181

Query: 157 KPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
              D  +  I+  F D     SPFSIH L+  G + G  AG W GP ++      L++  
Sbjct: 182 MRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSVAH---LLSQAV 238

Query: 215 RAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
                L    L  +A+YV              V + D    C     G   W  ++LLVP
Sbjct: 239 ERTGELPNSKLSRLAVYVAQD---------CAVYMQDVEEVCRTSDGG---WKSLILLVP 286

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           L+LG +K+NP Y P +    T    +G++GG+P  S Y +G Q++  I+LDPH  Q  ++
Sbjct: 287 LMLGTDKLNPVYAPCVTSLLTLDACIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVD 346

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 392
           + K++     +++H    R + L  +DPS  +GFY  +++   DF          SN   
Sbjct: 347 VSKENFPL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNRESLTDFMETIHSFVIPSNQKT 404

Query: 393 --PLFTVTQTHKKPVNH 407
             P+F   +  KK +  
Sbjct: 405 DYPMFLFCEGSKKDLQQ 421


>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
           litura]
          Length = 365

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 116/351 (33%), Positives = 176/351 (50%), Gaps = 33/351 (9%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I  +   +W+LG  +   QD           L    +D +S I  +YRKGF PIGD  +T
Sbjct: 5   IPQTKESVWILGKKYSAIQD-----------LDRIRRDITSIIWCTYRKGFIPIGDEGLT 53

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 180
           SD GWGCMLR  QM++  AL+   L   W   +  P  R+  Y++I+  F + + +P+SI
Sbjct: 54  SDKGWGCMLRCGQMVLGVALVRVHLSADW---VWTPETRDPTYLKIIQRFEERKQAPYSI 110

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H +   G + G   G W GP  + +  + L    +  +        + I+V   +   + 
Sbjct: 111 HQVALMGASEGKQVGQWFGPNTVAQVLKKLTVYDKWSS--------LVIHVALDNTVVKE 162

Query: 241 GGAPVVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
                  +++    CS        DW P+LL+VPL LGL ++NP YI  L++ F  PQS+
Sbjct: 163 DILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLSEINPIYIDGLKICFQCPQSI 222

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIHL 356
           G++GGKP  + Y+VG   +  IYLDPH  Q    V     D+ +    +YH      I +
Sbjct: 223 GVIGGKPNQALYLVGCVGDEVIYLDPHTTQRSGLVETKTTDEQKEMDWSYHCKYASRIPM 282

Query: 357 DSIDPSLAIGFYCRDKDDFDDFCAR-ASKLAEESNGAPLFTVTQTHKKPVN 406
            ++DPS+A+ F CR K DF++ CA   +KL  ES   PLF   +  K+P +
Sbjct: 283 LAMDPSVAVCFLCRTKRDFEELCATIETKLMCESQ--PLFETCE--KRPAH 329


>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
 gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
          Length = 673

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 113/318 (35%), Positives = 161/318 (50%), Gaps = 21/318 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  + +     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 253 AAENQVTECPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIA 312

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S+ SPFSIH L++ G+  G 
Sbjct: 313 QGLICHFLGRSWRYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 372

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E     
Sbjct: 373 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQ 432

Query: 245 VVCIDDASRHCSVFSK----GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
            V    A +  S   K     Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 433 HVPWQHAKKSTSDAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 492

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG QE+  I+LDPH  Q ++++ ++       ++H    R I    +D
Sbjct: 493 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMHSFHCKSPRKIKSSKMD 550

Query: 361 PSLAIGFYCRDKDDFDDF 378
           PS  IGFYC  K DFD F
Sbjct: 551 PSCCIGFYCATKTDFDSF 568


>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 445

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 111/307 (36%), Positives = 159/307 (51%), Gaps = 45/307 (14%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
           +EF  DF SR+ I+YR  F PI  S                        TSD GWGCM+R
Sbjct: 109 SEFLDDFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIR 168

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
           S Q L+A  ++ HRLGR WRK  +   +RE+ +IL LF D+  +PFSIH  ++ G +A G
Sbjct: 169 SGQSLLANTIVVHRLGRDWRKGQK---EREHKDILSLFADTPDAPFSIHKFVEHGAQACG 225

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP        A ARC RA T    Q+  + +Y    D D        V ID A
Sbjct: 226 TYPGEWFGP-------NATARCLRALTDKYHQA-GLRVYARPNDSD--------VYID-A 268

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
               +       ++ P L+++ + LG+EKV P Y   L+     PQS+GI GG+P +S Y
Sbjct: 269 LTATATQKDANDEFQPTLIVLGIRLGIEKVTPAYHAALKAALELPQSMGIAGGRPSSSHY 328

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
            VG Q ++  YLDPH  +P+++      + DT   H+  +R + L  +DPS+ +GF  R 
Sbjct: 329 FVGHQGDNFFYLDPHTTRPMLSPQPSAEDVDTC--HTRRVRRLSLAEMDPSMLLGFLVRS 386

Query: 372 KDDFDDF 378
           K+DF+++
Sbjct: 387 KEDFEEW 393


>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
           T30-4]
 gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
           T30-4]
          Length = 392

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 118/338 (34%), Positives = 172/338 (50%), Gaps = 26/338 (7%)

Query: 61  TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK 120
           T  ++ ++ +WLLG   K   D A  D         + + F S +  +YR+ +  +   +
Sbjct: 14  TPSAALSAPVWLLG---KRYDDVAAVD------FDAYKRSFESILWFTYRRDYPAMTPYE 64

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGDSE 174
            TSD GWGCMLRS+QML+ QAL    LGR WR P      +       YV++L  F DS 
Sbjct: 65  HTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPALFETEIDARLPETYVQLLRWFADSP 124

Query: 175 TSP--FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
                +SIH +++ G  Y    G W GP    +    L    R E G           VV
Sbjct: 125 DVECRYSIHQMVKLGVQYDKLPGEWYGPTTAAQVLRDLVNLHRREFGGELSMYVPQEGVV 184

Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADW-TPILLLVPLVLGLEKVNPRYIPTLRL 291
             D+  +      +C  D   H    ++ ++DW T +L+L+PL LGL++VN RY+P ++ 
Sbjct: 185 YSDDVAK------LCFFDPLLHPPT-TEDKSDWSTALLILIPLRLGLDQVNERYVPAIQK 237

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDV 350
           +F FPQS+GI+GGK G S Y VG Q++    LDPHDV P   +      A    T HS  
Sbjct: 238 SFAFPQSVGIIGGKKGHSVYFVGTQQDQLHLLDPHDVHPAPELNTAFPTATHLRTVHSSR 297

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
              +++ +IDPSLA+GF C ++ D++DF  R   L +E
Sbjct: 298 PLVMNVTTIDPSLALGFLCENRVDYEDFERRVRILHDE 335


>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
          Length = 387

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 110/338 (32%), Positives = 167/338 (49%), Gaps = 41/338 (12%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
           L +   D +S+I ++YR+ F  I  +  TSD GWGCMLR  QM VA+AL+   L R W+ 
Sbjct: 41  LDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGWQW 100

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
            P  +  D  Y+ +L +F D +   FSIH + Q G + G A G W GP  +      LA 
Sbjct: 101 APGIR--DESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------- 263
             +  +        +AI+V   +         VV +DD  + C + +  ++         
Sbjct: 159 FDKWSS--------LAIHVAMDN---------VVIMDDIRKVCRLEATAESGVRNRAEPA 201

Query: 264 --------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
                    W P+LL +PL LGL ++NP Y   L+ TF   QSLGI+GGKP  + YI+GV
Sbjct: 202 GLAAAAAESWKPLLLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYIIGV 261

Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
             +  ++LDPH  Q  +++  D    D  +YH      + +  +DPS+A+ FY   + +F
Sbjct: 262 VGDDLVFLDPHTTQLAVDL--DTEFPDDESYHCAHASRMDIGQLDPSIALCFYLPTEAEF 319

Query: 376 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGE 413
           D +C  A K        PLF +T+   +P+   D + E
Sbjct: 320 DSWCNLAHKHLISEMSQPLFEITE--HRPLGWPDFVDE 355


>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
 gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
          Length = 606

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 115/318 (36%), Positives = 161/318 (50%), Gaps = 34/318 (10%)

Query: 93  GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
           G+  F +DF SRI ++YR+ F  + DS  TSD GWGCM+RS QML+AQ L+ H LGR WR
Sbjct: 195 GIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLGRSWR 254

Query: 153 KPLQKPFDRE---YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
             +      E   + +++  FGD  S+TSPFSIH L+  GK  G   G W GP A+    
Sbjct: 255 WDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAVAHLL 314

Query: 208 EALARCQRAET----GLGCQ-SLPMAIYV--------VSGDEDG---ERGGAPVVCIDDA 251
               R    E     G+    +   A+Y+        V     G   +R GAP      +
Sbjct: 315 RQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGTNSSS 374

Query: 252 SRHCSVFSKG-----------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
           S   +  S              A W  ++LLVPL LG +K+NP Y   L+   +    +G
Sbjct: 375 STAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLKAMLSLDYCIG 434

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GG+P  S Y VG QE+  I+LDPH  Q ++++ +D+     +++H    R + L  +D
Sbjct: 435 IIGGRPKHSLYFVGYQEDKLIHLDPHYCQDMVDVNQDNFP--VASFHCKSPRKMKLSKMD 492

Query: 361 PSLAIGFYCRDKDDFDDF 378
           PS  IGFYC  K DF  F
Sbjct: 493 PSCCIGFYCETKKDFYKF 510


>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
 gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
          Length = 474

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 117/382 (30%), Positives = 179/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +    E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----VHLCGRRYHFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKP------------- 158
             +TSD GWGCMLRS QM++AQ LL H L R WR        P + P             
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLAPPEMPGPASPSRYRGPGR 193

Query: 159 --------------FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                          DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 HVPPRWTQGTLEMEQDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R      C  +P  +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-KCSEVPRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  IGFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTIGFYAGNRKEFETLCSELMR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
          Length = 390

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 184/374 (49%), Gaps = 54/374 (14%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           S S +W+LG            +    N +AE N +  SR+L +YRK F  I  S  TSD 
Sbjct: 28  SDSPVWILG-----------NELCARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDS 76

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 179
           GWGCMLR  QM++ +AL    LGR W+        + +    +Y++IL+LF DS+ +P+S
Sbjct: 77  GWGCMLRCGQMVLGEALQRISLGRDWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYS 136

Query: 180 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
           IH +   G++       G+W GP  + +  + L+  ++        ++P+ ++V   +  
Sbjct: 137 IHQIALMGESIQSKKPVGTWFGPNTVAQVLKKLSFFEK--------TVPIRLHVAMDN-- 186

Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
                   V ID+    C  F  G ++  P+LL +PL LGL ++NP Y   L+  F FPQ
Sbjct: 187 -------TVIIDEIKESCG-FVGGDSE-KPLLLFIPLRLGLTEINPIYFQDLKECFEFPQ 237

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT------STYHSDVI 351
            LG++GG+P  + Y +G  +   IYLDPH     I+        DT       T+H++  
Sbjct: 238 ILGVIGGRPNHALYFIGYVDNELIYLDPH-----ISTQSASSTVDTFGGPQDQTHHTERA 292

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHS 408
             +    +DPSL++ F CR++ +F+D C R        + +PLF + +    H  P+  S
Sbjct: 293 YRMDFKDLDPSLSLCFLCRNESEFEDMCERFLFKLIRGHNSPLFEICRQRPEHLMPLPLS 352

Query: 409 DVLGET--GGVPED 420
             L       VPE+
Sbjct: 353 SSLNSDLPNAVPEE 366


>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
 gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
          Length = 388

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 128/389 (32%), Positives = 196/389 (50%), Gaps = 39/389 (10%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I  +   +W+LG  +   +D           +     D  S++  +YRKGF PIGDS +T
Sbjct: 21  IPQTREPVWILGRKYDAGRD-----------VTAIRSDIKSKLWFTYRKGFVPIGDSGLT 69

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 180
           SD GWGCMLR  QM++AQAL+   LGR WR  K  ++P   EY+ IL +F D++T+ +SI
Sbjct: 70  SDKGWGCMLRCGQMVLAQALVCLHLGRDWRWKKDSKEP---EYLRILKMFEDTKTATYSI 126

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H +   G + G   G W GP  + +  + L+   +  + +   +L   I V       +R
Sbjct: 127 HQIALMGVSEGKDVGQWFGPNTVTQVLKKLSVYDKWSSIVIHVALDNTIIVNDIKSLCQR 186

Query: 241 GGAPVVCIDDASRHCS-----VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
               V  ID +++  S     V+      W P+LL+VPL LGL ++NP Y+  L+  FTF
Sbjct: 187 NEQSV--IDSSAQKHSPLNEPVYFNSARKWKPLLLVVPLRLGLSEINPVYLNGLKTCFTF 244

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIR 352
            QSLG++GGKP  + Y +G   E  IYLDPH  QPV  +   +L  + +   +YH     
Sbjct: 245 RQSLGVIGGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYHCPRAS 304

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 409
              +  +DPS+A+ F+C  + +FD  C +  +   +S   PLF +T     H  PV +  
Sbjct: 305 RSRILDMDPSVAVCFFCSSEVEFDILCQQIQEKLIKSEKQPLFEITLNKPRHWIPVEN-- 362

Query: 410 VLGETGGVPEDDSLGVMSMNDAVGNAHED 438
                   P + +L +     +  N+ ED
Sbjct: 363 --------PVERTLNLQDYERSFENSDED 383


>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
          Length = 486

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 112/344 (32%), Positives = 167/344 (48%), Gaps = 30/344 (8%)

Query: 84  ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
           A+   +  +G+ EF +DF+SR+ ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191

Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
            H LGR WR  + +P   E  +        I+  FGD    TSPFSIH L+  G  +G  
Sbjct: 192 CHFLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKR 251

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
           AG W GP ++          Q AE      +L  A+YV              V + D   
Sbjct: 252 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 299

Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
            C +       W  ++L VPL LG +K+NP Y   L    T    +G++GG+P  S Y +
Sbjct: 300 VCQM---PDGKWKSLILFVPLRLGADKLNPVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 356

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
           G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY  +K 
Sbjct: 357 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHNKM 414

Query: 374 DFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGET 414
            F +F   A       +E    P+F   +   K +     + E 
Sbjct: 415 QFTNFMEIAPSYLVPEDEKVDYPMFLFCEGSGKDLQQKIEIAEN 458


>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
          Length = 354

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 113/328 (34%), Positives = 164/328 (50%), Gaps = 49/328 (14%)

Query: 93  GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
           G+  F  DF S+I ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR
Sbjct: 8   GIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSWR 67

Query: 153 ---KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 201
              KP+Q    RE+ E      I+  FGD  S  SP SIH ++  G+A G   G W GP 
Sbjct: 68  WSEKPIQN--GREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGP- 124

Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV-------VCIDDASRH 254
                  ++A C +            ++ V +  E+ E     V       + I D   H
Sbjct: 125 ------ASVAHCLK------------SVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTH 166

Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
           C +       W  ++LLVP+ LG E++NP Y P L    T    +GI+GG+P  S Y VG
Sbjct: 167 CRL---PNGCWKSLILLVPVKLGTERLNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVG 223

Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
            Q++  I+LDPH  Q ++++ + +      T+H    R + +  +DPS  IGFY +   D
Sbjct: 224 YQDDRLIHLDPHYCQEMVDVWQPNFSLQ--TFHCRSPRKMPISKMDPSCCIGFYLQTHHD 281

Query: 375 FDDFCARASKL-----AEESNGAPLFTV 397
           F+ F    +          SN  P+FT+
Sbjct: 282 FETFVNVINTFLTPQGVSSSNEYPMFTL 309


>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
          Length = 393

 Score =  181 bits (458), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 119/401 (29%), Positives = 186/401 (46%), Gaps = 61/401 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVLTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALICRHLGRDWRWSKGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +    LA      +        +A+++   +          V ++
Sbjct: 134 EGKSIGQWYGPNTVAQVLRKLASFDTWSS--------LAVHIAMDN---------TVVME 176

Query: 250 DASRHC---------SVFSKGQADW------------------TPILLLVPLVLGLEKVN 282
           +  R C         S F   + D+                   P++LL+PL LGL  +N
Sbjct: 177 EIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLLWKPLVLLIPLRLGLTDIN 236

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
             YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D
Sbjct: 237 EAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPMDSCYIPD 296

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
            S +       + +  +DPS+A+GF+C  ++DF+D+C R  KL+      P+F + +   
Sbjct: 297 ESFHCQHPPCRMSIAELDPSIAVGFFCNSEEDFNDWCQRIKKLSLIRGALPMFELVEHQP 356

Query: 403 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
              +  DVL  T    + D L      +   ++ ++D+++L
Sbjct: 357 SHFSSPDVLNLTPDSSDADRL------ERFFDSEDEDFEIL 391


>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
          Length = 486

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 112/348 (32%), Positives = 167/348 (47%), Gaps = 38/348 (10%)

Query: 84  ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
           A+   +  +G+ EF +DF+SR+ ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191

Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
            H LGR WR    +P   E  +        I+  FGD    TSPFSIH L+  G  +G  
Sbjct: 192 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 251

Query: 194 AGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
           AG W GP    + + ++ E  A    A   L       A+YV              V + 
Sbjct: 252 AGDWYGPSSVAHLLSQAVENAAERHPAFNNL-------AVYVAQD---------CAVYLQ 295

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D    C         W  ++L VPL LG +K+NP Y   L    T    +G++GG+P  S
Sbjct: 296 DIENVCQT---PDGKWKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 352

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            Y +G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY 
Sbjct: 353 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 410

Query: 370 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGET 414
            +K  F +F   A       +E    P+F   +   K ++    + E 
Sbjct: 411 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAEN 458


>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
          Length = 525

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 114/348 (32%), Positives = 171/348 (49%), Gaps = 38/348 (10%)

Query: 84  ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
           A+   +  +G+ EF +DF+SR+ ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+
Sbjct: 171 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 230

Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
            H LGR WR    +P   E  +        I+  FGD    TSPFSIH L+  G  +G  
Sbjct: 231 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 290

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCID 249
           AG W GP ++     A    Q  E  +  +  P    +A+YV              V + 
Sbjct: 291 AGDWYGPSSV-----AHLLSQAVENAV--ERHPAFNNLAVYVAQD---------CAVYLQ 334

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D    C   S G+  W  ++L VPL LG +K+NP Y   L    T    +G++GG+P  S
Sbjct: 335 DIENVCQT-SDGK--WKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 391

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            Y +G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY 
Sbjct: 392 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 449

Query: 370 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGET 414
            +K  F +F   A       +E    P+F   +   K ++    + E 
Sbjct: 450 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAEN 497


>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
 gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
 gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
          Length = 397

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/312 (36%), Positives = 166/312 (53%), Gaps = 18/312 (5%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
           +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR WR    +    
Sbjct: 47  TSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVRRHLGRDWRWVRSQSQRE 106

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEALAR 212
           +Y+ IL+ F D +   +S+H + Q G   G + G W GP          A+  SW  L  
Sbjct: 107 DYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRLTV 166

Query: 213 CQRAETGLGCQS-----LPMAIYVVSGDEDGERG-GAPVVCIDDASRHCSVFSKGQADWT 266
               +  +  +      +P   Y  +   D + G   P  C++ A   C++  +  A W 
Sbjct: 167 HVAMDNTVVIEEIKRLCMPWLDYGGAACVDLQGGMPEPNGCLEGA---CALAEEETALWK 223

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P+LLL+PL LGL  +N  YI TL+  F  PQSLG++GGKP  + Y +G   E  IYLDPH
Sbjct: 224 PLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGGKPNHAHYFIGYVGEELIYLDPH 283

Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
             QP +   +D    D + +       +H+  IDPS+A+GF+CR +DDFDD+C R  KL+
Sbjct: 284 TTQPAVEPCEDSQVPDDTYHCQHPPCRMHICEIDPSIAVGFFCRTEDDFDDWCMRFRKLS 343

Query: 387 EESNGAPLFTVT 398
               G P+F + 
Sbjct: 344 HTRAGLPMFELV 355


>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
           pulchellus]
          Length = 390

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 164/328 (50%), Gaps = 44/328 (13%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
           L +   + +S+I ++YRK F  I  +  TSD GWGCMLR  QM+VA+A++   LG+ W+ 
Sbjct: 41  LDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGKDWQW 100

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
            P  K  D +Y+ +L +F D +   +SIH + Q G + G   G W GP  +      L+ 
Sbjct: 101 SPGTK--DEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKLST 158

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV--------------- 257
             +  +        +A++V   +         VV +DD  + C V               
Sbjct: 159 FDKWSS--------LAMHVAMDN---------VVVMDDIRKICRVETTTDVEDGIRNRTQ 201

Query: 258 -----FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
                 + G   W P++L +PL LGL ++NP Y   L+ TF   QSLGI+GGKP  + YI
Sbjct: 202 SHGGPAAAGARSWKPLVLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYI 261

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           +GV  +  ++LDPH  Q  +++   D+E  +  +YH      + +  +DPS+A+ FY   
Sbjct: 262 IGVVGDDLVFLDPHTTQLAVDL---DVECPEDESYHCAHASRMDIGQLDPSIALCFYMAT 318

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQ 399
           + +FD +C  A K        PLF +T+
Sbjct: 319 EAEFDSWCNLAHKHLISQMKQPLFEITE 346


>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
 gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
          Length = 397

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 177/376 (47%), Gaps = 47/376 (12%)

Query: 48  MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
           M  + E  LGP             I    +D+WLLG  +   Q+  L             
Sbjct: 13  MDSVFEAYLGPDSMLAGAVGEPEDIPKRNTDVWLLGKRYNAIQELEL-----------IR 61

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
           +D  SR+  +YR GF P+G+ ++T+D GWGCMLR  QM++AQAL+   LGR W      P
Sbjct: 62  RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTP 118

Query: 159 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
              D  Y++I++ F D+  S +SIH +   G++   A G W+GP  + +  + L R    
Sbjct: 119 DCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVRFDDW 178

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
            +        + ++V              V +D+    C   S   + W P+LL+VPL L
Sbjct: 179 SS--------LVVHVAMDS---------TVVLDEIYTRCQEVSA--STWKPLLLIVPLRL 219

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G+  +NP YIP L+       S G++GG+P  + Y +G  ++  +YLDPH  Q   ++ +
Sbjct: 220 GISDINPMYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGSVAQ 279

Query: 337 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
               A+     +YH      +   ++DPSLA+ F C+ ++ FD+   +  +         
Sbjct: 280 KTTAAEQELDESYHQKYAARLSFGAMDPSLAVCFLCKTRNSFDELLQQLRQEVLSLCTPA 339

Query: 394 LFTVTQTHKKPVNHSD 409
           LF ++Q+     + +D
Sbjct: 340 LFEISQSRAVDWDTAD 355


>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
          Length = 401

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 107/326 (32%), Positives = 167/326 (51%), Gaps = 16/326 (4%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
           L +   D +S+I ++YRK F  I  +  TSD GWGCMLR  QM++A+AL+   LG+ W+ 
Sbjct: 54  LDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGWQW 113

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
            P  +  D  Y+ +L +F D +   +SIH + Q G + G A G W GP  +      L+ 
Sbjct: 114 APGIR--DENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS----VFSKGQADWTPI 268
             +  + L        + V+       R   P V  DD  RH +    +       W P+
Sbjct: 172 FDKW-SSLAVHVAMDNVVVMDDIRKICRVETPAV--DDGVRHRTQSHGLACASAVSWKPL 228

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           LL +PL LGL ++NP Y   L+ TF   QS+GI+GGKP  + +I+GV  +  ++LDPH  
Sbjct: 229 LLFIPLRLGLNEINPVYYCGLKRTFALKQSVGIIGGKPNHALFIIGVVGDDLVFLDPHTT 288

Query: 329 QPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
           Q  +++   D+E  +  +YH      + +  +DPS+A+ FY   + +FD +C  A K   
Sbjct: 289 QLAVDL---DVEFPEDESYHCAHASRMDIGQLDPSIALCFYLPTECEFDSWCNLAHKHLI 345

Query: 388 ESNGAPLFTVTQTHKKPVNHSDVLGE 413
                PLF +T+  ++P+   D   E
Sbjct: 346 TQMKQPLFEITE--ERPLGWPDFTEE 369


>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
 gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
          Length = 474

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 175/382 (45%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+            G   + +F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQQFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
          Length = 459

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 120/405 (29%), Positives = 181/405 (44%), Gaps = 82/405 (20%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 112
           S  S ++LLG C+    DE+ G+ +  G+N           + EF +DF SRI ++YR+ 
Sbjct: 36  SRNSPVFLLGKCYHFKTDES-GELSTDGSNFDKINTEISGNVEEFRKDFISRIWLTYREE 94

Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 151
           F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                     
Sbjct: 95  FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIDSSDSESWTAHTV 154

Query: 152 --------------RKP----------LQKPFDRE-------YVEILHLFGDSETSPFSI 180
                         R+P          L++ +D         + +I+  FGDS  + F +
Sbjct: 155 KKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNEVYHRKIISWFGDSPLTAFGL 214

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H L++ GK  G  AG W GP  +           R     G     + IYV         
Sbjct: 215 HQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTIYVAQD------ 263

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
               V   D   R CS    G+AD   +++LVP+ LG E+ N  Y+  ++   +    +G
Sbjct: 264 --CTVYSSDVIDRQCSFMDSGEADTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVG 321

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +D
Sbjct: 322 IIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMD 379

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
           PS  IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 380 PSCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424


>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
          Length = 405

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 157/337 (46%), Gaps = 47/337 (13%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--K 153
           E   DF S+I  +YRK F  IG +  T D GWGCMLR  QM++AQAL+   LGR W+  K
Sbjct: 46  ELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWKWNK 105

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
             Q   D+ Y  IL +F D +++ +SI  +   G + G   GSW GP  + +  + LA  
Sbjct: 106 NCQ---DQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----------- 262
               +          +  ++ D          VC DD    C +    Q           
Sbjct: 163 DEWSS---------IVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRS 213

Query: 263 --------------------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
                                 W P+LL++PL LGL ++N  Y+ +L+   +FPQS+GI+
Sbjct: 214 KKSSQDSSKQDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKACLSFPQSVGII 273

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  + + VG   +  IYLDPH  Q   ++  D       +YH      +++  +DPS
Sbjct: 274 GGKPNHAHWFVGYMSDKLIYLDPHTTQLCEDL--DSPNFSDESYHCPYPSTMNVMELDPS 331

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           +A+GFYC  + +FDD      K    S+  P+F + +
Sbjct: 332 IALGFYCGTEKEFDDLTQSVQKFVVGSSKTPMFELYK 368


>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
          Length = 424

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 179/383 (46%), Gaps = 67/383 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E+ GD      +  F +DF+SR+ ++YR+ F P+  
Sbjct: 33  SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 82

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 83  GCLTSDCGWGCMLRSGQMMLAQGLLLHYLPRDWTWAEGAGLGPPEPVGLSSPNRYRGPAR 142

Query: 153 ---------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 203
                     P     +R + +I+  F D   +PF +H L++ G++ G  AG W GP   
Sbjct: 143 WMAPTLGPGAPPSWSRERRHRQIVSWFADHPRAPFGLHQLVELGQSSGKKAGDWYGP--- 199

Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 263
                 +A   R       +   + +YV       +   A +V   D +          A
Sbjct: 200 ----SLVAHILRKAVESCAEVTRLVVYVSQDCTVYKADVARLVARPDPT----------A 245

Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
           +W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YL
Sbjct: 246 EWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYL 305

Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 383
           DPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +
Sbjct: 306 DPHYCQPTVDVSRADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELT 363

Query: 384 KLAEESNGA---PLFTVTQTHKK 403
           ++   S+     P+FT+ + H +
Sbjct: 364 RVLSSSSATERYPMFTLAEGHAQ 386


>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
          Length = 474

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
 gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
          Length = 382

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 184/388 (47%), Gaps = 48/388 (12%)

Query: 48  MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
           M  + E  LGP             I    +++WLLG  +   Q+           L    
Sbjct: 13  MDSVFEAYLGPDGVLAGAVGEIEDIPKRNTNVWLLGKRYNAIQE-----------LEPIR 61

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
           +D  SR+  +YR GF P+G+ ++T+D GWGCMLR  QM++AQAL+   LGR W      P
Sbjct: 62  RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTP 118

Query: 159 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
              D  Y++I++ F D+  S +SIH +   G++   A G W+GP  + +  + L R    
Sbjct: 119 DCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVRFDDW 178

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
            +        +A++V              V +DD    C      ++ W P+LL+VPL L
Sbjct: 179 SS--------LAVHVAMDS---------TVVLDDIYTCCQ--ESSESSWKPLLLIVPLRL 219

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G+  +NP YIP L+       S G++GG+P  + Y +G  ++  +YLDPH  Q    + +
Sbjct: 220 GITDINPIYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQ 279

Query: 337 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
               A+     +YH      +   ++DPSLA+ F C+ +D F++   +  +     +   
Sbjct: 280 KTTAAERELDESYHQKYAARLSFGAMDPSLAVCFLCKTRDSFEELLQQLRQDVLTLSTPA 339

Query: 394 LFTVTQTHKKPVNHSDVLGETGGVPEDD 421
           LF ++Q+     + +D + E   +P+ D
Sbjct: 340 LFEISQSRAVDWDTADDI-EWPAMPDID 366


>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
 gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
           cysteine endopeptidase; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related cysteine endopeptidase
           4; AltName: Full=Autophagy-related protein 4 homolog D
 gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
 gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
 gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
 gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
 gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
          Length = 474

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
          Length = 389

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 178/362 (49%), Gaps = 34/362 (9%)

Query: 40  KRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQ 99
           KR++ A      +E  +   R G   +   +W+LG            +      L E N 
Sbjct: 23  KRMLEACEAFVTYESGIILERQGFEVNDEPVWILG-----------REYDTKTKLDELNS 71

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--WRKPLQK 157
           D  SR+L++YR+ F PIGDS +TSD GWGCMLR  QM+VAQAL+   LGR   W     +
Sbjct: 72  DVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFWPVGDDQ 131

Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
                Y +IL LF D +T+ +SIH L Q G + G   G W GP  + +  + L+      
Sbjct: 132 RTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLSEYDEWS 191

Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC--SVFSKGQADWTPILLLVPLV 275
                    + I+V   +          V I++  + C   +     + W+P+LL+VPL 
Sbjct: 192 A--------LKIHVAMDN---------AVVIEEIEQLCHKKITPTETSTWSPLLLVVPLR 234

Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
           LGL  +NP YI +L+     PQS+G++GGKP  + Y +G   +  ++LDPH  Q  I++ 
Sbjct: 235 LGLLNINPIYIDSLKACLQMPQSIGMIGGKPSQALYFIGYVGDDVVFLDPHLTQNAIDLD 294

Query: 336 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
           +D  E D S+YH      I   S+DPSLA+ F C    ++ D   +   ++E      LF
Sbjct: 295 ED--EFDDSSYHPATCARISFQSMDPSLAVCFSCTTHSEWKDLLRQFKDMSEIGKKQNLF 352

Query: 396 TV 397
            V
Sbjct: 353 EV 354


>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
 gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
          Length = 473

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 114/317 (35%), Positives = 161/317 (50%), Gaps = 47/317 (14%)

Query: 97  FNQDFSSRILISYRKGFDPI----GDSK------------------ITSDVGWGCMLRSS 134
           F  DF SR+ I+YR  F PI    G S                    TSD GWGCM+RS 
Sbjct: 138 FLDDFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSG 197

Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLA 193
           Q L+A  LLF RLGR WR+  Q   ++E  E+L LF D   +PFSIH  +Q G  A G  
Sbjct: 198 QSLLANTLLFLRLGRGWRRGSQ---EQEESELLSLFADHPRAPFSIHRFVQHGATACGKC 254

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCIDDAS 252
            G W GP A  +  +ALA         G     + +Y+ S G +  ER    + C     
Sbjct: 255 PGEWFGPAAAAQCIQALAN--------GHPQAGLNVYITSDGSDIYERQFREIACR---- 302

Query: 253 RHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
               +   G+ D   P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+P +S Y
Sbjct: 303 ---GLGEDGEDDSIKPTLILLGVRLGIDRVTPVYWESLKEVIRFPQSVGIAGGRPSSSHY 359

Query: 312 IVGVQEESAIYLDPHDVQPVI---NIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGF 367
            +  Q ++  YLDPH  +P +     G+D     + STYH+  +R +H+  +DPS+ IGF
Sbjct: 360 FIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGELSTYHTRRLRRLHIREMDPSMLIGF 419

Query: 368 YCRDKDDFDDFCARASK 384
             RD+ D++D   R  +
Sbjct: 420 LVRDEGDWEDLKGRIRR 436


>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
          Length = 408

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 172/355 (48%), Gaps = 39/355 (10%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 238
            G + G W GP          A+   W +LA     +  +  + +        +S D   
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 195

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 196 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 311

Query: 359 IDPSLAI------------GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +DPS+A+            GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 312 LDPSVALVVLSCLLLLPPKGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365


>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
 gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related protein 4 homolog D
 gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
          Length = 469

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 110/378 (29%), Positives = 178/378 (47%), Gaps = 62/378 (16%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQKPF----------- 159
             +TSD GWGCMLRS QM++AQ LL H L R W          P   P            
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSQGVGLGPPESSPNRYRGPAHWMPP 192

Query: 160 -----------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
                      +R + +I+  F D   +PF +H L++ G++ G  AG W GP        
Sbjct: 193 HWVQAAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------S 245

Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 268
            +A   R       +   + +YV       +   A +V   D +          A+W  +
Sbjct: 246 LVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKAV 295

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           ++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  
Sbjct: 296 VILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYC 355

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++   
Sbjct: 356 QPTVDVSQADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTRVLSS 413

Query: 389 SNGA---PLFTVTQTHKK 403
           S+     P+FT+ + H +
Sbjct: 414 SSATERYPMFTLVEGHAQ 431


>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
 gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
          Length = 472

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192

Query: 156 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
                  Q P    +R + +I+  F D   +PF +H L++ G+  G  AG W GP     
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247

Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
               +A   R       +   + +YV       +   A +V   D +          A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
             +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           H  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413

Query: 386 AEESNGA---PLFTVTQTHKK 403
              S+     P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434


>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
 gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
          Length = 708

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 289 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 348

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 349 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 408

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 409 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 468

Query: 245 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
            V    A R  +   K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 469 HVPWQQAKRPQAETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 528

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 529 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 586

Query: 363 LAIGFYCRDKDDFDDF 378
             IGFYC  K DFD F
Sbjct: 587 CCIGFYCATKSDFDSF 602


>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
          Length = 442

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 182/385 (47%), Gaps = 66/385 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 52  SRTSFSKLSS----VHLCGRRYRFETEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 101

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR----------KP--LQKPF------- 159
             +TSD GWGCMLRS QM++AQ LL H L R W           +P  L  P+       
Sbjct: 102 GYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWVKGVGLDPPEPSRLASPYWHHGPAC 161

Query: 160 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                          +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 162 WIPPHWTQGSPELEQERRHRQIVSWFADHPKAPFGLHQLVELGQSSGKKAGDWYGP---- 217

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 218 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 264

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 325 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 382

Query: 385 LAEESNGA---PLFTVTQTHKKPVN 406
           +   S+     P+FT+ + H +  N
Sbjct: 383 VLSSSSATERYPMFTLAEGHAQDHN 407


>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
 gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
          Length = 703

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463

Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
            V    A R  +   K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 464 HVPWQQAKRPQAETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581

Query: 363 LAIGFYCRDKDDFDDF 378
             IGFYC  K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597


>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
 gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
          Length = 676

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 111/328 (33%), Positives = 156/328 (47%), Gaps = 43/328 (13%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SR+ ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 258 AVENQVGETPWEEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 317

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L+  G A G 
Sbjct: 318 QGLIVHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGK 377

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP ++      L       T        +++YV              + I D  
Sbjct: 378 KPGDWYGPASVSY---LLKHALEHATQENADFDNISVYVAKD---------CTIYIQDIE 425

Query: 253 RHCSV----------------------FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
             CS+                          Q  W  +++L+PL LG +KVNP Y   L+
Sbjct: 426 DQCSIPEPAPKQTHVPWQQMKRPSLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAHCLK 485

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
           L  +    LGI+GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H   
Sbjct: 486 LLLSTENCLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQENF--SMQSFHCKS 543

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
            R I    +DPS  IGFYC  K DFD  
Sbjct: 544 PRKIKTSKMDPSCCIGFYCATKSDFDSL 571


>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
          Length = 472

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192

Query: 156 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
                  Q P    +R + +I+  F D   +PF +H L++ G+  G  AG W GP     
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247

Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
               +A   R       +   + +YV       +   A +V   D +          A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
             +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           H  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413

Query: 386 AEESNGA---PLFTVTQTHKK 403
              S+     P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434


>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
 gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
 gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
          Length = 668

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 249 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 308

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 309 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 368

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 369 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 428

Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
            V    A R  +  +K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 429 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 488

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 489 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 546

Query: 363 LAIGFYCRDKDDFDDF 378
             IGFYC  K DFD+F
Sbjct: 547 CCIGFYCATKSDFDNF 562


>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
          Length = 485

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 106/305 (34%), Positives = 153/305 (50%), Gaps = 27/305 (8%)

Query: 84  ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
           A+   +  + + EF +DF+SR+ ++YR+ F  +  S  TSD GWGCMLRS QM++AQAL+
Sbjct: 131 AMDAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALV 190

Query: 144 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 193
            H LGR WR  + +P   E  +        I+  FGD    TSPFSIH L+  G   G  
Sbjct: 191 CHFLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKR 250

Query: 194 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
           AG W GP ++          Q AE      +L  A+YV              V + D   
Sbjct: 251 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 298

Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
            C +       W  ++L VPL LG +K+N  Y   L    T    +G++GG+P  S Y +
Sbjct: 299 VCQM---PDGKWKSLILFVPLRLGADKLNLVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 355

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
           G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY  DK 
Sbjct: 356 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKM 413

Query: 374 DFDDF 378
            F +F
Sbjct: 414 QFTNF 418


>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
 gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
          Length = 703

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463

Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
            V    A R  +   K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 464 HVPWQKAKRPQAENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581

Query: 363 LAIGFYCRDKDDFDDF 378
             IGFYC  K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597


>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
          Length = 653

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 234 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 293

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 294 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 353

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 354 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 413

Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
            V    A R  +  +K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 414 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 473

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 474 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 531

Query: 363 LAIGFYCRDKDDFDDF 378
             IGFYC  K DFD+F
Sbjct: 532 CCIGFYCATKSDFDNF 547


>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 918

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 120/351 (34%), Positives = 172/351 (49%), Gaps = 37/351 (10%)

Query: 64  SSSTSDIWLLGVCHKIAQDEALGDAAGNNG-----LAEFNQDFSSRILISYRKGFDPIGD 118
           S S S IW+LG C+   + E  G     +      + +F  DF + +  SYRK F+ I  
Sbjct: 260 SISDSPIWMLGNCYSGKELECNGHTENKHNKRSRHICKFFADFQTLVCFSYRKDFERIPG 319

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKPFDREYVEILHLFG 171
           SK T+D GWGC LRS+QMLVA+AL+    GR WR        PL    + +   I+ LF 
Sbjct: 320 SKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCPAPLSSSKEDQLRLIIRLFQ 379

Query: 172 DSET--SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
           D     SPFSIHN++Q G + +   AG W GP ++ R +  L     A      ++    
Sbjct: 380 DQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFADLINQAYAMHQSPFRAYQAI 439

Query: 229 IYVVSGDEDGERGGAPVVCID-DASRHCSVFSKGQADWT-------------------PI 268
            +++  D   E    P    D + S   S       D T                   P+
Sbjct: 440 DHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVTPSASTSQSPPVLPPPFIPL 499

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           L+L+PL LGL ++N  YIP L+      Q +GI+GG+P  S Y VG QE++ I+ DPH  
Sbjct: 500 LILMPLRLGLNEINRMYIPCLKALLMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPHGC 559

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
           +  +++ +      T T+HS V   I    +DPS+AIGF C+++ DFDD C
Sbjct: 560 KRFVDMQQTSFP--TETFHSAVPNKIPFTHMDPSMAIGFLCQNQADFDDLC 608


>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
 gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
          Length = 672

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/318 (35%), Positives = 163/318 (51%), Gaps = 21/318 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  + D+    G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 252 AAENQMADSPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 311

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 312 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGK 371

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   +E     E    P
Sbjct: 372 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKP 431

Query: 245 VVCIDDASRH----CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
            V     S+          + Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 432 HVPWQMTSKKPASDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 491

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG QE+  I+LDPH  Q ++++ ++       ++H    R +    +D
Sbjct: 492 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMQSFHCKSPRKLKSSKMD 549

Query: 361 PSLAIGFYCRDKDDFDDF 378
           PS  IGFYC  K DFD F
Sbjct: 550 PSCCIGFYCATKTDFDSF 567


>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
 gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
          Length = 668

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/316 (35%), Positives = 167/316 (52%), Gaps = 19/316 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 248 AVENQVGEHPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 307

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H +GR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 308 QGLICHFMGRTWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGK 367

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 368 KPGDWYGPASVSYLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKP 427

Query: 245 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
            V    A R  +  SK   Q  W  +++L+PL LG +K+N  Y   L+L  +    LGI+
Sbjct: 428 NVPWQQAKRPQAEVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAHCLKLLLSTEHCLGII 487

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++   +  ++H    R +    +DPS
Sbjct: 488 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENFSLN--SFHCKSPRKLKSSKMDPS 545

Query: 363 LAIGFYCRDKDDFDDF 378
             IGFYC  K DFD+F
Sbjct: 546 CCIGFYCATKSDFDNF 561


>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
 gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
          Length = 439

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/310 (34%), Positives = 159/310 (51%), Gaps = 51/310 (16%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF ++I ++YR  F  I  S+                        TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A ALL  R+GR WR+ +    +R+   IL LF D   +P+SIH  ++ G  A G 
Sbjct: 163 GQSLLANALLTLRMGREWRRGVSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +AL+  Q            + +Y+ +GD      G+ V       
Sbjct: 220 HPGEWFGPSATARCIQALSNSQAKSE--------LRVYI-TGD------GSDVY----ED 260

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           +  S+     +D+TP L+LV   LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 261 KFMSIAKPNHSDFTPTLILVGTRLGLDKITPVYWEALKYSLQMPQSVGIAGGRPSSSHYF 320

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
           +GVQE    YLDPH  +P +   KD++E     D  + H+  +R +H+  +DPS+ I F 
Sbjct: 321 IGVQESDFFYLDPHQTRPALPY-KDNVEDYTTEDIDSCHTRRLRRLHIKEMDPSMLIAFL 379

Query: 369 CRDKDDFDDF 378
            RD++D++++
Sbjct: 380 IRDENDWNEW 389


>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
          Length = 405

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/336 (33%), Positives = 166/336 (49%), Gaps = 39/336 (11%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           S  S IWLLG  +  +       +   N       DF SRI ++YRK F  +  S  TSD
Sbjct: 18  SKDSPIWLLGRIYHQSHKTDDSSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSD 77

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPFDREYVEILHLFGD--S 173
            GWGCMLRS QML+AQAL+ H LGR WR         + LQ+   R    I+  FGD  S
Sbjct: 78  CGWGCMLRSGQMLLAQALVCHFLGRDWRWNESGAQEQQTLQESLHR---MIVQWFGDKPS 134

Query: 174 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
              P SIH ++  G  + G   G W GP ++  S+      QRA T    +   + +Y+ 
Sbjct: 135 PACPLSIHQMVSQGHISAGKRPGDWYGPSSV--SYIIKQILQRA-TDTYPELDTLRVYIA 191

Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD----------WTPILLLVPLVLGLEKVN 282
                        V +DD  + CS     + +          W  ++LL+PL LG E++N
Sbjct: 192 QD---------CTVYLDDVKQSCSKICNYECEETDYELIDDQWKSLILLIPLRLGGERMN 242

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           P Y   L+   +  Q +GI+GGKP  S Y +G Q++  I+LDPH+ Q ++++   +   +
Sbjct: 243 PTYDSCLKGLLSLEQCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDVLIPNF--N 300

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             ++H   +R   L  +DPS  +GFY R + +FD+F
Sbjct: 301 LKSFHCHELRKTALKQVDPSCCVGFYLRSQREFDEF 336


>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
          Length = 355

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/297 (35%), Positives = 151/297 (50%), Gaps = 27/297 (9%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
            G+  F  DF S+I ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR W
Sbjct: 15  EGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 74

Query: 152 RKPLQKPFD--REYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 201
           R   +KP    RE+ E      I+  FGD  S  SP SIH ++  G+A G   G W GP 
Sbjct: 75  RWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALGKKPGDWYGPA 134

Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 261
           ++    ++L      E     +   + +YV              V I D    C +    
Sbjct: 135 SVAHCLKSLIASASKENY---EFDHLEVYVAQDS---------TVYIQDIYSMCQLL--- 179

Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
              W  ++LLVP+ LG EK NP Y P L    T    +GI+GG+P  S Y VG Q++  I
Sbjct: 180 HGAWKSLILLVPVKLGTEKFNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVGYQDDKLI 239

Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           +LDPH  Q ++++ + +      ++H    R + L  +DPS  IGFY   + DF+ F
Sbjct: 240 HLDPHYCQEMVDVWQPNFSLQ--SFHCRSPRKMPLAKMDPSCCIGFYLGTQHDFETF 294


>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
          Length = 332

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 150/285 (52%), Gaps = 40/285 (14%)

Query: 70  IWLLGVCHKIA------------QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG 117
           +WLLGV + +A             ++ + D + N     F  D  SR+  SYR  F PI 
Sbjct: 70  VWLLGVRYTLAPPPMGQRGEGRETEQTVVDESQN-----FKLDMWSRLWFSYRYNFHPIS 124

Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR---EYVEILHLFGDSE 174
            +++T+D GWGCM+RS QML+ QAL+ H LGR WR      ++    +Y ++L +F D  
Sbjct: 125 GTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLSHTSKYNELPSDYRKVLEMFLDHP 184

Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC-QSLPMAIYVVS 233
            +P SIH+ ++AG+  G  AG+W GP  +C ++  L     A   LG   +L +  Y   
Sbjct: 185 CAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKL----HAGGALGSDNNLQLLAY--- 237

Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
              DG  G       D+           QA   P+ +L+P  LG+  V+P YIP +   F
Sbjct: 238 ---DGNDG-------DNTIYKSEALELLQAG--PLFILLPTRLGVSSVDPSYIPKISHVF 285

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
           +FPQSLG +GGKP ++ Y +  Q E+  YLDPH  QP+INI + +
Sbjct: 286 SFPQSLGFIGGKPSSAHYFIASQGEAVYYLDPHTPQPLINISEKE 330


>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
          Length = 442

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 53  SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 102

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
           S +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 103 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 161

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 162 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 217

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 218 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 264

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 325 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 382

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FTV + H +
Sbjct: 383 ILSSSSVTERYPMFTVAEGHAQ 404


>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
 gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
          Length = 473

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
           S +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 248

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 249 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 295

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 356 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 413

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FTV + H +
Sbjct: 414 ILSSSSVTERYPMFTVAEGHAQ 435


>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
           porcellus]
          Length = 474

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 179/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S I+L G  ++           G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSK-LSSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWAEGPGLGSPELPGTASPSPGRSPAR 192

Query: 152 ----RKPLQKP-FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R P   P  ++E  + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 193 WVPPRWPRGAPELEQELRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 248

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   +A+YV       +   A +V   D +          A+
Sbjct: 249 ---SLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHLVASRDPT----------AE 295

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  ++ +F+  CA  ++
Sbjct: 356 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCAELTR 413

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 414 ILSCSSATERYPMFTLAEGHAQ 435


>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
          Length = 437

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 167/356 (46%), Gaps = 54/356 (15%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           S+ S + +LG  +   +D           +  F   F S   ++YR GF PI  S +T+D
Sbjct: 61  SNNSPVLVLGKLYIPERDTKPQSEGIPRHILMFMDHFYSLPWMTYRCGFSPILSSSLTTD 120

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR-----------KPLQKPFDREYVEILHLFGDS 173
            GWGCM+RS QML+A  L  H LGR WR               K ++   V IL  FGDS
Sbjct: 121 CGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHRQVKNWNNYVVLILSWFGDS 180

Query: 174 ETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIY 230
           E+   PFSIH L++A   +G   G W GP  +      L R C R           + IY
Sbjct: 181 ESELCPFSIHRLMEAAYYHGNKPGDWFGPSQV----SILIRDCVRRALREHINLQKLNIY 236

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW------TPILLLVPLVLGLEKVNPR 284
           V                    S  C+V+ K   D         +L+LVP+ LG E +NP 
Sbjct: 237 V--------------------SHDCTVYIKDVQDIFESDLDQSLLVLVPVRLGSESLNPI 276

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           YIP ++       ++GI+GG+P  S + +G Q+E+ I+LDPH  Q  +N+ + D   D S
Sbjct: 277 YIPCVKALLALDHTVGIIGGRPKHSVFFIGFQDENLIHLDPHYSQTAVNMTRTDF--DVS 334

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
           +YH    + I +  +DPS  +GFYC   +DF+ F   A K+         FTVT T
Sbjct: 335 SYHCRSPKKIPVTKMDPSCTLGFYCHTLEDFNHFRIEAEKVT--------FTVTPT 382


>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
           NZE10]
          Length = 442

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 169/354 (47%), Gaps = 56/354 (15%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
           +EF +D  S+I ++YR  F PI  S                        TSD GWGCM+R
Sbjct: 111 SEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDGFTSDTGWGCMIR 170

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
           S Q L+A A+L HRLGR WR+  +   +REY +IL LF D+  SP SIH  ++ G +A G
Sbjct: 171 SGQSLLANAILIHRLGRDWRRGDK---EREYKDILSLFADTPESPLSIHKFVEHGAQACG 227

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDGERGGAPVVCID 249
              G W GP A  R   AL   +  E GL   S P    +YV                  
Sbjct: 228 TYPGEWFGPNATARCIRALTE-KYHEAGLQVYSRPNDSDVYV------------------ 268

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D+    +        + P L+++ + LG+EKV P Y   L+      QS+GI GG+P +S
Sbjct: 269 DSLMQTAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALELSQSVGIAGGRPSSS 328

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            Y +G Q ++  YLDPH  +P+++     L  D ++ H+  +R + +  +DPS+ +GF  
Sbjct: 329 HYFIGHQGDNFFYLDPHTTRPMLS--PQPLAEDINSCHTRRVRRLGIAEMDPSMLLGFLI 386

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           R KD+F+ +    S++       P   +   H+    +S      G V E ++L
Sbjct: 387 RSKDEFEQWRKSISEI-------PGKAIIHIHETEPKYSTGTERAGAVDEVETL 433


>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
 gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
          Length = 397

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 163/320 (50%), Gaps = 21/320 (6%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR WR  
Sbjct: 45  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWE 104

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 105 KHKNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 164

Query: 215 RAETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSKGQ----- 262
              +        +A+Y      VV  D        P  C +  A+ H S +S+ +     
Sbjct: 165 EWNS--------LAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGH 216

Query: 263 -ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
            + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  I
Sbjct: 217 CSGWRPLLLVVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEII 276

Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
           YLDPH  Q  ++  +     D + +       + + ++DPS+A+GF+C+D++DF+++C  
Sbjct: 277 YLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFNNWCEV 336

Query: 382 ASKLAEESNGAPLFTVTQTH 401
             K   +     +F +T  H
Sbjct: 337 IEKEILKHQSLRMFELTPKH 356


>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
 gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
          Length = 706

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 113/316 (35%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 287 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 346

Query: 140 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 347 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 406

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 407 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 466

Query: 245 VVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
            V    A R  +   K +    W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 467 HVPWQQAKRPQAETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 526

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 527 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 584

Query: 363 LAIGFYCRDKDDFDDF 378
             IGFYC  K DFD F
Sbjct: 585 CCIGFYCATKSDFDSF 600


>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
 gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
          Length = 380

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H + +D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 12  VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 60

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W+    K    EY  ILH F D +   +SIH + Q G  
Sbjct: 61  MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 120

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 121 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 163

Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 164 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 223

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 224 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 283

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 284 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 339


>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
          Length = 397

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H + +D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W+    K    EY  ILH F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 181 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356


>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
          Length = 408

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 115/396 (29%), Positives = 184/396 (46%), Gaps = 67/396 (16%)

Query: 46  GSMRRIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 102
           G  RR  E   R    SRT  S  +S    + VC +  + E  GD      +  F +DF 
Sbjct: 4   GGARRPREHGGRWAVKSRTSFSKISS----VHVCGRRYRFEGEGD------IQRFQRDFV 53

Query: 103 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------- 151
           SR+ ++YR+ F P+    +TSD GWGCMLRS QM++AQ+LL H L R W           
Sbjct: 54  SRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWTWAEGLGSAEP 113

Query: 152 ---------RKPL------------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
                    R P             +   +R + +I+  F D   +PF +H L++ G++ 
Sbjct: 114 AGSASPSRYRGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPGAPFGLHRLVELGQSS 173

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
           G  AG W GP         +A   R       +   + +YV       +   A +V   D
Sbjct: 174 GKKAGDWYGP-------SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPD 226

Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
            +          A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S 
Sbjct: 227 PT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRLELCLGIMGGKPRHSL 276

Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
           Y +G Q++  +YLDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  
Sbjct: 277 YFIGYQDDFLLYLDPHYCQPTVDVSQTDFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAG 334

Query: 371 DKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 403
            + +F+  C+  +++   S+     P+FT+ + H +
Sbjct: 335 GRKEFETLCSELTRVLGSSSATERYPMFTLAEGHAQ 370


>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
          Length = 433

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 176/392 (44%), Gaps = 62/392 (15%)

Query: 62  GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI 121
            +  S + ++LLG  HK     A GD    + + E+    +SR+  +YRK F PIG +  
Sbjct: 19  SVFDSNTPVYLLG--HKFP---ARGDM---DSIKEY---VTSRLWFTYRKNFMPIGGTGP 67

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
           TSD GWGCMLR  QML+AQAL+   LG  W        + +Y  IL +F D +  PFS+H
Sbjct: 68  TSDQGWGCMLRCGQMLLAQALIVRHLGTEWMWDRDNK-EEDYKRILRMFQDKKCCPFSLH 126

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALA---RCQRAETGLGCQSLPMAIYVVS----- 233
            + Q G +     G W GP    +  + L       R    +   +L +A  V +     
Sbjct: 127 QIAQMGVSERKQIGEWFGPNTAAQVLKKLVVYDDWSRLAVHVALDNLLIASDVRTMAHTR 186

Query: 234 ---------------GDEDGERGGAPVVCIDDASRHCSVFS-----------KGQADWTP 267
                           +E G   G   +C   + + C + S           + +  W P
Sbjct: 187 PPSRLSSRHTTENEQSEESGNASGGNSLCSFGSVKMCMLQSALMKECDENPVEDEEQWRP 246

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
           +L++VPL LGL  +N  Y+P +   F  PQ  GI+GG+P  + Y +G+  E  IYLDPH 
Sbjct: 247 LLIIVPLRLGLTSINRCYLPAIEAFFQLPQCTGIIGGRPNHALYFIGIAGEQLIYLDPHV 306

Query: 328 VQPVINIG----------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
            Q  I++                 K     D S+YH   + HI  DS DPSLA+ F CR 
Sbjct: 307 CQAAIDLDERCASLQQQDGFVEVVKSTDIFDDSSYHCPFLLHIAYDSADPSLALSFICRT 366

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
           +++++            ++  PLF + +T  K
Sbjct: 367 EEEYEHLANNLKTKVLPASSPPLFELLETRPK 398


>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
          Length = 457

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 111/368 (30%), Positives = 170/368 (46%), Gaps = 60/368 (16%)

Query: 84  ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
           ALG +    +G+    +  SSR   +YRK F PIG +  TSD GWGCMLR +QML+ + L
Sbjct: 34  ALGKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVL 93

Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
           L   +GR +   ++      Y +IL +F D + + +SIH + Q G   G     W GP  
Sbjct: 94  LRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNT 152

Query: 203 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 246
             +          W  +A     +  L  + +L MA    S D      E+G+       
Sbjct: 153 AAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQ------- 205

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
                 +H +  +  + +W P+LL++PL LGL  +N  Y+P ++  F  PQ +GI+GGKP
Sbjct: 206 ----VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKLPQCVGIIGGKP 261

Query: 307 GASTYIVGVQEESAIYLDPHDVQPV------------------INIGK-DDLE------- 340
             + Y VG+      YLDPH  +P                    N  + +DLE       
Sbjct: 262 NLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTESEQHDTNFSELEDLEPLPSQTS 321

Query: 341 -----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
                 D STYH  +++ +  +SIDPSLA+  +C  ++DFD+ C    K    ++  P+F
Sbjct: 322 DVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESREDFDNLCEELQKTTLPASKPPMF 381

Query: 396 TVTQTHKK 403
              +   K
Sbjct: 382 EFLEKRPK 389


>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
 gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
          Length = 402

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 171/355 (48%), Gaps = 38/355 (10%)

Query: 51  IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 110
           + + V G     I    +D+W+LG  +   Q+  L             +D  SR+  +YR
Sbjct: 31  VGQAVGGGESEDIPRRNTDVWVLGKRYNAIQELEL-----------IRRDIQSRLWCTYR 79

Query: 111 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
            GF P+G+ ++T+D GWGCMLR  QM++AQAL+   LGR W     +  D  Y++I++ F
Sbjct: 80  CGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-PECRDATYLKIVNRF 138

Query: 171 GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
            D + S +SIH +   G++   A G W+GP  + +  + L R              +A++
Sbjct: 139 EDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVRFD--------DWCSLAVH 190

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
           V              V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+
Sbjct: 191 VAMDS---------TVVLDDIYSLC----REGDSWKPLLLVIPLRLGITDINPMYVPALK 237

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTSTY 346
                  S G++GG+P  + Y +G  ++  +YLDPH  Q    +G+     + E D  TY
Sbjct: 238 RCLELDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGTVGQKTGVGEQEYD-ETY 296

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           H      ++  ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 297 HQKHAARLNFSAMDPSLAVCFLCKTSDSFESLLTKFRQEVLGLCSPALFEISQTR 351


>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 628

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 115/345 (33%), Positives = 164/345 (47%), Gaps = 59/345 (17%)

Query: 91  NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           + G+  F +DF SR+ ++YRK F  + DS  TSD GWGCM+RS QML+AQ L+ H LGR 
Sbjct: 188 DEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLGRG 247

Query: 151 WR-----KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSW 197
           WR     + L+  FD    E      I+  FGD  S TSPFSIH L+  GK  G   G W
Sbjct: 248 WRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPGDW 307

Query: 198 VGPYAMCRSWEALARCQRAET----GLGCQ-SLPMAIYVVSGDEDGERGGAPVV------ 246
            GP ++        +    E     G+    +   A+Y+    ++      P V      
Sbjct: 308 YGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAPWQKK 367

Query: 247 ------CIDDASR------------------HCSVF---------SKGQADWTPILLLVP 273
                 C D  S+                  H + F         S   + W  ++LLVP
Sbjct: 368 MSSAAACTDSPSQATTPRVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLILLVP 427

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           L LG EK+NP Y   L+   +    +GI+GG+P  S + VG QE+  I+LDPH  Q +++
Sbjct: 428 LRLGTEKLNPIYNDCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYCQDMVD 487

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           + +++     S++H    R + L  +DPS  IGFYC  + DF  F
Sbjct: 488 VNQENFPV--SSFHCKSPRKMKLSKMDPSCCIGFYCATRKDFFKF 530


>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
          Length = 458

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 117/403 (29%), Positives = 173/403 (42%), Gaps = 79/403 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 113
           S  S ++LLG C+    DE+  L     N           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155

Query: 152 -------------RKPLQKPFDREYV-----------EILH-----LFGDSETSPFSIHN 182
                        R+P      +E +           E+ H      FGDS  + F +H 
Sbjct: 156 KLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEVYHRKIISWFGDSPLAAFGLHQ 215

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           L++ GK  G  AG W GP  +           R     G     + +YV           
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQ--------D 262

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
             V   D   R CS    G+ D   +++LVP+ LG E+ N  Y+  ++   +    +GI+
Sbjct: 263 CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGII 322

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPS 380

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
             IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 381 CTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 423


>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
          Length = 445

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 179/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 55  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 104

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 105 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 164

Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                      +   +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 165 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 220

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 221 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 267

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 268 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 327

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 328 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSELTR 385

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 386 VLSSSSATERYPMFTLAEGHAQ 407


>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
 gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
          Length = 440

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 156/307 (50%), Gaps = 45/307 (14%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDS----------------------KITSDVGWGCMLR 132
           ++F  DF SR+ ++YR  F PI  +                        TSD GWGCM+R
Sbjct: 109 SQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGNFTSDTGWGCMIR 168

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
           S Q L+A  ++  RLGR WR+  +   ++++ EIL +F D+  +PFSIH  ++ G  A G
Sbjct: 169 SGQSLLANTVVMLRLGRDWRRGQK---EKQHHEILSMFADTPEAPFSIHKFVEHGASACG 225

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP        A ARC RA T      + + +Y    D D        V ID  
Sbjct: 226 TYPGEWFGP-------SATARCIRALTE-KYHDVGLRVYARPNDSD--------VYIDTL 269

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           +   +  S  +  ++P L+++ + LG+EKV P Y   L+     PQS+GI GG+P +S Y
Sbjct: 270 TATTTQHSASET-FSPTLIVLGVRLGIEKVTPAYHAALKSILELPQSVGIAGGRPSSSHY 328

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
            VG Q +   YLDPH  +P++         D  + H+  IR + +  +DPS+ +GF  RD
Sbjct: 329 FVGHQGDHFFYLDPHTTRPMLTAQP--TAEDVESCHTRRIRRLSIAEMDPSMLLGFLVRD 386

Query: 372 KDDFDDF 378
           K+DF+D+
Sbjct: 387 KEDFEDW 393


>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
          Length = 428

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 179/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 38  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 87

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 88  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 147

Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                      +   +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 148 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 203

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 204 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 250

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 251 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 310

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 311 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSELTR 368

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 369 VLSSSSATERYPMFTLAEGHAQ 390


>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
          Length = 423

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 180/382 (47%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E+ GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 33  SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 82

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 83  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSEASGLGPSEPSGLASPNRYRGPAR 142

Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                      +   +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 143 WMPPRWAQGTPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 198

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 199 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 245

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 246 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 305

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 306 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 363

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 364 VLSCSSATERYPMFTLAEGHAQ 385


>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
           familiaris]
          Length = 473

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 154
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGPGLGPSEPAGLASPNRYRGPAR 192

Query: 155 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
                       L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 193 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 248

Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 249 -----SLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARLVARPDPT---------- 293

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 354 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 411

Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
           +++   S+     P+FT+ + H +
Sbjct: 412 TRVLSSSSATERYPMFTLAEGHAQ 435


>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
          Length = 505

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 174/372 (46%), Gaps = 66/372 (17%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 104
           S  S +WLLG C+    +  L  A+                      N + EF +DF SR
Sbjct: 80  SKESPVWLLGQCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 139

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR-PWR-KPLQKPFDRE 162
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR  WR +P Q   +  
Sbjct: 140 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWRWRPEQLTDESS 199

Query: 163 YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRA 216
           +  I+  FGD  T  SPFSIH L+  G + G  AG W GP    + +C++ E      RA
Sbjct: 200 HRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQAME------RA 253

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
                 +   +A+YV        +    V C  D  R              ++LLVPL L
Sbjct: 254 SEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGRRKA------------LILLVPLRL 301

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------- 329
           G +K+NP Y P L    T    +G++GG+P  S Y +G Q++  I+LDPH  Q       
Sbjct: 302 GADKLNPVYAPCLTALLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQNEFYFRI 361

Query: 330 --------PVINIGKD-DLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
                   P + I +  D+E +     +++H    R + L  +DPS  +GFY  DK+   
Sbjct: 362 LLSITDSLPYLFIQETVDVEGNEKFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLT 421

Query: 377 DFCARASKLAEE 388
           DF     ++  +
Sbjct: 422 DFMETIQRIKNK 433


>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
          Length = 471

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 165/349 (47%), Gaps = 54/349 (15%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
           G   +  F +DF SR+  +YR+ F P+    +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 104 GEGDIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163

Query: 150 PW--------------------RKPLQKPFDR------------EYVEILHLFGDSETSP 177
            W                    R P +    R            ++ +I+  F D   +P
Sbjct: 164 DWTWAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAP 223

Query: 178 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
           FS+H L++ G++ G  AG W GP         +A   R       +   + +YV      
Sbjct: 224 FSLHRLVELGQSLGKKAGDWYGP-------SVVAHILRKAVESCSEVTHLVVYVSQDCTV 276

Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
            +   A +V   D +          A+W  +++LVP+ LG E +NP Y+P ++       
Sbjct: 277 YKADVARLVARPDPT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSEL 326

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
            LGI+GGKP  S Y +G Q++  +YLDPH  QP ++I + D   +  ++H    R +   
Sbjct: 327 CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDISQADFPLE--SFHCTAPRKMAFT 384

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 403
            +DPS  +GFY   K +F+  C+  +++   S+     P+FT+ + H +
Sbjct: 385 KMDPSCTVGFYAGGKKEFETLCSELTRVLSSSSAMERYPMFTLAEGHAQ 433


>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
          Length = 473

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 181/382 (47%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S ++L G  ++    E+ GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSK-ISSVYLCGHRYRF---ESEGD------IQRFQRDFMSRLWLTYRRDFPPLAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPLQ 156
             +TSD GWGCMLRS QML+AQ LL H L R W                      R P +
Sbjct: 133 GCLTSDCGWGCMLRSGQMLLAQGLLLHFLPRDWTWAEGSGLGPPELSGSASPSRYRGPAR 192

Query: 157 K----------PFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
           +            ++E+   +I+  F D   +PF +H L+  G++ G  AG W GP    
Sbjct: 193 RVPPHWAQCTPELEQEHWHRQIVSWFADHPQAPFGLHRLVALGQSSGKKAGDWYGP---- 248

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D           +A+
Sbjct: 249 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDP----------KAE 295

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 356 PHYCQPSVDVSQADFSLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 413

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 414 VLSSSSATERYPMFTLAEGHAQ 435


>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
          Length = 456

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 117/402 (29%), Positives = 179/402 (44%), Gaps = 79/402 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 112
           S  S ++LLG C+    +E+ G+ +  G+N           + EF +DF SRI ++YR+ 
Sbjct: 36  SRNSPVFLLGKCYHFKTEES-GELSTDGSNFDKISTEISGNVEEFRKDFISRIWLTYREE 94

Query: 113 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 151
           F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                     
Sbjct: 95  FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPEALDMESCDWESWTSSTV 154

Query: 152 ---------------------RKPLQKPFD----REYV---EILHLFGDSETSPFSIHNL 183
                                R P ++ +D    R  V   +I+  FGDS  + F +H L
Sbjct: 155 RKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYHRKIISWFGDSPLAAFGLHQL 214

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
           ++ GK  G  AG W GP  +           R     G     + +YV            
Sbjct: 215 IEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD--------C 261

Query: 244 PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
            V   D   R CS+   G+A    +++L P+ LG E+ N  Y+  ++   +    +GI+G
Sbjct: 262 TVYSSDVIDRQCSLVDSGKAGTKAVIILFPVRLGGERTNTDYLEFVKGILSLEYCVGIIG 321

Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
           GKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DPS 
Sbjct: 322 GKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPSC 379

Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
            IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 380 TIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 421


>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
 gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
          Length = 393

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 177/364 (48%), Gaps = 39/364 (10%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +++WLLG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPKRNANVWLLGKRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFVPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D+  S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTPDCRDTTYLKIVNRFEDTRKSFYSI 148

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H +   G++   A G W+GP  + +  + L R           SL + + + S       
Sbjct: 149 HQIALMGESQNKAVGEWLGPNTVAQILKILVRFD------DWSSLNVHVAMDS------- 195

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
                V +DD    C      ++ W P+LL+VPL LG+  +NP Y+P L+       S G
Sbjct: 196 ----TVVLDDIFTLCQ--EPSESAWKPLLLIVPLRLGISDINPIYVPALKRCLELNSSCG 249

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     +YH      +   
Sbjct: 250 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQKTTAAEQELDESYHQKYAARLSFA 309

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGV 417
           ++DPSLA+ F C+ +D F++   +  +         LF ++Q+     + +D + E   +
Sbjct: 310 AMDPSLAVCFLCKTRDSFNELLQQLRQEVLSLCTPALFEISQSRAVDWDTADDI-EWPAM 368

Query: 418 PEDD 421
           P+ D
Sbjct: 369 PDID 372


>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
          Length = 453

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 168/361 (46%), Gaps = 56/361 (15%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
           G   +  F +DF SR+ ++YR+ F P+    +TSD GWGCMLRS QML+AQ LL H   R
Sbjct: 84  GEGDIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSR 143

Query: 150 PW-----------RKP---------------------LQKPFDRE--YVEILHLFGDSET 175
            W           R+P                       + F++E  +  I+  F D   
Sbjct: 144 DWTWSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPG 203

Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
           +PF +H L++ G++ G  AG W GP         +A   R       +   + +YV    
Sbjct: 204 APFGLHRLVELGRSSGKRAGDWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDC 256

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
              +   A +V   D S           +W  I++LVP+ LG E +NP Y+P ++     
Sbjct: 257 TVYKADVAQLVAQPDPS----------TEWKSIVILVPVRLGGETLNPVYVPCVKELLRL 306

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
              +GI+GGKP  S Y +G Q++  +YLDPH  QP ++  ++    +  ++H    R + 
Sbjct: 307 ELCIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPFVDTSQESFPLE--SFHCTSPRKMA 364

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDVLG 412
              +DPS  IGFY  ++ +F+  C   +++   S+     P+FT+++ H +     +V  
Sbjct: 365 FSRMDPSCTIGFYAGNRKEFELLCLELTRVLNSSSATERYPMFTLSEGHAQEYGLEEVCS 424

Query: 413 E 413
           +
Sbjct: 425 Q 425


>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
          Length = 474

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
          Length = 411

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 130

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373


>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
          Length = 474

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
          Length = 411

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 352 VLSSSSAMERYPMFTLAEGHAQ 373


>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
          Length = 474

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------- 159
             +TSD GWGCMLRS QM++AQ LL H L R W            L  P           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSWYHGPAR 193

Query: 160 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                          +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPCWAQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D S          A+
Sbjct: 250 ---SLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARLVARPDPS----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WNSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S      P+FT+ + H +
Sbjct: 415 VLSSSAATERYPMFTLAEGHAQ 436


>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
          Length = 411

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373


>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
          Length = 474

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSAMERYPMFTLAEGHAQ 436


>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
          Length = 385

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 108/347 (31%), Positives = 171/347 (49%), Gaps = 35/347 (10%)

Query: 71  WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
           W+LG  H++  +++           +   D S+R+  +YR+ F PIG +  +SD GWGCM
Sbjct: 17  WILGRQHQLKTEKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 65

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           LR  QM++AQAL+   LGR W     K    EY  IL  F D +   +SIH + Q G   
Sbjct: 66  LRCGQMMLAQALICRHLGRDWHWEEHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGE 125

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGER------ 240
           G + G W GP  + +  + LA      +        +A+YV   +    ED ++      
Sbjct: 126 GKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCRLPN 177

Query: 241 GGAPVVCIDDASRHCSVFSKGQAD------WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
              P V       H S+ S+ ++       W P+LL++PL LG+  +NP Y+   +  F 
Sbjct: 178 QNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLIIPLRLGINHINPVYVDAFKECFK 237

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
            PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S +       +
Sbjct: 238 MPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQLFVDSEENSTVDDRSFHCQQAPHRM 297

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + ++DPS+A+GF+C+++ DFD +C+   K   +     +F + Q H
Sbjct: 298 KIMNLDPSVALGFFCKEEKDFDTWCSLVQKEIHKQQSLRMFELIQKH 344


>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
           rotundus]
          Length = 473

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 180/384 (46%), Gaps = 70/384 (18%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +    E+ GD      +  F +DF SR+ ++YR+ F P   
Sbjct: 83  SRTRFSKISS----VHLCGRRYCFESEGD------IQRFQRDFVSRLWLTYRRDFPPFAG 132

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWARGASLSPPEPSGLASSNRYRGPAH 192

Query: 152 --------RKP-LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
                   R P L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 193 CMTPCWAQRAPELEQ--ERRHRQIVSWFADHPQAPFGLHQLVELGQSSGKKAGDWYGP-- 248

Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 249 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 293

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 354 LDPHYCQPAVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 411

Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
           +++   S+     P+FT+ + H +
Sbjct: 412 TRVLSSSSTTERYPMFTLAEGHAQ 435


>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
 gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
          Length = 583

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/335 (33%), Positives = 155/335 (46%), Gaps = 67/335 (20%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 152
           +  F +DF +R+ ++YRK F  + DS  TSD GWGCM+RS QML+AQ LL H LGR WR 
Sbjct: 167 IEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNWRW 226

Query: 153 ----KPLQKPF------DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 200
               + L+  +      D  + +I+  FGD  S TSPFSIH L+  GK  G   G W GP
Sbjct: 227 DATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWYGP 286

Query: 201 YAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
                   ++A   R    L  Q +     + +YV              V I D    C+
Sbjct: 287 -------GSVAHLLRQAVKLAAQEISDLDGVNVYVAQDC---------AVYIQDIIDECT 330

Query: 257 VFS---------------------------------KGQADWTPILLLVPLVLGLEKVNP 283
           V +                                      W  ++LLVPL LG EK+NP
Sbjct: 331 VSAGPTLAPWQKKSPGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNP 390

Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
            Y   L+   +    +GI+GG+P  S Y VG QE+  I+LDPH  Q ++++   +     
Sbjct: 391 IYSDCLKAMLSLDNCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQDMVDVVNQE-NFPV 449

Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           +++H    R + L  +DPS  IGFYC  + DF  F
Sbjct: 450 ASFHCKSPRKMKLSKMDPSCCIGFYCETRKDFFKF 484


>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
 gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
          Length = 678

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SR+ ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310

Query: 140 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR      L   + D  + +I+  FGD  S++SPFSIH L++ G+  G 
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430

Query: 245 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
            V    A R  +         Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG QE+  I+LDPH  Q +++I ++       ++H    R + +  +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548

Query: 361 PSLAIGFYCRDKDDFDDF 378
           PS  IGFYC  K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566


>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
           gallopavo]
          Length = 421

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 170/356 (47%), Gaps = 52/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H + +D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGRRHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W+    K    EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 161

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204

Query: 250 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 285
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 205 DIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCTGWKPLLLIIPLRLGINHINPVY 264

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 265 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 324

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 325 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLQMFELVQKH 380


>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
 gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
          Length = 676

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 139
           A +  +G+     G+  F +DF SR+ ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310

Query: 140 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 192
           Q L+ H LGR WR      L   + D  + +I+  FGD  S++SPFSIH L++ G+  G 
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370

Query: 193 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 244
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430

Query: 245 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
            V    A R  +         Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG QE+  I+LDPH  Q +++I ++       ++H    R + +  +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548

Query: 361 PSLAIGFYCRDKDDFDDF 378
           PS  IGFYC  K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566


>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
          Length = 518

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 172/367 (46%), Gaps = 42/367 (11%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 145 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 191

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 192 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 251

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 252 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 303

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 304 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 363

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 364 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 423

Query: 347 HSDVIRHIHLDSIDPSLAI--GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
                  + +  +DPS+A+  G +   +    + C    +L+      P+F + +     
Sbjct: 424 CQHPPCRMSIAELDPSIAVVRGGHRSTQAFCAECCLGMKQLSLLGGALPMFELVEQQPSH 483

Query: 405 VNHSDVL 411
           +   DVL
Sbjct: 484 LACPDVL 490


>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
          Length = 439

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 49  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 98

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 99  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 158

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 159 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 214

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 215 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 261

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 262 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 321

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 322 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 379

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 380 VLGSSSATERYPMFTLAEGHAQ 401


>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
          Length = 474

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 415 VLGSSSATERYPMFTLAEGHAQ 436


>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
          Length = 392

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/392 (29%), Positives = 187/392 (47%), Gaps = 69/392 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +    E+ GD      +  F +DF+SR+ ++YR+ F P+  
Sbjct: 5   SRTSFSKISS----VHLCGRRYCFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 54

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 55  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGAGLSPPEPSGLASPNRHHGLAH 114

Query: 153 -KPLQ-----KPFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
            KP +        ++E+   +I+  F D   +PF +H L++ G+++G  AG W GP    
Sbjct: 115 WKPPRWAQGAPELEQEHWHRQIVSWFADHPQAPFGLHQLVELGQSWGKKAGDWYGP---- 170

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 171 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDCT----------AE 217

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++S +YLD
Sbjct: 218 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDSLLYLD 277

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+  ++
Sbjct: 278 PHYCQPTVDVSQAGFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGNRKEFETLCSELTR 335

Query: 385 LAEESNGA---PLFTVTQTHKKPVNHS-DVLG 412
           +   S      P+FT+ + H +  +HS D LG
Sbjct: 336 VLSSSAATQRYPMFTLAEGHAQ--DHSLDNLG 365


>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
          Length = 497

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 216

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 272

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 273 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 319

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 320 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 379

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 380 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 437

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 438 VLGSSSATERYPMFTLAEGHAQ 459


>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
          Length = 607

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 108/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S I+L G  ++           G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 216 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 265

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 154
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 266 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWIEGPGLAHPELPGSASSSQGRGPAR 325

Query: 155 ----------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                     L++  +  + +I+  F D   +P  +H L++ G++ G  AG W GP    
Sbjct: 326 WMPPSCPWGALEREQELRHRQIVSWFADHPRAPLGLHRLVELGQSSGKKAGDWYGP---- 381

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   +A+YV       +   A +V   D +          A+
Sbjct: 382 ---SLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHLVASPDPA----------AE 428

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 429 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 488

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  ++ + +  C+  ++
Sbjct: 489 PHYCQPTVDVSQADFSLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKELETLCSELTR 546

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 547 ILSSSSATERYPMFTLVEGHAQ 568



 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 10/93 (10%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S I+L G  ++           G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 130 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 179

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
             +TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 180 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212


>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
 gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
          Length = 454

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 111/318 (34%), Positives = 160/318 (50%), Gaps = 58/318 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 131
           +F  DF S++ I+YR  F PI        GDS I                TSD GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
           RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G  A 
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKL---QEESELVSLFADHPRAPFSIHRFVHHGATAC 235

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 249
           G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V C +
Sbjct: 236 GKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACDE 287

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
                            P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+P +S
Sbjct: 288 SGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGRPSSS 335

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHLDSID 360
            Y +  Q +S  YLDPH  +P +    +   D E+      + STYH+  +R +H+  +D
Sbjct: 336 HYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHIREMD 395

Query: 361 PSLAIGFYCRDKDDFDDF 378
           PS+ IGF  RD+DD++D 
Sbjct: 396 PSMLIGFLVRDEDDWEDL 413


>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
          Length = 411

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R           + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373


>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
 gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
           cysteine endopeptidase; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related cysteine endopeptidase
           4; AltName: Full=Autophagy-related protein 4 homolog D
 gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
 gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
 gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
 gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
           construct]
          Length = 474

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R           + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
          Length = 474

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R           + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WMSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
          Length = 395

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 168/356 (47%), Gaps = 52/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W+    K    EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 135

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178

Query: 250 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 285
           D  + C    +G                           W P+LL++PL LG+  +NP Y
Sbjct: 179 DIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGWKPLLLIIPLRLGINHINPVY 238

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 239 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDESF 298

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 299 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 354


>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
          Length = 447

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 111/332 (33%), Positives = 158/332 (47%), Gaps = 49/332 (14%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 132
           ++F  DF SRI I+YR GF PI  S                        TSD GWGCM+R
Sbjct: 110 SDFIDDFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIR 169

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 191
           S Q L+A  +L HRLGR WRK  ++    E+  IL LF D+  +PFSIH  ++ G +A G
Sbjct: 170 SGQSLLANTILLHRLGRDWRKGQKQ---EEHKNILSLFADTPEAPFSIHKFVEHGAQACG 226

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP        A ARC RA T        + +Y    D D            DA
Sbjct: 227 TYPGEWFGP-------NATARCLRALTD-KYHGAGLRVYARPNDSD---------VYADA 269

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
               +        + P L+++ + LG+EKV   Y   L+     PQS+GI GG+P +S Y
Sbjct: 270 LIETATQKDADDKFQPTLIVLGIRLGIEKVTSAYHVALKAALELPQSVGIAGGRPSSSHY 329

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
            +G Q +S  YLDPH  + +++        D  T H+  IR + L  +DPS+ +GF  R 
Sbjct: 330 FLGHQGDSFFYLDPHTTRHMLSPQPS--AEDIETCHTRRIRKLPLSEMDPSMLLGFLVRS 387

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
           +++F+++     K   E  G  +  + +T  K
Sbjct: 388 QEEFEEW----RKAVLEMPGKAIIHIHETEPK 415


>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
           1015]
          Length = 384

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 121/378 (32%), Positives = 180/378 (47%), Gaps = 54/378 (14%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
           +RI + +  P         S IW LG+ +   +D A      +     F  DF SRI ++
Sbjct: 11  KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKD-ANTRETQHAWPESFLLDFESRIWMT 69

Query: 109 YRKGFDPI----GDSK-------------------ITSDVGWGCMLRSSQMLVAQALLFH 145
           YR  F PI    GD K                    TSD GWGCM+RS Q L+A AL   
Sbjct: 70  YRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQSLLANALSML 129

Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMC 204
            LGR WR+  +  F+ E  ++L LF D+ T+PFS+H  ++ G ++ G   G W GP A  
Sbjct: 130 VLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKYPGEWFGPSATA 186

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
           +  EAL+          C +  + +YV +   +  +         D +R+ S        
Sbjct: 187 KCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK-----FMDIARNTS------GA 227

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           + P L+L+   LG++ + P Y   L+    FPQS+GI GG+P AS Y VG Q     YLD
Sbjct: 228 FQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGRPSASHYFVGAQGSHLFYLD 287

Query: 325 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
           PH  +P +     G+   + +  TYH+  +R IH+  +DPS+ IGF  R+++D+ D+  R
Sbjct: 288 PHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRNQEDWADWLKR 347

Query: 382 ASKLAEESNGAPLFTVTQ 399
                E   G P+  V +
Sbjct: 348 ----IEAVKGRPIIHVLK 361


>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
          Length = 412

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 169/356 (47%), Gaps = 52/356 (14%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W+    K    EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHC------------------SVFSKGQAD------WTPILLLVPLVLGLEKVNPRY 285
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 181 DIKKMCWSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGWKPLLLIIPLRLGINHINPVY 240

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 345
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDKSF 300

Query: 346 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356


>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
          Length = 482

 Score =  171 bits (433), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 105/398 (26%), Positives = 176/398 (44%), Gaps = 77/398 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           +S S +  + +C +  Q E  GD      +  F +DF+SR+ ++YR+ F P+    +TSD
Sbjct: 79  TSFSKLSTVHLCGRRYQFEGEGD------IQRFQKDFASRLWLTYRRDFPPLDGGSLTSD 132

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------------- 152
            GWGCMLRS QML+AQ LL H   R W                                 
Sbjct: 133 CGWGCMLRSGQMLLAQGLLLHFFSRDWTWAEAVLPPSPRESELFRSMSPSRSGASWQRGS 192

Query: 153 -----------------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 195
                             P Q   + ++  I+  F D   +PF +H L++ G++ G  AG
Sbjct: 193 STASGLGRATWSTGGTLSPRQLEQEEQHRRIVSWFADQPGAPFGLHRLVELGRSSGKRAG 252

Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
            W GP         +A   R       +   + +YV       +   A ++   D S   
Sbjct: 253 DWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQLMAQPDPS--- 302

Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
                   +W  +++LVP+ LG E +NP Y+P ++        +GI+GGKP  S Y +G 
Sbjct: 303 -------TEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDLCIGIIGGKPRHSLYFIGY 355

Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
           Q++  +YLDPH  QP ++  ++    +  ++H    R +    +DPS  IGFY  ++ +F
Sbjct: 356 QDDFLLYLDPHYCQPCVDTSQERFPLE--SFHCTSPRKMAFSRMDPSCTIGFYAGNRKEF 413

Query: 376 DDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDV 410
           +  C   +++   S+     P+FT+++ H +  +  +V
Sbjct: 414 EMLCLELTRVLNSSSATERYPMFTLSEGHAQEYSLEEV 451


>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
 gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
          Length = 404

 Score =  171 bits (433), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 122/397 (30%), Positives = 181/397 (45%), Gaps = 72/397 (18%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 96
           +RI + +  P         S IW LG+ +   +D    +    N   E            
Sbjct: 11  KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKDANTRETPDKNNTRENVMGTTNYRKPS 70

Query: 97  -------FNQDFSSRILISYRKGFDPI----GDSK-------------------ITSDVG 126
                  F  DF SRI ++YR  F PI    GD K                    TSD G
Sbjct: 71  EHAWPESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTG 130

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+RS Q L+A AL    LGR WR+  +  F+ E  ++L LF D+ T+PFS+H  ++ 
Sbjct: 131 WGCMIRSGQSLLANALSMLVLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKH 187

Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G ++ G   G W GP A  +  EAL+          C +  + +YV +   +  +     
Sbjct: 188 GAESCGKYPGEWFGPSATAKCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK--- 236

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
               D +R+ S        + P L+L+   LG++ + P Y   L+    FPQS+GI GG+
Sbjct: 237 --FMDIARNTS------GAFQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGR 288

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           P AS Y VG Q     YLDPH  +P +     G+   + +  TYH+  +R IH+  +DPS
Sbjct: 289 PSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPS 348

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           + IGF  R+++D+ D+  R     E   G P+  V +
Sbjct: 349 MLIGFLIRNQEDWADWLKR----IEAVKGRPIIHVLK 381


>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
          Length = 417

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 182/373 (48%), Gaps = 25/373 (6%)

Query: 39  VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 98
           V   V  G  R I     GP    +  +   +W+LG  + +         A     ++  
Sbjct: 19  VTLCVFPGVKRHITILSDGPEE--LPETDEPVWILGKQYDLQ--------AVITEKSKLL 68

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
            D S+R+  +YR+ F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W   +Q+ 
Sbjct: 69  SDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEMQQE 128

Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEA 209
              EY  IL  F D +   +SIH + Q G   G + G W GP          A+   W +
Sbjct: 129 QPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS 188

Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 269
           LA     +  +  + +    ++       +   +P   +D  S H    S G   W P+L
Sbjct: 189 LAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGLDQ-STHLPEPSPG---WKPLL 244

Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           L++PL LG+ ++NP YI   +  F  PQSLG +GGKP ++ Y +G      IYLDPH  Q
Sbjct: 245 LIIPLRLGINQINPVYIDAFKECFKMPQSLGALGGKPNSAYYFIGFLGNELIYLDPHTTQ 304

Query: 330 PVINIGKDDLEADTSTYHSDVIRH-IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
             ++  ++D   D  ++H     H + + ++DPS+A+GF+ ++++DFD++C    K   +
Sbjct: 305 TFVD-SEEDGTVDDQSFHCQQSPHRMQILNLDPSVALGFFFKEEEDFDNWCRLVQKEILK 363

Query: 389 SNGAPLFTVTQTH 401
                +F + Q H
Sbjct: 364 PQSLQMFELVQKH 376


>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
          Length = 457

 Score =  171 bits (432), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 123/422 (29%), Positives = 181/422 (42%), Gaps = 80/422 (18%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      ++E L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENEMLSARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPGALNIENSDSESWTSHTVK 155

Query: 155 ---------------LQKP-------------FDREYVEILH-----LFGDSETSPFSIH 181
                          L+ P             +     EI H      FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNEIYHRKIVSWFGDSPLAFFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           +     G     + IYV    +D    
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQG-----ITIYVA---QDCTVY 267

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
            + V+    ASR     S+G  D   +++LVP+ LG E+ NP Y+  ++   +    +GI
Sbjct: 268 NSDVIDTQSASRT----SEGAED-KAVIILVPVRLGGERTNPDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  QP +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQPFVDVSVKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKKPVNHSDVLGETGGVPE 419
           S  IGFYCR+  DF+      +K+   S     PLFT    H +  + +          E
Sbjct: 381 SCTIGFYCRNVQDFERTSEEITKMLRISAKEKYPLFTFVNGHSRDYDFTSTTTNEDLFSE 440

Query: 420 DD 421
           D+
Sbjct: 441 DE 442


>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
          Length = 508

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 110/345 (31%), Positives = 164/345 (47%), Gaps = 51/345 (14%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 126
           G++  A F  DF S+I ++YR GF  I  S                         T+D G
Sbjct: 138 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 197

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+RS Q L+A AL    LGR WR+  +   D+E   +L LF D   +PFSIH  ++ 
Sbjct: 198 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 254

Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G  A G   G W GP A  R  +AL+          C+   + +YV S   D        
Sbjct: 255 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 298

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +D  R  +     +A   P L+L+ + LG+++V P Y   L+    +PQS+GI GG+
Sbjct: 299 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 357

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           P +S Y +G Q     YLDPH  +P +     G+   E + ++YH+  +R +H+  +DPS
Sbjct: 358 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 417

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNH 407
           + IGF  +D+DD+ D+      +A    G  +  V+     P  H
Sbjct: 418 MLIGFLIKDEDDWADWKRNVGSVA----GKAIVHVSDKENSPFGH 458


>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 494

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/363 (33%), Positives = 173/363 (47%), Gaps = 65/363 (17%)

Query: 84  ALGDAAGNNG---LAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------- 122
           A GDA G         F  DF SRI ++YR GF       DP   S ++           
Sbjct: 139 AYGDADGTTDGGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGA 198

Query: 123 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
                 SD GWGCM+RS Q L+A ALL  RLGR WR+      +RE   IL LF D   +
Sbjct: 199 DQAGFSSDTGWGCMIRSGQSLLANALLISRLGREWRRGQNPKAERE---ILSLFADDPRA 255

Query: 177 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
           P+S+HN ++ G +A G   G W GP A  R  +ALA    +E         + +Y     
Sbjct: 256 PYSLHNFVKHGAEACGKFPGEWFGPSATARCIQALANKHESE---------LRVYST--- 303

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
                G  P V  D      ++ +     + P L+LV   LG++K+N  Y   L  T   
Sbjct: 304 -----GDLPDVYEDS---FMAIANPDGQHFHPTLVLVCTRLGIDKINKVYEQALISTLQM 355

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIR 352
            QS+GI GG+P  S Y +GVQ++   YLDPH  +P++      +D  + +  + H+  +R
Sbjct: 356 EQSIGIAGGRPSQSHYFIGVQDQWLFYLDPHYPRPMLPYRENPEDYTQEEVDSCHTRRLR 415

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           H+H++ +DPS+ IGF  +D+DD+D + +    +     G  + TV+        H   LG
Sbjct: 416 HLHVEDLDPSMLIGFLIKDEDDWDTWKSAVKHV----QGKAIITVSP-------HDPALG 464

Query: 413 ETG 415
            TG
Sbjct: 465 GTG 467


>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 336

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/295 (32%), Positives = 149/295 (50%), Gaps = 25/295 (8%)

Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 166
           ++YR  F  I DS   +D GWGCMLR  QML+A+A+    LG+ W    +K   +E    
Sbjct: 36  MTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWAPTSRKQRHQEMARF 95

Query: 167 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
           L LF D+  +PFSIH + + G+A G   G W GP  + +  + L   QR+   + C    
Sbjct: 96  LPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVNSQRSSLIVHCA--- 152

Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRY 285
               V++  E   +  A    + D  +H             +L+LVP+ LGL + +NP Y
Sbjct: 153 -MDGVLNRTEASTQLAA---ALSDGKKHS------------LLVLVPIRLGLNQSINPVY 196

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ-PVINIGKDDLEADTS 344
           IP L+ T   PQ LGI+GGKP A+ + VG   E+ +YLDPH VQ   + +  D +E    
Sbjct: 197 IPALKATLELPQCLGIIGGKPNAAHFFVGTVNENVLYLDPHVVQDAAMELTPDTVE---- 252

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           ++   V+  + +  +DPS+   + C    + +D   R+ ++  +  G  LF V +
Sbjct: 253 SFSVAVLSKMAISDVDPSMCAAYLCSSVAELEDLGKRSKQITSQFRGYGLFDVIE 307


>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
 gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
          Length = 471

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 58/324 (17%)

Query: 96  EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 131
           +F  DF SR+ I+YR  F PI         DS +                TSD GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
           RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +Q G  A 
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKI---QEESELVSLFADHPRAPFSIHRFVQHGATAC 252

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 249
           G   G W GP A  +  +AL +    + GL        +YV + G +  ER    V C +
Sbjct: 253 GKCPGEWFGPSAAAQCIQALVKSN-PQAGL-------RVYVTNDGSDIYERQFREVACDE 304

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
             S              P L+L+ + LG+++V P Y  +L+    +PQS+GI GG+P +S
Sbjct: 305 SGS------------IKPTLILLGVRLGIDRVTPIYWDSLKALLHYPQSVGIAGGRPSSS 352

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---------ADTSTYHSDVIRHIHLDSID 360
            Y +  Q +S  YLDPH  +P +    +  E          + STYH+  +R +H+  +D
Sbjct: 353 HYFIATQGDSFFYLDPHQTRPCLAPRSEPTEDEESHPYSPEELSTYHTRRLRRLHVREMD 412

Query: 361 PSLAIGFYCRDKDDFDDFCARASK 384
           PS+ IG   RD+ D++D  +R  +
Sbjct: 413 PSMLIGLLVRDEGDWEDLKSRVKE 436


>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
 gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
          Length = 458

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 121/425 (28%), Positives = 180/425 (42%), Gaps = 83/425 (19%)

Query: 50  RIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR-- 104
           R+HE   R+    +  +S        L     +A++ AL D+  N  +    + F+SR  
Sbjct: 11  RVHEEAKRLFADWKPAVSKMLETYLTLDPSFSVAENYALFDS--NLPIYLLGEKFTSRRD 68

Query: 105 -----------ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
                      +  +YRK F PIG    T+D GWGCMLR  QML+A+ L+   LGR W  
Sbjct: 69  MERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGRNWL- 127

Query: 154 PLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR--- 205
                +DR     EY  IL +F D + S FSIH +   G + G   G W GP    +   
Sbjct: 128 -----WDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLK 182

Query: 206 ------SWEALA--------------------------RCQRAETGLGCQSLPMAIYVVS 233
                  W  LA                             R ETG        A+    
Sbjct: 183 KLVIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAE 242

Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
            +   E   +P       +   S +     +W P+L+++PL LGL  +N  Y P ++  F
Sbjct: 243 AEIFPESTRSPT---RSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFF 299

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---------------KDD 338
             PQ +GI+GG+P  + Y  G+ + + +YLDPH  Q  +++                K+D
Sbjct: 300 QLPQCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDFVDLDETTATRDERDGYVEIKND 359

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
            E   STYH   I    +D +DPSLA+GF C  +DD+++   R       ++  PLF + 
Sbjct: 360 -EFRDSTYHCPFILTTKIDKVDPSLALGFLCHTEDDYNELAQRLRTHLLPASTPPLFEML 418

Query: 399 QTHKK 403
           +T  K
Sbjct: 419 ETRPK 423


>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
          Length = 354

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 107/319 (33%), Positives = 155/319 (48%), Gaps = 40/319 (12%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 347 HSDVIRHIHLDSIDPSLAI 365
                  + +  +DPS+A+
Sbjct: 301 CQHPPCRMSIAELDPSIAV 319


>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
          Length = 478

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 120/416 (28%), Positives = 179/416 (43%), Gaps = 84/416 (20%)

Query: 65  SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGD 118
           S  S + LLG C H  A+DE     A    L       F +DF+SR+ ++YR+ F P+  
Sbjct: 36  SRNSPVLLLGKCYHFKAEDEESPTEASVEDLVMGDVDAFRRDFASRVWLTYREEFSPLPG 95

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE------------- 162
           S +TSD GWGCMLR+ QM++AQ L+ H LGR   W + L  +P D E             
Sbjct: 96  STLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWTWSEALTLQPLDTETWTTTAAKRLVAS 155

Query: 163 ---------------------------------------YVEILHLFGDSETSPFSIHNL 183
                                                  +  ++  FGDS ++P  +H L
Sbjct: 156 LEASLQGVPGPSVRSSSPQAQALSLGSAEEADAHLKEMYHRTLVSWFGDSPSTPLGLHRL 215

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA--IYVVSGD------ 235
           ++ G   G  AG W GP  +    +     +  + GL C +  ++    V S D      
Sbjct: 216 VRLGLTMGKQAGDWYGPAVVAHILKKAVE-EAMDPGLACITAYVSQDCTVYSADVVDCHR 274

Query: 236 ------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
                    E   AP +  +D   H S   + +A    +++LVP+ LG EK NP Y    
Sbjct: 275 APRAERTSDETPDAPTLPQNDQPAHASTLPESRA----VIILVPVRLGGEKTNPEYFDFA 330

Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSD 349
           +   +    +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      +YH  
Sbjct: 331 KSILSLEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSYHCP 388

Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKK 403
             + +    +DPS  +GFY R   D++      SKL + S     P FT  Q H +
Sbjct: 389 SPKKMPFSKMDPSCTVGFYSRSVQDYERISQELSKLLQPSAKEKYPAFTFVQGHGR 444


>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
           purpuratus]
          Length = 390

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 105/332 (31%), Positives = 159/332 (47%), Gaps = 50/332 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           IW+LG  + ++Q +            E   D  SR+  +YRKGF  IG +  T+D GWGC
Sbjct: 48  IWILGKKYDLSQHQL-----------EARLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGC 96

Query: 130 MLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
           MLR  QM++AQAL++  LGR WR +P ++  D  Y++IL LF D + S FSIH + Q G 
Sbjct: 97  MLRCGQMMLAQALVYKHLGRDWRWRPQEQ--DETYLKILQLFLDKKDSCFSIHQIAQMGV 154

Query: 189 AYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
             G   G W GP  + +         SW  LA     +  +  + +     V S  E+  
Sbjct: 155 GEGKKVGDWFGPNTVGQVIRKLSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETS 214

Query: 240 RGGAPV--------------------------VCIDDASRHCSVFSKGQADWTPILLLVP 273
             G+                            + + +     +  S G   W  + L++P
Sbjct: 215 SEGSKTGSERRKRTSSSENIRHKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIP 274

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           L LGL ++N  Y+  L+  FT PQSLG++GGKP  + Y +GV  +  +YLDPH  QP  +
Sbjct: 275 LRLGLNEINTVYMQRLKRCFTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHTTQPAAD 334

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           I K     D S +H +    + + ++DPS+ +
Sbjct: 335 IDKWAFLQDES-FHCEHASRMPIKNLDPSIGL 365


>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 454

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 110/337 (32%), Positives = 155/337 (45%), Gaps = 51/337 (15%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 132
           F  DF  RI ++YR GF PI  S+                         TSD GWGCM+R
Sbjct: 117 FLDDFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 176

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
           S Q L+A AL   RLGR WR+        E   +L LF D   +PFSIH  ++ G  Y G
Sbjct: 177 SGQSLLANALAISRLGRDWRRGSNST---EENRLLSLFADDPAAPFSIHKFVRHGALYCG 233

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP A     +AL+  +  + G       M +YV S +          V  + +
Sbjct: 234 KHPGEWFGPSATATCIQALSD-EYKDAG-------MNVYVSSDNTYVYEDKFKAVAYNQS 285

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
            R             P L+L+   LG++++ P Y   L      PQ+LGI GG+P AS Y
Sbjct: 286 DRM-----------RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQALGIAGGRPSASHY 334

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
            +GVQ     YLDPH  +P +     DL   + +  + H+  +R IH+D +DPS+ +GF 
Sbjct: 335 FIGVQNSFFFYLDPHHTRPALPYKTGDLAYTQEEIDSCHTRRLRRIHIDDMDPSMLVGFL 394

Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
            RD++D+ D+  R +    E NG  +  +  T   P 
Sbjct: 395 IRDENDWMDWKRRITSSRPE-NGKAIIHIVDTKNVPT 430


>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
          Length = 450

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 171/370 (46%), Gaps = 50/370 (13%)

Query: 68  SDIWLLGVCHKIAQDEALGDAAGNNG------LAEFNQDFSSRILISYRKGFDPIGDSKI 121
           S I LLG C+  ++ E        N          F +DFSS+I  +YRK F  +  S +
Sbjct: 82  SPIILLGKCYCCSKSEKEDQRRQPNNSNILTTFDRFKRDFSSKIWFTYRKDFPKLYGSPL 141

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETS 176
           TSDVGWGCMLR++QM++AQAL+ H LGR W     +   +E   + +I+ LFGD     S
Sbjct: 142 TSDVGWGCMLRTAQMIIAQALVMHYLGRDWTIHHTQQNRKETMLHRQIIRLFGDFPGNDS 201

Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
           PFSI  L++ G  +G   G W GP ++                          YVV    
Sbjct: 202 PFSIQALVRIGVDHGKRPGDWYGPASVA-------------------------YVVRDAI 236

Query: 237 DGERGGAPV---VCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPT 288
           +      P+   VC+  A   C+V+ +   D     W  +++LVP+ LG E +NP Y   
Sbjct: 237 NQVPDFHPLLSQVCVYVAP-DCTVYIQDVIDLCTQHWKAVVILVPVRLGGEALNPIYSQC 295

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           ++        LGI+GG+P  S Y VG QEE  +YLDPH  Q  ++    D    TSTYH 
Sbjct: 296 VQSLLAHELCLGIIGGRPKHSLYFVGWQEEKLLYLDPHFCQDTVDTRFRDFP--TSTYHC 353

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTHKKPV 405
              R + L  +DPS  +GFY      F+       KL    ++    PLF         +
Sbjct: 354 LSPRKLALQKMDPSCTLGFYIPTHAAFNRLVKDMQKLVTPPKDQGIYPLFVFQDGRSIDI 413

Query: 406 NHSDVLGETG 415
            HS +  E+ 
Sbjct: 414 EHSHIKPESN 423


>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
          Length = 439

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 104/309 (33%), Positives = 150/309 (48%), Gaps = 49/309 (15%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF ++I ++YR  F  I  S+                        TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A ALL  R+GR WR+      +R+   IL LF D   +P+SIH  ++ G  A G 
Sbjct: 163 GQSLLANALLTLRMGREWRRGSSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP        A ARC +A T    +S  + +Y+     D           +D  
Sbjct: 220 HPGEWFGP-------SAAARCIQALTNSQVES-ELRVYITGDGSD---------VYEDT- 261

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
              S+       +TP L+LV   LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 262 -FMSIAKPNSTKFTPTLILVGTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYF 320

Query: 313 VGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           +GVQE    YLDPH  +P +      +D    D  + H+  +R +H+  +DPS+ I F  
Sbjct: 321 IGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLI 380

Query: 370 RDKDDFDDF 378
           RD++D+ D+
Sbjct: 381 RDENDWKDW 389


>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 489

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 112/355 (31%), Positives = 161/355 (45%), Gaps = 53/355 (14%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 132
           F  DF S+I ++YR  F PI  S+                         TSD GWGCM+R
Sbjct: 153 FLDDFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 212

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
           S QML+A AL   RLGR WR+        E  ++L LF D   +PFSIH  ++ G  Y G
Sbjct: 213 SGQMLLANALAISRLGRDWRRVSHT---TEENKLLSLFADDPAAPFSIHRFVRHGALYCG 269

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP A     +AL+   +           M +YV S               +D 
Sbjct: 270 KHPGEWFGPSATATCIQALSEEYKVAG--------MNVYVSSDS---------TYVYEDK 312

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
            +  +    G     P L+L+   LG++++ P Y   L      PQSLGI GG+P +S Y
Sbjct: 313 FKAVAYNQPGHM--RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQSLGIAGGRPSSSHY 370

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            +GVQ     YLDPH  +P +    D    +    + H+  +R IH+D +DPS+ +GF  
Sbjct: 371 FIGVQNSFFFYLDPHHTRPALPHKVDSAYTQEQVDSCHTRRLRRIHIDDMDPSMLVGFLI 430

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP---VNHSDVLGETGGVPEDD 421
           RD++D+ D+  R +  + E NG  +  +  T   P   +     L E   + +DD
Sbjct: 431 RDENDWIDWKRRIAS-SREGNGKAIIHIIDTESVPTPTMEREAALDEVEALDDDD 484


>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
           digitatum PHI26]
          Length = 401

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 123/421 (29%), Positives = 186/421 (44%), Gaps = 75/421 (17%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 96
           +RI +    P  T    + S IW LG   + A  +   D A NN  +             
Sbjct: 9   KRIVQYFWDPEPTNNVPAAS-IWCLG--KEYAPPQPFSDPATNNPHSSSGQPDASTLNDT 65

Query: 97  -----FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWG 128
                F  DF SRI I+YR  F PI  +K                        TSD GWG
Sbjct: 66  AWPNAFVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWG 125

Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
           CM+RS Q L+A A     LGR WR+  +   + E  +++ +F D   +PFSIH  +  G 
Sbjct: 126 CMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIHKFVNRGA 182

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
           ++ G   G W GP A  +  + L+    A          + +YV +   D          
Sbjct: 183 ESCGKYPGEWFGPSATAKCIQLLSTQSEAHR--------LRVYVTNDTSD---------V 225

Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
            +D   H S    G     P L+L+   LG+E V P Y   LR   T+PQS+GI GG+P 
Sbjct: 226 YEDKFAHVSHDRSGCIQ--PTLILIGTRLGIENVTPAYWDGLRAALTYPQSVGIAGGRPS 283

Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAI 365
           AS Y +G Q+    +LDPH  +P      D+L  + +  +Y++  +R IH+  +DPS+ I
Sbjct: 284 ASHYFLGAQDCHLFFLDPHTTRPATPYRPDELYTQEELDSYYTSRLRRIHIKDMDPSMLI 343

Query: 366 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN---HSDVLGETGGVPEDDS 422
           GF  +D++D+ D+     K  + + G P+  +     +P N    ++ L E   + + D 
Sbjct: 344 GFLIKDEEDWADW----KKRVQSTPGQPIVHMLPCQHQPDNGQGRAEALDEVEALDDSDE 399

Query: 423 L 423
           +
Sbjct: 400 I 400


>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
          Length = 331

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 158/319 (49%), Gaps = 27/319 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V+C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VLCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292

Query: 413 ETGGVPEDDSLGVMSMNDA 431
            + G  E   + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309


>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
           boliviensis]
          Length = 319

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 99/300 (33%), Positives = 149/300 (49%), Gaps = 25/300 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V C        DA+RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292


>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
           24927]
          Length = 444

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 110/301 (36%), Positives = 160/301 (53%), Gaps = 45/301 (14%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-------------------ITSDVGWGCMLRSSQML 137
           F  DF ++  ++YR  F PI  S                     TSD GWGCM+RS Q +
Sbjct: 111 FLDDFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCV 170

Query: 138 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGS 196
           +A A+   +LGR WR+  + P  +E   IL LF D   +PFS+HN ++ G+A  G+  G 
Sbjct: 171 LANAISLLKLGRDWRRG-KSP--QEEQHILSLFADDPRAPFSLHNFVKYGEASCGVYPGE 227

Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 256
           W GP A  R  +ALA    A+   G Q     +Y+ +GD     GG      +DA R  +
Sbjct: 228 WFGPSATARCIQALA----AQHDEGLQ-----VYI-TGD-----GGD---VYEDAFRKIA 269

Query: 257 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
           +   G   + P L+LV + LG+E+V P Y   L+ +   PQS+GI GG+P AS Y +GVQ
Sbjct: 270 ISDDGV--FHPTLVLVGIRLGIERVTPVYWEALKSSLMMPQSVGIAGGRPSASHYFIGVQ 327

Query: 317 EESAIYLDPHDVQPVINIGKD-DLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
            +S  YLDPH+ +P++   KD D  A+   + H+  +R +HL  +DPS+ + F  RD  D
Sbjct: 328 GQSLFYLDPHNTRPLLPYRKDSDYTAEEIEFCHTRKLRRLHLREMDPSMLLAFLIRDDRD 387

Query: 375 F 375
           +
Sbjct: 388 W 388


>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 601

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 116/383 (30%), Positives = 179/383 (46%), Gaps = 55/383 (14%)

Query: 52  HERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRK 111
           + R+    R+G+S +      L   +    +     ++G++  A F  DF S+I ++YR 
Sbjct: 195 YHRLSTSDRSGLSPTRQ----LPFTNNTRPESTSSSSSGHDWPAPFLDDFESKIWLTYRS 250

Query: 112 GF-------DPIGDSKIT----------------SDVGWGCMLRSSQMLVAQALLFHRLG 148
           GF       DP   S +T                +D GWGCM+RS Q L+A AL    LG
Sbjct: 251 GFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLASALSILSLG 310

Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 207
           R WR+  +   D+E   +L LF D   +PFSIH  ++ G  A G   G W GP A  R  
Sbjct: 311 RDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEYGASACGKYPGEWFGPSATARCI 367

Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
           +AL+          C+   + +YV S   D           +D  R  +     +A   P
Sbjct: 368 QALSS--------ECKHAGLNVYVTSDGSD---------VYEDRFRTIASGGATEAGIHP 410

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
            L+L+ + LG+++V P Y   L+    +PQS+GI GG+P +S Y +G Q     YLDPH 
Sbjct: 411 TLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGRPSSSHYFIGAQGSYFFYLDPHH 470

Query: 328 VQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
            +P +     G+   E + ++YH+  +R +H+  +DPS+ IGF  +D+DD+ D+      
Sbjct: 471 TRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPSMLIGFLIKDEDDWADWKRNVGS 530

Query: 385 LAEESNGAPLFTVTQTHKKPVNH 407
           +A    G  +  V      P  H
Sbjct: 531 VA----GKAIVHVFDKENSPFGH 549


>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
 gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
          Length = 494

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)

Query: 95  AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 131
           A F  DF S+I ++YR  F       DP   S +T                +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
           RS Q L+A AL    LGR WR+  +    +E   +L LF D   +PFSIH  ++ G  A 
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
           G   G W GP A  R  +AL+          C+   + +YV S   D           +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276

Query: 251 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
             R  ++ S G      D  P L+L+ + LG+++V P Y   L+    +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334

Query: 307 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
            +S Y +G Q     YLDPH  +P     + + +   + + +TYH+  +R +H+  +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
           + IGF  RD+DD+D++       A   NG  +  V      P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNVRGGAVTGNGKAIIHVFDKETSP 436


>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
          Length = 396

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 106/351 (30%), Positives = 168/351 (47%), Gaps = 43/351 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190

Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + +++ ++DPS+A+    R     D  C  + +   + N   +F + Q H
Sbjct: 304 PQRMNILNLDPSVALVGIRRLSGPGDTMCTVSPQEILKEN-LRMFELVQKH 353


>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
          Length = 405

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 122/364 (33%), Positives = 177/364 (48%), Gaps = 31/364 (8%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD--SK 120
           I  + + +W+LG  +   +D           +    +D  SR+  +YRKGF PIG   S 
Sbjct: 46  IPQTENSVWVLGKKYNAKKD-----------IDAIRRDIRSRLWFTYRKGFVPIGGFGST 94

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPF 178
            TSD GWGCMLR  QM++ QAL+   LGR WR     P  R   Y+ IL  F D   +P+
Sbjct: 95  FTSDKGWGCMLRCGQMVLGQALISLHLGRDWR---WTPETRSSTYLNILRRFEDRRAAPY 151

Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
           SIH +   G + G   G W GP  + +  + L       +     +L   + V    +  
Sbjct: 152 SIHQIALMGASEGKDVGQWFGPNTIAQVLKKLVVYDDWSSITIHVALDNTLVVNDVVQQC 211

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
              GA    +D          K  + W P+LLL+PL LGL ++NP YI  L+ +F FPQS
Sbjct: 212 RVEGATTAEVDGEKPL-----KAPSQWKPLLLLIPLRLGLNEINPIYINGLKTSFQFPQS 266

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV--INIGKDDLEADT-STYHSDVIRHIH 355
           LG++GGKP  + Y +G   +  I+LDPH  Q    ++   DD EA+  +TYH  +   I 
Sbjct: 267 LGLIGGKPSHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCKIASRIP 326

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---DVLG 412
           +  +DPS+A+ F+C  + DF   C         +   PLF + Q  ++P + S   DV  
Sbjct: 327 ITGMDPSVALCFFCATEKDFMSLCRLMQDELIGNEKQPLFELCQ--ERPASWSPAEDVAA 384

Query: 413 ETGG 416
           E  G
Sbjct: 385 EALG 388


>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
 gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
          Length = 494

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)

Query: 95  AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 131
           A F  DF S+I ++YR  F       DP   S +T                +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
           RS Q L+A AL    LGR WR+  +    +E   +L LF D   +PFSIH  ++ G  A 
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
           G   G W GP A  R  +AL+          C+   + +YV S   D           +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276

Query: 251 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
             R  ++ S G      D  P L+L+ + LG+++V P Y   L+    +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334

Query: 307 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
            +S Y +G Q     YLDPH  +P     + + +   + + +TYH+  +R +H+  +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
           + IGF  RD+DD+D++       A   NG  +  V      P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNLRGGAVTGNGKAIIHVFDKETSP 436


>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
          Length = 513

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 158/324 (48%), Gaps = 47/324 (14%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 126
           G++  A F  DF S+I ++YR GF  I  S                         T+D G
Sbjct: 143 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 202

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+RS Q L+A AL    LGR WR+  +   D+E   +L LF D   +PFSIH  ++ 
Sbjct: 203 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 259

Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G  A G   G W GP A  R  +AL+          C+   + +YV S   D        
Sbjct: 260 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 303

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +D  R  +     +A   P L+L+ + LG+++V P Y   L+    +PQS+GI GG+
Sbjct: 304 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 362

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           P +S Y +G Q     YLDPH  +P +     G+   E + ++YH+  +R +H+  +DPS
Sbjct: 363 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 422

Query: 363 LAIGFYCRDKDDFDDFCARASKLA 386
           + IGF  +D+DD+ D+      +A
Sbjct: 423 MLIGFLIKDEDDWADWKRNVGSVA 446


>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
          Length = 319

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 99/300 (33%), Positives = 148/300 (49%), Gaps = 25/300 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V C        DA RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292


>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
 gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
          Length = 458

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKLLPARSGCTIKDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             +  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPDALNIENSDSESWTSNTAK 155

Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNEIYHRKIISWFGDSPLTLFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCASMAPDNTDDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKSSSKEKYPLFTFVNAHSRDYDFTSTTTNKEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
           [Ciona intestinalis]
          Length = 422

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 168/359 (46%), Gaps = 58/359 (16%)

Query: 69  DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 128
           +IW+LG    +  + AL           F +   S +  +YRKG+ PIG +  TSD GWG
Sbjct: 39  NIWVLGSRFHLPHERAL-----------FLEHIKSFLWFTYRKGYTPIGGTGPTSDSGWG 87

Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
           CMLR  QML+A+AL    + + W+    KP    Y  ILH   D  +S +SIH + Q G 
Sbjct: 88  CMLRCGQMLLARALAELTMDKDWKWTEDKPQPPPYKRILHQLSDERSSCYSIHQIAQMGV 147

Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
             G   G W GP  + +    L++  +           +AI+V   +          VCI
Sbjct: 148 EEGKEVGQWFGPNTISQVLRRLSQFDQENV--------LAIHVAMDN---------TVCI 190

Query: 249 DDASRHCSVFSKGQAD----------------------------WTPILLLVPLVLGLEK 280
           +D  R CS     Q +                            W P+LLL+PL LGL +
Sbjct: 191 EDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLLLLIPLRLGLSE 250

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDD 338
           +NP Y   L+    + +S+G++GGKP  + Y +G  E+S I+LDPH  QP + +     +
Sbjct: 251 INPVYFTHLKECLHWKESVGVIGGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN 310

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
              D +T+H D    + L ++DPSLA+GF C  +  F D C +  ++ +     PLF V
Sbjct: 311 ERYDDTTFHCDTPGRMLLTNLDPSLALGFICTTRGSFCDLCHKVKQMVKTPTSFPLFEV 369


>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
           NRRL3357]
 gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
           NRRL3357]
          Length = 439

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 120/382 (31%), Positives = 169/382 (44%), Gaps = 66/382 (17%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 91
           +RI + +  P         + IW LGV +     KI       QDE       + D   +
Sbjct: 47  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 106

Query: 92  NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 128
                F  DF S+I ++YR  F PI                            TSD GWG
Sbjct: 107 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 166

Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
           CM+RS Q L+A A+L   LGR WR+  +     E   +L LF D   +P SIH  ++ G 
Sbjct: 167 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 223

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
           ++ G   G W GP A  R  EAL+          C ++   +YV +   D        V 
Sbjct: 224 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 267

Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
            D   R   V   G     P L+L+   LG++ V P Y   L+     PQS+GI GG+P 
Sbjct: 268 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 324

Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 364
           AS Y +G Q     YLDPH  +P +    D     + + STYH+  +R IH+  +DPS+ 
Sbjct: 325 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 384

Query: 365 IGFYCRDKDDFDDFCARASKLA 386
           IGF  R++DD++D+  R   + 
Sbjct: 385 IGFLVRNEDDWEDWKGRVGSVV 406


>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
          Length = 402

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 120/381 (31%), Positives = 169/381 (44%), Gaps = 66/381 (17%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 91
           +RI + +  P         + IW LGV +     KI       QDE       + D   +
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 70

Query: 92  NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 128
                F  DF S+I ++YR  F PI                            TSD GWG
Sbjct: 71  GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 130

Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 187
           CM+RS Q L+A A+L   LGR WR+  +     E   +L LF D   +P SIH  ++ G 
Sbjct: 131 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 187

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
           ++ G   G W GP A  R  EAL+          C ++   +YV +   D        V 
Sbjct: 188 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 231

Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
            D   R   V   G     P L+L+   LG++ V P Y   L+     PQS+GI GG+P 
Sbjct: 232 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 288

Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 364
           AS Y +G Q     YLDPH  +P +    D     + + STYH+  +R IH+  +DPS+ 
Sbjct: 289 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 348

Query: 365 IGFYCRDKDDFDDFCARASKL 385
           IGF  R++DD++D+  R   +
Sbjct: 349 IGFLVRNEDDWEDWKGRVGSV 369


>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 331

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 157/319 (49%), Gaps = 27/319 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292

Query: 413 ETGGVPEDDSLGVMSMNDA 431
            + G  E   + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309


>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
          Length = 435

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 125/437 (28%), Positives = 190/437 (43%), Gaps = 82/437 (18%)

Query: 51  IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQ 99
           +H R +  ++T  S + S + LLG C+    ++           A+ D      + EF +
Sbjct: 1   MHTRWVLKTKTYFSRN-SPVLLLGKCYHFKYEDEHKMLTARSGCAIEDRVIAGNVDEFRK 59

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--------- 150
           DF SRI ++YR+ F PI  S +++D GWGC LR+ QML+AQ L+ H LGR          
Sbjct: 60  DFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWIWPDALNI 119

Query: 151 -------WRKPLQKPFD--------------------REYVE----------------IL 167
                  W     K F                     +E +E                I+
Sbjct: 120 ENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDEVYHRKII 179

Query: 168 HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
             FGDS ++ F +H L++ G+  G  AG W GP  +           R     G     +
Sbjct: 180 SWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----I 234

Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
            +YV    +D     + V+    ASR       G AD   +++LVP+ LG E+ N  Y+ 
Sbjct: 235 TVYVA---QDCTVYNSDVIDKQSASR-----PAGNADDKAVIILVPVRLGGERTNTDYLE 286

Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
            ++   +    +GI+GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H
Sbjct: 287 FVKGVLSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFH 344

Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPV 405
               + +    +DPS  IGFYCR+  DF       +K+ + S+    PLFT    H K  
Sbjct: 345 CPSPKKMSFRKMDPSCTIGFYCRNVQDFQRASEEITKMLKMSSKEKYPLFTFVHGHSKDY 404

Query: 406 NH-SDVLGETGGVPEDD 421
           +  S V  E     +DD
Sbjct: 405 DFTSTVANEEDLFSQDD 421


>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
          Length = 459

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 114/404 (28%), Positives = 173/404 (42%), Gaps = 80/404 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNN-----------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +E  G    +N            + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKSEEDDGIPVRSNWAPEDPAVISGNVDEFRKDFVSRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
            P+G S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPAALDMENSDSESWTSHTVK 155

Query: 155 -LQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 181
            L   F+  +V                                +I+  FGDS  + F +H
Sbjct: 156 KLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNEGFHRKIISWFGDSPRTYFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L + GK  G  AG W GP  +           R     G     + +YV          
Sbjct: 216 QLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQG-----LTVYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +    G+ D   +L+LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVTDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
           S  +GFYCR+  DF+      +K+ + S+    PLFT  + H +
Sbjct: 381 SCTVGFYCRNVQDFERASEEITKVLKASSKEKYPLFTFVKGHSR 424


>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
          Length = 431

 Score =  167 bits (423), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 124/406 (30%), Positives = 185/406 (45%), Gaps = 43/406 (10%)

Query: 48  MRRIHERVLGPSRTGISSSTSDI--WLLGVCHKIAQDEALGDAAGNNGLA--EFNQDFSS 103
           MR    R   P R+ +SS+  +   W      +++    L     +      E   D +S
Sbjct: 1   MRPGPRRSCTPRRSALSSTLGEASDWCTAAAREVSAVSGLSQLQQDESYEKDEILSDVAS 60

Query: 104 RILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 163
           R+  +YRK F  IG +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y
Sbjct: 61  RLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSY 120

Query: 164 VEILHLFGDSETSPFSIHNLL------QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
             +L  F D + S +SIH +       +  +       S +GP  +C+S+ A+   +R  
Sbjct: 121 FSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGP-QLCQSFAAVRLSRRRR 179

Query: 218 TGLGCQSLP--MAIYVVSGDEDGERGGAPVVCIDD--ASRHCSVFSKG--------QADW 265
             L   S P  +A++               V ++D  A RHC+    G           W
Sbjct: 180 WELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDISADRHCNGVPAGAEVTHRPPLPPW 239

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLT-------------------FTFPQSLGIVGGKP 306
            P++LL+PL LGL  +N  Y+ TL+L                    F  PQSLG++GGKP
Sbjct: 240 RPLVLLIPLRLGLTDINEAYVGTLKLASTLVGLCSAAASLPLRQHCFMMPQSLGVIGGKP 299

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
            ++ Y +G   E  IYLDPH  QP + +       D S +       + +  +DPS+A G
Sbjct: 300 NSAHYFIGYVGEELIYLDPHTTQPAVEVADRRSIPDESFHCQHPPSRMRIGELDPSIA-G 358

Query: 367 FYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           F+C+ +DDFDD+C +  KL+      P+F + +     +   DVL 
Sbjct: 359 FFCQTEDDFDDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 404


>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
          Length = 454

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 127
           +F  DF S++ I+YR  F PI  +                              TSD GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
           GCM+RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 231

Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 245
             A G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V
Sbjct: 232 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 283

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
            C +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+
Sbjct: 284 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 331

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 356
           P +S Y +  Q +S  YLDPH  +P +    +   D E+      + STYH+  +R +H+
Sbjct: 332 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 391

Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASK 384
             +DPS+ IGF  RD+DD++D   R  +
Sbjct: 392 REMDPSMLIGFLVRDEDDWEDLKRRVRE 419


>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
          Length = 469

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 127
           +F  DF S++ I+YR  F PI  +                              TSD GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
           GCM+RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 246

Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 245
             A G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V
Sbjct: 247 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 298

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
            C +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+
Sbjct: 299 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 346

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 356
           P +S Y +  Q +S  YLDPH  +P +    +   D E+      + STYH+  +R +H+
Sbjct: 347 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 406

Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASK 384
             +DPS+ IGF  RD+DD++D   R  +
Sbjct: 407 REMDPSMLIGFLVRDEDDWEDLKRRVRE 434


>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
          Length = 319

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 150/300 (50%), Gaps = 25/300 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E  +   A 
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEISKLCRAS 112

Query: 245 VVCIDDAS------RHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           + C   A+      RHC+    G         W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 LPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     +   DVL 
Sbjct: 233 RMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 292


>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
 gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
          Length = 458

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 117/421 (27%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    ++            + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIADHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                        + L+ P             D E      + +I+  FGDS  +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLAPFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
          Length = 356

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 111/341 (32%), Positives = 159/341 (46%), Gaps = 32/341 (9%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
           +RI + +  P         + IW LGV +     +   +   +N  A      + RI   
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70

Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
                DP G    TSD GWGCM+RS Q L+A A+L   LGR WR+  +     E   +L 
Sbjct: 71  L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121

Query: 169 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
           LF D   +P SIH  ++ G ++ G   G W GP A  R  EAL+          C ++  
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173

Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
            +YV +   D        V  D   R   V   G     P L+L+   LG++ V P Y  
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222

Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 344
            L+     PQS+GI GG+P AS Y +G Q     YLDPH  +P +    D     + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           TYH+  +R IH+  +DPS+ IGF  R++DD++D+  R   +
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSV 323


>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
          Length = 508

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 173/382 (45%), Gaps = 57/382 (14%)

Query: 58  PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 113
           P+R+  S++     LL    H+ +    LG     +    F  DF S+I ++YR  F   
Sbjct: 85  PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144

Query: 114 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
               DP                +     T+D GWGCM+RS Q L+A AL    LGR WR+
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRR 204

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
             +    +E  ++L LF D   +PFSIH  ++ G  A G   G W GP A  R  +AL+ 
Sbjct: 205 GTKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS 261

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWT 266
                    C+   + +YV S   D           +D  R  ++ S G        D  
Sbjct: 262 --------ECEHAGLNVYVTSDGSD---------VYEDRFR--AIASAGGTGAGTSTDVH 302

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P L+L+ + LG+++V P Y   L+    +PQS+GI GG+P +S Y +G Q     YLDPH
Sbjct: 303 PTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFYLDPH 362

Query: 327 DVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
             +P +       +     + +TYH+  +R +H+  +DPS+ IGF  RD+DD++ +    
Sbjct: 363 HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSWKRSV 422

Query: 383 SKLAEESNGAPLFTVTQTHKKP 404
              A    G  +  V    K P
Sbjct: 423 HNRAMIGTGKAIIHVFDKEKSP 444


>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
           oryzae 3.042]
          Length = 357

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 111/342 (32%), Positives = 159/342 (46%), Gaps = 32/342 (9%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 108
           +RI + +  P         + IW LGV +     +   +   +N  A      + RI   
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70

Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
                DP G    TSD GWGCM+RS Q L+A A+L   LGR WR+  +     E   +L 
Sbjct: 71  L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121

Query: 169 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
           LF D   +P SIH  ++ G ++ G   G W GP A  R  EAL+          C ++  
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173

Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
            +YV +   D        V  D   R   V   G     P L+L+   LG++ V P Y  
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222

Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 344
            L+     PQS+GI GG+P AS Y +G Q     YLDPH  +P +    D     + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
           TYH+  +R IH+  +DPS+ IGF  R++DD++D+  R   + 
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSVV 324


>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
 gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
          Length = 458

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 175/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +DE L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFT 429


>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
          Length = 458

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 117/422 (27%), Positives = 179/422 (42%), Gaps = 80/422 (18%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF+SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHIIAGNVEEFRKDFTSRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDFESWTSNTVK 155

Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCASMASDHADDKAVIILVPVRLGGERTNTDYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHSDVLGETGGVPE 419
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +  + +    +   +  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTAAKEDDLFS 440

Query: 420 DD 421
           +D
Sbjct: 441 ED 442


>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
           terrestris]
          Length = 383

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 109/297 (36%), Positives = 156/297 (52%), Gaps = 16/297 (5%)

Query: 92  NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
           N + E +   +D  S++  +YRK F PIG  +S  TSD GWGCMLR  QM++ QAL+   
Sbjct: 31  NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 90

Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
           LGR W+   +   +  Y++IL  F D  T+ FSIH +   G + G   G W GP  + + 
Sbjct: 91  LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 149

Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
            + L       +     +L   + V    +     G   V  D A     V  K  + W 
Sbjct: 150 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 204

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P+LLL+PL LGL ++NP YI  L+ +F  PQSLG++GGKP  + Y +G  E   IYLDPH
Sbjct: 205 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 264

Query: 327 DVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
             Q   ++GK    +++E D +TYH      I +  IDPS+A+ F+C  + DF   C
Sbjct: 265 TTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPITGIDPSVALCFFCATEKDFKSLC 320


>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
          Length = 319

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDSTVVMEEIRRLCRTS 112

Query: 245 VVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292


>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
           gorilla]
 gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
           gorilla]
 gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
          Length = 319

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292


>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
          Length = 331

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 156/319 (48%), Gaps = 27/319 (8%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRNS 112

Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
            + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292

Query: 413 ETGGVPEDDSLGVMSMNDA 431
            + G  E   + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309


>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
          Length = 458

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 178/421 (42%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                        + L+ P             D E      + +++  FGDS  +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNEIYHRKVISWFGDSPLAPFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLQFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
           terrestris]
          Length = 386

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 109/297 (36%), Positives = 156/297 (52%), Gaps = 16/297 (5%)

Query: 92  NGLAEFN---QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHR 146
           N + E +   +D  S++  +YRK F PIG  +S  TSD GWGCMLR  QM++ QAL+   
Sbjct: 34  NAIRELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILH 93

Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
           LGR W+   +   +  Y++IL  F D  T+ FSIH +   G + G   G W GP  + + 
Sbjct: 94  LGRDWQWTAETR-NSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQV 152

Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
            + L       +     +L   + V    +     G   V  D A     V  K  + W 
Sbjct: 153 LKKLVVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGA-----VPLKAPSQWK 207

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P+LLL+PL LGL ++NP YI  L+ +F  PQSLG++GGKP  + Y +G  E   IYLDPH
Sbjct: 208 PLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLGVIGGKPNLALYFIGCVENEVIYLDPH 267

Query: 327 DVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
             Q   ++GK    +++E D +TYH      I +  IDPS+A+ F+C  + DF   C
Sbjct: 268 TTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPITGIDPSVALCFFCATEKDFKSLC 323


>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
 gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
          Length = 458

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    ++           A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
          Length = 1119

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 134/455 (29%), Positives = 185/455 (40%), Gaps = 131/455 (28%)

Query: 91   NNGLAEFNQDFSSRILISYRKGF-----DPIGDSK------------------------- 120
            N   A F  D  SRI ++YR GF     DP   S                          
Sbjct: 644  NGWPAAFYHDSYSRIALTYRSGFPIIPCDPSSSSTGVVQGMLNNLSMSIGRGGHRGPSPT 703

Query: 121  -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-----------KPFDREYV 164
                 ++SD GWGCMLR+ Q L+A AL+   LGR WR+PL             P    Y 
Sbjct: 704  NAEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYA 763

Query: 165  EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
             IL LF D  S  SPFS+H   Q GK  G   G W GP     + + L            
Sbjct: 764  RILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYE------- 816

Query: 223  QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---W-TPILLLVPLVLGL 278
               P  + VVS             C+D       V +    D   W TP+L+L+ + LG+
Sbjct: 817  ---PAGLKVVS-------------CVDGTVYESEVVAASTKDGEKWKTPVLVLINVRLGI 860

Query: 279  EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN--IGK 336
            + VNP Y   ++  F  PQS+GI GG+P +S Y VG Q  S  Y+DPH  +P +   +  
Sbjct: 861  DGVNPIYYEAIKGIFRLPQSVGIAGGRPSSSYYFVGAQANSLFYIDPHHPRPAVPLVLPP 920

Query: 337  DD-------------LEADT----------------------STYHSDVIRHIHLDSIDP 361
            DD               ADT                      +TYH+D +R   L S+DP
Sbjct: 921  DDSLVRAAQHLPLTPSTADTPAKESARQLDDFLLAAYPDAAWATYHTDKVRKCALSSLDP 980

Query: 362  SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---------DVLG 412
            S+ +GF   D+ D+ DF  R  +L++ S+  P+F +  +    +  S           L 
Sbjct: 981  SMLLGFLVEDERDWQDFRLRVQELSQASS--PIFAIAPSPPSWMRRSTSSAAPATVSALS 1038

Query: 413  ETGGVPEDDSLGVMSMN-----DAVGNAHEDDWQL 442
             T G   DDS   ++       D+ G +  +DW+L
Sbjct: 1039 PTIG---DDSFSEVAGEDVADADSAGFSEPEDWEL 1070


>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
 gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
 gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
           cysteine endopeptidase; AltName: Full=Autophagin-3;
           AltName: Full=Autophagy-related cysteine endopeptidase
           3; AltName: Full=Autophagy-related protein 4 homolog C
          Length = 458

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    ++           A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
          Length = 383

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 16  IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 64

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
            TSD GWGCMLR  QM++ QAL+   LGR W+   +   +  Y++IL  F D  T+ FSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 123

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H +   G + G   G W GP  + +  + L       +     +L   + V    +    
Sbjct: 124 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
            G   V  D A     V  K  + W P+LLL+PL LGL ++NP YI  L+ +F  PQSLG
Sbjct: 184 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 238

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 356
           ++GGKP  + Y +G  E   IYLDPH  Q   ++GK    +++E D +TYH      I +
Sbjct: 239 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 297

Query: 357 DSIDPSLAIGFYCRDKDDFDDFC 379
             IDPS+A+ F+C  + DF   C
Sbjct: 298 TGIDPSVALCFFCATEKDFKSLC 320


>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
 gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
          Length = 458

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/423 (28%), Positives = 175/423 (41%), Gaps = 81/423 (19%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 155 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 181
                                 QK   R Y +            I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    D   +++L+P+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNH-SDVLGETGGVP 418
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +  +  S    E     
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDFFS 440

Query: 419 EDD 421
           ED+
Sbjct: 441 EDE 443


>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
 gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
          Length = 400

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 150/313 (47%), Gaps = 49/313 (15%)

Query: 96  EFNQDFSSRILISYRKGFDPIG----------------------DSK-ITSDVGWGCMLR 132
           EF  D  SRI I+YR  F PI                       DS+  TSD GWGCM+R
Sbjct: 75  EFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTGWGCMIR 134

Query: 133 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 191
           S Q L+A A+L   LGR WR+  +   +    ++LH F D   +PFSIH  +Q G  +  
Sbjct: 135 SGQSLLANAMLILLLGRDWRRGTEAGKE---AQLLHQFADHPEAPFSIHRFVQHGAEFCN 191

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP A  R  +AL     A+ G    S  + +Y+     D        +  D  
Sbjct: 192 KYPGEWFGPSATARCIQALV----AQQG----SSELRVYITDDTAD--------IYEDKF 235

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           +R   +      D+ P L+LV   LG++ V P Y   L+     PQS+GI GG+P AS Y
Sbjct: 236 AR---IAQAEHGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLPQSVGIAGGRPSASHY 292

Query: 312 IVGVQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
            +GV  +   YLDPH  +P     ++       + +TYH+  +R IH+  +DPS+ IGF 
Sbjct: 293 FIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRLRRIHIKDMDPSMLIGFI 352

Query: 369 CRDKDDFDDFCAR 381
            R ++D+ D+  R
Sbjct: 353 IRSREDWTDWKTR 365


>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
          Length = 466

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    ++           A+ D      + EF +DF SRI ++YR+ F
Sbjct: 44  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 103

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 104 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 163

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 164 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 223

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 224 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 271

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 272 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 330

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 331 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 388

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 389 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 437


>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
          Length = 458

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
          Length = 407

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 120/386 (31%), Positives = 169/386 (43%), Gaps = 71/386 (18%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA-----------QDE------ALG 86
           +RI + +  P         + IW LGV +     KI            QDE       + 
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPGKLGNYQDELEAGTSKID 70

Query: 87  DAAGNNGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITS 123
           D   +     F  DF S+I ++YR  F PI                            TS
Sbjct: 71  DVTAHGWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTS 130

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCM+RS Q L+A A+L   LGR WR+  +     E   +L LF D   +P SIH  
Sbjct: 131 DTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRF 187

Query: 184 LQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           ++ G ++ G   G W GP A  R  EAL+          C ++   +YV +   D     
Sbjct: 188 VKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD----- 234

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
              V  D   R   V   G     P L+L+   LG++ V P Y   L+     PQS+GI 
Sbjct: 235 ---VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIA 288

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSI 359
           GG+P AS Y +G Q     YLDPH  +P +    D     + + STYH+  +R IH+  +
Sbjct: 289 GGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDM 348

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKL 385
           DPS+ IGF  R++DD++D+  R   +
Sbjct: 349 DPSMLIGFLVRNEDDWEDWKGRVGSV 374


>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
          Length = 458

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIHHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
          Length = 393

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 163/307 (53%), Gaps = 33/307 (10%)

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW------R 152
           + FSS +  +YRK F  IG    TSD GWGCMLR+ QM++ QAL+   LGR W      R
Sbjct: 79  KSFSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWMWTSDDR 138

Query: 153 KPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
            P     DRE Y+ IL +F D +++ FSIH +   G + G A G W GP  + ++ + L 
Sbjct: 139 LP-----DRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKLV 193

Query: 212 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLL 271
           +              M ++V   +         ++ + D    C   +K    W P+LL+
Sbjct: 194 QYDHWS--------EMKLHVAMDN---------IIILSDIKSLCC--AKESNKWRPLLLV 234

Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
           VPL LGL ++N  Y   +  +F    SLGI+GG+P  + Y +G+Q E  ++LDPH     
Sbjct: 235 VPLRLGLSEINDIYTNAVLNSFKMKHSLGIIGGRPSHALYFIGIQREELVFLDPHTTHNY 294

Query: 332 INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 391
           +++  D+   + STYH    + + + ++DPS+A+ FY  D+D+ D +  +A +L  +++G
Sbjct: 295 VDL--DEEPYNDSTYHCQRAQRMKISNMDPSIAMCFYIGDEDELDQWRVQAKELLVDNSG 352

Query: 392 APLFTVT 398
             LF +T
Sbjct: 353 HMLFEIT 359


>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
          Length = 480

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 106/328 (32%), Positives = 153/328 (46%), Gaps = 56/328 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 134
           F  DF SRI ++YR  F PI  S+                       TSD GWGCM+RS 
Sbjct: 114 FLDDFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSG 173

Query: 135 QMLVAQALLFHRLGRPWRK-------------PLQKPFDREYVEILHLFGDSETSPFSIH 181
           Q L+A  L+   LGR WR+                    +   EIL LF DS  +PFSIH
Sbjct: 174 QSLLANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREAEILSLFADSPDAPFSIH 233

Query: 182 NLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
             +Q G  A G   G W GP        A A C R E    C +  + +YV     +   
Sbjct: 234 RFVQHGASACGKHPGQWFGP-------SATASCIR-ELSTECAAAGLRVYVTPSASE--- 282

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
                   +D  R  +  S       P L+L  + LGL+++ P Y   L+ + T+PQS+G
Sbjct: 283 ------LYEDRFRSIAAASPSDPTIKPTLILFGIRLGLDRITPVYHEALKSSLTYPQSIG 336

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLD 357
           I GG+P +S Y VG Q +   YLDPH+ +P +       D  E + +T H+  +R + ++
Sbjct: 337 IAGGRPSSSHYFVGCQGDLFFYLDPHETRPALPHHASPADYSEEEIATCHTRRLRGLRIN 396

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKL 385
            +DPS+ IGF  +D+ D++D+  R  ++
Sbjct: 397 EMDPSMLIGFLIKDEADWEDWKRRIKEV 424


>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
          Length = 458

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 116/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    ++           A+ D      + EF +DF SR+ ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRLWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
           rotundus]
          Length = 458

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 120/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C H   +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----------------LQ 156
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                 ++
Sbjct: 96  PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 157 K-----------------------------PFDRE------YVEILHLFGDSETSPFSIH 181
           K                             P DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNEVYHRKIISWFGDSPVALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFQRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNKEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
 gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
          Length = 458

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 120/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ E S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLEFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
 gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
          Length = 454

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 110/309 (35%), Positives = 153/309 (49%), Gaps = 50/309 (16%)

Query: 97  FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
           F  DF SRI ++YR GF       DP  +S ++                SD GWGCM+RS
Sbjct: 118 FLDDFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRS 177

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A AL   RLGR WR+      +RE   IL LF D   +P+S+HN ++ G A  G 
Sbjct: 178 GQSLLANALQISRLGRDWRRATDPDAERE---ILSLFADDPRAPYSLHNFVKHGAAACGK 234

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  EALA   + E+ L   S                G  P V  D   
Sbjct: 235 YPGEWFGPSATARCIEALA--NQHESSLRVYST---------------GDLPDVYEDS-- 275

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
              +V +     + P L+LV   LG++K+N  Y   L  T    QS+GI GG+P +S Y 
Sbjct: 276 -FMAVANPDGEHFHPTLILVCTRLGIDKINQVYEEALISTLQMEQSIGIAGGRPSSSHYF 334

Query: 313 VGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           VGVQ +   YLDPH  +P +      +D    +  + H+  +RH+H++ +DPS+ IGF  
Sbjct: 335 VGVQGQWLFYLDPHHPRPALPYREAPEDYTSEELGSCHTRRLRHLHVEDMDPSMLIGFLI 394

Query: 370 RDKDDFDDF 378
           +D+DD+D +
Sbjct: 395 KDEDDWDTW 403


>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
          Length = 388

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 169/366 (46%), Gaps = 63/366 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 50  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 99

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 155
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 100 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSGLGPSEPSGLASPNRYRGPAR 159

Query: 156 -----------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
                      +   +R + +I+  F D   +PF +H L   G++ G  AG W GP    
Sbjct: 160 WVPPRWAHGTPELEQERRHRQIVSWFADHPRAPFGLHRLGGLGQSSGKKAGDWYGP---- 215

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 216 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 262

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 263 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 322

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 323 PHYCQPTVDVTQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSELTR 380

Query: 385 LAEESN 390
           +   S+
Sbjct: 381 VLSSSS 386


>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
          Length = 396

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 112/386 (29%), Positives = 163/386 (42%), Gaps = 78/386 (20%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
           AGN  + EF +DF SRI ++YR+ F  I  S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 11  AGN--VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 68

Query: 149 RPWRKP----------------------------------------LQKPFDREYVE--- 165
           R W  P                                         QK   R Y +   
Sbjct: 69  RAWTWPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHE 128

Query: 166 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
                    I+  FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 129 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 188

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
               G     + IYV             V   D   + C+  +    D   +++L+P+ L
Sbjct: 189 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRL 235

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G E+ N  Y+  ++   +    +GI+GGKP  S Y  G Q++S IY+DPH  Q  +++  
Sbjct: 236 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 295

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 394
            D   +  T+H    + +    +DPS  IGFYCR+  DF       +K+ + S+    PL
Sbjct: 296 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 353

Query: 395 FTVTQTHKK-------PVNHSDVLGE 413
           FT    H +         N  D+  E
Sbjct: 354 FTFVNGHSRDYDFTSTTTNEEDLFSE 379


>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
          Length = 458

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 116/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    ++           A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          DRE                          + +I+  FG+S  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGNSPVAVFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
 gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
          Length = 458

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 175/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 155 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L+  GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
          Length = 458

 Score =  164 bits (415), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 116/421 (27%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 111
           S  S + LLG C+    +E    A       G N        + EF +DF SRI ++YR+
Sbjct: 36  SRNSPVLLLGKCYHFKSEEENDPAPVQPQWVGENEPVVVSGNVEEFRRDFISRIWLTYRE 95

Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR------------------- 152
            F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                    
Sbjct: 96  EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDVDNSDSESWTSHT 155

Query: 153 ---------------------KPLQKPFDRE----------YVEILHLFGDSETSPFSIH 181
                                 P+++P  R           + +I+  F DS  + F +H
Sbjct: 156 VKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESCHRKIVSWFADSPLACFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + CS       +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYKADVIDKQCSSMDPENTEDKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDVLG 412
           S  +GFYCR+  DF+      +K+ + S+    PLFT  + H +       P N  D+  
Sbjct: 381 SCTVGFYCRNIQDFERASEEITKVLKASSREKYPLFTFVKGHARDYDFTCTPTNEDDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 441

 Score =  164 bits (415), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 125/372 (33%), Positives = 176/372 (47%), Gaps = 39/372 (10%)

Query: 9   GASKCFSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTS 68
           GA+ C   S PD  + S  S  S   ++       + GS     E V G     +    S
Sbjct: 50  GATACTPSSLPDLKSASAESSRSAQPATPPDSTASSLGSGVHEDEDVGGWPTPFLDDFES 109

Query: 69  DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 128
            IWL       +Q  A+  +     L+  +     R  +  + GF        TSD GWG
Sbjct: 110 KIWLT----YRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGF--------TSDTGWG 157

Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
           CM+RS Q L+A AL+  R+GR WR+       +E   I+ LF D+ T+P+SIHN ++ G 
Sbjct: 158 CMIRSGQSLLANALVMLRMGRDWRR--GSSASQEERSIISLFADTPTAPYSIHNFVEHGA 215

Query: 189 AY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGDEDGERGGAPVV 246
           A  G   G W GP A  R  +ALA         G QS  + +YV   G E  E     + 
Sbjct: 216 AACGKHPGEWFGPSATARCIQALAN--------GHQSPELRVYVTGDGLEVYEDSFMKIA 267

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
             D           GQA + P L+LV   LGL+K+ P Y   L+ +   PQSLGI GG+P
Sbjct: 268 KPD-----------GQA-FIPTLILVGTRLGLDKITPVYWEALKSSLQIPQSLGIAGGQP 315

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSL 363
            +S Y +GVQ     YLDPH  +P + +    +D  + D  + H+  +R IH+  +DPS+
Sbjct: 316 SSSHYFIGVQGHHFFYLDPHQTRPALPLPDNIEDYSQEDIDSCHTRRLRRIHIKEMDPSM 375

Query: 364 AIGFYCRDKDDF 375
            I F  RD+DD+
Sbjct: 376 LIAFLIRDEDDW 387


>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
          Length = 342

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 26/312 (8%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 243
            Q G   G + G W GP  + +  + LA      + L          V++      R   
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187

Query: 244 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
           P    +    D+ RHC+ F          A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307

Query: 354 IHLDSIDPSLAI 365
           + +  +DPS+A+
Sbjct: 308 MSIAELDPSIAV 319


>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
          Length = 400

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 169/387 (43%), Gaps = 71/387 (18%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
           AGN  + EF +DF+SRI ++YR+ F  I  S +T+D GWGC +R+ QML+AQ L+ H LG
Sbjct: 15  AGN--VEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLG 72

Query: 149 RPW----------------------------------RKPLQKPF------------DRE 162
           R W                                   + L+ P             D E
Sbjct: 73  RAWTWPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHE 132

Query: 163 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
                 + +I+  FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 133 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 192

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
               G     + IYV             V   D   + C+  +   AD   +++LVP+ L
Sbjct: 193 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCASMASDHADDKAVIILVPVRL 239

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G E+ N  Y+  ++   +    +GI+GGKP  S Y  G Q++S IY+DPH  Q  +++  
Sbjct: 240 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 299

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 394
            D   +  T+H    + +    +DPS  IGFYCR+  DF       +K+ + S+    PL
Sbjct: 300 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 357

Query: 395 FTVTQTHKKPVNHSDVLGETGGVPEDD 421
           FT    H +  + +    +   +  +D
Sbjct: 358 FTFVNGHSRDYDFTSTAAKEDDLFSED 384


>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
          Length = 451

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGSVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGEERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
           42464]
 gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
           42464]
          Length = 456

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 114/321 (35%), Positives = 163/321 (50%), Gaps = 57/321 (17%)

Query: 87  DAAGNNGLAE-FNQDFSSRILISYRKGF-------DP---------------IGD-SKIT 122
           +++G++G    F  DF SRI ++YR GF       DP               +GD +  T
Sbjct: 111 ESSGDSGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFT 170

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           SD GWGCM+RS Q L+A ALL  RLGR WR+      +R    IL LF D   +P+S+HN
Sbjct: 171 SDTGWGCMIRSGQSLLANALLISRLGRDWRRMTDPDAERP---ILALFADDSRAPYSLHN 227

Query: 183 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            ++ G+ A G   G W GP A  R  +ALA   + E+ L   S                G
Sbjct: 228 FVKHGELACGKYPGEWFGPSATARCIQALA--NKHESSLRVYST---------------G 270

Query: 242 GAPVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
             P V  D      S  +  + D   + P L+LV   LG++K+N  Y+  L  T    QS
Sbjct: 271 DLPDVYED------SFMATAKPDGETFHPTLILVCTRLGIDKINQVYVEALISTLQMEQS 324

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--GKDDLEADT-STYHSDVIRHIH 355
           +GI GG+P +S Y VGVQ +   YLDPH  +P +      DD  ++   + H+  +R +H
Sbjct: 325 IGIAGGRPASSHYFVGVQGQWLFYLDPHHPRPKLPYRENPDDYTSEELDSCHTRRLRRLH 384

Query: 356 LDSIDPSLAIGFYCRDKDDFD 376
           ++ +DPS+ IGF  +D+DD+D
Sbjct: 385 VEDMDPSMLIGFLIKDEDDWD 405


>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
 gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
          Length = 411

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/342 (30%), Positives = 165/342 (48%), Gaps = 36/342 (10%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           +D GWGCMLR  QM++AQAL+   LGR W        D  Y++I++ F D   S +SIH 
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-ADCRDATYLKIVNRFEDVRNSFYSIHQ 150

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           + Q G++   A G W+GP  + +  + L R     +        +AI+V           
Sbjct: 151 IAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
              V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCGMI 249

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
           GG+P  + Y +G  ++  +YLDPH  Q    +G+    A+     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAVAEQDYDETYHQKHAARLSFSAM 309

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 310 DPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351


>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
 gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
 gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
 gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
 gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
          Length = 411

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D   S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H + Q G++   A G W+GP  + +  + L R     +        +AI+V         
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
                V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH      ++  
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351


>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
          Length = 382

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 16  IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
            TSD GWGCMLR  QM++ QAL+   LGR W+  L+   +  Y++IL  F D   +PFSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWSLETR-NSTYLKILERFEDKRNAPFSI 123

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 231
           H +   G + G   G W GP  + +          W ++      +  L    +     V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183

Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
             G      G AP+              K  + W P+LLL+PL LGL ++NP YI  L+ 
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 347
           +F  PQSLG++GGKP  + Y +G      IYLDPH  Q   ++ K    +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288

Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
                 I +  IDPS+A+ F+C  + DF   C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320


>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
 gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
          Length = 410

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D   S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H + Q G++   A G W+GP  + +  + L R     +        +AI+V         
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
                V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH      ++  
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351


>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
 gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
          Length = 411

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/343 (32%), Positives = 170/343 (49%), Gaps = 42/343 (12%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +D+W+LG  +   Q+  L             +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           +D GWGCMLR  QM++AQAL+   LGR W        D  Y++I++ F D   S +SIH 
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-SDCRDATYLKIVNRFEDVRNSYYSIHQ 150

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           + Q G+    A G W+GP  + +  + L R     +        +AI+V           
Sbjct: 151 IAQMGETQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
              V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELESSCGMI 249

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
           GG+P  + Y +G  ++  +YLDPH  Q    +G+    A+     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAAAEQDYDETYHQKHAARLSFSAM 309

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEE--SNGAP-LFTVTQ 399
           DPSLA+ F C+  D F+   A  +KL EE  S  +P LF ++Q
Sbjct: 310 DPSLAVCFLCKTSDSFE---ALLTKLKEEVLSLCSPALFEISQ 349


>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
          Length = 384

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 111/329 (33%), Positives = 163/329 (49%), Gaps = 50/329 (15%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
           +W+LG  +   ++           L    +D  S++  +YRKGF PIG   S  TSD GW
Sbjct: 23  VWILGKQYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGW 71

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           GCMLR  QM++ QAL+   LGR W+  P  +  +  Y++IL  F D  T+PFSIH +   
Sbjct: 72  GCMLRCGQMVLGQALIILHLGRDWQWTPETR--NSTYLKILERFEDRRTAPFSIHQIASM 129

Query: 187 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
           G + G   G W GP  + +  + L       +        + I+V   +          +
Sbjct: 130 GASEGKEVGQWFGPNTIAQVLKKLVVYDDWSS--------ITIHVALDN---------TL 172

Query: 247 CIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
            ++D  R C V              K  + W P+LLL+PL LGL ++NP YI  L+ +F 
Sbjct: 173 IVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFK 232

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDV 350
            PQSLG++GGKP  + Y +G      IYLDPH  Q   ++ K    +++E D +TYH   
Sbjct: 233 IPQSLGVIGGKPNLALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMD-ATYHCKF 291

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
              I +  IDPS+A+ F+C  + DF   C
Sbjct: 292 ASRIPITGIDPSVALCFFCATERDFKSLC 320


>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
 gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
 gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
 gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
 gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
           cysteine endopeptidase; AltName: Full=Autophagin-3;
           AltName: Full=Autophagy-related cysteine endopeptidase
           3; AltName: Full=Autophagy-related protein 4 homolog C
 gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
 gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
 gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
 gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
           construct]
 gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
           construct]
 gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
          Length = 458

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
 gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
          Length = 411

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 180
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D   S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H + Q G++   A G W+GP  + +  + L R     +        +AI+V         
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
                V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 357
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH      ++  
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKFKEEVLSLCSPALFEISQTR 351


>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
          Length = 382

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 16  IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
            TSD GWGCMLR  QM++ QAL+   LGR W+  L+   +  Y++IL  F D   +PFSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWNLETR-NSTYLKILERFEDKRNAPFSI 123

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 231
           H +   G + G   G W GP  + +          W ++      +  L    +     V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183

Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
             G      G AP+              K  + W P+LLL+PL LGL ++NP YI  L+ 
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 347
           +F  PQSLG++GGKP  + Y +G      IYLDPH  Q   ++ K    +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288

Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
                 I +  IDPS+A+ F+C  + DF   C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320


>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
          Length = 448

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 56/310 (18%)

Query: 97  FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+PI                      GD +  +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
            Q L+A ALL  +LGR WR+      +R    I+ LF D   +P+S+ N ++ G  A G 
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAERN---IVALFADDARAPYSLQNFVKHGAIACGK 229

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +ALA    +          + IY          G  P V  D   
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269

Query: 253 RHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
              S  +  + D   + P L+LV   LG++K+NP Y   L  T    QS+GI GG+P +S
Sbjct: 270 ---SFLATARPDGETFHPTLILVCTRLGIDKINPVYEEALISTLQMEQSIGIAGGRPSSS 326

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIG 366
            Y VGVQ +   YLDPH  +P +   ++ L     +  + H+  +R++H++ +DPS+ IG
Sbjct: 327 HYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIG 386

Query: 367 FYCRDKDDFD 376
           F  +D+DD+D
Sbjct: 387 FLIQDEDDWD 396


>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 515

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 108/307 (35%), Positives = 150/307 (48%), Gaps = 50/307 (16%)

Query: 97  FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 133
           F  DF SRI ++YR  F       DP                   +  +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A A+L  RLGR WR+  +   D E  +I+ LF D   +PFS+HN ++ G  A G 
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYGATACGK 296

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +AL      E+GL   S                G  P V  D   
Sbjct: 297 YPGEWFGPLATARCIQALT--DEKESGLRVYST---------------GDLPDVYEDS-- 337

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
              +V +     + P L+LV   LG++K+N  Y   L  T   PQS+GI GG+P +S Y 
Sbjct: 338 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 396

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           +GVQ +   YLDPH  +P +   +D       +  T H+  +R +H+D +DPS+ IGF  
Sbjct: 397 IGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMDPSMLIGFLI 456

Query: 370 RDKDDFD 376
           +D+DD+D
Sbjct: 457 KDEDDWD 463


>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
          Length = 509

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 187/404 (46%), Gaps = 80/404 (19%)

Query: 72  LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 120
           L  + HK   D+A    A  +   EF +D  SRI ++YR GF  I  ++           
Sbjct: 51  LRTLFHKFKPDQAADTEA--SWPREFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSL 108

Query: 121 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
                         T+D GWGCM+R+SQ L+A +LL  RLGR WR    +   + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANSLLQLRLGRGWRYDQTRECAK-HAEIV 167

Query: 168 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
             F D  T+PFSIHN ++ G    G   G W GP A  RS + L      +TGL      
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKTGLKV---- 223

Query: 227 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 282
              +  SGD  ED                   +F   Q  A+  P+L+L  + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQQGAELRPVLILAGIRLGVKNVN 263

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 337
           P Y   L+ T  +PQS+GI GG+P +S Y  G Q +   YLDPH  Q  + I  +     
Sbjct: 264 PLYWDFLKKTLGWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323

Query: 338 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
                  ++E+  D  + H++ IR +HLD +DPS+ +G    ++  +D   A    +   
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHSINSH 380

Query: 389 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 430
             G+    V  +  +PV  +     +GG+ E +   LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419


>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
          Length = 458

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
           boliviensis]
          Length = 458

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEMYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
          Length = 509

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 186/404 (46%), Gaps = 80/404 (19%)

Query: 72  LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 120
           L  + HK  QD+A    A  +   EF  D  SRI ++YR GF  I  ++           
Sbjct: 51  LRTLFHKFKQDQAAETEA--SWPREFLGDVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSL 108

Query: 121 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
                         T+D GWGCM+R+SQ L+A  LL  RLGR WR    +   + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANGLLQLRLGRGWRYDQTRECAK-HAEIV 167

Query: 168 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
             F D  T+PFSIHN ++ G    G   G W GP A  RS + L      + GL      
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKIGLKV---- 223

Query: 227 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 282
              +  SGD  ED                   +F   Q  A+  P+L+L  + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQEGAELRPVLILAGIRLGVKNVN 263

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 337
           P Y   L+ T ++PQS+GI GG+P +S Y  G Q +   YLDPH  Q  + I  +     
Sbjct: 264 PLYWDFLKKTLSWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323

Query: 338 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
                  ++E+  D  + H++ IR +HLD +DPS+ +G    ++  +D   A    +   
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHNINAH 380

Query: 389 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 430
             G+    V  +  +PV  +     +GG+ E +   LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419


>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
 gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
          Length = 458

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKSILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMAFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 ETGGVPEDDSLGVMSMNDAV 432
           E     E   L   SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456


>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
 gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
          Length = 389

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 114/343 (33%), Positives = 172/343 (50%), Gaps = 42/343 (12%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  +   +D  L             +D  +R+  +YR+GF PIG S++T+D GWGC
Sbjct: 28  VWILGKSYSATEDLDL-----------IRRDVQTRLWCTYRRGFVPIGGSQLTTDKGWGC 76

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL    LGR W    +   +  Y++I++ F DS+ +PFS+H +   G++
Sbjct: 77  MLRCGQMVLAQALTQLHLGRDWSWTPETT-NETYLKIVNRFEDSKAAPFSLHQIALTGES 135

Query: 190 YGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
                 G W GP  + +  + L +              + I+V   +          +  
Sbjct: 136 SEEKRVGEWFGPNTVAQVLKKLVKFD--------DWCSLVIHVALDN---------TLAT 178

Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
           D+    C V       W P+LL++PL LGL ++NP Y+  L+  F    + G+VGG+P  
Sbjct: 179 DEVLELC-VDRSNPDSWKPLLLIIPLRLGLSEINPIYVDGLKKCFELAGNCGMVGGRPNQ 237

Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLA 364
           + Y +G   + A+YLDPH VQ    IG     D+ E D  T+H    R I+   +DPSLA
Sbjct: 238 ALYFIGYVADEALYLDPHTVQRSGTIGSKRDPDERELD-ETFHQKYARRINFKGMDPSLA 296

Query: 365 IGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKP 404
           + F C  + DFDD   R     E+ NG    PLF VT+T + P
Sbjct: 297 LCFLCATRKDFDDLIQR---FKEDLNGGGCQPLFEVTKTRQAP 336


>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
           UAMH 10762]
          Length = 446

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 109/326 (33%), Positives = 165/326 (50%), Gaps = 65/326 (19%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------- 120
           A++EALG        AEF  D  +RI ++YR  F PI  S                    
Sbjct: 103 AEEEALG------WPAEFMDDMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGN 156

Query: 121 ---ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 177
               TSD GWGCM+RS Q L+A +L   +LGR WR+  +   + +Y  ++ LF D+  +P
Sbjct: 157 SGGFTSDAGWGCMIRSGQTLLANSLATLKLGRDWRRGQK---EDDYKHLISLFADTPEAP 213

Query: 178 FSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
           FSIH  ++ G +A G   G W GP A  RS +AL    R + GL   + P         +
Sbjct: 214 FSIHKFVEHGAQACGKHPGEWFGPSATARSVQALTEKYR-DVGLRVYARP---------D 263

Query: 237 DGERGGAPVVCIDDASRHCSVF-SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRL 291
           DG+      V +D      S+F + GQ D    + P L+++ + LG++++ P Y   L+ 
Sbjct: 264 DGD------VYVD------SLFATAGQMDANDEFQPTLIVLGIRLGIDRITPVYHAALKA 311

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI--NIGKDDLEADTSTYHSD 349
           T   PQS+GI GG+P +S Y VG Q ++  YLDPH  +  I  N   +DL    ++ H+ 
Sbjct: 312 TLEMPQSVGIAGGRPSSSHYFVGHQGDNFFYLDPHTTRQAIPQNPSAEDL----ASCHTR 367

Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDF 375
            +R + +  +DPS+ +GF    K++F
Sbjct: 368 RLRRLKIAEMDPSMLLGFLIHSKEEF 393


>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
 gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
          Length = 508

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 151/307 (49%), Gaps = 50/307 (16%)

Query: 97  FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+ I                      GD +  +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A A+L  R GR WR+      +RE   I+ LF D   +P+SI N +  G A  G 
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +ALA+   +          + +Y+            P V  D+  
Sbjct: 290 YPGEWFGPSATARCIQALAKKHDSS---------LRVYLTRD--------LPEVYEDN-- 330

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
              S  +     + P L+LV   LG++K+NP Y   L  T   PQ++GI GG+P +S Y 
Sbjct: 331 -FMSTANPDGNHFHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSSHYF 389

Query: 313 VGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           +G Q +   YLDPH  +P +   +   D    +  + H+  +RH+H++ +DPS+ IGF  
Sbjct: 390 IGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIGFLI 449

Query: 370 RDKDDFD 376
           +D+DD+D
Sbjct: 450 KDEDDWD 456


>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
          Length = 458

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 169/404 (41%), Gaps = 80/404 (19%)

Query: 65  SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C H   +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 155
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPYALSIENSDSESRTSHTVK 155

Query: 156 ----------------------------QKPFDRE------YVEILHLFGDSETSPFSIH 181
                                       + P D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQRASMASDNTDDKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +
Sbjct: 381 SCTIGFYCRNIQDFKRASEEITKMLKISSKEKYPLFTFVNGHSR 424


>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
 gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
 gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
 gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
          Length = 458

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 ETGGVPEDDSLGVMSMNDAV 432
           E     E   L   SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456


>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
          Length = 458

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 172/404 (42%), Gaps = 80/404 (19%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE          A+ D   +  + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKMLPAISSCAIEDCVISGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 181
            F                              D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNEIYHRKIVSWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQCASMASDNPDNKAVIILVPVRLGGERTNVDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
           S  IGFYC++  DF+      +K+ + S+    PLFT    H +
Sbjct: 381 SCTIGFYCQNVQDFERASEEITKMLKVSSKEKYPLFTFVNGHSR 424


>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
          Length = 454

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 103/309 (33%), Positives = 144/309 (46%), Gaps = 49/309 (15%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF S+  ++YR  F  I  S                         TSD GWGCM+RS
Sbjct: 118 FLDDFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRS 177

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
            Q L+A A+    LGR WR+  Q P D    ++L  F D   +P+SIH  +Q G  A G 
Sbjct: 178 GQSLLANAMAAINLGRDWRRG-QNPEDER--KLLSWFADDPRAPYSIHQFVQHGAVACGK 234

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +ALA  Q  +        P+ +Y          G  P V  D   
Sbjct: 235 YPGEWFGPSATARCIQALANAQEQQ--------PLRVYST--------GDGPDVYED--- 275

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           +   +     + + P L+LV   LG++K+ P Y   L      PQS+GI GG+P +S Y 
Sbjct: 276 KFMEIAKPDGSRFNPTLILVGTRLGIDKITPVYWEALIAALQMPQSVGIAGGRPASSHYF 335

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           +G Q     YLDPH  +P +    D     EAD  T H+  +R +H+  +DPS+ +GF  
Sbjct: 336 IGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVDTVHTRRLRRLHVRELDPSMLVGFLI 395

Query: 370 RDKDDFDDF 378
           RD+DD+ ++
Sbjct: 396 RDEDDWAEW 404


>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
 gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
           Full=Autophagy-related protein 4 homolog C
 gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
          Length = 450

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 172/403 (42%), Gaps = 95/403 (23%)

Query: 67  TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
            S ++LLG C+    +++    D   N+G          + EF +DF SRI ++YRK F 
Sbjct: 38  NSPVFLLGKCYHFKYEDSGVTADDCSNSGSDSKEDLSGNVDEFRKDFISRIWLTYRKEFP 97

Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
            I  S  T+D GWGC LR+ QML+AQ LL H LGR W                       
Sbjct: 98  QIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWTWTEALDIFCSESDFWTANTARK 157

Query: 152 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 187
                              ++PLQ    + Y E LH      F D   + F +H L++ G
Sbjct: 158 LDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRKIISWFADYPLAYFGLHQLVKLG 217

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
           K  G  AG W GP  +      L R    E+                  D E  G  +  
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256

Query: 248 IDDASRHCSVFSK-------GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
             D    C++++         + +   +++LVP+ LG E+ N  Y   ++   +    +G
Sbjct: 257 AQD----CTIYNADVYDLQCNKGNEKAVVILVPVRLGGERTNMEYFEYVKGILSLEFCIG 312

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG Q++S IY+DPH  Q  +++   +   +  ++H    + +    +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMD 370

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 401
           PS  +GFYCR+  +F+      +K+ + S     PLFT    H
Sbjct: 371 PSCTVGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413


>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
          Length = 458

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 113/404 (27%), Positives = 170/404 (42%), Gaps = 80/404 (19%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 152 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 181
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTICLKETIGKCSEDHETENEICHRKIISWFGDSPLAAFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + +YV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITVYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQSASMTSDNTDDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKMSSKEKYPLFTFVNGHSR 424


>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
          Length = 458

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -FSVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 ETGGVPEDDSLGVMSMNDAV 432
           E     E   L   SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456


>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
 gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
          Length = 463

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 120/414 (28%), Positives = 184/414 (44%), Gaps = 82/414 (19%)

Query: 59  SRTGISSSTSDIWLLGVCH--KIAQDE--------ALGDAAGNNGLAEFNQDFSSRILIS 108
           S+T  S + S ++LLG C+  K+  DE        AL D      + EF +DF+SR+ ++
Sbjct: 31  SKTAFSRN-SPVFLLGKCYHFKVVDDENPTESTAEALDDDVVTGNVDEFRKDFTSRVWLT 89

Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQ-KPFDRE--- 162
           YR+ F  +  S  TSD GWGC LR+ QM++AQALL H LGR W+  + L  +P D E   
Sbjct: 90  YREEFPALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWKWSEALSLEPLDTETWT 149

Query: 163 --------------------------------------YVE------ILHLFGDSETSPF 178
                                                 Y++      I+  FGD  ++  
Sbjct: 150 SSAARRLVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETYHRTIVSWFGDGPSAQL 209

Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
            I+ L++ G   G  AG W GP         +A   R        ++   I V    +D 
Sbjct: 210 GIYKLVELGMTSGKQAGDWYGP-------AVVAHILRKAVDEAVDAMLKGIRVYVA-QDC 261

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQA-------DWTPILLLVPLVLGLEKVNPRYIPTLRL 291
               A V  ID  S      S  Q        D   +++L+P+ LG EK+NP Y+  ++ 
Sbjct: 262 TVYSADV--IDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEYLNFVKS 319

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
             +    +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      ++H    
Sbjct: 320 ILSLEYCIGIIGGKPKQAYYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 377

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
           + +    +DPS  IGFY +  + F+      SK+ + S+    P FT+ + H K
Sbjct: 378 KKMSFSKMDPSCTIGFYSKSVEHFEKIANELSKILQPSSKEKYPAFTIMKGHGK 431


>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
          Length = 459

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 113/404 (27%), Positives = 168/404 (41%), Gaps = 80/404 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 113
           S  S ++LLG C+    DE+  L     N           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQA-------------------------------- 141
             I  S +T+D GWGC LR+ QML+AQ                                 
Sbjct: 96  PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155

Query: 142 -----------------LLFHRLGRPWRKPLQKPFDREYV---EILHLFGDSETSPFSIH 181
                            +L H   R  R+       R  V   +I+  FGDS  + F +H
Sbjct: 156 KLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNEVYHRKIISWFGDSPLAAFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + +YV          
Sbjct: 216 QLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   R CS    G+ D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 403
           S  IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 381 SCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424


>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
          Length = 585

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 157/359 (43%), Gaps = 69/359 (19%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----- 151
           F +DF+SRI ++YR+ F  +  +  T+D GWGCMLRS QML+AQ L+ H LG+ W     
Sbjct: 198 FQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDWTWPDA 257

Query: 152 ------------------------------------------------RKPLQKPFDREY 163
                                                           R P +   +R +
Sbjct: 258 LHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELEKERYH 317

Query: 164 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
            +I+  F D   + F IH L+  G + G  AG W GP            C        C 
Sbjct: 318 RKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDC--------CS 369

Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 283
                +  VS D    +G   V  + + S   + +  G A W  +++LVP+ LG E  NP
Sbjct: 370 EAGNLVVYVSQDCTVYKGD--VANLANKSEDRTAWDPG-AVWKAVIILVPMRLGGEAFNP 426

Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
            Y+  ++        +GI+GGKP  S Y VG Q+++ +YLDPH  QP ++  K++   + 
Sbjct: 427 AYVDCVKELLKLEFCIGIIGGKPRHSLYFVGYQDDALLYLDPHYCQPFVDTTKENFPLE- 485

Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 399
            ++H +  R      +DPS  IGFY   + +F++ C   +++   S      P+F++ +
Sbjct: 486 -SFHCNSPRKTAFTKVDPSCTIGFYAHHRTEFEELCLHLTQVLNSSTAKEKYPMFSIVE 543


>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
 gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
          Length = 651

 Score =  160 bits (406), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 143/475 (30%), Positives = 208/475 (43%), Gaps = 100/475 (21%)

Query: 15  SKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSST------- 67
           +K TP  P++ + S    +   + V  L++      + E VLG S T  +S T       
Sbjct: 215 AKETPLCPSQ-MHSSQQPISDHQPVSTLLS------LVEAVLGSSDTLPTSVTWLAHQLK 267

Query: 68  SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 127
           +  W L   H +         A        +  F   + +++R  F        TSDVGW
Sbjct: 268 ARGWELLASHGVPYTSPTAHTAFPGVWHSVHAVFQHILSLTHRTCF--------TSDVGW 319

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQ 185
           GCMLRS Q ++A AL+   LGR WR+  ++    +Y  IL  F D  S   PFSIH L+ 
Sbjct: 320 GCMLRSVQSMLANALIRVHLGRHWRRRAKQKTHPQYARILSWFMDDPSLECPFSIHRLVD 379

Query: 186 AGKAYGLAAGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            G+  G+ AG W GP    +A+C+  +A   C     GLG          V    DG   
Sbjct: 380 EGQRLGVQAGDWFGPSTAAFALCKLIQAYDAC-----GLG----------VVVTNDGMLY 424

Query: 242 GAPVVCIDDASRHCSVFSKGQAD-WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
              VV         + F+ G++D WT P+L+L+   LGL++V P Y P L+ +FT PQS+
Sbjct: 425 KEQVVA--------ASFAPGRSDPWTRPVLILLVQRLGLDQVPPHYRPALKQSFTMPQSV 476

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI------------GKDDLEADTSTYH 347
           G+VGG+P +S Y VGVQ E  + LDPH V+P +                 DL +  S + 
Sbjct: 477 GVVGGRPRSSLYFVGVQREHLLCLDPHHVRPCVPFRSPPRMTRASVGASTDLASTVSPWF 536

Query: 348 SDVIRHIHLDS-------------IDPSLAIGFYCRDKDDFDDFCAR----ASKLAEESN 390
            +      LDS             +DPS+ +GF C    D  D  AR     ++L + ++
Sbjct: 537 EEAYTAEELDSFHTPHTSLLPISQMDPSMLLGFVCEQASDLIDLQARIESSETRLFDVAD 596

Query: 391 GAPLF----------------TVTQTHKKPVNHSDVLGETGGVPE--DDSLGVMS 427
             P +                   +THK    HSD +    GV +  DDS   M+
Sbjct: 597 NMPSYYRLSMSMGGEGEGDDDDNHRTHKAEDGHSDRVAAHSGVGDNVDDSGWTMA 651


>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
           familiaris]
 gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
           familiaris]
          Length = 458

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 117/421 (27%), Positives = 172/421 (40%), Gaps = 87/421 (20%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKFEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
             I  S  T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSDSWTSNTVK 155

Query: 158 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 181
            F                              D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    D   +++L+P+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 412
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 413 E 413
           E
Sbjct: 441 E 441


>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
 gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
           Full=Autophagy-related protein 4
 gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 506

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 131/405 (32%), Positives = 182/405 (44%), Gaps = 87/405 (21%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSR 60
             G R  A A+ C S ++      S A  GS+LGS +TV   VT+G     ++  L    
Sbjct: 112 FNGVRTTATAT-CLSDTS-----MSAAPTGSQLGSFDTVPDSVTSG-----YDSALAYEE 160

Query: 61  TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF------- 113
            G                  QD     A        F  DF SRI ++YR  F       
Sbjct: 161 PG------------------QDGGWPPA--------FLDDFESRIWMTYRTDFALIPRSS 194

Query: 114 DPIGDSKIT----------------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
           DP   S ++                SD GWGCM+RS Q L+A A+L  RLGR WR+    
Sbjct: 195 DPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQSLLANAILIARLGREWRRGTD- 253

Query: 158 PFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
             D E  +I+ LF D   +P+S+HN ++ G  A G   G W GP A  R  +ALA     
Sbjct: 254 -LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGKYPGEWFGPSATARCIQALA--DEK 309

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
           ++GL   S                G  P V  D      +V +     + P L+LV   L
Sbjct: 310 QSGLRVYST---------------GDLPDVYEDS---FMAVANPDGRGFQPTLILVCTRL 351

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G++K+N  Y   L  T   PQS+GI GG+P +S Y VGVQ +   YLDPH  +P +   +
Sbjct: 352 GIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYFVGVQGQRLFYLDPHHPRPALPYRE 411

Query: 337 DD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           D       +  T H+  +R +H+  +DPS+ IGF  +D+DD+D +
Sbjct: 412 DPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLIKDEDDWDTW 456


>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
 gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
          Length = 468

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 99/354 (27%), Positives = 161/354 (45%), Gaps = 59/354 (16%)

Query: 91  NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           ++ +  F +DF SR+ ++YR+ F  +  + +T+D GWGCM+RS QML+AQ LL H L R 
Sbjct: 92  DDEIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSRE 151

Query: 151 W----------------------RKPL-------------------QKPF-DREYVEILH 168
           W                      R PL                   + P  ++ +  I+ 
Sbjct: 152 WTWPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIR 211

Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
            F D  ++PF +H ++  G  +G  AG W GP         +A   +      C+   ++
Sbjct: 212 WFSDHPSAPFGLHRMVALGSIFGKKAGDWYGP-------SIVAHIIKKAIETSCEVAELS 264

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
           +YV S D    +     +   D     +    G+A    +++LVP  LG E  NP Y   
Sbjct: 265 VYV-SQDCTVYKADIEQLFAGDVPHAETSRDAGKA----VIILVPARLGGETFNPVYKHC 319

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L+     P  LGI+GGKP  S Y +G Q+   +YLDPH  Q  I+  ++D   +  ++H 
Sbjct: 320 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQSYIDTSRNDFPLE--SFHC 377

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 399
           +  R I +  +DPS    FY +++DDF   C    K+    +     P+F++++
Sbjct: 378 NTPRKISITRMDPSCTFAFYAQNRDDFGKLCDHLMKVLHSPHAEEKYPIFSISE 431


>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 115/397 (28%), Positives = 170/397 (42%), Gaps = 80/397 (20%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFT 396
           S  IGFYCR+  DF       +K+ + S+    PLFT
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFT 417


>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
          Length = 454

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 152/329 (46%), Gaps = 54/329 (16%)

Query: 79  IAQDEALGDAAGNNG--LAEFNQDFSSRILISYRKGFDPIGDSK---------------- 120
           +A DE   D +G +G     F  DF S+  ++YR  F  I  S                 
Sbjct: 101 LAYDE---DYSGQDGGWPTAFLDDFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKS 157

Query: 121 -------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 173
                   +SD GWGCM+RS Q L+A A+    LGR WR+   +  +R+   +L LF D 
Sbjct: 158 QLVDQNGFSSDSGWGCMIRSGQSLLANAMAVINLGRDWRRGQNQEEERK---LLSLFADD 214

Query: 174 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
             +P+SIH  +Q G  A G   G W GP A  R  +ALA  Q  +        P+ +Y  
Sbjct: 215 PRAPYSIHQFVQHGAVACGKYPGEWFGPSATARCIQALANAQMHQ--------PLRVYST 266

Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
               D        +   D SR           + P L+LV   LG++K+ P Y   L   
Sbjct: 267 GDGPDVYEDKFMKIAKPDGSR-----------FHPTLILVGTRLGIDKITPVYWEALIAA 315

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSD 349
              PQS+GI GG+P +S Y +G Q     YLDPH  +P +    +     EAD  T H+ 
Sbjct: 316 LQMPQSVGIAGGRPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVDTVHTR 375

Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
            +R +H+  +DPS+ IGF   D+DD+D++
Sbjct: 376 RLRRLHVRELDPSMLIGFLILDEDDWDEW 404


>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
 gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
          Length = 454

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)

Query: 84  ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
           ALG + +  +G+    +  +SR   +YR+ F PIG +  ++D GWGCMLR +QML+ + L
Sbjct: 39  ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 98

Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
           L   +GR +   ++K     Y +IL +F D + + +SIH + Q G   G     W GP  
Sbjct: 99  LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 157

Query: 203 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 246
             +          W  +A     +  L  + ++ MA    S D      E+G        
Sbjct: 158 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 209

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
            + D +R          +W P+LL++PL LGL  +NP Y+  ++  F  PQ +GI+GG+P
Sbjct: 210 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 268

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 340
             + Y VG+      YLDPH  +P                   ++G   LE         
Sbjct: 269 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 328

Query: 341 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 394
                  D STYH  ++  I  +++DPSLA+  +C  +D+F++ C    K    ++  P+
Sbjct: 329 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 388

Query: 395 FTVTQTHKK 403
           F   Q   K
Sbjct: 389 FEFLQRRPK 397


>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
          Length = 460

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 122/423 (28%), Positives = 183/423 (43%), Gaps = 89/423 (21%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 111
           S  S + LLG C+    +E    A      AG N        + EF +DF SRI ++YR+
Sbjct: 36  SRNSPVLLLGKCYHFKSEEENDPAPVGSGWAGENEHVVIYGNVEEFRRDFISRIWLTYRE 95

Query: 112 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------- 154
            F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                 
Sbjct: 96  EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDIENSDSASWTSHT 155

Query: 155 ------------------------LQKPF-----DRE------YVEILHLFGDSETSPFS 179
                                   L++P      D E      + +I+  FGDS  + F 
Sbjct: 156 VKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGRNELCHRKIISWFGDSPLACFG 215

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           +H L++ GK  G  AG W GP  +           R     G     + IYV    +D  
Sbjct: 216 LHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVA---QDCT 267

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
              A V+     S      ++ +A    I+LLVP+ LG E+ N  Y+  ++   +    +
Sbjct: 268 VYKADVIDKQGISAGLET-TEDKA----IILLVPVRLGGERTNMDYLDFVKGILSLEYCV 322

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
           GI+GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +
Sbjct: 323 GIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKM 380

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDV 410
           DPS  +GFYCR+  DF+      +++ + S+    PLFT  + H +       P N  D+
Sbjct: 381 DPSCTVGFYCRNAQDFERASEELTQVLKASSREKYPLFTFVKGHARDYDFTSTPTNEDDL 440

Query: 411 LGE 413
             E
Sbjct: 441 FSE 443


>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
 gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
          Length = 481

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)

Query: 84  ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 142
           ALG + +  +G+    +  +SR   +YR+ F PIG +  ++D GWGCMLR +QML+ + L
Sbjct: 66  ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 125

Query: 143 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
           L   +GR +   ++K     Y +IL +F D + + +SIH + Q G   G     W GP  
Sbjct: 126 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 184

Query: 203 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 246
             +          W  +A     +  L  + ++ MA    S D      E+G        
Sbjct: 185 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 236

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
            + D +R          +W P+LL++PL LGL  +NP Y+  ++  F  PQ +GI+GG+P
Sbjct: 237 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 295

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 340
             + Y VG+      YLDPH  +P                   ++G   LE         
Sbjct: 296 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 355

Query: 341 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 394
                  D STYH  ++  I  +++DPSLA+  +C  +D+F++ C    K    ++  P+
Sbjct: 356 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 415

Query: 395 FTVTQTHKK 403
           F   Q   K
Sbjct: 416 FEFLQRRPK 424


>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
 gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 401

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 171/380 (45%), Gaps = 75/380 (19%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAE-----------------FNQDFSSRILISYRKG 112
           IW LG   + A  +   D A NN  +                  F  DF SRI I+YR  
Sbjct: 29  IWCLG--REYAPSQPPSDPASNNPRSPSRQPNASTLNDTTWPKAFLSDFGSRIWITYRSN 86

Query: 113 FDPIGDSK-----------------------ITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
           F PI  +K                        TSD GWGCM+RS Q L+A       LGR
Sbjct: 87  FTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQSLLANTFSVLLLGR 146

Query: 150 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWE 208
            WR+  +     E  +++ +F D   +PFSIH  +  G ++ G   G W GP        
Sbjct: 147 DWRRGEKV---EEESKLISMFADHPEAPFSIHRFVNRGAESCGKYPGEWFGP-------S 196

Query: 209 ALARCQRAETGLGCQS-LP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
           A A+C +    L  QS +P + +Y+ +   D           +D   H +    G+    
Sbjct: 197 ATAKCIQL---LSTQSEVPQLRVYLTNDTSD---------VYEDKFAHVAHDESGRIQ-- 242

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P L+L+   LG++ V P Y   LR   T+PQS+GI GG+P AS Y VG Q+    +LDPH
Sbjct: 243 PTLILIGTRLGIDNVTPAYWDGLRAALTYPQSVGIAGGRPSASHYFVGAQDCHLFFLDPH 302

Query: 327 DVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
             +P      D L  + +  +Y++  +R IH+  +DPS+ IGF  +D+DD+ D+     K
Sbjct: 303 TTRPATLYRPDGLYTQEELDSYYTSRLRRIHIKDMDPSMLIGFLVKDEDDWADW----KK 358

Query: 385 LAEESNGAPLFTVTQTHKKP 404
               + G P+  +  +  +P
Sbjct: 359 RIRSTPGQPIVHIFPSQHQP 378


>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
          Length = 457

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 118/431 (27%), Positives = 180/431 (41%), Gaps = 80/431 (18%)

Query: 65  SSTSDIWLLGVCHKIAQDEA-----------LGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S ++LLG C+    DE            + D + +  + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKSDEPSDQSPNGSCDDMTDESFSRNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 151
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWANAFVFENPESESWTSQTVK 155

Query: 152 -----------------------RKPLQKPFDREYVE------ILHLFGDSETSPFSIHN 182
                                  + P++     E VE      I+  F DS  + F +H 
Sbjct: 156 KLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEEQYHRRIISWFADSPFANFGLHR 215

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           L++ GK  G  AG W GP  +      L R +  E     +   + IYV       +   
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAH----LLR-KAVEKARDPELQGITIYVAQDCTVYKSDV 270

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
              +C    S   SV S        I++L+P+ LG E+ N  Y   ++   +    +GI+
Sbjct: 271 IDALCPFTDSEKTSVKS--------IIILIPVRLGGERTNMEYFEFVKGILSLDYCIGII 322

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSVKDFPLE--SFHCPSPKKMSFKKMDPS 380

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESN-GAPLFTVTQTHKKPVNH--SDVLGETGGVPE 419
             IG YC D   F+      +K+ + S    PLFT    H +  +   S V  E     E
Sbjct: 381 CTIGLYCPDMQGFERAAEEITKILKLSKEKYPLFTFVNGHSRDFDFVVSPVQEEKTMFSE 440

Query: 420 DDSLGVMSMND 430
           ++   +   N+
Sbjct: 441 EEHKKLACFNN 451


>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
          Length = 481

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 122/469 (26%), Positives = 195/469 (41%), Gaps = 104/469 (22%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           IS  T  IW LG            + +  +G+    +  +SR   +YR+ F PIG +  +
Sbjct: 25  ISIDTFPIWALG-----------KEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPS 73

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           +D  WGCMLR +QML+ + LL   +GR +   ++K  D  Y +IL +F D + + +SIH 
Sbjct: 74  TDQYWGCMLRCAQMLLGEVLLRRHIGRHFEWDIEKTSDV-YEKILQMFFDEKDALYSIHQ 132

Query: 183 LLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYVV 232
           + Q G + G     W GP    +          W  +A     +  L  Q +L MA    
Sbjct: 133 IAQMGVSEGKEVSEWFGPNTAAQVIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYP 192

Query: 233 SGDE----DGERG-------GAPVVCID-DASRHCSVFSKGQAD-------------WTP 267
           S D      GE G        + ++C++ D  +    F  G  +             W P
Sbjct: 193 SEDAVKLIMGEFGFKSDRISSSHIICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRP 252

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
           +LL++PL LGL  +N  Y+  ++  F  PQ +GI+GGKP  + Y VG+      YLDPH 
Sbjct: 253 LLLMIPLRLGLTSINSCYLSAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHH 312

Query: 328 VQP--------------------------VINIGKDDLE---------------ADTSTY 346
            +P                          + + G  +LE                + STY
Sbjct: 313 CRPKTSKFFVEKEQQQQSSGDSTPEKVEKIDDNGFHELEDLEPLPSQTSDVYTKMNDSTY 372

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV- 405
           H  +++ +  DSIDPSLA+  +C  +++F++ C    K    ++  P+F   +   K + 
Sbjct: 373 HCQMMQWMEYDSIDPSLALALFCETREEFENLCDELQKTTLTASNPPMFEFLEKRPKYLP 432

Query: 406 ---------------NHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
                             D+      + ED  +  +S+ DA   A  DD
Sbjct: 433 KFEPYTGVSMKIEMKEFDDIGAANSKIDEDFEVLDVSVEDAETGAEADD 481


>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 440

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 115/410 (28%), Positives = 174/410 (42%), Gaps = 81/410 (19%)

Query: 65  SSTSDIWLLGVCH--KIAQDEALGDAAGN--------NGLAEFNQDFSSRILISYRKGFD 114
           S  S + LLG C+  K+ +DE + +A             + +F +DF SRI ++YR+ F 
Sbjct: 36  SRNSPVLLLGKCYHFKVEEDEGVAEACCEASDEEDVVGNVEDFRRDFGSRIWLTYREEFP 95

Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDRE--------- 162
           P+  S +TSD GWGCMLR+ QM++AQALL H +GR W   R    +P D E         
Sbjct: 96  PLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWTWSRTMSLQPLDTETWTTSAAKR 155

Query: 163 ----------------------------------YVE-------ILHLFGDSETSPFSIH 181
                                             +VE       ++  FGDS ++ F +H
Sbjct: 156 LVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEGEAFHRTLVSWFGDSPSAQFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSW-----EALARCQRAETGLGCQSLPMAIYVVSGDE 236
            ++  G   G  AG W GP  +         EAL       T    Q   +    V    
Sbjct: 216 RMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSLAGITAYVSQDCTVYSADVIDGH 275

Query: 237 DGERGGAP-----VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
                 +P     V  +   ++  S     +A    +++LVP+ LG EK NP Y    + 
Sbjct: 276 KASTSASPESSDDVTLLSPNNQAASALPDSRA----VIILVPVRLGGEKTNPDYFNLAKS 331

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
             +    +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      ++H    
Sbjct: 332 ILSLDYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 389

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 399
           + +    +DPS  +GFY R   DF+      +KL + S+    P F   Q
Sbjct: 390 KKMPFTKMDPSCTLGFYSRSAQDFEKIKQELTKLLQPSSKEKYPAFIFVQ 439


>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
          Length = 467

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 158/354 (44%), Gaps = 87/354 (24%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
           N  + F  DF SR+ ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           +RS Q ++A AL   RLGR WR   +   D+++ EIL LF D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209

Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G   G W GP A  R  + LA   R E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVYV-SGD------GADVY--E 252

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  +  +V   G   W P L+LV   LG++K+ P Y   L+ +   PQS+GI GG+P AS
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKASLQIPQSIGIAGGRPSAS 310

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEA---------------------------- 341
            Y VGVQ  +  YLDPH  +P++      L A                            
Sbjct: 311 HYFVGVQGNNFYYLDPHSTRPLLPFHPPSLAAATSDTPNLTASTTSVSSTTSSTTIVPPA 370

Query: 342 -----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                            D ST H+  IR + +  +DPS+ + F    + D+ D+
Sbjct: 371 DSIPAPSDPRQSLYPPSDLSTCHTRRIRRLQIREMDPSMLLAFLVTSEADYQDW 424


>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
          Length = 437

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 107/342 (31%), Positives = 157/342 (45%), Gaps = 54/342 (15%)

Query: 87  DAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSK-----------------------IT 122
           D+  N G  + F  DF +R+ I+YR  F  I  S+                        +
Sbjct: 94  DSDANGGWPSPFLDDFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFS 153

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           SD GWGCM+RS Q L+A AL   RLGR WR+      +R    IL LF D   +PFSIH 
Sbjct: 154 SDTGWGCMIRSGQSLLANALQVLRLGRAWRRGQDSQGERR---ILSLFADDPKAPFSIHR 210

Query: 183 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            ++ G  A G   G W GP A  R  +AL+         G +   + +Y+     D    
Sbjct: 211 FVEHGAVACGKHPGEWFGPSATARCIQALSN--------GYEDAGLRVYITGDGSD---- 258

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
                  +D+     V       + P L+LV + LG+++V P Y   L+ +    QS+GI
Sbjct: 259 -----VYEDS--FMKVAKDANNTFHPTLVLVGIRLGIDRVTPVYWEALKASLQLSQSIGI 311

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDS 358
            GG+P AS Y VG Q     YLDPH  +P + +     D  + D  + H+  +R +H+  
Sbjct: 312 AGGRPSASHYFVGTQGSYFFYLDPHTTRPFLPLHSDLSDYTQEDIDSCHTRRLRRLHVKE 371

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
           +DPS+ I F  RD+ D+ ++     K   E +G P+  V  +
Sbjct: 372 MDPSMLIAFLIRDETDWQNW----RKAVAEVHGKPVIHVADS 409


>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
          Length = 482

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 120/424 (28%), Positives = 179/424 (42%), Gaps = 96/424 (22%)

Query: 65  SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAE---------FNQDFSSRILISYRKGFD 114
           S  S + LLG C H  A D+   D A      E         F +DF+SR+ ++YR+ F 
Sbjct: 36  SRNSPVLLLGRCYHFKADDDGSADEASCREPEEGFSMGNVEAFRKDFTSRVWLTYREEFP 95

Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQ----------- 156
           P+  S +T+D GWGC+LR+ QM++AQAL+ H LGR W        +PL            
Sbjct: 96  PLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLGRDWTWSEALTLQPLDTETWTASAAKR 155

Query: 157 -------------KPFDREYVE-----------------------ILHLFGDSETSPFSI 180
                        K  DR++ E                       I+  FGD+ ++   +
Sbjct: 156 LVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEAEAHLKEMYHRTIISWFGDTSSALLGL 215

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG------------------- 221
           H L++ G   G  AG+W GP  +    +     +  ++GL                    
Sbjct: 216 HRLVRLGLTMGKNAGNWYGPAVVAHILKKAVE-EAMDSGLAGITAYVSQDCTVYSADVAD 274

Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
           C   P A            GG P    +D     S+    QA    +++L+P+ LG EK+
Sbjct: 275 CHKPPSARQASVSPPIA--GGGP--SKEDQPGSASILPDSQA----VIILIPVRLGGEKI 326

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           NP Y   ++   +    +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D   
Sbjct: 327 NPEYFEFVKNILSVEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDFP- 385

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 399
              ++H    + I    +DPS  IGFY R   D+D      SKL + S     P FT  Q
Sbjct: 386 -LQSFHCPSPKKIPFTRMDPSCTIGFYSRSLQDYDRIREELSKLLQPSTKEKYPAFTFVQ 444

Query: 400 THKK 403
            H +
Sbjct: 445 GHGR 448


>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
          Length = 458

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 115/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENSDSDSWTSNTVK 155

Query: 158 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 181
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNEIYHRKIISWFGDSPVAVFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSRDFDFT 429


>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
           2508]
 gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
           2509]
          Length = 506

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 108/309 (34%), Positives = 151/309 (48%), Gaps = 50/309 (16%)

Query: 97  FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
           F  DF SRI ++YR  F       DP   S ++                SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A A+L  RLGR WR+      D E  +I+ LF D   +P+S+HN ++ G  A G 
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGK 287

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +ALA     ++GL   S                G  P V  D   
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDS-- 328

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
              +V +     + P L+LV   LG++K+N  Y   L  T   PQS+GI GG+P +S Y 
Sbjct: 329 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           VGVQ +   YLDPH  +P +   +D       +  T H+  +R +H+  +DPS+ IGF  
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLI 447

Query: 370 RDKDDFDDF 378
           +D+DD+D +
Sbjct: 448 KDEDDWDTW 456


>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
 gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
          Length = 478

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 174/386 (45%), Gaps = 82/386 (21%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           +GL    +  +SR+  +YR+ F PIG +  ++D GWGCMLR +QML+ + LL   +GR +
Sbjct: 47  DGLEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHF 106

Query: 152 RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR------ 205
              ++K     Y +IL +F D + + +SIH + Q G   G     W GP    +      
Sbjct: 107 EWDIEKT-SEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLT 165

Query: 206 ---SWEALARCQRAETGLGCQ-SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV---- 257
               W  +A     +  L  + +L MA    S +       + +  + +  ++ ++    
Sbjct: 166 IFDDWSNIAVHVALDNILVKEDALTMATTYPSDN------ASYIFAVHNFLKYFTLNLTF 219

Query: 258 --FSK-GQ-----------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
             F++ GQ            DW P+L+++PL LGL  +NP Y+P ++  F  PQ +GI+G
Sbjct: 220 PNFAENGQIEKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFELPQCVGIIG 279

Query: 304 GKPGASTYIVGVQEESAIYLDPH-----------------------------DVQPVINI 334
           GKP  + Y VG+      YLDPH                             D+Q  I+ 
Sbjct: 280 GKPNLAHYFVGIAGTKLFYLDPHHCRAKTTKRDAGVTTNTMISSITTTDAQLDIQNQIDD 339

Query: 335 GK----DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                 +DLE             D STYH  +++ +  +SIDPSLA+  +C  + DFD  
Sbjct: 340 SDFHKLEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEYESIDPSLALALFCETRQDFDTL 399

Query: 379 CARASKLAEESNGAPLFTVTQTHKKP 404
           C    K    S+  P+F   +  K+P
Sbjct: 400 CEELQKTTLPSSVPPMFEFLE--KRP 423


>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
          Length = 343

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 139/303 (45%), Gaps = 48/303 (15%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
           E   D +SR+  +YRK F  IG +  TSD GWGCMLR  QM+ AQAL+   LGR WR   
Sbjct: 40  EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWIK 99

Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR---------S 206
            K     Y  +L+ F D + S +SIH + Q G   G + G W GP  + +         +
Sbjct: 100 GKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFDT 159

Query: 207 WEALA----------------RCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV 246
           W +LA                 CQ   +  G  + P      +Y    +E G R    + 
Sbjct: 160 WSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL- 218

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
                             W P++LL+PL LGL ++N  YI TL+  F  PQSLG++GGKP
Sbjct: 219 ------------------WKPLVLLIPLRLGLTEINEAYIETLKHCFMMPQSLGVIGGKP 260

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
            ++ Y +G   E  IYLDPH  QP +         D S +       + +  +DPS+A+ 
Sbjct: 261 NSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCLPDESFHCQHPPCRMSIAELDPSIAVV 320

Query: 367 FYC 369
             C
Sbjct: 321 CSC 323


>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
          Length = 358

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 107/340 (31%), Positives = 164/340 (48%), Gaps = 44/340 (12%)

Query: 94  LAEFNQDF---SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
            AE+ +DF   S  + I  RK     G +  TSD GWGCMLR  QM+ AQAL+   LGR 
Sbjct: 13  FAEY-EDFPETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRD 71

Query: 151 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
           WR   +K     Y  +L+ F D + S +SIH + Q G   G + G W GP  + +  + L
Sbjct: 72  WRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKL 131

Query: 211 ARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFS 259
           A      +        +A+++     V  +E        V C        D+ RHC+ F 
Sbjct: 132 AVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFP 183

Query: 260 KGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
            G       + W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +
Sbjct: 184 AGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFI 243

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP--SLAIGFYCRD 371
           G   ES+ +  P  + P+                 + + H   + ++P  S A+GF+C+ 
Sbjct: 244 GYVGESSSHRVPVGLCPLRAF-------------CEQVPHARCNIVEPEGSRALGFFCKT 290

Query: 372 KDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
           +DDF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 291 EDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 330


>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
          Length = 468

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 117/416 (28%), Positives = 176/416 (42%), Gaps = 71/416 (17%)

Query: 65  SSTSDIWLLGVCHKI------AQDEALGDAAGNNGLA----EFNQDFSSRILISYRKGFD 114
           S  S + LLG C+         Q EA  +A+   G+     +F +DF SRI ++YR+ F 
Sbjct: 29  SRNSPVLLLGKCYHFKAEEDEGQTEACREASDEEGVMGNVEDFRRDFGSRIWLTYREEFP 88

Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE--------- 162
           P+  S +TSD GWGCMLR+ QM++AQALL H LGR   W   +  +P D E         
Sbjct: 89  PLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWTWSGAMSLQPLDTETWTTSAAKR 148

Query: 163 ----------------------------------------YVEILHLFGDSETSPFSIHN 182
                                                   +  ++  FGDS ++ F +H 
Sbjct: 149 LVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDGGFHRTLVSWFGDSPSAQFGLHR 208

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS-LPMAIYVVSGDEDGERG 241
           +++ G A G  AG W GP  +    +      R     G  S +     V S D      
Sbjct: 209 MVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLAGISSYVSQDCTVYSADVIDSHK 268

Query: 242 GAPVVCID----DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
            +     +     +S H S  +    D   +++LVP+ LG EK NP Y    +   +   
Sbjct: 269 ASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVRLGGEKTNPDYFNLAKSFLSLDY 328

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
            +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      ++H    + +   
Sbjct: 329 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSPKKMPFT 386

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKKPVNHSDVL 411
            +DPS   GFY R   DF+      ++L + S     P F   Q H +  + S  L
Sbjct: 387 KMDPSCTFGFYSRSAQDFERIKHELTELLQPSAKEKYPAFIFVQGHGRDYDLSASL 442


>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
 gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
           SOWgp]
 gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
           SOWgp]
 gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
 gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
          Length = 432

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 100/297 (33%), Positives = 138/297 (46%), Gaps = 50/297 (16%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF S+   +YR  F  I  S+                        T+D GWGCM+RS
Sbjct: 105 FLDDFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRS 164

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A AL    LGR WR+  +    +E  E+L LF D+  +PFSIH  +  G  A G 
Sbjct: 165 GQSLLANALSILNLGRDWRRGSKI---KEECELLSLFADNPQAPFSIHRFVDYGASACGK 221

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  EAL+          C+   + +YV+S   D        +   D  
Sbjct: 222 HPGEWFGPSATARCIEALSN--------ECKHTDLNVYVMSDGSDVHEDQFRQIAGPDGI 273

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           R             P L+L+ + LG+E V P Y   LR    +PQS+GI GG+P +S Y 
Sbjct: 274 R-------------PTLILLGVRLGIESVTPVYWEALRAIIRYPQSVGIAGGRPSSSLYF 320

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGF 367
           +GVQ     YLDPH  +P ++   D      +  TYH+  +R +H+  +DPS+ IGF
Sbjct: 321 IGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLDTYHTRRLRRLHIREMDPSMLIGF 377


>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 468

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 105/313 (33%), Positives = 145/313 (46%), Gaps = 56/313 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF SRI +SYR GF PI  S                         T+D GWGCM+R+
Sbjct: 128 FLDDFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRT 187

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A  LL HRLGR WR+  +   +R+   +L LF D   +P+SIH  ++ G A  G 
Sbjct: 188 GQSLLANTLLSHRLGRGWRRGEKSDEERK---LLSLFADDPRAPYSIHKFVEHGAAKCGK 244

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  EALA               + +Y          G  P V  D   
Sbjct: 245 YPGEWFGPSATARCIEALANTNEKT---------LRVYST--------GDLPDVYEDS-- 285

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
               V       + P L+LV   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 286 -FMEVARPDGKTFHPTLILVSTRLGIDKINQVYWESLTATLQMPQSVGIAGGRPSSSHYF 344

Query: 313 VGVQE------ESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
           VG Q        +  YLDPH  +P +    D      +D  + H+  +R +H+  +DPS+
Sbjct: 345 VGAQRSDEDQGSNLFYLDPHHTRPALPYFDDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 404

Query: 364 AIGFYCRDKDDFD 376
            IGF   D++D++
Sbjct: 405 LIGFLITDEEDWE 417


>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
          Length = 500

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 64/310 (20%)

Query: 97  FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+ I                      GD +  +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A A+L  R GR WR+      +RE   I+ LF D   +P+SI N +  G A  G 
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI---YVVSGDEDGERGGAPVVCID 249
             G W GP        A ARC  +      + LP      ++ + + DG           
Sbjct: 290 YPGEWFGP-------SATARCIHSLRVYLTRDLPEVYEDNFMSTANPDGNH--------- 333

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
                          + P L+LV   LG++K+NP Y   L  T   PQ++GI GG+P +S
Sbjct: 334 ---------------FHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSS 378

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
            Y +G Q +   YLDPH  +P +   +   D    +  + H+  +RH+H++ +DPS+ IG
Sbjct: 379 HYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIG 438

Query: 367 FYCRDKDDFD 376
           F  +D+DD+D
Sbjct: 439 FLIKDEDDWD 448


>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
 gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
 gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
          Length = 321

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 92/291 (31%), Positives = 147/291 (50%), Gaps = 32/291 (10%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 1   MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60

Query: 190 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 233
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 61  EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 115

Query: 234 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
            D  G+R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 116 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 168

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 169 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 228

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 229 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 278


>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
          Length = 207

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/161 (51%), Positives = 105/161 (65%), Gaps = 7/161 (4%)

Query: 14  FSKSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLL 73
           F      + NRSL         S  ++R+   GSM R     LG S+   + ++SD+W L
Sbjct: 53  FEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF----LGASK---ALTSSDVWFL 105

Query: 74  GVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRS 133
           G C+K++ +E    +   +G A F +DFSSRI I+YRKGFD I DSK TSDV WGCM+RS
Sbjct: 106 GKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRS 165

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 174
           SQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE
Sbjct: 166 SQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSE 206


>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
 gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
          Length = 379

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/336 (34%), Positives = 172/336 (51%), Gaps = 30/336 (8%)

Query: 77  HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
           H+I     L +A     L +  +D  SR+  +YR+GF PIG S+ TSD GWGCMLR  QM
Sbjct: 13  HRIRCIFGLSNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQM 72

Query: 137 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLA-AG 195
           ++AQALL   LGR W    +   D  Y+ I++ F D++ +PFS+H +   G++      G
Sbjct: 73  VLAQALLQLHLGRDWEWTAETR-DETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVG 131

Query: 196 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 255
            W GP  + +  + L +              + ++V              +  D+    C
Sbjct: 132 EWFGPNTVAQVLKKLVKFD--------DWCSVVVHVALD---------STLATDEVVELC 174

Query: 256 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
              S     W P+LL++PL LGL ++NP Y+  L+  F    + G++GG+P  + Y +G 
Sbjct: 175 EDKSDAGTSWKPLLLIIPLRLGLSEINPIYVAGLKKCFELAGNCGMIGGRPNQALYFIGY 234

Query: 316 QEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
             + A++LDPH VQ   NIG     D+ E D S +H    R I+  ++DPSLA+ F C  
Sbjct: 235 VGDEALFLDPHTVQRSGNIGDKTGLDEREMDES-FHQRYARRINFKAMDPSLALCFLCAT 293

Query: 372 KDDFDDFCARASKLAEESNGAP---LFTVTQTHKKP 404
           + +FDD  AR    AE+ NG     LF VT+T + P
Sbjct: 294 RTEFDDLLAR---FAEDLNGGSCQGLFEVTKTRQAP 326


>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
 gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
          Length = 458

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 115/409 (28%), Positives = 172/409 (42%), Gaps = 80/409 (19%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE          A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKVLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG------------------------- 148
             I  S +T+D GWGC LR+ QML+AQ L+ H LG                         
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIESSDSDSWTSNTIH 155

Query: 149 -------------RPWRKPL--------QKPFDRE------YVEILHLFGDSETSPFSIH 181
                        R  R P         + P D        + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSEIYHRQIISWFGDSPVAVFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVKGVLSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 408
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSRDFDFT 429


>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
          Length = 449

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 171/372 (45%), Gaps = 59/372 (15%)

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 116
           +A D+   D    +G   F  DF SRI ++YR  FDPI                      
Sbjct: 99  LAYDDQSNDGGWPSG---FITDFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQL 155

Query: 117 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
           GD S  +SD GWGCM+RS Q L+A  +   RLGR WR   Q     E   IL  F D   
Sbjct: 156 GDQSPFSSDSGWGCMIRSGQSLLANTIALVRLGRDWR---QGQSLEEECRILKDFADDPR 212

Query: 176 SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
           +P+SIH+ ++ G  A G   G W GP A  R  +ALA                +I V S 
Sbjct: 213 APYSIHSFVRHGASACGKYPGEWFGPSATARCIQALANSHEP-----------SIRVYST 261

Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
                 G  P V  DD  +  +    G+A + P L+LV   LGL+K+ P Y   L     
Sbjct: 262 ------GDGPDVYEDDFMKIAN--PTGEA-FHPTLVLVGTRLGLDKITPVYWEALIAALQ 312

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVI 351
            PQS+GI GG+P +S Y +G Q     YLDPH  +P +   ++ ++    +  + H+  +
Sbjct: 313 MPQSVGIAGGRPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMDYTSEEIESCHTARL 372

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 411
           R IH+  +DPS+ IGF  R ++D+ D+     +  +   G  +  V Q  +  V+     
Sbjct: 373 RRIHVREMDPSMLIGFLIRSEEDWQDW----KRSVKHVQGKSIIHVAQ--RNAVHGGSSE 426

Query: 412 GETGGVPEDDSL 423
           G  G + E ++L
Sbjct: 427 GREGAIDEVETL 438


>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
          Length = 459

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/332 (32%), Positives = 152/332 (45%), Gaps = 60/332 (18%)

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 120
           +A DE L DA        F  DF SR+ ++YR  F+PI  S                   
Sbjct: 109 LAYDELLEDAGWP---IAFLDDFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLA 165

Query: 121 ----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
                +SD GWGCM+RS Q L+A  L+  +LGR WR+       R+  EIL  F D   +
Sbjct: 166 DQGGFSSDTGWGCMIRSGQSLLANTLVICQLGRDWRRGKAA---RQEREILARFADDPRA 222

Query: 177 PFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
           P+S+HN ++ G  A G   G W GP A  R  +ALA    +          + +Y     
Sbjct: 223 PYSLHNFVRHGAVACGKFPGEWFGPSATARCIQALANSNESS---------LRVYST--- 270

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
                G  P V  D      +V       + P L+LV   LG++K+N  Y   L  T   
Sbjct: 271 -----GDLPDVYEDS---FMAVAKPDGETFHPTLILVGTRLGIDKINQVYWEALTATLQM 322

Query: 296 PQSLGIVGGKPGASTYIVGVQEES--------AIYLDPHDVQPVINIGKD---DLEADTS 344
           PQS+GI GG+P AS Y +G Q             YLDPH  +P +   +D       D +
Sbjct: 323 PQSVGIAGGRPSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN 382

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
           T H+  +R +H+  +DPS+ IGF  +D+DD+D
Sbjct: 383 TCHTRRLRRLHVRDMDPSMLIGFLIKDEDDWD 414


>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
          Length = 448

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/307 (32%), Positives = 148/307 (48%), Gaps = 49/307 (15%)

Query: 97  FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 133
           F  DF S++  SYR GF       DP   S ++                SD GWGCM+RS
Sbjct: 108 FLDDFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRS 167

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A +++  RL R WR+ + +  +RE   I+ LF D   +P+SIH  ++ G +A G 
Sbjct: 168 GQSLLANSMVILRLSRGWRRGVGRDKERE---IVSLFADDPRAPYSIHKFVEHGAEACGK 224

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  + LA+          +S  + +Y+     D  + G   V   D  
Sbjct: 225 YPGQWFGPSATARCIQELAKRH--------ESADVRVYITGDGSDVYKDGFMSVAKPDG- 275

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
                      ++ P L+LV   LG++KV P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 276 ----------VNFKPTLILVGTRLGIDKVTPVYWEALKASLQMPQSVGIAGGRPSSSHYF 325

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           VGVQ     YLDPH     I    D  E   A+  + H+  +R + +  +DPS+ IGF  
Sbjct: 326 VGVQGSHFFYLDPHQTMAAIPFHTDVDEYTPAEIDSCHTRRLRRLDIKEMDPSMLIGFLI 385

Query: 370 RDKDDFD 376
           RD+ D++
Sbjct: 386 RDEKDWE 392


>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
 gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
          Length = 545

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 98/393 (24%)

Query: 96  EFNQDFSSRILISYRKGF--------------------------DPIGDSKITSDVGWGC 129
           +F  D  SRI +SYR GF                          DP G    TSDVGWGC
Sbjct: 64  DFLADVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRG---YTSDVGWGC 120

Query: 130 MLRSSQMLVAQALLFHRLGRPWR----------------------------KPLQKPFDR 161
           M+R+SQ L+A ALLF  LGR WR                            K  +     
Sbjct: 121 MIRTSQSLLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSE 180

Query: 162 EYV----EILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 216
           E       I+  F DS  SPFSIH  ++ G KA    AG W GP A   S  AL      
Sbjct: 181 ETAVSEETIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYAL------ 234

Query: 217 ETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
                C   P   + +Y      +G  GG   V  D+      +   G     P+L+L  
Sbjct: 235 -----CNEFPDSGLKVYY-----NGNGGGD--VYEDE------LLETG----FPLLVLCG 272

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           L LG++ VNP Y  +LR   + PQS+GI GG+P  S Y  G Q E   YLDPH  +P + 
Sbjct: 273 LRLGIDNVNPIYWDSLRQMLSLPQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPAVK 332

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 393
                 + DT+++HS  I  +HL  +DPS+ +GFY   + D++ F    +   E+++   
Sbjct: 333 T----TDKDTTSFHSSRIWKLHLKEMDPSMLVGFYITSEADWETFKGSLTASKEKTSSQI 388

Query: 394 LFTVTQTHKKP-VNHSDVLGETGGVPEDDSLGV 425
           +      H  P  +  D     GG  +DD + V
Sbjct: 389 VHIHPSRHNIPSFDEEDEYVSIGGASDDDFVDV 421


>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
 gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
          Length = 389

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 117/346 (33%), Positives = 170/346 (49%), Gaps = 34/346 (9%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I  +   +W+LG  +  + D           L    QD  SR+  +YR+GF PIG++++T
Sbjct: 21  IPKTNDTVWILGKQYNASDD-----------LEAIRQDVQSRLWCTYRRGFVPIGNTQLT 69

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           +D GWGCMLR  QM++AQALL   LGR W    +   D  Y+ I++ F DS+ +PFS+H 
Sbjct: 70  TDKGWGCMLRCGQMVLAQALLQLHLGRDWVWEAETR-DDIYLNIVNRFEDSKQAPFSLHQ 128

Query: 183 L-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
           + L    +     G W GP  + +  + L +         C+   + I+V   +      
Sbjct: 129 IALMGDSSEEKRIGEWFGPNTVAQVLKKLVKFDD-----WCR---LVIHVALDN------ 174

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
               V  D+    C V  K    W P+LL++PL LGL +VNP YI  L+  F  P S G+
Sbjct: 175 ---TVATDEIVELC-VDKKEPEAWKPLLLIIPLRLGLSEVNPIYIEGLKKCFQLPGSCGM 230

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDS 358
           +GG+P  + Y +G     A+YLDPH VQ V  +G     A+     T+H      I   S
Sbjct: 231 IGGRPNQALYFIGYVGGEALYLDPHTVQRVGTVGSKQDPAEQELDETFHQRYASRISFTS 290

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
           +DPSLA+ F C  +  FD   AR +          LF VT+T + P
Sbjct: 291 MDPSLAVCFLCVSRQQFDQLVARFNDSVNGGTSQALFEVTKTRQAP 336


>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
          Length = 383

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 107/346 (30%), Positives = 166/346 (47%), Gaps = 48/346 (13%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
           +W+LG  +   ++           L    +D  S++  +YRKGF PIG  +S  TSD GW
Sbjct: 23  VWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGCNSTFTSDKGW 71

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
           GCMLR  QM++AQAL+   LG+ W+  + +  +  Y++IL  F D   + FSIH +   G
Sbjct: 72  GCMLRCGQMVLAQALITLHLGKDWQW-MPETKNNTYLKILRRFEDKRAAAFSIHQIALMG 130

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
            + G   G W GP  + +  + L       +        + I+V   +          + 
Sbjct: 131 ASEGKEVGQWFGPNTIAQVLKKLIVYDEWSS--------LTIHVALDN---------TLI 173

Query: 248 IDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
           ++D  R C V              +  + W P+LLL+PL LGL ++NP YI  L+ +F  
Sbjct: 174 VNDILRQCRVEGGVTAEADGEIPLRAPSQWKPLLLLIPLRLGLSEINPVYINGLKTSFKI 233

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVI 351
            QSLG++GGKP  + Y +G   +  IYLDPH  Q        I ++++E D S YH    
Sbjct: 234 SQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDIS-YHCKSA 292

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
             I +  +DPS+A+ F+C  + +F   C    +        PLF +
Sbjct: 293 SRIPITGMDPSVALCFFCATEKEFKSLCKSMQEELILPEKQPLFEL 338


>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 102/289 (35%), Positives = 143/289 (49%), Gaps = 29/289 (10%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           L +   DF SR+  +YR+ F  IG S  TSD GWGCMLR+ QMLVA+ LL  RLGR +  
Sbjct: 39  LEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRNYVW 98

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
                 D  Y EIL LF D+ ++  S+  + L    A   A G W GP  M    + L R
Sbjct: 99  SESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMA---QVLKR 155

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
             ++      +SL   + V             VV ++D S    + + G+   TP++L++
Sbjct: 156 ITKS------RSLGFGVTVAMDS---------VVSVEDVS--AEIINGGKP--TPLVLMI 196

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA----IYLDPHDV 328
           PL LGL  VN  Y+  L++       +GI+GGKP  + Y VG QE       +YLDPH  
Sbjct: 197 PLRLGLNSVNEIYVNPLKIFLASKYCVGIMGGKPNQAHYFVGYQETVEDTWLLYLDPHTT 256

Query: 329 Q--PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
           Q  PV        E    + H+D +  I    +DPSLA+GF+    ++F
Sbjct: 257 QQSPVSVNNNMPFEQFDKSLHTDKLCWIKALKLDPSLAVGFFFNTVEEF 305


>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
 gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
          Length = 480

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 159/312 (50%), Gaps = 41/312 (13%)

Query: 101 FSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFH----RLGRPWRKPL 155
           F S    +YR   + PIG S   SD GWGCM+R+ QML+ QA++ H     L   + + +
Sbjct: 154 FKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMMRHVFEDNLKYEYIEKI 213

Query: 156 QKPFDREYVEILHLF---GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
            + +  EY+ +L LF   G+ + SP+SI N+   G       G W GP A+    + L +
Sbjct: 214 TE-YREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIVLKRLTK 272

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILLL 271
             +          P+  + +             VC++  + + +V  +   DWT  + ++
Sbjct: 273 IYK----------PVKQFTM------------YVCLE-GNIYLNVIQEKSKDWTQSVFIV 309

Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES--AIYLDPHDVQ 329
           +PL LGL  + P Y+ +++  FTFPQ++GI GG+  ++ Y +G+ + S   IYLDPH VQ
Sbjct: 310 IPLRLGLNYIEPEYLSSVKKVFTFPQNVGIAGGRENSALYFIGISDSSNNLIYLDPHLVQ 369

Query: 330 ---PVINIGKDD-LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
              P  N+  ++      S++H    + + L+ +  S+AIGFY RD +DF DF  R   L
Sbjct: 370 KSVPTCNMQTNEQFYQYESSFHCTKFKKMPLNRMCTSVAIGFYIRDYNDFLDFQTRIKSL 429

Query: 386 AEESNGAPLFTV 397
           +   N   +FTV
Sbjct: 430 SSGENS--IFTV 439


>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
          Length = 459

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 118/437 (27%), Positives = 183/437 (41%), Gaps = 84/437 (19%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAA-GNN----------GLAEFNQDFSSRILISYRKGF 113
           S  S ++LLG C+    DE    +  G+N           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKTDEPTEQSPNGSNYDVTEEEVSRNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 155
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWPDALVNENPESESWTSHTVK 155

Query: 156 ------------QKPFDREYV----------------------EILHLFGDSETSPFSIH 181
                       +K F  + +                      +I+  FGDS  + F +H
Sbjct: 156 KLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRDEHYHRKIVSWFGDSPLANFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ G   G  AG W GP  +      L R +  E     +   + +YV          
Sbjct: 216 RLIEYGNKSGKMAGDWYGPAVVAH----LLR-KAVEEAKDPELQGITVYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D     CS+    +     +++L+P+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYKSDVVEMQCSLKDSEKPGAKSVIILIPVRLGGERTNMEYLEFVKGILSLEYCIGI 322

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           VGG+P  S Y  G Q++S IY+DPH  Q  +++   +   +  ++H    + +    +DP
Sbjct: 323 VGGRPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMDP 380

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNH--SDVLGETGGV 417
           S  IG YC +   F+      +K+ + S+    PLFT    H K  +   S V  E    
Sbjct: 381 SCTIGLYCPNVQGFERASEEITKILKASSKEKYPLFTFVNGHSKDYDFMMSPVQEEKALF 440

Query: 418 PEDDS--LGVMSMNDAV 432
            ED++  L   S  D V
Sbjct: 441 SEDENKKLKRFSTEDFV 457


>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
          Length = 427

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 104/382 (27%), Positives = 169/382 (44%), Gaps = 66/382 (17%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 37  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 86

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 87  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 146

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 204
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 147 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 202

Query: 205 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 264
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 203 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 249

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G          
Sbjct: 250 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGXXXXXXXXXX 309

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
               QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 310 XXXCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 367

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 368 VLGSSSATERYPMFTLAEGHAQ 389


>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
           4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
           nidulans FGSC A4]
          Length = 402

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 120/390 (30%), Positives = 178/390 (45%), Gaps = 68/390 (17%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGV-----CHKIAQDEALGDAAGNN--------GLA 95
           +RI + +  P         S IW LG      C +   DE+     G          G  
Sbjct: 11  KRIIQYIWDPEPKNDEEPGSPIWCLGTRYPPQCVEETADESRNPDHGQQQNTNTSAPGWP 70

Query: 96  E-FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
           E F  DF S+I ++YR  F PI                            TSD GWGCM+
Sbjct: 71  EAFLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMI 130

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
           RS Q L+A ++    LGR WR+  +     E  ++L LF DS  +PFSIH+ ++ G  + 
Sbjct: 131 RSGQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFC 187

Query: 191 GLAAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
           G   G W GP A  R  + LA R  ++          + +Y+   + D  +     V  D
Sbjct: 188 GKHPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRD 238

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           +         KG     P L+L+ L LG++++   Y   L+     PQS+GI GG+P AS
Sbjct: 239 E---------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSAS 287

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
            Y V VQ     YLDPH+ +P +   +     E + +TYH+  +R +++  +DPS+ IGF
Sbjct: 288 HYFVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGF 347

Query: 368 YCRDKDDFDDFCARASKLAEESNGAPLFTV 397
             RD+DD++D+ AR   L     G P+ T+
Sbjct: 348 LIRDEDDWEDWKARIMSL----EGKPIITI 373


>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
          Length = 449

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 52/327 (15%)

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 120
           +A DEA+    G    + F  DF S+  ++YR  F+PI  S                   
Sbjct: 98  LAYDEAMNQDGG--WPSAFLDDFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQF 155

Query: 121 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
                 +SD GWGCM+RS Q L+A A+    LGR WR+ +    +R+   +L  F D   
Sbjct: 156 MDQAGYSSDSGWGCMIRSGQSLLANAMAVLDLGRDWRRGVAAEKERQ---LLSKFADDPK 212

Query: 176 SPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
           +P+SIH  +Q G  A G   G W GP A  R  +AL                + +Y    
Sbjct: 213 APYSIHRFVQHGAVACGKYPGEWFGPSATARCIQALVNANEPH---------LRVYST-- 261

Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
                 G  P V  D   R   +       + P L+LV   LG++K+ P Y   L     
Sbjct: 262 ------GDGPDVYED---RFFDIAKPSGETFHPTLILVGTRLGIDKITPVYWDALIAALQ 312

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVI 351
            PQS+GI GG+P +S Y +G Q     YLDPH  +  +   +D     +AD  + H+  +
Sbjct: 313 MPQSIGIAGGRPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSVHTRRL 372

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           R +H+  +DPS+ IGF   D+DD+D++
Sbjct: 373 RRLHVREMDPSMLIGFVIHDEDDWDEW 399


>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
          Length = 507

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 105/304 (34%), Positives = 148/304 (48%), Gaps = 52/304 (17%)

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-----KPLQKPFDREYVEILHLFGD--SE 174
           TSD GWGCM+RS QML+AQ L+ H LGR WR      P++ P D  + +++  F D  S+
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242

Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMC-----------RSWEALARCQ--------- 214
            SPFS+H L+QA    G   GSW GP  +C           R +E LAR           
Sbjct: 243 ESPFSLHRLVQAS---GQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299

Query: 215 -RAET-----GLGCQSLPMAIYVVSGDEDGERGGA-------PVVCIDD---ASRHCSVF 258
            R E      G   +  P  +      E+ +   +       P   + D   +S   ++F
Sbjct: 300 YREEIMNLARGQPVRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQDGIQSSPSTTLF 359

Query: 259 SKGQADWTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 317
                    ++LL+P+ LGL+K ++ RY+P +      P  +GI+GG+P  S YI+G Q 
Sbjct: 360 PSHA-----VILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIYILGCQN 414

Query: 318 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
              I+LDPH  QPV+    D  E +  T+H  V R I    +DPS A+GFYCR + D  D
Sbjct: 415 TQLIHLDPHFTQPVVRNVVDSEEFNVKTWHCLVPRVIEAAKLDPSCAVGFYCRSRGDLSD 474

Query: 378 FCAR 381
              R
Sbjct: 475 LLER 478


>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
 gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
           Full=Autophagy-related protein 4 homolog C
 gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
 gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
          Length = 450

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 111/403 (27%), Positives = 172/403 (42%), Gaps = 95/403 (23%)

Query: 67  TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 114
            S ++LLG C+    +++    D   N+G          + EF +DF SRI ++YR+ F 
Sbjct: 38  NSPVFLLGKCYHFKYEDSSVTSDGGSNSGSESKEDLSGNVDEFRKDFISRIWLTYREEFP 97

Query: 115 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 151
            I  S  T+D GWGC LR+ QML+AQ L+ H LGR W                       
Sbjct: 98  QIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWTWTEALDIFSSESEFWTANTARK 157

Query: 152 -------------------RKPLQ---KPFDRE--YVEILHLFGDSETSPFSIHNLLQAG 187
                              ++PL    K  + E  + +I+  F D   + F +H L++ G
Sbjct: 158 LTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQKIISWFADYPLAYFGLHQLVKLG 217

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 247
           K  G  AG W GP  +      L R    E+                  D E  G  +  
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256

Query: 248 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
             D    C+++S    D          +++LVP+ LG E+ N  Y   ++   +    +G
Sbjct: 257 AQD----CTIYSADVYDLQCNKGTEKAVVILVPVRLGGERTNMEYFEFVKGILSLEFCIG 312

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I+GGKP  S Y VG Q++S IY+DPH  Q  +++   +   +  ++H    + +    +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSVKNFPLE--SFHCPSPKKMSFKKMD 370

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 401
           PS  IGFYCR+  +F+      +K+ + S     PLFT    H
Sbjct: 371 PSCTIGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413


>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
 gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
          Length = 409

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 166/345 (48%), Gaps = 42/345 (12%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           +D GWGCMLR  QM++AQAL+   LGR W    +   D  Y++I++ F D   S +SIH 
Sbjct: 92  TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           +   G++   A G W+GP  + +  + L         L      + ++V           
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
              V +DD    C    +G A W P+LL++PL LG+  +NP YIP L+       S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
           GG+P  + Y +G  E+  +YLDPH  Q    +G+     +     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQKTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTH 401
           DPSLA+ F C+  D F        KL +E  G     LF ++QT 
Sbjct: 310 DPSLAVCFLCKTSDSFQQL---LDKLRQEVLGMCSPALFEISQTR 351


>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
 gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
          Length = 409

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/342 (30%), Positives = 163/342 (47%), Gaps = 36/342 (10%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           +D GWGCMLR  QM++AQAL+   LGR W    +   D  Y++I++ F D   S +SIH 
Sbjct: 92  TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           +   G++   A G W+GP  + +  + L         L      + ++V           
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
              V +DD    C    +G A W P+LL++PL LG+  +NP YIP L+       S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 359
           GG+P  + Y +G  E+  +YLDPH  Q    +G+     +     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQRTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309

Query: 360 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           DPSLA+ F C+  D F     +  +         LF ++QT 
Sbjct: 310 DPSLAVCFLCKTSDSFQQLLEKLRQEVLGMCSPALFEISQTR 351


>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
          Length = 433

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 97/329 (29%), Positives = 150/329 (45%), Gaps = 59/329 (17%)

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
           TSD GWGCMLR +QML+ + LL   +GR +   ++      Y +IL +F D + + +SIH
Sbjct: 49  TSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIH 107

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYV 231
            + Q G   G     W GP    +          W  +A     +  L  + +L MA   
Sbjct: 108 QIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTY 167

Query: 232 VSGD------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
            S D      E+G+             +H +  +  + +W P+LL++PL LGL  +N  Y
Sbjct: 168 PSEDAVKLIMENGQ-----------VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCY 216

Query: 286 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV-------------- 331
           +P ++  F  PQ +GI+GGKP  + Y VG+      YLDPH  +P               
Sbjct: 217 LPAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTES 276

Query: 332 ----INIGK-DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
                N  + +DLE             D STYH  +++ +  +SIDPSLA+  +C  ++D
Sbjct: 277 EQHDTNFSELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESRED 336

Query: 375 FDDFCARASKLAEESNGAPLFTVTQTHKK 403
           FD+ C    K    ++  P+F   +   K
Sbjct: 337 FDNLCQELQKTTLPASKPPMFEFLEKRPK 365


>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
          Length = 409

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/313 (32%), Positives = 150/313 (47%), Gaps = 49/313 (15%)

Query: 97  FNQDFSSRILISYRKGFDPI---------------------GDSKITSDVGWGCMLRSSQ 135
           F +DF S + ++YR  F PI                          TSD GWGCM+RS Q
Sbjct: 86  FLEDFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQ 145

Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAA 194
            ++A AL   RLGR WR+ + KP   E   +L LF D   +PFSIH  ++ G+   G   
Sbjct: 146 AVIANALAHLRLGRGWRRGM-KP--EEEKRLLALFADDPRAPFSIHKFVRHGEVECGKNP 202

Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 254
           G W GP        A A C +A T    +   + +Y  + ++  E     V  ++     
Sbjct: 203 GEWFGP-------SAAAMCIQALTH-AYEPAGLRVYQTNSNDLYEEDFRKVAVVN----- 249

Query: 255 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
             VF        P L+L  + LG+E++   Y   L      PQ++GI GG+P +S Y + 
Sbjct: 250 -GVFK-------PTLVLAGIRLGIERITNIYYEPLAACLRMPQTVGIAGGRPSSSHYFIA 301

Query: 315 VQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
           VQ E+  YLDPH  +P++      +D  E +  T H+  IR +H+  +DPS+ I F  RD
Sbjct: 302 VQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHTRRIRRLHIREMDPSMLIAFLIRD 361

Query: 372 KDDFDDFCARASK 384
           + D++D+  R S+
Sbjct: 362 EADWEDWQRRISE 374


>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
 gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
          Length = 518

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 155/318 (48%), Gaps = 49/318 (15%)

Query: 87  DAAG-NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
           DA G ++G  +F  D+ SR+ I+YR  F P+ ++  T+D GWGCM+R++QM+VAQA++ +
Sbjct: 159 DANGVSSGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIMLN 218

Query: 146 RLGRPWRKPLQKP-----------FDREYVE---ILHLFGDSETSPFSIHNLLQ--AGKA 189
           R GR WR   +K            FDRE ++   IL LF D  +SP  IH +++  A + 
Sbjct: 219 RFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAKEK 278

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
              A GSW  P       EA+   ++A        L  +I  ++GD       A  + I 
Sbjct: 279 GKKAVGSWYSPS------EAVFIMKKA--------LTESISPLTGD------TAMYLSI- 317

Query: 250 DASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
           D   H         +W   L+LV +V LG  ++NP Y+P L   F+    LG+ GG+P  
Sbjct: 318 DGRVHIRDIEVETKNWMKTLILVIVVRLGAAELNPIYVPHLMRLFSMESCLGVTGGRPDH 377

Query: 309 STYIVGVQEESAIYLDPHDVQPVINI----------GKDDLEADTSTYHSDVIRHIHLDS 358
           S + VG   +  IYLDPH     I I           K   +    +YH  ++  +H   
Sbjct: 378 SCWFVGFYGDQIIYLDPHVAHEYIPIDMNFNVNMTDNKKSKKCPERSYHCRLLSKMHFLD 437

Query: 359 IDPSLAIGFYCRDKDDFD 376
           +DPS A+ F    ++ FD
Sbjct: 438 MDPSCALCFRFESREQFD 455


>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
 gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
          Length = 521

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/340 (31%), Positives = 159/340 (46%), Gaps = 56/340 (16%)

Query: 68  SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 127
           +D+  LG  +  + DE+       +G   F  D+ SR+ I+YR  F  + D+  T+D GW
Sbjct: 146 NDVVFLGRRYSTSVDES----GLRSGFENFCSDYYSRLWITYRTDFPALLDTDTTTDCGW 201

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQK-----------PFDREYVE---ILHLFGDS 173
           GCM+R++QM+VAQA++ +R GR WR   +K            FDRE ++   IL LF D 
Sbjct: 202 GCMIRTTQMMVAQAIMVNRFGRDWRFTRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDK 261

Query: 174 ETSPFSIHNLL---QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
            T+P  IH ++     GK    A GSW  P       EA+   ++A   L   S P+   
Sbjct: 262 PTAPLGIHKMVGIAAMGKGKK-AVGSWYSPS------EAVFIMKKA---LTESSSPLT-- 309

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTL 289
                     G   ++   D   H         +W   L+LV +V LG  ++NP Y+P L
Sbjct: 310 ----------GNTAMLLSIDGRVHIRDIEVETKNWMKKLILVIVVRLGAAELNPIYVPHL 359

Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH--------DVQPVINI----GKD 337
              F     LGI GG+P  S++ VG   +  IYLDPH        D+ P  N+     K 
Sbjct: 360 MRLFAMESCLGITGGRPDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSDSKK 419

Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
             +    +YH  ++  +H   +DPS A+ F    ++ FD+
Sbjct: 420 AKKCPEKSYHCRLLSKMHFFDMDPSCALCFQFESREQFDN 459


>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
 gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
          Length = 491

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+PI  S                         T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367

Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427

Query: 364 AIGFYCRDKDDF 375
            IGF   D++++
Sbjct: 428 LIGFLILDEENW 439


>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
           TFB-10046 SS5]
          Length = 989

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 96/275 (34%), Positives = 131/275 (47%), Gaps = 42/275 (15%)

Query: 97  FNQDFSSRILISYRKGFDPI-----------------------------GDSKITSDVGW 127
           F  DF+SR+ ++YR  F PI                             G+   TSD GW
Sbjct: 314 FYADFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGW 373

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPL---QKPFDREYVEILHLFGDSET--SPFSIHN 182
           GCMLR+ Q L+A  L+   LGR WR+P      P    YV+IL  F D+ +  +PFS+H 
Sbjct: 374 GCMLRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHR 433

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIYVVSGDEDGERG 241
           +  +GK +G   G W GP     +   L     RA+ G+      +A+  V  + D    
Sbjct: 434 MAMSGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVA-----IAVDGVLYETDIYSA 488

Query: 242 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
               +   D +R  S F +    W    +L+LV   LGL+ VNP Y   L+  FTFPQSL
Sbjct: 489 SHYPMSSADGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTIFTFPQSL 548

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           GI GG+P +S Y VG Q  S  YLDPH  +P + +
Sbjct: 549 GIAGGRPSSSYYFVGSQGNSLFYLDPHHTRPAVPL 583



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 4/72 (5%)

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA-PLFTVTQT 400
           D  T+H D +R + L  +DPS+ +GF CRD+ D+ DF  R   +AE S G   LF++ + 
Sbjct: 699 DLKTFHCDRVRKMPLSGLDPSMLLGFLCRDEQDWKDFRRR---MAEISKGRDTLFSIQEE 755

Query: 401 HKKPVNHSDVLG 412
                + SD +G
Sbjct: 756 PPSWPSDSDDMG 767


>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
          Length = 572

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+PI  S                         T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 390 -FMEVAKSDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448

Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508

Query: 364 AIGFYCRDKDDF 375
            IGF   D++++
Sbjct: 509 LIGFLILDEENW 520


>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
          Length = 572

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+PI  S                         T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 390 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448

Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508

Query: 364 AIGFYCRDKDDF 375
            IGF   D++++
Sbjct: 509 LIGFLILDEENW 520


>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
           4308]
          Length = 378

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 113/375 (30%), Positives = 172/375 (45%), Gaps = 54/375 (14%)

Query: 49  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQD-----------EALGDAAGNNGLAE- 96
           +RI + +  P        TS IW LG+ +   +D            A  +  G     + 
Sbjct: 11  KRIVQYLWDPEPRNDEDPTSSIWCLGIEYHPEKDVSPRGETPDKNSARDNTTGTTNYRKP 70

Query: 97  --------FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
                   F  DF SRI ++YR  F PI   ++  D     M   S  L+A AL    LG
Sbjct: 71  SEHAWPESFLLDFESRIWMTYRSNFPPI--PRVEGDDKSASMTLGS--LLANALSTLVLG 126

Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 207
           R WR+  +  F+ E  ++L LF D+ T+PFS+H  ++ G ++ G   G W GP A  +  
Sbjct: 127 RDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCI 183

Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
           EAL+          C S  + +YV +   +  +            R  +V       + P
Sbjct: 184 EALSS--------QCGSPTLKVYVSNDTSEVYQ-----------DRFMNVARNSSGVFQP 224

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
            L+L+   LG++ + P Y   L+ T   PQS+GI GG+P AS Y VG Q     YLDPH 
Sbjct: 225 TLILLGTRLGIDHITPVYWDGLKATLQLPQSVGIAGGRPSASHYFVGAQGSHLFYLDPHY 284

Query: 328 VQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
            +P +     G+   + +  TYH+  +R IH+  +DPS+ IGF  RD++D+DD+  R   
Sbjct: 285 TRPALPDRQGGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRDQEDWDDWLNRIQA 344

Query: 385 LAEESNGAPLFTVTQ 399
           +     G P+  V +
Sbjct: 345 V----KGRPIIHVLK 355


>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
          Length = 758

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
           F  D  S+I ++YR GF PI   K                      TSD GWGCM+R+SQ
Sbjct: 65  FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124

Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
            L+A ALLF  LGR W    + P + E+  I+  F D    PFSIHN +Q G K      
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184

Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
           G W GP A  R+ + L           C+  P   + +Y  S             C D  
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221

Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
                  + G +D +TPIL+L+ + LG+EKVNP Y  +LR   +  QS+GI GG+P +S 
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281

Query: 311 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 341
           Y  G Q +   YLDPH  Q  +  G            K D  A                 
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341

Query: 342 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                               D  + H+  +  +HL  +DPS+ IGF    +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398


>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
          Length = 459

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 117/425 (27%), Positives = 171/425 (40%), Gaps = 94/425 (22%)

Query: 65  SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 154
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 155 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 181
                                 QK   R Y +            I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQ-------- 262

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   + C+  +    D   +++L+P+ LG E+ N  Y+  ++      ++L I
Sbjct: 263 DCTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGIL---RALNI 319

Query: 302 VG----GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
           V      KP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +   
Sbjct: 320 VWVLLVAKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFR 377

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHS 408
            +DPS  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  
Sbjct: 378 KMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEE 437

Query: 409 DVLGE 413
           D+  E
Sbjct: 438 DLFSE 442


>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
          Length = 318

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 49/266 (18%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 77  VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 125

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 126 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 185

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 186 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 228

Query: 250 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 288
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 229 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 288

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVG 314
            +  F  PQSLG +GGKP  + Y +G
Sbjct: 289 FKECFKMPQSLGALGGKPNNAYYFIG 314


>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
           pastoris GS115]
 gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
           pastoris GS115]
          Length = 531

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
           F  D  S+I ++YR GF PI   K                      TSD GWGCM+R+SQ
Sbjct: 65  FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124

Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
            L+A ALLF  LGR W    + P + E+  I+  F D    PFSIHN +Q G K      
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184

Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
           G W GP A  R+ + L           C+  P   + +Y  S             C D  
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221

Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
                  + G +D +TPIL+L+ + LG+EKVNP Y  +LR   +  QS+GI GG+P +S 
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281

Query: 311 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 341
           Y  G Q +   YLDPH  Q  +  G            K D  A                 
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341

Query: 342 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                               D  + H+  +  +HL  +DPS+ IGF    +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398


>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
          Length = 491

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 144/312 (46%), Gaps = 56/312 (17%)

Query: 97  FNQDFSSRILISYRKGF-------DPIGDSKI----------------TSDVGWGCMLRS 133
           F  DF SRI ++YR GF       DP   S++                T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367

Query: 313 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 363
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427

Query: 364 AIGFYCRDKDDF 375
            IGF   D++++
Sbjct: 428 LIGFLILDEENW 439


>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
 gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
          Length = 379

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/328 (32%), Positives = 159/328 (48%), Gaps = 54/328 (16%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 133
           F  DF S+I ++YR  F PI                            TSD GWGCM+RS
Sbjct: 50  FLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRS 109

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 192
            Q L+A ++    LGR WR+  +     E  ++L LF DS  +PFSIH+ ++ G  + G 
Sbjct: 110 GQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFCGK 166

Query: 193 AAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
             G W GP A  R  + LA R  ++          + +Y+   + D  +     V  D+ 
Sbjct: 167 HPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRDE- 216

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
                   KG     P L+L+ L LG++++   Y   L+     PQS+GI GG+P AS Y
Sbjct: 217 --------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSASHY 266

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            V VQ     YLDPH+ +P +   +     E + +TYH+  +R +++  +DPS+ IGF  
Sbjct: 267 FVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGFLI 326

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTV 397
           RD+DD++D+ AR   L     G P+ T+
Sbjct: 327 RDEDDWEDWKARIMSL----EGKPIITI 350


>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
          Length = 451

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 98/309 (31%), Positives = 146/309 (47%), Gaps = 50/309 (16%)

Query: 97  FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 133
           F +D +++  ++YR GFDPI  S                         +SD GWGCM+RS
Sbjct: 117 FLEDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRS 176

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A  +   +LGR WR+       +E  +++ +F D   +P+SIHN ++ G  A G 
Sbjct: 177 GQSLLATTIGILQLGRDWRR---GKCQQEERQLISMFADDPRAPYSIHNFVRHGATACGK 233

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP        A A+C +A T      LP+ +Y  +  +D        +   D  
Sbjct: 234 FPGEWFGP-------SATAQCIQALTS--ASGLPLKVYSPNDGQDVYEDSFMKIAKPD-- 282

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
                   GQ D+ P L+L+   LG++K+ P Y   L      PQS+GI GG+P +S Y 
Sbjct: 283 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLLAALQMPQSVGIAGGRPSSSHYF 333

Query: 313 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           VG Q     YLDPH  +  I    D     E D  + H+  +R +HL  +DPS+ IGF  
Sbjct: 334 VGSQGSYLFYLDPHHTRKAIPYHADVTKYTEEDIESCHTSRLRRLHLKEMDPSMLIGFLI 393

Query: 370 RDKDDFDDF 378
           R + D+ ++
Sbjct: 394 RTESDWSEW 402


>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
           206040]
          Length = 452

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 148/316 (46%), Gaps = 50/316 (15%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVG 126
           G    A F +D SS+  ++YR GF+PI  S                         +SD G
Sbjct: 113 GTGWPAGFVEDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSG 172

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+RS Q L+A  +   RLGR WR+   +  +R    ++ +F D   +P+SIHN ++ 
Sbjct: 173 WGCMIRSGQSLLATTIGILRLGRDWRRDQSQEEERH---LISMFADDPRAPYSIHNFVRH 229

Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G  A G   G W GP        A A+C +A T      L + IY  +  +D        
Sbjct: 230 GATACGKYPGEWFGP-------SATAQCIQALTS--SSGLSLNIYSPNDGQD-------- 272

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
             + + S      S GQ  + P L+L+   LG++K+ P Y   L      PQS+GI GG+
Sbjct: 273 --VYEDSFMKIAKSDGQT-FNPTLILIRTRLGIDKITPIYWDALIAALHMPQSVGIAGGR 329

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPS 362
           P +S Y VG Q     YLDPH  +  I    D     E D  + H+  +R IH+  +DPS
Sbjct: 330 PASSHYFVGSQGSYLFYLDPHHTRKAIPYHDDVTKYTEEDIESCHTSRLRRIHIKEMDPS 389

Query: 363 LAIGFYCRDKDDFDDF 378
           + IGF  R + D+ ++
Sbjct: 390 MLIGFLIRTESDWTEW 405


>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
           heterostrophus C5]
          Length = 471

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 160/358 (44%), Gaps = 91/358 (25%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
           N  + F  DF SRI ++YR GF  I  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           +RS Q ++A AL   RLGR WR    KP  +E+ EIL LF D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209

Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G   G W GP A  R  + LA   R E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYV-SGD------GADVY--E 252

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  +  ++   GQ  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+P AS
Sbjct: 253 DKLKEVAIDDDGQ--WQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310

Query: 310 TYIVGVQEESAIYLDPHDVQPVI--------------NIGKDDLE--------------- 340
            Y V  Q  +  YLDPH  +P++              N  ++ L                
Sbjct: 311 HYFVATQGNNFFYLDPHSTRPLLPYRPPPSSTENESQNQSQNQLAVPSSLDASATSNSSS 370

Query: 341 ------------ADTSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                       +D +TY        H+  IR + +  +DPS+ I F     DD++++
Sbjct: 371 TTIVPSATPTDGSDRTTYSEEELATCHTRRIRRLQIREMDPSMLIAFLITSADDYENW 428


>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
          Length = 1505

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 158/348 (45%), Gaps = 80/348 (22%)

Query: 112  GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--------- 162
            GF   G   +T+D GWGCMLR+ Q L+A AL+   LGR W +  + P  R+         
Sbjct: 779  GFSRAG---LTTDSGWGCMLRTGQSLLANALINVHLGRSWMR--EAPPARQLEFLQELAN 833

Query: 163  ------------------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGS 196
                                    Y++IL  F D  S   PF +H + + GK  G   G 
Sbjct: 834  LSLDTSAEKQSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGE 893

Query: 197  WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE-DGERGGAPVVCIDDASRHC 255
            W GP     + + L   +  + GL  +     ++ +  DE     G +  +    AS   
Sbjct: 894  WFGPSTAAGAIKQLV-SEFPDAGLAVELAHDGVFYL--DEVRAAAGASRQLGKGRASATG 950

Query: 256  SVFSKGQADWT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
            +   KG    T   P+L+L+ + LGL+ VNP Y  +++ TF+FP S+GI GG+P +S Y 
Sbjct: 951  TNGRKGDTALTWHKPVLILIGIRLGLDSVNPIYYESVKATFSFPHSVGIAGGRPSSSYYF 1010

Query: 313  VGVQEESAIYLDPHDVQPVINI------------------------GKDD---------L 339
            +G Q  S  YLDPH+V+P + +                          DD          
Sbjct: 1011 MGHQGNSLFYLDPHNVRPAVALRFPPSTFPAAVPRQLDIAHRFAFEEHDDEDEWWSHAYT 1070

Query: 340  EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
            EA TST+H D +R + + S+DPS+ +GF  +D++D  D CAR   L++
Sbjct: 1071 EAQTSTFHCDKVRRMPIKSLDPSMLLGFLVKDEEDLADLCARIKALSK 1118


>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
 gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
          Length = 470

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 109/357 (30%), Positives = 157/357 (43%), Gaps = 90/357 (25%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
           N  + F  DF SRI ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           +RS Q ++A AL   RLGR WR   ++P  +E+ +I+ +F D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDIVAMFADDPRAPFSIHRFVEHGAAV 209

Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G   G W GP A  R  + L   +  E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVH-KNKEVGL-------KVYV-SGD------GADVY--E 252

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  +  +V   G+  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD--------------------------- 342
            Y V  Q  +  YLDPH  +P++         +                           
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLLPYRPSSWSTEEQASAPSTLEASATSATSTSSSTTIVP 370

Query: 343 -------------TSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                        TS Y        H+  IR + +  +DPS+ + F    +DD++D+
Sbjct: 371 SANEVTAPSDASRTSGYSPEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYEDW 427


>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 470

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 112/357 (31%), Positives = 159/357 (44%), Gaps = 90/357 (25%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
           N  + F  DF SRI ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           +RS Q ++A AL   RLGR WR   ++P  +E+ +++ +F D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAV 209

Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G   G W GP A  R  + L    R E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVHKNR-EAGL-------KVYV-SGD------GADVY--E 252

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  +  +V   G+  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310

Query: 310 TYIVGVQEESAIYLDPHDVQPVINI------------GKDDLE----------------- 340
            Y V  Q  +  YLDPH  +P++                  LE                 
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLLPYRPSSSSTEEQVAAPSTLEASATSVTSTSSSTTIVP 370

Query: 341 -ADTSTYHSDV------------------IRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
            A+  T  SDV                  IR + +  +DPS+ + F    +DD++D+
Sbjct: 371 SANEVTAPSDVSKPSGYSLEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYEDW 427


>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
          Length = 1509

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 115/392 (29%), Positives = 172/392 (43%), Gaps = 88/392 (22%)

Query: 121  ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE-------------- 162
            +T+D GWGCMLR+ Q L+A AL+   LGR W++      Q  F  E              
Sbjct: 776  LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAENQS 835

Query: 163  -------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
                         Y+ IL  F D  S   PF +H + + GK  G   G W GP     + 
Sbjct: 836  LASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAGAI 895

Query: 208  EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----A 263
            + L      E G+  +     ++ +    D  R  A        SR   + S  +    A
Sbjct: 896  KQLV-FDFPEAGIAVELAHDGVFYL----DEVRAAASAST--GKSRASGMLSGNRRAETA 948

Query: 264  DWT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
             W  P+L+L+ + LGLE VNP Y  +++ TF+FPQS+GI GG+P +S Y +G Q  S  Y
Sbjct: 949  VWRRPVLILIGIRLGLETVNPIYYESVKATFSFPQSVGIAGGRPSSSYYFMGHQGNSLFY 1008

Query: 323  LDPHDVQPVINI------------------------GKDD---------LEADTSTYHSD 349
            LDPH+V+P + +                         +DD          EA TST+H +
Sbjct: 1009 LDPHNVRPAVPLRYPPTTFPAAAPSRFDVSHRYALEDRDDEDEWWSHAYTEAQTSTFHCE 1068

Query: 350  VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 409
             +R + + S+DPS+ +GF  +D++   D CAR   L +      +F+  ++  K V+  D
Sbjct: 1069 KVRRMPIKSLDPSMLLGFLVKDEEALVDLCARIKALPKT-----IFSFAESAPKWVDDDD 1123

Query: 410  V--LGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
                 E+   P  D  G    +D VG   + D
Sbjct: 1124 FDPSMESFSEPSADEAG---SDDDVGKGEDQD 1152


>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
 gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
          Length = 1541

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/339 (30%), Positives = 154/339 (45%), Gaps = 81/339 (23%)

Query: 112  GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK--PLQKPFD--------- 160
            GF   G   +T+D GWGCMLR+ Q L+A ALL   LGR W +  P  +  D         
Sbjct: 814  GFSRAG---LTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLS 870

Query: 161  -------------RE-------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 198
                         RE       Y++IL  F D  S   PF +H + + GK  G   G W 
Sbjct: 871  LDSSVEMQSLQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 930

Query: 199  GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
            GP     + + L   +  + G+  +     ++ +  DE     GA         R     
Sbjct: 931  GPSTAAGAIKQLV-TEFPDAGIAVELAHDGVFYL--DEVRLAAGARSALQSGKGR----- 982

Query: 259  SKGQADWT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 315
             +G A  T   P+++L+ + LGL+ VNP Y  +++ TF+FP S+GI GG+P +S Y +G 
Sbjct: 983  -QGDAAVTWRRPVVILIGIRLGLDSVNPIYYESVKETFSFPHSVGIAGGRPSSSYYFMGH 1041

Query: 316  QEESAIYLDPHDVQPVINI------------------------GKDD---------LEAD 342
            Q  S  YLDPH+V+P + +                         KDD          EA 
Sbjct: 1042 QGNSLFYLDPHNVRPAVALRYPPSTFPTAVPHQLDVAHRFALEDKDDELEWWSHAYTEAQ 1101

Query: 343  TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
            TST+H + +R + + S+DPS+ +GF  +D++D  D C R
Sbjct: 1102 TSTFHCEKVRRMPIKSLDPSMLLGFLVKDEEDLMDLCTR 1140


>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
          Length = 473

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/263 (36%), Positives = 122/263 (46%), Gaps = 42/263 (15%)

Query: 88  AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 126
           A   N  + F  DF SRI ++YR GF  I  S+                      TSD G
Sbjct: 87  AQYGNWPSAFLDDFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           +GCM+RS Q ++A AL   RLGR WR    KP  +E+ EIL LF D   +PFSIH  ++ 
Sbjct: 147 FGCMIRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEH 205

Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G A  G   G W GP A  R  + LA   R E GL        +YV     D        
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYVSGDGADVYEDKLKE 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
           V IDD             +W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+
Sbjct: 258 VAIDD-----------DGEWQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGR 306

Query: 306 PGASTYIVGVQEESAIYLDPHDV 328
           P AS Y V  Q  +  YLDPH  
Sbjct: 307 PSASHYFVATQGNNFFYLDPHST 329


>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
 gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
          Length = 376

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/317 (32%), Positives = 155/317 (48%), Gaps = 42/317 (13%)

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFS 179
           TSD GWGCM R  QML+AQAL+ H LGR WR    +      ++I+  F DS +  SP S
Sbjct: 67  TSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLS 126

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSG 234
           +H L+Q         G W GP ++C    A+ R     + L  +   + +Y     V+  
Sbjct: 127 LHRLVQMSDR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYR 180

Query: 235 DE--DGERG------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLG 277
           +E  D  RG        P +   D   H +++ + Q+D          T ILLL+PL+ G
Sbjct: 181 EEIIDLARGLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFG 236

Query: 278 L-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
              ++NPRYI  +   F+ P  +G++GG+   S+Y VG Q  S IYLDPH  QP  N+  
Sbjct: 237 KGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNS 296

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 396
                D  ++H  + + +   +++PS A+GFYCR + +  D   R   L   S+      
Sbjct: 297 PKFSVD--SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ--- 351

Query: 397 VTQTHKKPVNHS-DVLG 412
              T  +PV  + +VLG
Sbjct: 352 -ASTRSRPVAFTVEVLG 367


>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
          Length = 1093

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 153/319 (47%), Gaps = 42/319 (13%)

Query: 117 GDSKITSDVGWGCMLRSSQMLVAQALL-------------FHRLGRPWRKPLQKPFDRE- 162
           G   +TSD GWGCMLR+ QML+A +L+              +    P   P +   DR+ 
Sbjct: 431 GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPAPSLPPSET--DRQR 488

Query: 163 ---YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 217
              YV+IL  F D  +   PFS+H L  AG   G   G W GP     S + L     A 
Sbjct: 489 FEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAGSIKKLVSAFPA- 547

Query: 218 TGLGCQSLP------MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPIL 269
            GLG    P       A++  S         + +    D        ++ + +W    +L
Sbjct: 548 CGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERA-NRMKEEWGDRAVL 606

Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           +L+ L LG+E V P Y  +++  FTFPQ++GI GG+P +S Y VG Q +   YLDPH  +
Sbjct: 607 ILIGLRLGIEGVTPIYYDSVKALFTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTR 666

Query: 330 PVINI-----GKDDLE-----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
           P + +     G  D       ++  T+HSD +R +H+  +DPS+  GF  R+ +++ D  
Sbjct: 667 PAVPLRVPTDGPYDATGQFTLSEMKTFHSDKVRKMHISGLDPSMLCGFIVRNVEEWRDLR 726

Query: 380 ARASKLAEESNG-APLFTV 397
           AR   LA+   G AP+FT+
Sbjct: 727 ARVDALAKSKGGKAPIFTI 745


>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
          Length = 431

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/384 (27%), Positives = 171/384 (44%), Gaps = 89/384 (23%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 60  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 109

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 110 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSRSASPSRYHGPAH 169

Query: 152 -RKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 202
            R P        L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 170 WRPPRWAQGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 225

Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
                  +A   R       +   + +YV    +D     A VV +              
Sbjct: 226 -----SLVAHILRKAVESCSEVTRLVVYV---SQDCTVYKADVVRL-------VARPDPA 270

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           A+W  +++LVP+ LG E +NP Y+P ++L  T P                    ++  +Y
Sbjct: 271 AEWKSVVILVPVRLGGETLNPVYVPCVKLMPTPP-------------------TDDFLLY 311

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 312 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFEMLCSEL 369

Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
           +++   S+     P+FT+ + H +
Sbjct: 370 TRVLSSSSATERYPMFTLAEGHAQ 393


>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
          Length = 1257

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 95/294 (32%), Positives = 137/294 (46%), Gaps = 61/294 (20%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
           F  D++SR+ ++YR  F PI D+ +                                   
Sbjct: 317 FYSDYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWS 376

Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE---ILHLFGD- 172
                TSD GWGCMLR+ Q L+A AL+   L R WR+P    +  +YV+   IL  F D 
Sbjct: 377 GEKGWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDN 436

Query: 173 -SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC--------- 222
            S  +PF IH +  AGK  G   GSW GP     + + L   +  + GL           
Sbjct: 437 PSPLAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLV-GEFEDAGLEVALAVDSVVY 495

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEK 280
           QS   A    S +++G  G +  V    + +      +G   W   P+L+LV + LG++ 
Sbjct: 496 QSDVYAASAASRNQNGVEGDSKTVGTSKSRKKG----QGPPKWGNRPVLILVGIRLGIDG 551

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           VNP Y  +++  FTFPQ++GI GG+P +S Y VG Q +S  YLDPH  +P I +
Sbjct: 552 VNPIYYESVKTLFTFPQTVGIAGGRPSSSYYFVGAQGDSLFYLDPHHTRPAIPL 605



 Score = 42.4 bits (98), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 16/53 (30%), Positives = 34/53 (64%), Gaps = 2/53 (3%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           T+H + +R + L ++DPS+ +GF CR+++++ D   R +++A       +F+V
Sbjct: 794 TFHCERVRKMPLSALDPSMLLGFLCRNEEEWKDLRERLAEMARTKKA--IFSV 844


>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
          Length = 459

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/342 (28%), Positives = 154/342 (45%), Gaps = 59/342 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------------- 142
           F + F+S +  +YR+GF P+  S +T+D GWGC+LRSSQML+AQ L              
Sbjct: 98  FRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWSGN 157

Query: 143 ---------LFHRLGR---------------PWRKPLQKPFDREYVEILHLFGDSETSPF 178
                    L H +                  W   L +P +     IL  F D+ T+PF
Sbjct: 158 QRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTAPF 217

Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
            IH L++ GK+ G  AG W GP          A   R         LP  +  V+ D   
Sbjct: 218 GIHRLVELGKSSGKKAGDWYGP-------SIAAHILRKAVEASVVDLPNLVAYVAQD--- 267

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
                  + + D  + C         W  +L+LVP+ LG + +NP YI +++        
Sbjct: 268 -----CTIYLQDVRKLCE--RPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLECC 320

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
           +GI+GGKP  S + VG Q++  +YLDPH  QP +++ K+       ++H    R +    
Sbjct: 321 IGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPTVDVTKN---FPLESFHCKNPRKMPFSR 377

Query: 359 IDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQ 399
           +DPS  IGFY + + +F+  C   ++ ++  +   P+F   +
Sbjct: 378 MDPSCTIGFYAKGQMEFESLCTSVNEAVSASAETYPMFIFEE 419


>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
          Length = 450

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/312 (31%), Positives = 148/312 (47%), Gaps = 50/312 (16%)

Query: 97  FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 133
           F +D +++  ++YR GF+PI  S                         +SD GWGCM+RS
Sbjct: 115 FTEDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRS 174

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 192
            Q L+A  +   +LGR WR+   +   +E   ++ +F D   +PFSIHN ++ G  A G 
Sbjct: 175 GQSLLATTIATLQLGRDWRRGKNQ---QEERRLISMFADDPRAPFSIHNFVRHGATACGK 231

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP        A A+C +A T      L + +Y  +  +D        V   D  
Sbjct: 232 FPGEWFGP-------SATAQCIQALTS--SSDLDLHVYSPNDGQDVYEDSFMKVAKPD-- 280

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
                   GQ D+ P L+L+   LG++K+ P Y   L  T   PQS+GI GG+P +S Y 
Sbjct: 281 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLIATLQMPQSVGIAGGRPSSSHYF 331

Query: 313 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           VG Q     YLDPH  +  +   +D     + D  + H+  +R +H+  +DPS+ IGF  
Sbjct: 332 VGSQGSYLFYLDPHHTRKALPYHEDVANYTDEDIDSCHTSRLRRLHVKEMDPSMLIGFLI 391

Query: 370 RDKDDFDDFCAR 381
           R + D+ ++  R
Sbjct: 392 RSESDWAEWRQR 403


>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
          Length = 268

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 92/268 (34%), Positives = 133/268 (49%), Gaps = 40/268 (14%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 184 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 238
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 239 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 286
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVG 314
            TL+  F  PQSLG++GGKP ++ Y +G
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIG 268


>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
          Length = 1572

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 114/387 (29%)

Query: 112  GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK---PL-QKPFDRE----- 162
            GF   G   +T+D GWGCMLR+ Q L+A AL+   LGR W++   PL Q+ F  E     
Sbjct: 824  GFSRAG---LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLS 880

Query: 163  ----------------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 198
                                  Y++IL  F D  S   PF +H + + GK  G   G W 
Sbjct: 881  IADAAEKESLQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 940

Query: 199  GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD-------A 251
            GP     + + L               P A   V    DG      V  +D+       +
Sbjct: 941  GPSTASGAIKQL-----------VSEFPQAGIAVELARDG------VFYLDEVRAAASAS 983

Query: 252  SRHCSVFSKGQAD---------------WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
            +   SV S G+A                W  P+L+L+ + LGLE VNP Y  +++ TF+F
Sbjct: 984  ASAASVQSGGKARSSGAASGSRKGEGLIWRRPVLILIGIRLGLESVNPIYYESVKATFSF 1043

Query: 296  PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--------------------- 334
            P S+GI GG+P +S Y +G Q  S  YLDPH+V+P + +                     
Sbjct: 1044 PHSVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPSTFPDAVPRHLGIAHRF 1103

Query: 335  ---GKDD---------LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
                KDD          E  TST+H + +R + + S+DPS+ +GF  +D++   D CAR 
Sbjct: 1104 VLEDKDDEDEWWSHAYSEVQTSTFHCEKVRRMPIKSLDPSMLLGFLVKDEESLQDLCARI 1163

Query: 383  SKLAEESNGAPLFTVTQTHKKPVNHSD 409
              L +      +F+  ++  K V+  D
Sbjct: 1164 KALPKT-----IFSFAESAPKWVDDDD 1185


>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
          Length = 452

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 167/371 (45%), Gaps = 65/371 (17%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           S +S + LLG  +++ +DEA  +         F + F+S + ++YR+GF  +  S +T+D
Sbjct: 70  SKSSPLILLGKSYEL-KDEANKE--------RFRRSFASLLWLTYRRGFPQLAGSSLTTD 120

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----------------------------- 155
            GWGC+LR+ QML+A+ LL H +   W   +                             
Sbjct: 121 SGWGCVLRTGQMLLARGLLTHLMPPGWMWSVWYRAVKDDLDLPHHADCTDCKSNMRCRYQ 180

Query: 156 ------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
                  +P +  + +++  F D   +PF IH L++ G + G  AG W GP  +      
Sbjct: 181 SLGSLYDRPLEAMHRKVVSWFADHPKAPFGIHRLVELGASSGKKAGDWYGPSIVA---HI 237

Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 269
           L +   A        LP  +  V+ D          + + D    C         W  ++
Sbjct: 238 LQKAVAASV-----DLPNLVVYVAQD--------CTIYLQDVRGLCE--RPPPHSWKSVI 282

Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           +LVP+ LG + +NP YI  ++        +GI+GG+P  S + VG Q++  +YLDPH  Q
Sbjct: 283 ILVPVRLGGQDLNPSYISCVKKLLELQCCIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQ 342

Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 389
             +N+ K++   +  ++H    R +    +DPS  IGFY   + + +  C   +++   S
Sbjct: 343 LTVNVTKENFPLE--SFHCKYPRKMPFSRMDPSCTIGFYASGQQELELLCTNVNEVVSTS 400

Query: 390 -NGAPLFTVTQ 399
             G P+F  ++
Sbjct: 401 AEGYPMFIFSE 411


>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
 gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
          Length = 480

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 98/330 (29%), Positives = 154/330 (46%), Gaps = 60/330 (18%)

Query: 85  LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 116
           LG   G+    E ++D  SRI  +YR GF+PI                            
Sbjct: 69  LGRRYGSGSKEEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128

Query: 117 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
                  +   T+DVGWGCM+R+SQML+A A+    LGR +        ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAIQLLLLGRGFT--YADSSEKKHSDIIDMF 186

Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
            D   +PFS+HN ++A     L    G W GP A   S + L + Q  E+     S P  
Sbjct: 187 TDDPKAPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDES-----SSPRF 241

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
             ++S   D           DD  +   +  + +     IL+L+P+ LGL KV+P Y  +
Sbjct: 242 RVIISESCD---------IYDD--KIGKLLQENEDAEGAILILLPVRLGLNKVSPYYHNS 290

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L   F+ PQ +GI GGKP +S Y  G    + +YLDPH  Q V         +   T+H+
Sbjct: 291 LSSLFSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSV------KASSIYDTFHT 344

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             ++ + ++ +DPS+ IG   + K+D++ F
Sbjct: 345 HNVQSLKIEDMDPSMLIGILIKSKEDYESF 374


>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
 gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
          Length = 400

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 157/316 (49%), Gaps = 28/316 (8%)

Query: 92  NGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
           N + E +   +D  SR+  +YR  F P+G+ ++T+D GWGCMLR  QM++AQAL+   LG
Sbjct: 52  NAIQELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLG 111

Query: 149 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 208
           R W     +  D  Y++I++ F D+  S +S+H +   G++     G W+GP  + +  +
Sbjct: 112 REWYWT-SECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILK 170

Query: 209 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 268
            L  C      L        I+V              V +DD        S+    W P+
Sbjct: 171 KLV-CFDDWCSL-------VIHVAMDS---------TVVLDDIYS----LSQDGESWKPL 209

Query: 269 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           LL++PL LG+  +NP Y+P L+  F    S G++GG+P  + Y VG  ++  +YLDPH  
Sbjct: 210 LLIIPLRLGITDINPIYVPALKRCFELESSCGMIGGRPNQALYFVGYVDDEVLYLDPHTT 269

Query: 329 QPVINIGKDDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           Q    +G+    A+     TYH      ++  ++DPSLA+ F C+ +  F+    +  + 
Sbjct: 270 QRTGAVGQKTTTAEQELDETYHQKYAARLNFSAMDPSLAVCFICKTQSSFELLLKQLREE 329

Query: 386 AEESNGAPLFTVTQTH 401
               +   LF ++++ 
Sbjct: 330 VLTLSSPALFEISKSR 345


>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
          Length = 1202

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 112/390 (28%), Positives = 159/390 (40%), Gaps = 110/390 (28%)

Query: 97  FNQDFSSRILISYRKGFDPI---------------------------GDSKITSDVGWGC 129
           F +DF+SRI ++YR GF PI                            +  +++D GWGC
Sbjct: 545 FYEDFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGC 604

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD--------------REYVEILHLFGD--S 173
           MLR+ Q L+A AL F  LGR WR+      +                Y  +L  F D  S
Sbjct: 605 MLRTGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPS 664

Query: 174 ETSPFSIHNLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV- 231
              PFS+H     GK  G    G W GP     + + LA      +     +L +A+ V 
Sbjct: 665 PLCPFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLA------SNFAPANLGVAVSVD 718

Query: 232 --VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
             V   +       P      A R     S   +   P+L+L+   LGL+KVNP Y  ++
Sbjct: 719 GTVYRSDVQAAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESI 778

Query: 290 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH----------------------- 326
           +   +FPQS+GI GG+P +S Y VGVQ+ S  Y+DPH                       
Sbjct: 779 KAALSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPHHTKPAIPFRQPPPDIAALAAELP 838

Query: 327 -DVQPVINIGKDDL----------------EADTST-----------------YHSDVIR 352
            D+   +N  +  L                E D +T                 +H D +R
Sbjct: 839 LDIHSPLNAWQRSLGDSLPPTPGAEPPAPDECDDATRLRAWFANEYDETCFGSFHCDRVR 898

Query: 353 HIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
            + L  +DPS+ IGF CRD+ D+DD  +RA
Sbjct: 899 KMPLSGLDPSMLIGFLCRDEADWDDLQSRA 928


>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
           SS1]
          Length = 1286

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 102/312 (32%), Positives = 145/312 (46%), Gaps = 57/312 (18%)

Query: 91  NNGLAEFNQDFSSRILISYRKGFDPIGDSKIT---------------------------- 122
           NN    F  DF+SR+ ++YR  F PI DS +T                            
Sbjct: 333 NNWPPVFYSDFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGW 392

Query: 123 ---------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLF 170
                    SD GWGCMLR+ Q L+A AL+   LGR WR+P    +  +Y   V++L  F
Sbjct: 393 PGSGEKGWTSDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWF 452

Query: 171 GDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
            DS T   PFS+H +  AGK  G   G W GP     + + L      E GLG     +A
Sbjct: 453 FDSPTPHCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---IA 508

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
                   D      P +    + +  +    G+A    +L+L+ + LGL+ VNP Y  T
Sbjct: 509 SDSQIFQSDVFAASHPPMDSPSSKKKLASTWGGRA----VLVLIGIRLGLDGVNPIYYET 564

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           ++  +TFPQS+GI GG+P +S Y VG Q ++  YLDPH  +P +      L    ST  +
Sbjct: 565 IKALYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAV-----PLRPPPST--N 617

Query: 349 DVIRHIHLDSID 360
           D++  I  +SI+
Sbjct: 618 DIVLDISRESIE 629



 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 51/91 (56%), Gaps = 15/91 (16%)

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
           A+  T+H + +R + L  +DPS+ +GF CRD+ D++DF AR + L++        T+   
Sbjct: 836 AELKTFHCERVRKMPLSGLDPSMLVGFLCRDEGDWEDFKARVADLSKTHK-----TIFSI 890

Query: 401 HKKPVNH-SDVLGETGGVPEDDSLGVMSMND 430
           H +P ++ SD          +D LG+ SM++
Sbjct: 891 HDEPPSYPSD---------SEDHLGLESMSE 912


>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
           aries]
          Length = 438

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 105/347 (30%), Positives = 163/347 (46%), Gaps = 31/347 (8%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 85  TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGTLTSD 138

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 184
            GWGCMLRS QM++AQ LL H L R W    Q                    P       
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHLLPRDWTWS-QGAGLGPAEPPGLGSPSPGPGPXXXXXXX 197

Query: 185 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 244
             G+A G  AG W GP         +A   R      C  +   +  VS D         
Sbjct: 198 SWGRAPGKKAGDWYGP-------SLVAHILRKAVE-SCSEVTRLVVYVSQDC-------- 241

Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
            V   D +R  +  S   A+W  +++LVP+ LG E +NP Y+P ++        LGI+GG
Sbjct: 242 TVYKADVARLVAR-SDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGG 300

Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
            P  S Y +G Q++  +YLDPH  QP +++ + D   +  ++H    R +    +DPS  
Sbjct: 301 TPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCT 358

Query: 365 IGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 408
           +GFY  D+ +F+  C+  +++   S+     P+FT+ + H +  +HS
Sbjct: 359 VGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLVEGHAQ--DHS 403


>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
 gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
          Length = 470

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 172/385 (44%), Gaps = 79/385 (20%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           S ++ ++LLG  +    D+ +           F +DF SR+ ++YR+ F  +  + +T+D
Sbjct: 76  SRSAPVYLLGERYYFRLDDEID---------RFQKDFVSRVWLTYRRDFPALEGTALTTD 126

Query: 125 VGWGCMLRSSQMLV---------------AQALLFH------------------------ 145
            GWGCM+RS QML+               ++AL  H                        
Sbjct: 127 CGWGCMIRSGQMLLAQGLLLHLLSREWTWSEALYTHFVEMEPIRSSSPSSMPLSLATDHS 186

Query: 146 -RLGRPWRKPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 203
            R  +P     + P+  E +  I+  F D  ++PF +H ++  G  +G  AG W GP   
Sbjct: 187 GRHSQPQTHCSRAPYGGEVHQNIVSWFSDHASAPFGLHRMVALGSIFGKRAGDWYGP--- 243

Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSG----DEDGERGGAPVVCIDDASRHCSVFS 259
                 +A   +       +   +++YV         D E+  A  V   D SR      
Sbjct: 244 ----SIVAHIIKKAIESSSEVPDLSVYVSQDCTVYKADIEQLFAGEVPHTDTSR-----G 294

Query: 260 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 319
            G+A    +++LVP  LG E  NP Y   L+     P  LGI+GGKP  S Y +G Q+  
Sbjct: 295 AGKA----VIILVPARLGGETFNPVYKHCLKEFLRMPSCLGIIGGKPKHSLYFIGYQDNY 350

Query: 320 AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
            +YLDPH  QP I+  +D+   +  ++H +  R + +  +DPS    FY +++DDF   C
Sbjct: 351 LLYLDPHYCQPYIDTSRDNFPLE--SFHCNAPRKLSITRMDPSCTFAFYAKNRDDFGKLC 408

Query: 380 ARASKL-----AEESNGAPLFTVTQ 399
              SK+     AEE    P+F++++
Sbjct: 409 EHLSKVLHSPQAEEK--YPIFSISE 431


>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
          Length = 403

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 110/371 (29%), Positives = 166/371 (44%), Gaps = 64/371 (17%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 120
           I  +   +W+LG  +   ++           L    +D  S++  +YRKGF PIG  +S 
Sbjct: 16  IPQTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFIPIGGCNST 64

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 179
            TSD GWGCMLR  QM++AQAL+   LG+ W+  P  K  +  Y++IL  F D   + FS
Sbjct: 65  FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQWMPETK--NNTYLKILSRFEDKRAAAFS 122

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIY 230
           IH +   G + G   G W GP  + +          W +L      +  L    +     
Sbjct: 123 IHQIALTGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSSLTIHVALDNTLIVNDILKQCR 182

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
           +  G+     G  P+              K  + W P+LLL+PL LGL ++NP YI  L+
Sbjct: 183 IEGGETAEADGEVPL--------------KAPSQWKPLLLLIPLRLGLSEINPVYINGLK 228

Query: 291 L--------------------TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
           +                    +F   QSLG++GGKP  + Y +G   +  IYLDPH  Q 
Sbjct: 229 VKFKILCMQKKKYICIQFFQTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQR 288

Query: 331 V----INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
                  I ++++E D  TYH      I +  +DPS+A+ F+C  + +F   C    +  
Sbjct: 289 SGSVEDKISEEEIEMDI-TYHCKSASRIPITGMDPSVALCFFCATEKEFMSLCKSMQEEL 347

Query: 387 EESNGAPLFTV 397
                 PLF +
Sbjct: 348 ILPEKQPLFEL 358


>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
 gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related protein 4 homolog D
 gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
          Length = 469

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 161/356 (45%), Gaps = 63/356 (17%)

Query: 91  NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           ++ +  F +DF SR+ ++YR+ F  +  + +T+D GWGCM+RS QML+AQ LL H L R 
Sbjct: 93  DDEIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSRE 152

Query: 151 W--RKPLQKPF----------------------------------------DREYVEILH 168
           W   + L + F                                        D+ +  I+ 
Sbjct: 153 WTWSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMR 212

Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
            F D   SPF +H L+  G  +G  AG W GP         +A   +       +   ++
Sbjct: 213 WFSDHPGSPFGLHQLVTLGSIFGKKAGDWYGP-------SIVAHIIKKAIETSSEVPELS 265

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
           +YV S D    +     +   D     +    G+A    +++LVP+ LG E  NP Y   
Sbjct: 266 VYV-SQDCTVYKADIEQLFAGDVPHAETSRGAGKA----VIILVPVRLGGETFNPVYKHC 320

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L+     P  LGI+GGKP  S Y +G Q+   +YLDPH  QP I+  K+D   +  ++H 
Sbjct: 321 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPYIDTSKNDFPLE--SFHC 378

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL-----AEESNGAPLFTVTQ 399
           +  R I +  +DPS    FY ++ +DF   C    K+     AEE    P+F++++
Sbjct: 379 NSPRKISITRMDPSCTFAFYAKNSEDFGKLCDHLMKVLHSPRAEEK--YPIFSISE 432


>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
 gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
          Length = 414

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 115/378 (30%), Positives = 177/378 (46%), Gaps = 63/378 (16%)

Query: 66  STSDIWLLGVCHKIAQDE-------ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           S S IWLLG   + A++E          + +    L++F +DF +RI  +YR GF  I  
Sbjct: 45  SHSPIWLLG--KQYAKNEPRPNLRRGFDENSAVGKLSDFLEDFRTRIWFTYRHGFPCIPG 102

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDREYVEILHLFGDSET 175
           +K  +D GWGC +RS QML+A+ +L H LGR W   +  L +     + +++ LF D+ T
Sbjct: 103 TKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLLGQSGLPEDEALMHRKVIGLFCDNLT 162

Query: 176 SPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
           SPFS+HNL+Q G+  +G  AGSW GP ++ +  + +A     E GL      +A++V+  
Sbjct: 163 SPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQ-VAMNNAIERGL---VEGLAVHVIGD 218

Query: 235 DE----DGERGG-----APV----------------VCIDDASRHCSV------------ 257
            E    D ER G     APV                    D  R  SV            
Sbjct: 219 GELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSYLDLRRLTSVSNGDLLPSHDGE 278

Query: 258 ------FSKGQADWTP-ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
                 F      W+  +L+L+PL LG+EK N  Y   L+   +    +G++GG+     
Sbjct: 279 SIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHLKRVLSTKFCVGVIGGRHHKCY 338

Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
           Y  G   +  I LDPH  QP ++  +  +     ++H    +   +  IDP  +IGFY R
Sbjct: 339 YFCGWHTDYLIRLDPHYSQPAVDATQPGVS--LHSFHCKYPKKTLIADIDPWCSIGFYIR 396

Query: 371 DKDDFDDFCARASKLAEE 388
           ++ +   F A  S++  E
Sbjct: 397 NRLELQSFLADISEVGFE 414


>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
 gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
          Length = 462

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 109/353 (30%), Positives = 152/353 (43%), Gaps = 82/353 (23%)

Query: 88  AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 126
           A   N  + F  DF SRI ++YR GF  I  S+                      TSD G
Sbjct: 87  AQYGNWPSAFLDDFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           +GCM+RS Q ++A AL   RLGR WR     P  +E+  IL LF D   +PFSIH  ++ 
Sbjct: 147 FGCMIRSGQCILANALQTLRLGRDWRY-QDDPTAQEHCNILSLFADDPQAPFSIHRFVEH 205

Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G A  G   G W GP A  R  + L   +  E GL        +YV SGD      GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KYKEAGL-------RVYV-SGD------GADV 250

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +D  +  +V   G+  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+
Sbjct: 251 Y--EDKLKQVAVEEDGE--WIPTLILVGTRLGIDKITPVYWEALKASLQMKQSMGIAGGR 306

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVI----NIGKDDLEA-------------------- 341
           P AS Y V  Q     YLDPH  +P +        D+                       
Sbjct: 307 PSASHYFVATQANHFFYLDPHSTRPHLPYRPPTSSDETTTQLASSITSTSSSTTIVPSAS 366

Query: 342 ----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                           D S+ H+  IR + +  +DPS+ + F    ++D++ +
Sbjct: 367 SLPPRSPPEPSTYTLDDISSCHTRRIRRLQIREMDPSMLLAFLVTSQEDYEKW 419


>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
          Length = 499

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 175/387 (45%), Gaps = 76/387 (19%)

Query: 77  HKIAQDEALGDAAGNNG---LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC---- 129
           +KI+    LGD+   N    +  F   F SRI ++YRK F  +  S  T+D GWGC    
Sbjct: 83  NKISPVTILGDSYLLNSEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRS 142

Query: 130 --MLRSSQMLV-----------AQAL------LFH-----RLG----------------- 148
             ML +  +LV           AQ L      +F      R G                 
Sbjct: 143 GQMLLAQGLLVHLMPRGWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPE 202

Query: 149 RPW----------RKPLQKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
           RP           +K L+   DR+    + +++  FGD  T+PF IH L++ GK+ G  A
Sbjct: 203 RPLLSEQATKCSRKKRLESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKA 262

Query: 195 GSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 253
           G W GP  +     +A+AR     +        + +YV   D    +     +C    S+
Sbjct: 263 GDWYGPAIVAHILRKAVARASAVHS--------LVVYVAQ-DCTVYKEDVMHLCDPTPSQ 313

Query: 254 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 313
             S     QA W  +++LVP+ LG E +NP YI  ++        +GI+GGKP  S Y V
Sbjct: 314 TPSDPLSHQA-WKSVIILVPVRLGGECLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFV 372

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
           G Q+E  +YLDPH  QPV+++ +  + +   ++H +  + +  + +DPS  IGFY + K 
Sbjct: 373 GFQDEQLLYLDPHYCQPVVDVSQ--VNSSLESFHCNAPKKMPFNRMDPSCTIGFYAKSKK 430

Query: 374 DFDDFC-ARASKLAEESNGAPLFTVTQ 399
           DF+  C A  + L+      PLFT  +
Sbjct: 431 DFESLCSAVGTALSSSKERYPLFTFIE 457


>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
           MF3/22]
          Length = 1147

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/298 (33%), Positives = 135/298 (45%), Gaps = 62/298 (20%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
           G N    F  DFSSR+ ++YR  + PI D  +                            
Sbjct: 335 GANWPPGFYSDFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGS 394

Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDS 173
                TSD GWGCMLR+ Q L+A AL+   LGR WR+P Q  +  +   YV+IL  F DS
Sbjct: 395 GEKGWTSDSGWGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYATYVKILTWFFDS 454

Query: 174 ET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV 231
                PFS+H +  AGK  G   G W GP     + + +     AE GLG  S+     V
Sbjct: 455 TDIHCPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHA-FAEAGLGV-SVATDGVV 512

Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---------------W--TPILLLVPL 274
              D        P +      RH  + +   +                W   P+L+LV +
Sbjct: 513 YETDVLAASNAGPYMY-----RHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGI 567

Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
            LG++ VNP Y   ++  FTFPQS+GI GG+P +S Y VGVQ ++  YLDPH  +P +
Sbjct: 568 RLGIDCVNPVYYDAVKALFTFPQSVGIAGGRPSSSYYFVGVQTDNLFYLDPHHSRPSV 625



 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 29/42 (69%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
           T+H D +R + L S+DPS+ IGF CRD+ D+ D   R ++++
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCRDERDWKDLRERVTEMS 769


>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
           boliviensis]
          Length = 463

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 161/376 (42%), Gaps = 96/376 (25%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 109 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 162

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------------- 159
            GWGCMLRS QM++AQ LL H L R W            L  P                 
Sbjct: 163 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSRYHGPARWMPPCW 222

Query: 160 ---------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
                    +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +
Sbjct: 223 AQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 275

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
           A   R       +   + +YV                    S+ C+    G+   TP L 
Sbjct: 276 AHILRKAVESSSEVTRLVVYV--------------------SQDCT----GKGTCTPSLQ 311

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
            +                LR        LGI+GGKP  S Y +G Q++  +YLDPH  QP
Sbjct: 312 EL----------------LRCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP 351

Query: 331 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
            +++ + +   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++   S+
Sbjct: 352 TVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSS 409

Query: 391 GA---PLFTVTQTHKK 403
                P+FT+ + H +
Sbjct: 410 ATERYPMFTLAEGHAQ 425


>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
          Length = 511

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 162/382 (42%), Gaps = 80/382 (20%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 129 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 182

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPW-------------------------------RK 153
            GWGCMLRS QM++AQ LL H L R W                               R 
Sbjct: 183 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRW 242

Query: 154 PLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
               P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +
Sbjct: 243 AQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 295

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------KGQAD 264
           A   R      C  +   +  VS D       +PV     +     +        + +  
Sbjct: 296 AHILRKAVE-SCSEVTRLVVYVSQDCTAAEASSPVSDTPASGPLHLLPLLLGVLFQQRCR 354

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +  L+   L                      LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 355 WLFVCELLRCEL---------------------CLGIMGGKPRHSLYFIGYQDDFLLYLD 393

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 394 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 451

Query: 385 LAEESNGA---PLFTVTQTHKK 403
           +   S+     P+FT+ + H +
Sbjct: 452 VLGSSSATERYPMFTLAEGHAQ 473


>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
          Length = 491

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 116/452 (25%), Positives = 176/452 (38%), Gaps = 116/452 (25%)

Query: 65  SSTSDIWLLGVCHKIA---QDEALGDAAG--------NNGLAEFNQDFSSRILISYRKGF 113
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPTESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 157
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 158 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 181
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNYDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 302 VGGKPGASTYIVGVQE----------------ESAIYLDPHDVQPVINIGKDDLEADT-- 343
           +GGKP  S Y  G QE                ++ + L+  + +P +  G +D   +   
Sbjct: 323 IGGKPKQSYYFAGFQENEVQRSSMNSLKQKSSKNNLKLEGSEKRPQMGFGSEDEFKNILL 382

Query: 344 -------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
                         T+H    + +    +DPS  IGFYCR+  DF+      +K+ + S+
Sbjct: 383 DHVQAFGPPSYPRLTFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFERASEEITKMLKFSS 442

Query: 391 GA--PLFTVTQTHKK-------PVNHSDVLGE 413
               PLFT    H +         N  D+  E
Sbjct: 443 KEKYPLFTFVNGHSRDYDFTSTTTNEEDLFSE 474


>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
           FP-101664 SS1]
          Length = 997

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 91/282 (32%), Positives = 133/282 (47%), Gaps = 58/282 (20%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 123
           F  DF+SRI ++YR  F PI D+ +                                 T+
Sbjct: 298 FYADFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTT 357

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 178
           D GWGCMLR+ Q L+A AL+   LGR WR+P    +  +Y   V+I+  F D+ +   PF
Sbjct: 358 DAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPF 417

Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
           S+H +   GK  G   G W GP     + + L             + P A   V+   DG
Sbjct: 418 SVHRMALVGKDLGKDVGQWFGPSTAAGAIKTL-----------VHAFPEATLGVANAVDG 466

Query: 239 ERGGAPVVCIDDASRHCSVFSK----GQADW--TPILLLVPLVLGLEKVNPRYIPTLRLT 292
               + V     ASR     ++     + DW    +L+L+ + LG+E VNP Y  T++  
Sbjct: 467 TLYESDVYA---ASRSVMYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTL 523

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           +TFPQS+GI GG+P +S Y VG Q ++  YLDPH  +P + +
Sbjct: 524 YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPL 565



 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           + +  T+H D +R + L  +DPS+ +GF C+D+ ++ D   R ++L    N   +F++  
Sbjct: 696 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKDRIAELFR--NNKSIFSLAN 753

Query: 400 THKKPVNHSDVLG 412
              +  + SD +G
Sbjct: 754 EPPQYPSDSDDMG 766


>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
           bisporus H97]
          Length = 1261

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 101/292 (34%), Positives = 134/292 (45%), Gaps = 65/292 (22%)

Query: 97  FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 122
           F  DF SRI ++YR  F  PI DS +T                                 
Sbjct: 247 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 306

Query: 123 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 174
                SD GWGCMLR+ Q L+A AL+   LGR WRKP    +  +Y   V+IL  F D+ 
Sbjct: 307 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 366

Query: 175 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
           +  +PFS+H +  AGK +G   G W GP     + + L               P +   V
Sbjct: 367 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 415

Query: 233 SGDEDGERGGAPVVCIDDA-------SRHCSVFSKGQA-DW--TPILLLVPLVLGLEKVN 282
           S  +DG      V     A       +   S  S  QA  W   P+L+LV L LG++ VN
Sbjct: 416 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 475

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           P Y  T++  FT PQS+GI GG+PG+S Y VG Q ++  YLDPH  +P I +
Sbjct: 476 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 527


>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1355

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 97/292 (33%), Positives = 133/292 (45%), Gaps = 65/292 (22%)

Query: 97  FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 122
           F  DF SRI ++YR  F  PI DS +T                                 
Sbjct: 334 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 393

Query: 123 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 174
                SD GWGCMLR+ Q L+A AL+   LGR WRKP    +  +Y   V+IL  F D+ 
Sbjct: 394 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 453

Query: 175 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
           +  +PFS+H +  AGK +G   G W GP     + + L               P +   V
Sbjct: 454 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 502

Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD--------W--TPILLLVPLVLGLEKVN 282
           S  +DG      V     A    +  +  ++         W   P+L+LV L LG++ VN
Sbjct: 503 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 562

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           P Y  T++  FT PQS+GI GG+PG+S Y VG Q ++  YLDPH  +P I +
Sbjct: 563 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 614


>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
 gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
          Length = 858

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 94/275 (34%), Positives = 132/275 (48%), Gaps = 58/275 (21%)

Query: 88  AAGNNGLAEFNQDFSSRILISYRKGFDPI------------------------GDSKITS 123
           AA +    EF  DF+SR+ ++YR GF PI                        G   +TS
Sbjct: 148 AAASGWPQEFFSDFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTS 207

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIH 181
           D GWGCMLR+ Q L+A AL+   +GR             Y+ ++ LF DS +  +PFS+H
Sbjct: 208 DAGWGCMLRTGQSLLANALVVAWMGRGALA--------LYIHLISLFLDSPSPSAPFSVH 259

Query: 182 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            +  AG+A G   G W GP     + +AL      + GLG          V+  EDG   
Sbjct: 260 RMALAGRALGKDVGQWFGPSTAAGAIKALVNAY-PDAGLG----------VAIAEDG--- 305

Query: 242 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
               V      R      + + +W   P+L+L+ + LGL+ VNP Y  T++  +TFPQSL
Sbjct: 306 ----VVYQTQRRQ----KEREREWGDQPVLVLLGIRLGLDGVNPIYYDTIKQLYTFPQSL 357

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           GI GG+P +S Y VG Q     YLDPH  +P + +
Sbjct: 358 GIAGGRPSSSYYFVGAQAGDLFYLDPHHARPTVPL 392



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 33/47 (70%)

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
           A+T T+H + +R + +  +DPS+ IGF C+D+ D++D+  R SKL +
Sbjct: 537 AETRTFHCERVRKMPMSGLDPSMLIGFLCKDRADWEDWRTRVSKLPK 583


>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
          Length = 336

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 92/333 (27%), Positives = 147/333 (44%), Gaps = 69/333 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG-GKPGA 308
           D  + C V                                      P S   VG   PG 
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTVGESTPG- 202

Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
            T     Q +  I+LDPH  Q  +N  ++    D + +     + +++ ++DPS+A+GF+
Sbjct: 203 -TLNASNQSDELIFLDPHTTQTFVNTEENGTVDDQTFHCLQSPQRMNILNLDPSVALGFF 261

Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 262 CKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
          Length = 393

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 158/347 (45%), Gaps = 68/347 (19%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PI          W  
Sbjct: 57  VWILGKQHLLKTEKS-----------KLLADISARLWFTYRRKFSPID---------WN- 95

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
                                W K  ++P  +EY  IL  F D +   +SIH + Q G  
Sbjct: 96  ---------------------WEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGVG 132

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGERGGAPV 245
            G + G W GP  + +  + LA      +        +A+YV   +    ED ++     
Sbjct: 133 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCCAS 184

Query: 246 VCIDDA-------SRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
               DA       S + S  SKG    +  W P+LL+VPL LG+ ++NP Y+   +  F 
Sbjct: 185 ALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFK 244

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
            PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + +     + +
Sbjct: 245 MPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQPPQRM 304

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++ ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 305 NILNLDPSVALGFFCQEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 350


>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
 gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
          Length = 425

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 99/307 (32%), Positives = 142/307 (46%), Gaps = 73/307 (23%)

Query: 97  FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 133
           F  DF SRI ++YR GF+PI                      GD +  +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 192
            Q L+A ALL  +LGR WR+      +R    I+ LF D   +P+S+ N ++ G  A G 
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAER---NIVALFADDARAPYSLQNFVKHGAIACGK 229

Query: 193 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 252
             G W GP A  R  +ALA    +          + IY          G  P V  D   
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269

Query: 253 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
              S  +  + D                    + PTL L     QS+GI GG+P +S Y 
Sbjct: 270 ---SFLATARPD-----------------GETFHPTLIL---MEQSIGIAGGRPSSSHYF 306

Query: 313 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           VGVQ +   YLDPH  +P +   ++ L     +  + H+  +R++H++ +DPS+ IGF  
Sbjct: 307 VGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIGFLI 366

Query: 370 RDKDDFD 376
           +D+DD+D
Sbjct: 367 QDEDDWD 373


>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
 gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
           commune H4-8]
          Length = 602

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 102/310 (32%), Positives = 144/310 (46%), Gaps = 82/310 (26%)

Query: 69  DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------- 121
           +IWL+GVCH               G  +F  DF++RI ++YR GF+ I D ++       
Sbjct: 114 EIWLMGVCHA-------------PGAPDFYADFATRIWLTYRSGFELIRDRQLIDLPPPV 160

Query: 122 ------------------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 157
                                   +SD GWGCMLR+ Q L+A ALL    GR WR+  + 
Sbjct: 161 ASLDGHLQGEWATDEAEPPGAYGFSSDSGWGCMLRTGQSLLANALLTAWFGRDWRRISEV 220

Query: 158 PFDRE--YVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
              +   YV +L LF D+   T+PFSIH +  AGK  G   G W GP     + + L   
Sbjct: 221 ETHQHSLYVHLLSLFLDTPHPTAPFSIHRMALAGKQLGKDIGQWFGPSTAAGAIKNL--- 277

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT------- 266
                     + P+A            G   VV +D A     VF+   ++W+       
Sbjct: 278 --------VSAYPLA------------GIGVVVGMDGALSKSEVFTASHSEWSDEEAALD 317

Query: 267 ----PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
               P+L+L+ L LGL++VNP Y  T++  FTFPQS+GI GG+P +S + VG Q    IY
Sbjct: 318 WGDRPVLILLNLRLGLDRVNPIYHDTIKALFTFPQSVGIAGGRPCSSYHFVGAQGSDLIY 377

Query: 323 LDPHDVQPVI 332
           LDPH  +  +
Sbjct: 378 LDPHHTRNTV 387



 Score = 41.6 bits (96), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
            AD +T+H    + + + + DPS+  GF C+D  D+DD+ AR S+L  +
Sbjct: 524 HADLATFHCTNPKMMPISAQDPSMLAGFLCKDIADWDDWRARMSRLPNQ 572


>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
          Length = 988

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 94/272 (34%), Positives = 126/272 (46%), Gaps = 57/272 (20%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 124
           F  DF+SRI ++YR  F PI D+ +                                TSD
Sbjct: 305 FYSDFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSD 364

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD------REYVEILHLFGDSETS-- 176
            GWGCMLR+ Q L+A  LL   LGR WR+P   P+         YV+IL  F D+ +   
Sbjct: 365 AGWGCMLRTGQSLLANTLLHLHLGRDWRRP---PYPICTADYATYVQILTWFFDNPSPLC 421

Query: 177 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
           PFS+H +   GK  G   G W GP     + + L      E GLG  S+     +   D 
Sbjct: 422 PFSVHRMALVGKELGKEVGQWFGPSTAAGAIKTLVHA-FPEAGLGV-SVATDSVIYQSD- 478

Query: 237 DGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
                    V     S   S    G++ W    +L+LV + LGL+ VNP Y  T++  +T
Sbjct: 479 ---------VYTASRSNLGSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIYYDTIKALYT 529

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           FPQS+GI GG+P +S Y VG Q ++  YLDPH
Sbjct: 530 FPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561



 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 15/96 (15%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
           T+H + IR + L  +DPS+ IGF C+D++D+ D   R + L+         T+     +P
Sbjct: 693 TFHCERIRKMPLSGLDPSMLIGFLCKDEEDWLDLRKRITDLSRTHK-----TIFSIQDEP 747

Query: 405 VN-HSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 439
            N  SD          DD++G+ S+++   +  ED+
Sbjct: 748 PNWPSD---------SDDNMGLESISEPDIDMPEDE 774


>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
          Length = 342

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 93/333 (27%), Positives = 150/333 (45%), Gaps = 69/333 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS-SRILISYRKGFDPIGDSKITSDVGWG 128
           +W+LG  H +  D           L E    F+ +  L ++  G  P      +SD GWG
Sbjct: 35  VWILGKQHLLKTD----------SLPEIISHFTETSELTAHDGGTGP------SSDAGWG 78

Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 188
           CMLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + +   
Sbjct: 79  CMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKM-- 136

Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
                                            C  LP++  + + +  G          
Sbjct: 137 ---------------------------------CCILPLSADIATENPSGS--------- 154

Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
            +AS H    S     W P+LL+VPL LG+ ++NP Y+   +       SLG +GGKP  
Sbjct: 155 PNASNHSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK-------SLGALGGKPNN 207

Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
           + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+
Sbjct: 208 AYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILNLDPSVALGFF 267

Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 268 CKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 299


>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
 gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
          Length = 492

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 156/343 (45%), Gaps = 79/343 (23%)

Query: 87  DAAGNNGLAEFNQDFSSRILISYRKGFDPIG----------------------------- 117
           D + ++G+ E  QD  S+I ++YR GF+PI                              
Sbjct: 77  DISVDDGVIE--QDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNF 134

Query: 118 -----DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW-----RKPLQKPFDREYVEIL 167
                +   T+DVGWGCM+R+SQ L+A       LGR +     R P        + EI+
Sbjct: 135 HGLLDNDNFTTDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRDRSP-------RHDEII 187

Query: 168 HLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
            +F D   +PFS+HN ++      L    G W GP A   S + L           C + 
Sbjct: 188 DMFMDEPRAPFSLHNFIKVASESPLKVKPGQWFGPNAASLSIKRL-----------CDN- 235

Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP----ILLLVPLVLGLEKV 281
              +Y  +G      G   VV  + ++ +  + ++      P    IL+L+P+ LG++KV
Sbjct: 236 ---VYESNG-----TGRVKVVISESSNLYDDIITQMFTTLNPVPDAILVLLPVRLGIDKV 287

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           NP Y  ++       QS+GI GGKP +S Y  G +    +YLDPH  Q V N       +
Sbjct: 288 NPLYHASVLELLALRQSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRN-----KTS 342

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
              TYH++  + + +D +DPS+ IG   +D +D++DF +  +K
Sbjct: 343 VYDTYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKSSCTK 385


>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
           972h-]
 gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
          Length = 320

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 144/341 (42%), Gaps = 53/341 (15%)

Query: 48  MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 107
           M R  ER L  + T      + IW LG  +KI   +            +F  D  S I I
Sbjct: 4   MARFLERYLHFAPTNTEPPGTLIWFLGHSYKIEDSQ---------WPEKFLYDSFSLITI 54

Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
           +YR G +  G   +TSD GWGCM+RS+Q L+A  L   R+  P         +++  EIL
Sbjct: 55  TYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL---RICYP---------EKQLKEIL 100

Query: 168 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
            LF D  ++PFSIH  +  GK    +  G W GP   C     +AR            +P
Sbjct: 101 ALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSC---SCVARLSDQNP-----DVP 152

Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
           + +YV        R     V                    P+LLL+P  LG++ +N  Y 
Sbjct: 153 LHVYVARNGNAIYRDQLSKVSF------------------PVLLLIPTRLGIDSINESYY 194

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 346
             L   F     +GI GG+P ++ Y    Q +   YLDPH         +    A   T+
Sbjct: 195 DQLLQVFEIRSFVGITGGRPRSAHYFYARQNQYFFYLDPHCTHFAHTTTQ---PASEETF 251

Query: 347 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
           HS  +R + +  +DP +  GF  RD++++  F A     A+
Sbjct: 252 HSATLRRVAIQDLDPCMIFGFLIRDEEEWHSFEANQKYFAD 292


>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
          Length = 324

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 87/300 (29%), Positives = 130/300 (43%), Gaps = 56/300 (18%)

Query: 66  STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           ++  +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 26  TSEPVWILGRKYSVLTEKE-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 74

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           GWGCMLR  QM+ AQAL+   LGR WR          Y  +L+ F D + S +SIH + Q
Sbjct: 75  GWGCMLRCGQMIFAQALVCRHLGRDWRWAQWTQQPDSYFNVLNAFIDRKDSYYSIHQIAQ 134

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
            G   G + G W GP  + +  + LA                                  
Sbjct: 135 MGVGEGKSIGQWYGPNTVAQVLKKLA---------------------------------- 160

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
                      VF    +    I +   +V G   +N  Y+ TL+  F  PQSLG++GGK
Sbjct: 161 -----------VFDTWSSLAVHIAMDNTVVTGEININEAYVETLKHCFMMPQSLGVIGGK 209

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P ++ Y +G   +  IYLDPH  QP + +    L  D S +       + +  +DPS+A+
Sbjct: 210 PNSAHYFIGYVGDELIYLDPHTTQPAVELTDSCLVPDESFHCQHPPSRMSIRELDPSIAV 269


>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
          Length = 336

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 145/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G  P  S
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTAGESPPGS 203

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
              +  Q    I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 204 LTALN-QSNELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
          Length = 336

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P   
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRP-LD 202

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                 Q +  I+LDPH  Q  ++  ++ +  D + +     + +++ ++DPS+A+GF+C
Sbjct: 203 YLTASNQSDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
 gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
          Length = 480

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 96/330 (29%), Positives = 151/330 (45%), Gaps = 60/330 (18%)

Query: 85  LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 116
           LG   G++   E  +D  SRI  +YR GF+PI                            
Sbjct: 69  LGRRYGSSSKEEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128

Query: 117 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
                  +   T+DVGWGCM+R+SQML+A A     LGR +        ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAFQLLLLGRDF--AYVDGSEKKHSDIIDMF 186

Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
            D   +PFS+HN ++A     L    G W GP A   S + L  C+    G    S  + 
Sbjct: 187 TDEPKTPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRL--CKSQFDGSVSPSFRVI 244

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
           I   S D   ++ G  +  I+++                IL+L+P+ LGL KV+P Y  +
Sbjct: 245 I-SESCDIYDDKIGKLLQEIENSE-------------DAILILLPVRLGLNKVSPYYHDS 290

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L   F   Q +GI GGKP +S Y  G      +YLDPH  Q +         +   T+H+
Sbjct: 291 LSSLFCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPHYPQSM------KASSIYDTFHT 344

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           + ++ + ++ +DPS+ IG   + K+D++ F
Sbjct: 345 NKVQSLKIEDMDPSMLIGILIKSKEDYESF 374


>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
           [Homo sapiens]
          Length = 340

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 33  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 82  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 184

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P  S
Sbjct: 185 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 207

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 208 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 266

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 267 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 297


>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
          Length = 336

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P  S
Sbjct: 181 DIKKMCCV-------------------------------------LPLSADTAGDRPPDS 203

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 992

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 98/284 (34%), Positives = 134/284 (47%), Gaps = 49/284 (17%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
           G+N    F  DF+SRI ++YR  F PI DS +                            
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350

Query: 122 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD 172
                 TSD GWGCMLR+ Q L+A ALL   LGR WR+P       +Y   V+I+  F D
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFD 410

Query: 173 --SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 230
             S  SPFS+H +  AGK  G   G W GP     + + L      E GLG       + 
Sbjct: 411 TPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGVI 469

Query: 231 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
             S     +   A    I    RH  V   G+A    +++L+ + LGL+ VNP Y  T++
Sbjct: 470 FQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTIK 520

Query: 291 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
             +TFPQS+GI GG+P +S Y +G Q ++  YLDPH  +P + +
Sbjct: 521 ALYTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPHHARPAVPL 564



 Score = 44.7 bits (104), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 6/84 (7%)

Query: 314 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
           G  E +   LDP     V     D L     T+H D +R + +  +DPS+ +GF C+D++
Sbjct: 681 GDSEGAGEALDPMAEHYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDEN 736

Query: 374 DFDDFCARASKLAEESNGAPLFTV 397
           D+ DF  R + L        +FTV
Sbjct: 737 DWFDFRRRVNDLMHRHKT--IFTV 758


>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
 gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
           gorilla]
 gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
 gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 336

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P  S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 203

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 1009

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 96/288 (33%), Positives = 131/288 (45%), Gaps = 57/288 (19%)

Query: 97  FNQDFSSRILISYRKGFDPI-------------------------------GDSKITSDV 125
           F  DF+SRI ++YR  F PI                               GD   +SD 
Sbjct: 308 FYADFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDA 367

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFSI 180
           GWGCMLR+ Q L+A AL+   LGR WRKP       +Y   ++I+  F D  +   PFS+
Sbjct: 368 GWGCMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSV 427

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMC------RSWEALARCQRAETGLGCQSLPMA---IYV 231
           H +   GK  G+  G W GP           +  ++   Q A   L   + P A   IYV
Sbjct: 428 HRMALVGKQLGVKVGQWFGPSTAAGAIKYVSAHSSMVPNQPARRTL-VHAFPEAGLGIYV 486

Query: 232 VSGD---EDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYI 286
            +      D E   A    I    RH          W   P+L+L+   LG++ VNP Y 
Sbjct: 487 AADGGTIYDSEVFAASHSGIGSPRRHTRRV------WGDRPVLILIGHRLGIDGVNPIYY 540

Query: 287 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
            TL+  +T+PQS+GI GG+P +S Y VG Q ++  YLDPH  +P I +
Sbjct: 541 DTLKTLYTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPTIPL 588



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 1/51 (1%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
           T+H D +R + L S+DPS+ IGF C+D+ ++ D  +R ++L+ +S  +P+F
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCKDESEWQDLKSRINELSRKSK-SPVF 777


>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
          Length = 336

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P  S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 203

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
          Length = 500

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 120/242 (49%), Gaps = 13/242 (5%)

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
           +  ++  FGD   +PF +H L+  GK  G  AG W GP         +A   R       
Sbjct: 232 HSRLVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGP-------SVVAHILRKAVDKTS 284

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
               +A+YV    +D       VV + D S + +       DW  +++LVP+ LG E +N
Sbjct: 285 VVTNLAVYVA---QDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGGEALN 341

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           P YI  ++        +GI+GGKP  S Y +G Q+E  +YLDPH  QPV+++ + +   +
Sbjct: 342 PSYIDCVKNFLKLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQINFSLE 401

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTH 401
             ++H    + +  + +DPS  IGFY ++K DF+  C+  S+ L+      P+FT  + H
Sbjct: 402 --SFHCSSPKKMPFNRMDPSCTIGFYAKNKKDFESLCSAVSEALSSSKEKYPVFTFVEGH 459

Query: 402 KK 403
            +
Sbjct: 460 SQ 461



 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 30/61 (49%), Positives = 38/61 (62%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           +  F   F SRI ++YR+ F  +  S  T+D GWGCMLRS QML+AQ LL H + R W  
Sbjct: 104 VERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPRDWVW 163

Query: 154 P 154
           P
Sbjct: 164 P 164


>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
 gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
          Length = 340

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/318 (30%), Positives = 147/318 (46%), Gaps = 63/318 (19%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIG------------------------------DSKITS 123
           L E     +SR+  +YR GF+PI                               +   ++
Sbjct: 52  LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHN 182
           DVGWGCM+R+SQ L+A AL    LGR  + P       E VE I+ LFGD  T PFS+HN
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171

Query: 183 LLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
            ++   A  L    G W GP A   S + L  C + E+     ++ ++I       D E 
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRL--CAKFESN-EIPNINVSICESCNLYDEEI 228

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
            G              +F + +   +P+L+L PL LG++K+N  Y P+L       QS+G
Sbjct: 229 RG--------------IFEESE---SPLLILFPLRLGIDKINSIYYPSLLQLLALKQSVG 271

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           I GGKP +S Y  G Q  + +YLDPH++Q           +D  TYH+   + + + ++D
Sbjct: 272 IAGGKPSSSYYFFGFQGSNLLYLDPHNLQAA--------SSDPGTYHTSKFQTLSISNLD 323

Query: 361 PSLAIGFYCRDKDDFDDF 378
           P  A   +  ++  +DD+
Sbjct: 324 PLNAC--WSVNQMTYDDY 339


>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 360

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P  S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                 + +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 228 -LTASNESDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317


>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
 gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
          Length = 1039

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/278 (34%), Positives = 138/278 (49%), Gaps = 51/278 (18%)

Query: 97  FNQDFSSRILISYRKGFD-PIGDSKI-------------------------------TSD 124
           F  DF+SRI ++YR  F  PI D+++                               +SD
Sbjct: 336 FYIDFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSD 395

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDREYVEILHLFGDSET--SPFS 179
            GWGCMLR+ Q L+A AL+   LGR WR+P   +Q      YV+I+  F D+    +PFS
Sbjct: 396 TGWGCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFS 455

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           +H +  AGK +G   G W GP     + + L      E+GLG          VS   DG 
Sbjct: 456 VHRMALAGKEFGTDVGQWFGPSVAAGAIKTLVNS-FPESGLG----------VSVATDGT 504

Query: 240 RGGAPVVCI---DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
              + V  +   + +SR             P+LLL+ + LG+E VNP Y  T++L +TFP
Sbjct: 505 LFQSDVFAVSHGEMSSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIYYETIKLLYTFP 564

Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           QS+GI GG+P +S Y VG Q ++  YLDPH+ +P I +
Sbjct: 565 QSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTRPAIPL 602



 Score = 43.1 bits (100), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 29/43 (67%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 387
           T+H + +R + L  +DPS+ IGF CRD+ ++ DF  R ++L +
Sbjct: 739 TFHCERVRKMPLSGLDPSMLIGFLCRDEAEWWDFKKRVAELPK 781


>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
           jacchus]
          Length = 360

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P  S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                  +E  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 228 LTASNRSDE-LIFLDPHTTQTFVDAEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317


>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
          Length = 280

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 84/254 (33%), Positives = 123/254 (48%), Gaps = 25/254 (9%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 244
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 245 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 352
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 353 HIHLDSIDPSLAIG 366
            + +  +DPS+A+G
Sbjct: 233 RMSIAELDPSIAVG 246


>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
          Length = 497

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 120/233 (51%), Gaps = 13/233 (5%)

Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
           +++ LFGD   +PF +H L+  GK  G  AG W GP  +      + R   A+T +G QS
Sbjct: 231 KLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAH----ILRKAVAKTSVG-QS 285

Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 284
           L  A+YV    +D       V+ + D S    V       W  +++LVP+ LG E +NP 
Sbjct: 286 L--AVYVA---QDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGGEALNPS 340

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           YI  ++   +    +GI+GGKP  S Y +G Q+E  +YLDPH  QPV++  + +   +  
Sbjct: 341 YIECVKNILSLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDFTQANFSLE-- 398

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFT 396
           ++H    + +    +DPS  IGFY R K+DF+  C+     L+      P+FT
Sbjct: 399 SFHCSSPKKMPFSRMDPSCTIGFYARTKEDFESMCSVVGMVLSSSKEKYPIFT 451



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 11/73 (15%)

Query: 65  SSTSDIWLLGVCHKI-AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           + TS I++LG  + + ++DE          +  F  DF SRI ++YR+ F  +  S +T+
Sbjct: 87  NKTSPIFVLGHAYLLNSEDE----------VERFRLDFVSRIWLTYRREFPQLEGSTLTT 136

Query: 124 DVGWGCMLRSSQM 136
           D GWGCMLRS QM
Sbjct: 137 DCGWGCMLRSGQM 149


>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
           benhamiae CBS 112371]
 gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
           benhamiae CBS 112371]
          Length = 437

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 140/316 (44%), Gaps = 85/316 (26%)

Query: 96  EFNQDFSSRILISYRKGFDPI--------GDSK-----------------ITSDVGWGCM 130
           +F  DF S++ I+YR  F PI        GDS                   TSD GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 189
           +RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G  A
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGATA 261

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCI 248
            G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V C 
Sbjct: 262 CGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACD 313

Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
           +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI G +   
Sbjct: 314 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGPE--- 359

Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
                                            + STYH+  +R +H+  +DPS+ IGF 
Sbjct: 360 ---------------------------------ELSTYHTRRLRRLHVREMDPSMLIGFL 386

Query: 369 CRDKDDFDDFCARASK 384
            RD+DD++D   R  +
Sbjct: 387 VRDEDDWEDLKQRVRE 402


>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
          Length = 431

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 126/260 (48%), Gaps = 15/260 (5%)

Query: 151 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------Y 201
           W K  ++P   EY  IL  F D +   +SIH + Q G   G + G W GP          
Sbjct: 137 WEKHQEQP--EEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 194

Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 261
           A+   W +LA     +  +  + +    ++   D   +   +    +D  +  C   + G
Sbjct: 195 ALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLD-WNTDCPGQTSG 253

Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
              W P+LL+VPL LG+ ++NP Y    +  F  PQSLG +GGKP ++ Y +G   +  I
Sbjct: 254 ---WKPLLLIVPLRLGINQINPIYADAFKECFKMPQSLGALGGKPNSAYYFIGFLGDELI 310

Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
           YLDPH  Q  ++  ++    D S +       + + ++DPS+A+GF+C++++DFD++C  
Sbjct: 311 YLDPHTTQTFVDTEENGTVNDQSFHCQQSPPRMKILNLDPSVALGFFCKEEEDFDNWCGL 370

Query: 382 ASKLAEESNGAPLFTVTQTH 401
             K   +     +F + + H
Sbjct: 371 VQKEILKPQSLQMFELVEKH 390


>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
          Length = 603

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 96/310 (30%), Positives = 137/310 (44%), Gaps = 63/310 (20%)

Query: 85  LGDAAGNNGLAEFNQDFSSRILISYRKGF------DPIGDS------------------- 119
           LG+   NN  ++   DF SRI  +YR  F      DP+ D                    
Sbjct: 55  LGNLYDNN--SDLLDDFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLR 112

Query: 120 --KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF--------DREYV---EI 166
                +D GWGCMLR+SQ L+A  L    LGR WR+    PF         +EYV   ++
Sbjct: 113 ERTFNTDQGWGCMLRTSQSLLANTLQIMLLGRQWRR---NPFVDLTDYAKRKEYVNLIKL 169

Query: 167 LHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
           L+LF D  S  SPFS+H +   GK+ G   G W GP     + + L   Q  +  L   S
Sbjct: 170 LNLFMDNPSTLSPFSVHRMAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQ-TDINLSV-S 227

Query: 225 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVN 282
           +     +   D     GG                    ++W   P+L+LV + LGL+ ++
Sbjct: 228 VASDSVIYKSDVYQASGGTSTT--------------ADSEWGNKPVLILVGVRLGLDGIH 273

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
           PRY  TL+        +GI GG+P +S Y  G Q +S  Y+DPH ++P INI     E +
Sbjct: 274 PRYYETLKAFLRMQSCVGIAGGRPSSSYYFFGYQSDSLFYVDPHIMKPTINIKTPPTEGE 333

Query: 343 TSTYHSDVIR 352
             T   +++R
Sbjct: 334 LKTEIENLLR 343



 Score = 42.0 bits (97), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 37/62 (59%), Gaps = 5/62 (8%)

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
           A  STY  D  R +++  +DPS+ IGF  +D+++F +F  +  +L ++     +F+V  +
Sbjct: 470 ASISTYFCDKPRKMNISQMDPSMLIGFLVKDENEFFEFVNQIKELPQQ-----VFSVADS 524

Query: 401 HK 402
           H+
Sbjct: 525 HR 526


>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
 gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
          Length = 433

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 117/410 (28%), Positives = 170/410 (41%), Gaps = 94/410 (22%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVG 126
           IWLLGV +     +  G +A  +  A F+   +DFSSR+  +YR+ F  I  + I +D G
Sbjct: 36  IWLLGVIYHRKMTQFYGASAVVDDGASFDAFLEDFSSRLWFTYRREFPAIPGTDIRTDCG 95

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWR------------------KPLQKPF-----DREY 163
           WGCMLRSSQM++AQA + H LGR WR                   PL++ F     D   
Sbjct: 96  WGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTEAGEVRLPRHALWPLREGFRCTGGDGTA 155

Query: 164 VEIL----------HLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 210
           V +             FGD    ++PFS+HNL+Q G+  G  AG W GP ++     +AL
Sbjct: 156 VLVRCSPKPVNDPPRWFGDKADASTPFSLHNLVQRGRESGKKAGDWYGPSSVAYILKDAL 215

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 270
                 +  L      + IYV              + +DD +  CS  S           
Sbjct: 216 EDAAHRDQRLA----QLCIYVAQD---------CTIYMDDVTALCSAGSTEGV------- 255

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQS--------------LGIVGGKPGASTYI-VGV 315
                    +  PR +   R  F+  Q+                +   K G S  + +  
Sbjct: 256 -------THRRLPRTVFARREMFSGGQTQRMCIHSSWLHLFVFFVCFLKYGISFLLQLSA 308

Query: 316 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            EE  IYLDPH  Q ++++   D   D  ++H    R +    IDPS  IGFYC+ K D 
Sbjct: 309 AEEKVIYLDPHYCQEMVDVNSQDFPLD--SFHCSWPRKMSFSRIDPSCTIGFYCKTKHDL 366

Query: 376 DDFCARASKLA---EESNGAPLFTV--------TQTHKKPVNHSDVLGET 414
           +DF     +L    +  +  P+F +        T T K+P     VL + 
Sbjct: 367 EDFTKNIRELTVPKQMRHEYPVFLISEGSCSDHTDTEKRPEEIVHVLQDV 416


>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
 gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 414

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 93/289 (32%), Positives = 134/289 (46%), Gaps = 34/289 (11%)

Query: 97  FNQDFSSRILISYRKGFDPIG---DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           F  DF ++I ++YR  F  I    D K  S +     LRS   LV Q       G  W  
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRS--QLVDQGGFTSDTG--WGC 158

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
                   E  +IL LF D   +P+SIH  ++ G  A G   G W GP        A AR
Sbjct: 159 SSSN----EERKILSLFADDPRAPYSIHKFVEHGASACGKHPGEWFGP-------SAAAR 207

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
           C +A T    +S  + +Y+     D           +D     S+       +TP L+LV
Sbjct: 208 CIQALTNSQVES-ELRVYITGDGSD---------VYEDT--FMSIAKPNSTKFTPTLILV 255

Query: 273 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
              LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y +GVQE    YLDPH  +P +
Sbjct: 256 GTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYFIGVQESDFFYLDPHQTRPAL 315

Query: 333 NIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                 +D    D  + H+  +R +H+  +DPS+ I F  RD++D+ D+
Sbjct: 316 PFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLIRDENDWKDW 364


>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
 gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
          Length = 266

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 60/169 (35%), Positives = 95/169 (56%), Gaps = 6/169 (3%)

Query: 250 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
           D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++G
Sbjct: 71  DSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 130

Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
           GKP ++ Y +G   E  IYLDPH  QP + +       D S +       + +  +DPS+
Sbjct: 131 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPSRMGIGELDPSI 190

Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           A+GF+C+ ++DF+D+C +  KL++     P+F + +     +   DVL 
Sbjct: 191 AVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 239


>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
          Length = 485

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 79/255 (30%), Positives = 123/255 (48%), Gaps = 27/255 (10%)

Query: 150 PWRKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
           P R P   P    D  + +++  FGD  ++PF +H L++ GK  G  AG W GP  +   
Sbjct: 210 PARCPSASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHM 269

Query: 207 W-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
             +A+AR    E         +A+YV              V  +D    C     G   W
Sbjct: 270 LRKAVARAAEFED--------LAVYVAQDC---------TVYKEDVMSLCESSGVG---W 309

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
             +++LVP+ LG E +NP YI  ++        +GI+GGKP  S + VG Q+E  +YLDP
Sbjct: 310 KSVVILVPVRLGGESLNPSYIECVKNILKLKCCIGIIGGKPKHSLFFVGFQDEQLLYLDP 369

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK- 384
           H  QPV+++ + +   +  ++H +  R ++   +DPS  IG Y R K DF+  C   S+ 
Sbjct: 370 HYCQPVVDVTQANFSLE--SFHCNSPRKMNFSRMDPSCTIGLYARSKTDFESLCTAVSEA 427

Query: 385 LAEESNGAPLFTVTQ 399
           L+      P+FT  +
Sbjct: 428 LSSSKEKYPIFTFVE 442



 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 30/63 (47%), Positives = 40/63 (63%), Gaps = 1/63 (1%)

Query: 91  NNGLAE-FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
           N G  E F Q F S + ++YR+ F  +  S +T+D GWGCMLRS QM++AQ LL H +  
Sbjct: 92  NEGEVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPT 151

Query: 150 PWR 152
            WR
Sbjct: 152 DWR 154


>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
          Length = 430

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 153/355 (43%), Gaps = 73/355 (20%)

Query: 95  AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 131
           A F  DF+SR  ++YR  F       DP                +  S  TSD GWGCM+
Sbjct: 121 AAFLDDFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMI 180

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
           RS Q L+A A+    LGR WR+ +    DRE   +L LF D   +P+SIHN ++ G+ Y 
Sbjct: 181 RSGQSLLANAMAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSIHNFVRHGEKYC 237

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
               G W GP A  R  + L   ++ E         + IY          G  P +  D+
Sbjct: 238 SKYPGEWFGPSATARCIQDLVNSRKQE---------LRIYST--------GDGPDIYEDN 280

Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
             +   +       + P L+LV   LG++K+ P Y   L  +    QS+GI GG+P +S 
Sbjct: 281 FMK---IAKPDGEVFHPTLVLVGTRLGIDKITPVYWEALIASVQMSQSVGIAGGRPSSSH 337

Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEA---DTSTYHSDVIRHIHLDSIDPSLAIGF 367
           Y VG Q     YLDPH  +  +    D       D  + H+  +R IH+  +DP+     
Sbjct: 338 YFVGSQGHFLFYLDPHHTRKALPYYSDVARYTIDDMDSCHTSRLRRIHVREMDPN----- 392

Query: 368 YCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 422
                      C  A+++ + +  + +  V          SD  GE GG+P D S
Sbjct: 393 -----------CHPANEIRDATGRSVIDEVELL-------SDEDGEDGGIPHDKS 429


>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
           leucogenys]
          Length = 441

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/328 (26%), Positives = 150/328 (45%), Gaps = 36/328 (10%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW--GCMLRSSQML-VAQALLFHR 146
           G      F  DF SR+ ++YR     +    I  D  W  G  L   ++   A    +H 
Sbjct: 98  GEGEHTAFPADFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHG 157

Query: 147 LGRPWRKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
             R W  P        L++  +R + +I+  F D   +PF +H L++ G++ G  AG W 
Sbjct: 158 PAR-WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWY 214

Query: 199 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 258
           GP         +A   R       +   + +YV       +   A +V   D +      
Sbjct: 215 GP-------SLVAHILRKAVESCSEVTRLVVYVSQTCSMYKADVARLVARPDPT------ 261

Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
               A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++
Sbjct: 262 ----AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCQLCLGIMGGKPRHSLYFIGYQDD 317

Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             +YLDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 375

Query: 379 CARASKLAEESNGA---PLFTVTQTHKK 403
           C+  +++   S+     P+FT+ + H +
Sbjct: 376 CSELTRVLSSSSAMERYPMFTLAEGHAQ 403


>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
 gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
          Length = 302

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/309 (31%), Positives = 148/309 (47%), Gaps = 42/309 (13%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAG 187
           M R  QML+AQAL+ H LGR WR    +      ++I+  F DS +  SP S+H L+Q  
Sbjct: 1   MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHRLVQMS 60

Query: 188 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGDE--DGER 240
                  G W GP ++C    A+ R     + L  +   + +Y     V+  +E  D  R
Sbjct: 61  DR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLAR 114

Query: 241 G------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLGL-EKVNPR 284
           G        P +   D   H +++ + Q+D          T ILLL+PL+ G   ++NPR
Sbjct: 115 GLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFGKGNRINPR 170

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           YI  +   F+ P  +G++GG+   S+Y VG Q  S IYLDPH  QP  N+       D  
Sbjct: 171 YIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNSPKFSVD-- 228

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
           ++H  + + +   +++PS A+GFYCR + +  D   R   L   S+         T  +P
Sbjct: 229 SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ----ASTRSRP 284

Query: 405 VNHS-DVLG 412
           V  + +VLG
Sbjct: 285 VAFTVEVLG 293


>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
          Length = 252

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/233 (30%), Positives = 118/233 (50%), Gaps = 32/233 (13%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I  +   +W+LG  +    D           L +   D  SR+  +YRKGF  IG++  T
Sbjct: 40  IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           SD GWGCMLR  QM++ QAL+F  LGR WR    K  D +Y++IL +F D  ++P+SIH 
Sbjct: 89  SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147

Query: 183 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 242
           +   G ++G   G W GP  + +  + LA             L   ++ V+ D       
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192

Query: 243 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLT 292
              + I++  + C+V  +  +    W P++L++PL LG+  +NP Y+  ++++
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKVS 243


>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
          Length = 330

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 138/313 (44%), Gaps = 60/313 (19%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWR------------------------------------K 153
           MLRS QM++AQ LL H L R W                                      
Sbjct: 1   MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
            L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +A  
Sbjct: 61  ELER--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHI 111

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
            R           + +YV       +   A +V   D +          A+W  +++LVP
Sbjct: 112 LRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVP 161

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           + LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP ++
Sbjct: 162 VRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVD 221

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 392
           + + D   +  ++H    R +     DPS  +GFY  D+ +F   C+  +++   S+   
Sbjct: 222 VSQADFPLE--SFHCTSPRKMAFAKTDPSCTVGFYAGDRKEFGTLCSELTRVLSSSSATE 279

Query: 393 --PLFTVTQTHKK 403
             P+FT+ + H +
Sbjct: 280 RYPMFTLAEGHAQ 292


>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1038

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 139/318 (43%), Gaps = 74/318 (23%)

Query: 80  AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------ 121
           +Q  A     G +   EF  DF+SRI ++YR  F PI DS +                  
Sbjct: 271 SQSPASEKHPGQDWAPEFYADFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSP 330

Query: 122 --------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YV 164
                         T+D GWGCMLR+ Q L+A ALL   LGR WR+P    +  +   YV
Sbjct: 331 SPKSRRWFGGEKGWTTDTGWGCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYV 390

Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
           +I+  F DS    +PFS+H +  AGK  G   G W GP     + + L +    + GLG 
Sbjct: 391 QIITWFLDSPLPQAPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKRLVQA-FPDAGLGV 449

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------W--TPIL 269
                                  V  D A     V+S    D           W    +L
Sbjct: 450 ----------------------AVASDGALYQTDVYSASYVDVGSPRNVRKLRWGGRAVL 487

Query: 270 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           +L  + LG+  VNP Y  T++  F  PQS+GI GG+P +S Y +GVQ ++ IYLDPH  +
Sbjct: 488 VLFGIRLGINGVNPIYYDTIKGLFEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPHHAR 547

Query: 330 PVINIGKDDLEADTSTYH 347
           P I + +   EAD    H
Sbjct: 548 PAIPL-RPLPEADEGNQH 564



 Score = 45.1 bits (105), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 5/69 (7%)

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
           A+  T+H D +R + L  +DPS+ +GF C+D++D+ DF  R + L   +      T+   
Sbjct: 716 AELKTFHCDRVRKMPLSGLDPSMLLGFLCQDEEDWIDFRHRITDLMHRNK-----TIFAI 770

Query: 401 HKKPVNHSD 409
             +P N S+
Sbjct: 771 QDEPPNWSE 779


>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
          Length = 263

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 93/169 (55%), Gaps = 6/169 (3%)

Query: 250 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
           D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++G
Sbjct: 68  DSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 127

Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
           GKP ++ Y VG   E  IYLDPH  QP +         D S +       + +  +DPS+
Sbjct: 128 GKPNSAHYFVGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFHCQHPPCRMSIAELDPSI 187

Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 188 AVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 236


>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
          Length = 423

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 90/274 (32%), Positives = 136/274 (49%), Gaps = 44/274 (16%)

Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 177
           +   TSD GWGCM+R+SQ L+A ALL  +L     +  Q       ++IL LF D  TSP
Sbjct: 138 NDNFTSDAGWGCMIRTSQNLLAIALL--KLSEEHNESAQ-------LDILKLFQDDPTSP 188

Query: 178 FSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSG 234
           FS+HN ++   +  L    G W GP A   S + L    ++ ET       P  I  V  
Sbjct: 189 FSLHNFIRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLET-------PGEIPYVYI 241

Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
            E+ +         DD      +F++ Q    P+LLL P+ LG+++VN  Y  ++    +
Sbjct: 242 SENAD-------LFDDEIE--DLFNEEQK---PLLLLFPVRLGIDQVNKYYYKSILQLLS 289

Query: 295 FPQSLGIVGGKPGASTYIVGVQEES-AIYLDPHDVQPV---INIGKDDLEADTSTYHSDV 350
            P S+GI GGKP +S Y +G + E+  +Y DPH  Q V   INI         +TYH+  
Sbjct: 290 LPYSVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI---------TTYHTAN 340

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
              + ++ +DPS+ IG   +  D++ +F    S+
Sbjct: 341 YNKLDIEMVDPSMMIGVLLKSMDEYKEFKQDCSE 374


>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
          Length = 592

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 107/347 (30%), Positives = 155/347 (44%), Gaps = 62/347 (17%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG------- 117
           S   DIW     H  A+D    D   N    EF  D  +RI ++YR  F PI        
Sbjct: 75  SGLKDIWQTLRFH-TAEDNEKDDL--NKWPQEFIDDVYTRIWLTYRTKFSPIDRDPEGPS 131

Query: 118 ----------------DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
                           +   T+D GWGCM+R+SQ L+A ALL   +GR WR       + 
Sbjct: 132 PLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQSLLANALLNLHIGRDWR--YTGELNE 189

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGL 220
            + EI+  F D  + PFSIH ++  GK       G W GP A  RS ++L          
Sbjct: 190 MHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGEWFGPSAAARSIQSL---------- 239

Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
            C      + V  G + G+     V  +  A     VF        PIL+L+ L LG++ 
Sbjct: 240 -CNEFDSGVKVYIGSDSGDIYENDVFKV--AKDENGVFK-------PILILLGLRLGIDN 289

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
           +NP Y  +L+      +S+GI GG+P  S Y  G Q +   YLDPH  QP + +  D L+
Sbjct: 290 INPVYWDSLKAILNSKESIGIAGGRPSTSHYFFGFQGDHLFYLDPHLPQPAL-LHDDQLD 348

Query: 341 A------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
                        D ++ H+  +R IHL  +DPS+ +GF  +D++++
Sbjct: 349 TSVSESTEIVSSLDVNSVHTKKLRKIHLSEVDPSMLLGFLIKDENEW 395


>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
          Length = 330

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 139/311 (44%), Gaps = 56/311 (18%)

Query: 130 MLRSSQMLVAQALLFHRLGRPW----------------------------------RKPL 155
           MLRS QM++AQ LL H L R W                                  +   
Sbjct: 1   MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60

Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
           +   +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +A   R
Sbjct: 61  ELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILR 113

Query: 216 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
                  +   + +YV       +   A +V   D +          A+W  +++LVP+ 
Sbjct: 114 KAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVPVR 163

Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
           LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ 
Sbjct: 164 LGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVS 223

Query: 336 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--- 392
           + D   +  ++H    R +    +DPS  +G Y  D+ +F+  C+  +++   S+     
Sbjct: 224 QADFPLE--SFHCTSPRKMAFAKMDPSCTVGSYAGDRKEFETLCSELTRVLGSSSATERY 281

Query: 393 PLFTVTQTHKK 403
           P+FT+ + H +
Sbjct: 282 PMFTLAEGHAQ 292


>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 999

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 91/278 (32%), Positives = 133/278 (47%), Gaps = 50/278 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 123
           F  DF+SRI ++YR  F PI D+ +                                 TS
Sbjct: 303 FYADFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTS 362

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 178
           D GWGCMLR+ Q L+A ALL   LGR WR+P    +  +Y   V+I+  F D+ +   PF
Sbjct: 363 DAGWGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPF 422

Query: 179 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
           S+H +   GK  G   G W GP     + + L      + GLG     +A+   S   + 
Sbjct: 423 SVHRMALVGKDLGKEVGQWFGPSTAAGAIKTLVHS-FPDAGLG-----VAVASDSTLYES 476

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
           +   A    +    RH       + +W    +L+L+ + LG+E VNP Y  T++  +TFP
Sbjct: 477 DVYAASRSSVYSTRRH----GHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLYTFP 532

Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           Q++GI GG+P +S Y VG Q ++  YLDPH  +P I +
Sbjct: 533 QTVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAIPL 570



 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 15/46 (32%), Positives = 29/46 (63%)

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           + +  T+H D +R + L  +DPS+ +GF C+D+ ++ D   R ++L
Sbjct: 699 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKERITEL 744


>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
          Length = 208

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/123 (53%), Positives = 87/123 (70%), Gaps = 8/123 (6%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
           S  ++R V +GSM R+    LG +R   S    D+W LG C++++ ++E  G +  ++G 
Sbjct: 90  SRILRRFVGSGSMWRL----LGCARVLTSG---DVWFLGKCYRVSPEEEESGGSDSDSGH 142

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
           A F +DFSSRI I+YRKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202

Query: 155 LQK 157
            +K
Sbjct: 203 SEK 205


>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
 gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
          Length = 577

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 145/320 (45%), Gaps = 64/320 (20%)

Query: 95  AEFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDV 125
            EF +D  SR++ +YR  F PI     G S I                        T+D+
Sbjct: 127 VEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNSFTTDI 186

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 179
           GWGCM+R+ Q L+  AL    LGR +R       P  K    E  +I+  F D+   PFS
Sbjct: 187 GWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEE--DIIEWFYDNPNKPFS 244

Query: 180 IHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 238
           IH  +  G +      G W GP   C + ++L   +  E G+        + V SGD   
Sbjct: 245 IHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLIY-EFPECGID----ECILSVSSGD--- 296

Query: 239 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
                  +  D+ + H   F K +   T IL+L+ + LG++K+N  Y   ++       S
Sbjct: 297 -------IYEDEINEH---FQKNEN--TIILILLGVKLGIDKINQCYFNDIKDILNSRYS 344

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
            GI GG+P +S Y  G   E   Y DPH  +P + + +D   +  ST +S ++    +  
Sbjct: 345 CGISGGRPSSSLYFFGHMNEYLYYFDPH--KPQLQLNEDFKNSCHSTDYSKIL----ISE 398

Query: 359 IDPSLAIGFYCRDKDDFDDF 378
           IDPS+ IGFY + K D+D+F
Sbjct: 399 IDPSMLIGFYLKGKKDWDNF 418


>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
          Length = 292

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/123 (52%), Positives = 88/123 (71%), Gaps = 8/123 (6%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
           S  ++R V +GSM R+    LG +R     ++ D+W LG C++++ ++E  G +  ++G 
Sbjct: 90  SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
           A F +DFSSRI I+YRKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202

Query: 155 LQK 157
            +K
Sbjct: 203 SEK 205


>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
          Length = 271

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/123 (52%), Positives = 88/123 (71%), Gaps = 8/123 (6%)

Query: 36  SETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGL 94
           S  ++R V +GSM R+    LG +R     ++ D+W LG C++++ ++E  G +  ++G 
Sbjct: 90  SRILRRFVGSGSMWRL----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGH 142

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
           A F +DFSSRI I+YRKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP
Sbjct: 143 AAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKP 202

Query: 155 LQK 157
            +K
Sbjct: 203 SEK 205


>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
          Length = 450

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 164/383 (42%), Gaps = 67/383 (17%)

Query: 26  LASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEAL 85
           L+ +   LG  E V R  T   + + +  +   SRT + +  S           A +  +
Sbjct: 4   LSRISQHLGIVEDVDRDGTVFILGKEYAPLNNKSRTDVETDDS-----------ALESLI 52

Query: 86  GDAAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT------------------ 122
              + N GL     D  SR+  +YR  F PI     G S I                   
Sbjct: 53  NIVSLNPGLL---SDVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALT 109

Query: 123 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
                 SD+GWGCM+R+ Q L+A A+   +L R +R    +  D E + ++  F D    
Sbjct: 110 DPDSFYSDIGWGCMIRTGQALLANAIQRVKLAREFRINASRIDDNE-LNLIRWFQDDVKY 168

Query: 177 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 235
           P S+HN ++A  K  G+  G W GP A  RS + L      E    C      I   S D
Sbjct: 169 PLSLHNFVKAEEKISGMKPGQWFGPSATARSIKTLI-----EGFPLCGIKNCIISTQSAD 223

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
                     +  D+ +R   +F K +     +LLL  + LG++K+N  Y   +    + 
Sbjct: 224 ----------IYEDEVTR---IFHKDRD--ANLLLLFAVRLGVDKINSLYWKDIFKILSS 268

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           P S+GI GGKP +S Y  G Q E+  YLDPH+ Q   ++  DDLE   S  H      +H
Sbjct: 269 PYSVGIAGGKPSSSLYFFGYQNENLFYLDPHNTQQS-SLMMDDLEFYRSC-HGHKFNKLH 326

Query: 356 LDSIDPSLAIGFYCRDKDDFDDF 378
           +   DPS+ +G     K+++D F
Sbjct: 327 ISETDPSMLLGMLISGKNEWDQF 349


>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
 gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
           norvegicus]
          Length = 224

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 58/169 (34%), Positives = 93/169 (55%), Gaps = 6/169 (3%)

Query: 250 DASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
           ++ RHC+    G         W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++G
Sbjct: 29  ESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 88

Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
           GKP ++ Y +G   E  IYLDPH  QP + +       D S +       + +  +DPS+
Sbjct: 89  GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPCRMGIGELDPSI 148

Query: 364 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           A+GF+C+ ++DF+D+C +  KL++     P+F + +     +   DVL 
Sbjct: 149 AVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 197


>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 180

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 55/118 (46%), Positives = 80/118 (67%), Gaps = 1/118 (0%)

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W P+++LVP+ LG++ +NP YIPTL+  F+FPQ LG++GGKP +S Y VG Q+   +Y+D
Sbjct: 11  WHPVIILVPVRLGIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMD 70

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           PH VQP + +  D L     +Y  ++ + +  D IDPSLA+GF C  + +FDDFC  A
Sbjct: 71  PHFVQPTVKMDDDPLFP-IESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 127


>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
           passalidarum NRRL Y-27907]
          Length = 363

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 84/280 (30%), Positives = 133/280 (47%), Gaps = 43/280 (15%)

Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
           F+ R+  + R  FD        SDVGWGCM+R+SQ L+A AL+           LQ   +
Sbjct: 104 FNKRLFTTVRSLFD---SENFNSDVGWGCMIRTSQSLLANALM----------KLQPSAE 150

Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 218
            E   +++LF D+  S FS+HN ++      L    G W GP A   S + L    + +T
Sbjct: 151 HE---VINLFQDNIASAFSLHNFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKT 207

Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
             G +   + I   S   D E        I++     SV           L+L P+ LG+
Sbjct: 208 IQGVKYPHVFISENSDLYDEE--------IEELLVESSV-----------LILFPVRLGI 248

Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
           + VN  Y  ++      P ++GI GGKP +S Y +G Q++  +Y DPH  Q   N     
Sbjct: 249 DNVNSYYYDSIFQLLACPFTVGISGGKPSSSFYFLGYQDQDLLYFDPHSPQLYEN----- 303

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
              + +TYH++  + +H+  +DPS+ +G   +DK ++ +F
Sbjct: 304 -PINYTTYHTNNYQRLHIHMLDPSMMVGILVKDKSEYKEF 342


>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
 gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
          Length = 1034

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/272 (34%), Positives = 129/272 (47%), Gaps = 50/272 (18%)

Query: 97  FNQDFSSRILISYRKGF-DPIGDSKI-------------------------------TSD 124
           F  DF+SRI ++YR  F  PI D ++                               +SD
Sbjct: 302 FYIDFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSD 361

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFS 179
            GWGCMLR+ Q L+A AL+   LGR WRKP       +Y   V IL  F D+    +PFS
Sbjct: 362 SGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFS 421

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           +H +  AGK  G   G W GP     + +AL      E G+G     +A+ V     DG 
Sbjct: 422 VHRMALAGKELGTDVGQWFGPSVAAGAIKALVNS-FPEAGIG-----VAVAV-----DGV 470

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
                V              + +  W   P+LLL+ + LG+E VNP Y  T+++ +TFPQ
Sbjct: 471 LYQTDVHAASHGDHFGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIYYDTIKMLYTFPQ 530

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 329
           S+GI GG+P +S Y VG Q ++  YLDPH  +
Sbjct: 531 SVGIAGGRPSSSYYFVGSQADNLFYLDPHHAR 562



 Score = 39.7 bits (91), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 16/45 (35%), Positives = 28/45 (62%)

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           A+  T+H + +R + L  +DPS+ +GF CRD+ ++ D   R + L
Sbjct: 711 AELKTFHCERVRKMPLSGLDPSMLLGFLCRDEAEWVDLRKRVAGL 755


>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
          Length = 337

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 121/247 (48%), Gaps = 22/247 (8%)

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
           DR +  I+  F D   +PF +H L++ G++ G  AG W GP         +A   R    
Sbjct: 72  DRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SVVAHILRKAVE 124

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
             C  +   +  VS D    +         D +R  S +    A+W  +++LVP+ LG E
Sbjct: 125 -SCSEVTRLVVYVSQDCTVYKA--------DVARLVS-WPDPTAEWKSVVILVPVRLGGE 174

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
            +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ + + 
Sbjct: 175 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVNQANF 234

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 396
             +  ++H    R +    +DPS  +GFY  ++ +F+  C+   ++   S+     P+FT
Sbjct: 235 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 292

Query: 397 VTQTHKK 403
           V + H +
Sbjct: 293 VAEGHAQ 299


>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
          Length = 246

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 76/246 (30%), Positives = 112/246 (45%), Gaps = 53/246 (21%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 250 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 284
           D  + C V   G AD                         W P+LL+VPL LG+ ++NP 
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240

Query: 285 YIPTLR 290
           YI   +
Sbjct: 241 YIEAFK 246


>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
          Length = 994

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/277 (32%), Positives = 129/277 (46%), Gaps = 49/277 (17%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 124
           F  DF+SRI ++YR  F+PI D+ +                                TSD
Sbjct: 309 FYSDFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSD 368

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETSPFS 179
            GWGCMLR+ Q L+A ALL   LGR WR+P    +  +   YV+I+  F D  S   PFS
Sbjct: 369 SGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFS 428

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           +H +   GK  G   G W GP     + + L      E GLG     +A+  V    D  
Sbjct: 429 VHRMALVGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---VAVDGVIYQSDVY 484

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
                 + +    +H      G+  W    +L+L+ + LG++ VNP Y   ++  +T PQ
Sbjct: 485 AVSRSTMGLGSPRKH------GRPSWGDRAVLVLIGIRLGIDGVNPIYYDLIKALYTLPQ 538

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           +LGI GG+P +S Y VG Q  +  YLDPH  +P I +
Sbjct: 539 TLGIAGGRPSSSYYFVGSQANNLFYLDPHHARPTIPL 575



 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 52/100 (52%), Gaps = 16/100 (16%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
           T+H D +R + L  +DPS+ IGF C+D++D+ D   R ++L              THK+ 
Sbjct: 711 TFHCDRVRKMPLSGLDPSMLIGFLCKDENDWIDLRRRLTELF------------NTHKRH 758

Query: 405 VNHSDVLGETGGVPED--DSLGVMSMNDAVGNAHEDDWQL 442
           +    +  E    P D  D++G+ S+++   +  E+D +L
Sbjct: 759 I--FSIQDEPPNWPSDSEDNIGLESISEPDIDLPEEDDEL 796


>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
           6054]
 gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 514

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 95/323 (29%), Positives = 153/323 (47%), Gaps = 43/323 (13%)

Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
           FS  +L + +   + I     T+DVGWGCM+R+SQ L+A    F RL       L K  D
Sbjct: 138 FSKSLLYNLQNFNNFIEKENFTTDVGWGCMIRTSQSLLANT--FVRL-------LDKQSD 188

Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 218
                I+ LF D+  +PFS+HN ++   +  L    G W GP A   S + L  C     
Sbjct: 189 -----IIALFNDTYLAPFSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRL--CDGYYD 241

Query: 219 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
               +++   I V+  +            ++ ++      +KG      +L+L+P+ LG+
Sbjct: 242 NSTSETILPRINVLISESTDLYDSQIAQLLEPSTE-----TKG------LLVLLPVRLGI 290

Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
           + +N  Y  +L    +  QS+GI GGKP +S Y  G Q+ S IY+DPH  Q    I   D
Sbjct: 291 DSINSYYFSSLLHLLSLEQSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQ----IFSSD 346

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF---CARASKLAEESNGAPLF 395
           +  D STY++   + + +  +DPS+ IG + RD   +++F   C  A+      +     
Sbjct: 347 I--DMSTYYATRYQRVDIGKLDPSMLIGVFIRDLTSYENFKKSCLDAANKIVHFHATERS 404

Query: 396 TVTQTHKK-----PVNHSDVLGE 413
           TV ++ +K      +N SD+  E
Sbjct: 405 TVPESRRKNSEFVNINRSDLKDE 427


>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 557

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 141/346 (40%), Gaps = 48/346 (13%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL---- 155
           D  S    +YR  F  I    ITSD GWGCMLRS+QM++ QAL  H   R WR P     
Sbjct: 171 DERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQLLAR 230

Query: 156 --QKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 212
             Q  F R  +     +  S  S +S+HN++ AG   Y    G W GP   C     L  
Sbjct: 231 RRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMRDLVH 290

Query: 213 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 272
               +  LG   L   I+ V     G      +           +  K +          
Sbjct: 291 IHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQPQAH 350

Query: 273 PLVLGLEK---------------------------VNPRYIPTLRLTFTFPQSLGIVGGK 305
           PL L  E+                           +N  Y+ +L  TF+ PQS+G++GG+
Sbjct: 351 PLDLEWEEELMESANTVEWDTALLLLVPLRLGLTSLNEEYVQSLAHTFSLPQSVGVLGGR 410

Query: 306 PGASTYIVGVQEE-SAIY-LDPHDVQ--PVINIGKDDLEADTSTYHS-DVIRHIHLD--- 357
           P  + +  G Q++ S I+ LDPH VQ  P     + + +A +    S D +R  H     
Sbjct: 411 PRGARWFYGAQKDGSKIFGLDPHTVQTAPGRQTARVNGQASSVVELSDDYLRSCHTTCPE 470

Query: 358 -----SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP-LFTV 397
                 +DPS+A+GFYCR + D +          +E +  P LF+V
Sbjct: 471 MFPFCKMDPSIALGFYCRTRADLNHVLNSMGAWQKEHSSIPELFSV 516


>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
 gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
          Length = 443

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 94/330 (28%), Positives = 143/330 (43%), Gaps = 73/330 (22%)

Query: 85  LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------- 119
           LG    N+  A  N    S++ +SYR GF+PI  S                         
Sbjct: 69  LGQIFDNSNAA--NNYIESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFA 126

Query: 120 ---------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 170
                      TSD GWGCM+R+SQ L+A  LL             K + +   EI+ LF
Sbjct: 127 NLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEQEIVKLF 173

Query: 171 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
            D   SPFSIHN ++   +  L    G W GP A   S + L    + +   G    P  
Sbjct: 174 QDDTKSPFSIHNFIRVASSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGIN--PPR 231

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
           +++    +            DD  R   VF+K +++   +++L P+ LG++KVN  Y  +
Sbjct: 232 VFISENSD----------LFDDEIR--DVFAKEKSN--SVIILFPIRLGIDKVNSYYYNS 277

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           +    +   S GI GGKP +S Y +G ++   IY DPH  Q V      +   +  +YHS
Sbjct: 278 IFHLLSSKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQIV------ETPFNMDSYHS 331

Query: 349 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
                +++  +DPS+ IG    + D++ DF
Sbjct: 332 TNYNTLNISLLDPSMMIGILVTNIDEYIDF 361


>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
          Length = 296

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 120/247 (48%), Gaps = 22/247 (8%)

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
           DR +  I+  F D   +PF +H L++ G++ G  AG W GP         +A   R    
Sbjct: 31  DRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP-------SVVAHILRKAVE 83

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
             C  +   +  VS D    +         D +R  S +    A+W  +++LVP+ LG E
Sbjct: 84  -SCSEVSRLVVYVSQDCTVYKA--------DVARLLS-WPDPTAEWKSVVILVPVRLGGE 133

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
            +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ +   
Sbjct: 134 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQPSF 193

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 396
             +  ++H    R +    +DPS  +GFY  ++ +F+  C+   ++   S+     P+FT
Sbjct: 194 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 251

Query: 397 VTQTHKK 403
           V + H +
Sbjct: 252 VAEGHAQ 258


>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
 gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
          Length = 446

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 98/353 (27%), Positives = 148/353 (41%), Gaps = 76/353 (21%)

Query: 98  NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 123
           N    S++ +SYR GF+PI  S                                    TS
Sbjct: 80  NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCM+R+SQ L+A  LL             K + +   EI+ LF D  +SPFSIHN 
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDGTSSPFSIHNF 186

Query: 184 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
           ++      L    G W GP A   S + L      +  L    +P     +S + D    
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVF--ISENSD---- 240

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
                  DD  R   VF+K ++    +L+L P+ LG++KVN  Y  ++        S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKS--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
            GGKP +S Y +G ++   IY DPH  Q V      +   +  +YH+     +++  +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 414
           S+ IG    + D++ DF    S   + +N    F     H  PV    ++ ++
Sbjct: 346 SMMIGILVTNIDEYIDF---KSSCIDNNNKIVHF---HPHTLPVQQDSIINQS 392


>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
           1558]
          Length = 1159

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 82/248 (33%), Positives = 112/248 (45%), Gaps = 51/248 (20%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------------REYVEIL 167
           +T+D GWGCMLR+ Q L+A AL+   LGR WR P Q                   YV IL
Sbjct: 580 LTTDAGWGCMLRTGQSLLANALIHLHLGRDWRVPSQPQVPPTSAAHLAELEAYSSYVRIL 639

Query: 168 HLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
             F D  S   PFS+H +   GK  G   G W GP     + + L             S 
Sbjct: 640 SWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTL-----------VNSF 688

Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------------W--T 266
           P +   V+   D       +V   D     ++ S G +D                 W   
Sbjct: 689 PPSGMAVATAVDS------IVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNR 742

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
            +L+L+ + LGL+ VNP Y  +++  FTFPQS+GI GG+P +S Y VG Q  S +YLDPH
Sbjct: 743 AVLVLIGIRLGLDGVNPLYYESIKALFTFPQSVGIAGGRPSSSYYFVGTQANSLVYLDPH 802

Query: 327 DVQPVINI 334
             +P + +
Sbjct: 803 FTRPAVPL 810



 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 5/60 (8%)

Query: 340  EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
            +A   T+H D +R I L  +DPS+ +GF C+D+ DF+DFC+R ++L ++     +FT+ +
Sbjct: 962  KAQLGTFHCDKVRKIPLSGLDPSMLLGFVCKDEADFEDFCSRVAQLPQK-----IFTIQE 1016


>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
           70294]
 gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 411

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 142/309 (45%), Gaps = 57/309 (18%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 127
           F  D  SRI  +YR  F PI  S                                +D+GW
Sbjct: 74  FLSDVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGW 133

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
           GCM+R+ Q L+A A+    LGR +R       + +  +I+  F D+   PFS+HN ++ G
Sbjct: 134 GCMIRTGQSLLANAIQIAILGREFRVN-DGDVNEQERKIISWFMDTPDEPFSLHNFVKKG 192

Query: 188 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
            +      G W GP A  RS ++L   Q  + G+    + ++   +  DE          
Sbjct: 193 CELSSKKPGEWFGPAATSRSIQSLVE-QFPDCGIDRCIVSVSSADIFKDE---------- 241

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
            I+D      +F   +  ++ ILLL+ + LG++KVN  Y+  +R       S+GI GG+P
Sbjct: 242 -IND------IFKNKR--YSNILLLMGVKLGVDKVNEYYLKDIRKILESRYSVGISGGRP 292

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
            +S Y  G Q+++ +Y DPH  QP        +E+   T H+D    I++  +DPS+ IG
Sbjct: 293 SSSLYFFGYQDDTLLYFDPHKPQPST------IESLLETCHTDNFDKINISDMDPSMLIG 346

Query: 367 FYCRDKDDF 375
              + +DD+
Sbjct: 347 VLLQGEDDW 355


>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
 gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
          Length = 446

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 136/317 (42%), Gaps = 70/317 (22%)

Query: 98  NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 123
           N    S++ +SYR GF+PI  S                                    TS
Sbjct: 80  NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCM+R+SQ L+A  LL             K + +   EI+ LF D  +SPFSIHN 
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDDTSSPFSIHNF 186

Query: 184 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
           ++      L    G W GP A   S + LA     +  +    +P     +S + D    
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVF--ISENSD---- 240

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
                  DD  R   VF+K +     +L+L P+ LG++KVN  Y  ++        S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
            GGKP +S Y +G ++   IY DPH  Q V      +   +  +YH+     +++  +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345

Query: 362 SLAIGFYCRDKDDFDDF 378
           S+ IG    + D++ DF
Sbjct: 346 SMMIGILVTNIDEYIDF 362


>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
          Length = 391

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/326 (27%), Positives = 148/326 (45%), Gaps = 42/326 (12%)

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
           Q +S  I  +YRK F  I +S+ TSD GWGCMLRS QM+ AQ L  H      R+  Q  
Sbjct: 51  QIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQILRVH-----IRQKKQHS 105

Query: 159 FDREYVEILHLFGDSE---------------TSPFSIHNLLQAGK-AYGLAAGSWVGPYA 202
            D +Y ++L  F D +                SP+SI  +    +  + +    W  P  
Sbjct: 106 KDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWYRPDQ 164

Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIY--VVSGDEDGERGGAPVVC-----------ID 249
           +  +   L + ++ E   G + L + I   ++      E  G  + C             
Sbjct: 165 ILNALSLLHQQKQLE---GSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKNK 221

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           + S+ C++  K       I  +  +  GL+++N  Y+P L      PQ  GI+GG+   +
Sbjct: 222 EISKICNICQKKDPKALAIFFITRI--GLDEINKEYLPFLNDLIDLPQFQGIIGGRDDKA 279

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
            YI+G   +  IYLDPH +Q  IN G   +  D  T+    +++I+ + + PS+A+GFYC
Sbjct: 280 YYILGRVNKRLIYLDPHYIQEHINRGNVVMLKD--TFFCKDVKYINEEQMSPSIALGFYC 337

Query: 370 RDKDDFDDFCARASKLAEESNGAPLF 395
           +++ + D F     ++ +  +    F
Sbjct: 338 QNQSELDKFFNSIEQIKKNYDNEKTF 363


>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 446

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 140/331 (42%), Gaps = 72/331 (21%)

Query: 84  ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------ 119
            LG    N   A  N    S++ +SYR GF+PI  S                        
Sbjct: 68  VLGQTFDNFDTA--NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNF 125

Query: 120 ----------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 169
                       TSD GWGCM+R+SQ L+A  LL             K + +   EI+ L
Sbjct: 126 ANLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKL 172

Query: 170 FGDSETSPFSIHNLLQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
           F D  +SPFSIHN ++      L   +G W GP A   S + L      +  +    +P 
Sbjct: 173 FQDGTSSPFSIHNFIRVASLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPR 232

Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
               +S + D           DD  R   VF+K +     +L+L P+ LG++KVN  Y  
Sbjct: 233 VF--ISENSD---------LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYN 277

Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 347
           ++        S GI GGKP +S Y +G ++   IY DPH  Q V      +   +  +YH
Sbjct: 278 SIFHLLASKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYH 331

Query: 348 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           +     +++  +DPS+ IG    + D++ DF
Sbjct: 332 TTNYNRLNISLLDPSMMIGILVTNIDEYIDF 362


>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
          Length = 411

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 88/285 (30%), Positives = 135/285 (47%), Gaps = 54/285 (18%)

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP-----------FDREYVE---IL 167
           T+D GWGCM+R++QM+VAQA++ +R GR WR   +K            FD E ++   IL
Sbjct: 88  TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147

Query: 168 HLFGDSETSPFSIHNLLQ-AGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
            LF D  ++P  IH +++ A +  G  A G W  P       EA+   ++A T       
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPS------EAVFIMKKAITESASPLT 201

Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPR 284
              +  +S D     G   +  ++  ++H          WT  L+LV +V LG  ++N  
Sbjct: 202 GDTVMYLSID-----GRVHIRDLEVETKH----------WTKTLMLVIVVRLGAAELNRI 246

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           Y+P L   F+    LGI GG+P  S + VG   +  IYLDPH     I I   D++ +TS
Sbjct: 247 YVPHLMRLFSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAHEYIPI---DMDFNTS 303

Query: 345 -------------TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
                        +YH  ++  +H   +DPS A+ F    ++ FD
Sbjct: 304 QEDPKKPKKCPERSYHCRLLSKMHFLDMDPSCALCFRFESREQFD 348


>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
          Length = 408

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/268 (29%), Positives = 128/268 (47%), Gaps = 37/268 (13%)

Query: 116 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 175
           I +   T+DVGWGCM+R+SQ L+A           +++ + +   +E +++L  F DSE 
Sbjct: 123 IDNENFTTDVGWGCMIRTSQSLLANT---------YKRMISEDAQQE-IQLLDQFKDSEA 172

Query: 176 SPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS 233
           +PFS+HN ++      L    G W GP A   S + L     ++   G   LP    ++S
Sbjct: 173 APFSLHNFIRVANESPLQVKPGQWFGPNAASLSIQRLCNLVNSKENFG---LPGLSVLIS 229

Query: 234 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
            + D           DD  +   +  K Q+    +L+L+P+ LG++K N  Y  ++    
Sbjct: 230 ENSD---------LYDDKVQEF-LDKKKQS----LLILLPIRLGIDKTNEFYYSSILQLL 275

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
              QS+GI GGKP +S Y  G   +  +YLDPH  Q           A  ++YH+   + 
Sbjct: 276 NCKQSVGIAGGKPSSSFYFFGYDNDELLYLDPHYPQ--------GTNAGYNSYHTPRYQR 327

Query: 354 IHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
           + +  +DPS+ IG    D  D++ F A 
Sbjct: 328 LTISQLDPSMMIGILVDDLQDYNTFKAE 355


>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
          Length = 411

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/310 (28%), Positives = 123/310 (39%), Gaps = 83/310 (26%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
           A F  DF S+  ++YR  F+ I  S                         +SD GWGCM+
Sbjct: 109 AAFLDDFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMI 168

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG 191
           RS QML+A A+    LGR                                       A G
Sbjct: 169 RSGQMLLANAMAITNLGR--------------------------------------VACG 190

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 251
              G W GP A  R  ++L   Q   +        + +Y          G  P V  D  
Sbjct: 191 KYPGEWFGPSATARCIQSLTNAQEQPS--------LRVYST--------GDGPDVYED-- 232

Query: 252 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
            +   +       + P L+LV   LG++K+ P Y   L      PQS+GI GG+P AS Y
Sbjct: 233 -KFMKIAKPDGTRFHPTLILVGTRLGIDKITPVYWDALIAALQMPQSVGIAGGRPSASHY 291

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
            +G Q     YLDPH  +P +    D     +AD  T H+  +R +H+  +DPS+ IGF 
Sbjct: 292 FIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADIDTAHTRRLRRLHVREMDPSMLIGFL 351

Query: 369 CRDKDDFDDF 378
            +D DD+ ++
Sbjct: 352 IKDDDDWSEW 361


>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
 gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
          Length = 551

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 59/161 (36%), Positives = 93/161 (57%), Gaps = 6/161 (3%)

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W P+L+L+P+ LGL+ +N  Y  +L   F FPQ+LG+VGGKP AS Y +  Q+++  YLD
Sbjct: 383 WEPLLILIPMRLGLDGLNSIYHSSLLEIFKFPQNLGVVGGKPRASLYFIAAQDDNLFYLD 442

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH VQ  I + ++  +   +T+     +  H+  +DPSL + F+C+ KDDF+DF  R+ K
Sbjct: 443 PHTVQNHIEV-ENGSKFPLNTFFCSTTKRTHVSEVDPSLVVAFFCKTKDDFNDFVERSKK 501

Query: 385 LAEESNGAPLFTVTQTHKKPVNHSDV----LGETGGVPEDD 421
           +  +    P+F++        +  D     + ETGG   DD
Sbjct: 502 MTSQMEN-PIFSIFDNEPDYDSSRDYEYEEIDETGGETSDD 541



 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 5/119 (4%)

Query: 94  LAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
           + EF  DF++R+L  +YR+GF  I D+   +D GWGCMLRS QML++  LL + LG  W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199

Query: 153 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 211
           +         + +I+ +F D  ++PFSIHN+   G+  G   G W  P  + ++ + L 
Sbjct: 200 RSSSAT----HPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILV 254


>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
 gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
          Length = 332

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 91/308 (29%), Positives = 139/308 (45%), Gaps = 38/308 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
           I I+YRK    I +   T+D GWGCM+RS QM++AQ  L   LG  W+     +    + 
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96

Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
            +++ I++LFGDS  S FSIH L+      G+  G W GP        + A    AE   
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP--------SFASDIAAEHIN 148

Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
             +      YV    + G   G  +             SK +  + P ++ VPL LG E 
Sbjct: 149 EMRVFRTRGYVA---KLGSIVGPKI----------EELSKDEVGFNPCIIFVPLRLGPES 195

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
               + P L+  F  PQ +G++GGKPG + Y       +  +LDPH  Q  I     D++
Sbjct: 196 PENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DMK 250

Query: 341 ADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 398
            D S  +Y     + ++   IDPS+++ F  +  +D++ F     K  E    + LF   
Sbjct: 251 GDWSYQSYFCKDNKSMNYSKIDPSISLVFLVKHVNDYEHF----KKSFENKTFSKLFIFK 306

Query: 399 QTHKKPVN 406
              +K +N
Sbjct: 307 NEIEKKLN 314


>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
 gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
          Length = 495

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 90/321 (28%), Positives = 145/321 (45%), Gaps = 74/321 (23%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 127
           F +D  +R+  +YR  F PI  S                                +D+GW
Sbjct: 75  FLKDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGW 134

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGDSETSPFSIHNLLQ 185
           GCM+R+ Q L+   L   RLGR +R     P +++  E  I+  F D+   PFS+H  + 
Sbjct: 135 GCMIRTGQSLLGNTLQIVRLGRDFR---YDPENKDISENRIIEWFIDAPEKPFSLHQFIT 191

Query: 186 AG-KAYGLAAGSWVGPYAMCRSWEALAR----CQRAETGLGCQSLPMAIYVVSGDEDGER 240
            G +  G   G W GP A  RS ++L R    C  AE           + V SGD     
Sbjct: 192 EGMELSGKNPGEWFGPAATARSIQSLIRKFPDCGIAEC---------LVSVSSGD----- 237

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
                +  D+  +   VF+  + +   +L+L+ + LGL  VN  Y  ++R   +   S+G
Sbjct: 238 -----IYSDEVKQ---VFADNKKN---LLILLGVKLGLNAVNECYWDSIRHILSSKYSVG 286

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHIHLD 357
           I GG+P +S Y  G + +  +Y DPH  QP        LE +  +Y   H++    + ++
Sbjct: 287 ISGGRPSSSLYFFGYEGDELLYFDPHSPQP-------SLEENNVSYKSCHTNKYGKLLMN 339

Query: 358 SIDPSLAIGFYCRDKDDFDDF 378
            +DPS+ +GF  R ++D+++F
Sbjct: 340 DMDPSMLLGFLIRGQEDWENF 360


>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 1193

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 82/240 (34%), Positives = 110/240 (45%), Gaps = 28/240 (11%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
           +TSD GWGCMLR+ Q L+  AL+   LGR WR P       E               Y +
Sbjct: 562 LTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAKYAQ 621

Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
           +L  F D  S   PFS+H +   GK  G   G W GP     + + LA    A  G+   
Sbjct: 622 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 680

Query: 224 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 274
           +   +I      Y  S    D     +P        R     +K +  W    +L+LV +
Sbjct: 681 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRRGDNEAK-EEKWGKRAVLILVGV 739

Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
            LGL+ VNP Y  +++  FTFPQS+GI GG+P +S Y VG Q     YLDPH  +P I +
Sbjct: 740 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 799



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           TYH + I+ + L  +DPS+ +GF C+D+DDF+DF  R ++L ++     +FTV
Sbjct: 952 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 999


>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1093

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 55/274 (20%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
           F  DF+SR+ ++YR  F PI D+ +                                   
Sbjct: 369 FYADFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGG 428

Query: 122 ----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-LQKPFDR--EYVEILHLFGDSE 174
               TSD GWGCMLR+ Q L+A ALL   LGR WR+P   +P      YV++L  F DS 
Sbjct: 429 EKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSP 488

Query: 175 TS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 232
           +   PFS+H +  AGK  G   G W GP     + + L     A  G G         VV
Sbjct: 489 SPLCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVH---AFPGGGLGVAVAVDGVV 545

Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
              +      +P     D+ RH    + G      +L+L+ + LGL+ VNP Y  T++  
Sbjct: 546 YETDVFSASHSP-----DSRRHHRTSTWGDRG---VLILIGIRLGLDGVNPIYYDTIKEL 597

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           +T+PQS+GI GG+P +S Y VG Q +S  YLDPH
Sbjct: 598 YTWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631



 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 37/57 (64%), Gaps = 2/57 (3%)

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           A+  T+H + +R + L  +DPS+ IGF CRD++++ D  AR + +A++    P+F V
Sbjct: 779 AELRTFHCERVRKMPLSGLDPSMLIGFLCRDEEEWRDLRARIANMAKKFK--PIFAV 833


>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
          Length = 734

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 62/163 (38%), Positives = 91/163 (55%), Gaps = 13/163 (7%)

Query: 238 GERGGA---PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
           GE  G+   P+ C D  S  C         W  I++LVP+ LGL+K+N  Y   ++    
Sbjct: 515 GENSGSFKDPLTCSDFFSSSCI-----PQRWKSIIILVPIKLGLDKLNEVYFREIKSMLE 569

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 354
            PQS+G++GGKP  S Y VG Q+E  IYLDPH V   ++    +    + +YH  V + +
Sbjct: 570 LPQSIGLIGGKPKQSFYFVGYQDEHIIYLDPHFVHDTVSPNDINF---SDSYHHCVPQKM 626

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
            +  +DPS+AIGFYC  + DF+DFC R  ++  E  G P+ +V
Sbjct: 627 LISQLDPSMAIGFYCHTQSDFEDFCVRIKEI--EKRGFPVVSV 667



 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 31/47 (65%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 136
            N  +  F  DF + +  SYRK F PI ++ IT+D+GWGCM+R+ QM
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQM 315


>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
 gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
 gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
          Length = 521

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/310 (30%), Positives = 137/310 (44%), Gaps = 52/310 (16%)

Query: 96  EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
           EF  D  +R+  +YR  F PI     G S ++                        +D+G
Sbjct: 115 EFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDIG 174

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+A AL    LGR +R       + E + I+  F D    PFS+H  +Q 
Sbjct: 175 WGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHE-LRIIKWFEDDPKYPFSLHKFVQE 233

Query: 187 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G +  G   G W GP A  RS +AL     A     C      I   SGD          
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPA-----CGIAHCVISTDSGD---------- 278

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
           V +D+      +F    +    +LLL+ + LG++ VN  Y   +R   +   S+GI GG+
Sbjct: 279 VYMDEVE---PLFRADPS--AAVLLLLCVRLGVDVVNEVYWEHIRHILSSEHSVGIAGGR 333

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q+E   YLDPH  Q  +   + DL+   S  H+     +H+  IDPS+ I
Sbjct: 334 PSSSLYFFGYQDEHLFYLDPHKPQLNLASYQQDLDLFRSV-HTQRFNKVHMSDIDPSMLI 392

Query: 366 GFYCRDKDDF 375
           G     KDD+
Sbjct: 393 GILLNGKDDW 402


>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
 gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
          Length = 314

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 140/341 (41%), Gaps = 57/341 (16%)

Query: 48  MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 107
           M  I ER L    T    S + IW LG  H  A +      A       F QD    + +
Sbjct: 4   MSHILERYLRMFPTNHEPSGTFIWSLG--HSYATETGKWPEA-------FVQDTYDLLSL 54

Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 167
           +YRK     G    +SD GWGCM+RS Q ++A  L   R  +P   P+ K        IL
Sbjct: 55  TYRKCI--AGMECFSSDAGWGCMIRSMQTMLANCL---RRVQP-SLPVHK--------IL 100

Query: 168 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
           H F D   +  S+H  + AG     +  G+W GP  +      L           C + P
Sbjct: 101 HYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHL-----------CSTHP 149

Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI--LLLVPLVLGLEKVNPR 284
                V    DG                 ++  + Q   TP   LLL  L LG++ ++  
Sbjct: 150 QVGLNVCVSHDG-----------------AIMYRDQLRNTPYPRLLLFTLRLGIDTIHTS 192

Query: 285 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           Y   L    T PQ++GIVGG+P A+ Y    Q +   YLDPH  Q        D  A  S
Sbjct: 193 YYEQLCHVLTIPQAIGIVGGRPRAAHYFYACQSQWFFYLDPHTTQTAHTF---DNPAPNS 249

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           ++H   +R + ++ +DP + +GF    ++   DF  R  KL
Sbjct: 250 SFHVTTLRRLRINELDPCMVLGFAITSEECQTDFEQRIVKL 290


>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
          Length = 616

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 53/140 (37%), Positives = 84/140 (60%), Gaps = 5/140 (3%)

Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
            Q++W  +++LVP+ LGL+K+N  Y   ++     P S+G++GGKP  S Y VG Q+E  
Sbjct: 426 NQSNWKSLIILVPVKLGLDKLNEIYFSGIKAMLQMPSSIGLIGGKPKQSFYFVGFQDEHI 485

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           IYLDPH V   I+    +     ++YH  + + +H   IDPS+A GFYC    DF+ FC 
Sbjct: 486 IYLDPHFVHDTIHPFDSNF---LNSYHDCIPQKMHFSQIDPSMAFGFYCHTYKDFEQFCI 542

Query: 381 RASKLAEESNGAPLFTVTQT 400
           R  ++  E++G P+ ++ +T
Sbjct: 543 RIKEI--EASGFPILSIGET 560



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 61/100 (61%), Gaps = 7/100 (7%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR---P 150
           +  F +DF S +  SYRK F  I ++ IT+D+GWGCMLR+ QM++A+ALL H       P
Sbjct: 194 VERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENIP 253

Query: 151 WRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGK 188
           + + ++   + +Y +I+  F D  S+ + +SIH ++   K
Sbjct: 254 YGEKIKT--NSKYKKIMSWFCDYPSKENFYSIHQIVHKNK 291


>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
 gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
          Length = 465

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 86/134 (64%), Gaps = 5/134 (3%)

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++++PL LG++++N  YI  L+   + PQSLG +GGKP  S Y +G Q++  IYLD
Sbjct: 217 WKSLIIMIPLKLGVDRINTSYIRKLKSILSIPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 276

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH VQ  ++   ++    + T+   + + +   +IDPSL++GFYC+DK  FDD C R SK
Sbjct: 277 PHFVQDTVDPSSNNY---SETFCGCIPQKMSFSNIDPSLSVGFYCKDKSSFDDLCDRLSK 333

Query: 385 LAEESNGAPLFTVT 398
           L  E++  P+ +++
Sbjct: 334 L--ENDEFPIISIS 345


>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
           H99]
          Length = 1185

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 81/239 (33%), Positives = 110/239 (46%), Gaps = 26/239 (10%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
           +TSD GWGCMLR+ Q L+  AL+   LGR WR P       E               Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYAQ 619

Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
           +L  F D  S   PFS+H +   GK  G   G W GP     + + LA    A  G+   
Sbjct: 620 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 678

Query: 224 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSK-GQADWTPILLLVPLV 275
           +   +I      Y  S    D     +P        R     +K G+     +L+LV + 
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRGGDNKAKEGKWGKRAVLILVGIR 738

Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           LGL+ VNP Y  +++  FTFPQS+GI GG+P +S Y +G Q     YLDPH  +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFIGSQANHLFYLDPHLTRPAIPL 797



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           TYH + I+ + L  +DPS+ +GF C+D+DDF+DF  R ++L ++     +FTV
Sbjct: 946 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 993


>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
 gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1188

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/239 (33%), Positives = 109/239 (45%), Gaps = 26/239 (10%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 165
           +TSD GWGCMLR+ Q L+  AL+   LGR WR P       E               Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAKYAQ 619

Query: 166 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
           ++  F D  S   PFS+H +   GK  G   G W GP     + + LA    A  G+   
Sbjct: 620 MVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANS-FAPCGIAVA 678

Query: 224 SLPMAI------YVVSG--DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
           +   +I      Y  S    +D  R            RH +   +G+     +L+LV + 
Sbjct: 679 TATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEGKWGERAVLILVGIR 738

Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
           LGL+ VNP Y  +++  FTFPQ+ G  GG+P +S Y VG Q     YLDPH  +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQAGGSAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 797



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 34/53 (64%), Gaps = 5/53 (9%)

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           TYH + I+ + L  +DPS+ +GF C+ +DDF++F  R + L ++     +FTV
Sbjct: 947 TYHCEKIKKMPLSGLDPSMLLGFVCKSEDDFENFVERVALLPKK-----IFTV 994


>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
 gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
          Length = 489

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 145/321 (45%), Gaps = 53/321 (16%)

Query: 88  AAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT-------------------- 122
           +  +N   +F  D  SR+  +YR  F PI     G S ++                    
Sbjct: 69  SKNSNENPDFLSDVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNP 128

Query: 123 ----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
               +DVGWGCM+R+ Q L+  AL   RLGR +R  +      E + I+  F D   +PF
Sbjct: 129 ACFNTDVGWGCMIRTGQSLLGNALQIARLGRGYR--IGSELKPEEISIIDWFVDIPDAPF 186

Query: 179 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
           SIHN +  G +      G W GP A  RS ++L R  +      CQ     I V SGD  
Sbjct: 187 SIHNFVSKGMELSSKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQ-----ISVSSGD-- 239

Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
                   V  +D  +   VF++ +   + ILLL+ + LG+  VN  Y   ++       
Sbjct: 240 --------VYEEDVMK---VFNESKD--SRILLLLGVKLGINAVNEFYWNDIKRLLGSKF 286

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
           S+GI GG+P +S Y +G Q    +YLDPH  QP ++    +  +   + HS     + + 
Sbjct: 287 SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQPFLSPSHQE-RSFYDSCHSSNYGKLAIQ 345

Query: 358 SIDPSLAIGFYCRDKDDFDDF 378
            +DPS+ IG     +++F ++
Sbjct: 346 DLDPSMLIGILISGEEEFKEW 366


>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
 gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
          Length = 427

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 84/253 (33%), Positives = 119/253 (47%), Gaps = 30/253 (11%)

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 180
           +D+GWGCM+R+ Q L+  AL    LGR WR           +  EI   F D+   PFS+
Sbjct: 55  TDIGWGCMIRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSL 114

Query: 181 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD---- 235
           H  +  G +  G   G W GP A  RS ++L   +  E G+        I V SGD    
Sbjct: 115 HRFISKGMQLSGKKPGEWFGPAATARSIQSLVH-EFPECGID----KCLISVSSGDIYKT 169

Query: 236 --EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 293
             ED    G           H      GQ D T IL+L+ + LG+E +N  Y  ++R   
Sbjct: 170 EVEDVFNEG-----------HTGEARNGQKDKT-ILILLGVKLGIETINRCYWDSIRRIL 217

Query: 294 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 353
           +   S+GI GG+P +S Y  G Q +  +Y DPH  QP  +  K+DL  +T   H+     
Sbjct: 218 SSEYSIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQPSYD--KNDLFYETC--HTTNFGK 273

Query: 354 IHLDSIDPSLAIG 366
           + L  +DPS+ +G
Sbjct: 274 LSLADMDPSMLLG 286


>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
 gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
          Length = 484

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 62/167 (37%), Positives = 86/167 (51%), Gaps = 21/167 (12%)

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G++K+NP YIP L+   ++PQS+GIVGG+P AS Y+ GVQ+ S IYLDPH+ Q  +    
Sbjct: 339 GMDKINPVYIPQLQQVLSWPQSVGIVGGRPSASLYVCGVQDASFIYLDPHEAQLALG--- 395

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 396
                   TY  DV+R +    +DPSLAIGF C    + +D  AR   LA + + APL T
Sbjct: 396 --------TYFCDVVRVLPSAQLDPSLAIGFVCTSSAELEDLFARLQALATQHSSAPLMT 447

Query: 397 VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 443
           +T      V          G   D       +    G    D+W+L+
Sbjct: 448 LTTGSGAAV----------GCGSDADFTDDVLEGGTGQQQLDEWELV 484



 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 48/117 (41%), Positives = 65/117 (55%), Gaps = 6/117 (5%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           DF SR+  +YRK F  +G S +TSDVGWGC LRS QML+A+     R G   R  L + +
Sbjct: 49  DFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVRHGWRAGAMMRVALGRDW 108

Query: 160 DR-----EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 210
            R     E V  ++    D   +P SIH +  AG   G+  G W+GP+ +C+  EAL
Sbjct: 109 QRCSDNLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPGRWLGPWMLCKGLEAL 165


>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4; AltName:
           Full=Pexophagy zeocin-resistant mutant protein 8
 gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
          Length = 533

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 87/267 (32%), Positives = 117/267 (43%), Gaps = 50/267 (18%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 135
           F  D  S+I ++YR GF PI   K                      TSD GWGCM+R+SQ
Sbjct: 65  FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124

Query: 136 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 194
            L+A ALLF  LGR W    + P + E+  I+  F D    PFSIHN +Q G K      
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184

Query: 195 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 251
           G W GP A  R+ + L           C+  P   + +Y  S             C D  
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221

Query: 252 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
                  + G +D +TPIL+L+ + LG+EKVN      LR   +  QS+GI G K     
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSLKQSVGISGRKTSFLA 281

Query: 311 YI-VGVQEESAIYLDPHDVQPVINIGK 336
            + +G Q +   YL P   +  +  GK
Sbjct: 282 LLSIGFQGDYLFYLIPTFPKKALTFGK 308


>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 330

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 136/309 (44%), Gaps = 40/309 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
           I I+YRK    I +   T+D GWGCM+RS QM +AQ  L   LG  W+     +    + 
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96

Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARCQRAETG 219
            +++ I++LFGDS  S FSIH L+      G+  G W GP +A   + E +   +   T 
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRTR 156

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
                L   I           G      I D              + P ++ VPL LG E
Sbjct: 157 GYVAKLGSII-----------GSKIEELIKDG-----------GGFNPCIIFVPLRLGPE 194

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
                + P L+  F  PQ +G++GGKPG + Y       +  +LDPH  Q  I     D+
Sbjct: 195 SPENEFKPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DM 249

Query: 340 EADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
           + D S  +Y     + +    +DPS+++ F  +  +D++ F     K  E    + LFT 
Sbjct: 250 KGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTFSKLFTF 305

Query: 398 TQTHKKPVN 406
               +K +N
Sbjct: 306 KDETEKELN 314


>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 330

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 141/315 (44%), Gaps = 41/315 (13%)

Query: 100 DFSSR-ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---L 155
           DF+   I I+YRK    I +   T+D GWGCM+RS QM +AQ  L   LG  W+     +
Sbjct: 33  DFARHTIWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCI 90

Query: 156 QKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARC 213
               +  +++ I++LFGDS  S FSIH L+      G+  G W GP +A   + E +   
Sbjct: 91  NTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEM 150

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 273
           +   T                     RG    +     S+   +   G   + P ++ VP
Sbjct: 151 RVFRT---------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVP 188

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 333
           L LG E     + P L+  F  PQ +G++GGKPG + Y       +  +LDPH  Q  I 
Sbjct: 189 LRLGPESPENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTTQNAI- 247

Query: 334 IGKDDLEADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 391
               D++ D S  +Y     + +    +DPS+++ F  +  +D++ F     K  E    
Sbjct: 248 ----DMKGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTF 299

Query: 392 APLFTVTQTHKKPVN 406
           + LFT     +K +N
Sbjct: 300 SKLFTFKDETEKELN 314


>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
          Length = 476

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 150/323 (46%), Gaps = 56/323 (17%)

Query: 95  AEFNQDFSSRILISYRKGF-----DPIGDSKI-------------------TSDVGWGCM 130
           ++F  D ++R+  +YR GF     DP G S +                   T+D GWGCM
Sbjct: 91  SDFISDVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCM 150

Query: 131 LRSSQMLVAQALLFHRLGRPWRK-PLQKP---------FDREYVEILHLFGDSETSPFSI 180
           +R+SQ L+A ALL   +GR WR  P + P         +++++ +I+  F D   +PFSI
Sbjct: 151 IRTSQSLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQW-QIITWFADFPWAPFSI 209

Query: 181 HNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
             +++ G  +     G W GP A  RS   L +    ++   C+   +  Y+  G+ D  
Sbjct: 210 QQIVRYGSEHCNKKPGEWFGPSAASRSIVYLCK----QSYKACK---LNTYLTEGNGD-- 260

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
                    +D     S     +  + P L+L  + LG+  VNP Y   L+   +  QS+
Sbjct: 261 -------IYEDELLXVSCPEGTENGFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIHQSV 313

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEAD-TSTYHSDVIRHIH 355
           GI GG+P +S Y  G Q ++  Y+DPH  Q  +   ++   D   +  ++ H+  IR + 
Sbjct: 314 GIAGGRPSSSHYFFGYQGDNLFYMDPHTPQTALLADHVDDADYRXEYVASVHTKRIRKLG 373

Query: 356 LDSIDPSLAIGFYCRDKDDFDDF 378
           L  +DPS+ IG      +D+ + 
Sbjct: 374 LCEMDPSMLIGLLVTSLEDYKEL 396


>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
          Length = 285

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/291 (29%), Positives = 127/291 (43%), Gaps = 44/291 (15%)

Query: 101 FSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
           F S I I+YR+ F P+     +  SD GWGCM+R  QM +A+ L              K 
Sbjct: 2   FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGL--------------KR 47

Query: 159 FDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAE 217
           F  +  EI+ LF D + S FSI N+ +AGK  + L AG W  P  +C   + L   +   
Sbjct: 48  FQIKEDEIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKK--- 104

Query: 218 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 277
              G + L   I  +S D         ++  +D     S    G      ++L +   LG
Sbjct: 105 ---GFKDLK--IRTISSDR--------ILIFEDLEMEFSSEKNG------LILFLVCKLG 145

Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 337
           LEK    Y+      F +  S+G++GGKP  + + VG  E+  IYLDPH VQ       +
Sbjct: 146 LEKTEENYLKIALKIFDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDPHYVQDF-----N 200

Query: 338 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
               D ++Y       +    ID S+    +  +K++   F     +L EE
Sbjct: 201 QNNVDQNSYFCKNYAVLDQKKIDSSIGNVLFFENKEELKMFFQFLDQLKEE 251


>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
          Length = 402

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/327 (26%), Positives = 139/327 (42%), Gaps = 45/327 (13%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPWRKP--LQKP 158
           SS I  SYRK       S +TSD GWGCM+R +QM +AQ +  +H   +P +    ++  
Sbjct: 71  SSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVIRHYHSFTQPEQLIVLIRHF 130

Query: 159 FDREYVEILHLFGDSETS-------PFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEAL 210
            D +  E+++     + +       PFSI  ++   K  +    G W  P  +  +   L
Sbjct: 131 LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKVEFKKEPGDWYKPNEILETLNYL 190

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP--- 267
            +  +        SL M IY+                + DA +    + KG  +W     
Sbjct: 191 FKYSQY-------SLNMQIYI---------NYQCAFILQDAIKQMFNYDKGNQEWLKECI 234

Query: 268 -------------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 314
                        I + +P  +GL++VN  Y+  L +  T P   GI+GG    + YIVG
Sbjct: 235 KNNNQFISQHDKGIAIFLPARIGLQRVNQDYLEVLNILMTLPYFQGIIGGVTNRAFYIVG 294

Query: 315 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
             ++  IYLDPH VQ   N   +DL    ++Y    I+ IH  SIDPS+ +    R+  +
Sbjct: 295 RIQDYLIYLDPHFVQNAQNF--EDLSKTQASYTCQNIQLIHNKSIDPSIVVCLCVRNGLE 352

Query: 375 FDDFCARASKLAEESNGAPLFTVTQTH 401
             D     + + +E       ++  T+
Sbjct: 353 LLDLWHSLNHMKQEFQEFFFISILDTN 379


>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 523

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/289 (29%), Positives = 135/289 (46%), Gaps = 43/289 (14%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALL--FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
            TSD GWGCM+R+SQ L+A ALL  FH  G    +P      +   +++ LF D+ ++PF
Sbjct: 179 FTSDAGWGCMIRTSQNLLANALLRLFHTTGG---QPQNFAVTKTEADVIELFQDTLSAPF 235

Query: 179 SIHNLLQAGKAYGL--AAGSWVGPYA-------MCRSWEALARCQRAETGLGCQS---LP 226
           S+HN ++A  +  L    G W GP A       +   +  + + +R+E   G  S   +P
Sbjct: 236 SLHNFIKAANSLSLNIKPGQWFGPSAASLSIKKLVNDYNLIQQERRSERDSGRDSGHKVP 295

Query: 227 M-----------AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG-----QADWTPILL 270
                       +      D   +R   P V +   S +C ++        + +  PIL 
Sbjct: 296 TPNLKLHSKSADSDSDSDSDAISKRNSIPYVYV---SENCDLYDDEINAIFELEQRPILF 352

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQ 329
           L P+ LG+E+VN  Y  ++        S+GI GGKP +S Y +G + E+  IY DPH  Q
Sbjct: 353 LFPIRLGIEQVNKYYYSSILQILASKFSVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQ 412

Query: 330 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
            V          +  +YH+     + +D +DPS+ IG      D++ +F
Sbjct: 413 IV------QTPVNLESYHTSEYSKLKIDQLDPSMMIGILIETIDEYQEF 455


>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
          Length = 1055

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 142/308 (46%), Gaps = 46/308 (14%)

Query: 105 ILISYRKGFDPI-GDSKITSDVGWGCMLRSSQMLVAQALLFHR--LGRPWRKPLQKPFDR 161
           + ++YRKG+DPI GD+++TSD GWGC  RS QML+AQAL+ +     R  R    +P   
Sbjct: 603 VWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSARMQRLEGVRPSTW 662

Query: 162 EYVE----ILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 215
           ++ E    +L +F DS    + FSI ++ +         G W+ P               
Sbjct: 663 QHEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLSP--------------- 707

Query: 216 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 275
                    + + I  ++  E G R    V  ++D          G+  W P LL++PL 
Sbjct: 708 -------SEVALIIRRLNPPETGMR----VRIVNDTLLSTRRILAGEP-WMPTLLMIPLR 755

Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE--SAIYLDPHDVQPVIN 333
            GL+ + P  +P     F +P  +G +GGKPG++ Y VG+  +    +YLDPH  +  ++
Sbjct: 756 AGLDTLQPESVPAFVAFFDWPWCVGAIGGKPGSAYYYVGIDHDRRRVLYLDPHTTRSRLD 815

Query: 334 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG-- 391
           +     +A   T   D ++ + +     S+ +G +  +  D  +   R  +  E+ +G  
Sbjct: 816 LSN---QAAEKTCVPDKLKSMDMSKSCSSICVGLFLPELRDLTELVQRYKR--EQLSGMW 870

Query: 392 -APLFTVT 398
             PLF V 
Sbjct: 871 STPLFHVV 878


>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
          Length = 330

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 127/278 (45%), Gaps = 29/278 (10%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDR 161
           I ++YRK    +   + TSD GWGCM+RS QM +AQ+ +   +G  W   +   Q   ++
Sbjct: 38  IWVTYRKNMKELPGGR-TSDSGWGCMIRSMQMALAQSFVSLVMGNSWKFTKTGFQVERNK 96

Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
            ++  I++LFGD   S FSIHNL+      G+  G W GP     S+ +        T  
Sbjct: 97  FHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGP-----SFASEIAADHLNT-- 149

Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
                   I+V        R G  V             S+   +  P ++ VPL LG   
Sbjct: 150 --------IHVFRTRGYVARLGRIV------KPDILDISEDNGNILPTIIFVPLRLGPVN 195

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
               + P L+  F  PQ +G+VGGKP  + +          YLDPH  Q  +++   D  
Sbjct: 196 AEEDFRPILKKVFDIPQCVGMVGGKPNLAFFFHTFDGNLLYYLDPHTTQNAVSM---DGG 252

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
               +Y  + ++ +   ++DPS+++ F  ++KDDF+ F
Sbjct: 253 WSAESYFCNDVKSMKYKNLDPSVSLLFLIKNKDDFNKF 290


>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
           AG-1 IA]
          Length = 808

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 92/286 (32%), Positives = 126/286 (44%), Gaps = 69/286 (24%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 121
           F +DF+S I ++YR  + PI D+ +                                   
Sbjct: 142 FYEDFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKS 201

Query: 122 -TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSET-- 175
            TSD GWGCMLR+ Q L+A AL+   LGR WR+P    F  E   YV+IL  F D+ +  
Sbjct: 202 WTSDAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPL 261

Query: 176 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ---SLPMAIYVV 232
           +PF +H +  AGKA G   G+W GP     S + LA          CQ   SL +   V 
Sbjct: 262 APFGVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPE-----CQLSVSLAVDGTVF 316

Query: 233 SGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLR 290
           + D         V     +       SK  G+A    +L+LV + LGL+ VNP Y   L+
Sbjct: 317 ASDVYAASHMGMVTTSGRSISSRRSASKWGGRA----VLILVNIRLGLDNVNPIYYDALK 372

Query: 291 LTFTFPQSLGIVGGKP--GASTYIVGVQEESAIYLDPHDVQPVINI 334
           +            G+P  G+S Y VG Q +S  YLDPH  +P I +
Sbjct: 373 V------------GRPRQGSSYYFVGSQADSLFYLDPHHTRPYIPL 406



 Score = 45.1 bits (105), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 30/46 (65%)

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
           A+  T+H D +R + + ++DPS+ +GF CRD  D+ DF  R + ++
Sbjct: 519 AELRTFHCDRVRKMPMSALDPSMLLGFLCRDDADWKDFRTRVADVS 564


>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
 gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
          Length = 391

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 85/155 (54%), Gaps = 18/155 (11%)

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G++K+NP Y+P L+   T+PQS+GIVGG+P AS Y+ GVQ+ S ++LDPH+ QP +  G 
Sbjct: 216 GMDKINPVYLPQLQRILTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWGI 275

Query: 337 DDLEADT-----------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
                 T                 +TY  D +R +   ++DPS+AIGF C    D +D  
Sbjct: 276 AGDAGHTKEAGNGGSAVVLPASSLATYFCDTVRLMPATALDPSMAIGFLCMGAADLEDLF 335

Query: 380 ARASKLAEESNGAPLFTVTQ-THKKPVNHSDVLGE 413
            R   LA+E + APL T+T  T +  V   D  GE
Sbjct: 336 TRLDALAKEHSLAPLMTLTSGTAQAGVGLEDDFGE 370


>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
          Length = 355

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 142/323 (43%), Gaps = 32/323 (9%)

Query: 102 SSRILISYRKGFDPIGDS-KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
           S+ +  +YR     IGDS +  +D GWGC LR  QM+V +AL      R + K L  P +
Sbjct: 52  SAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK-LSYPSE 110

Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 220
              + IL  F D      S+H +    K  G  AG W  P  +          Q A   +
Sbjct: 111 AARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQEA---M 167

Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
           G Q     ++V             +V +DD  +   +F   +A     LL VPL LG++ 
Sbjct: 168 GLQ-----VHVAMD---------SMVVLDDLRK---LFRADRA----TLLFVPLRLGIDI 206

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
           V    IP ++  F  P +LGI+GG+PGA+ Y +G  + + + LDPH  Q  +  G  D  
Sbjct: 207 VQAEMIPAVKRFFHSPSALGIMGGRPGAAHYFIGYMDHNLLLLDPHTTQDPLRAGSQDAL 266

Query: 341 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV--T 398
                    +   + LD +DP++ + F   D++    F    +   EE+ G  LF++  T
Sbjct: 267 VSCRCSRPML---LDLDKVDPTMCLAFLLTDEESLQRFADDYNASVEET-GVRLFSMLDT 322

Query: 399 QTHKKPVNHSDVLGETGGVPEDD 421
           ++    V  +  L E     +DD
Sbjct: 323 KSFASSVAVASSLAEEEEFSDDD 345


>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
 gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
          Length = 196

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 75/122 (61%), Gaps = 11/122 (9%)

Query: 265 WTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
           W P+++LVPLVLGL++ VNPRY+P +      PQS+GI+GGKP AS Y VG Q+E   YL
Sbjct: 75  WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLPQSVGILGGKPCASLYFVGAQDEELFYL 134

Query: 324 DPHDVQPVINIGK----------DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 373
           DPH VQ  + + +          +     T TYH   + H++   +DPS+ +GFYCR + 
Sbjct: 135 DPHTVQLAVPLEQIWGCAQTGSPESGPFPTETYHCRSVLHMNARELDPSMVLGFYCRTRA 194

Query: 374 DF 375
           DF
Sbjct: 195 DF 196



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 26/45 (57%), Positives = 31/45 (68%)

Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
           F SR+ I+YR+GF  IG    T+D GWGC LRS QML+A AL  H
Sbjct: 1   FHSRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANALQSH 45


>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 402

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)

Query: 93  GLAEFNQDFSSRILISYRKGFDPIG---------------------------------DS 119
           G +E  +    R  +SYR GF+PI                                  + 
Sbjct: 75  GDSEVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDND 134

Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 178
             T+DVGWGCM+R+SQ ++A A+                 DR   E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177

Query: 179 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 235
           S+HN ++      L    G W GP A   S + L   + + T     ++P+++ V  SGD
Sbjct: 178 SLHNFVKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
                        DD           Q    P+LLL+PL LG++ VN  Y  +L      
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQS GI GGKP +S Y  G Q  S +YLDPH  Q V         A   +YHS   + + 
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSSYQKLD 322

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARAS 383
           +  +DPS+  G   ++ +D+ D   R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350


>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 388

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 127/279 (45%), Gaps = 39/279 (13%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
           +  SYR+ F+P+ +   TSDVGWGC +R+ QM++A A + +R G           D   V
Sbjct: 94  LYFSYRRQFEPLRNGA-TSDVGWGCTIRACQMMLAWAFMRYRNGG------SVTMDDNVV 146

Query: 165 EIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
           + L      LF D  T+PF IH +   G  +G+  G W GP  M +   AL    R+  G
Sbjct: 147 DSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSSGG 206

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
            G + L  +        D + G   VV     S+H             ++LL+P+ LG +
Sbjct: 207 EGPEVLVAS--------DRQIGVQDVVVRLQRSQH-------------VVLLIPVKLGPQ 245

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 339
            V+  Y   L+  F    S+G VGG+  ++ +  G Q +  I+LDPH VQ  +       
Sbjct: 246 TVSVTYANALKRFFEMGSSIGAVGGEKNSAYFFFGYQGDKIIHLDPHYVQCALT------ 299

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             +++   +   R + +   + S  +GFY    D+ D F
Sbjct: 300 SPNSNGTLAGTWRSLPVMQCNTSALLGFYVSSCDELDQF 338


>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 1295

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 116/278 (41%), Gaps = 43/278 (15%)

Query: 82  DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 141
           D   G  A N GL+       SR       G+   G+  +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559

Query: 142 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 185
           L+   LGR WR P QKP                  YV +L  F D  S   PFS+H    
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
            GK  G   G W GP     + + LA            S P     V    DG    + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668

Query: 246 VCIDD-------ASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
               +        ++     S  +  W    +L+++P  LGL+ VNP Y   ++      
Sbjct: 669 YQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------ 722

Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
            S+GI GG+P +S Y V  Q  S  YLDPH  +P + +
Sbjct: 723 -SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 31/49 (63%)

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           E    T+H D ++ + L  +DPS+ +GF C ++ +F+DFC R S+L  +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979


>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 402

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)

Query: 93  GLAEFNQDFSSRILISYRKGFDPIG---------------------------------DS 119
           G  E  +    R  +SYR GF+PI                                  + 
Sbjct: 75  GDLEVQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFANIHSLVDND 134

Query: 120 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 178
             T+DVGWGCM+R+SQ ++A A+                 DR   E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177

Query: 179 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 235
           S+HN ++      L    G W GP A   S + L   + + T     ++P+++ V  SGD
Sbjct: 178 SLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232

Query: 236 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 295
                        DD           Q    P+LLL+PL LG++ VN  Y  +L      
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           PQS GI GGKP +S Y  G Q  S +YLDPH  Q V         A   +YHS + + + 
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSLYQKLD 322

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARAS 383
           +  +DPS+  G   ++ +D+ D   R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350


>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
           kowalevskii]
          Length = 338

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 83/143 (58%), Gaps = 5/143 (3%)

Query: 259 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
           S+    W  +++L+P+ LG E++NP YI  ++  FT    +GI+GGKP  S Y +G QE+
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSLFTLKHCIGIIGGKPKHSLYFIGFQED 215

Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             I+LDPH  Q V+++   D      ++H    R + L  +DPS  IGFYC+ +DDF +F
Sbjct: 216 KLIHLDPHLCQDVVDMRSRDFPL--QSFHCMSPRKMSLMKMDPSCTIGFYCKTQDDFKEF 273

Query: 379 CARASKLAEESNGA---PLFTVT 398
           C+ A ++ + +      P+F  +
Sbjct: 274 CSYAQEVLDSTKHVGDYPMFIFS 296



 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 62/100 (62%), Gaps = 6/100 (6%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDE--ALGDAAGNNGLA---EFNQDFSSRILISYRKGF 113
           S+T  S  T  IWLLG C+    D+      +A ++ L     F +DF+SR+ ++YR+ F
Sbjct: 42  SQTNFSYHTP-IWLLGECYHHRPDDPNETEQSAEDDCLTPMERFKRDFTSRLWLTYRREF 100

Query: 114 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
             +  + +T+D GWGCMLRS QM++AQ+ L H LGR +++
Sbjct: 101 QQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140


>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 1295

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 116/278 (41%), Gaps = 43/278 (15%)

Query: 82  DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 141
           D   G  A N GL+       SR       G+   G+  +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559

Query: 142 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 185
           L+   LGR WR P QKP                  YV +L  F D  S   PFS+H    
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
            GK  G   G W GP     + + LA            S P     V    DG    + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668

Query: 246 VCIDD-------ASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
               +        ++     S  +  W    +L+++P  LGL+ VNP Y   ++      
Sbjct: 669 YQASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------ 722

Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
            S+GI GG+P +S Y V  Q  S  YLDPH  +P + +
Sbjct: 723 -SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 31/49 (63%)

Query: 340 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           E    T+H D ++ + L  +DPS+ +GF C ++ +F+DFC R S+L  +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979


>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
 gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
          Length = 357

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 110/230 (47%), Gaps = 42/230 (18%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 130
           N  + F  DF SR+ ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150

Query: 131 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 190
           +RS Q ++A AL   RLGR WR   +   D+++ EIL LF D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209

Query: 191 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G   G W GP A  R  + LA   R E GL        +Y VSGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVY-VSGD------GADV--YE 252

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
           D  +  +V   G   W P L+LV   LG++K+ P Y   L++    P  L
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKIREMDPSML 300


>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 873

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 149/353 (42%), Gaps = 76/353 (21%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 121
           G+N    F  DF+SRI ++YR  F PI DS +                            
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350

Query: 122 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRP-WRKPLQKPFDRE---YVEILHLFG 171
                 TSD GWGCMLR+ Q L+A ALL   LGR  WR+P       +   YV+I+  F 
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFF 410

Query: 172 D--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI 229
           D  S  SPFS+H +  AGK  G   G W GP     + + L      E GLG       +
Sbjct: 411 DTPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGV 469

Query: 230 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 289
              S     +   A    I    RH  V   G+A    +++L+ + LGL+ VNP Y  T+
Sbjct: 470 IFQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTI 520

Query: 290 RLT-----------FTFPQSLGIVGGKPGASTYIV----------GVQEESAIYLDPHDV 328
           +++            T P + G     P AS  I           G  E +   LDP   
Sbjct: 521 KVSIRTLRPYRWILMTVPYTSGFNASLP-ASPEISSDMDVRELGWGDSEGAGEALDPMAE 579

Query: 329 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
             V     D L     T+H D +R + +  +DPS+ +GF C+D++D+ DF  R
Sbjct: 580 HYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDENDWFDFRRR 628


>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 377

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 97/353 (27%), Positives = 136/353 (38%), Gaps = 107/353 (30%)

Query: 95  AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 131
           A F  DF SRI I+YR  F  I  SK                        T+D GWGCM+
Sbjct: 90  AAFLDDFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMI 149

Query: 132 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 190
           RS Q L+A ALL  +LGR WR+  +     + + +L LF D   +PFSIH  ++ G A  
Sbjct: 150 RSGQSLLANALLIQKLGRDWRRGSET---GKEIALLSLFADRPQAPFSIHRFVEHGAAAC 206

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
           G   G W GP        A ARC        C+   + +YV S   D           +D
Sbjct: 207 GKHPGEWFGP-------SATARCIDE-----CEHAGLNVYVTSDGSD---------VHED 245

Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 310
             R  +    G  D  P L+L+ + LG++ + P Y   L+    +PQS+GI G       
Sbjct: 246 KFRQIA----GLDDIKPTLILLGVRLGIDSITPVYWDALKAIIQYPQSVGIAG------- 294

Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
                                                      +H+  +DPS+ IGF  +
Sbjct: 295 ------------------------------------------RLHIKEMDPSMLIGFLIK 312

Query: 371 DKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 423
           + DD+ D+  R       + G P+  V      P N        G V E ++L
Sbjct: 313 NNDDWHDWKHR----VRSAPGKPIIHVFDG--GPPNFGRHFEREGAVDEVEAL 359


>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
 gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
 gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
 gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
 gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
 gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
 gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
 gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 506

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 97  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 370 GILIKGEKDW 379


>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGBIYENEVEKV 257

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
          Length = 506

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 97  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 370 GILIKGEKDW 379


>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 158/373 (42%), Gaps = 60/373 (16%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           N + +  QD    I I+YR+ F P+  S   SD GWGCMLR  QM +AQ L  H      
Sbjct: 57  NKIKQLVQD---TIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQMLKKHLKNHGD 113

Query: 152 RKPLQKPFDREYVEILHLFGDSETS----------------------PFSIHNL-LQAGK 188
           ++      D +Y  IL  F D+++                       PFSI  +   A K
Sbjct: 114 KR------DEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKK 167

Query: 189 AYGLAAGSWVGPYAM------------CRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 236
            + L  G W  P  +             R+ E L      ++ L    L   ++ +  + 
Sbjct: 168 EFNLDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFDIKFET 227

Query: 237 DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 296
           D +        +++      + SK       + + V   +GL++ N +Y+  L      P
Sbjct: 228 DKD--------LEEQLEKTQLKSKN-----SLAIFVLTRIGLDEPNQKYLKVLDELMELP 274

Query: 297 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK--DDLEADTSTYHSDVIRHI 354
              GIVGG P  + YI+G   +  IYLDPH VQ   N G+  ++   + ++Y    I  +
Sbjct: 275 YFQGIVGGTPKRAFYILGRINDHYIYLDPHYVQEAENKGQIIENKMFNRTSYSCKYIHLL 334

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 414
           +   +D S+ + +Y R+K +   F     K+ ++S+   +F ++ T  + V++S+ L E+
Sbjct: 335 NQKHVDTSMGLSYYIRNKSELLQFWRDMKKIKQKSDDFFIF-LSDTTPEYVDYSNQLEES 393

Query: 415 GGVPEDDSLGVMS 427
                DD +  + 
Sbjct: 394 SNKLNDDDVVFLQ 406


>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
          Length = 494

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 132/310 (42%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
           EF  D  SR+  +YR  F PI     G S ++                        +D+G
Sbjct: 85  EFLLDVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 144

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R   +K   RE  +I+  F D+  +PFSIHN +  
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVDNEKSLKRES-KIVTWFNDTPEAPFSIHNFVST 203

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L           C      + V SG  D  +     
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIYGFPE-----CGITDCVVSVSSG--DIYQNEVEK 256

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
           + +++               + IL L+ + LG+  VN  Y  ++       +S+GI GG+
Sbjct: 257 IYVENPD-------------SIILFLLGVKLGINAVNESYRESICGILNSARSVGIAGGR 303

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    +Y DPH  QP +       E+   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNQFLYFDPHIPQPAVE------ESFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 366 GFYCRDKDDF 375
           G   + ++D+
Sbjct: 358 GVLIKGEEDW 367


>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
          Length = 392

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 142/310 (45%), Gaps = 49/310 (15%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 155
           +   D ++RI  +YRK F P+  S+ T+DVGWGCMLR  QM++A  L+           +
Sbjct: 119 QLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLM----------AV 168

Query: 156 QKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 214
            +P       + HL        +++ N  L+AG+  G ++   VG   + +   ALA+  
Sbjct: 169 LQP------RVHHLLK------YTMENHHLKAGRFQGPSS---VGSALLHQVPSALAQLN 213

Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
           +       + + +  Y  S            + I D  R      +GQA++ PI+L++PL
Sbjct: 214 QFRD----EEVKLRTYFASD----------TLVILDQLRP----EEGQAEFEPIMLVLPL 255

Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
            LG+EK+ P+Y   L+L    P  +G +GG    + YI G Q      LDPH     +  
Sbjct: 256 RLGIEKIGPQYHARLQLLLRQPWCMGFIGGHDKRAMYIFGYQGHQYFGLDPHRCSAAVAQ 315

Query: 335 GKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEES 389
              +L         ++H+  +  I  D +DPSLA+    R  ++ DD  +   +  +E+ 
Sbjct: 316 STAELRDRWVEVRDSFHTSKLSGIERDDLDPSLAVFLLARTAEELDDMLSVIGQPTSEDR 375

Query: 390 NGAPLFTVTQ 399
            G  L +V Q
Sbjct: 376 PGPALVSVVQ 385


>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
 gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
          Length = 384

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 80/139 (57%), Gaps = 6/139 (4%)

Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
           +W  +++L+P+ LG E +NP Y P ++  FT    LG++GG+P  S Y VG QE+  I+L
Sbjct: 203 NWCSVIILIPVRLGGESLNPIYEPCIKGLFTMDHCLGVIGGRPKHSLYFVGFQEDKLIHL 262

Query: 324 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 383
           DPH  Q V+++   D   +  ++H    R + +  +DPS  IGFYCR +DDF+ FC   +
Sbjct: 263 DPHFCQEVVDMTPRDFPLE--SFHCMNPRKMSIARMDPSCTIGFYCRTRDDFNKFCTTVT 320

Query: 384 KLAEESNGA----PLFTVT 398
           +      G     P+F V+
Sbjct: 321 EEMLRQPGPKADYPMFIVS 339



 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 7/91 (7%)

Query: 70  IWLLGVCHKIAQDE------ALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKIT 122
           IWL GVC+    +E       L D+       E F +DF+S++ ++YR+ F  +  S  T
Sbjct: 88  IWLQGVCYHRRNEELTKELEPLTDSDRRLYTMELFKRDFASKVWLTYRREFPQLAGSMFT 147

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           +D GWGCMLRS QML+A  L+ H LGR +++
Sbjct: 148 TDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178


>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
          Length = 216

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 52/154 (33%), Positives = 84/154 (54%), Gaps = 14/154 (9%)

Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
           +W P+L+++PL LGL  +N  Y P ++  F  PQ +GI+GG+P  + Y  G+ + + +YL
Sbjct: 28  EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 87

Query: 324 DPHDVQPVINIG--------KDDL------EADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
           DPH  Q  +++         +DD       E   STYH   I    +D +DPSLA+GF+C
Sbjct: 88  DPHFCQNFVDLDETTTTRDERDDYVEIKNDEFKDSTYHCPFILSTKIDKVDPSLALGFFC 147

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
             +DD+++   R       ++  PLF + +T  K
Sbjct: 148 HTEDDYNELAKRLRTHLLPASTPPLFEMLETRPK 181


>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
          Length = 389

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 137/340 (40%), Gaps = 45/340 (13%)

Query: 87  DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-LFH 145
           D A +  + +    F   I  SYR     +  S +TSD GWGCMLR  QM + Q +  F+
Sbjct: 47  DLAVDQKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFY 106

Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSE-------------------TSPFSIHNLL-Q 185
            L             +E  E++  F D++                    SPFSI  ++ Q
Sbjct: 107 NLSSS----------QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQ 156

Query: 186 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED---GERGG 242
                  + G W  P  +    + L R  + +  L         +++S        + GG
Sbjct: 157 TKLELQKSPGEWYKPNDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFNKNGG 216

Query: 243 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 302
                  D         KGQ D   + + +   +GL+  N  Y+  L    T+PQ  GI+
Sbjct: 217 -------DEEWLKEQIEKGQNDEFGVSIFILTRIGLDTCNQEYLKVLNDIMTYPQFQGIL 269

Query: 303 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 362
           GG P  + YI+G      IYLDPH VQ   N    ++E D S+Y    I+ I  + +DPS
Sbjct: 270 GGFPNKALYILGRVGNYYIYLDPHYVQNAQNY--QEMENDRSSYTCQSIQLIDSNQLDPS 327

Query: 363 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT-VTQTH 401
           +AI F C           R  K  +  NG   F  +T+TH
Sbjct: 328 MAISF-CVKNALDLLDLWRRLKQTKSENGESFFMALTETH 366


>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
          Length = 700

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/149 (39%), Positives = 84/149 (56%), Gaps = 10/149 (6%)

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAI 321
           A W P+LL +PL LGL + NP Y   ++     P S+GI+GG+P  + +IVG   +E  +
Sbjct: 259 ATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQIPHSIGIMGGRPSHAVWIVGTAGDEDLL 318

Query: 322 YLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
            LDPH  QP     +DDL A D  T+H D    + L+ +DPS+ IGF C  +D+FD  CA
Sbjct: 319 CLDPHTTQPA---SQDDLTAEDDVTHHCDCPVRLPLERLDPSMVIGFVCTTEDEFDQLCA 375

Query: 381 RASK---LAEESNGAPLFTVTQTHKKPVN 406
              +     E + G PLF V ++  +P N
Sbjct: 376 HLERDVLSVETTCGHPLFEVHKS--RPSN 402



 Score = 41.2 bits (95), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 35/66 (53%), Gaps = 3/66 (4%)

Query: 136 MLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
           M++A+A+    LG+ WR  P  +  D  Y  +  +F D ++S +SI N+   G A     
Sbjct: 1   MMLAEAITRIHLGKDWRWTPGCQ--DEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPI 58

Query: 195 GSWVGP 200
           GSW GP
Sbjct: 59  GSWFGP 64


>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
           purpuratus]
          Length = 1018

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 57/145 (39%), Positives = 81/145 (55%), Gaps = 10/145 (6%)

Query: 70  IWLLGVC-HKIAQDEALGDAAGNNGLAE-----FNQDFSSRILISYRKGFDPIGDSKITS 123
           IW LG C H+  +D       G + +       F QDFSSR+ ++YR+ F  +  S  TS
Sbjct: 346 IWFLGKCYHQRPEDPDPERPPGMDSVRSMVIEMFKQDFSSRLWMTYRREFPTLAGSNFTS 405

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDS--ETSPFS 179
           D GWGCMLRS QM++A +L+ H LGR W   KP  +   + + +I+  FGD   + SPFS
Sbjct: 406 DCGWGCMLRSGQMMLAHSLILHFLGREWNIYKPQTQEMLQFHRQIVRWFGDQPLDMSPFS 465

Query: 180 IHNLLQAGKAYGLAAGSWVGPYAMC 204
           +H L+  G+  G   G W GP ++ 
Sbjct: 466 VHRLVGIGQNNGKKVGDWYGPSSVA 490



 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 6/154 (3%)

Query: 248 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 307
           ID +    S  ++G   W  +++++P+ LG ++VNP YI  ++  FT    LGI+GGKP 
Sbjct: 819 IDPSRSRTSTSTEGGKPWCAVVIMIPVRLGGDEVNPVYIRPIQSLFTLESCLGIIGGKPK 878

Query: 308 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
            S + VG QEE  I+LDPH  Q V+++   D      ++H    R + +  +DPS  IGF
Sbjct: 879 HSLFFVGFQEEKLIHLDPHYCQQVVDMKTRDFPL--WSFHCMSPRKMSISKMDPSCTIGF 936

Query: 368 YCRDKDDFDDFCAR----ASKLAEESNGAPLFTV 397
           Y R ++ F+  C       S L   S+  P+F V
Sbjct: 937 YIRTEEQFEQLCKELPTVVSPLGSHSSDYPMFIV 970


>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
           8797]
          Length = 448

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 154/363 (42%), Gaps = 68/363 (18%)

Query: 92  NGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------------IT 122
           N   +F +D  +R+  +YR  F PI  S                                
Sbjct: 38  NEKMQFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFN 97

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 182
           +D+GWGCM+R+ Q L+  AL   R GR +R       D    +I+  F D+  +PFS+HN
Sbjct: 98  TDIGWGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD----DIIQWFKDTPDAPFSLHN 153

Query: 183 LLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 241
            ++ G +   +  G W GP A  RS ++L  C   + G+        I  VS  +  ++ 
Sbjct: 154 FVKKGVELADMKPGQWFGPAATSRSIQSLI-CNFPQCGID-----HCIVSVSSADIYKQD 207

Query: 242 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 301
              +   D  S               +L+L  + LG+  VN  Y   +R       S+GI
Sbjct: 208 VEDMFDADPDSN--------------LLILFGVKLGVSAVNASYWEDIRRLLNSKFSVGI 253

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
            GG+P +S Y  G Q +  +Y DPH  QP +    DD  A  +T HS     + L  +DP
Sbjct: 254 AGGRPSSSLYFFGYQNQELLYFDPHTPQPSL---IDD--AAFNTCHSIEFGKLELRDMDP 308

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD 421
           S+ IG     + D++++    ++  E S    +F + +   +   + DV  +  G   D+
Sbjct: 309 SMLIGIMIEGERDWENW----ARFTETSK---IFNILEERSEDCINVDV--DIDGDENDE 359

Query: 422 SLG 424
           ++G
Sbjct: 360 NIG 362


>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
          Length = 357

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/277 (26%), Positives = 125/277 (45%), Gaps = 35/277 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +   G P    + LQ+   R
Sbjct: 74  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 133 TQT----LFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNAL----------- 177

Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
                MA Y+ +G E     G  V+   +         +       ++LL+P++LG+  +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEQVKELLRQSMHVVLLIPVMLGIRVI 227

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP      +  E 
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQPAFTSSGNSGEL 287

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             +       R +   S D S+ +GFY    D F  F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSFAVF 318


>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
          Length = 286

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/141 (35%), Positives = 79/141 (56%), Gaps = 3/141 (2%)

Query: 262 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 321
           +A+W  I++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +
Sbjct: 109 EAEWKSIIILVPVRLGGETLNPAYMPCIKELLRMEPCLGIIGGKPKHSLYFIGYQDDFLL 168

Query: 322 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
           YLDPH  QP ++  KD    +  ++H    R +    +DPS  +GFY   + DF+  C++
Sbjct: 169 YLDPHYCQPCVDTMKDSFPLE--SFHCTAPRKLPFAKMDPSCTVGFYAGTRKDFEALCSQ 226

Query: 382 -ASKLAEESNGAPLFTVTQTH 401
               L   +   P+FTV + H
Sbjct: 227 LLQALNSTATRYPMFTVAEGH 247


>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
          Length = 354

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 32/299 (10%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA-LLFHRLGRPWRKPLQKPFDREY 163
           +  SYR GF P+ +   T+DV WGC++R++QML+AQA + F   G  +         RE 
Sbjct: 69  LYFSYRCGFTPLSNGS-TTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFVDGSALQILREK 127

Query: 164 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
           V+   LF D  ++PF IH +    + YG+A G W G     ++  +L +      G G  
Sbjct: 128 VQ--PLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSLRGGNG-- 183

Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 283
             P  +  V    D E     V  +   SR              ++LL+P VLGL++++ 
Sbjct: 184 --PAVLVFV----DREVSALKVRDLLSHSRQ-------------VVLLIPAVLGLDRISV 224

Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 343
           +Y   L         +G++GG+  ++ Y VG Q  + IYLDPH  Q          E  T
Sbjct: 225 KYSKMLIRCLEMESCIGVIGGRKSSALYFVGHQSNNIIYLDPHRAQRAFTEVASPGEL-T 283

Query: 344 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
             +H      + + +   S+  GFY    + F  F A   + A  +   PL +V  + +
Sbjct: 284 GAWHL-----LPVTACSTSILFGFYIDSLESFKQFEADMLE-ANSALAFPLISVATSER 336


>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
 gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
          Length = 440

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/155 (33%), Positives = 83/155 (53%), Gaps = 16/155 (10%)

Query: 264 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 323
           +W P+L+++PL LGL  +N  Y P ++  F  PQ +GI+GG+P  + Y  G+ + + +YL
Sbjct: 252 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 311

Query: 324 DPHDVQPVINIG---------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
           DPH  Q  +++                K+D E   STYH   I    +D +DPSLA+GF+
Sbjct: 312 DPHFCQNFVDLDEATTTKDERGDYVEIKND-EFRDSTYHCPFILSTKIDKVDPSLALGFF 370

Query: 369 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
           C  +DD+ +   R       ++  PLF + +T  K
Sbjct: 371 CHTEDDYSELANRLRTHLLPASTPPLFEMLETRPK 405



 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/122 (36%), Positives = 57/122 (46%), Gaps = 28/122 (22%)

Query: 85  LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
           LG+   + G +A   +  +S +  +YRK F PIG +  T+D GWGCMLR  QML+A+ L+
Sbjct: 59  LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 118

Query: 144 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
              LGR W       +DR     EY  IL   G SE                G   G W 
Sbjct: 119 VRHLGRNWL------WDRDVMLTEYKRILPNMGVSE----------------GKEIGEWF 156

Query: 199 GP 200
           GP
Sbjct: 157 GP 158


>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
          Length = 178

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/77 (64%), Positives = 61/77 (79%)

Query: 81  QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQ 140
           ++E  G +  ++G A F +DFSSRI I+YRKGFD I  SK+TSDV WGCM+RSSQMLVAQ
Sbjct: 99  EEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQ 158

Query: 141 ALLFHRLGRPWRKPLQK 157
           AL+FH LGR WRKP +K
Sbjct: 159 ALIFHHLGRSWRKPSEK 175


>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 298

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 81/304 (26%), Positives = 135/304 (44%), Gaps = 46/304 (15%)

Query: 108 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVE 165
           +Y K F P+     T+D  WGC +RS+Q L+ Q +  L+  LG   R     P + +Y  
Sbjct: 28  TYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDDIRNIF--PTNSKY-- 82

Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 225
              LF D   SPF + ++    ++YG+  G WV P  +    + +    R          
Sbjct: 83  --ELFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKEIMNFFRI--------- 131

Query: 226 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 285
                             PVV  +       V ++  +   P+LLL  L+LG E    +Y
Sbjct: 132 ------------------PVVIAEHGCLSREVLNEALSHNIPVLLLFTLMLGYENFELKY 173

Query: 286 IPTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 344
           +P L+LT +   QS+G+VGG+ G + +IVG Q+E  +Y DPHDV    +I K D     +
Sbjct: 174 LPFLKLTLSLIYQSVGVVGGQQGKAYFIVGHQKEKLLYFDPHDVNE--SITKID---QIN 228

Query: 345 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 404
                 ++ +  D++  S+ +GF+  +  D ++       L  +S   P+  V +  +  
Sbjct: 229 QLFKPPLKVMPADTLSSSMLVGFFITNLQDAEEL----PMLLNQSGECPIHIVDKIEEAK 284

Query: 405 VNHS 408
             H+
Sbjct: 285 ETHT 288


>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
          Length = 360

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 181 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 240

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 241 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 298

Query: 383 SKLAEESNGA---PLFTVTQTHKKPVNHS 408
           +++   S+     P+FT+ + H +  +HS
Sbjct: 299 TRVLSSSSATERYPMFTLAEGHAQ--DHS 325



 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 86  TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 139

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            GWGCMLRS QM++AQ LL H L R ++
Sbjct: 140 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 167


>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
          Length = 362

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 183 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 242

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 243 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 300

Query: 383 SKLAEESNGA---PLFTVTQTHKKPVNHS 408
           +++   S+     P+FT+ + H +  +HS
Sbjct: 301 TRVLSSSSATERYPMFTLAEGHAQ--DHS 327



 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 88  TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 141

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            GWGCMLRS QM++AQ LL H L R ++
Sbjct: 142 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 169


>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
          Length = 359

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/144 (32%), Positives = 83/144 (57%), Gaps = 5/144 (3%)

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 180 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 239

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 382
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 240 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 297

Query: 383 SKLAEESNGA---PLFTVTQTHKK 403
           +++   S+     P+FT+ + H +
Sbjct: 298 TRVLSSSSATERYPMFTLVEGHAQ 321



 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 124
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 85  TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGSLTSD 138

Query: 125 VGWGCMLRSSQMLVAQALLFHRLGRPWR 152
            GWGCMLRS QM++AQ LL H L R ++
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 166


>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
          Length = 745

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++++PL LG +K+N  YI  L+L    PQSLG +GGKP  S Y +G Q++  IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH VQ  +N    D    ++TY   + + +    +DPSL+IGFYCRD+  F+D C R S 
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619

Query: 385 LAEESNGAPLFTVTQ 399
           +   +   P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632



 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
           F  D +S I  SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H        P  
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289

Query: 155 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 189
             +KP    Y ++L  F D  S+   + IH ++   +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326


>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
          Length = 745

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           W  +++++PL LG +K+N  YI  L+L    PQSLG +GGKP  S Y +G Q++  IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           PH VQ  +N    D    ++TY   + + +    +DPSL+IGFYCRD+  F+D C R S 
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619

Query: 385 LAEESNGAPLFTVTQ 399
           +   +   P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 154
           F  D +S I  SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H        P  
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289

Query: 155 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 189
             +KP    Y ++L  F D  S+   + IH ++   +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326


>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 469

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 145/332 (43%), Gaps = 54/332 (16%)

Query: 96  EFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDVG 126
           EF +D +SR+  +YR  F PI     G S +                         +D+G
Sbjct: 62  EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 185
           WGCM+R+ Q L+A AL    LGR +R        +   ++I+  F D+   PFS+H  +Q
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181

Query: 186 AG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 244
            G K  G   G W GP A+ RS  +L           C        ++S D       + 
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHC--------IISTD-------SA 226

Query: 245 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
            V +D+         K        LLL+ + LG++  N  Y   ++   +  QS+GI GG
Sbjct: 227 DVYLDEIDPLFRANPKANV-----LLLLGVRLGVDFTNEYYWDDIKNILSSSQSVGISGG 281

Query: 305 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 364
           +P +S Y  G Q +   YLDPH VQ  + + + D E    + H      IHL +IDPS+ 
Sbjct: 282 RPSSSLYFFGYQGDYLFYLDPHKVQLNLALYESD-EERFHSVHPQTFNKIHLSAIDPSML 340

Query: 365 IGFYCRDKDDFDDF--CARASKLAEESNGAPL 394
           +GF    +DD+  +      SK+   S+  P+
Sbjct: 341 LGFLLTGEDDWLSWKTTVLGSKIIHLSDSKPV 372


>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 485

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 128/310 (41%), Gaps = 57/310 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 76  EFLLDVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 135

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R      F RE   I++ F D+  +PFS+HN +  
Sbjct: 136 WGCMIRTGQSLLGNALQILHLGRDFRVDEDDDFRRE-SRIVNWFNDTPEAPFSLHNFVST 194

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS + L      E G+        + V SG  D        
Sbjct: 195 GTELSDKRPGEWFGPAATARSIQYLIY-GFPECGINA----CIVSVSSG--DIYENEVEE 247

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
           V +D+ +             + IL L+ + LG+  VN  Y  ++        S+GI GG+
Sbjct: 248 VFVDNPN-------------SSILFLLGVKLGINAVNESYRESICGILNSAWSVGIAGGR 294

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++  ++ H+     + L  +DPS+ I
Sbjct: 295 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVNSCHTSKFGRLQLSEMDPSMLI 348

Query: 366 GFYCRDKDDF 375
           G   + + D+
Sbjct: 349 GVLIKGEKDW 358


>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 444

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 91/312 (29%), Positives = 136/312 (43%), Gaps = 71/312 (22%)

Query: 103 SRILISYRKGFDPIGDSK----------------------------------ITSDVGWG 128
           SR+ +SYR GFDPI  ++                                   TSD GWG
Sbjct: 84  SRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAGWG 143

Query: 129 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AG 187
           CM+R+SQ L+A  LL              P D +  +++ LF D+++SPFSIHN ++ AG
Sbjct: 144 CMIRTSQNLLANTLL-----------QLLPPDSKQ-DVIGLFQDNQSSPFSIHNFIKVAG 191

Query: 188 KA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
           ++   +  G W GP A   S + L    + +   G +   + I   S   DGE       
Sbjct: 192 ESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVFISENSDLYDGEINE---- 247

Query: 247 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 306
            + +  R              +L+L P+ LG++KVN  Y  ++        S GI GGKP
Sbjct: 248 ILSEEGRS-------------VLVLFPIRLGIDKVNSYYYDSIFQVLKSKFSCGISGGKP 294

Query: 307 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 366
            +S Y +G      IY DPH  Q V N        +  +YH+     +++  +DPS+ IG
Sbjct: 295 SSSFYFLGYDNSDLIYFDPHLPQLVEN------PINIESYHTRNYNRLNISLLDPSMMIG 348

Query: 367 FYCRDKDDFDDF 378
              R  DD+ +F
Sbjct: 349 ILLRSMDDYLEF 360


>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
 gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
          Length = 463

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 129/312 (41%), Gaps = 67/312 (21%)

Query: 92  NGLAEFNQDF----SSRILISYRKGFDPIGDSK--------------------------- 120
           N  +  NQDF    +SR+  +YR  F PI  S                            
Sbjct: 52  NRNSNLNQDFLSDVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNP 111

Query: 121 --ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
               +D+GWGCM+R+ Q L+  AL   +LGR +R  L      +  EI+  F D+   PF
Sbjct: 112 DCFNTDIGWGCMIRTGQSLLGNALQLAKLGRHFR--LDNKMGIKDDEIISWFRDTTQEPF 169

Query: 179 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
           SIH  ++ G K      G W GP A   S ++L   +  E G+        + V SGD  
Sbjct: 170 SIHKFVEKGNKLANKKPGEWFGPAATSISIQSLIE-EFPECGID----KCLVSVSSGD-- 222

Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
                      +D  R   +F +     + IL L+ + LGL+ VN  Y   +        
Sbjct: 223 ---------IFEDDVRE--IFEENMD--SKILFLMGVKLGLDAVNSFYWEDILNILDSKF 269

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHI 354
           S+GI GG+P +S Y  G Q    +Y DPH  QP +         D S Y   H+     +
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQPSL--------VDPSVYETCHTTNFGKL 321

Query: 355 HLDSIDPSLAIG 366
            +  +DPS+ IG
Sbjct: 322 DIKDMDPSMLIG 333


>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
          Length = 392

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 143/340 (42%), Gaps = 41/340 (12%)

Query: 86  GDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
            DA     + +  Q  S  I  SYRK       S +TSD GWGCM+R +QM +AQ +   
Sbjct: 46  NDADIEQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQII--- 102

Query: 146 RLGRPWRKPLQ-----KPF----DREYVEILHLFGDSET----SPFSIHNLLQAGKA-YG 191
           R    ++KP Q     + F    D E  + +  F  ++     +PFSI  ++   K    
Sbjct: 103 RYYNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162

Query: 192 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----------VSGDEDGER 240
              G W     + ++ + L +  +        SL M IY+           +    + + 
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQY-------SLNMEIYINYDCAFILQDAIQQMFNQQE 215

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
           G    + + + +++ + F     D   I + +P  +GL+ +N  Y+  L      P   G
Sbjct: 216 GNE--IWLKERAKNNNQFDL--QDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQG 271

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           ++GG    + Y VG  ++  IYLDPH VQ   N   DDL  + ++Y    I+ IH   ID
Sbjct: 272 MIGGVSKRALYFVGRIQDYLIYLDPHFVQNAQNF--DDLSKNQASYTCQNIQLIHNSLID 329

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
           PS+ +    R+  +  D         +E      F++ +T
Sbjct: 330 PSIVVCLCIRNALELLDLWQIFQHFKQEYQDLFFFSLLET 369


>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 1216

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 87/362 (24%), Positives = 154/362 (42%), Gaps = 79/362 (21%)

Query: 99  QDFSSRILISYRKGFDPIGDSKI-------TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           Q + + IL +YRK F P+   KI       TSD GWGCM+R+ QM+ AQ +  H     +
Sbjct: 257 QIYQNTILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRHLKKTDY 316

Query: 152 RKP----------LQKPFDRE----YVEILHLFGDSETSPFSIHNLL-QAGKAYGLAAGS 196
            +           L++   +E    Y+     +      P+SIH +  +A   Y +  G 
Sbjct: 317 IEQHQLINIIIGFLEEEEVQEGGKGYIFNQQSYIQDRIRPYSIHQITNRAFCKYKIQPGQ 376

Query: 197 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED-----------GERGGAPV 245
           W  P  +    + L +  + +   G ++L + ++  S D+            G +G   +
Sbjct: 377 WYTPNQIAIILKELHKKNKIK---GTENLKIDVH--SSDKPIIFEKILQTLLGRQGKINL 431

Query: 246 VC--------------IDDA------------SRHCSVFSKGQADWT------------- 266
            C               DD+                S + + + D T             
Sbjct: 432 NCNHENQQSRNSINQDQDDSFEKIMPPNQQEIEEFSSQYEESKEDQTDNLCCKDCFKTDN 491

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
            + LL+P  LGL++++P +I  L+   +  QS+G++GGKP  + Y +G   +  +YLDPH
Sbjct: 492 KLFLLLPCRLGLDEISPIHIEILKKLLSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDPH 551

Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
            ++  +   K+DL  + S+Y  + +  + ++ I  SL  GFY    D+ + F     +L 
Sbjct: 552 YIKECVR--KEDLMENISSYFEEDVFKMPINKISTSLVFGFYFSGVDELNKFYKFLRQLE 609

Query: 387 EE 388
           +E
Sbjct: 610 KE 611


>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
           Clan CA, family C54, putative [Trypanosoma cruzi]
          Length = 357

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG---RPWRKPLQKPFDR 161
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +  G   R   + LQ+   R
Sbjct: 74  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177

Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
                MA Y+ +G E     G  ++   +         +     T ++LL+P++LG+  +
Sbjct: 178 -----MASYLATGGE-----GPVILAFPERQIFLEEVKELLRQSTHVVLLIPVMLGICVI 227

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP         E 
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             +       R +   S D S+ +GFY    D    F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLSVF 318


>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 357

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +   G P    + LQ+   R
Sbjct: 74  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177

Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
                MA Y+ +G E     G  V+   +         +     T ++LL+P++LG+  +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 227

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP         E 
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             +       R +   S D S+ +GFY    D    F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 318


>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
           verrucosum HKI 0517]
 gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
           verrucosum HKI 0517]
          Length = 398

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 104/235 (44%), Gaps = 48/235 (20%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK--------------------------ITSDVGWGC 129
           +F  DF S++ I+YR  F PI  +                            TSD GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-K 188
           M+RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G  
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGAT 301

Query: 189 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
           A G   G W GP A  +  +AL +    + GL             G +  E+    V C 
Sbjct: 302 ACGKCPGEWFGPSAASQCIQALVKSN-PQVGL------RVCITSDGSDIYEKQFKEVACD 354

Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
           +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI G
Sbjct: 355 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAG 398


>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
          Length = 351

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 161
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +   G P    + LQ+   R
Sbjct: 68  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 127 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 171

Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
                MA Y+ +G E     G  V+   +         +     T ++LL+P++LG+  +
Sbjct: 172 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 221

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP         E 
Sbjct: 222 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 281

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
             +       R +   S D S+ +GFY    D    F
Sbjct: 282 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 312


>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 371

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 124/302 (41%), Gaps = 57/302 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 126
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 97  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215

Query: 187 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLI------YGFPECGIDDCIVSVSSGDIYENEVEKV 269

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DP  ++
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPRCSL 369

Query: 366 GF 367
            F
Sbjct: 370 VF 371


>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
 gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
          Length = 603

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 83/168 (49%), Gaps = 38/168 (22%)

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
           P+L+L+P+ LGL+ +N  Y  +L   F FPQ+LG+VGGKP AS Y + VQ+++  YLDPH
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEIFKFPQNLGVVGGKPRASLYFIAVQDDNLFYLDPH 430

Query: 327 DVQPVINIGKDDLEAD-------------------------------------TSTYHSD 349
            VQ  I+I   + E                                        +T+   
Sbjct: 431 TVQNHIDINNSNGEPSNFSFSSSPSSSNINIINTNNNNNNNNNNDKNNNNSFPVNTFFCS 490

Query: 350 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 397
             +  H+  +DPSL + F+C+ + DFDDF  R+  +A +    P+F++
Sbjct: 491 QTKRTHVSEVDPSLVVAFFCKSRSDFDDFVDRSKAMASQMEN-PIFSI 537



 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 74/126 (58%), Gaps = 1/126 (0%)

Query: 87  DAAGNNGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
           D  G + + EF +DF++R+L  +YR+GF  I +++  +D GWGCMLRS QML++  LL H
Sbjct: 129 DIPGQSFIKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHH 188

Query: 146 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 205
            LG  W+K         Y  I+ +F D  ++PFSIHN+   G+  G   G W  P  + +
Sbjct: 189 ALGDDWKKSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQ 248

Query: 206 SWEALA 211
           + ++L 
Sbjct: 249 AIKSLV 254


>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 284

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/261 (27%), Positives = 113/261 (43%), Gaps = 39/261 (14%)

Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
           YR     + +S +T+D GWGC  RS+Q L+ Q +L  +L R +R    + F +  V  L 
Sbjct: 25  YRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYIL--KLHRKFRSLYDQVFGQN-VNPLD 81

Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
           LF D  ++PF I NL +   A GL  G W  P  M     A  +       L C      
Sbjct: 82  LFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIM----AATIKLIFDTLNLSC------ 131

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
             ++S D   +             +H            P L+L+P + GL K++  Y+  
Sbjct: 132 --IISQDLTLDSNDI---------KHTKY---------PALILIPSLFGLSKMDDSYLSF 171

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           L L      SLG V G+  ++ Y VG   E   Y DPH  +  +      +     ++  
Sbjct: 172 LLLCLCIESSLGFVSGQNASAYYFVGFDLEDFYYFDPHVTKEAV------VSPPYDSFFD 225

Query: 349 DVIRHIHLDSIDPSLAIGFYC 369
             ++ +  +SI+PS+ +GFYC
Sbjct: 226 LELKSMKKESINPSVLLGFYC 246


>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
           [Homo sapiens]
          Length = 231

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 66/226 (29%), Positives = 102/226 (45%), Gaps = 61/226 (26%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + +    
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEK---- 133

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
                        MCR                       +  +S D  G+R    +   +
Sbjct: 134 -------------MCR-----------------------VLPLSADTAGDRPPDSLTASN 157

Query: 250 DA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 292
            +   S +CS        W P+LL+VPL LG+ ++NP Y+   ++T
Sbjct: 158 QSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKVT 196


>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 284

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 117/263 (44%), Gaps = 39/263 (14%)

Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 168
           YR  F  I +S ++ D GWGC  RSSQ LV Q +L  RL + +       F  +    L 
Sbjct: 25  YRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYIL--RLHKNFPDLYNSTFGID-KNPLD 81

Query: 169 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 228
           LF D   +PF I N++    + GL  G+W  P  +  +++++ +       L C      
Sbjct: 82  LFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSIFQ----SLHLNC------ 131

Query: 229 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 288
             +V  D                     ++ + ++   P+L+L+P + GLEK+   YI  
Sbjct: 132 --IVPQDSTF------------------IYEELESTNYPVLILIPGLFGLEKIEKPYISF 171

Query: 289 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 348
           + L+     SLG V G   ++ Y +G   +   Y DPH  +  +     D   +      
Sbjct: 172 IFLSLCMNSSLGFVSGHNDSAFYFIGFDSDYFYYFDPHVTKQALTGPPYDSLFELK---- 227

Query: 349 DVIRHIHLDSIDPSLAIGFYCRD 371
             ++ + +++I+PS+ +GFYC D
Sbjct: 228 --LKSMKIENINPSVLLGFYCDD 248


>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
           fumigatus A1163]
          Length = 226

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)

Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
           G+  + P L+L+   LG++++ P Y   ++ T   PQS+GI GG+P AS Y VGVQ    
Sbjct: 20  GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79

Query: 321 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            YLDPH  +P +   NI     + +  TYH+  +R IH+  +DPS+ IGF  +D++D+
Sbjct: 80  FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137


>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
 gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
           fumigatus Af293]
          Length = 226

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)

Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
           G+  + P L+L+   LG++++ P Y   ++ T   PQS+GI GG+P AS Y VGVQ    
Sbjct: 20  GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79

Query: 321 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            YLDPH  +P +   NI     + +  TYH+  +R IH+  +DPS+ IGF  +D++D+
Sbjct: 80  FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137


>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
 gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
          Length = 460

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 97/334 (29%), Positives = 146/334 (43%), Gaps = 63/334 (18%)

Query: 96  EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 126
           +F  D  SR+  +YR  F PI     G S ++                        +D+G
Sbjct: 60  QFLSDVHSRLHFTYRTKFVPIPRVSDGPSPLSFHFLIRENPLTTIENAIYNPDCFNTDIG 119

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+R+ Q L+  AL    LGR +R  + +  D+E  +I+  F D+  + FSIHN +  
Sbjct: 120 WGCMIRTGQSLLGNALQIANLGRDFR--VNQGKDQEEYKIIDWFADTPQAHFSIHNFVSQ 177

Query: 187 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 245
           G K      G W GP A  RS + L   Q  + G+        I V SGD          
Sbjct: 178 GLKLSNKKPGEWFGPAATSRSIQCLVE-QFPDCGID----KCLISVSSGD---------- 222

Query: 246 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 305
              +D  R   +F+  Q   + ILLL+ + LG+  VN  Y   ++ T     S+GI GG+
Sbjct: 223 -VFEDEVRE--IFA--QKPQSRILLLLGVKLGVNAVNEYYWDDVKKTLGSKFSVGIAGGR 277

Query: 306 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 365
           P +S Y +G Q    IY DPH  QP +    +  +    T H+     + L  +DPS+ I
Sbjct: 278 PSSSLYFMGFQGNELIYFDPHTPQPSLQTSANFYD----TCHALNFGKLLLSDLDPSMLI 333

Query: 366 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
           G     ++ +        +  EE   + +F V+Q
Sbjct: 334 GILISGEEAW-------LQWKEEVKDSKIFNVSQ 360


>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
          Length = 348

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 49/344 (14%)

Query: 93  GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           G AE  +  + ++L  SYR  F+P+ +   T+D+GWGC +R+ QM++A AL+ ++ G   
Sbjct: 37  GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93

Query: 152 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
                  F+   V  L     HLF D  ++PF IH +   G  +G   GSW GP  +   
Sbjct: 94  ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149

Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
             AL                M  Y+ SG +     G  V+ + D         K      
Sbjct: 150 MGAL----------------MEDYLSSGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
            +LLL+P++LG   ++  Y   L+       ++G VGGK G++ + +G Q  + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248

Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
             Q           +DT    S     + L S   S+ +GFY    D F  F        
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298

Query: 387 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
           +++N + +F + +     V  SD +G      + D   ++S  D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFSEDDPDVCSLVSFGD 337


>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
          Length = 632

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 4/118 (3%)

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           ++W P+LL VPL LGL   NP Y   ++  F  P  +GI+GG P  + +IVGV  +  I 
Sbjct: 385 SNWRPLLLFVPLRLGLHNPNPCYFNAIKAVFRLPNCIGILGGSPCHAVWIVGVTGDDVIC 444

Query: 323 LDPHDVQPVINIGKDDLEAD-TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 379
           LDPH  QP    G+ +L+ D   TYH +    + L  +DPS+ +GF C  + +FDD C
Sbjct: 445 LDPHTTQPA---GRGNLKPDYDQTYHCENPIRMPLKRLDPSMVLGFLCSTEKEFDDLC 499



 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
           E      SR+ ++YRKGF PIG      SD GWGCM R  QM++A+A+L   LGR WR  
Sbjct: 43  EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
           P Q+    EY  +L +F D  +  +SI  +   G + G + GSW GP  + +  + L+  
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160

Query: 214 QR 215
            R
Sbjct: 161 DR 162


>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
          Length = 364

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 84/351 (23%), Positives = 142/351 (40%), Gaps = 96/351 (27%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           L EF  D    + I  ++     G +  +SD GWGCMLR  QM++AQAL+   LGR    
Sbjct: 24  LEEF-PDTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRA--- 79

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
                                          Q G   G + G W GP  + +  + LA  
Sbjct: 80  -------------------------------QMGVGEGKSIGEWFGPNTVAQVLKKLALF 108

Query: 214 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF--------------- 258
               +        +A+YV   +          V I+D  + C V                
Sbjct: 109 DEWNS--------LAVYVSMDN---------TVVIEDIKKMCCVLPLSADTDTESPPDSP 151

Query: 259 -----SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV------- 302
                SKG +     W P+LL+VPL LG+ ++NP Y+   +L  +    L +        
Sbjct: 152 TASNQSKGPSACGSAWKPLLLIVPLRLGINQINPVYVDAFKLQASCHPILIVTKEGVRRT 211

Query: 303 ---------GGKPGASTYIVGVQEESA---IYLDPHDVQPVINIGKDDLEADTSTYHSDV 350
                    G +   S  +  V  ++    I+LDPH  Q  ++  ++ +  D + +    
Sbjct: 212 RILPPKDSSGARASESLKVKHVSFKTGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQS 271

Query: 351 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 272 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 321


>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 348

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 92/344 (26%), Positives = 148/344 (43%), Gaps = 49/344 (14%)

Query: 93  GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           G AE  +  + ++L  SYR  F+P+ +   T+D+GWGC +R+ QM++A AL+ ++ G   
Sbjct: 37  GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93

Query: 152 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
                  F+   V  L     HLF D  ++PF IH +   G  +G   GSW GP  +   
Sbjct: 94  ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149

Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 266
             AL                M  Y+ +G +     G  V+ + D         K      
Sbjct: 150 MGAL----------------MEDYLRNGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
            +LLL+P++LG   ++  Y   L+       ++G VGGK G++ + +G Q  + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248

Query: 327 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
             Q           +DT    S     + L S   S+ +GFY    D F  F        
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298

Query: 387 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
           +++N + +F + +     V  SD +G      + D   ++S  D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFNEDDPDVCSLVSFGD 337


>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
 gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
          Length = 179

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 45/114 (39%), Positives = 70/114 (61%), Gaps = 3/114 (2%)

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           + P L+L+   LG++++ P Y   ++ T   PQS+GI GG+P AS Y VGVQ     YLD
Sbjct: 24  FRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHLFYLD 83

Query: 325 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
           PH  +P +   NI +   + +  TYH+  +R IH+  +DPS+ IGF  +D++D+
Sbjct: 84  PHQTRPALPQRNIDERYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137


>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
          Length = 378

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 98/386 (25%), Positives = 137/386 (35%), Gaps = 130/386 (33%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
           AGN  + EF +DF SRI ++YR+ F PI  S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45  AGN--VEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102

Query: 149 RPWRKP----------------------------------LQKPFD--REYVE------- 165
           R W  P                                  L+ P    +E +E       
Sbjct: 103 RAWTWPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHE 162

Query: 166 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
                    I+  FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 163 IRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
               G     + IYV             V   D   +  +  +   AD   +++LVP+ L
Sbjct: 223 PDLQG-----ITIYVAQ--------DCTVYSSDVIDKQRTAMTADNADDKAVIILVPVRL 269

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G E+ N  Y+  ++ TF  P    +   K                 +DP           
Sbjct: 270 GGERTNTDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 394
                                    S  IGFYCR+  DF       +K+   S+    PL
Sbjct: 301 -------------------------SCTIGFYCRNIQDFKRASEEITKMLTISSKEKYPL 335

Query: 395 FTVTQTHKK-------PVNHSDVLGE 413
           FT    H +         N  D+  E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361


>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 92/397 (23%), Positives = 162/397 (40%), Gaps = 67/397 (16%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           I++LG  H+I  D+        + + +  Q     I I+YR+ + P+  S   SD GWGC
Sbjct: 38  IYILG--HRIDIDQF----EIEDRINKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGC 91

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS------------- 176
           MLR  QM +AQ L  H      ++      D +Y  I+  F D+++              
Sbjct: 92  MLRVGQMAMAQMLKKHLKNHGDKR------DEDYDNIILAFADNDSQENKEFIEFQNSKD 145

Query: 177 ---------PFSIHNL-LQAGKAYGLAAGSWVGPYAM------------CRSWEALARCQ 214
                    PFSI  +   A K + L  G W  P  +             R+ E L    
Sbjct: 146 KQKAHNFICPFSIQKIAYLAKKEFNLDPGEWYRPNYILFLLELLHNTIPIRASENLKLSV 205

Query: 215 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 274
             ++ L    L   ++    + D +        +++      +  K       + + V  
Sbjct: 206 FNDSCLFLDQLMNRMFEAKFETDKD--------LEEQLEKTQLIGKN-----SLAIFVLT 252

Query: 275 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
            +GL++ N +Y+  L      P   GIVGG P  + YI+G   +  +YLDPH VQ   N 
Sbjct: 253 RIGLDEPNQKYLKILDEIMELPYFQGIVGGTPKRAFYILGKINDHYLYLDPHYVQEAEN- 311

Query: 335 GKDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 390
            KD +  +     ++Y    I  ++   +D S+ + FY R++ +   F     ++ + S+
Sbjct: 312 -KDQINENKMFNRTSYSCKNIHLLNQKHVDTSMGLSFYIRNQSELLQFWRNMKQIKQSSD 370

Query: 391 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMS 427
              +F ++ +  + V++S  L E+     DD +  + 
Sbjct: 371 DFFIF-LSDSAPEYVDYSGQLEESSNKLNDDDVVFLQ 406


>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
          Length = 259

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/272 (25%), Positives = 114/272 (41%), Gaps = 56/272 (20%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 1   MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 61  EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 103

Query: 250 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
           D  + C V                                      P S    G +P  S
Sbjct: 104 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 126

Query: 310 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 369
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 127 -LTASNQGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 185

Query: 370 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 186 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 216


>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
 gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
          Length = 394

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)

Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
                     +ET+PFSIHN++++          +  P   C   EA+ R  +    +  
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGC---EAIKRTMQG--AVKT 148

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLNEGPSAA 255

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
          Length = 454

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
            D  P L+L+ + LG+++V P Y   L+    +PQS+GI GG+P +S Y +G Q     Y
Sbjct: 245 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFY 304

Query: 323 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           LDPH  +P +       +     + +TYH+  +R +H+  +DPS+ IGF  RD+DD++ +
Sbjct: 305 LDPHHTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 364

Query: 379 CARASKLAEESNGAPLFTVTQTHKKP 404
                  A    G  +  V    K P
Sbjct: 365 KRSVHNRAMIGTGKAIIHVFDKEKSP 390



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 24/119 (20%)

Query: 58  PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 113
           P+R+  S++     LL    H+ +    LG     +    F  DF S+I ++YR  F   
Sbjct: 85  PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144

Query: 114 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 152
               DP                +     T+D GWGCM+RS Q L+A AL    LGR  R
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRACR 203


>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 297

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 109/223 (48%), Gaps = 33/223 (14%)

Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 166
            +Y KGF P+     T+D  WGC +RS Q L+ Q +   +L + +   ++  F       
Sbjct: 27  FTYHKGFSPLAGG-YTTDKNWGCCIRSGQGLLMQFV--SKLYQLYGDKIKNIFPNG--SK 81

Query: 167 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 226
             LF D   +PF IH + +  + +G+ AG WV P  +   ++ L                
Sbjct: 82  FELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDLLSF------------- 128

Query: 227 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 286
             I+VV   E+G        C+   S      S G     P+LLL  L+LG +  + +Y+
Sbjct: 129 FGIHVVIA-ENG--------CLSRESLR-EALSYGH----PVLLLFTLMLGYKDFDLKYL 174

Query: 287 PTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 328
           P LRLT +   QS+G+VGG+ G + Y+VG Q+E+ +Y DPH+V
Sbjct: 175 PFLRLTLSLIYQSVGVVGGQQGKAYYLVGHQKENLLYFDPHEV 217


>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 463

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)

Query: 263 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
            D  P L+L+ + LG+++V P Y   L+    +PQS+GI GG+P +S Y +G Q     Y
Sbjct: 253 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQASHFFY 312

Query: 323 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           LDPH  +P +       +     + +TYH+  +R +H+  +DPS+ IGF  RD+DD++ +
Sbjct: 313 LDPHHTRPALAYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 372

Query: 379 CARASKLAEESNGAPLFTVTQTHKKP 404
                  A    G  +  V    K P
Sbjct: 373 KRSVHNGAMIGTGKAIIHVFDKEKSP 398


>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
 gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
          Length = 394

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 121/274 (44%), Gaps = 33/274 (12%)

Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
                     +ET+PFSIHN++++     +    +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGC---EAIKRT--VQGAVKT 148

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLCEGLSAA 255

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 ASVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
 gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
          Length = 356

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/263 (30%), Positives = 117/263 (44%), Gaps = 37/263 (14%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
            TSD+GWGCM+R+ Q L+A AL     G P              EI+ LF D   +PFSI
Sbjct: 84  FTSDIGWGCMIRTGQTLLANALQRTNKGTPCS------------EIIELFVDETKNPFSI 131

Query: 181 HNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           HN +  GK   L   G W  P    +  E L           C      + + SGD   +
Sbjct: 132 HNFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKC-----IVSISSGDIYEQ 186

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTFTFPQS 298
                +  +DD+    +  +K Q     ILLL  + LG+  +N  +Y   ++       +
Sbjct: 187 ---DVLDELDDSEPPAN--TKQQH----ILLLFGIKLGINTINIEKYGQDIKDITNNKYT 237

Query: 299 LGIVGGKPGASTYIVGVQE--ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHL 356
            GI GG+P +S +  G     +  +Y DPH      N   D+   D STYHS     + +
Sbjct: 238 CGISGGQPKSSLFFFGYNNTHDRILYFDPHKPN---NFTTDN---DYSTYHSTEFNELEM 291

Query: 357 DSIDPSLAIGFYCR-DKDDFDDF 378
            ++DPS+ IGF  + +K D++ F
Sbjct: 292 FNLDPSMIIGFLVKNNKADWNKF 314


>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
 gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
          Length = 394

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)

Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
                     +ET+PFSIHN++++          +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
 gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
          Length = 394

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)

Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
                     +ET+PFSIHN++++          +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSANG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255

Query: 342 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
 gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
          Length = 483

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 85/257 (33%), Positives = 118/257 (45%), Gaps = 34/257 (13%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
             SD+GWGCM+R+ Q L+  AL   RL  P   P +K       +++  F D  ++PFS+
Sbjct: 144 FCSDIGWGCMIRTGQALLGNALA--RLRSP---PEEK-------QLIGWFEDRSSAPFSL 191

Query: 181 HNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           HN ++ G A      G W GP A  RS ++L      + GL        I   SGD   E
Sbjct: 192 HNFVREGNALSRKPPGEWFGPSATSRSIQSLVHA-FPQCGLNH----CIISTDSGDVYEE 246

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 299
             G P++               +     ILLL+ + LGL  VN RY P ++       S+
Sbjct: 247 DVG-PIL--------------EREPQATILLLLGVKLGLNNVNSRYWPDVKHILGSSFSV 291

Query: 300 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 359
           GI GG+P +S Y  G Q +   YLDPH  Q  +     D E   S  HS     +H   +
Sbjct: 292 GIAGGRPSSSLYFFGYQGDYLFYLDPHTSQLDLASCATDNEKYESV-HSARFNKVHFSEL 350

Query: 360 DPSLAIGFYCRDKDDFD 376
           DPS+ IG   +  DD+D
Sbjct: 351 DPSMLIGVLIQGLDDWD 367


>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
          Length = 378

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 115/315 (36%), Gaps = 118/315 (37%)

Query: 91  NNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGW 127
           N    +F  DF SR  ++YR  F PI  SK                        +SD GW
Sbjct: 109 NGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGFSSDSGW 168

Query: 128 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
           GCM+RS Q L+A A    RLGR WR+  QK    E ++I+ +F D   +P+SIHN +  G
Sbjct: 169 GCMIRSGQSLLANATGIVRLGRDWRRGQQK---AEEIKIMRMFADDPAAPYSIHNFVDYG 225

Query: 188 KAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 246
            +  G   G W GP A  +                                         
Sbjct: 226 SSKCGKYPGEWFGPSATSQ----------------------------------------- 244

Query: 247 CIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 303
           CI+      S  +  ++D   + P L+L+   LG++K+   Y   L      PQS+GI G
Sbjct: 245 CINPDVYEDSFMATAKSDHGFFKPTLILISTRLGIDKITQVYWEALISALQMPQSVGIAG 304

Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 363
                                                          +R +H+  +DPS+
Sbjct: 305 -----------------------------------------------LRRLHVQQMDPSM 317

Query: 364 AIGFYCRDKDDFDDF 378
            IGF  R ++++ ++
Sbjct: 318 LIGFIIRSEEEWKEW 332


>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 265

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 67/116 (57%), Gaps = 2/116 (1%)

Query: 261 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
               W  +++LVP+ LG E +NP YI  ++        +GI+GGKP  S Y +G Q+E  
Sbjct: 151 AHQSWQSVIILVPVRLGGESLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFIGFQDEQL 210

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 376
           +YLDPH  QPV+++ + +   +  ++H +  + +    +DPS  IGFY + K DF+
Sbjct: 211 LYLDPHYCQPVVDVSQVNFSLE--SFHCNSPKKMPFSRMDPSCTIGFYAKSKKDFE 264



 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 30/60 (50%), Positives = 41/60 (68%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           +  F   F SRI ++YRK F P+  S +T+D GWGCMLRS QML+AQ LL H + R +++
Sbjct: 74  VERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQGLLVHLMHRVYKE 133


>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
          Length = 257

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 80/176 (45%), Gaps = 44/176 (25%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+     E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP 249


>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
          Length = 378

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 98/386 (25%), Positives = 134/386 (34%), Gaps = 130/386 (33%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
           AGN  + EF +DF SRI ++YR+ F  I  S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45  AGN--VEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102

Query: 149 RPWRKP----------------------------------LQKPF------------DRE 162
           R W  P                                  L+ P             D E
Sbjct: 103 RAWTWPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHE 162

Query: 163 YV-EILH-----LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
              EI H      FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 163 MQNEIYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 276
               G     + IYV             V   D   + C+  +    D   +++LVP+ L
Sbjct: 223 PELQG-----ITIYVAQ--------DCTVYSSDVIDKQCASMAPDITDDKAVIILVPVRL 269

Query: 277 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 336
           G E+ N  Y+  ++ TF  P    +   K                 +DP           
Sbjct: 270 GGERTNIDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE--ESNGAPL 394
                                    S  IGFYCR+  DF       +K+ +       PL
Sbjct: 301 -------------------------SCTIGFYCRNVQDFKRASEEITKMLKVFSKEKYPL 335

Query: 395 FTVTQTHKK-------PVNHSDVLGE 413
           FT    H +         N  D+  E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361


>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 343

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 72/293 (24%), Positives = 125/293 (42%), Gaps = 37/293 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
           I  SYR GF     + I SD GWGCMLRS QM+ A  LL H    P    +Q     + +
Sbjct: 27  IYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLLRHLKENP---QIQNQLKIQNI 83

Query: 165 E-----ILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 218
                 I+  F +++  PFSI  +   A + + L  G W  P  +  S + L    +  +
Sbjct: 84  NDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKKLLNNFQTFS 143

Query: 219 GLGCQS------LPMAIYVVSGDEDGERGGAPV------VCIDDASRHCSVFSKGQADWT 266
            +   S       P+          G++  + +      + I++  +   +  +    + 
Sbjct: 144 EMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKIMKQNSNKYQ 203

Query: 267 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
                  +++GL+    +Y+  L   FT   S+G           ++G+  +   YLDPH
Sbjct: 204 INKQNYKILIGLDYPEEKYLDILIKLFTHRLSIG-----------MIGLNNDKLTYLDPH 252

Query: 327 DVQPV-INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
            VQ   IN      E +  TY  + ++ I+  ++ PS+ +GFY +D +D ++F
Sbjct: 253 IVQHADINTN----EINLKTYFQEEVKQINKHALGPSVGLGFYLKDLNDLNEF 301


>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
          Length = 194

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 78/158 (49%), Gaps = 27/158 (17%)

Query: 70  IWLLGVCHKI--------AQDEALGDAAGNNGLA----------------EFNQDFSSRI 105
           IWLLG  + I        A  EA  D   N G +                +F  DF+SR+
Sbjct: 29  IWLLGCSYIIKPTDHIQQALLEAQRDLMFNKGSSENEEENNQNMHMLWPPDFYDDFTSRL 88

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-KPFDREYV 164
            ++YR  + PI  S   +D+GWGC LRS Q L+A  L+ H LGR WR+  Q +   ++Y 
Sbjct: 89  WMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQAAWKQYS 148

Query: 165 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 200
            I+H F D  S  +PFSIH +   GK  G   G W GP
Sbjct: 149 RIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGP 186


>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
          Length = 350

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 70/130 (53%), Gaps = 3/130 (2%)

Query: 278 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG-- 335
           L +VNP YI  L+  F  P S G++GG+P  + Y +G   E A+YLDPH VQ V  IG  
Sbjct: 180 LNEVNPIYIEGLKKCFQLPGSCGMIGGRPNQALYFIGYVGEEALYLDPHTVQRVGCIGEK 239

Query: 336 KDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 394
           ++ +E +  +T+H      I   S+DPSLA+ F C  +  FD   A   +        PL
Sbjct: 240 QESVEQEQDATFHQRHASRIAFASMDPSLAVCFLCCSRAQFDQLVAHFKERLNGGGSQPL 299

Query: 395 FTVTQTHKKP 404
           F VT+T + P
Sbjct: 300 FEVTKTRQAP 309



 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 2/103 (1%)

Query: 99  QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 158
           QD  SR+  +YR+GF PIG++++T+D GWGCMLR  QM++A+AL    LGR W+   ++ 
Sbjct: 72  QDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLGRDWQWS-EET 130

Query: 159 FDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGP 200
            D  Y++I++ F D++ +PFS+H + L    +     G W GP
Sbjct: 131 RDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGP 173


>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 296

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/276 (25%), Positives = 119/276 (43%), Gaps = 54/276 (19%)

Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY--- 163
            +YR  F  I    ITSD GWGC  RS+Q L+A   L +            P D EY   
Sbjct: 30  FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNY-----------APVDAEYFFT 78

Query: 164 ----VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
               + +  LF D    PFSI NL+   + +G+  G+W  P  +  + E++ +       
Sbjct: 79  VFNEIPMFSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFK------- 131

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
                L +++ ++S D +       ++  D  +                      +LG++
Sbjct: 132 ----DLKLSV-LISKDSN-------IIPEDVKTMRAPFLLLIPI-----------LLGMK 168

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE-ESAIYLDPHDVQPVINIGKDD 338
            V  ++IP ++ TF  P+ LG V G    S ++VG+ E ++ +Y DPH  +  +      
Sbjct: 169 DVEQKFIPFIKYTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPHVTKQAVASS--- 225

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
              D S +     R I + S++PS  +GF+C   ++
Sbjct: 226 --FDHSEFFEVPPRGIKMKSLNPSFLLGFFCSSTEN 259


>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
          Length = 296

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 110/248 (44%), Gaps = 44/248 (17%)

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
           +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +A   R    
Sbjct: 51  ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVE 103

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 279
              +   + +YV                    S+ C+V           L +  L +   
Sbjct: 104 SCSEVTRLVVYV--------------------SQDCTV-----------LHMRSLAIDPS 132

Query: 280 KVNPRYIPT-LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
           K     +P+ L+        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ + +
Sbjct: 133 KDRSTCLPSSLQELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQAN 192

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLF 395
              +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++   S+     P+F
Sbjct: 193 FPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMF 250

Query: 396 TVTQTHKK 403
           T+ + H +
Sbjct: 251 TLAEGHAQ 258


>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
 gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
          Length = 269

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/229 (27%), Positives = 105/229 (45%), Gaps = 24/229 (10%)

Query: 175 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 234
            S +SIH + Q G++   A G W+GP  + +  + L R     +        +AI+V   
Sbjct: 1   NSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD 52

Query: 235 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 294
                      V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+    
Sbjct: 53  ---------STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLE 99

Query: 295 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVI 351
              S G++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH    
Sbjct: 100 LDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHA 159

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 400
             ++  ++DPSLA+ F C+  D F+    +  +         LF ++QT
Sbjct: 160 ARLNFSAMDPSLAVCFLCKTSDSFESLLTQFKEEVLSLCSPALFEISQT 208


>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
 gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
          Length = 419

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 117/260 (45%), Gaps = 39/260 (15%)

Query: 110 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 169
           R  FD   +   TSD GWGCM+R+SQ L+A AL          K   +      +EIL L
Sbjct: 130 RSLFD---NENFTSDAGWGCMIRTSQNLLANAL---------LKLAGEANGNVQLEILKL 177

Query: 170 FGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 227
           F D   + FSIHN ++   A  L+   G W GP A   S   L         +  Q  P 
Sbjct: 178 FQDDPNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLT------IEMTDQESPT 231

Query: 228 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 287
            +  V   E+ +         DD      +  K      P+LLL P+ LG++ VN  Y  
Sbjct: 232 VVPFVYISENAD-------LYDDEIEETFLKEK-----RPLLLLFPVRLGIDHVNKYYYK 279

Query: 288 TLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQPVINIGKDDLEADTSTY 346
           ++        S+GI GGKP +S Y +G + +E+ IY DPH  Q        +   + ++Y
Sbjct: 280 SILQLLASRFSVGIAGGKPSSSFYFIGYENDENLIYFDPHLPQVF------ESPINLASY 333

Query: 347 HSDVIRHIHLDSIDPSLAIG 366
           H+     + ++ +DPS+ IG
Sbjct: 334 HTLNYNKLSIEMLDPSMMIG 353


>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
          Length = 347

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 109/247 (44%), Gaps = 28/247 (11%)

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           M+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + AG  
Sbjct: 1   MIRTGQSLLGNALQILHLGRDFRVNGNESLERES-KFVNWFNDTPEAPFSLHNFVSAGTE 59

Query: 190 YG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 248
                 G W GP A  RS ++L        G     +   I  VS  +  E     V   
Sbjct: 60  LSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKVFAE 113

Query: 249 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 308
           +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+P +
Sbjct: 114 NPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGRPSS 159

Query: 309 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 368
           S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ IG  
Sbjct: 160 SLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLIGIL 213

Query: 369 CRDKDDF 375
            + + D+
Sbjct: 214 IKGEKDW 220


>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
          Length = 373

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 4/117 (3%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
           F  DF SR+ ++YR  F  IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR   +
Sbjct: 147 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 206

Query: 157 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEAL 210
           +     Y E+L  F D  S  SP+SIH + + G + +    G W  P  +  +   L
Sbjct: 207 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLL 262


>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 516

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 98/400 (24%), Positives = 160/400 (40%), Gaps = 78/400 (19%)

Query: 99  QDFSSRILISYRKGFDPIGD------------SKITSDVGWGCMLRSSQMLVAQALLFHR 146
           ++F + I I+YRK F  + +            S+  SD GWGCM+R  QM  A+ L  H 
Sbjct: 71  ENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCMVRVGQMAFAEGLRRHL 130

Query: 147 LGRPWRKPLQKPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSWVGPY 201
           +    +K + K  +   V I     D +     +P+SI  + + A   + L  G W  P 
Sbjct: 131 VEN--KKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIALSDFNLLPGEWYTPI 188

Query: 202 AMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGD-------EDGERGGAPVVCID 249
            +C     L   ++A  G   + L +A++     +V  D        D +RG    +C +
Sbjct: 189 RICYILGLLHNERKAIKG--TEDLKVAVFSSSRPIVFQDFLERMCKVDPQRGKHAQICPN 246

Query: 250 -------------DASRHCSVFSKGQ---------ADWTPILLLV-PL------------ 274
                        D   H  +  + Q         ++ TP L LV P+            
Sbjct: 247 QCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCPIHHELQYSMIVYI 306

Query: 275 --VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 332
             ++GL+   P Y+   +    F  SLG++GGKP  + Y VG  E+  IYLDPH VQ   
Sbjct: 307 VCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDEFIYLDPHYVQEFS 366

Query: 333 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 392
           N       +   TY     +     +ID S ++ +Y +D +  ++F      L  + N  
Sbjct: 367 NEKNFQSSSQLETYFCKKFQTYPSKNIDSSFSLMYYLKDLEQLEEFYQFMMGLKRDYNEH 426

Query: 393 PLFTVTQTHKKPVNHSDVLG---ETGGVPEDDSLGVMSMN 429
               +  T       S  LG   E+  +  D +L +++ N
Sbjct: 427 FFMMMEDTEP-----SFCLGDGKESSNLISDKNLNILADN 461


>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
          Length = 256

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 60/176 (34%), Positives = 81/176 (46%), Gaps = 45/176 (25%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S   S + L G C+     E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISSVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 152
           S +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192

Query: 153 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP 248


>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
          Length = 546

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/135 (37%), Positives = 75/135 (55%), Gaps = 6/135 (4%)

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
           ++LLVPL LGL++++  YIP+L  T   PQSLG +GG+P  + + +G Q  +   LDPH 
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSLLETLRVPQSLGFLGGRPNHAIFFIGAQGNTLTGLDPHT 439

Query: 328 VQPVINIGKD-DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 386
            QP  ++G+    E    + H      + +  IDPSLA+ FY  D+  F+D   R     
Sbjct: 440 TQPAADMGEGFPSERYVHSLHCQSAVSMDVHRIDPSLALAFYLPDRATFEDLIKRIG--- 496

Query: 387 EESNGAPLFTVTQTH 401
            E+N  P F+V QT 
Sbjct: 497 -ETN-PPPFSVEQTR 509



 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/138 (39%), Positives = 70/138 (50%), Gaps = 23/138 (16%)

Query: 71  WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 130
           W++G+ +   ++E            E   D  S + I+YR GF  +     T D GWGCM
Sbjct: 38  WIMGIPYTELREE------------ERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCM 85

Query: 131 LRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGD--SETSPFSIHN 182
           LRS+QML+ QAL  H LGR WR P      L+ P   EY  ++ LF D   E + FSIHN
Sbjct: 86  LRSAQMLMTQALQRHTLGRSWRVPRTLEERLRVP---EYRTLVRLFADHPGEANLFSIHN 142

Query: 183 LLQAGKAYGLAAGSWVGP 200
           + Q G  Y    G W GP
Sbjct: 143 MCQVGIRYDKLPGEWYGP 160


>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
 gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
           brucei strain 927/4 GUTat10.1]
          Length = 327

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
           +S  L +YR+ FDP+  S +TSD GWGC+ R++QML+A +L         R+   +    
Sbjct: 41  NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 219
           +Y   L    D + +PFS+H +++    + L  G  + P  +A  +  EA++ C +  T 
Sbjct: 92  QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144

Query: 220 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
            G  S P+++ + V+G    E     V C    SR+             +L+L PL  G 
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187

Query: 279 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 336
            + ++ +   +L      P+S+G+VGG P    YI+G   +E  +YLDPH       +  
Sbjct: 188 SRYMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSS 247

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           +  E       S  +R +    +D S  +GF+   +  ++    R   L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299


>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 398

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 31/273 (11%)

Query: 105 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 162
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  +  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWAY--GRPADRRLALFFDH- 102

Query: 163 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 222
                     +ET+PFSIHNL+++          +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGC---EAIKRTM--QDAIKT 148

Query: 223 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
           + L   + VV+             C+     H   F +G A+   +L  V +    +   
Sbjct: 149 EQLQTRVTVVTSTNG---------CVYADEVH-HTFKQG-AEVVLVLASVRVSAAAQLTQ 197

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
             Y+   +L    PQ LGIVGG PG S Y     +    YLDPH       +        
Sbjct: 198 ESYLQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPHQRTTAALLSDGPSATV 256

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
           + T     +R +H   +D SL + F    +D++
Sbjct: 257 SVTPSVSDVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
           gambiense DAL972]
          Length = 327

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
           +S  L +YR+ FDP+  S +TSD GWGC+ R++QML+A +L         R+   +    
Sbjct: 41  NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 219
           +Y   L    D + +PFS+H +++    + L  G  + P  +A  +  EA++ C +  T 
Sbjct: 92  QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144

Query: 220 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 278
            G  S P+++ + V+G    E     V C    SR+             +L+L PL  G 
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187

Query: 279 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 336
            + ++ +   +L      P+S+G+VGG P    YI+G   +E  +YLDPH       +  
Sbjct: 188 SRCMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSG 247

Query: 337 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           +  E       S  +R +    +D S  +GF+   +  ++    R   L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299


>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
          Length = 326

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 118/280 (42%), Gaps = 35/280 (12%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
           +S  L++YR  F+P+  S +TSD GWGC+ R+SQML+A  L  H                
Sbjct: 41  TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVLRRHAASEC----------- 89

Query: 162 EYVEILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETG 219
            +++      D   +PFS+H + +A   +G    A  W  P   C   EA+  C  +   
Sbjct: 90  -HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYW-APSQGC---EAIRSCVESAVR 144

Query: 220 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL- 278
            G  +  +++ V S     ER                +    + D + +L+LVP+  G  
Sbjct: 145 QGLLTQKLSVVVSSSGTIPER---------------EIHEHLRGDGS-VLVLVPVRCGTS 188

Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
            ++       L      P  +G+VGG P    YIVG      +YLDPH +     +  + 
Sbjct: 189 RRMTQTMFFALEHLLHIPSCMGVVGGVPNRGYYIVGTSGHRLLYLDPHCMTQNAMVSCEL 248

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
            +    T  ++++R +  D +D S   GF     D+++  
Sbjct: 249 GKVGIVTPTTNLLRSVRWDHVDTSFFFGFLLDSLDEYEKL 288


>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
          Length = 567

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 10/146 (6%)

Query: 254 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 311
           +CS  ++ +    W P++++VP+ LG    +      L       QSLG +GG+P  S Y
Sbjct: 406 NCSRMAQAREPCSWRPLIVVVPVRLGARSEDQH----LSRIDKHLQSLGFIGGRPRHSYY 461

Query: 312 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 371
            VGV+  +A YLDPH  QP  +I K+    + +++H      + L  IDPSLA+GFYC D
Sbjct: 462 FVGVRGYNAYYLDPHITQPYQSIRKN---INVASFHCAHPGKMSLAHIDPSLALGFYCDD 518

Query: 372 KDDFDDFCARASKLAEESNGAPLFTV 397
           K DF+D   R  +LA   +  P+ +V
Sbjct: 519 KSDFEDLIRRVEELA-AGDSHPILSV 543


>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
           anatinus]
          Length = 147

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 32/136 (23%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----R 152
           F +DF SR+ ++YR+ F P+  S  TSD GWGCMLRS QML+AQ L+ H L R W     
Sbjct: 5   FQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWIWAEA 64

Query: 153 KPLQKP----------------------------FDREYVEILHLFGDSETSPFSIHNLL 184
            P  KP                             +R++  I+  F D   +PFS+H L+
Sbjct: 65  GPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSLHRLV 124

Query: 185 QAGKAYGLAAGSWVGP 200
           + G+  G  AG W GP
Sbjct: 125 RLGQGSGKRAGDWYGP 140


>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
          Length = 172

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 45/125 (36%), Positives = 68/125 (54%), Gaps = 11/125 (8%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 33  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + +  + 
Sbjct: 82  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKMCRV 141

Query: 190 YGLAA 194
             L+A
Sbjct: 142 LPLSA 146


>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 327

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 131/290 (45%), Gaps = 42/290 (14%)

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
           L +YRK F+P+  S IT+D GWGC+ R+SQML+A AL         R+ +   F  +Y  
Sbjct: 45  LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMTLDFSFQYFC 95

Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
            +    D   +PFS+H ++++    G  L    W  P   C   EA++ C R+    G  
Sbjct: 96  DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRSAIHRGAL 148

Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 282
              + + V         G A  +   + +RH      G A     L+LVP+  G   ++ 
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 341
            +   +L      P  +G+VGG PG   YIVG   +E  +YLDPH +     +     E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIVGTGGQELLLYLDPHCMTQEALVS---CES 249

Query: 342 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           DT+       RH   +  D +D S  IGF+    + ++D   +   L+ +
Sbjct: 250 DTAGVVRPTPRHLLCVPYDRVDTSFFIGFFVDSFELWEDLQKKIEGLSRQ 299


>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 359

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 86/361 (23%), Positives = 148/361 (40%), Gaps = 54/361 (14%)

Query: 52  HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL----AEFNQDFSSRIL 106
           HE +  P   G  S     ++LGV  K  Q D+ L +      L    A      S+   
Sbjct: 25  HEDIQKPIFIGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAPAFFRISNLFW 80

Query: 107 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYV 164
           ++YR G++ + +S +T+DVGWGC +R+ QM++A A+  + +       +    P   E +
Sbjct: 81  MTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIPTKEEIM 140

Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGL 220
            +L  F DS   T+P SIH++ ++        +  +++ P  + +++  L    +     
Sbjct: 141 NVLVPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL---- 196

Query: 221 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 280
                                  P+ C+  ++         +  + P L+ +P+VL    
Sbjct: 197 ----------------------CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL---- 230

Query: 281 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 340
            N      L+  +      GIVGG    + ++ G      +YLDPH VQP     K   E
Sbjct: 231 -NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTE 286

Query: 341 ADTSTYHSDVIRHIHLDSIDPS-----LAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 395
            DT +Y         + +IDP+        GF  ++  + DDF   A ++ E SN   L 
Sbjct: 287 IDTKSYSPISTNRFSVHTIDPTKLDDFCTFGFLIKNFHEIDDFMKFAKEVFEISNDKELR 346

Query: 396 T 396
           T
Sbjct: 347 T 347


>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
 gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
          Length = 556

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/122 (37%), Positives = 69/122 (56%), Gaps = 4/122 (3%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
           E  +  +SR+ ++YRKGF PIG      SD GWGCM R  QM++A+A+L   LGR W+  
Sbjct: 37  EIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLGRSWKWS 96

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
           P Q+    EY  +L +F D  ++ +SI  +   G + G + GSW GP  + +  + L+  
Sbjct: 97  PEQE--SPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKKLSVY 154

Query: 214 QR 215
            R
Sbjct: 155 DR 156



 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 63/121 (52%), Gaps = 15/121 (12%)

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TSTYHSDVI 351
           F  P  +GI+GG P  + +IVGV ++  I LDPH  QP    G+ +L+ D   TYH D  
Sbjct: 351 FRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPA---GRGNLKPDYDQTYHCDNP 407

Query: 352 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE------SNGAPLFTVTQTHKKPV 405
             I L  +DPS+ +GF C  + +FDD C     L EE      +N  PL  +  T  +P 
Sbjct: 408 IRIPLKRLDPSMVLGFLCSTEKEFDDLC---HNLKEEVLHPSVANSWPLVEIHTT--RPS 462

Query: 406 N 406
           N
Sbjct: 463 N 463


>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
          Length = 823

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 67/113 (59%), Gaps = 3/113 (2%)

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
           IL+++P  LGL KVN  Y  +++  F    ++GI+GG+P  + Y VG Q+   I LDPH 
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIKYVFQCRLNVGIMGGRPNQALYFVGTQKTDLICLDPHL 670

Query: 328 VQPVINIGKDDLEAD--TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           VQ  + + +++L       TYH D  + + +  +D SLA GFY +D +DF+ F
Sbjct: 671 VQDTV-LNQEELSNVELNQTYHCDQAKKLSMTKLDTSLAFGFYLKDYNDFEVF 722



 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 10/99 (10%)

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLG-----RPWRKPLQKPFDREYVEILHLFGD---S 173
           T+DVGWGC +R  QM++ QAL+ H +G     +      QK  +  Y +I+ L  D   S
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLN--YAKIIQLIHDNDCS 451

Query: 174 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 212
           +T  FSI N+ + G  +    G W GP+A+      L R
Sbjct: 452 QTGAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNR 490


>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 359

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 144/344 (41%), Gaps = 52/344 (15%)

Query: 70  IWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRILISYRKGFDPIGDSKITS 123
            ++LGV  K  Q D+ L +      L     A F +  S+   ++YR G++ + +S +T+
Sbjct: 39  FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAATFFR-ISNLFWMTYRSGYEKLPNSSLTT 97

Query: 124 DVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFS 179
           DVGWGC +R+ QM++A A+  + +       +    P  +E + +L  F DS   T+P S
Sbjct: 98  DVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIPTKQEVMNVLIPFIDSPNSTTPLS 157

Query: 180 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
           IH++ ++        +  +++ P  + +++  L    +                      
Sbjct: 158 IHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL--------------------- 196

Query: 238 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
                 P+ C+  ++         +  + P L+ +P+VL     N      L+  +    
Sbjct: 197 -----CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKL 246

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
             GIVGG    + ++ G      +YLDPH VQP     K   E DT +Y         + 
Sbjct: 247 FAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTEIDTKSYSPIGTNRFSVH 303

Query: 358 SIDPS-----LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 396
           +IDP+        GF  ++  + DDF   A  + E SN   L T
Sbjct: 304 TIDPTKLDDFCTFGFLIKNLHEVDDFMKLAKDVFEISNDKELRT 347


>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
          Length = 280

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 81/176 (46%), Gaps = 44/176 (25%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 151
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 216

Query: 152 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 272


>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
          Length = 483

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/265 (29%), Positives = 122/265 (46%), Gaps = 39/265 (14%)

Query: 123 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 181
           +DVGWGCM+R+ Q L+  AL   R+    + +P     D +  EI  LF D+  S FS+ 
Sbjct: 135 TDVGWGCMIRTGQSLLGNAL--QRVKSTVKDQPYIYEMD-DTKEITDLFKDNTKSAFSLQ 191

Query: 182 NLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           N ++ G+ Y  +A G W GP         L +         C      I V SGD   E 
Sbjct: 192 NFVKCGRIYNKIAPGEWFGPATTATCIRYLIQENPCYGIEAC-----YISVSSGDIFKEN 246

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTP---ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 297
                              +G  D  P   IL+L+ + LGL+ V+ RY   ++     P 
Sbjct: 247 ------------------IQGMIDRYPNGNILILLGIKLGLDSVHERYWGEIKTMLESPF 288

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
           S+GI GG+P +S Y  G  +++ ++ DPH+ Q  +    DD +    + H++    ++  
Sbjct: 289 SVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQTAL---IDDFD---ESCHTENFGKLNFS 342

Query: 358 SIDPSLAIGFY--CRDKDDFDDFCA 380
            +DPS+ +GF   C   D+F +F +
Sbjct: 343 DLDPSMLLGFLLPCSKWDEFQEFTS 367


>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
          Length = 414

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)

Query: 96  EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 153
           E      SR+ ++YRKGF PIG      SD GWGCM R  QM++A+A+L   LGR WR  
Sbjct: 43  EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 213
           P Q+    EY  +L +F D  +  +SI  +   G + G + GSW GP  + +  + L+  
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160

Query: 214 QR 215
            R
Sbjct: 161 DR 162


>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
          Length = 356

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 67/257 (26%), Positives = 102/257 (39%), Gaps = 87/257 (33%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 79  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 127

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 189
           MLR  QM++AQAL+   LGR                                   Q G  
Sbjct: 128 MLRCGQMMLAQALICRHLGRA----------------------------------QMGVG 153

Query: 190 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 249
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 154 EGKSVGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 196

Query: 250 DASRHCSV--FSKGQAD----------------------WTPILLLVPLVLGLEKVNPRY 285
           D  + C +  FS   AD                      W P+LL+VPL LG+ ++NP Y
Sbjct: 197 DIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLLLIVPLRLGINQINPVY 256

Query: 286 IPTLRLTFTFPQSLGIV 302
           +   + TF   +  G V
Sbjct: 257 VDAFK-TFVDTEENGTV 272


>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
 gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
          Length = 388

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 73/295 (24%), Positives = 123/295 (41%), Gaps = 44/295 (14%)

Query: 92  NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           +G  EF +  + ++L  SYR  F P+ + + T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKAAAKKLLYFSYRNCFPPLPN-RSTTDTRWGCLVRTTQMLVGSCLLRYHCKGA 112

Query: 151 WRKPLQKPFDREYVE----ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 206
           +  P     +R+  E    I  LF D  ++P  IH +        +   S + P      
Sbjct: 113 YVLP-----ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSP------ 161

Query: 207 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHCSVFSKGQADW 265
                     E G+   +  +A +   GD       AP   C ++ +   S      ++ 
Sbjct: 162 ---------TEAGMAIAAALIAFHAQGGD-------APFTFCCENRNIDESAVMAKLSEG 205

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
             ++L++P+VLG+  ++ +Y   L          GI GG   AS Y+ G Q  +  ++DP
Sbjct: 206 QHVILIIPVVLGIAPMSGQYERMLLKILDMKACCGIAGGFKQASLYMFGHQGRNVFFMDP 265

Query: 326 HDVQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           H VQ     G+    LE             +     DP + +GFY     D+ +F
Sbjct: 266 HYVQRAYTSGRTVGTLEGARG--------DLAARRFDPCMVLGFYLHTPADYCEF 312


>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
          Length = 256

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 12/122 (9%)

Query: 85  LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 143
           LG+   + G +A   +  +S +  +YRK F PIG +  T+D GWGCMLR  QML+A+ L+
Sbjct: 30  LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 89

Query: 144 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 198
              LG  W       +DR     EY  IL +F D +   FSIH +   G + G   G W 
Sbjct: 90  VRHLGHNWL------WDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWF 143

Query: 199 GP 200
           GP
Sbjct: 144 GP 145


>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
          Length = 256

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 11/114 (9%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH +
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 129



 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 1/91 (1%)

Query: 311 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
           Y +    +  I+LDPH  Q  ++  +D    D + +     + +++ ++DPS+A+GF+C+
Sbjct: 124 YSIHQMGDELIFLDPHTTQTFVDTEEDGTVDDQTFHCLQSPQRMNILNLDPSVALGFFCK 183

Query: 371 DKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++ DFD++C+   K   + N   +F + Q H
Sbjct: 184 EEKDFDNWCSLVQKEILKEN-LRMFELVQKH 213


>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
           IL3000]
          Length = 327

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 81/290 (27%), Positives = 129/290 (44%), Gaps = 42/290 (14%)

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 165
           L +YRK F+P+  S IT+D GWGC+ R+SQML+A AL         R+ +   F  +Y  
Sbjct: 45  LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMALDFSFQYFC 95

Query: 166 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 223
            +    D   +PFS+H ++++    G  L    W  P   C   EA++ C R     G  
Sbjct: 96  DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRRAIHRGAL 148

Query: 224 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 282
              + + V         G A  +   + +RH      G A     L+LVP+  G   ++ 
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 341
            +   +L      P  +G+VGG PG   YI+G   +E  +YLDPH +     +     E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIIGTGGQELLLYLDPHCMTQEALVS---CES 249

Query: 342 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 388
           DT        RH   +  D +D S  +GF+    + ++D   +   L+ +
Sbjct: 250 DTVGVVRPTPRHLLCVPYDRVDTSFFLGFFVDSFELWEDLQKKIEGLSRQ 299


>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 649

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/358 (23%), Positives = 145/358 (40%), Gaps = 36/358 (10%)

Query: 105 ILISYRKGFDPIGD-----SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 159
           I  SYR  F  I D       +++D GWGCM+R SQML+A+AL  H L     +  Q   
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204

Query: 160 DRE---YVEILHLFGD--SETSPFS------------IHNLLQAGKAYGLAAGSWVGPYA 202
           D E   Y  I+ LF D  SE+   +            + N       Y L     +   A
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264

Query: 203 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---RHCSVFS 259
           + R ++     +   T +    +   I   S  +   + G  ++   D     +     S
Sbjct: 265 ILRQYQQ--NVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEAS 322

Query: 260 KGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 318
           + Q D    IL++V L  G+ K   ++             +G + G      YI+G QE+
Sbjct: 323 RKQNDRLNNILVMVHLKFGINKFEMQHKDYFIELLKIKNFVGALSGTETKGMYIIGFQED 382

Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR--DKDDFD 376
             I LDPH +Q     G+  L+ D  TY +   R I L+ +   +++G++ +  ++   +
Sbjct: 383 RLIVLDPHFIQKSTE-GEQGLDKDYCTYFNKTPRSISLECLSSDISLGYFIQVNEEQSIN 441

Query: 377 DFCARASKLAEESNGAPLFTV----TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 430
            F  +   L E+ +  PL ++     +T +  +    +  E       DS+  +S N+
Sbjct: 442 QFIDQILTLNEK-HKEPLLSILNDRIETDEMEIEEHQINKEVKDQENQDSVNNISQNE 498


>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
           Clan CA, family C54, putative [Trypanosoma cruzi]
          Length = 328

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
             NN     N   +   L++YR  F P+  S +TSD GWGC++RSSQML+A AL      
Sbjct: 28  VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81

Query: 149 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 205
             WR          + ++     D+E ++PFS+H +++A   KA       W        
Sbjct: 82  --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQ---- 130

Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 263
                          GC+++   +     +   +R   P + +   S+ C +  +     
Sbjct: 131 ---------------GCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170

Query: 264 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           ++  +L+L P+  G   ++      +L         +G+VGG P  S YI+G   +  +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMKFFSLEHLLHSSTCIGVVGGVPQRSYYILGTSGQRLLY 230

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
           LDPH +     +     +A   T  + +++ +  D +D S  +GF
Sbjct: 231 LDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
 gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
          Length = 348

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/347 (27%), Positives = 146/347 (42%), Gaps = 68/347 (19%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSK------------ITSDVGWGCMLRSSQMLVAQALLF 144
           F ++F   IL +YR  F  I  ++            I SDVGWGCM R +QM +A  +  
Sbjct: 44  FLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVTQMSIAHGIC- 102

Query: 145 HRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAM 203
                 + K      + E  +IL+ F D+E++ FSIHN++  G + +G+   SW+GP   
Sbjct: 103 -----QFMKRFLGNLNIE--KILNNFQDNESAKFSIHNMVNIGLSEFGIDPTSWIGPTTS 155

Query: 204 CRSWEALARCQRAETGLGCQSLPMA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 262
                 L    R+       ++ +A I  V G           +  D A +H   FS+  
Sbjct: 156 SMIANKLINDNRSIIS----NIQIASITYVEG----------TIYRDQAVKH---FSEVG 198

Query: 263 ADWTPILLLVPLVLGLEKVNPR-YIPTLRLTFTFPQSLGIVGGKPGAS--TYIVGVQEES 319
           +D    + L  + LG  K N   Y  T+       Q + I+GG   +S    IV      
Sbjct: 199 SDSCTFVWLC-MKLGTSKFNINSYKKTVISMSNVSQFICIMGGNNYSSGALLIVAFSNSF 257

Query: 320 AIYLDPH-DVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
              LDPH  V P     N  +DD      T        I+   ++ SL++ + CR+ +DF
Sbjct: 258 LYCLDPHIKVLPSFSDKNFIRDDFIQKVPT-------RIYWGELNSSLSMVYICRNLEDF 310

Query: 376 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 422
           DD C+  +++      + LF V       +N+ D   E   + E DS
Sbjct: 311 DDLCSNLTRI-----NSDLFEV-------INNCDF--EVKSINELDS 343


>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
          Length = 362

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 73/335 (21%), Positives = 143/335 (42%), Gaps = 45/335 (13%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ-----DFSSRILISYRKGFDPIGDSKITSD 124
           ++LLG+ +K    +        + L +++        S+ + ++YR G++ + +S + +D
Sbjct: 39  LFLLGIEYKTTPLKKQAQELPQSSLLQYSSMAAYVRMSNLLWMTYRSGYEKLPNSSLNTD 98

Query: 125 VGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSI 180
           VGWGC +R+ QM+++ A+  L ++           P   E + ++  F D   +T+P SI
Sbjct: 99  VGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIPKQNEILNVVIPFVDFFEQTTPLSI 158

Query: 181 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           H++                       +E+    ++ ++G+   + P  +     D     
Sbjct: 159 HHV-----------------------YESRFVVEQNKSGVNYLA-PTIVAKAYSDLVNSW 194

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
               + C+  ++    +    +  + P L+ +P+++  + V  R    L+  + F    G
Sbjct: 195 KMCALRCVMASNTSIPLCDIKKEPFKPTLVFLPIIMD-QLVKSR----LQQIYKFNMFAG 249

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI-NIGKDDLEAD---TSTYHSDVIRHIHL 356
           IV G    + YI G      ++LDPH VQP   +  K DL++      T +   I  I L
Sbjct: 250 IVSGIGDRAVYIFGFHVMRCLFLDPHTVQPAAESFTKIDLKSYAPINPTLNRFAIHSIEL 309

Query: 357 DSIDPSLAIGFYCR---DKDDFDDFCARASKLAEE 388
           D ID     GF  +   + D F+ FC     ++ E
Sbjct: 310 DKIDQFCTFGFLIKSLEEVDAFEKFCTETFDISHE 344


>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 328

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
             NN     N   +   L++YR  F P+  S +TSD GWGC++RSSQML+A AL      
Sbjct: 28  VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81

Query: 149 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 205
             WR          + ++     D+E ++PFS+H +++A   KA       W        
Sbjct: 82  --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127

Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 263
                          GC+++   +     +   +R   P + +   S+ C +  +     
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170

Query: 264 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 322
           ++  +L+L P+  G   ++      +L         +G+VGG P  S YI+G   +  +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230

Query: 323 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
           LDPH +     +     +A   T  + +++ +  D +D S  +GF
Sbjct: 231 LDPHCMTQEALVSGHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
 gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
          Length = 328

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 71/287 (24%), Positives = 121/287 (42%), Gaps = 47/287 (16%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
             NN     N   +   L++YR  F P+  S +TSD GWGC++RSSQML+A AL      
Sbjct: 28  VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81

Query: 149 RPWRKPLQKPFDREYVEILHLFGDSET---SPFSIHNLLQA--GKAYGLAAGSWVGPYAM 203
             WR      +      + H F D +T   +PFS+H +++A   KA       W      
Sbjct: 82  --WR------YSANDCRLDH-FRDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWT----- 127

Query: 204 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--G 261
                            GC+++   +     +   +R   P + +   S+ C +  +   
Sbjct: 128 --------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICS 168

Query: 262 QADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 320
             ++  +L+L P+  G   ++      +L         +G+VGG P  S YI+G   +  
Sbjct: 169 NLEFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRL 228

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
           +YLDPH +     +     +A   T  + +++ +  D +D S  +GF
Sbjct: 229 LYLDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
          Length = 328

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 67/269 (24%), Positives = 116/269 (43%), Gaps = 43/269 (15%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
            L++YR  F P+  S +TSD GWGC++RSSQML+A AL        WR          + 
Sbjct: 44  FLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSANDCRLDHFR 95

Query: 165 EILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
           +I     D+E ++PFS+H +++A   KA       W                       G
Sbjct: 96  DI-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT-------------------PSQG 131

Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGL- 278
           C+++   +     +   +R   P + +   S+ C +  +     ++  +L+L P+  G  
Sbjct: 132 CEAIRCCV-----NNAVDRRLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS 186

Query: 279 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 338
            ++      +L         +G+VGG P  S YI+G   +  +YLDPH +     +    
Sbjct: 187 RRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLYLDPHCMTQEALVSSHA 246

Query: 339 LEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
             A   T  + +++ +  D +D S  +GF
Sbjct: 247 ERAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 371

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 75/304 (24%), Positives = 130/304 (42%), Gaps = 40/304 (13%)

Query: 99  QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
           ++ SS + +SY+K         + IT+D GWGC LR+SQM++AQ L  H   +  +  + 
Sbjct: 52  EELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRHLYEKRVQSFIY 111

Query: 157 KPFDREYVEILHL---FGDSET------SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 207
              D+  ++  HL   F +S +      SPF  H+LL   +A  L        Y   +  
Sbjct: 112 N--DKTKLDFQHLIMMFAESNSLENMDQSPFGFHSLL--TQAINLFQVPLKQQYTPVQGI 167

Query: 208 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 267
           +AL +          Q L  ++ +V+           V+  +D  +    + K       
Sbjct: 168 KALKQ------QFKQQKLVKSLKIVT-------SSTGVIFQEDIRQKMKNWEKS------ 208

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
           +LL++   LG  K+N  Y+  ++        +G +GG    S ++VG   +  + LDPH 
Sbjct: 209 LLLILHFKLGTGKLNQIYVEQIKSLMDLEYFVGAIGGIKNKSLFMVGYMNDQFLSLDPHV 268

Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDS---IDPSLAIGFYCRDKDDFDDFCARASK 384
            Q   N  KD L  +     S   + +  DS    +   +I FY R +  ++ F  + S 
Sbjct: 269 QQ---NACKDPLNLNDEEMSSFFPKKVRADSCVKYEGDFSISFYIRSEKQYNIFLQKISN 325

Query: 385 LAEE 388
           L ++
Sbjct: 326 LNKQ 329


>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
 gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
          Length = 388

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 144/359 (40%), Gaps = 50/359 (13%)

Query: 92  NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           +G  EF +  + ++L  SYR  F P+ +   T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGA 112

Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
           +  P     + E  E I  LF D  ++P  IH          +   S + P         
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161

Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID----DASRHCSVFSKGQADW 265
                  E G+   +  +A +   GD           C +    D     +  S+GQ   
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGDVPF------TFCCESRNIDEPAVMAKLSEGQH-- 207

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
             ++L++P+VLG+  ++ +Y   +          GI GG   AS Y+ G Q  S  ++DP
Sbjct: 208 --VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMDP 265

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           H +Q       +   +D +    +  R  +     DP + +GFY    +D+  F A    
Sbjct: 266 HYIQ-------NAYTSDRTVGTLEGARGELSARRFDPCMVLGFYLHTLEDYRVF-AEELA 317

Query: 385 LAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 436
           +A      PL +  Q  ++    SD         E G +P ++    +S N  A G  H
Sbjct: 318 VANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376


>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
 gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
          Length = 388

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 144/359 (40%), Gaps = 50/359 (13%)

Query: 92  NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           +G  EF +  + ++L  SYR  F P+ +   T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGT 112

Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
           +  P     + E  E I  LF D  ++P  IH          +   S + P         
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161

Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID----DASRHCSVFSKGQADW 265
                  E G+   +  +A +   GD           C +    D     +  S+GQ   
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGDVPF------TFCCESRNIDEPAVMAKLSEGQH-- 207

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
             ++L++P+VLG+  ++ +Y   +          GI GG   AS Y+ G Q  S  ++DP
Sbjct: 208 --VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMDP 265

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 384
           H +Q       +   +D +    +  R  +     DP + +GFY    +D+  F A    
Sbjct: 266 HYIQ-------NAYTSDKTVGTLEGARGELSARRFDPCMVLGFYIHTLEDYRVF-AEELV 317

Query: 385 LAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 436
           +A      PL +  Q  ++    SD         E G +P ++    +S N  A G  H
Sbjct: 318 VANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376


>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
          Length = 128

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 46/120 (38%), Positives = 64/120 (53%), Gaps = 15/120 (12%)

Query: 66  STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 123
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 19  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 66  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125


>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
           digitatum Pd1]
          Length = 208

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 67/145 (46%), Gaps = 30/145 (20%)

Query: 85  LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------I 121
           L D A  N    F  DF SRI I+YR  F PI  +K                        
Sbjct: 59  LNDTAWPNA---FVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGF 115

Query: 122 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 181
           TSD GWGCM+RS Q L+A A     LGR WR+  +   + E  +++ +F D   +PFSIH
Sbjct: 116 TSDTGWGCMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIH 172

Query: 182 NLLQAG-KAYGLAAGSWVGPYAMCR 205
             +  G ++ G   G W GP A  +
Sbjct: 173 KFVNRGAESCGKYPGEWFGPSATAK 197


>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
 gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
          Length = 81

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 35/48 (72%), Positives = 41/48 (85%)

Query: 284 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
           RYIP L+ T TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ V
Sbjct: 10  RYIPLLKETLTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLV 57


>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
          Length = 255

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/140 (37%), Positives = 67/140 (47%), Gaps = 27/140 (19%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 126
           G    A F  DF+SR  ++YR  F       DP                +  S  TSD G
Sbjct: 116 GTGWPAAFLDDFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSG 175

Query: 127 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 186
           WGCM+RS Q L+A AL    LGR WR+ +    DRE   +L LF D   +P+S+HN ++ 
Sbjct: 176 WGCMIRSGQSLLANALAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSVHNFVRH 232

Query: 187 GKAY-GLAAGSWVGPYAMCR 205
           G+ Y     G W GP A  R
Sbjct: 233 GEKYCSKYPGEWFGPSATAR 252


>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 425

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 55/172 (31%), Positives = 75/172 (43%), Gaps = 27/172 (15%)

Query: 58  PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF---- 113
           P+R+  S++     LL           LG     +    F  DF S+I ++YR  F    
Sbjct: 85  PTRSSDSATKPQRHLLPFAIHRGSTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLIP 144

Query: 114 ---DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 154
              DP                +     T+D GWGCM+RS Q L+A AL    LGR WR+ 
Sbjct: 145 KSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRRG 204

Query: 155 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCR 205
            +    +E  ++L LF D   +PFSIH  ++ G  A G   G W GP A  R
Sbjct: 205 TKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATAR 253



 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 6/109 (5%)

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD-----LEADTSTYHSDVIRHIHL 356
           + G+P +S Y +G Q     YLDPH  +P + + +D         + +TYH+  +R +H+
Sbjct: 255 IHGRPSSSHYFIGAQGSHFFYLDPHHTRPAL-VYRDAGDRPYTTEELNTYHTRRLRRLHI 313

Query: 357 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 405
             +DPS+ IGF  RD+DD++ +       A    G  +  V    K P 
Sbjct: 314 KDMDPSMLIGFLIRDEDDWNSWKRSVHNGAMIGTGKAIIHVFDKEKSPF 362


>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
          Length = 321

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 120/323 (37%), Gaps = 91/323 (28%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           I+ L   H    + +  DAA               I I+YR+ +  +G + +TSD GWGC
Sbjct: 38  IFGLSYTHDTPSELSFADAA---------HRIHDLITITYRQKYATLGHTYLTSDAGWGC 88

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH---------LFGDSETSPFSI 180
            +RS QML+  +++ +         L K F  EY    H         L  D E+S  SI
Sbjct: 89  AIRSVQMLLVNSIVVY---------LDKSFHPEYTSHDHIAIKNNAKQLVFDKESSVLSI 139

Query: 181 HNL-LQAGKAYGLAAGSWVGPYAMCRS--------WEALARCQRAETGLGCQSLPMAIYV 231
           HN+ +Q         G+   P + C +        WE     +R    L C         
Sbjct: 140 HNIYIQDAIIKHNPTGTNFLPPSTCATAVADLYNFWE-----KRTFDVLMCTEY------ 188

Query: 232 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 291
                           I + ++             P LL +P ++   + N      ++ 
Sbjct: 189 ----------------IPEVTQ-------------PTLLFIPRIVTKSERN-----FIQT 214

Query: 292 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 351
           T   PQS G V G   A+ Y  GVQE+   +LDPH VQ    +G          Y +  I
Sbjct: 215 TSFLPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQDASEVG----------YFNRPI 264

Query: 352 RHIHLDSIDPSLAIGFYCRDKDD 374
              + D +D S   G  C +K D
Sbjct: 265 FEANFDELDNSFVFGMMCENKSD 287


>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
 gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
          Length = 142

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 65/144 (45%), Gaps = 40/144 (27%)

Query: 83  EALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI--------------------- 121
           + +G  +G N   EF  DF+S++ ++YR  F PI D+ +                     
Sbjct: 3   DMVGTTSGANWPPEFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLG 62

Query: 122 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 176
                TSD GWGCMLR+ Q L+A AL+F  LGR WR+P   P   E             S
Sbjct: 63  GERGWTSDSGWGCMLRTGQSLLANALVFMWLGREWRRP-PAPMPTE-------------S 108

Query: 177 PFSIHNLLQAGKAYGLAAGSWVGP 200
             S+H +  AGK  G   G W GP
Sbjct: 109 YASVHRMALAGKELGKDVGQWFGP 132


>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
 gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
          Length = 388

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 135/323 (41%), Gaps = 52/323 (16%)

Query: 92  NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 150
           +G  EF +  + ++L  SYR  F P+  S  T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKVATKKLLYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGA 112

Query: 151 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 209
           +  P  +  + E  E I  LF D  ++P  IH          +   S + P         
Sbjct: 113 YVLP--EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161

Query: 210 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHC---SVFSKGQADW 265
                  E G+   +  +A     GD        P   C +  SRH    +V +K   + 
Sbjct: 162 ------TEAGMAIAAALIAFRAQGGD-------VPFTFCCE--SRHIDEPAVMAK-LLEG 205

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 325
             ++L++P+VLG+  ++ +Y   +          GI GG   AS Y+ G Q  S  ++DP
Sbjct: 206 QHVVLIIPVVLGIAPMSDQYELVMLKILDVKACCGIAGGFKQASLYMFGHQGRSVFFMDP 265

Query: 326 HDVQPVINIGKDDLEADTSTYHSDVIR----HIHLDSIDPSLAIGFYCRDKDDFDDFCAR 381
           H VQ           A TS+     +      +     DP + +GFY    +D+  F   
Sbjct: 266 HYVQ----------NAYTSSRTVGTLEGSRGELRARRFDPCMVLGFYLHTPEDYRVF--- 312

Query: 382 ASKLAEESNGAPLFTVTQTHKKP 404
           A +LA  +N   +F +    ++P
Sbjct: 313 AEELA-VANSLVVFPLISFGRRP 334


>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 325

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 68/314 (21%), Positives = 129/314 (41%), Gaps = 67/314 (21%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +++LG C+    +E L     N+     N      I+ +YR+ +  +G++ ++SD GWGC
Sbjct: 36  VYILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSCLGNTYLSSDAGWGC 91

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------REYVEILHLFGDSETSPFSIHN 182
            +R++QM++   L+       ++  +Q+  D       +  ++   L  D  +S  SIHN
Sbjct: 92  AIRATQMMIVNTLVI------FKDQMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHN 145

Query: 183 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           +   +  K +     +++ P   C +  +L +                       E  ++
Sbjct: 146 IYIQEIIKVHNPTGTNFLPPSICCIAISSLLQ-----------------------EWDKK 182

Query: 241 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 300
               + C+D    +CS          P L L+P ++   +        +  + T  QS G
Sbjct: 183 LFNCITCLDHIP-NCSY---------PTLYLIPQIITFTEHQ-----LILDSLTLSQSRG 227

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
            VGG   ++ ++ G Q  +  +LDPH VQ   + G          Y +     I L  I 
Sbjct: 228 FVGGIGESAIFVFGYQGTTLFFLDPHYVQNAGDFG----------YFNPPTYQIDLSLIS 277

Query: 361 PSLAIGFYCRDKDD 374
           PS+   F C +++D
Sbjct: 278 PSIVFAFMCYNEND 291


>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
          Length = 224

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 5/101 (4%)

Query: 83  EALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 137
           E + +   NN +      +F  DF+SR+ ++YR  + PI  S   +D+GWGCMLRS Q L
Sbjct: 120 EEISEEEDNNNMYLRWPLDFYDDFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSL 179

Query: 138 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 178
           +A  L+ H LGR WR+  Q    R+ + I  L    +  PF
Sbjct: 180 LANTLIIHFLGRDWRRQTQNQTTRKELCIGFLMSYHQEHPF 220


>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
          Length = 312

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/298 (23%), Positives = 125/298 (41%), Gaps = 59/298 (19%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
           FNQ   + I   YR      G  K  SD GWGC++R  QM++A AL+        R+   
Sbjct: 49  FNQKKDTLIWFCYRANIQFEG--KAISDQGWGCLVRVGQMMLANALM--------RECKI 98

Query: 157 KPFDREYVEILHLFGDSE----TSPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 210
              ++    I+HLF D++     +PFSI  +++ A     +  G W  GP  M       
Sbjct: 99  LAINKTKAMIIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIM------- 151

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT---P 267
                                 S  ED  +    +  I+  +       + Q D +   P
Sbjct: 152 ----------------------SVIEDLNKNNMNIKQINLVNFLEQCVLESQIDLSFKKP 189

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
            LL++  ++G + +    I  L+      Q  G + GK   + +++G Q+ +AI++DPH 
Sbjct: 190 HLLIIHAIIGDKSLGQLEIQNLQSHMQISQFAGAIIGKNNKAFFLIGFQKNNAIFMDPHY 249

Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
           VQ      K ++E +        ++   L  ++ ++A+ FY  +  ++ +F  + +KL
Sbjct: 250 VQES---NKIEMECN--------LKCQPLKQLNGTIALAFYISNYMEYLEFKKQVNKL 296


>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
          Length = 564

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 83/357 (23%), Positives = 140/357 (39%), Gaps = 83/357 (23%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 176
           +T+D  WGC +RS+QM++A AL             Q  F      IL LF D+      S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261

Query: 177 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 211
            FSI N+    LQ G+     YG+++ + +             + +C        +E + 
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321

Query: 212 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------- 260
           +  CQ        Q L     V++  +  E         DD +     FS+         
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLT-----FSQMGLGCDRRI 375

Query: 261 ---------------GQADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
                             +W   +L++V + LGL+K++P Y   +      PQ +G+VGG
Sbjct: 376 NYDKLPNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGG 435

Query: 305 KPGASTYIVG------VQEESAIYLDPHDVQP-VINIGKD-DLEA-DTSTYHSDVIRHIH 355
           KP  + Y  G        +   ++LDPH VQ    N+    DL+  + + +H+   R + 
Sbjct: 436 KPNKAFYFFGHIIDQDTNKVKLMFLDPHKVQDYTYNVETSYDLDVKEQAKFHTTEARLLK 495

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           +  +D  L  GF  +   DF+ F        +E     +F++ Q   +  N+S +  
Sbjct: 496 IKELDTCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552


>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
 gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
          Length = 102

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 49/83 (59%), Gaps = 11/83 (13%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           +W+LG C+ +  ++            E   D  SR+  +YRK F PIG +  +SD GWGC
Sbjct: 29  VWVLGECYNVKTEKT-----------ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGC 77

Query: 130 MLRSSQMLVAQALLFHRLGRPWR 152
           MLR  QM++AQAL+  +LGR WR
Sbjct: 78  MLRCGQMILAQALVCSQLGRAWR 100


>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
          Length = 564

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 81/357 (22%), Positives = 137/357 (38%), Gaps = 83/357 (23%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 176
           +T+D  WGC +RS+QM++A AL             Q  F      IL LF D+      S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261

Query: 177 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 211
            FSI N+    LQ G+     YG+++ + +             + +C        +E + 
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321

Query: 212 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------- 260
           +  CQ        Q L     V++  +  E         DD +     FS+         
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLT-----FSQMGLGCDRRI 375

Query: 261 ---------------GQADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 304
                             +W   +L++V + LGL+K++P Y   +      PQ +G+VGG
Sbjct: 376 NYDKLPNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGG 435

Query: 305 KPGASTYIVG------VQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIH 355
           KP  + Y  G        +   ++LDPH VQ     +    D    + + +H+   R + 
Sbjct: 436 KPNKAFYFFGHIIDLDTNKVKLMFLDPHKVQDYTYDVETSYDLDVKEQAKFHTTEARLLK 495

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           +  +D  L  GF  +   DF+ F        +E     +F++ Q   +  N+S +  
Sbjct: 496 IKELDTCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552


>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 228

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 12/96 (12%)

Query: 59  SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 118
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 119 SKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWR 152
             +TSD GWGCMLRS QM++AQ LL H L  G+PWR
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKPWR 169


>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 398

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 120/300 (40%), Gaps = 36/300 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
           +  SYR  F P+ +   T+D  WGC+LR++QML+   LL +     +  P     + +  
Sbjct: 74  LYFSYRSCFPPLPNGS-TTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELK-A 131

Query: 165 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 224
            I  LF D  ++P  IH          +   S + P                E G+    
Sbjct: 132 NISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSP---------------TEAGMA--- 173

Query: 225 LPMAIYVVSGDEDGERGGAPVV--CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 282
             MA  +++   +G  G  P    C +      +V +K   +   ++L++P+VLGL  ++
Sbjct: 174 --MAAALIACHAEG--GDVPFTFSCENRNIDEPAVVAK-LLEGQHVILIIPVVLGLAPLS 228

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 342
            +Y   +          GI GG   AS Y+ G Q     ++DPH +Q       D     
Sbjct: 229 DKYESMMLKILDMKACCGIAGGFKQASFYMFGHQGRKVFFMDPHYIQKAYT--SDKTAGT 286

Query: 343 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 402
                 D+         DP + +GFY    +D+  F   A +LA  ++      ++ +HK
Sbjct: 287 LYGARGDLTAR----KFDPCMVLGFYLHTLEDYRVF---AEELAVVNSLVTFPLISWSHK 339


>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 355

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 36/228 (15%)

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 134
           +A DE   D   N    +F  DF SRI ++YR  F+ I  S   + TS +     L+S  
Sbjct: 99  LAYDEPTKD---NGWPPQFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQL 155

Query: 135 ---QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
                  + +++  RLGR WR+  Q P   E  EI+ LF D   +P+S+H+ ++ G  A 
Sbjct: 156 GDQSPFSSDSMI--RLGRDWRR-GQSP--HEEREIIKLFADHPNAPYSLHSFVRHGASAC 210

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
           G   G W GP A  R  +ALA    +          + +Y          G  P V  D+
Sbjct: 211 GKYPGEWFGPSATARCIQALANSHESS---------LRVYST--------GDGPDVYEDE 253

Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
             +      +G+A + P L+LV   LG++K+ P Y   L  +   PQS
Sbjct: 254 FMKIAK--PEGEA-FHPTLILVGTRLGIDKITPVYWEALIASLQMPQS 298


>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
 gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
 gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
           construct]
          Length = 141

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 62/110 (56%), Gaps = 7/110 (6%)

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y +G Q++  +YLDPH  QP +++ + D   +  ++H    R +    +DP
Sbjct: 1   MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDP 58

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 408
           S  +GFY  D+ +F+  C+  +++   S+     P+FT+ + H +  +HS
Sbjct: 59  SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ--DHS 106


>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
          Length = 141

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 59/105 (56%), Gaps = 5/105 (4%)

Query: 302 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 361
           +GGKP  S Y +G Q++  +YLDPH  QP +++ + +   +  ++H    R +    +DP
Sbjct: 1   MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDP 58

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 403
           S  +GFY  D+ +F+  C+  +++   S+     P+FT+ + H +
Sbjct: 59  SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ 103


>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
          Length = 282

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 74/146 (50%), Gaps = 22/146 (15%)

Query: 70  IWLLGVCHKIAQ----DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 125
           +WLLG  +  ++    DE +  A        F  D+ SRI ++YR    P+  S  T+D 
Sbjct: 116 LWLLGEFYFTSRPDEDDEVVFRA--------FAIDYYSRIWLTYRTELSPLPGSSKTTDC 167

Query: 126 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE-----YVEILHLFGDSETSPFSI 180
           GWGC LR+ QM++AQAL+   LGR WR    +  +R      + +I+ LFGD   +   +
Sbjct: 168 GWGCTLRTCQMMLAQALVVLHLGREWRFWGDEEANRYRCGFGHYDIVSLFGDHLDADLGL 227

Query: 181 HNLLQAGKAYGL--AAGSWVGPYAMC 204
           + L++  K      A G+W   Y+ C
Sbjct: 228 YRLMKIAKERNEHDAVGNW---YSAC 250


>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
          Length = 429

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 26/114 (22%)

Query: 97  FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 133
           F  DF SRI ++YR  F       DP                   +  +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239

Query: 134 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 187
            Q L+A A+L  RLGR WR+  +   D E  +I+ LF D   +PFS+HN ++ G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYG 290



 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 3/76 (3%)

Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSID 360
           G+P +S Y +GVQ +   YLDPH  +P +   +D       +  T H+  +R +H+D +D
Sbjct: 302 GRPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMD 361

Query: 361 PSLAIGFYCRDKDDFD 376
           PS+ IGF  +D+DD+D
Sbjct: 362 PSMLIGFLIKDEDDWD 377


>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
 gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
          Length = 158

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 46/68 (67%)

Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
           LGL+ VNP Y  T+++ +TFPQS+GI GG+P +S Y VG Q ++  YLDPH  +P + + 
Sbjct: 1   LGLDGVNPIYYDTIKILYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPLR 60

Query: 336 KDDLEADT 343
              LE ++
Sbjct: 61  PPTLEPES 68


>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
          Length = 352

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 67/278 (24%), Positives = 117/278 (42%), Gaps = 57/278 (20%)

Query: 109 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--- 165
           YR  F P+ ++ +TSD GWGC +RS+QMLVA A+          K     FD   V    
Sbjct: 92  YRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANAI---------GKLFTNDFDTGEVTDKM 142

Query: 166 ILHLFGD--SETSPFSIHNLL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 221
           ++  F D  S   PFSIHNL   +A     +   S++ P A+  ++  + + + A    G
Sbjct: 143 VIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK-KLANPKFG 201

Query: 222 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 281
            + L                          +    V+++      P ++L+P+ +  +  
Sbjct: 202 MEIL------------------------TTTFTFRVYTQ------PTIVLIPISIP-DSF 230

Query: 282 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 341
           N +    + + F+F    G+VGG    + Y  G+  +  ++LDPH V+   N   +    
Sbjct: 231 NDK----IAVIFSFYLFSGMVGGSGRKAFYFFGIHHDQLLFLDPHTVR---NTVINSCSF 283

Query: 342 DTSTYHSDV--IRHIHLDSIDPSLAIGFYCRDKDDFDD 377
           D   YH  +  ++ +    +D S  + F    + + DD
Sbjct: 284 DPQEYHPIIGDVKALSYSLLDRSAVLAFVVTSQRELDD 321


>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
          Length = 806

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 2/110 (1%)

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
           +++++ + LGLE +   Y   L+  F+  Q +GI+GGKP  + Y VG Q++  I+LDPH 
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLRQCVGILGGKPNFALYFVGYQQDHMIFLDPHY 700

Query: 328 VQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 375
           VQ  +   +   D E   +       + I ++S+DP + +GF  ++  D 
Sbjct: 701 VQQALTSDEQLKDQELKDTYQSQRSAKKIKMESLDPCIGVGFLIQNSKDL 750



 Score = 42.7 bits (99), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 13/72 (18%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV----EILHLFGDSETS 176
           I SD GWGCM+R  QM++A + L         K LQ+  +   +     IL +  D   +
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFL---------KLLQQNHNFHDILTHDSILSMILDQLDA 443

Query: 177 PFSIHNLLQAGK 188
           PF IH + + G+
Sbjct: 444 PFGIHQITEEGR 455


>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 200

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 6/100 (6%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 161
           I I+YRK    I +   T+D GWGCM+RS QM++AQ  L   LG  W+     +    + 
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96

Query: 162 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
            +++ I++LFGDS  S FSIH L+      G+  G W GP
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP 136


>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
 gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
          Length = 353

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 87/325 (26%), Positives = 130/325 (40%), Gaps = 47/325 (14%)

Query: 75  VCHKIAQ-DEALGDAAGNNGLAE----FNQDFSSRILISYRKGFDPIGD---------SK 120
           + + I Q D++L    GN   A+    F + F   IL SYR  F  I           S 
Sbjct: 20  IIYNIDQHDDSLIFLFGNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSS 79

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 180
           +T+D+GWGCMLR  QM +A  LL        R    K +      IL  F D E S FSI
Sbjct: 80  VTTDLGWGCMLRVIQMSLALGLL--------RYCKMKKYTYSLDYILQNFQDLEESLFSI 131

Query: 181 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 239
           H  ++ G   +      W GP +     + L +             P             
Sbjct: 132 HQFVKVGCSIFNKKPKDWFGPTSASTIADYLVKNN-----------PFLFNNFRISSILF 180

Query: 240 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTF-TFPQ 297
           + G     I  ++   S  ++  ++ T   + +   LG   +N  +Y  ++   F   PQ
Sbjct: 181 KDGT----IYKSNLFQSFKNEEYSENTLTFVWLCTRLGSSALNIQKYKDSIFSIFKNVPQ 236

Query: 298 SLGIVGGKPGAST--YIVGVQEESAIYLDPH-DVQPVINIGKDDLEADTSTYHSDVIRHI 354
            + I GG   +S+   IVG  E+    LDPH  +Q    I   + E     +   V   I
Sbjct: 237 LICIAGGHNCSSSALLIVGASEKFLYCLDPHIKLQEAFVIKNFNREE----FIQQVPMRI 292

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFC 379
             ++++PSL+  F C D DDF+  C
Sbjct: 293 SWENLNPSLSFVFCCTDIDDFNHLC 317


>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
          Length = 325

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/316 (22%), Positives = 127/316 (40%), Gaps = 71/316 (22%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 129
           + +LG C+    +E L     N+     N      I+ +YR+ +  +G++ ++SD GWGC
Sbjct: 36  VHILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91

Query: 130 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 182
            +R++QM++  AL+       ++  +Q+  D    E          L  D  +S  SIHN
Sbjct: 92  AIRATQMMIVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145

Query: 183 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 240
           +   Q  K +     +++ P   C +  +L +                          E 
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSVCCIAISSLLQ--------------------------EW 179

Query: 241 GGAPVVCIDDASR--HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
              P  CI   +    CS          P L L+P ++   + +   + +L L+    QS
Sbjct: 180 DKKPFNCITCLNHIPSCS---------CPTLYLIPRIITFTE-HQLILDSLALS----QS 225

Query: 299 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 358
            G VGG   ++ ++ G Q  +  +LDPH VQ   + G      +  TY  D+        
Sbjct: 226 RGFVGGIGESAIFVFGCQGTTLFFLDPHYVQNAGDFGY----FNPPTYQIDI------SL 275

Query: 359 IDPSLAIGFYCRDKDD 374
           I  S+   F C ++++
Sbjct: 276 ISSSVVFAFMCYEENE 291


>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
          Length = 426

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/169 (30%), Positives = 72/169 (42%), Gaps = 50/169 (29%)

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGA-----STYIVGVQEE---------------- 318
           ++ PRY   LR     PQS G++GG+P A     +T +    ++                
Sbjct: 234 RLEPRYAEPLRAALRLPQSAGMLGGRPRANRIFNTTSMCASSDQNLQLCFENSTRAIDPS 293

Query: 319 ------SAIY---------------LDPHDVQPVINIGKDDL---EADTSTYHSDVIRHI 354
                 +A++               LDPH VQP + +G D      A  S    D  + +
Sbjct: 294 KSGRPRAALFFPGLAARDGGADVYGLDPHTVQPALAVGDDGALGPGAAASVAPRDA-KKL 352

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 403
             D++DPSLA+ FYC D+DDF DF  RA  L     GAPLF V     +
Sbjct: 353 AADALDPSLALAFYCADRDDFLDFVGRARALP----GAPLFEVVDAAPR 397



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 54/117 (46%), Gaps = 15/117 (12%)

Query: 105 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 164
           +  +YR GF+ +     T D GWGCMLRS+QML+  AL   R G   R           +
Sbjct: 28  LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL--TRNGAAPR-----------L 74

Query: 165 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 219
               LF D+  +++PF +HN  + G  Y +  G W GP   C     L   +R   G
Sbjct: 75  ATAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLVDWRRNAPG 131


>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
          Length = 348

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 28/124 (22%)

Query: 97  FNQDFSSRILISYRKGFDPI---------------------GD-SKITSDVGWGCMLRSS 134
           F  DF SR  ++YR GF+PI                     GD S  +SD GWGCM+RS 
Sbjct: 120 FLDDFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSG 179

Query: 135 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 194
           Q L+A A+  + LGR WR       ++   EI+ LF D   +P+SIH  +  G    +A 
Sbjct: 180 QSLLANAMAMYELGRGWRLSDGGIAEK---EIISLFADDPRAPYSIHRFVGHG---AVAC 233

Query: 195 GSWV 198
           GS++
Sbjct: 234 GSFL 237



 Score = 46.6 bits (109), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 3/61 (4%)

Query: 321 IYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 377
            YLDPH  +P +   +   E    +  + H+  +R +H+  +DPS+ IGF  RD+DD+D+
Sbjct: 238 FYLDPHHTRPGLPFHEHPSEYTQEEVGSCHTRRLRRLHIREMDPSMLIGFLIRDEDDWDN 297

Query: 378 F 378
           +
Sbjct: 298 W 298


>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
           tetraurelia strain d4-2]
 gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
           tetraurelia]
 gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
          Length = 277

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/291 (24%), Positives = 122/291 (41%), Gaps = 59/291 (20%)

Query: 97  FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 156
           F Q   + I  SYR      G +   SD GWGC++R  QM+VA +L+             
Sbjct: 14  FLQLKETFIWFSYRANIQYEGRA--ISDQGWGCLIRVGQMIVANSLIRESTNS------- 64

Query: 157 KPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 210
           KP D +  +I+ LF D++     +PFSI  +++ A   Y +  G W  GP  MC   + L
Sbjct: 65  KPNDLK-TKIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLL 123

Query: 211 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW---TP 267
              Q A+T                           + I +    C +  + Q D     P
Sbjct: 124 ---QSAKT------------------------IKQLKIINFLEQCVI--EKQIDLQFKQP 154

Query: 268 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 327
            LL++  ++G ++++  ++  L+     PQ  G + GK   + +++G Q    I +DPH 
Sbjct: 155 QLLIIHAIIGNKELDQYFVAELQKHMQIPQFAGAIVGKSKKAYFLIGYQNNQGIVMDPHY 214

Query: 328 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           VQ          E++    +S  ++ I L     ++A+ +Y  +  D+   
Sbjct: 215 VQ----------ESNLLQLNSQ-LKCIPLKEFSGTIALCYYISNSYDYQQL 254


>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
          Length = 149

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 11/82 (13%)

Query: 65  SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKITS 123
           S +S + LLG  ++++          + G+ E F + FSS + +SYR+GF P+  S ++S
Sbjct: 74  SKSSPVCLLGQSYQLS----------STGVRESFRRVFSSLLWMSYRRGFRPLDGSTLSS 123

Query: 124 DVGWGCMLRSSQMLVAQALLFH 145
           D GWGCMLRS+QML+AQ LL H
Sbjct: 124 DAGWGCMLRSAQMLLAQGLLLH 145


>gi|237837057|ref|XP_002367826.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
 gi|211965490|gb|EEB00686.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
          Length = 3559

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 87/175 (49%), Gaps = 16/175 (9%)

Query: 242  GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 300
            GA V C+ D S     + +G       LLL PL L   EK+NP Y+ +L      P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023

Query: 301  IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 358
            +V G+   + Y +G Q+++ +YLDPH  +QP        L A T ++ +     +  + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079

Query: 359  IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL 411
            ++PSLA+ F+ R++       A   KL EE +   +  V +  +   P++  DVL
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKL-EEVDSFSMLQVVERRRPFSPLDLDDVL 3133



 Score = 44.7 bits (104), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)

Query: 43   VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
            +TA SM R+   V G S       R  IS    D W  G    ++ D A       + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139

Query: 96   EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
            E  +  +     +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196

Query: 139  AQALLFHRL 147
             QAL  H L
Sbjct: 1197 MQALRRHFL 1205


>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
          Length = 646

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 22/120 (18%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 153
           + EF +DFS++I +SYR+GF  IGD+   +D GWG                      W+K
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGY---------------------WKK 447

Query: 154 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALAR 212
             Q  +      I+ +F D  T+PFSIHN+   G+ + G   G W  P  +  + ++L  
Sbjct: 448 SGQNEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVN 507



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 26/99 (26%)

Query: 301 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 360
           IVGGKP AS Y +  Q+++  YLDPH VQ  I+                       + ++
Sbjct: 541 IVGGKPRASLYFIAAQDDNLFYLDPHTVQQAID-----------------------NEVE 577

Query: 361 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 399
            SL++      K+DF DF  R+ KL  +S   PL+ + +
Sbjct: 578 FSLSVS--VETKEDFLDFLERSKKLVSKSE-FPLYNIAE 613


>gi|221481944|gb|EEE20310.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 3562

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 75/149 (50%), Gaps = 13/149 (8%)

Query: 242  GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 300
            GA V C+ D S     + +G       LLL PL L   EK+NP Y+ +L      P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023

Query: 301  IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 358
            +V G+   + Y +G Q+++ +YLDPH  +QP        L A T ++ +     +  + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079

Query: 359  IDPSLAIGFYCRDKDDFDDFCARASKLAE 387
            ++PSLA+ F+ R++       A   KL E
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKLEE 3108



 Score = 44.7 bits (104), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)

Query: 43   VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
            +TA SM R+   V G S       R  IS    D W  G    ++ D A       + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139

Query: 96   EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
            E  +  +     +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196

Query: 139  AQALLFHRL 147
             QAL  H L
Sbjct: 1197 MQALRRHFL 1205


>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
          Length = 538

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 41/79 (51%), Gaps = 5/79 (6%)

Query: 319 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           S IYLDPH VQ       D       T+  +  R + L SIDPSLA+GFYC    ++ D 
Sbjct: 331 SVIYLDPHQVQEAAACPDD-----WRTFWCETPRSMPLPSIDPSLALGFYCSSLGEYRDL 385

Query: 379 CARASKLAEESNGAPLFTV 397
           C+R   L   S GAPL  V
Sbjct: 386 CSRLEALERRSGGAPLVCV 404



 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 26/129 (20%)

Query: 136 MLVAQALLFHRLGRPWR----------------KPLQKPFDREYVEILHLFGDS--ETSP 177
           M++AQ L+ H LGR WR                             +L LF D+  E +P
Sbjct: 1   MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60

Query: 178 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 237
           FS+H+L +AG+A G+ AG W+GP+ MC++  A A   R       Q + + + V    E 
Sbjct: 61  FSLHSLCRAGQACGVVAGRWLGPWVMCKTLAAAAGAARR------QGVDLGLTVAVLAES 114

Query: 238 GERGGAPVV 246
           G  GGAP++
Sbjct: 115 G--GGAPLL 121



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/37 (59%), Positives = 26/37 (70%)

Query: 280 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 316
           K+NPRYIP L      PQS+GIVGG+P +S Y VG Q
Sbjct: 215 KLNPRYIPQLEAVLAMPQSIGIVGGRPSSSLYFVGFQ 251


>gi|221505025|gb|EEE30679.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 3554

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 65/123 (52%), Gaps = 7/123 (5%)

Query: 268  ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
             LLL PL L   EK+NP Y+ +L      P SLG+V G+   + Y +G Q+++ +YLDPH
Sbjct: 2988 CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLGMVAGRGQQAFYCIGTQQKALLYLDPH 3047

Query: 327  D-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDSIDPSLAIGFYCRDKDDFDDFCARASK 384
              +QP        L A T ++ +     +  + +++PSLA+ F+ R++       A   K
Sbjct: 3048 SGIQPPAL----QLPAATPSFFAGSCWKVSDVAALNPSLAVAFFVRNERQLLGLAAALKK 3103

Query: 385  LAE 387
            L E
Sbjct: 3104 LEE 3106



 Score = 44.7 bits (104), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)

Query: 43   VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 95
            +TA SM R+   V G S       R  IS    D W  G    ++ D A       + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139

Query: 96   EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
            E  +  +     +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196

Query: 139  AQALLFHRL 147
             QAL  H L
Sbjct: 1197 MQALRRHFL 1205


>gi|193784751|dbj|BAG53904.1| unnamed protein product [Homo sapiens]
          Length = 146

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 1/117 (0%)

Query: 296 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 355
           P SL   G      T ++   EE  IYLDPH  QP +         D S +       + 
Sbjct: 4   PLSLSSAGSATHLPTCLILPGEE-LIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMS 62

Query: 356 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 63  IAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 119


>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
 gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
          Length = 266

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 2/57 (3%)

Query: 91  NNGLAE--FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 145
           NN + +  F  D  S I  SYRK F PI ++ IT+D+GWGCMLR+ QM++A+ALL H
Sbjct: 205 NNNIIQSNFLDDVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALLKH 261


>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
          Length = 360

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 96/228 (42%), Gaps = 35/228 (15%)

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 134
           +A D+ + D    +G   F  DF S+I ++YR  F+PI  S   + TS +     L+S  
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158

Query: 135 --QMLVAQALLFHRLGR-PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 190
             Q   +   +  RLGR  WR+        E   +L  F D   +P+SIH+ ++ G  A 
Sbjct: 159 GDQSPFSSDTMV-RLGRGDWRRGESV---EEECRLLKDFADDPRAPYSIHSFVRHGASAC 214

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
           G   G W GP A  R  +AL     +           +I V S       G  P V  D+
Sbjct: 215 GKYPGEWFGPSATARCIQALTNSHES-----------SIRVYST------GDGPDVYEDE 257

Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 298
                 +      D+ P L+LV   LG++K+ P Y   L      PQS
Sbjct: 258 ---FMQIAKPPGEDFHPTLVLVGTRLGIDKITPVYWEALIAALQMPQS 302


>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 3465

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 59/106 (55%), Gaps = 7/106 (6%)

Query: 268  ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 326
             LLL PL L   EK+NP Y+P+L      P S+G+V G+   + Y +G Q+++ +YLDPH
Sbjct: 2955 CLLLFPLTLCSGEKINPVYVPSLLAYLELPWSVGMVAGRGQQAFYCIGTQQKALLYLDPH 3014

Query: 327  D-VQ-PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 370
              +Q P + +      A  S +     +   + +++PSL++ F+ R
Sbjct: 3015 SGIQPPALQL----PSATPSFFAGSCWKIADVAALNPSLSVAFFVR 3056



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 17/70 (24%)

Query: 96   EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 138
            + +Q   S    +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 942  QLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQMLL 1001

Query: 139  AQALLFHRLG 148
             QAL  H LG
Sbjct: 1002 MQALRRHFLG 1011


>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
           multifiliis]
          Length = 209

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 20/143 (13%)

Query: 99  QDFSSRILISYRKGFDPI----GDSKI---TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           ++F + I ++YR+ F P+     D KI    SD GWGCM+R  QM +A+ L  H   +  
Sbjct: 24  ENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRHHLQQKGI 83

Query: 152 ---RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSW 207
              ++ +Q   D +       FGD   +P+SI  + + A K + L  G W  P  +C   
Sbjct: 84  YDNKRIIQAFLDND-------FGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRICHVL 136

Query: 208 EALARCQRAETGLGCQSLPMAIY 230
             L      +  L C+ L + ++
Sbjct: 137 SLLHN--DKKQILDCEDLKVGVF 157


>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 348

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 114/278 (41%), Gaps = 67/278 (24%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
           +S I   YR  F  + ++ +TSD GWGC +R+ QML+A A++             K F  
Sbjct: 85  TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAII-------------KLFGS 131

Query: 162 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQR 215
           + +    ++H F D   S  P+SIH+L        G   GS   P++             
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFS------------- 178

Query: 216 AETGLGCQSLPMAIYVVSG--DEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILL 270
                        IY ++   ++D  R              C V +     ++   P ++
Sbjct: 179 -----------SVIYALTELVNKDFNRAF-----------ECHVITNKFLLKSINKPTIV 216

Query: 271 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 330
            +P  +  +K + R I      F+F    G+VGG    + Y  G+     ++LDPH V+P
Sbjct: 217 FIPFTIP-DKFDQRLIT----IFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRP 271

Query: 331 VI-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
              +I K D E D     SD I+ + ++ ++ S+   F
Sbjct: 272 CASSIMKFD-EKDYIAKLSD-IKSLRINELERSVVFSF 307


>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
          Length = 93

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 19/91 (20%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIG-- 117
           I  +   IW+LG  +              N L E +   +D  S +  +YRKGF PIG  
Sbjct: 16  IPQTDEPIWILGKKY--------------NALKELDMIRRDIRSMLWFTYRKGFIPIGGC 61

Query: 118 DSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
           +S  TSD GWGCMLR  QM++AQAL+   LG
Sbjct: 62  NSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92


>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 348

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 66/277 (23%), Positives = 116/277 (41%), Gaps = 65/277 (23%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
           +S I   YR  F  + ++ + SD GWGC +R+ QML+A A++             K F  
Sbjct: 85  TSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAII-------------KLFGS 131

Query: 162 EYVE---ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
           + +    ++H F D  +   P+SIH+L        + +G+                    
Sbjct: 132 DNINRKTVIHWFLDFYNVECPYSIHSLFTTQI---IVSGN-------------------- 168

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR--HCSVFSKG---QADWTPILLL 271
               G   LP+++   +  E   +         D +R   C V +      +   P ++ 
Sbjct: 169 --PNGSSFLPLSVVTYALTELVNK---------DLNRIFECHVITNKFLLNSINKPTIIF 217

Query: 272 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 331
           +P  +  ++ N R I      F+F    G+VGG    + Y  G+  +  ++LDPH V+P 
Sbjct: 218 IPFTIP-DEFNQRLIS----IFSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDPHFVRPC 272

Query: 332 I-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
             +I K D E D     SD I+ +H++ ++ S+   F
Sbjct: 273 ASSIMKFD-EKDYIAKLSD-IKSLHINELERSVVFSF 307


>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
          Length = 473

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 81/340 (23%)

Query: 105 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 157
           I  +YR+GF      DS +T+D GWGC++R  QM++A+ L      F+++      PL +
Sbjct: 52  IRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAELLKRHLKCFYKVDLFSFPPLLQ 111

Query: 158 PFDREYVEILHLFGDSE--------TSP----FSIHNLLQ-AGKAYGLAAGSWVGPYAMC 204
                  ++L +F D +        + P    FSI  +++ A K +G   G W  P  + 
Sbjct: 112 -------DVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKEWGKKPGEWYSPNQIV 164

Query: 205 RS-WEALARCQRAET-GLG-------------------------CQ----SLPMAIYVVS 233
           ++ ++ L         GLG                         CQ    S+   +  + 
Sbjct: 165 QAIYKILQEINIPYCYGLGFVPFYESQIDLRAIFQEMCMMEDCVCQKKVFSIEQFLKSLE 224

Query: 234 GDEDGERGGAPV---------VCIDDASRHC-----SVFSK--GQADWTPILLLVPLVL- 276
             E G+     V         VC +D S        ++  K   Q  + P+  +   +L 
Sbjct: 225 KLEIGKEEMVQVMHGNDSISDVCCEDQSEQNKKEIGNLLKKYICQKCFVPVRAVAVCLLS 284

Query: 277 --GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 334
             G ++ NP Y+  +R         G++GG+P  + +IVG  +   + LDPH VQ     
Sbjct: 285 RIGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHLVQE---- 340

Query: 335 GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
            K + E    +        +    ID SL + FY ++ DD
Sbjct: 341 AKMNPEEYIKSCFPGEALFMSDKEIDCSLGLVFYLKNLDD 380


>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 348

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/278 (20%), Positives = 109/278 (39%), Gaps = 67/278 (24%)

Query: 102 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 161
           +S I   YR  F  + ++ +TSD GWGC +R+ QML+A +++             K F  
Sbjct: 85  TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSII-------------KLFGS 131

Query: 162 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 216
           + +    ++H F D   S  P+SIH+L                            +   +
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFT-------------------------TQIIVS 166

Query: 217 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILLLVP 273
           +   G   LP ++ + +  E   +         + +  C + +      +   P ++ +P
Sbjct: 167 KNPNGSSFLPFSVVIYALTELVNKDF-------NRAFECHIITNKFLLNSINKPTIVFIP 219

Query: 274 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP--- 330
             +  E     +   L   F+F    G+VGG    + Y  G+     ++LDPH V+P   
Sbjct: 220 FTIPDE-----FEQRLITIFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRPCAS 274

Query: 331 -VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 367
            +I   + D  A  S      I+ + ++ ++ S+   F
Sbjct: 275 SIIKFDEKDYIAKLSD-----IKSLRINELERSVVFSF 307


>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
          Length = 98

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 13/81 (16%)

Query: 70  IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 127
           +W+LG  +   ++           L    +D  S +  +YRKGF PIG  +S  TSD GW
Sbjct: 23  VWILGRVYNAIKE-----------LDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGW 71

Query: 128 GCMLRSSQMLVAQALLFHRLG 148
           GCMLR  QM++A+AL+   LG
Sbjct: 72  GCMLRCGQMVLARALITLHLG 92


>gi|412989956|emb|CCO20598.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Bathycoccus prasinos]
          Length = 532

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 96/267 (35%), Gaps = 74/267 (27%)

Query: 191 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 250
            L  G W+ P  +C+ +  +     +   + C  L           DG  GG P    + 
Sbjct: 234 ALCPGQWMAPSEICKRYGKMMNRLDSFQNVRCLILG----------DGCGGGVPEFYPER 283

Query: 251 ASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 309
                    K  AD   +L+LVPL  G  + +NP Y+ +L+   +  + +GIVGGK  AS
Sbjct: 284 VREEM----KTHAD-KDVLILVPLRCGASDAINPEYVKSLQKFLSVRECVGIVGGKKTAS 338

Query: 310 TYIVGVQE--------------------------------------ESAIYLDPHDVQPV 331
            YIVG                                           AIYLDPH  +  
Sbjct: 339 YYIVGFTSGKKSSDSYSGGEKEEEEEEKEEEENEEDEEEEEEEEEETRAIYLDPHVAKAY 398

Query: 332 INIGKDDLEADT-STYHSDV--------IRHIHLDSIDPSLAIGFYCRDKDDFDD----- 377
           ++  +   +  T S Y+           I +    ++DPSL +GF   +  ++D+     
Sbjct: 399 VSPRERSRDESTESAYYRSFFGSASEHGILYTPFHALDPSLVVGFLVGNDTNYDEMNNAS 458

Query: 378 ------FCARASKLAEESNGAPLFTVT 398
                 F    + +  ES   PL TV 
Sbjct: 459 SSSLDAFVDVLTNIERESGSTPLITVV 485


>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 341

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 18/101 (17%)

Query: 105 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 160
           IL +YR  F+PI    G + + SD GWGC +R++QML+AQA+     G+          D
Sbjct: 67  ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV--KMAGK----------D 113

Query: 161 REYVEILHLFGDSETSPFSIHNLLQAGK-AYGLAAGSWVGP 200
            +   +L LF DS  +P S+H +++ G+       G+W GP
Sbjct: 114 ADDSVVLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGP 154


>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 658

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 69/179 (38%), Gaps = 52/179 (29%)

Query: 283 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-----------------QEESAIY-LD 324
           P Y  TL    +FPQS+G++GG P  + +  G                  QE    Y LD
Sbjct: 418 PTYGSTLAKLLSFPQSVGMLGGTPRHALWFYGADEVDPPTFGDDGKALNGQECGGWYGLD 477

Query: 325 PHDVQ------PVINIGKDDLEADT------------------------STYHSDVIRHI 354
           PH  Q           GKD++ +D                         +T H++  R I
Sbjct: 478 PHTTQVAPRGTRTTKYGKDEVSSDDIELNNCQWQVQLNDAYLRSLHFTPTTTHANHQRSI 537

Query: 355 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES---NGAPLFTVTQTHKKPVNHSDV 410
            L  +DPS A+GFY RD  DF  F      L++E    N  P   VT T K P    DV
Sbjct: 538 PLSKLDPSCALGFYIRDHSDFVQFTNAIDALSKEHCRPNKLPDI-VTVTEKTPNYEVDV 595



 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 121 ITSDVGWGCMLRSSQMLVAQALLFH 145
           + SD GWGCMLRS+QM++AQ +  H
Sbjct: 133 LKSDAGWGCMLRSAQMMMAQTVRMH 157


>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 193

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 25/150 (16%)

Query: 52  HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRI 105
           HE V  P   G  S     ++LGV  K  Q D+ L +      L     A F +  S+  
Sbjct: 25  HEDVQKPIFVGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAAAFFR-ISNLF 79

Query: 106 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKP 158
            ++YR G++ + +S +T+DVGWGC +R+ QM++A A+         +    P+      P
Sbjct: 80  WMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----P 134

Query: 159 FDREYVEILHLFGDS--ETSPFSIHNLLQA 186
             +E + +L  F DS   T+P SIH++ ++
Sbjct: 135 TKQEVMNVLIPFIDSPNSTTPLSIHHVYES 164


>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 183

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 35/186 (18%)

Query: 28  SVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGD 87
           ++GS   +S + KRL+            L P      +  + + +LG C+    +E L  
Sbjct: 10  NIGSYFYNSMSSKRLIK-----------LQPF-----TQKNVVHILGNCYYPETNENLNH 53

Query: 88  AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
              N+     N      I+ +YR+ +  +G++ ++SD GWGC +R++QM+V  AL+    
Sbjct: 54  LTFNDA----NLKIHDLIVATYRQKYSYLGNTYLSSDAGWGCAIRATQMMVVNALVI--- 106

Query: 148 GRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHNLL--QAGKAYGLAAGSWV 198
              ++  +Q+  D    E          L  D  +S  SIHN+   Q  K +     +++
Sbjct: 107 ---FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHNIYIQQVIKTHNPKGTNFL 163

Query: 199 GPYAMC 204
            P   C
Sbjct: 164 PPSICC 169


>gi|148682816|gb|EDL14763.1| mCG116861, isoform CRA_a [Mus musculus]
          Length = 127

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 25/81 (30%), Positives = 48/81 (59%), Gaps = 1/81 (1%)

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           I+LDPH  Q  ++I +  L  D + +     + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4   IFLDPHTTQTFVDIEESGLVDDQTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63

Query: 381 RASKLAEESNGAPLFTVTQTH 401
              K   + N   +F + Q H
Sbjct: 64  LVQKEILKEN-LRMFELVQKH 83


>gi|78070455|gb|AAI07651.1| Atg4d protein [Rattus norvegicus]
          Length = 168

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           +YLDPH  QP +++ + +   ++  +H    R +    +DPS  +GFY  ++ +F+  C+
Sbjct: 47  LYLDPHYCQPTVDVNQANFPLES--FHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCS 104

Query: 381 RASKLAEESNGA---PLFTVTQTHKK 403
              ++   S+     P+FTV + H +
Sbjct: 105 ELMRILSSSSVTERYPMFTVAEGHAQ 130


>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
           multifiliis]
          Length = 384

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 2/81 (2%)

Query: 298 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 357
           S+G++GG PG + Y +G+ +   IYLDPH +Q      K     D  TY    I  +   
Sbjct: 223 SIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQEAHQNEKTVQNID--TYFCKFINRVSQK 280

Query: 358 SIDPSLAIGFYCRDKDDFDDF 378
            ++ SLA GFY ++  + + F
Sbjct: 281 KLESSLAFGFYIKNLQELEQF 301


>gi|392343434|ref|XP_003754884.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
           norvegicus]
 gi|392355909|ref|XP_003752169.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
           norvegicus]
          Length = 126

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 1/81 (1%)

Query: 321 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 380
           I+LDPH  Q  ++  +  L  D + +     + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4   IFLDPHTTQTFVDTEESGLVDDHTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63

Query: 381 RASKLAEESNGAPLFTVTQTH 401
              K   + N   +F + Q H
Sbjct: 64  LVQKEILKEN-LRMFELVQKH 83


>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
          Length = 206

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 30/135 (22%)

Query: 84  ALGDAAGNNGLAEFNQDFSSRILISYRKGFD-------------------PI-GDSKITS 123
           A+ D      L E  +DF   IL++YR+G                     P+   + I +
Sbjct: 17  AMCDQNPGPKLRERLKDF---ILLTYRRGLSIHLPRFYAGNIPKRFYGIWPLWQQTDIKT 73

Query: 124 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 183
           D GWGC LR++QM +A+AL      R    PL      +   IL LF D+  +PFS+ NL
Sbjct: 74  DRGWGCALRATQMALAEAL------RDVLSPLDN-VQEQRSRILQLFYDTTEAPFSLENL 126

Query: 184 LQAGKAYGLAAGSWV 198
           + A   +G    +W+
Sbjct: 127 VMADVEHGANVVAWI 141


>gi|328852767|gb|EGG01910.1| Hypothetical protein MELLADRAFT_123246 [Melampsora larici-populina
           98AG31]
          Length = 134

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)

Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           P+L+L+ +  GL++VN  P Y  T+  TFTFPQS+GI GG+P  S ++
Sbjct: 83  PVLVLMNVQSGLDRVNINPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130


>gi|328852471|gb|EGG01617.1| Hypothetical protein MELLADRAFT_92005 [Melampsora larici-populina
           98AG31]
          Length = 134

 Score = 51.2 bits (121), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)

Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           P+L+L+ +  GL++VN  P Y  T+  TFTFPQS+GI GG+P  S ++
Sbjct: 83  PVLVLMNVQSGLDRVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130


>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
          Length = 389

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 304 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSID 360
           G+P +S Y +G Q     YLDPH  +  +   +D +E    + ++ H+  +R IH+  +D
Sbjct: 262 GRPSSSHYFIGAQGSFLFYLDPHHTRVALPYREDPIEYTSEEIASCHTPRLRRIHVREMD 321

Query: 361 PSLAIGFYCRDKDDFDDF 378
           PS+ IGF  +++ D+ + 
Sbjct: 322 PSMLIGFLIQNEVDWQEL 339



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 26/94 (27%)

Query: 79  IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 116
           +A D+ + D    +G   F  DF S+I ++YR  F+PI                      
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158

Query: 117 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGR 149
           GD S  +SD GWGCM+RS Q ++A  +   RLGR
Sbjct: 159 GDQSPFSSDSGWGCMIRSGQSMLANTIAMVRLGR 192


>gi|328859149|gb|EGG08259.1| Hypothetical protein MELLADRAFT_123247 [Melampsora larici-populina
           98AG31]
          Length = 134

 Score = 51.2 bits (121), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)

Query: 267 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 312
           P+L+L+ +  GL++VN  P Y  T+  TFTFPQS+GI GG+P  S ++
Sbjct: 83  PVLVLMNVQSGLDQVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130


>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
 gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
          Length = 350

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 81/356 (22%), Positives = 126/356 (35%), Gaps = 95/356 (26%)

Query: 85  LGDAAGNNGLAEFNQDFSSR--ILISYRKG-------------------FDPIGDSK--- 120
           + +    N    +N+   SR  IL +YR G                   F P+  S    
Sbjct: 1   MSNVVRENVNVLYNKRLESRFGILFTYRYGLEYKFPRPINFKRRRLFNIFSPLNLSNGIV 60

Query: 121 -ITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------RKPLQKPFDREYVE----- 165
            I SD GWGC+LRS+QM ++QALL   LG  +         R P  +  D+  +      
Sbjct: 61  TIDSDKGWGCVLRSTQMAISQALLNLVLGPEFSVEQLEIRNRTPRNRKIDQSLLNIDTFE 120

Query: 166 -----------------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW-VGPY--AMCR 205
                            IL  F D   + FSI+N + A             GP   A+C 
Sbjct: 121 KLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIYNFVIADYVLKTCTKFLHFGPTSAALC- 179

Query: 206 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 265
                     A   +   +LP+                  +   D   H S   +   + 
Sbjct: 180 ----------ASKIINDLNLPIN----------------SIAFPDGVFHISDVREILEEK 213

Query: 266 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP-GASTYIVGVQEESAIYLD 324
             +L+ V     L+++       +R  F   Q  GI+GG     S YI G   +   Y D
Sbjct: 214 RNLLVWVSNKKKLDRIER---ECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYND 270

Query: 325 PHDV--QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 378
           PH    +   ++   D+  D   + S  ++ ++    + S  + F  +D+DDF DF
Sbjct: 271 PHLYCKKAFRSLEYVDIFRD---FTSRRVKSMNWRYFNASFTLLFLFKDRDDFQDF 323


>gi|407037202|gb|EKE38551.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 157

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 58/137 (42%), Gaps = 13/137 (9%)

Query: 265 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 324
           + P L+ +P+VL     N      L+  +      GIVGG    + ++ G      +YLD
Sbjct: 17  FKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFLYLD 71

Query: 325 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS-----LAIGFYCRDKDDFDDFC 379
           PH VQP     K   E DT +Y         + +IDP+        GF  ++  + DDF 
Sbjct: 72  PHIVQPSF---KSFTEIDTKSYSPIGSNRFSVHTIDPTKLDDFCTFGFLIKNLHEVDDFM 128

Query: 380 ARASKLAEESNGAPLFT 396
             A  + E SN   L T
Sbjct: 129 KLAKDVFEISNDKELRT 145


>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
          Length = 307

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 26/38 (68%)

Query: 94  LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 131
           +  F +DF SRI ++YR+ F  + DS  TSD GWGCM+
Sbjct: 195 IEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMI 232


>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
          Length = 469

 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 27/116 (23%)

Query: 105 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 157
           I  +YR+GF      +S +T+D GWGC++R  QM++A+ L      F+ +      PL +
Sbjct: 52  IRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAELLKRHLKCFYNVNLFQFPPLMQ 111

Query: 158 PFDREYVEILHLFGDSETSP------------FSIHNLLQ-AGKAYGLAAGSWVGP 200
                  E+L LF D +               FSI  +++ A + +G   G W  P
Sbjct: 112 -------EVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160



 Score = 45.8 bits (107), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 12/103 (11%)

Query: 276 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 335
           +G ++ NP YI  +R         G++GG+P  + +IVG  ++  + LDPH VQ   N+ 
Sbjct: 286 IGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHLVQQA-NMN 344

Query: 336 KDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 374
            ++         + + SD         ID SL + FY ++++D
Sbjct: 345 PEEYVKSCFPGEALFMSD-------KEIDCSLGLVFYLKNEED 380


>gi|307108756|gb|EFN56995.1| hypothetical protein CHLNCDRAFT_143631 [Chlorella variabilis]
          Length = 137

 Score = 46.6 bits (109), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 9/103 (8%)

Query: 33  LGSSETVKRLVTAGSMRRIHERVLGPSRTGIS-SSTSDIWLLGVCHKIAQDEALGDAAGN 91
           LG S +   L  A  + ++H+ +     +G S +  + +WLLG C+      +  +A   
Sbjct: 15  LGLSRSYYALARALRLNKLHDLLA----SGASITPDAPVWLLGQCYSCPPGAS--EAQQE 68

Query: 92  NGLAEFNQDFSSRILISYRKGFDPI--GDSKITSDVGWGCMLR 132
             LA     + S   +SYR GF  I  G + + SD GWGC LR
Sbjct: 69  EALARMLHHYQSIPWMSYRTGFTSIAAGSAHLQSDAGWGCTLR 111


>gi|294954843|ref|XP_002788322.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
 gi|239903634|gb|EER20118.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
          Length = 345

 Score = 45.1 bits (105), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 27/113 (23%), Positives = 52/113 (46%), Gaps = 26/113 (23%)

Query: 293 FTFPQSLGIVGGKPGASTYIVGVQEESA-------------------IYLDPHDVQPVIN 333
              P  +G++GG+   + Y+VGV E+                     + +DPH VQ  + 
Sbjct: 207 LKLPWCVGVIGGQSTRAHYVVGVAEKDTYLQSSTWGRSGYRQTRTDLLSIDPHFVQSAV- 265

Query: 334 IGKDDLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 385
                +EA + ++ +SD    +    ++PSL +GFY +D+ D ++  A   ++
Sbjct: 266 -----VEAQSISFKNSDEPSRLQPTKLNPSLGVGFYVKDETDLEELSAELDRV 313


>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
          Length = 346

 Score = 44.7 bits (104), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 20/94 (21%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
             NN +A   +  S+   ++YR GF   +    +T+D GWGC LRS QML   +L+  RL
Sbjct: 57  TSNNNIA---KHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111

Query: 148 GRP-------WRKPLQKPF-------DREYVEIL 167
             P         + +QK F        REYV+++
Sbjct: 112 QEPNPGFGEDAAEKVQKNFIIHSMEERREYVQLI 145


>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
 gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
          Length = 135

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 11/67 (16%)

Query: 63  ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 122
           I    +++W+LG  +   Q+  L             +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTNVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 123 SDVGWGC 129
           +D GWG 
Sbjct: 92  TDKGWGL 98


>gi|389585790|dbj|GAB68520.1| peptidase, partial [Plasmodium cynomolgi strain B]
          Length = 894

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           F+ R    Y KG D I  S   SD GWGCM+R  QM++A  L+ +++ + +
Sbjct: 418 FTKRKRTKYTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 466


>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 346

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 6/63 (9%)

Query: 89  AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 147
             NN +A   +  S+   I+YR GF   +    +T+D GWGC LRS QML   +L+  RL
Sbjct: 57  TSNNNIA---KHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111

Query: 148 GRP 150
             P
Sbjct: 112 QEP 114


>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
 gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
          Length = 133

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 30/42 (71%), Gaps = 5/42 (11%)

Query: 105 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQAL 142
           IL +YR  F+PI    G + + SD GWGC +R++QML+AQA+
Sbjct: 67  ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV 107


>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 346

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 6/62 (9%)

Query: 90  GNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 148
            NN +A   +  S+   I+YR GF   +    +T+D GWGC LRS QML   +L+  RL 
Sbjct: 58  SNNNVA---KHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RLQ 112

Query: 149 RP 150
            P
Sbjct: 113 EP 114


>gi|221060360|ref|XP_002260825.1| peptidase [Plasmodium knowlesi strain H]
 gi|193810899|emb|CAQ42797.1| peptidase, putative [Plasmodium knowlesi strain H]
          Length = 1001

 Score = 42.4 bits (98), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 100 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           +F++R    + KG D I  S   SD GWGCM+R  QM++A  L+ +++ + +
Sbjct: 464 NFTNRRRTKHTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 513


>gi|156102174|ref|XP_001616780.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805654|gb|EDL47053.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1007

 Score = 42.4 bits (98), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 101 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 151
           F+ R    Y KG D I  S   SD GWGCM+R  QM++A  L+ +++ + +
Sbjct: 468 FAKRKRDRYSKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 516


>gi|149030140|gb|EDL85217.1| rCG23129 [Rattus norvegicus]
          Length = 90

 Score = 41.6 bits (96), Expect = 0.89,   Method: Composition-based stats.
 Identities = 16/44 (36%), Positives = 30/44 (68%), Gaps = 1/44 (2%)

Query: 358 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 401
           ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 5   NLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 47


>gi|50303849|ref|XP_451871.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49641003|emb|CAH02264.1| KLLA0B07667p [Kluyveromyces lactis]
          Length = 1999

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 74/173 (42%), Gaps = 20/173 (11%)

Query: 10   ASKCFS------KSTPDTPNRSLASVGSELGSSETVKRLVTAGSMRRIHERVLGPSRTGI 63
            +SKCF       KS  DT  ++L    S +  S++VKRL T   M  I  R+ G  R   
Sbjct: 1024 SSKCFEFLAKSVKSDDDTLLQALRDATSNVLFSKSVKRLQTLYKMDGI--RMDGHRRVSR 1081

Query: 64   SSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQD----FSSRILISYRKGFDPIGD 118
            S       L  +  K   DE       +N + A F +D        +LI  R+  D + D
Sbjct: 1082 SQ------LTHILFKERTDEYDRSIIDSNSIYALFKKDNVNLTKKMVLIEERRLNDYLAD 1135

Query: 119  SKITSDVGWGCMLRSSQMLVAQALLF-HRLGRPWRKPLQKPFDREYVEILHLF 170
             +   + G+ C LR  + + + A L   +  R W    ++   R+ +++L +F
Sbjct: 1136 DRYQKEAGYACALRVIRKVASTAYLRDFKSTREWYLAARENVKRQRIQLLPVF 1188


>gi|390457789|ref|XP_003732004.1| PREDICTED: cysteine protease ATG4B-like [Callithrix jacchus]
          Length = 102

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 16/51 (31%), Positives = 29/51 (56%)

Query: 362 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 412
           S+++GF+C+ +DDF+D C +  KL+      P+F + +     +   DVL 
Sbjct: 25  SISVGFFCKTEDDFNDRCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 75


>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
           gorilla]
          Length = 351

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 160 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 200
           +R + +I+  F D   +PF +H L++ G++ G  AG W GP
Sbjct: 51  ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 91


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.136    0.415 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,450,998,062
Number of Sequences: 23463169
Number of extensions: 325879431
Number of successful extensions: 652308
Number of sequences better than 100.0: 783
Number of HSP's better than 100.0 without gapping: 762
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 649429
Number of HSP's gapped (non-prelim): 1361
length of query: 443
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 297
effective length of database: 8,933,572,693
effective search space: 2653271089821
effective search space used: 2653271089821
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)