BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011418
         (486 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
 gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
          Length = 489

 Score =  699 bits (1805), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/493 (73%), Positives = 413/493 (83%), Gaps = 11/493 (2%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MKGFRE+  AS+C SK   DTPNRSL S   E GS+   S+KGSL SS F SAFSVFETY
Sbjct: 1   MKGFRERV-ASRCSSKCPVDTPNRSLTSDCLESGSN--FSTKGSLWSSFFASAFSVFETY 57

Query: 61  SESS-ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
            ES  ASEKK  H++ NGWT+AVK++V+ GSMRRIHERVLGPSRTGISS+TSDIWLLGVC
Sbjct: 58  RESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVC 117

Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
           +KI++DE+ G+A   N LAEF  D+SSRIL++YR+GFD IGDSK  SDVGWGCMLRSSQM
Sbjct: 118 YKISEDES-GNADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQM 176

Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
           LVAQALLFH+LGR W KP QKP D+ YVEILHLFGDSE +PFSIHNL+QAGKAY LAAGS
Sbjct: 177 LVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGS 236

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
           WVGPYAMCRSWE+LAR +R E  L  QSLPMA+YVVSGDEDGERGGAPVV I+DASRHC 
Sbjct: 237 WVGPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCL 296

Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
            FS+GQADWTPILLLVPLVLGL+KVNPRYIP+L+ TFTF QSLGI+GGKPGASTYIVGVQ
Sbjct: 297 EFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQ 356

Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           +++A YLDPH+VQ V+NIG+DD+EADTS+YHSD++RHI L SIDPSLAIGFYCRDKDDFD
Sbjct: 357 DDNAFYLDPHEVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFD 416

Query: 420 DFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL-GETGGVPEDDSLGV-MSMNDAV 475
           +FC  ASKLA++S GAPLFTV   HK  KPV+H D+L  E   V EDDS+ V M +ND  
Sbjct: 417 EFCLLASKLADDSQGAPLFTVAHCHKLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDA 476

Query: 476 --GNAHEDDWQLL 486
             G A ED+WQLL
Sbjct: 477 EGGGAQEDEWQLL 489


>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
 gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
          Length = 486

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/493 (71%), Positives = 405/493 (82%), Gaps = 14/493 (2%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MKGF EKA ASK   K+  D+ N       SE  SS++K SK SL SS+F SAFSVFET 
Sbjct: 1   MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53

Query: 61  SESS--ASEKKAVHN-KSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 117
           SESS  ASEKKA+ N ++NGWT AV+++VT  SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54  SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113

Query: 118 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 177
           +C+KI+Q+E+   A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173

Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
           QMLVAQALL HR+GR WRK   KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233

Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
           GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
           C  FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L  TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353

Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
           VQ+E A YLDPH+ Q V++I +++LEADTS+YH ++IRHI LDSIDPSLAIGFYCRDKDD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDD 413

Query: 418 FDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAV 475
           FDDFC RASKLA++SNGAPLFTV   H   KP++ SD + +  G  EDDS  V+S   A 
Sbjct: 414 FDDFCIRASKLADKSNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAE 473

Query: 476 G--NAHEDDWQLL 486
           G  + HEDDWQLL
Sbjct: 474 GYEHEHEDDWQLL 486


>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
 gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  680 bits (1754), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/488 (71%), Positives = 407/488 (83%), Gaps = 8/488 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MKGFRE+   +   S ST ++PNRS  S  SELGS+++K SK SL S+ F SAFSVF+T+
Sbjct: 1   MKGFRERGFVASSKSSSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDTH 60

Query: 61  SESSA-SEKKAVHNK-SNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 118
            +SS+ SEKKA H +  NGWT+AVK++V  GSMRRI E VLG S+TGIS++T DIWLLG 
Sbjct: 61  CDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGA 120

Query: 119 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 178
           C+KI+QD + GDAA  N LA FN DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 121 CYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQ 180

Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
           MLVAQALLFHRLGR WRKPL KP DREYVEILHLFGDSE+S FSIHNLL+AGKAYGLAAG
Sbjct: 181 MLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAG 240

Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
           SWVGPYA+C SWE+L R +R ET L  QSL MA+YVVSG EDGERGGAPV+CI++A+RHC
Sbjct: 241 SWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHC 300

Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
           S FSKGQ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 301 SEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 360

Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
           Q+E+A YLDPH+VQPV+N+ +DD+EA+TS+YH +V+RH+ LD IDPSLAIGFYCRDKDDF
Sbjct: 361 QDENAFYLDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDF 420

Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 478
           DDFC  ASKL +ESNGAPLFTV  + +K + H     ++G V  DDSLGVM+MND  G  
Sbjct: 421 DDFCTLASKLTDESNGAPLFTVAHS-RKLLKH-----DSGEVRSDDSLGVMTMNDVEGCV 474

Query: 479 HEDDWQLL 486
           HEDDWQLL
Sbjct: 475 HEDDWQLL 482


>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  678 bits (1750), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/496 (71%), Positives = 405/496 (81%), Gaps = 17/496 (3%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MKGF EKA ASK   K+  D+ N       SE  SS++K SK SL SS+F SAFSVFET 
Sbjct: 1   MKGFCEKAVASKFSCKTKSDSSN-------SEPQSSDTKLSKVSLWSSVFASAFSVFETN 53

Query: 61  SESS--ASEKKAVHN-KSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLG 117
           SESS  ASEKKA+ N ++NGWT AV+++VT  SMRRI ERVLG S+TGISSSTSDIWLLG
Sbjct: 54  SESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLG 113

Query: 118 VCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSS 177
           +C+KI+Q+E+   A+ +NGLAEF QDFSSRIL++YRKGF+ IGDSK+TSDV WGCMLRSS
Sbjct: 114 LCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSS 173

Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
           QMLVAQALL HR+GR WRK   KP D++Y+EILH FGDS+ S FSIHN+LQAGKAYGLAA
Sbjct: 174 QMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAA 233

Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
           GSWVGPYAMCRSWE LAR +R ET L CQSLPMAIY+VSGDEDGERGGAPVV I++ASRH
Sbjct: 234 GSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
           C  FSKGQ DWTPILLLVPLVLGLEKVNPRYIP+L  TFTFPQSLGI+GGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353

Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYH---SDVIRHIHLDSIDPSLAIGFYCRD 414
           VQ+E A YLDPH+ Q V++I +++LEADTS+YH   S +IRHI LDSIDPSLAIGFYCRD
Sbjct: 354 VQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRD 413

Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVLGETGGVPEDDSLGVMSMN 472
           KDDFDDFC RASKLA+ESNGAPLFTV   H   KP++ SD + +  G  EDDS  V+S  
Sbjct: 414 KDDFDDFCIRASKLADESNGAPLFTVAHIHSLPKPISCSDGMDDCSGFREDDSFDVVSNK 473

Query: 473 DAVG--NAHEDDWQLL 486
            A G  + HEDDWQLL
Sbjct: 474 GAEGYEHEHEDDWQLL 489


>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
 gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
          Length = 481

 Score =  674 bits (1738), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/488 (70%), Positives = 404/488 (82%), Gaps = 9/488 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MK FR++ GA      +T DTP  S  S  SE GS+++K SK SL SS F SAFSVF+ Y
Sbjct: 1   MKVFRDR-GAVSPSKTTTTDTPKSSFISDSSEPGSTDTKVSKPSLWSSFFASAFSVFDIY 59

Query: 61  SESSA-SEKKAVHNK-SNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGV 118
            +SS+ S  +A H + SNGWT++VK++V  G+MRRI ERVLG S+TGIS++TSDIWLLG 
Sbjct: 60  RDSSSTSHNEAPHIRHSNGWTSSVKKIVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGA 119

Query: 119 CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQ 178
            +KI+QD++ G+A   N LA F++DFSSRILI+YRKGFD I DSK+TSDV WGCMLRSSQ
Sbjct: 120 RYKISQDDSSGNADATNALAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQ 179

Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
           MLVAQALLFHRLGR WRKP+ KP DR+YVEILHLFGDSE S FSIHNLLQAGKAYGLAAG
Sbjct: 180 MLVAQALLFHRLGRSWRKPVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAG 239

Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
           SWVGPYAMCRSWE+LAR +R ET L  Q+LPMA+YVVSG EDGERGGAPV+ I+DA+RHC
Sbjct: 240 SWVGPYAMCRSWESLARSKREETNLEYQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHC 299

Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
           S FSKG+ DWTPILLLVPLVLGL+K+NPRYIP+L+ TFTFPQSLGI+GGKPGASTYIVGV
Sbjct: 300 SEFSKGREDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGV 359

Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
           Q+E+A YLDPH+VQPV+N  +DD+EA+TS+YH DV+RHI LD IDPSLAIGFYCRDKDDF
Sbjct: 360 QDENAFYLDPHEVQPVVNFSRDDVEANTSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDF 419

Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 478
           DDFC+ ASKLA+ESNGAPLFTV  ++K   +      ++  V +DD LGVM+MNDA G  
Sbjct: 420 DDFCSLASKLADESNGAPLFTVANSYKSSKH------DSSEVRDDDPLGVMTMNDAEGCL 473

Query: 479 HEDDWQLL 486
           +EDDWQLL
Sbjct: 474 NEDDWQLL 481


>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
 gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
          Length = 483

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/483 (68%), Positives = 379/483 (78%), Gaps = 3/483 (0%)

Query: 5   REKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESS 64
           R K   S C  +   D  +R+  SV  ELGS    SSK S  S  F+S FS+FE + +SS
Sbjct: 3   RGKDLKSTCSPEPAADAIDRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKDSS 62

Query: 65  ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQ 124
            +EKK  H + N W A V++++T+GSMRRI ER+LG  R+G+ SS  DIWLLGVCHKI+Q
Sbjct: 63  VTEKKVFHPRHNVW-ATVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQ 121

Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
           D    DAA + G+A + QDFSSRIL++YRKGF  I DSK TSDV WGCMLRSSQMLVAQA
Sbjct: 122 DHPPDDAASSPGVAGYEQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQA 181

Query: 185 LLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
           LLFHRLGR WRKP QKP D+EYVEILHLFGDSETS FSIHNLLQAG+AY LAAGSWVGPY
Sbjct: 182 LLFHRLGRSWRKPSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPY 241

Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
           AMCRSWE L R +R    L  Q LPMAIY+VSGDEDGERGGAPV+ IDDASRHC  FSKG
Sbjct: 242 AMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKG 301

Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
           Q DW+PILLLVPLVLGLEK+NPRYIP+LR TFTFPQSLGI+GGKPGASTYIVGVQ+E+A 
Sbjct: 302 QHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAF 361

Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           YLDPH+VQ V+NI KDDLEADTS+YH +VIRHI L+SIDPSLAIGFYCRDKDDFD+FC R
Sbjct: 362 YLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCHR 421

Query: 425 ASKLAEESNGAPLFTVTQTHK-KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDW 483
           ASKLAEES+GAPLFTV +TH   P   S  L +   + EDD  GV+ M +    +HEDDW
Sbjct: 422 ASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNE-EESHEDDW 480

Query: 484 QLL 486
           Q L
Sbjct: 481 QFL 483


>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  633 bits (1632), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 337/489 (68%), Positives = 386/489 (78%), Gaps = 8/489 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           +KG  E+  +SKC SKS+ +T + +   V S+ GSS SK  K SL S++F S FSV ETY
Sbjct: 3   LKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSNSKFPKASLWSNIFTSGFSVVETY 62

Query: 61  SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
           SESSASEKKAVH++S+GW AAV+++VT GSMRR  ERVLG SRT ISSS  DIWLLGVCH
Sbjct: 63  SESSASEKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCH 122

Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
           KI+Q E+ G    +NGLA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQML
Sbjct: 123 KISQQESSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQML 182

Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
           VAQALLFH+LGR WRKP+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSW 242

Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
           VGPYAMCR+WE LA   R +  LG   LPMAIYVVSGDEDGERGGAPVVCI+DAS+ C  
Sbjct: 243 VGPYAMCRTWEVLA---RKKNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFE 299

Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
           FS G A WTP+LLLVPLVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+G Q 
Sbjct: 300 FSSGLAAWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQN 359

Query: 361 ESAIYLDPHDVQPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           E A YLDPHDVQ V+NI  D  E   TS+YH +++RHI LDSIDPSLAIGFYCRDKDDFD
Sbjct: 360 EKAFYLDPHDVQQVVNISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDFD 419

Query: 420 DFCARASKLAEESNGAPLFTVTQTH--KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGN 477
           DFC++ASKLAEESNGAPLFTVTQ+    K V  +DV G+  G  E+D  G+   ND   N
Sbjct: 420 DFCSQASKLAEESNGAPLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDTGTN 479

Query: 478 AHEDDWQLL 486
             EDDWQLL
Sbjct: 480 --EDDWQLL 486


>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 339/489 (69%), Positives = 389/489 (79%), Gaps = 9/489 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           +KG  E+  +SKC SKS+ +T + +   V S+ GSS+ K  K SL SS+F S FSV ETY
Sbjct: 3   LKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSDCKFPKASLWSSIFTSGFSVVETY 62

Query: 61  SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
           SESSASEKKAV ++S+GW AAV+++VT GSMRR  ERVLG SRT ISSS  DIWLLGVCH
Sbjct: 63  SESSASEKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCH 122

Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
           KI+Q E+ G    +NGLA F QDFSS+IL++YRKGFD IGD+K TSDV WGCMLRSSQML
Sbjct: 123 KISQQESTGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQML 182

Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
           VAQALLFH+LGR WRKP+ KP D+EY+++L LFGDSE S FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSW 242

Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
           VGPYAMCR+WE LA   R +  LG   LPMAIYVVSGDEDGERGGAPVVCI+DAS+ CS 
Sbjct: 243 VGPYAMCRTWEVLA---RKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSE 299

Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
           FS G A WTP+LLLVPLVLGL+KVNPRYIP LR TF FPQSLGI+GGKPGASTYI+GVQ 
Sbjct: 300 FSSGLAVWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQN 359

Query: 361 ESAIYLDPHDVQPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           E A YLDPHDVQ V+NI  D  E   TS+YH +V+RHI LDSIDPSLAIGFYCRDKDDFD
Sbjct: 360 EKAFYLDPHDVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFD 419

Query: 420 DFCARASKLAEESNGAPLFTV--TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGN 477
           DFC++ASKLAEESNGAPLFTV  +++  K V++ DV G+  G  EDD  G+   ND V N
Sbjct: 420 DFCSQASKLAEESNGAPLFTVAKSRSFSKQVSN-DVSGDNTGFQEDDFPGMDCGNDTVTN 478

Query: 478 AHEDDWQLL 486
             EDDWQLL
Sbjct: 479 --EDDWQLL 485


>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
 gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
 gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
          Length = 487

 Score =  610 bits (1573), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 320/488 (65%), Positives = 377/488 (77%), Gaps = 5/488 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           +K   ++  A+KC SKS+ +  + +     S+ GSS+SK  K SL S+ F S FSV ETY
Sbjct: 3   LKDLCDRIVAAKCSSKSSTEIVDNTQVPASSKAGSSDSKFPKASLWSTFFTSGFSVDETY 62

Query: 61  SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
           SESS+SEKK VH++++GW AAV+++V+ GSMRR  ERVLG  RT +SSS  DIWLLGVCH
Sbjct: 63  SESSSSEKKTVHSRNSGWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCH 122

Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
           KI+Q E+ GD    N  A F QDF SRILI+YRKGFD I DSK TSDV WGCMLRSSQML
Sbjct: 123 KISQHESTGDVDIRNVFAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQML 182

Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
           VAQALLFH+LGR WRK + KP D+EY++IL LFGDSE + FSIHNLLQAGK YGLA GSW
Sbjct: 183 VAQALLFHKLGRSWRKTVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSW 242

Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
           VGPYAMCR+WE LAR QR +   G Q LPMAIYVVSGDEDGERGGAPVVCI+DA + C  
Sbjct: 243 VGPYAMCRTWEVLARNQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLE 302

Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
           FS+G   WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ 
Sbjct: 303 FSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQN 362

Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
           + A YLDPH+V+PV+NI  D  E +TS+YH ++ RH+ LDSIDPSLAIGFYCRDKDDFDD
Sbjct: 363 DKAFYLDPHEVKPVVNITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDKDDFDD 422

Query: 421 FCARASKLAEESNGAPLFTVTQTHKKP--VNHSDVLGETGGVPEDDSLGVMSMNDAVGNA 478
           FC+RA+KLAEESNGAPLFTV Q+   P  V  + V G+     EDDSL +  +NDA    
Sbjct: 423 FCSRATKLAEESNGAPLFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSMNLVNDA---G 479

Query: 479 HEDDWQLL 486
           +EDDWQ L
Sbjct: 480 NEDDWQFL 487


>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
          Length = 489

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 311/489 (63%), Positives = 359/489 (73%), Gaps = 5/489 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           +K F ++  A+KC SKS+ +T + S     S+ GSS+SK  K SL SS F S FSV ETY
Sbjct: 3   LKAFCDRIVAAKCSSKSSTETVDNSQVPACSKAGSSDSKFPKASLWSSFFTSGFSVIETY 62

Query: 61  SESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRI-HERVLGPSRTGISSSTSDIWLLGVC 119
           S+S ASEKKAVH++++GW    +          I     LG +     +       LGVC
Sbjct: 63  SKSPASEKKAVHSQNSGWGCCCEESCYCWLNEEIPRACTLGQAELTFQALMVIYGFLGVC 122

Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
           HK +Q E+ GD   +   A F QDFSS+IL++YRKGFD IGDSK TSDV WGCMLRSSQM
Sbjct: 123 HKFSQQESTGDVDNSTVFAAFEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQM 182

Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
           LVAQALLFH+LGR WRK   KP D+EY++IL  FGDSE S FSIHNLLQAGK YGLA GS
Sbjct: 183 LVAQALLFHKLGRMWRKTTDKPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGS 242

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
           WVGPYAMCRSWE LAR QR     G Q LPMA+YVVSGDEDGERGGAPVVCI+DASR CS
Sbjct: 243 WVGPYAMCRSWEVLARNQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCIEDASRRCS 302

Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
            FS+G A WTP+LLLVPLVLGL+KVN RYIP L+ TF FPQSLGI+GGKPGASTYI+GVQ
Sbjct: 303 EFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQ 362

Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
            E A YLDPHDVQPV++I  D  + +TS+YH +++R + LDSIDPSLAIGFYCRDKDDFD
Sbjct: 363 NEKAFYLDPHDVQPVVHINGDAQDPNTSSYHCNIVRQMPLDSIDPSLAIGFYCRDKDDFD 422

Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHS--DVLGETGGVPEDDSLGVMSMNDAVGN 477
           DFC+RASKLAEESNGAPLFTV Q    P   +  DV G+  G  EDDS GV  +NDA  N
Sbjct: 423 DFCSRASKLAEESNGAPLFTVAQFRSFPFQDAGYDVSGDNTGFQEDDSHGVDLLNDAGTN 482

Query: 478 AHEDDWQLL 486
             EDDWQLL
Sbjct: 483 --EDDWQLL 489


>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
 gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
           Full=Autophagy-related protein 4 homolog a;
           Short=AtAPG4a; Short=Protein autophagy 4a
 gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
 gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
 gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
          Length = 467

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 275/453 (60%), Positives = 350/453 (77%), Gaps = 6/453 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MK   ++    +C S S  DT ++S   + S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1   MKALCDRFVPQQCSSSSKSDTHDKS--PLVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57

Query: 61  SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
            ESS S  K V    NGWTA VKR+ + +G++RR  ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58  RESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117

Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
           +KI+ DE  G+      LA    DFSS+IL++YRKGF+P  D+  TSDV WGCM+RSSQM
Sbjct: 118 YKISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQM 177

Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
           L AQALLFHRLGR W K  + P ++EY+E L  FGDSE S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQALLFHRLGRAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGS 236

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
           WVGPYA+CR+WE+LA  +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C 
Sbjct: 237 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 296

Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
            FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 297 EFSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQ 356

Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           E+   YLDPH+VQ V+ + K+  + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFD
Sbjct: 357 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFD 416

Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
           DFC RA KLAEESNGAPLFTVTQTH   +N S+
Sbjct: 417 DFCLRALKLAEESNGAPLFTVTQTHTA-INQSN 448


>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
 gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
          Length = 467

 Score =  549 bits (1415), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 278/445 (62%), Positives = 351/445 (78%), Gaps = 5/445 (1%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MK   ++    +C S S  DT ++S   V S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1   MKALCDRFVPQQCSSSSKSDTHDKS--PVVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57

Query: 61  SESSASEKKAVHNKSNGWTAAVKRLVTA-GSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
            ESS S  K V    NGWTA VKR+  A G++RR  ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58  RESSTSGHKQVCTTRNGWTAFVKRVSMATGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117

Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
           +KI++DEA G+      LA F QDFSS+IL++YR+GF+P  D+  TSDV WGCM+RSSQM
Sbjct: 118 YKISEDEASGETNTGCVLAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQM 177

Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
           L AQALLFHRLGR W K  + P ++EY+E L  FGDSE+S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQALLFHRLGRSWTKKSELP-EQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGS 236

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
           WVGPYA+CR+WE+LA  +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C 
Sbjct: 237 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 296

Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
            FSKGQ++WTPILLLVPLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 297 EFSKGQSEWTPILLLVPLVLGLDSVNPRYIPSLIATFTFPQSVGILGGKPGASTYIVGVQ 356

Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           E+   YLDPH+VQ V+ + K+  + DTS+YH +VIR++ L+S+DPSLA+GFYCRDKDDFD
Sbjct: 357 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVIRYVPLESLDPSLALGFYCRDKDDFD 416

Query: 420 DFCARASKLAEESNGAPLFTVTQTH 444
           DFC RASKLAE+SNGAPLFT+TQTH
Sbjct: 417 DFCLRASKLAEDSNGAPLFTITQTH 441


>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
 gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
          Length = 422

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 254/392 (64%), Positives = 315/392 (80%), Gaps = 3/392 (0%)

Query: 62  ESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCH 120
           ESS S  K V    NGWTA VKR+ + +G++RR  ERVLGP+RTG+ S+TSD+WLLGVC+
Sbjct: 14  ESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCY 73

Query: 121 KIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
           KI+ DE  G+      LA    DFSS+IL++YRKGF+P  D+  TSDV WGCM+RSSQML
Sbjct: 74  KISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQML 133

Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW 240
            AQALLFHRLGR W K  + P ++EY+E L  FGDSE S FSIHNL+ AG +YGLAAGSW
Sbjct: 134 FAQALLFHRLGRAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSW 192

Query: 241 VGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV 300
           VGPYA+CR+WE+LA  +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C  
Sbjct: 193 VGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLE 252

Query: 301 FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
           FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQE
Sbjct: 253 FSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQE 312

Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
           +   YLDPH+VQ V+ + K+  + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFDD
Sbjct: 313 DKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDD 372

Query: 421 FCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
           FC RA KLAEESNGAPLFTVTQTH   +N S+
Sbjct: 373 FCLRALKLAEESNGAPLFTVTQTHTA-INQSN 403


>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
          Length = 451

 Score =  529 bits (1362), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 263/453 (58%), Positives = 337/453 (74%), Gaps = 22/453 (4%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MK   ++    +C S S  DT ++S   + S+ G S++KS K +L S++F S+ SV + Y
Sbjct: 1   MKALCDRFVPQQCSSSSKSDTHDKS--PLVSDSGPSDNKS-KFTLWSNVFTSSSSVSQPY 57

Query: 61  SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
            ESS S  K V    NGWTA VKR+ + +G++RR  ERVLGP+RTG+ S+TSD+WLLGVC
Sbjct: 58  RESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVC 117

Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
           +KI+ DE  G+      LA    DFSS+IL++YRKGF+P  D+  TSDV WGCM+RSSQM
Sbjct: 118 YKISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQM 177

Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
           L AQ                   ++EY+E L  FGDSE S FSIHNL+ AG +YGLAAGS
Sbjct: 178 LFAQLP-----------------EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGS 220

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
           WVGPYA+CR+WE+LA  +R +T    Q+LPMA+++VSG EDGERGGAP++CI+DA++ C 
Sbjct: 221 WVGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCL 280

Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
            FSKGQ++WTPI+LLVPLVLGL+ VNPRYIP+L  TFTFPQS+GI+GGKPGASTYIVGVQ
Sbjct: 281 EFSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQ 340

Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           E+   YLDPH+VQ V+ + K+  + DTS+YH +V+R++ L+S+DPSLA+GFYCRDKDDFD
Sbjct: 341 EDKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFD 400

Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
           DFC RA KLAEESNGAPLFTVTQTH   +N S+
Sbjct: 401 DFCLRALKLAEESNGAPLFTVTQTHTA-INQSN 432


>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
 gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
          Length = 476

 Score =  526 bits (1356), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 276/487 (56%), Positives = 351/487 (72%), Gaps = 12/487 (2%)

Query: 1   MKGFREKAGASKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETY 60
           MK   ++   SKC S  T +  + S  +       S    S  +L S +  S+  V +  
Sbjct: 1   MKAICDRFVPSKCSSSCTSEKRDIS-PTSLVSDSPSSDDKSNLTLCSDVVESSSPVSQPC 59

Query: 61  SESSASEKKAVHNKSNGWTAAVKRL-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVC 119
            E+S SE K V    N WT  +K   + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC
Sbjct: 60  REASTSEHKQVCTTHNSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVC 119

Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
           +KI++ E+  +A     LA F QDFSS IL++YR+GF+PIGD+  TSDV WGCMLRS QM
Sbjct: 120 YKISEAESFEEADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQM 179

Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGS 239
           L AQALLF RLGR WRK   +P + +Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGS
Sbjct: 180 LFAQALLFQRLGRSWRKKDSEPPNEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGS 239

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
           WVGPYA+CRSWE+LAR  + ET +  +S  MA+++VSG EDGERGGAP++CI+D ++ C 
Sbjct: 240 WVGPYAVCRSWESLARKNKEETDVKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL 299

Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
            FS+G  +W PILLLVPLVLGL+KVNPRYIP+L  TFTFPQSLGI+GGKPGASTYIVGVQ
Sbjct: 300 EFSEGDTEWPPILLLVPLVLGLDKVNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQ 359

Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           E+   YLDPHDVQ V+ + K++ + DTS+YH + +R++ L+S+DPSLA+GFYC+DKDDFD
Sbjct: 360 EDKGFYLDPHDVQQVVTVKKENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQDKDDFD 419

Query: 420 DFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAH 479
           DFC RA+KLA +SNGAPLFTVTQ+H+             G+ E  S  V+S  +  G  H
Sbjct: 420 DFCIRATKLAGDSNGAPLFTVTQSHRT---------NDCGIAETSSSTVIS-TEISGEEH 469

Query: 480 EDDWQLL 486
           EDDWQLL
Sbjct: 470 EDDWQLL 476


>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
 gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
 gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
 gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
 gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 478

 Score =  522 bits (1344), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 267/453 (58%), Positives = 345/453 (76%), Gaps = 17/453 (3%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K SK S+LS +FNS F++FE + +SSA++     + S  W+  ++R+V +GSM R     
Sbjct: 38  KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF---- 93

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+     ++SD+W LG C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD 
Sbjct: 94  LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE 
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEA 210

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
             FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R 
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
           + LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450

Query: 454 LGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
           LG +G    D ++ V  + DA G   E++WQ+L
Sbjct: 451 LGISG----DGNINVEDL-DASGETGEEEWQIL 478


>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B;
           Short=Protein autophagy 4; AltName: Full=OsAtg4
          Length = 478

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 266/453 (58%), Positives = 343/453 (75%), Gaps = 17/453 (3%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K SK S+LS +FNS F++FE + +SSA++     + S  W   ++R+V +GSM R     
Sbjct: 38  KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 93

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+     ++SD+W LG C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD 
Sbjct: 94  LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE 
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 210

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
             FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R 
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
           + LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450

Query: 454 LGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
           LG +G    D ++ V  + DA G   E++WQ+L
Sbjct: 451 LGISG----DGNINVEDL-DASGETGEEEWQIL 478


>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
          Length = 892

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 258/425 (60%), Positives = 330/425 (77%), Gaps = 12/425 (2%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K SK S+LS +FNS F++FE + +SSA++     + S  W+  ++R+V +GSM R     
Sbjct: 38  KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWSRVLRRIVCSGSMWRF---- 93

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+     ++SD+W LG C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD 
Sbjct: 94  LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+PL+KP++ EY+ ILH+FGDSE 
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKPYNPEYIGILHMFGDSEA 210

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
             FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R 
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
           + LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450

Query: 454 LGETG 458
           LG +G
Sbjct: 451 LGISG 455


>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
           Full=Autophagy-related protein 4 homolog b;
           Short=AtAPG4b; Short=Protein autophagy 4b
 gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
 gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
 gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 477

 Score =  509 bits (1311), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 265/463 (57%), Positives = 342/463 (73%), Gaps = 12/463 (2%)

Query: 25  SLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKR 84
           S  S+ S+  SS++KS+  +L S +  S+  V +   E+S S    V    + WT  +K 
Sbjct: 26  SPTSLVSDSASSDNKSNL-TLCSDVVASSSPVSQLCREASTSGHNPVCTTHSSWTVILKT 84

Query: 85  L-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQD 143
             + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QD
Sbjct: 85  ASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQD 144

Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
           FSS IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P D
Sbjct: 145 FSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPAD 204

Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
            +Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET  
Sbjct: 205 EKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDD 264

Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
             +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G+ +W PILLLVPLVLGL++
Sbjct: 265 KHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDR 324

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           VNPRYIP+L  TFTFPQSLGI+GGKPGASTYIVGVQE+   YLDPHDVQ V+ + K++ +
Sbjct: 325 VNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQD 384

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
            DTS+YH + +R++ L+S+DPSLA+GFYC+ KDDFDDFC RA+KLA +SNGAPLFTVTQ+
Sbjct: 385 VDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLAGDSNGAPLFTVTQS 444

Query: 444 HKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
           H++  N   +   +        +         G  HEDDWQLL
Sbjct: 445 HRR--NDCGIAETSSSTETSTEIS--------GEEHEDDWQLL 477


>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
          Length = 912

 Score =  508 bits (1309), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 257/425 (60%), Positives = 328/425 (77%), Gaps = 12/425 (2%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K SK S+LS +FNS F++FE + +SSA++     + S  W   ++R+V +GSM R     
Sbjct: 38  KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 93

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+     ++SD+W LG C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD 
Sbjct: 94  LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 150

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE 
Sbjct: 151 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 210

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
             FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVS
Sbjct: 211 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 270

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 271 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 330

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D++EADTS+YH   +R 
Sbjct: 331 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRD 390

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
           + LD IDPSLAIGFYCRDKDDFDDFC+RA++L +++NGAPLFTV Q+    K+  N  DV
Sbjct: 391 LALDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDV 450

Query: 454 LGETG 458
           LG +G
Sbjct: 451 LGISG 455


>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
          Length = 473

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 261/452 (57%), Positives = 332/452 (73%), Gaps = 16/452 (3%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K SK S+LS +F+S FS+FE + +SSA      H+ S  W+  ++R+   GSM R     
Sbjct: 34  KQSKNSILSCVFSSPFSIFEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSMWRF---- 89

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+   + ++SD+W LG C+K++ +E    +   +G A F +DFSSRI I+YRKGFD 
Sbjct: 90  LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 146

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE 
Sbjct: 147 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 206

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
             FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R  R   E   G  + PMA+YVVS
Sbjct: 207 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVS 266

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 267 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETF 326

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           TFPQSLGI+GGKPG STY+ GVQ++  +YLDPH+VQ  ++I  D+LEADTS+YH   +R 
Sbjct: 327 TFPQSLGILGGKPGTSTYVAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYHCSTVRD 386

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGE 456
           + LD IDPSLAIGFYCRDKDDFDDFC+RAS+L +++NGAPLFTV Q+ +      +    
Sbjct: 387 LALDLIDPSLAIGFYCRDKDDFDDFCSRASELVDKANGAPLFTVMQSVQPSKQMYNEESS 446

Query: 457 TGGVPEDDSLGVMSMN--DAVGNAHEDDWQLL 486
           +G     D + ++++   D  G   E++WQ+L
Sbjct: 447 SG-----DGMDIINVEGLDGSGETGEEEWQIL 473


>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
           sativa Japonica Group]
 gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
           Japonica Group]
 gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
 gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 474

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 263/453 (58%), Positives = 331/453 (73%), Gaps = 18/453 (3%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K  K S+LS +F+S FS+FE + +SSA+     H+ S  W+  ++R+   GSM R     
Sbjct: 35  KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+   + ++SD+W LG C+K++ +E    +   +G A F +DFSSRI I+YRKGFD 
Sbjct: 91  LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE 
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 207

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
             FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L    R   E   G  + PMA+YVVS
Sbjct: 208 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVS 267

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T 
Sbjct: 268 GDEDGERGGAPVVCIDVAAQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETL 327

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D+LEA TS+YH   +R 
Sbjct: 328 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRD 387

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSDV 453
           + LD IDPSLAIGFYCRDKDDFDDFC+RAS+L +++NGAPLFTV Q+    K+  N    
Sbjct: 388 LALDLIDPSLAIGFYCRDKDDFDDFCSRASELVDKANGAPLFTVVQSVQPSKQMYNEESS 447

Query: 454 LGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
            G+  G+   DS+ V  + D  G   E++WQ+L
Sbjct: 448 SGD--GM---DSINVEGL-DGSGETGEEEWQIL 474


>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
          Length = 1216

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 257/458 (56%), Positives = 328/458 (71%), Gaps = 45/458 (9%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K SK S+LS +FNS F++FE + +SSA++     + S  W   ++R+V +GSM R     
Sbjct: 309 KQSKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF---- 364

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+     ++SD+W LG C+K++ +E+  D+   +G A F +DFSSRI I+YR+GFD 
Sbjct: 365 LGTSKV---LTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDA 421

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WR+P +KP++ EY+ ILH+FGDSE 
Sbjct: 422 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEA 481

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVVS 276
             FSIHNLLQAG +YGLAAGSWVGPYAMCR+W+ L R  R +  +  G +S PMA+YVVS
Sbjct: 482 CAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVS 541

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ TF
Sbjct: 542 GDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 601

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------------------------ 372
           TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ                        
Sbjct: 602 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYG 661

Query: 373 ---------PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
                      ++I  D++EADTS+YH   +R + LD IDPSLAIGFYCRDKDDFDDFC+
Sbjct: 662 SYSGVFSTSQAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDDFCS 721

Query: 424 RASKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETG 458
           RA++L +++NGAPLFTV Q+    K+  N  DVLG +G
Sbjct: 722 RATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLGISG 759


>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
          Length = 493

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 267/465 (57%), Positives = 335/465 (72%), Gaps = 28/465 (6%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKA--VHNKSNG----------WTAAVKRL 85
           SK  KGS+LSS+F    ++FE   +SS+S   A    NKS G          W+ A++R 
Sbjct: 41  SKHCKGSILSSVF----TIFEAQQDSSSSVAAAAACENKSPGHSSGPSYGGAWSRALRRF 96

Query: 86  VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
           V  GSM R     LG ++     +  D+W LG C+K + +E+  D   ++G A F +DFS
Sbjct: 97  VGGGSMWRF----LGCAKV---LTNGDVWFLGKCYKFSSEESSSDLDTDSGHAAFLEDFS 149

Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
           SRI ++YRKGFD I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP + E
Sbjct: 150 SRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPSQKPCNPE 209

Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGL 263
           Y+ ILHLFGDSE   FS+HNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R  R   E   
Sbjct: 210 YIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVSN 269

Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
           G +S PMA+YVVSGDEDGERGGAPVVCID A++ C  F+K Q+ W+PILLLVPLVLGL+K
Sbjct: 270 GNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSTWSPILLLVPLVLGLDK 329

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           +NPRYIP L+ TFTFPQSLGI+GGKPG STYI G+Q++ A+YLDPHDVQ  +NI  D+L+
Sbjct: 330 INPRYIPLLKETFTFPQSLGILGGKPGTSTYIAGIQDDRALYLDPHDVQMAVNIASDNLD 389

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
           ADTS+YH   +R + LD +DPSLAIGFYCRDKDDFDDFC+RAS+L  ++NGAPLFTV Q+
Sbjct: 390 ADTSSYHCSTVRDMALDLLDPSLAIGFYCRDKDDFDDFCSRASELVVKANGAPLFTVVQS 449

Query: 444 HK--KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
            +  K + + D    + G    D++ +  + D  G A E++WQ+L
Sbjct: 450 IQPSKQMYNQDDGSGSSGDGMADNINMEDL-DGSGEAGEEEWQIL 493


>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
 gi|194701156|gb|ACF84662.1| unknown [Zea mays]
 gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
 gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
 gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
          Length = 492

 Score =  486 bits (1251), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 264/462 (57%), Positives = 333/462 (72%), Gaps = 27/462 (5%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
           S+  K S+LS +F    ++FE   + S++   A   K    S  W+  ++R V +GSM R
Sbjct: 45  SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104

Query: 94  IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
           +    LG +R     ++ D+W LG C++++ ++E  G +  ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157

Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
           RKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHL
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHL 217

Query: 213 FGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPM 270
           FGDSE   FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R  R  A+   G ++ PM
Sbjct: 218 FGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPM 277

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
           A+YVVSGDEDGERGGAPV CID A++ CS F+KGQ  W+PILLL+PLVLGL+K+NPRYIP
Sbjct: 278 ALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIP 337

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
            L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ  ++I  D+LEADTS+YH
Sbjct: 338 LLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYH 397

Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKP 447
             V+R + L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+    K+ 
Sbjct: 398 CSVVRDLALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGAPLFTVMQSVQPSKQM 457

Query: 448 VNHSDVLGETGG---VPEDDSLGVMSMNDAVGNAHEDDWQLL 486
               D L    G     ED  L      DA G A E +WQ+L
Sbjct: 458 YKQDDGLCCCSGSSMANEDYDL------DASGEAGE-EWQIL 492


>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
          Length = 484

 Score =  486 bits (1251), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 266/455 (58%), Positives = 329/455 (72%), Gaps = 23/455 (5%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVH-NKSNGWTAAVKRLVTAGSMRRIHER 97
           K  K S+LSS+     ++FE   + S   +   H + S  W+  ++R V  GSM R    
Sbjct: 46  KQCKASILSSVL----TIFEPDQDQSG--RSGGHASGSYAWSRVLRRFVGGGSMWRF--- 96

Query: 98  VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
            LG    G + +  D+W LG C+K++ +E+  D+    G A F +DFSSR+ I+YRKGFD
Sbjct: 97  -LG---CGKALTAGDVWFLGKCYKLSSEESSSDSDSEGGHAAFLEDFSSRVWITYRKGFD 152

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
            I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP Q P D E+  ILHLFGDSE
Sbjct: 153 VISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNPSDPEHTRILHLFGDSE 212

Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVV 275
              FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R  R +  +    +S PM +YVV
Sbjct: 213 VCAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVINRNESFPMVLYVV 272

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           SGDEDGERGGAPVVCID A++ C  F+KGQ+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 273 SGDEDGERGGAPVVCIDVAAQLCYDFNKGQSAWSPILLLVPLVLGLDKINPRYIPLLKET 332

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           FTFPQSLGI+GGKPGASTYI GVQ++ A+YLDPH+VQ  +NI  D+LEADTS+YH   +R
Sbjct: 333 FTFPQSLGILGGKPGASTYIAGVQDDRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVR 392

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
            + LD IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+    K+  N  D
Sbjct: 393 DMPLDLIDPSLAIGFYCRDKDDFDDFCSRASELAEQANGAPLFTVVQSVQPSKQMYNQDD 452

Query: 453 VLGETG-GVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
             G +G GV   D++    + D  G   ED+WQ+L
Sbjct: 453 GSGCSGYGV--SDNIDTEDL-DGSGETGEDEWQIL 484


>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
 gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
 gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
 gi|219886349|gb|ACL53549.1| unknown [Zea mays]
 gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
 gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
          Length = 492

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 261/462 (56%), Positives = 337/462 (72%), Gaps = 25/462 (5%)

Query: 36  SESKSSKGSLLSSLFNSAFSVFETYSESSASEK-KAVHNKSN----GWTAAVKRLVTAGS 90
           S S+  K S+LS +F+  F++FE   + S+S    A   KS+    G +  ++R V +GS
Sbjct: 45  SGSRQPKASILSGVFSPPFAIFEGQQQGSSSPACDARSTKSSSGSYGLSRILRRFVGSGS 104

Query: 91  MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQDFSSRIL 149
           M R+    LG  R     ++SD+W LG C+K++ +E     + ++   A F +DFSSRI 
Sbjct: 105 MWRL----LGCGRV---LTSSDVWFLGKCYKVSPEEEESGDSESDSGHAAFLEDFSSRIW 157

Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
           I+YRKGFD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +
Sbjct: 158 ITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGV 217

Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQS 267
           LHLFGDSE   FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R  R  A+   G ++
Sbjct: 218 LHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKEN 277

Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
            PMA+YVVSGDEDGERGGAPVVCID A++ CS F+KG + W+PILLLVPLVLGL+K+NPR
Sbjct: 278 FPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPR 337

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           YIP L+ TF FPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D+LEADTS
Sbjct: 338 YIPLLKETFMFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMTVDIALDNLEADTS 397

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---H 444
           +YH  V+R + L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+    
Sbjct: 398 SYHCSVVRALALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGAPLFTVVQSIEPS 457

Query: 445 KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
           K+     D LG +G    +D       +D  G+   ++WQ+L
Sbjct: 458 KQMYKQDDGLGCSGSSMAND-------DDLDGSGEAEEWQIL 492


>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
          Length = 486

 Score =  481 bits (1238), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 264/459 (57%), Positives = 327/459 (71%), Gaps = 31/459 (6%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVH-NKSNGWTAAVKRLVTAGSMRRIHER 97
           K  K S+LSS+     ++FE   + S   +   H + S  W+  ++R V  GSM R    
Sbjct: 48  KQCKASILSSVL----TIFEPDQDQSG--RSGGHASGSYAWSRVLRRFVGGGSMWRF--- 98

Query: 98  VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
            LG    G + + +D+  LG C+K++ +E+  D+    G A F +DFSSRI I+YRKGFD
Sbjct: 99  -LG---CGKALTAADVQFLGKCYKLSSEESSSDSDSEGGHAAFLEDFSSRIWITYRKGFD 154

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
            I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP Q P + EY+ ILHLFGDSE
Sbjct: 155 AISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNPSNPEYIRILHLFGDSE 214

Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GCQSLPMAIYVV 275
              FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L R  R +  +    +S PMA+YVV
Sbjct: 215 ACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVINRNESFPMALYVV 274

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           SGDEDGERGGAPVVCID A++ C  F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T
Sbjct: 275 SGDEDGERGGAPVVCIDVAAQLCYDFNKDQSAWSPILLLVPLVLGLDKINPRYIPLLKET 334

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           FTFPQSLGI+GGKPGASTYI GVQ++ A+YLDPH+VQ  +NI  D+LEADTS+YH   +R
Sbjct: 335 FTFPQSLGILGGKPGASTYIAGVQDDRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVR 394

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
            + LD IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+    K+  N  D
Sbjct: 395 DMPLDLIDPSLAIGFYCRDKDDFDDFCSRASELAEQANGAPLFTVVQSVQPSKQMYNRDD 454

Query: 453 -----VLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
                  G +G +  +D        D  G   ED+WQ+L
Sbjct: 455 GSGCSGYGVSGNIDAEDL-------DGSGETGEDEWQIL 486


>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
           sativa Japonica Group]
 gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
           Japonica Group]
          Length = 505

 Score =  479 bits (1232), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 263/484 (54%), Positives = 331/484 (68%), Gaps = 49/484 (10%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K  K S+LS +F+S FS+FE + +SSA+     H+ S  W+  ++R+   GSM R     
Sbjct: 35  KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+   + ++SD+W LG C+K++ +E    +   +G A F +DFSSRI I+YRKGFD 
Sbjct: 91  LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE 
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEA 207

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA--ETGLGCQSLPMAIYVVS 276
             FSIHNLLQAGK+YGLAAGSWVGPYAMCR+W+ L    R   E   G  + PMA+YVVS
Sbjct: 208 CAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHHEAVDGNGNFPMALYVVS 267

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           GDEDGERGGAPVVCID A++ C  F+K Q+ W+PILLLVPLVLGL+K+NPRYIP L+ T 
Sbjct: 268 GDEDGERGGAPVVCIDVAAQLCCDFNKNQSTWSPILLLVPLVLGLDKLNPRYIPLLKETL 327

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D+LEA TS+YH   +R 
Sbjct: 328 TFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLAVDIAADNLEAGTSSYHCSTVRD 387

Query: 397 IHLDSIDPSLAIGFYCRDK-------------------------------DDFDDFCARA 425
           + LD IDPSLAIGFYCRDK                               DDFDDFC+RA
Sbjct: 388 LALDLIDPSLAIGFYCRDKGELLLPDKMLGHHLSSLQSWFSYLLCLSAYVDDFDDFCSRA 447

Query: 426 SKLAEESNGAPLFTVTQT---HKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
           S+L +++NGAPLFTV Q+    K+  N     G+  G+   DS+ V  + D  G   E++
Sbjct: 448 SELVDKANGAPLFTVVQSVQPSKQMYNEESSSGD--GM---DSINVEGL-DGSGETGEEE 501

Query: 483 WQLL 486
           WQ+L
Sbjct: 502 WQIL 505


>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
 gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
          Length = 462

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 252/457 (55%), Positives = 312/457 (68%), Gaps = 47/457 (10%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHER 97
           S+  K S+LS +F    ++FE   + S++   A   K    + A  R+     +RR+   
Sbjct: 45  SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRI-----LRRVS-- 97

Query: 98  VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
                                     ++E  G +  ++G A F +DFSSRI I+YRKGFD
Sbjct: 98  -------------------------PEEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFD 132

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
            I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHLFGDSE
Sbjct: 133 AIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHLFGDSE 192

Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVV 275
              FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R  R  A+   G ++ PMA+YVV
Sbjct: 193 ACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVV 252

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           SGDEDGERGGAPV CID A++ CS F+KGQ  W+PILLL+PLVLGL+K+NPRYIP L+ T
Sbjct: 253 SGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIPLLKET 312

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ  ++I  D+LEADTS+YH  V+R
Sbjct: 313 FKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYHCSVVR 372

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
            + L+ IDPSLAIGFYCRDKDDFDDFC+RAS+LAE++NGAPLFTV Q+    K+     D
Sbjct: 373 DLALEQIDPSLAIGFYCRDKDDFDDFCSRASELAEKANGAPLFTVMQSVQPSKQMYKQDD 432

Query: 453 VLGETGG---VPEDDSLGVMSMNDAVGNAHEDDWQLL 486
            L    G     ED  L      DA G A E +WQ+L
Sbjct: 433 GLCCCSGSSMANEDYDL------DASGEAGE-EWQIL 462


>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
          Length = 595

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 228/387 (58%), Positives = 289/387 (74%), Gaps = 14/387 (3%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
           S+  K S+LS +F    ++FE   + S++   A   K    S  W+  ++R V +GSM R
Sbjct: 45  SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104

Query: 94  IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
           +    LG +R     ++ D+W LG C++++ ++E  G +  ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157

Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
           RKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP+D +Y+ +LHL
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPYDPDYIRVLHL 217

Query: 213 FGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPM 270
           FGDSE   FSIHNLLQAG+ YGLAAGSWVGPYAMCR+W+ L R  R  A+   G ++ PM
Sbjct: 218 FGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQADAVDGKENFPM 277

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
           A+YVVSGDEDGERGGAPV CID A++ CS F+KGQ  W+PILLL+PLVLGL+K+NPRYIP
Sbjct: 278 ALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCTWSPILLLIPLVLGLDKINPRYIP 337

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
            L+ TF FPQSLGI+GGKPG STYI GVQE+ A+YLDPHDVQ  ++I  D+LEADTS+YH
Sbjct: 338 LLKETFKFPQSLGILGGKPGTSTYIAGVQEDRALYLDPHDVQMAVDIAPDNLEADTSSYH 397

Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDD 417
             V+R + L+ IDPSLAIGFYCRDK D
Sbjct: 398 CSVVRDLALEQIDPSLAIGFYCRDKGD 424


>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 356

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 211/357 (59%), Positives = 281/357 (78%), Gaps = 4/357 (1%)

Query: 89  GSMRRIHERVLGPSRTGISSST-SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR 147
           GSMRR+ E +LGP  T  ++S+ S+IW+LG+C+K++ D    +        EF  DF+SR
Sbjct: 1   GSMRRLQELLLGPRFTAANASSGSEIWVLGLCYKVSADPN-NETLSVQAFEEFISDFTSR 59

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
           I I+YRKGF+ +G SK+TSDVGWGCMLRS QML+AQAL+ H LGR WR+   +P  + Y+
Sbjct: 60  IWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLGRSWRREPGQPCSQAYL 119

Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL--GC 265
           +IL  FGDSE+ PFSIHNLL+AG  +GLAAGSW+GPYA+CR+ EALAR  R ++    G 
Sbjct: 120 QILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLEALARADREQSQKKGGK 179

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
           ++LP A+YVVSG+ +GERGGAPV+C++D +  CS + +   +WTP+L+LVPLVLGL+KVN
Sbjct: 180 RALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTEEWTPLLVLVPLVLGLDKVN 239

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
           PRY+P+LR TFTFPQSLGI GGKPGASTY++GVQ+E A+YLDPH+ Q V+ +  ++LE D
Sbjct: 240 PRYLPSLRATFTFPQSLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELD 299

Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
           TS+YH   +R + LD+IDPSLAIGFYCRD+ +FDD CAR+S+LA++SNGAP+FTV +
Sbjct: 300 TSSYHCSTVRRLPLDTIDPSLAIGFYCRDRAEFDDLCARSSELAKQSNGAPMFTVAE 356


>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
          Length = 429

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 226/388 (58%), Positives = 289/388 (74%), Gaps = 15/388 (3%)

Query: 36  SESKSSKGSLLSSLFNSAFSVFETYSESSAS-----EKKAVHNKSNGWTAAVKRLVTAGS 90
           S S+  K S+LS +F+  F++FE   + S+S           + S G +  ++R V +GS
Sbjct: 45  SGSRQPKASILSGVFSPPFAIFEGQQQGSSSPACDARSTKSSSGSYGLSRILRRFVGSGS 104

Query: 91  MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGL-AEFNQDFSSRIL 149
           M R+    LG  R     ++SD+W LG C+K++ +E     + ++   A F +DFSSRI 
Sbjct: 105 MWRL----LGCGRV---LTSSDVWFLGKCYKVSPEEEESGDSESDSGHAAFLEDFSSRIW 157

Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
           I+YRKGFD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +
Sbjct: 158 ITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGV 217

Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQS 267
           LHLFGDSE   FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L R  R  A+   G ++
Sbjct: 218 LHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKEN 277

Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
            PMA+YVVSGDEDGERGGAPVVCID A++ CS F+KG + W+PILLLVPLVLGL+K+NPR
Sbjct: 278 FPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPR 337

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           YIP L+ TF FPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ  ++I  D+LEADTS
Sbjct: 338 YIPLLKETFMFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMTVDIALDNLEADTS 397

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDK 415
           +YH  V+R + L+ IDPSLAIGFYCRDK
Sbjct: 398 SYHCSVVRALALEQIDPSLAIGFYCRDK 425


>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
 gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
          Length = 358

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 206/376 (54%), Positives = 271/376 (72%), Gaps = 29/376 (7%)

Query: 78  WTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDA 131
           WTAAV+R V  G +RRI E ++G       SS S IWLLG C+++      + DE   ++
Sbjct: 1   WTAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKES 58

Query: 132 AGNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR 189
             ++   +A+F  DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HR
Sbjct: 59  TSSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHR 118

Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
           LGR WR+  ++P+ REY+EILH F DS +   PFSIHN ++AG  YGLAAGSW+GPYA+C
Sbjct: 119 LGRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALC 177

Query: 248 RSWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
            + EALAR      G G Q    +A+YVVSGD  GERGGAPV+   D +  C        
Sbjct: 178 HAIEALAR----NDGRGRQGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-------- 225

Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
              P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YL
Sbjct: 226 ---PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYL 282

Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           DPH+VQ V+++  + LE D+++YH  V+R + LD+IDPSLA+GFYCR++++ DD CARAS
Sbjct: 283 DPHEVQKVVSVSGESLEFDSASYHCSVVRKMPLDAIDPSLALGFYCRNREELDDLCARAS 342

Query: 427 KLAEESNGAPLFTVTQ 442
           +LA +SNGAP+FTV +
Sbjct: 343 ELASQSNGAPMFTVAE 358


>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
 gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
          Length = 358

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 206/376 (54%), Positives = 271/376 (72%), Gaps = 29/376 (7%)

Query: 78  WTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI------AQDEALGDA 131
           WTAAV+R V  G +RRI E ++G       SS S IWLLG C+++      + DE   ++
Sbjct: 1   WTAAVRRAV--GPVRRIQECLMGMRGGNGISSGSAIWLLGACYRMGASSTSSTDEEAKES 58

Query: 132 AGNN--GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR 189
             ++   +A+F  DFSSRI I+YR+GF+ IG+SK TSDVGWGCM+RS QML AQAL+ HR
Sbjct: 59  TSSSPEAVADFLLDFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHR 118

Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
           LGR WR+  ++P+ REY+EILH F DS +   PFSIHN ++AG  YGLAAGSW+GPYA+C
Sbjct: 119 LGRGWRRG-EQPYAREYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALC 177

Query: 248 RSWEALARCQRAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
            + EALAR      G G +    +A+YVVSGD  GERGGAPV+   D +  C        
Sbjct: 178 HAIEALAR----NDGRGREGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-------- 225

Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
              P+L+LVPLVLGL+K+NPRY+P+LR TF FPQS+GI GGKP AS Y VGVQ++ A+YL
Sbjct: 226 ---PVLILVPLVLGLDKINPRYLPSLRATFAFPQSVGIAGGKPAASVYFVGVQDDQALYL 282

Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           DPH+VQ V+++  + LE D+++YH  V+R + LD+IDPSLA+GFYCR+++D DD CARAS
Sbjct: 283 DPHEVQKVVSVSGESLEFDSASYHCSVVRKMLLDAIDPSLALGFYCRNREDLDDLCARAS 342

Query: 427 KLAEESNGAPLFTVTQ 442
           +LA +SNGAP+FTV +
Sbjct: 343 ELASQSNGAPMFTVAE 358


>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 346

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/339 (58%), Positives = 267/339 (78%), Gaps = 5/339 (1%)

Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           SSS  +IW+LG+C+K++ D A  +A   +   EF  DFSSRI I+YRKGF+ +G+SK+TS
Sbjct: 4   SSSGGEIWVLGICYKVSAD-ANDEAVSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTS 62

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           DVGWGCMLRS Q+L+AQAL+ H LGR WR+   +   +EY++IL  FGDSE+  FSIHNL
Sbjct: 63  DVGWGCMLRSGQILLAQALVCHYLGRTWRRNACQECLQEYLQILQSFGDSESCSFSIHNL 122

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARC---QRAETGLGCQSLPMAIYVVSGDEDGER 283
           L+AG+ +GLAAGSW+GPYA+CR+ EALA+    Q A+ G G ++LP A+YVVSG+ +G+R
Sbjct: 123 LEAGRPFGLAAGSWLGPYALCRTLEALAKADEDQNAKKG-GKRALPFAVYVVSGETEGDR 181

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
           GGAPV C++DA+  CS + +   +W+P+++LVPLVLGL+K+NPRY+P+LR TFT PQSLG
Sbjct: 182 GGAPVRCVEDAAVLCSKWGEATEEWSPLVVLVPLVLGLDKLNPRYLPSLRATFTLPQSLG 241

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           + GGKPGAST+++GVQ + A+YLDPH+ Q V  +  ++LE DTS YH  V+R + LDSID
Sbjct: 242 VAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLELDTSFYHCSVVRRLPLDSID 301

Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
           PSLAIGFYCRD+ +FDD CAR+S+L ++ NGAP+FTV +
Sbjct: 302 PSLAIGFYCRDRAEFDDLCARSSELVKQYNGAPIFTVAE 340


>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 360

 Score =  318 bits (814), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 173/306 (56%), Positives = 226/306 (73%), Gaps = 2/306 (0%)

Query: 25  SLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKR 84
           S  S+ S+  SS++KS+  +L S +  S+  V +   E+S S    V    + WT  +K 
Sbjct: 26  SPTSLVSDSASSDNKSNL-TLCSDVVASSSPVSQLCREASTSGHNPVCTTHSSWTVILKT 84

Query: 85  L-VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQD 143
             + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QD
Sbjct: 85  ASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQD 144

Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
           FSS IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P D
Sbjct: 145 FSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPAD 204

Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
            +Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET  
Sbjct: 205 EKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDD 264

Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
             +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G+ +W PILLLVPLVLGL++
Sbjct: 265 KHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDR 324

Query: 324 VNPRYI 329
           VNP + 
Sbjct: 325 VNPSHF 330


>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 267

 Score =  314 bits (805), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 156/244 (63%), Positives = 198/244 (81%)

Query: 86  VTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
           + +G++RR  +RVLGPSRTGISSSTS+IWLLGVC+KI++ E+  +A     LA F QDFS
Sbjct: 1   MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 60

Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
           S IL++YR+GF+PIGD+  TSDV WGCMLRS QML AQALLF RLGR WRK   +P D +
Sbjct: 61  SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 120

Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
           Y+EIL LFGD+E S FSIHNL+ AG++YGLAAGSWVGPYA+CRSWE+LAR  + ET    
Sbjct: 121 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 180

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
           +S  MA+++VSG EDGERGGAP++CI+D ++ C  FS+G+ +W PILLLVPLVLGL++VN
Sbjct: 181 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 240

Query: 326 PRYI 329
           PR++
Sbjct: 241 PRFV 244


>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
          Length = 219

 Score =  288 bits (736), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 158/220 (71%), Positives = 178/220 (80%), Gaps = 4/220 (1%)

Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
           MAIYVVSGDEDGERGGAPVVCI+DA +HCS FS+GQA WTP+LLLVPLVLGL+KVNPRYI
Sbjct: 1   MAIYVVSGDEDGERGGAPVVCIEDAFKHCSEFSRGQAAWTPLLLLVPLVLGLDKVNPRYI 60

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TST 388
           P L  TF FPQSLGI+GGKPGASTYI+GVQ E A YLDPHDVQ V+NI  D  E + TS+
Sbjct: 61  PLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQEPNSTSS 120

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH--KK 446
           YH +V+RHI LDSIDPSLAIGFYCRDKDDFDDFC++ASKLAEESNGAPLFTV Q+    K
Sbjct: 121 YHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGAPLFTVAQSRSFSK 180

Query: 447 PVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
            V+ +DV G+  G  ED  LG    +D     +EDDWQLL
Sbjct: 181 QVSGNDVSGDNTGFEEDAFLGT-DHDDNDAGTNEDDWQLL 219


>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
          Length = 290

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 143/209 (68%), Positives = 172/209 (82%), Gaps = 2/209 (0%)

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           S  G GCM+RSSQMLVAQAL+FH LGR WRKP +KP++ +Y+ +L LFGDSE   FSIHN
Sbjct: 14  SLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGDSEACAFSIHN 73

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYVVSGDEDGER 283
           LLQA + YGLAAGSW+GPYAMCR+W+ L R  R  A+   G ++ PMA+YVVSGDEDGER
Sbjct: 74  LLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALYVVSGDEDGER 133

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
           GGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+ TF FPQSLG
Sbjct: 134 GGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLKETFMFPQSLG 193

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           I+GGKPG STYI GVQ++ A+YLDPH+VQ
Sbjct: 194 ILGGKPGTSTYIAGVQDDRALYLDPHEVQ 222


>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
 gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
          Length = 472

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 142/205 (69%), Positives = 168/205 (81%), Gaps = 2/205 (0%)

Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 215
           FD I DSK+TSDV WGCM+RSSQMLVAQAL+FH LGR  RKP +KP++ +Y+ +LHLFGD
Sbjct: 34  FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93

Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIY 273
           SE   FSIHNLLQAG+ YGLAAGSW+GPYAMCR+W+ L    R  A+   G ++ PMA+Y
Sbjct: 94  SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153

Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
           VVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVPLVLGL+K+NPRYIP L+
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVPLVLGLDKINPRYIPLLK 213

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGV 358
            TF FPQSL I+GGKPG STYI GV
Sbjct: 214 ETFMFPQSLCILGGKPGTSTYIAGV 238


>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
          Length = 169

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 113/165 (68%), Positives = 130/165 (78%)

Query: 96  ERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKG 155
           + +LG S T   SSTSDIWLLG C+K++ +E+ G     NG A F +DFSSRI I+YRKG
Sbjct: 2   QELLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKG 61

Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGD 215
           FD IGDSK TSDV WGCM+RSSQMLVAQALLFH LGR WRKP QKP D +Y+EILHLFGD
Sbjct: 62  FDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGD 121

Query: 216 SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
           SE   FSIHNLL+AGKAYGLAA  WVGPYAMCR+WE + R +R +
Sbjct: 122 SEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQ 166


>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
          Length = 362

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 139/364 (38%), Positives = 194/364 (53%), Gaps = 44/364 (12%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D  SRI ++YR+GF PI  S ITSDVGWGC LRS QML+AQAL++H +GR WR+ L+  +
Sbjct: 23  DLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQWRRKLEAAY 82

Query: 203 DREYVEILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
             E  ++L  FGD   E  PFSIHN+   G+ +G+ AG W+GP  +C +   +   +   
Sbjct: 83  PEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLADMVN-KVQP 141

Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWTPILLL 314
            GL C+     +    G      GGAPV+C    SR  + F  G      +   +     
Sbjct: 142 GGLQCR-----VVATFG------GGAPVLC---TSRLATAFEGGADRSGGEVGSSGSEES 187

Query: 315 VPLVLGLE-----------KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
            P   GL            K+NPRY   L+   T+PQS+GIVGG+P +S Y +G+Q++  
Sbjct: 188 GPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSLYFIGLQDQHV 247

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           +YLDPH+VQ V +       AD  TY    +R + L +IDPSLAIGFYC    DF+D C 
Sbjct: 248 LYLDPHEVQEVASEA-----ADLDTYFCSSLRLMPLANIDPSLAIGFYCSSLSDFEDLCG 302

Query: 424 RASKLAEESNGAPLFT-VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
           R   L  E+  APL   V +   +P   ++ +    G+P D      S     G A+ D+
Sbjct: 303 RLRTLEAEAGCAPLVCMVDEDAGEPSWPAEEVLSDEGIPSDAD----SPAPPAGGANRDN 358

Query: 483 WQLL 486
           W++L
Sbjct: 359 WEML 362


>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
          Length = 416

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 134/263 (50%), Positives = 166/263 (63%), Gaps = 56/263 (21%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
           L  F +DFSSRI I+YRKGFD I D K+TSDV WGCM+RSSQMLVAQAL+FH LGR WRK
Sbjct: 29  LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRK 88

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
           P +K                         L++  +                         
Sbjct: 89  PPEK------------------------TLIRTNR------------------------- 99

Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
           ++A+   G ++ PM +YVVSGDEDGERGGAPVV ID A++ CS F+KG + W+PILLLVP
Sbjct: 100 EQADAVDGKENFPMELYVVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPSTWSPILLLVP 159

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI- 375
           LVLGL+K+NPRYIP L+ TF FPQSLGI+G KPG STYI GVQ++ A+YLDPH+VQ V+ 
Sbjct: 160 LVLGLDKINPRYIPLLKETFMFPQSLGILGVKPGTSTYIAGVQDDRALYLDPHEVQMVLA 219

Query: 376 NIGKDDLEADTSTYHSDVIRHIH 398
           NI   +      T  +D I +IH
Sbjct: 220 NIKWPE------TLETDFIYNIH 236


>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 348

 Score =  221 bits (562), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 125/329 (37%), Positives = 182/329 (55%), Gaps = 19/329 (5%)

Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
           +LGV +    DE   +   ++    + +D+ SR  ++YR+GF+ +G +K  +D GWGC L
Sbjct: 1   MLGVTYWSKDDECNAEKY-DDARRAWERDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTL 59

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAY 233
           RS+QM+VA AL  H  GR WR+ ++   D E V+ +L +F D  ++PFSIH++ +   A+
Sbjct: 60  RSAQMMVANALSIHTRGRHWRRQVKAKEDDESVDHVLSMFIDDASAPFSIHSVCETTTAW 119

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG-DEDGERGGAPVVCID 292
           G   G W  P  MCR++ AL          G     +A++VV G +ED   GG P   ID
Sbjct: 120 GAPPGRWFEPSVMCRAFSALIEAN------GDLRNQIAVHVVGGQNEDDSAGGVPT--ID 171

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
           D          G+A    +LL VPLVLG+   +N RYI  LR    F QS+G++GG+P A
Sbjct: 172 DGELRAKSADVGKA----LLLFVPLVLGVGRNINTRYISQLRSIIAFKQSIGVIGGRPNA 227

Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
           S Y+VG  ++   YLDPH VQP  +  +     D  +Y+      +  + +DP+LA+GFY
Sbjct: 228 SLYLVGHSDDVFFYLDPHTVQPANSFAE---AVDFDSYYCSTPLQMRGELLDPTLALGFY 284

Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTV 440
           CRD DD D   A    LAE +  AP+  V
Sbjct: 285 CRDGDDLDSLFASVKALAEANATAPVLDV 313


>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
          Length = 369

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 128/370 (34%), Positives = 187/370 (50%), Gaps = 40/370 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F PIG +  TS
Sbjct: 19  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 65

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 66  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 125

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
            Q G   G + G W GP  + +  + LA               +A+++   +    ED  
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 177

Query: 283 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
           R   G  P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+
Sbjct: 178 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 237

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +      AD S +
Sbjct: 238 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 297

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDFDD+C +  +L+      P+F + +     + 
Sbjct: 298 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 357

Query: 450 HSDVLGETGG 459
             DVL  + G
Sbjct: 358 CPDVLNVSLG 367


>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
          Length = 390

 Score =  218 bits (554), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 120/333 (36%), Positives = 173/333 (51%), Gaps = 12/333 (3%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++A+AL+   LGR WR    +  
Sbjct: 45  DVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWARGRRQ 104

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
             EYV IL+ F D + S +SIH + Q G   G   G W GP          A+  +W  L
Sbjct: 105 REEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRL 164

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 313
           A     +  +  + +           D E  G    C++ A   C++  +  A W P++L
Sbjct: 165 AVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGA---CALVEEETALWKPLVL 221

Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
           L+PL LGL  +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP
Sbjct: 222 LIPLRLGLSDINEAYIDTLKQCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP 281

Query: 374 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 433
            +   +D    D + +       +H+  +DPS+A GF+CR +D+FDD+C R   L+ +  
Sbjct: 282 AVEPSEDGQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRGLSCKRG 341

Query: 434 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
           G P+F +  +    +   D L  T    + D L
Sbjct: 342 GLPMFELVDSQPTHMVSVDALNLTPDFSDSDRL 374


>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
          Length = 405

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 127/365 (34%), Positives = 185/365 (50%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F PIG +  TS
Sbjct: 34  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPPIGGTGPTS 80

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 81  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 140

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
            Q G   G + G W GP  + +  + LA               +A+++   +    ED  
Sbjct: 141 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSA--------LAVHIAMDNTVVMEDIR 192

Query: 283 R---GGAPVV----CIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
           R   G  P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+
Sbjct: 193 RLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPWRPLVLLIPLRLGLTDINEAYV 252

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +      AD S +
Sbjct: 253 ETLKRCFMMPQSLGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAVELTDSCFIADESFH 312

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDFDD+C +  +L+      P+F + +     + 
Sbjct: 313 CRHPPSRMSIGELDPSIAVGFFCKTEDDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLA 372

Query: 450 HSDVL 454
             DVL
Sbjct: 373 CPDVL 377


>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
 gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
          Length = 342

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 178/343 (51%), Gaps = 32/343 (9%)

Query: 101 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNN----GLAEFNQDFSSRILISYRKGF 156
           P +T  +   S IWLLG C+     E   + +        L EF++ F+S I ++YR+ F
Sbjct: 12  PLKTNFNED-SPIWLLGRCYHAKNYEYTSEQSKQQCQILSLEEFHRHFTSLIWLTYRRSF 70

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---KPLQKPFDREYVEILHLF 213
             +  S +TSD GWGCMLRS QM++A  L+FH L + WR   +   +  +  Y  IL  F
Sbjct: 71  VQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRISGRCHSREQEHYYRVILQFF 130

Query: 214 GDS---ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
           GD    E SPFS+H L+  G+  G  AG W GP ++    E              +++  
Sbjct: 131 GDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHILE--------------KAMIS 176

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVN 325
           A + +  D +        V ID+  R C+     Q D     W P+++LVP+ LG E +N
Sbjct: 177 ATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRLGGEALN 236

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
           P YIP ++  FT  Q +GI+GG+P  S Y VG Q+E  I+LDPH  QPV++  ++     
Sbjct: 237 PIYIPCVKSLFTLDQCIGIIGGRPKHSLYFVGFQDEKMIHLDPHYCQPVVDTTQEKFP-- 294

Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           T ++H    R      +DPS  IGFYC   +DF+ FC  AS++
Sbjct: 295 TESFHCPNPRKTSFKKMDPSCTIGFYCSSHEDFESFCQHASEV 337


>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
          Length = 390

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 125/359 (34%), Positives = 179/359 (49%), Gaps = 26/359 (7%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 19  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 65

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 66  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQSDSYFNVLNAFIDRKDSYYSIHQI 125

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
            Q G   G + G W GP  + +  + LA      + L          V+       RG  
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 184

Query: 287 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+ TL+  F
Sbjct: 185 PCAGATALPTDSSRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 244

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +       
Sbjct: 245 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCRHPPSR 304

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           + +  +DPS+A+GF+C+ +DDFDD+C R  +L+      P+F + +     +   DVL 
Sbjct: 305 MGISELDPSIAVGFFCKTEDDFDDWCQRVRQLSLLGGALPMFELVEQQPSHLACPDVLN 363


>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
          Length = 445

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 124/358 (34%), Positives = 177/358 (49%), Gaps = 26/358 (7%)

Query: 109 STSDIWLLGVCHKIA--QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I+  +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 74  TSEPVWILGRKYSISTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 120

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 121 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 180

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
            Q G   G + G W GP  + +  + LA      + L          V+       R G 
Sbjct: 181 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMEDIRRLCRAGL 239

Query: 287 P----VVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           P         D  RHC+ F  G       A W P++LL+PL LGL  +N  Y+ TL+  F
Sbjct: 240 PCAGAAALPADPGRHCNGFPAGAEVSNRLAPWRPLVLLIPLRLGLTDINEAYVETLKHCF 299

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 300 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFADSCFIPDESFHCQHPPSR 359

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
           + +  +DPS+A+GF+C+ ++DFDD+C R  KL+      P+F + +     +   DVL
Sbjct: 360 MGVRELDPSIAVGFFCQTEEDFDDWCQRVRKLSLLGGALPMFELVEQQPSHLACPDVL 417


>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
 gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
 gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
 gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
 gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
          Length = 393

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 185/366 (50%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
               A + C+       D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CQDVLN 366


>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
          Length = 410

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 121/356 (33%), Positives = 184/356 (51%), Gaps = 34/356 (9%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           + S +W+LG  + +  D           LAE  +D  SR+ ++YRKGFDPIG S  TSD 
Sbjct: 30  TESPVWILGKQYSVLYD-----------LAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQ 78

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           GWGCMLR  QM++AQ+L+   LGR WR    K +D +Y EIL +F D  ++ +S+  +  
Sbjct: 79  GWGCMLRCGQMMLAQSLICRHLGRDWRWTKDK-YDPKYFEILRMFQDKRSAKYSLQVIAS 137

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD------EDGE 282
            G + G A G W GP  + +    L  C   E       + +   V+  D         +
Sbjct: 138 MGTSEGKAIGEWFGPNTISQVLRKL--CVSDEWSNLVVHVALDNTVIIDDVFCLCKSSKK 195

Query: 283 RGGAPVVCIDDASRHCSVFS-----------KGQAD-WTPILLLVPLVLGLEKVNPRYIP 330
               P+  +  A     +F+            G+ D W P+LL+VPL LGL ++NP YIP
Sbjct: 196 ESNEPIPGVHAACASALLFNGHDPTAEGHDPSGEDDSWRPLLLIVPLRLGLSEINPVYIP 255

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
            L+   TF QS+GI+GGKP  + + +G  E+  +Y+DPH  QP +++ +   E+D S YH
Sbjct: 256 FLKTCLTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPFVDVTQPG-ESDAS-YH 313

Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
                 + +  +DPS+A+GF+C+ + DF+D C    K        P+F + Q   +
Sbjct: 314 CSYSCRMPVSYLDPSVAVGFFCQTEADFEDLCQCIRKYILHGQKTPMFELHQRRPR 369


>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
          Length = 394

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 123/358 (34%), Positives = 177/358 (49%), Gaps = 26/358 (7%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 23  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 129

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
            Q G   G + G W GP  + +  + LA      + L          V+       RG  
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSALAVHIAMDNTVVMEDIRRLCRGSL 188

Query: 287 PV----VCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           P         D+SRHC+ F  G       A W P++LL+PL LGL  +N  Y+ TL+  F
Sbjct: 189 PCAGAAALPADSSRHCNGFPAGAEVTNRLAPWRPLVLLIPLRLGLTDINEAYVETLKRCF 248

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 249 MMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFIPDESFHCQHPPSR 308

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
           + +  +DPS+A+GF+C+ + DFDD+C +  +L+      P+F + +     +   DVL
Sbjct: 309 MSIGELDPSIAVGFFCKTEGDFDDWCQQVRQLSLLGGALPMFELVEQQPSHLACPDVL 366


>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
          Length = 434

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 127/333 (38%), Positives = 174/333 (52%), Gaps = 33/333 (9%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
           A F   F S +  +YR  F  +G    TSD+GWGCMLR+ QM++AQ L  H LG  WR+ 
Sbjct: 108 ASFLTHFRSVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQ 167

Query: 198 LQK--PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
             +  P    Y +++  F D    PFS+H +  AG  YG   G W GP  M +  E L +
Sbjct: 168 SDRSSPL---YAKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLK 224

Query: 256 CQRAETGLG---CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-WTPI 311
            + + +GL    CQ     +Y+            P+   DD         +GQ   W P+
Sbjct: 225 -EFSPSGLRAYVCQD--GCLYLDQLRRTATAAHWPLDEDDD---------EGQGKSWAPM 272

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           L+++PL LGL+++N  Y P L+ TF  PQS+GI GGKP AS Y VG Q++   YLDPH V
Sbjct: 273 LIMLPLRLGLDQLNEDYAPVLKETFRIPQSVGISGGKPRASLYFVGNQDDYVFYLDPHTV 332

Query: 372 QPV---INIGKDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           QP      +G      D   T+H      + +  IDPSL + FYCR+++DFDDFCARA +
Sbjct: 333 QPAPRFPEVGDVPASEDVYDTFHCSAPLRLPIRDIDPSLCLAFYCRNREDFDDFCARAIQ 392

Query: 428 LAEESNGAPLFTVTQ------THKKPVNHSDVL 454
           L+E     P+FTV +         KP  HS+ L
Sbjct: 393 LSE--GPMPIFTVAERMPDYLVRPKPPKHSEKL 423


>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
 gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
 gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
          Length = 393

 Score =  208 bits (529), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
               A + C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CQDVLN 366


>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
          Length = 390

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 19  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 65

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 66  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 126 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 177

Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
               A + C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 178 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 237

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 238 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 297

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 298 CQHPPSRMGIGELDPSIAVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 357

Query: 450 HSDVLG 455
             DVL 
Sbjct: 358 CQDVLN 363


>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
           cysteine endopeptidase; AltName: Full=Autophagin-1;
           AltName: Full=Autophagy-related cysteine endopeptidase
           1; AltName: Full=Autophagy-related protein 4 homolog B
          Length = 393

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 184/366 (50%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
               A + C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     + 
Sbjct: 301 CQHPPSRMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CQDVLN 366


>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
          Length = 391

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 118/345 (34%), Positives = 178/345 (51%), Gaps = 16/345 (4%)

Query: 135 NGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
           N L E ++   D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LG
Sbjct: 34  NALTEKDEILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVCRHLG 93

Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
           R WR    +    EY+ +L+ F D + S +SIH + Q G   G   G W GP  + +  +
Sbjct: 94  RDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLK 153

Query: 252 ALARCQRAETGLGCQSLPMAIYVVS--------GDEDGERGGAPVV--CIDDASRHCSVF 301
            LA        +   ++   + +           D  GE  G   +  C++ A   C++ 
Sbjct: 154 KLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGA---CAMA 210

Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
            +  A W P++LL+PL LGL  +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E
Sbjct: 211 EEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLPQSLGVIGGKPNSAHYFIGYVGE 270

Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             IYLDPH  QP +   +D    D + +       +H+  +DPS+A GF+CR +D+FDD+
Sbjct: 271 ELIYLDPHTTQPAVEPSEDSQVPDETYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDW 330

Query: 422 CARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
           C R  +L+      P+F +  +    +   D L  T    + D L
Sbjct: 331 CMRIRRLSCNRGTLPMFELVDSQPSHMVSVDTLNLTPDFSDSDRL 375


>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
          Length = 517

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 172/313 (54%), Gaps = 25/313 (7%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---K 196
           F +DFSSR+  +YR+ F PI  + ITSD GWGCMLRSSQM++AQA++ H LGR WR    
Sbjct: 181 FLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYRRN 240

Query: 197 PLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EAL 253
              +  D  + +++ LFGD  +  SPFS+H L+Q G   G  AG W GP +      EAL
Sbjct: 241 NQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKEAL 300

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS-VFSKGQADWTPIL 312
               + E  L    L + IYV              + ++D    C    S G   W  ++
Sbjct: 301 EGACQTEQLL----LDLRIYVAQD---------CTIYLEDVRALCRGTRSNGAPLWRSVI 347

Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           +LVP+ LG E++NP YIP ++   + P  +G++GG+P  S Y +G Q E  IYLDPH VQ
Sbjct: 348 ILVPVRLGGEQLNPTYIPCVKGMLSHPNCIGVIGGRPRHSLYFLGWQGEKVIYLDPHYVQ 407

Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA--- 429
             +++G  D   D  +YH    R +    +DPS  +GFYC+ +D+F+ F     +LA   
Sbjct: 408 EAVDVGPQDFPLD--SYHCSWPRKMSFYKMDPSCTMGFYCKTEDEFEHFVKDVKQLAVPT 465

Query: 430 EESNGAPLFTVTQ 442
           E  +  P+F V++
Sbjct: 466 ESRHEYPVFLVSE 478


>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
          Length = 393

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 126/372 (33%), Positives = 181/372 (48%), Gaps = 52/372 (13%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
            Q G   G + G W GP          A+  +W ALA     +  +  +        SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 188

Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 323
            A            G A      D+ RHC+ F  G       A W P++LL+PL LGL  
Sbjct: 189 CA------------GAAAFPA--DSDRHCNGFPAGAEVTNRPAPWRPLVLLIPLRLGLTD 234

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +        
Sbjct: 235 INEAYVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCFI 294

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
            D S +       + +  +DPS+A+GF+C+ +DDF+D+C + + L+      P+F + + 
Sbjct: 295 PDESFHCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVTMLSLLGGALPMFELVEQ 354

Query: 444 HKKPVNHSDVLG 455
               +   DVL 
Sbjct: 355 QPSHLACPDVLN 366


>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
          Length = 394

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR W+    +  
Sbjct: 45  DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
             EYV IL+ F D + S +SIH + Q G   G + G W GP          A+  SW  L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
           A     +  +  + +     +     D +RG       P     D    C++  +  A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
            P++LL+PL LGL  +N  YI  L+  F  PQSLG++GGKP ++ Y +G   +  IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           H  QP ++  +D    D S +       +H+  +DPS+A GF+C+ +DDFDD+CA+  K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341

Query: 429 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
           +    G P+F +  +    +  +DVL  T    + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378


>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
 gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
          Length = 394

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 176/338 (52%), Gaps = 18/338 (5%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR W+    +  
Sbjct: 45  DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 104

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
             EYV IL+ F D + S +SIH + Q G   G + G W GP          A+  SW  L
Sbjct: 105 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 164

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
           A     +  +  + +     +     D +RG       P     D    C++  +  A W
Sbjct: 165 AVHVAMDNTVVIEEIKR---LCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETALW 221

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
            P++LL+PL LGL  +N  YI  L+  F  PQSLG++GGKP ++ Y +G   +  IYLDP
Sbjct: 222 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 281

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           H  QP ++  +D    D S +       +H+  +DPS+A GF+C+ +DDFDD+CA+  K+
Sbjct: 282 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 341

Query: 429 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
           +    G P+F +  +    +  +DVL  T    + D L
Sbjct: 342 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 378


>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
          Length = 393

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 175/338 (51%), Gaps = 18/338 (5%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR W+    +  
Sbjct: 44  DVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWSPGQRQ 103

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEAL 253
             EYV IL+ F D + S +SIH + Q G   G + G W GP          A+  SW  L
Sbjct: 104 RPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRL 163

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA-----PVVCIDDASRHCSVFSKGQADW 308
           A     +  +  + +           D +RG       P     D    C++  +  A W
Sbjct: 164 AVHVAMDNTVVIEEIKRLCMPWL---DFDRGACAVSEEPREMNGDLEGACALAEEETALW 220

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
            P++LL+PL LGL  +N  YI  L+  F  PQSLG++GGKP ++ Y +G   +  IYLDP
Sbjct: 221 KPLVLLIPLRLGLSDINEAYIEPLKQCFMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDP 280

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           H  QP ++  +D    D S +       +H+  +DPS+A GF+C+ +DDFDD+CA+  K+
Sbjct: 281 HTTQPAVDPSEDGHFPDDSYHCQHPPCRMHICELDPSIAAGFFCQTEDDFDDWCAQIRKV 340

Query: 429 AEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
           +    G P+F +  +    +  +DVL  T    + D L
Sbjct: 341 S-NCRGLPMFELVDSQPSHLITADVLNLTPDFSDSDRL 377


>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
          Length = 405

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 127/385 (32%), Positives = 190/385 (49%), Gaps = 42/385 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLGETGGVPEDDSLGVMSMNDA 474
             DVL  + G  E   + V S+ D+
Sbjct: 361 CPDVLNLSLG--ESCQVQVGSLGDS 383


>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
 gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
          Length = 393

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 127/372 (34%), Positives = 179/372 (48%), Gaps = 52/372 (13%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQALL   LGR WR    +     Y  +LH F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALLCRHLGRGWRWTQWERQPDSYFSVLHAFMDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQ-SLP 269
            Q G   G + G W GP  + +         +W ALA            E    C+ SLP
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSALAVHVAMDNTVVMEEIRRLCRSSLP 188

Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEK 323
                        R GA      D+ RHC+ F            W P++LL+PL LGL  
Sbjct: 189 -------------RAGAAAFPA-DSDRHCNGFPAEAEVGPRPVPWRPLVLLIPLRLGLTD 234

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           +N  Y  TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +    L 
Sbjct: 235 INAAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQVTDSCLI 294

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
            D S +       + +  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + + 
Sbjct: 295 PDESFHCQHPPHRMSIAELDPSIAVGFFCQTEEDFNDWCQQVRKLSLLGGALPMFELVEQ 354

Query: 444 HKKPVNHSDVLG 455
               +   DVL 
Sbjct: 355 QPSHLACPDVLN 366


>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
          Length = 510

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 139 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 185

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 186 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 245

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 246 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 297

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 298 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 357

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 358 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 417

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 418 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 477

Query: 450 HSDVL 454
             DVL
Sbjct: 478 CPDVL 482


>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
          Length = 415

 Score =  205 bits (522), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLGETGG 459
             DVL  + G
Sbjct: 361 CPDVLNLSLG 370


>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
          Length = 521

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 127/385 (32%), Positives = 189/385 (49%), Gaps = 42/385 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476

Query: 450 HSDVLGETGGVPEDDSLGVMSMNDA 474
             DVL  + G  E   + V S+ D+
Sbjct: 477 CPDVLNLSLG--ESCQVQVGSLGDS 499


>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
          Length = 468

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448

Query: 450 HSDVLGETGG 459
             DVL  + G
Sbjct: 449 CPDVLNLSLG 458


>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
          Length = 393

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CPDVLN 366


>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
          Length = 380

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360

Query: 450 HSDVLGETGG 459
             DVL  + G
Sbjct: 361 CPDVLNLSLG 370


>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
          Length = 481

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 328

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 448

Query: 450 HSDVL 454
             DVL
Sbjct: 449 CPDVL 453


>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
          Length = 508

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 137 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 183

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 184 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 243

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 244 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 295

Query: 282 ERGGAPVVCID------DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 296 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 355

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 356 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 415

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 416 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 475

Query: 450 HSDVL 454
             DVL
Sbjct: 476 CPDVL 480


>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
 gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
 gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
          Length = 393

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CPDVLN 366


>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
 gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
          Length = 398

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 27  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 73

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 74  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 133

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 134 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 185

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 186 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 245

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 246 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 305

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 306 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 365

Query: 450 HSDVL 454
             DVL
Sbjct: 366 CPDVL 370


>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
          Length = 393

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CPDVLN 366


>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
          Length = 396

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363

Query: 450 HSDVLG 455
             DVL 
Sbjct: 364 CPDVLN 369


>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
          Length = 468

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 110 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 156

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 157 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 216

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 217 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 268

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 269 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 328

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 329 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 388

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 389 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 448

Query: 450 HSDVLGETGG 459
             DVL  + G
Sbjct: 449 CPDVLNLSLG 458


>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
          Length = 479

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 120/365 (32%), Positives = 181/365 (49%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 108 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 154

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 155 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 214

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 215 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 266

Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 329
               A + C        D+ RHC+ F  G         W P++LL+PL LGL  +N  Y+
Sbjct: 267 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 326

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 327 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 386

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + +     + 
Sbjct: 387 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 446

Query: 450 HSDVL 454
             DVL
Sbjct: 447 CQDVL 451


>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
 gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
 gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
 gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
 gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
 gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
           construct]
 gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
          Length = 393

 Score =  204 bits (520), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CPDVLN 366


>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
 gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
           cysteine endopeptidase; AltName: Full=Autophagin-1;
           AltName: Full=Autophagy-related cysteine endopeptidase
           1; AltName: Full=Autophagy-related protein 4 homolog B;
           Short=hAPG4B
 gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
          Length = 393

 Score =  204 bits (520), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CPDVLN 366


>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
          Length = 396

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 363

Query: 450 HSDVLG 455
             DVL 
Sbjct: 364 CPDVLN 369


>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
 gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
          Length = 375

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 109/337 (32%), Positives = 171/337 (50%), Gaps = 33/337 (9%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG C+ +   ++           E   D  SR+  +YRK F PIG +  +SD GWGC
Sbjct: 26  VWILGACYNVKTKKS-----------ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGC 74

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR WR   +K   +EY  IL  F D + S +SIH + Q G  
Sbjct: 75  MLRCGQMILAQALICSHLGRDWRWDPEKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVG 134

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +++YV   +          V I+
Sbjct: 135 EGKSVGEWYGPNTVAQVLKKLALFDDWNS--------LSVYVSMDN---------TVVIE 177

Query: 293 DASRHC-----SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
           D  + C      + S+   DW P+LL++PL +G+  +NP YI  L+  F  PQS G++GG
Sbjct: 178 DIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQALKECFKMPQSCGVLGG 237

Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 407
           KP  + Y +G  ++  IYLDPH  Q  ++        D S +       + + S+DPS+A
Sbjct: 238 KPNLAYYFIGFIDDELIYLDPHTTQQAVDTESGSAVDDQSFHCQRTPHRMKITSLDPSVA 297

Query: 408 IGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +GF+C+ ++DFD +C    +   +     +F + + H
Sbjct: 298 LGFFCKSEEDFDSWCDLVQQELLKKRNLRMFELVEKH 334


>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
          Length = 496

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 182/370 (49%), Gaps = 40/370 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476

Query: 450 HSDVLGETGG 459
             DVL  + G
Sbjct: 477 CPDVLNLSLG 486


>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
          Length = 509

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 180/365 (49%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 138 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 184

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 185 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 244

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 245 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 296

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 297 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 356

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 357 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 416

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 417 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 476

Query: 450 HSDVL 454
             DVL
Sbjct: 477 CPDVL 481


>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
          Length = 394

 Score =  204 bits (519), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 179/367 (48%), Gaps = 42/367 (11%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 23  TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQALL   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129

Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
            Q G   G + G W GP          A+  +W ALA     +  +  +        SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAIFDTWSALAVHIAMDNTVVMEDIRRLCRSSLP 189

Query: 270 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
            A       D +G   G P           +  +   + W P++LL+PL LGL  +N  Y
Sbjct: 190 CAEATAFPADSEGHCNGLPA---------GAEVTNRPSLWRPLVLLIPLRLGLTDINEAY 240

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           + TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +      L  D S 
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSFLIPDESF 300

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
           +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHL 360

Query: 449 NHSDVLG 455
              DVL 
Sbjct: 361 ACPDVLN 367


>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
          Length = 393

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CPDVLN 366


>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
           pisum]
 gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
           pisum]
 gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
           pisum]
 gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
           pisum]
          Length = 402

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 110/344 (31%), Positives = 178/344 (51%), Gaps = 38/344 (11%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I  +   +W+LG  +    D           L +   D  SR+  +YRKGF  IG++  T
Sbjct: 40  IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           SD GWGCMLR  QM++ QAL+F  LGR WR    K  D +Y++IL +F D  ++P+SIH 
Sbjct: 89  SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           +   G ++G   G W GP  + +  + LA             L   ++ V+ D       
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192

Query: 286 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
              + I++  + C+V  +  +    W P++L++PL LG+  +NP Y+  +++ FTFPQSL
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKMCFTFPQSL 250

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIRHIHL 399
           G++GG+P  + Y +G      I+LDPH  Q +  +   D+E +     +YH   I  + +
Sbjct: 251 GVIGGRPNHALYFIGFVGNDVIFLDPHTTQQIGMLPNKDIETEHKIDHSYHCQQINRLPI 310

Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARAS---KLAEESNGAPLFTV 440
            ++DPSLA  F C+ ++DF+  C         +++S   PL T+
Sbjct: 311 LNMDPSLAACFMCQTENDFNALCHELKVHLVQSDQSPSQPLITI 354


>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
          Length = 394

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 181/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 23  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 69

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 129

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 181

Query: 282 ERGGAPVVCI------DDASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYI 329
               A + C        D+ RHC+ F  G         W P++LL+PL LGL  +N  Y+
Sbjct: 182 RLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAWRPLVLLIPLRLGLTDINEAYV 241

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +
Sbjct: 242 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFH 301

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + +     + 
Sbjct: 302 CQHPPCRMGIGELDPSIAVGFFCETEEDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLV 361

Query: 450 HSDVLG 455
             DVL 
Sbjct: 362 CQDVLN 367


>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
          Length = 380

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 122/370 (32%), Positives = 183/370 (49%), Gaps = 40/370 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  +  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCYMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLA 360

Query: 450 HSDVLGETGG 459
             DVL  + G
Sbjct: 361 CPDVLNLSLG 370


>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
          Length = 393

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 180/366 (49%), Gaps = 40/366 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     + 
Sbjct: 301 CQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGCALPMFELVEQQPSHLA 360

Query: 450 HSDVLG 455
             DVL 
Sbjct: 361 CPDVLN 366


>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
          Length = 420

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 119/365 (32%), Positives = 183/365 (50%), Gaps = 40/365 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 49  TSEPVWILGRKYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 95

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQALL   LGR WR   ++     Y  +LH F D + S +SIH +
Sbjct: 96  DTGWGCMLRCGQMIFAQALLCRHLGRDWRWAQRRRQPDSYFSVLHAFIDRKDSHYSIHQI 155

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGE 282
            Q G   G + G W GP  + +  + LA      +        +A+++   +    E+  
Sbjct: 156 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 207

Query: 283 R-------GGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
           R             C  D S+HC+    G       + W P++LL+PL LGL  +N  Y+
Sbjct: 208 RLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTWRPLVLLIPLRLGLTDINEAYV 267

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D + +
Sbjct: 268 ETLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELAGGFSIPDETFH 327

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  +++  +DPS+A+GF+C+ ++DF+D+C +  KL+  S   P+F + +     + 
Sbjct: 328 CQHPPCRMNIAELDPSIAVGFFCKTEEDFNDWCQQVKKLSLLSGALPMFELVEQQPSHLA 387

Query: 450 HSDVL 454
             DVL
Sbjct: 388 CPDVL 392


>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
          Length = 473

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 175/355 (49%), Gaps = 21/355 (5%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           ++  +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 103 TSEPVWILGRKYSLLTEKN-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 151

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           GWGCMLR  QM+ AQAL+   LGR WR   QK     Y+ +LH F D + S +SIH + Q
Sbjct: 152 GWGCMLRCGQMIFAQALVCRHLGRDWRWTQQKRQPDSYLSVLHAFMDRKDSYYSIHQIAQ 211

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
            G   G + G W GP  + +  + LA      + L          V+       R   P 
Sbjct: 212 MGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRSSHPC 270

Query: 289 VCIDDASR----HCSVFS-----KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
                       HC+ F        ++ W P++LL+PL LGL  +N  Y+ TL+L F  P
Sbjct: 271 AGAATPPAGADWHCNGFPASTEVTNRSPWRPLVLLIPLRLGLTDINEAYVETLKLCFRMP 330

Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHL 399
           QSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +       + +
Sbjct: 331 QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDLCFIPDESFHCQHPPCRMSI 390

Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
             +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 391 GELDPSIAVGFFCKTEEDFNDWCQQVRKLSLLGGALPMFELVEQQPPHLACPDVL 445


>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
          Length = 477

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 119/373 (31%), Positives = 176/373 (47%), Gaps = 46/373 (12%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
           S  S +WLLG C+    ++ L  A+                      N + EF +DF SR
Sbjct: 86  SKESPVWLLGQCYLKKSEDPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFVSR 145

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF----- 202
           I ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR   ++P      
Sbjct: 146 IWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQPIETLQQ 205

Query: 203 ---DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
              DR +  I+  FGD   SPFSIH L+  G + G  AG W GP ++         C   
Sbjct: 206 RLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAHLLSQAVECASK 265

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
           ++      L  A+YV              V + D    C         W  ++LLVPL L
Sbjct: 266 QSNSNFDHL--AVYVAQD---------CAVYLQDVENICRT---PDGKWKALVLLVPLRL 311

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G +K+NP Y P L    T    +G++GG+P  S Y +G Q++  I+LDPH  Q  +++ K
Sbjct: 312 GADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVWK 371

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK--LAEESNGAPL 437
           +D     +++H    R + L  +DPS  +GFY  +K+   DF     +  +  +    P+
Sbjct: 372 NDFSL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNKEALTDFMETIQRFVIPNQKTNYPM 429

Query: 438 FTVTQTHKKPVNH 450
           F   +   K + H
Sbjct: 430 FLFCEGSGKDLQH 442


>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
          Length = 412

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 178/360 (49%), Gaps = 28/360 (7%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +D+ L D A             SR+  +YR+ F  IG +  TS
Sbjct: 39  TSEPVWILGRKYSIFTEKDDILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 85

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 86  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 145

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
            Q G   G + G W GP  + +  + LA      + L          V+       R G 
Sbjct: 146 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-TWSSLAVHIAMDNTVVMEEIRRLCRTGL 204

Query: 287 PVV----CIDDASRHCSVF--------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
           P         DA RHC+ F         +  + W P++LL+PL LGL  +N  Y+ TL+ 
Sbjct: 205 PCAGAAALPTDADRHCNGFPTQTEVTNRQSPSLWRPLVLLIPLRLGLTDINEAYVETLKH 264

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
            F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D + +     
Sbjct: 265 CFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIPDETFHCQHPP 324

Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
             + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 325 CRMGIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 384


>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
          Length = 393

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 122/375 (32%), Positives = 180/375 (48%), Gaps = 58/375 (15%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           ++  +W+LG  + I  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 22  TSEPVWILGRKYSIFTEKE-----------ELLSDVASRLWFTYRKNFPAIGGTGPTSDT 70

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q
Sbjct: 71  GWGCMLRCGQMIFAQALVCQHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQ 130

Query: 229 AGKAYGLAAGSWVGP---------YAMCRSWEALA------------RCQR-AETGLGCQ 266
            G   G + G W GP          A+  +W +LA              +R   T L C 
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSLPCG 190

Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ------ADWTPILLLVPLVLG 320
           + P +              AP        +HC+ F  G       + W P++LL+PL LG
Sbjct: 191 TAPAS------------SAAP-------DQHCNGFPAGAEVTTRLSPWRPLVLLIPLRLG 231

Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
           L  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +     
Sbjct: 232 LTDINAAYVETLKRCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEATDS 291

Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
            L  D S +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F +
Sbjct: 292 CLVPDESFHCQHPPCRMSIGELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFEL 351

Query: 441 TQTHKKPVNHSDVLG 455
            +     +   DVL 
Sbjct: 352 VEQPPSHLACPDVLN 366


>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
           kowalevskii]
          Length = 356

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 114/311 (36%), Positives = 163/311 (52%), Gaps = 13/311 (4%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  + + +D +           E   D  SRI I+YRK F  IG +  TSD GWGC
Sbjct: 26  VWILGKAYHLIRDRS-----------ELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGC 74

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQALL   LGR WR   ++  +  Y +IL LF D + S +SIH + Q G  
Sbjct: 75  MLRCGQMILAQALLCKHLGREWRWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVG 134

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +    L       +     S+   I VV       R      C  
Sbjct: 135 EGKSIGQWFGPNTVAQVLRKLTLFDDWSSIAVHISMDNTI-VVEDIRKLCRTPLFTECAS 193

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
             +   S+ + G   W P++L +PL LGL ++NP Y+  L+  FT  QSLG++GGKP  +
Sbjct: 194 PKAASASLENGGTTYWKPLVLFIPLRLGLTEINPLYLDVLKKCFTLKQSLGMIGGKPNHA 253

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            Y +G   ++ +YLDPH  QPV++I K     D  TYH      +++  +DPS+A+GF+C
Sbjct: 254 HYFIGFYGKTLVYLDPHTTQPVVDINKWASIPD-DTYHCKHPSRMNIMHLDPSIALGFFC 312

Query: 413 RDKDDFDDFCA 423
             + DFDD C 
Sbjct: 313 HCESDFDDLCT 323


>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
          Length = 394

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 117/363 (32%), Positives = 182/363 (50%), Gaps = 27/363 (7%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
            +T  +W+LG            + +      E   D +SR+  +YRK F PIG +  TSD
Sbjct: 21  ETTEPVWILG-----------NEYSALTEKEEILSDVTSRLWFTYRKSFPPIGGTGPTSD 69

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLL 227
            GWGCMLR  QM++ QAL+   LGR WR    +   +EY+ IL+ F D + S +SIH + 
Sbjct: 70  TGWGCMLRCGQMILGQALMCRHLGRDWRWVRGQKQRQEYISILNAFIDKKDSYYSIHQIA 129

Query: 228 QAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 277
           Q G   G   G W GP          A+  +W  L      +  +  + +  + +  +  
Sbjct: 130 QMGVGEGKPIGQWYGPNTVAQVLKKLAVFDTWSRLVVHVAMDNTVVIEEIKRLCMPWLDK 189

Query: 278 DE---DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
            E   + ER G    C++ A   C++  +  A W P++LL+PL LGL  +N  YI TL+ 
Sbjct: 190 AEVFGEPERVGELNGCLEGA---CALSEEEVALWKPLVLLIPLRLGLSDINGAYIETLKK 246

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
            F  PQSLG++GGKP ++ Y +G      IYLDPH  Q  +   +     D + +     
Sbjct: 247 CFMLPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFPDDTYHCQHPP 306

Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
             +H+  +DPS+A+GF+CR +D+FDD+C R  +L+   +  P+F +  +    +   D +
Sbjct: 307 CRMHICELDPSIAVGFFCRTEDEFDDWCMRIRRLSCNKDNLPMFELVDSQPSHLVGVDAI 366

Query: 455 GET 457
             T
Sbjct: 367 NLT 369


>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
 gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
           cysteine endopeptidase; AltName: Full=Autophagin-2;
           AltName: Full=Autophagy-related cysteine endopeptidase
           2; AltName: Full=Autophagy-related protein 4 homolog A
 gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
 gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  201 bits (511), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 176/353 (49%), Gaps = 50/353 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
            +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++I +  L  D + +  
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
              + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352


>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
          Length = 390

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 127/368 (34%), Positives = 177/368 (48%), Gaps = 46/368 (12%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
            Q G   G + G W GP          A+  +W ALA             + M   VV  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177

Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
           D     R   P    +    D+ RHC+ F          A W P++LL+PL LGL  VN 
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237

Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
            Y  TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D 
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDE 297

Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
           S +       + +  +DPS+A+GF+C  +DDF+D+C + SKL+      P+F + +    
Sbjct: 298 SFHCQHPPGRMSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPS 357

Query: 447 PVNHSDVL 454
            +   DVL
Sbjct: 358 HLACPDVL 365


>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
 gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related cysteine endopeptidase 2B;
           Short=Autophagin-2B; AltName: Full=Autophagy-related
           protein 4 homolog B; AltName: Full=bAut2B
 gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
          Length = 393

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 127/368 (34%), Positives = 177/368 (48%), Gaps = 46/368 (12%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
            Q G   G + G W GP          A+  +W ALA             + M   VV  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177

Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
           D     R   P    +    D+ RHC+ F          A W P++LL+PL LGL  VN 
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237

Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
            Y  TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D 
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDE 297

Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
           S +       + +  +DPS+A+GF+C  +DDF+D+C + SKL+      P+F + +    
Sbjct: 298 SFHCQHPPGRMSIAELDPSIAVGFFCETEDDFNDWCQQVSKLSLLGGALPMFELVEQQPS 357

Query: 447 PVNHSDVL 454
            +   DVL
Sbjct: 358 HLACPDVL 365


>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
           [Desmodus rotundus]
          Length = 394

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 176/366 (48%), Gaps = 42/366 (11%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 23  TSEPVWILGRRYSVFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 69

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQALL   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  DTGWGCMLRCGQMIFAQALLCRHLGRDWRWTQRKRQPDSYFHVLNAFIDRKDSYYSIHQI 129

Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQ--------SLP 269
            Q G   G + G W GP          A+  +W ALA     +  +  +        SLP
Sbjct: 130 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALAVHVAMDNTVVMEDIRRLCRSSLP 189

Query: 270 MA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
            A       D +G   G P           +  +   + W P++LL+PL LGL  +N  Y
Sbjct: 190 CAGASAFPADSEGHCNGFPAR---------AEVTNRPSPWRPLVLLIPLRLGLTDINEAY 240

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           + TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S 
Sbjct: 241 VETLKGCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEFTDSCSIPDESF 300

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
           +       + +  +DPS+A+GF+C  +DDF D+C +  KL+      P+F + +     +
Sbjct: 301 HCQHPPSRMSIGELDPSIAVGFFCETEDDFGDWCQQVKKLSLLGGALPMFELVEQQPSHL 360

Query: 449 NHSDVL 454
              DVL
Sbjct: 361 ACPDVL 366


>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
 gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
          Length = 424

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 126/364 (34%), Positives = 182/364 (50%), Gaps = 57/364 (15%)

Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
           + GV H   ++ + G+ +   G  E+ +D+ SR  ++YR+GF+ +G +K  +D GWGC L
Sbjct: 42  MFGVTH-WDRETSSGERSNEVGRREWERDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTL 100

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDRE----------------------------- 205
           RS+QM++A AL  H  GR WR+ +Q     E                             
Sbjct: 101 RSAQMMLANALSIHSRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSE 160

Query: 206 --------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
                     +IL LF D   +PFSIH + +    +G   G W  P  MCR++EAL    
Sbjct: 161 RTRAGSDAQEDILRLFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALV--- 217

Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
            AE  LG +   + ++VVSG E GE GG P V  D+A         G+A    +LL VP+
Sbjct: 218 -AEHDLGSE---LTVHVVSGRE-GEDGGVPTV--DEAEVRAKSADVGKA----LLLFVPV 266

Query: 318 VLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
           VLG+ + +N RY+  LR    F QS+GIVGG+P +S Y+VG  ++   YLDPH VQ   +
Sbjct: 267 VLGVGRTINARYLSQLRSMMAFKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASS 326

Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
           +   D E    +Y+     H+    +DP+LA+GFYCRD DD          LA  +  AP
Sbjct: 327 MVTMDFE----SYYCPTPLHVCGGDLDPTLALGFYCRDGDDVASLLVDIEALARVNATAP 382

Query: 437 LFTV 440
              +
Sbjct: 383 ALAI 386


>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
          Length = 393

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 121/358 (33%), Positives = 174/358 (48%), Gaps = 26/358 (7%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
            Q G   G + G W GP  + +  + LA      + L          V++      R   
Sbjct: 129 AQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD-TWSALAVHVAMDNTVVMADIRRLCRSSL 187

Query: 287 PVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           P    +    D+ RHC+ F          A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 188 PCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 247

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 248 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 307

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
           + +  +DPS+A+GF+C  +DDF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 308 MSIAELDPSIAVGFFCETEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVL 365


>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
 gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
          Length = 384

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 115/342 (33%), Positives = 169/342 (49%), Gaps = 36/342 (10%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D +SR+  +YR+ F  IG +  TSD GWGCMLR  QM+ AQAL+   +GR WR   QKP 
Sbjct: 45  DITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWDKQKP- 103

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
             EY+ IL  F D + S +SIH + Q G   G   G W GP  + +    LA   +  + 
Sbjct: 104 KGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQWSS- 162

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD--------------- 307
                  +A+++   +          V +D+  R C   S   +D               
Sbjct: 163 -------IAVHIAMDN---------TVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDPS 206

Query: 308 ---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
              W P++LL+PL LGL ++N  YI TL+  F  PQSLG++GG+P ++ Y +G   +  I
Sbjct: 207 CAQWKPLVLLIPLRLGLSEINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDELI 266

Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           YLDPH  Q  +         D S +       +H+  IDPS+A+GF+C  ++DF+D+C  
Sbjct: 267 YLDPHTTQLSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSIAVGFFCSSQEDFEDWCQH 326

Query: 425 ASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
             KL+      P+F V       +++ DVL  T    + D L
Sbjct: 327 IKKLSLSGGALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368


>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
 gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
          Length = 396

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 174/353 (49%), Gaps = 50/353 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H    +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHPFKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + L       +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLTLFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
            +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++I +  L  D + +  
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
              + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352


>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
 gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
          Length = 357

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDP   QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPATTQPAVEPTDGCFIPDESFH 303

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356


>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
          Length = 392

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 41/366 (11%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G     + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEK-SIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 179

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 180 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 239

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 240 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 299

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN 449
                  + + ++DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     + 
Sbjct: 300 CQHPPCRMSIANLDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLA 359

Query: 450 HSDVLG 455
             DVL 
Sbjct: 360 CPDVLN 365


>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
          Length = 432

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 109/313 (34%), Positives = 171/313 (54%), Gaps = 11/313 (3%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHR-LGRPWR 195
           + EF +DFS+++  SYR+GF+ IGDS   +D GWGCMLRS QML+A  LL +  +G+ W+
Sbjct: 88  IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147

Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALA 254
           KP    +  ++ +++ LF D  ++PFSIHN+   G+ + G + G W  P  +  +  AL 
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207

Query: 255 -RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC----SVFSKGQADWT 309
            +      G   +            +   +    V   DD S +      +  +    W 
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALRSDGSWM 267

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           P+L+L+P  LG++ +N  Y   L   +TFPQ+LGIVGGKP AS Y +  Q+++  YLDPH
Sbjct: 268 PLLILIPTKLGIDTINEIYYRPLLDIYTFPQNLGIVGGKPRASLYFIASQDDNLFYLDPH 327

Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
            VQ  I   + D +   S+Y  ++ +  ++  +DPSL I F+C  K+ F DF  R+ KL 
Sbjct: 328 TVQNSI---ESDSDFSLSSYFCNIPKKANISEVDPSLVIPFFCSTKESFLDFLERSKKL- 383

Query: 430 EESNGAPLFTVTQ 442
           E S+  PL+ + +
Sbjct: 384 ESSSEFPLYNIQE 396


>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
 gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
          Length = 380

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 116/349 (33%), Positives = 179/349 (51%), Gaps = 41/349 (11%)

Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
           W+LGV +   +D             E   D SSR+  +YRK F PIG +   SD GWGCM
Sbjct: 32  WILGVGYNTVKDRQ-----------ELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCM 80

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
           LR  QM++ QAL+   LGR WR      +D +Y +IL LF D + S +SIH + Q G + 
Sbjct: 81  LRCGQMMLGQALICRHLGRDWRWK-SAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSE 139

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE----------DGER 283
           G + G W GP  + +  + LA  +   +        +AI+V   +              R
Sbjct: 140 GKSVGQWFGPNTVAQVLKKLALFEDWSS--------LAIHVAMDNTVIIDDIKKLCRSAR 191

Query: 284 GGAP------VVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
              P       +C   ++   S  S+  A  W P++L++PL LGL ++NP Y   L+  F
Sbjct: 192 QPTPSQVTNSFLCNGVSAEQTSARSRSPALPWQPLMLIIPLRLGLSELNPVYTDCLKACF 251

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           T  QSLG++GGKP  + Y +G    S +YLDPH  QP + + + ++    S++H      
Sbjct: 252 TLRQSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPAVEL-EGNVPIPDSSFHCTHPSR 310

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKL--AEESNGAPLFTVTQT 443
           +++  +DPS+A+GF+C+D+ DF D C    +L   +++  A +F V Q+
Sbjct: 311 MNIQDLDPSIALGFFCQDEADFADLCENMRRLIIGQKTQNA-MFEVVQS 358


>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
           Complex
          Length = 357

 Score =  198 bits (504), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 118/353 (33%), Positives = 176/353 (49%), Gaps = 40/353 (11%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 25  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 71

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWG MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGSMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 131

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 183

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 184 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 243

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 244 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 303

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
                  + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +
Sbjct: 304 CQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVE 356


>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 175/353 (49%), Gaps = 50/353 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+Y    +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYDSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 181 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 240

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
            +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++I +  L  D + +  
Sbjct: 241 FKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDIEESGLVDDQTFHCL 300

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
              + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 QSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 352


>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
           [Desmodus rotundus]
          Length = 396

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +     V+  S D   
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPS 195

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           E    P+    +A+ H    S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 196 ESSHDPL----NATNHNKAISACCPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + +     + + + +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQSPQRMSILN 311

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 312 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353


>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
          Length = 398

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 111/343 (32%), Positives = 179/343 (52%), Gaps = 27/343 (7%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +     I  +S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAG 197

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           E   +P   ++ ++R  S  S G   W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 E---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
          Length = 453

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 172/346 (49%), Gaps = 60/346 (17%)

Query: 108 SSTSDIWLLGVCHK-----------------IAQDEALGDAAGNNGLAEFNQDFSSRILI 150
           S  S +WLLG C++                   Q ++   ++ + G   F +DF SR+ +
Sbjct: 63  SKESPVWLLGKCYRRIESPSSDSTELGTDVAAFQSQSEIASSDDEGFEGFKKDFISRLWL 122

Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDRE-YVE 208
           +YR+ F  +  S  +SD GWGCMLRS QML+AQAL+ H LGR WR +P  +P  RE ++E
Sbjct: 123 TYRREFPILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDWRWQPDHQPTTRESFIE 182

Query: 209 ------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
                 I+  FGD  S  SPFSIH L+  G+A G  AG W GP                 
Sbjct: 183 VVNHRKIIKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGP----------------- 225

Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--------GQADWTPIL 312
            G        A    S  ED     +  VC+   ++ C+V+ K            W  ++
Sbjct: 226 -GFVAHLFRQAFKRAS--EDNYEFDSLTVCV---AQDCAVYIKDVMEECTDKNGKWKSLI 279

Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           LL+P+ LG EK N  Y P L   F+  Q +GI+GG+P  S Y VG Q++  I+LDPH  Q
Sbjct: 280 LLIPVRLGAEKFNSIYAPCLTTLFSLKQCIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQ 339

Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            V+++   D     +++H    R IHL  +DPS  IGFYC  K+ F
Sbjct: 340 EVVDVWAVDFP--LTSFHCRSPRKIHLSKMDPSCCIGFYCPTKESF 383


>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
           tropicalis]
          Length = 384

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 118/334 (35%), Positives = 172/334 (51%), Gaps = 20/334 (5%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D +SR+  +YR+ F  IG +  TSD GWGCMLR  QM+ AQALL   +GR WR   QK  
Sbjct: 45  DITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWDKQKS- 103

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
             EY+ IL  F D + S +SIH + Q G   G   G W GP  + +    LA   +  + 
Sbjct: 104 QGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQWSS- 162

Query: 263 LGCQSLPMAIYV-----VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----WTPIL 312
                  +A+++     V  DE      A      +A   C+ ++ G +D     W P++
Sbjct: 163 -------IAVHIAMDNTVVMDEIRRLCRAGTNESSEAGALCNGYT-GVSDPSCSLWKPLV 214

Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           LL+PL LGL  +N  YI TL+  F  PQSLG++GG+P ++ Y +G   +  IYLDPH  Q
Sbjct: 215 LLIPLRLGLSDINEAYIETLKHCFMVPQSLGVIGGRPNSAHYFIGYVGDELIYLDPHTTQ 274

Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 432
             +         D S +       +H+  IDPS+A+GF+CR ++DF+D+C +  KL+   
Sbjct: 275 LAVEPSDCCFVEDESFHCQHPPCRMHVSEIDPSIAVGFFCRSQEDFEDWCQQIKKLSLSG 334

Query: 433 NGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
              P+F V       +++ DVL  T    + D L
Sbjct: 335 GALPMFEVVDQLPLHLSNPDVLNLTPDSSDADRL 368


>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
          Length = 398

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 108/342 (31%), Positives = 175/342 (51%), Gaps = 25/342 (7%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
            G + G W GP          A+   W +LA     +  +  + +     V+    D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTAD 197

Query: 284 GGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
             +P   I  + S+  S F      W P+LL+VPL LG+ ++NP Y+   +  F  PQSL
Sbjct: 198 KSSPDSFITSNQSKDTSAFCPA---WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSL 254

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 402
           G +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ ++
Sbjct: 255 GALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQQMNILNL 314

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 315 DPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
          Length = 442

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 117/344 (34%), Positives = 176/344 (51%), Gaps = 28/344 (8%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNN------GLAEFNQDFSSRILISYRKGFDPIGDS 162
           S S IWLLG C+   Q E     A  N      G+  F +DFSS I +SYRK F  + +S
Sbjct: 63  SDSPIWLLGRCYYAKQAEYDSKNAVQNTQYKIHGIDCFFEDFSSLIYLSYRKHFSQLANS 122

Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGD--SET 218
            +TSD GWGCMLR+ QML+A ALL H L   WR   +K  ++ Y+   IL  F D  S+ 
Sbjct: 123 NLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKYTEKNYIYRMILRFFNDENSDN 182

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP-MAIYVVSG 277
           SPFS+H L++ G       G W GP ++  +  A          +   S P +  + V  
Sbjct: 183 SPFSLHELVRIGSK---KPGEWYGPTSVAHTLSA---------AVNLTSHPVLDTFRVYV 230

Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
             D       V+      ++C+  +  +  W  +L+LVP+ LG + +NP YIP L+   T
Sbjct: 231 ANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLNPIYIPCLKALLT 290

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
               +GI+GG+P  S Y VG Q +  I LDPH +Q  +++   +   ++   H    + +
Sbjct: 291 LDYCVGIIGGRPKHSLYFVGFQGKKLINLDPHYLQEYVDMTTQEFPVESFRCHYP--KKM 348

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE---ESNGAPLF 438
               +DPS A+GFYCR ++DF+  C +A ++ +   +    P+F
Sbjct: 349 AFKKMDPSCAVGFYCRTREDFESLCKQAVEMLKPPMQRTEYPMF 392


>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
 gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
          Length = 398

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 178/356 (50%), Gaps = 53/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
           D  + C V                     SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 240

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++ G++    D + 
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTGENGTVNDQTF 300

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +     + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
 gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
          Length = 368

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 175/335 (52%), Gaps = 41/335 (12%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           +  D+W+LG  + I Q    GD      +   N D  SRI ++YRK F  IG +  T+D 
Sbjct: 26  TEEDVWILGKRYNILQ----GD------MGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDS 75

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           GWGCMLR  QM++AQAL+   LGR W+   +     EY++IL  F D + S +SIH + Q
Sbjct: 76  GWGCMLRCGQMMLAQALVCRHLGRDWQWDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQ 135

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
            G + G A GSW GP  + +  + L+      +        + ++V   +          
Sbjct: 136 MGVSEGKAVGSWFGPNTVAQVLKKLSAFDDWSS--------LCLHVAMDN---------T 178

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
           V I+D S           +W P++L +PL LGL ++N  Y   L+  FTF QSLGI+GG+
Sbjct: 179 VIIEDIS-----------NWRPLVLFIPLRLGLTEMNVVYNEPLKACFTFKQSLGIIGGR 227

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL-EADTSTYHSDVIRHIHLDSIDPSLA 407
           P  +TY +G    + +YLDPH  Q  +N   D+L      ++H      +++  +DPS+A
Sbjct: 228 PNHATYFIGYFGNNLVYLDPHTTQQTVN--PDELSRIPDGSFHCVYPCRMNIADVDPSVA 285

Query: 408 IGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
           +GF+C+ ++DFDD C +  K   +    P+F + +
Sbjct: 286 LGFFCKSEEDFDDLCQQIQKKIIDGKSRPMFEIAK 320


>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
          Length = 398

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQS 305

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
          Length = 436

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 174/366 (47%), Gaps = 44/366 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG   K  +D           + +FN +  ++   +YR+ F PIG +   SD GWGC
Sbjct: 31  VWILGKHFKPDED-----------MEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGC 79

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQALL   LGR W     +  +  Y+ ILH F D + S +SIH + Q G  
Sbjct: 80  MLRCGQMMLAQALLCRHLGRDWDWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVG 139

Query: 233 YGLAAGSWVGPYAMCRSWEALAR-------------------------CQRAETGLGCQS 267
            G   G W GP  + +  + L                           C+ +    GC  
Sbjct: 140 EGKQIGQWFGPNTVAQVIKKLVLFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYG 199

Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHC-------SVFSKGQADWTPILLLVPLVLG 320
               I+  S     +    P  C  ++S+         S  S+    W P+LL +PL LG
Sbjct: 200 ECSYIHDRSSLTGNQSVSKPPHCSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLG 259

Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
           L ++N  Y  +L++ FT  QSLG++GGKP  + Y +G   +  +YLDPH  Q  I   + 
Sbjct: 260 LSEINSDYYNSLKIMFTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQTIEPERF 319

Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           ++  D S +H      +   S+DPS+A+GFYC  +DDFDD+C   ++L  +    P+F +
Sbjct: 320 NVIPDES-FHCVYPCFMSFQSLDPSVALGFYCHTEDDFDDWCQAVNELVVQREKRPMFEI 378

Query: 441 TQTHKK 446
            QT  +
Sbjct: 379 NQTRPR 384


>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 422

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 111/348 (31%), Positives = 179/348 (51%), Gaps = 37/348 (10%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 216

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
            D  G+R    +   ++ SR  S +      W P+LL+VPL LG+ ++NP Y+   +  F
Sbjct: 217 ADTPGDRPPDSLTASNE-SRGTSAYCPA---WKPLLLIVPLRLGINQINPVYVDAFKECF 272

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
             PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + 
Sbjct: 273 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 332

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 333 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 379


>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
          Length = 411

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 177/356 (49%), Gaps = 53/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  + +           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 42  VWILGKQHLLKTERS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 90

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 91  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 150

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 151 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 193

Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
           D  + C VF                    SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 194 DIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 253

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + 
Sbjct: 254 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTF 313

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +     + +++ ++DPS+A+GF+C+++ DFD++C    K   + N   +F + Q H
Sbjct: 314 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCCLVQKEILKEN-LRMFELVQKH 368


>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
 gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
 gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
          Length = 398

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 180/351 (51%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP++I    
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSI---- 193

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 194 -DTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
          Length = 393

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 115/364 (31%), Positives = 179/364 (49%), Gaps = 36/364 (9%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           +T  +W+LG  + I  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 22  TTEPVWILGRKYTIFTEKE-----------EILSDVTSRLWFTYRKNFPAIGGTGPTSDT 70

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           GWGCMLR  QM+ AQAL+   LGR WR    +     Y  +L+ F D + S +SIH + Q
Sbjct: 71  GWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 130

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGER 283
            G   G + G W GP  + +  + LA      +        +A+++     V  +E    
Sbjct: 131 MGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRL 182

Query: 284 GGAPVVCIDDAS--RHCSVFSKGQ----------ADWTPILLLVPLVLGLEKVNPRYIPT 331
             A   C D A+      + S G           + W P++LL+PL LGL  +N  Y  T
Sbjct: 183 CKAGFPCADGAAFPTDSELLSNGYPPAAEVTDRASPWRPLVLLIPLRLGLTDINEAYTET 242

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           L+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +   +  +  D + +  
Sbjct: 243 LKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVESTEGGVFPDETFHCQ 302

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS 451
                +++  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + +      +  
Sbjct: 303 HPPCRMNIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSRIPGALPMFELVERQPSHFSCP 362

Query: 452 DVLG 455
           DVL 
Sbjct: 363 DVLN 366


>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
 gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
 gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
 gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
          Length = 355

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 25  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 73

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 74  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 133

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 134 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 193

Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 194 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 246

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 247 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 306

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 307 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 351


>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 366

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 111/324 (34%), Positives = 163/324 (50%), Gaps = 41/324 (12%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D +SR+  +YRKGF PIG +  TSD GWGCMLR  QM++ QAL+   LGR WR    +  
Sbjct: 68  DVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWVSGEEQ 127

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
             EYV IL+ F D + S +SIH + +                 +C  W   A    A  G
Sbjct: 128 RHEYVNILNAFIDKKDSYYSIHQIER-----------------LCMPWLDKAEACAASEG 170

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
           +G             + +G   GA           C+   +  A W P++LL+PL LGL 
Sbjct: 171 VG-------------ELNGYLEGA-----------CAFSEEETALWKPLVLLIPLRLGLT 206

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
            +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  Q  ++  +D  
Sbjct: 207 DINEAYIETLKKCFMLPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQTAVDPCEDGT 266

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
             D S +       +H+  +DPS+A GF+CR +D+FDD+C R  +L+   +  P+F + +
Sbjct: 267 FTDDSYHCQHPPCRMHICELDPSIAAGFFCRTEDEFDDWCMRIRRLSCNRDNLPMFELVE 326

Query: 443 THKKPVNHSDVLGETGGVPEDDSL 466
           +    +   D +  T    + + L
Sbjct: 327 SQPSHMVSVDAINLTPDFSDSERL 350


>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
           [Homo sapiens]
          Length = 402

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 33  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 82  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 196

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 197 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 249

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 250 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 309

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 310 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 359


>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
 gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
          Length = 682

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 162/314 (51%), Gaps = 17/314 (5%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  L ++    G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 262 AAENQLAESPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 321

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S+ SPFSIH L++ G+  G 
Sbjct: 322 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 381

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDE---DGERGGAP 287
             G W GP ++    + AL    R        S+ +A    IY+   +E     E    P
Sbjct: 382 KPGDWYGPASVSYLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKP 441

Query: 288 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
            V    A R  S   K    W  +++L+PL LG +K+NP Y   L+L  +    LGI+GG
Sbjct: 442 HVPWQQAKRSTSDAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEYCLGIIGG 501

Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 407
           KP  S Y VG QE+  I+LDPH  Q ++++ ++       ++H    R +    +DPS  
Sbjct: 502 KPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETFP--MHSFHCKSPRKLKSSKMDPSCC 559

Query: 408 IGFYCRDKDDFDDF 421
           IGFYC  K DFD F
Sbjct: 560 IGFYCPTKTDFDSF 573


>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
          Length = 398

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 193 ADTPGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
 gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
           gorilla]
 gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
           cysteine endopeptidase; AltName: Full=Autophagin-2;
           AltName: Full=Autophagy-related cysteine endopeptidase
           2; AltName: Full=Autophagy-related protein 4 homolog A;
           Short=hAPG4A
 gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
 gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
 gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
 gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
 gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
           construct]
          Length = 398

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 193 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 441

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 119/312 (38%), Positives = 168/312 (53%), Gaps = 32/312 (10%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
            F  DF SR+ ++YRKGF  I  +  T D GWGCMLRS QMLVA ALLFH LGR WR  L
Sbjct: 137 HFLDDFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWR--L 194

Query: 199 QKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
               DR+    Y  IL  F D  TSP+SI  +   G  +    G W GP  + +  + L 
Sbjct: 195 GDSNDRDTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLV 254

Query: 255 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILL 313
                      Q + + ++V     DG      +  I  A+R       G+   TP +L+
Sbjct: 255 NDD--------QRISLKVHV---SNDGVVYKNEINTILSATR-----DDGK---TPAVLI 295

Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
           ++PL LG+E +NP Y P ++  F     +GI GG+P +S + +GV  +  IYLDPH ++P
Sbjct: 296 MIPLRLGVETMNPVYYPGVKHCFAMSHCVGIAGGRPNSSLFFLGVDGDHLIYLDPHHLRP 355

Query: 374 VI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
            +   +I    +E D  +YH + +R + + S+DPSL IGFYC    DFD  CA+ ++LA 
Sbjct: 356 SVDSRDITSYKME-DLLSYHCEKVRLLPIASMDPSLVIGFYCHSLKDFDVLCAKMTELAT 414

Query: 431 ESNGAPLFTVTQ 442
            S  APLF++ +
Sbjct: 415 GS--APLFSIEE 424


>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
 gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
 gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
 gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
          Length = 395

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 113/316 (35%), Positives = 163/316 (51%), Gaps = 27/316 (8%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR WR    K  
Sbjct: 52  DIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKHKEH 111

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
             EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA      + 
Sbjct: 112 PEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS- 170

Query: 263 LGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSK-----GQAD-WT 309
                  +A+Y      VV  D        P  C +  A+ + S +S+     GQ+  W 
Sbjct: 171 -------LAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSSGWR 223

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  IYLDPH
Sbjct: 224 PLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEIIYLDPH 283

Query: 370 DVQPVINIGKDDLEADTSTYHSDV-IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
             Q  +     D E    TYH       + + ++DPS+A+GF+C+D++DFD++C    K 
Sbjct: 284 TTQTFV-----DTEDQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFDNWCEVIEKE 338

Query: 429 AEESNGAPLFTVTQTH 444
             +     +F +T  H
Sbjct: 339 ILKHQSLRMFELTPKH 354


>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
          Length = 488

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 191/403 (47%), Gaps = 48/403 (11%)

Query: 66  SEKKAVHNKSNGWTAAVK---RLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKI 122
           S  +AV N+  GW A +K     + +G+   I +           S    I+LLG  +  
Sbjct: 89  STSEAVKNRVRGWWANMKYGWNAMNSGAQIDISDL----------SGADPIYLLGHVYHN 138

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
             + A            F  DFS+R+  +YR+ F P+  +  TSD GWGCMLRS+QM++A
Sbjct: 139 KNNSA--------SFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGCMLRSAQMMLA 190

Query: 183 QALLFHRLGRPWRKPLQKPFDREYV--EILHLFG---DSETSPFSIHNLLQAGKAYGLAA 237
           +A +FH LGR WR   Q+      V  +I+  F    D+  +PFS+HN+++A    G  A
Sbjct: 191 EAFIFHLLGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMVRAAAHCGKKA 250

Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQS---LPMAIYVVSGDEDGERGGAPVVCIDDA 294
           G W GP         L RC     G+         MAIYV              +   D 
Sbjct: 251 GDWFGPSTAAY---LLKRCLEEAAGVADSKEIFEQMAIYVAQD---------CTIYTQDV 298

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
              C+  S    +W  ++LL+P+ LG E+VN  YI  ++    +   LGI+GGKP  S Y
Sbjct: 299 LDLCT--SDPNIEWKSVVLLIPVRLGGERVNVNYIHCIKEILAYQNCLGIIGGKPRHSLY 356

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
            VG Q +  +YLDPH +Q   +  +  L    +++H    R +    +DPS  IGFYC+ 
Sbjct: 357 FVGFQGKKLVYLDPHYLQKTTDTSR--LNFSVNSFHCTTARKVSFSKLDPSATIGFYCKT 414

Query: 415 KDDFDDFCARASKLAE---ESNGAPLFTVTQTHKKPVNHSDVL 454
           + DF+ F +    + E   ++ G P+F +++     VN  + L
Sbjct: 415 RRDFESFQSIMQSVTESCPQNQGYPVFIISEGSSALVNQLNPL 457


>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
          Length = 398

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 178/342 (52%), Gaps = 25/342 (7%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
            G + G W GP          A+   W +LA     +  +  + +     V+    D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVG 197

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
              P   ++ +++    F+   A W P+LL+VPL LG+ ++NP Y+   +  F  PQSLG
Sbjct: 198 ESTPGT-LNASNQSRGTFACCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLG 255

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSI 402
            +GGKP  + Y +G   +  I+LDPH  Q  +N  +++   D  T+H     + +++ ++
Sbjct: 256 ALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVNT-EENGTVDDQTFHCLQSPQRMNILNL 314

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 315 DPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
          Length = 1114

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 182/364 (50%), Gaps = 32/364 (8%)

Query: 111 SDIWLLGVCHKIAQDEALGDAAGNN-------GLAEFNQDFSSRILISYRKGFDPIGDSK 163
           S +WLLG  + I   + + D             + +F QDFSS +  +YR+ F  I  +K
Sbjct: 226 SPVWLLGKFYHIKPSDLIDDDIQRGKRTRVVPNIEKFKQDFSSLLWFTYRQDFPAIPGTK 285

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV--EILHLFGD--SETS 219
           +TSD GWGCMLRS QM++A+AL  H LG  W     +  ++E    +I+  FGD   + S
Sbjct: 286 LTSDCGWGCMLRSGQMMLAKALTLHYLGPEWNVFSDQTREQETYRKQIIRWFGDYLCDES 345

Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGD 278
           PFS+H L++ GK  G   G W GP ++     E + + Q+ +T L      + +YV    
Sbjct: 346 PFSMHRLVEVGKNLGKQPGEWFGPASVAHILKETMVKGQKTQTVLS----DLCVYVSQDC 401

Query: 279 EDGERGGAPVVCI----------DDASRHCSVFSKGQADWT-PILLLVPLVLGLEKVNPR 327
              ++    + C              S H S       DW   +++L+P+ LG E++NP 
Sbjct: 402 TVYKQDIYELCCTRPRADTKFTNSTESEHESSQDASSMDWKRAVVILIPVRLGGEQLNPV 461

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           YIP ++   +    +GI+GGKP  S Y VG QE+  IYLDPH  Q V++  +        
Sbjct: 462 YIPCVKGLLSQDSCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVDTRERHFP--IQ 519

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTH 444
           +YH    R + +D IDPS  IGFYCR++ +F+ F  +  ++    ++    P+F  +  H
Sbjct: 520 SYHCMSPRKVSIDKIDPSCTIGFYCRNQKEFEKFVQQTEEMVAPPKQRLSYPMFVFSDGH 579

Query: 445 KKPV 448
              V
Sbjct: 580 SNEV 583


>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
          Length = 398

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 178/346 (51%), Gaps = 33/346 (9%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAG 197

Query: 282 ERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
           +R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +  F  
Sbjct: 198 DRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKECFKM 250

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
           PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + ++
Sbjct: 251 PQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMN 310

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 311 ILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
          Length = 392

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 126/402 (31%), Positives = 191/402 (47%), Gaps = 49/402 (12%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
           S  S +WLLG+C+    +  L  A+                      N + EF +DF SR
Sbjct: 6   SKESPVWLLGLCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 65

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREY 206
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR +P Q   +  +
Sbjct: 66  LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWRPEQSTDESSH 125

Query: 207 VEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRAE 260
             I+  FGD  T  SPFSIH L+  G + G  AG W GP    + +C++ E      RA 
Sbjct: 126 RMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQAME------RAS 179

Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
                +   +A+YV        +    V C  D  R              ++LLVPL LG
Sbjct: 180 EDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR------------KALILLVPLRLG 227

Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
            +K+NP Y P L    T    +G++GG+P  S Y +G Q++  I+LDPH  Q  +++  +
Sbjct: 228 ADKLNPVYAPCLTSLLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVDVEGN 287

Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK--LAEESNGAPLF 438
           + +   +++H    R + L  +DPS  +GFY  DK+   DF     +  +  ++   P+F
Sbjct: 288 E-KFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLTDFMETIQQFVIPNQNMDYPMF 346

Query: 439 TVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHE 480
              +   K +     + E G +P     G  SM D +    E
Sbjct: 347 LFCEGSGKDLQQGIEVVE-GLLPSSSRFGHESMEDDLFECEE 387


>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
          Length = 461

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 115/372 (30%), Positives = 177/372 (47%), Gaps = 53/372 (14%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           +T  +W+LG  + I  ++            +   D +SR+  +YRK F  IG +  TSD 
Sbjct: 91  TTEPVWILGRKYTIFTEKE-----------DILSDVTSRLWFTYRKNFPAIGGTGPTSDT 139

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           GWGCMLR  QM+ AQALL   LGR WR    +     Y  +L+ F D + S +SIH + Q
Sbjct: 140 GWGCMLRCGQMIFAQALLCRHLGRDWRWKKGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQ 199

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
            G   G + G W GP  + +  + LA      +        +A+++   +          
Sbjct: 200 MGVGEGKSIGQWYGPNTVAQVLKKLAAFDTWSS--------LAVHIAMDN---------T 242

Query: 289 VCIDDASRHC--------SVF-----------------SKGQADWTPILLLVPLVLGLEK 323
           V I++  R C        S F                 +     W P++LL+PL LGL +
Sbjct: 243 VVIEEIRRLCKPNFPAGASAFPTDSEFLLNGFPSGAEVTNRPTQWKPLVLLIPLRLGLTE 302

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           +N  YI TL+  F  PQSLG++GGKP ++ Y +G      IYLDPH  QP + I      
Sbjct: 303 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEISGSCFI 362

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
            D S +       +++  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F + + 
Sbjct: 363 PDESFHCQHPPCRMNIVELDPSIAVGFFCKTEEDFNDWCQQVKKLSLIRGALPMFELVEH 422

Query: 444 HKKPVNHSDVLG 455
                +  DVL 
Sbjct: 423 QPSHFSSPDVLN 434


>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
          Length = 398

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 110/348 (31%), Positives = 178/348 (51%), Gaps = 37/348 (10%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
            D  G+R    +    + S+  S +      W P+LL+VPL LG+ ++NP Y+   +  F
Sbjct: 193 ADTAGDRPPDSLTA-SNQSKGTSAYCTA---WKPLLLIVPLRLGINQINPVYVDAFKECF 248

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
             PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + 
Sbjct: 249 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQR 308

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 309 MNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
 gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
 gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
          Length = 398

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 192

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 193 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 245

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 246 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 305

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 306 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
          Length = 396

 Score =  194 bits (493), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 178/351 (50%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 304 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 353


>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
          Length = 408

 Score =  194 bits (493), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 175/343 (51%), Gaps = 27/343 (7%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 39  VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 87

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 88  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 147

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 148 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVG 207

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           E     +    +AS        G+  W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 208 ESPPDTL----NASNQSKGTPAGRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 263

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 264 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 323

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 324 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365


>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
          Length = 398

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 109/344 (31%), Positives = 177/344 (51%), Gaps = 29/344 (8%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTAG 197

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           E     +  ++ +       S  +  W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ESPPGSLTALNQSKGT----SACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 400
           LG +GGKP  + Y +G      I+LDPH  Q  ++  +++   D  T+H     + +++ 
Sbjct: 254 LGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNIL 312

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
          Length = 393

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 180/373 (48%), Gaps = 39/373 (10%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVG 133

Query: 233 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 273
            G + G W GP  + +         +W +LA            E    CQS      A  
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAA 193

Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
             + + DG   G P    ++A          ++ W P++LL+PL LGL ++N  YI TL+
Sbjct: 194 CPAVESDGLYNGCP----EEAG-----VRDRRSLWKPLVLLIPLRLGLTEINEAYIETLK 244

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +    
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEHNDSGCLPDESFHCQHP 304

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 453
              + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + +      ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEEDFNDWCQQIKKLSLVRAALPMFELVERQPSHFSNPDV 364

Query: 454 LGETGGVPEDDSL 466
           L  T    + D L
Sbjct: 365 LNLTPDSSDADRL 377


>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
          Length = 396

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 116/375 (30%), Positives = 177/375 (47%), Gaps = 58/375 (15%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           +T  +W+LG  + I   +DE L D              +SR+  +YRK F  IG +  TS
Sbjct: 25  TTDPVWILGRKYTIFTEKDEILSDV-------------TSRLWFTYRKNFPAIGGTGPTS 71

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR    +     Y  +L+ F D + S +SIH +
Sbjct: 72  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWKQGRRQTDNYFNVLNAFIDKKDSYYSIHQI 131

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
            Q G   G + G W GP  + +  + LA      +        +A+++   +        
Sbjct: 132 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDN-------- 175

Query: 287 PVVCIDDASRHCSV-FSKGQA-------------------------DWTPILLLVPLVLG 320
             V ++D  R C   FS   A                          W P++LL+PL LG
Sbjct: 176 -TVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQWRPLVLLIPLRLG 234

Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
           L  +N  Y  TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  Q  + +   
Sbjct: 235 LTDINEAYTETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNG 294

Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
            +  D S +       +++  +DPS+A+GF+C+ ++DF+D+C +  KL+      P+F +
Sbjct: 295 GVIPDESFHCQHPPCRMNIGELDPSIAVGFFCKSEEDFNDWCQQVKKLSRIPGALPMFEL 354

Query: 441 TQTHKKPVNHSDVLG 455
            +      +  DVL 
Sbjct: 355 VEHQPSHFSCPDVLN 369


>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
           cuniculus]
          Length = 405

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 176/356 (49%), Gaps = 53/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 36  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 84

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 85  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 144

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 145 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 187

Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
           D  + C V                     SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 188 DIKKMCCVLPLSANTPGERLHDSLTASNQSKGTSACCPAWKPLLLIVPLRLGINQINPVY 247

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           +   +  F  PQSLG +GGKP  + Y +G      I+LDPH  Q  ++  ++    D + 
Sbjct: 248 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIFLDPHTTQTFVDTEENGTVDDQTF 307

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +     + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 308 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 362


>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
          Length = 393

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/373 (32%), Positives = 178/373 (47%), Gaps = 39/373 (10%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 233 YGLAAGSWVGPYAMCR---------SWEALA-------RCQRAETGLGCQS---LPMAIY 273
            G + G W GP  + +         +W +LA            E    CQS      A  
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAA 193

Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
             + + D    G P    +D         +  A W P++LL+PL LGL ++N  YI TL+
Sbjct: 194 CPALESDVLYNGCP----EDVG-----LRERLALWKPLVLLIPLRLGLTEINEAYIETLK 244

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +  G      D S +    
Sbjct: 245 HCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPGDSGCLPDESFHCQHP 304

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDV 453
              + +  +DPS+A+GF+C  + DF+D+C +  KL+      P+F + +      ++ DV
Sbjct: 305 PCRMSIAELDPSIAVGFFCNTEADFNDWCQQIKKLSLVRGALPMFELVERQPSHFSNPDV 364

Query: 454 LGETGGVPEDDSL 466
           L  T    + D L
Sbjct: 365 LNLTPDSSDADRL 377


>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
          Length = 393

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 233 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 267
            G + G W GP  + +         +W +LA                 CQ   +  G  +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193

Query: 268 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
            P      +Y    +E G R    +                   W P++LL+PL LGL +
Sbjct: 194 CPTVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +        
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
            D S +       + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + + 
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354

Query: 444 HKKPVNHSDVLGETGGVPEDDSL 466
                ++ DVL  T    + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377


>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
 gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
          Length = 398

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 175/356 (49%), Gaps = 53/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLRTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKG--QAD----------------------WTPILLLVPLVLGLEKVNPRY 328
           D  + C V   G   AD                      W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPLLLIVPLRLGINQINPVY 240

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           I   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +  +  D + 
Sbjct: 241 IEAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEESGIVDDETF 300

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +     + + + ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 301 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 355


>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
 gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related cysteine endopeptidase 2B;
           Short=Autophagin-2B; Short=cAut2B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
          Length = 393

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 177/383 (46%), Gaps = 59/383 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVFTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALVCRHLGRDWRWIKGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 233 YGLAAGSWVGPYAMCR---------SWEALA----------------RCQRAETGLGCQS 267
            G + G W GP  + +         +W +LA                 CQ   +  G  +
Sbjct: 134 EGKSIGQWYGPNTVAQVLKKLATFDTWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAA 193

Query: 268 LPMA----IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
            P      +Y    +E G R    +                   W P++LL+PL LGL +
Sbjct: 194 CPAVEADVLYNGYPEEAGVRDKLSL-------------------WKPLVLLIPLRLGLTE 234

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           +N  YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +        
Sbjct: 235 INEAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCL 294

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
            D S +       + +  +DPS+A+GF+C  ++DF+D+C +  KL+      P+F + + 
Sbjct: 295 PDESFHCQHPPCRMSIAELDPSIAVGFFCHTEEDFNDWCHQIKKLSLVRGALPMFELVER 354

Query: 444 HKKPVNHSDVLGETGGVPEDDSL 466
                ++ DVL  T    + D L
Sbjct: 355 QPSHFSNPDVLNLTPDSSDADRL 377


>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
 gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related cysteine endopeptidase 2A;
           Short=Autophagin-2A; AltName: Full=Autophagy-related
           protein 4 homolog A; AltName: Full=bAut2A
 gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
          Length = 398

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 281
            G + G W GP          A+   W +LA     +  +  + +      +S   D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 400
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++  AD  T+H     + +++ 
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
          Length = 369

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 113/345 (32%), Positives = 182/345 (52%), Gaps = 33/345 (9%)

Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
           W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGCM
Sbjct: 1   WILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 49

Query: 174 LRSSQMLVAQALLFHRLGRP--WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
           LR  QM++AQAL+   LGR   W K  ++P  +EY  IL  F D +   +SIH + Q G 
Sbjct: 50  LRCGQMMLAQALICRHLGRDLNWEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGV 107

Query: 232 AYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDED 280
             G + G W GP          A+   W +LA     +  +  + +     I  +S D  
Sbjct: 108 GEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCCILPLSADTA 167

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
           GE   +P   ++ ++R  S  S G   W P+LL+VPL LG+ ++NP Y+   +  F  PQ
Sbjct: 168 GE---SPPSSLNASNRSKST-SAGWPAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQ 223

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHL 399
           SLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++   D  T+H     + +++
Sbjct: 224 SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNI 282

Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 283 LNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 326


>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
          Length = 398

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 179/343 (52%), Gaps = 27/343 (7%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D  +R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   REY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVV--SGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +     V+  S D  G
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIG 197

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           E   +P+  ++ +++  S  +   A W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 E---SPLNTLNASNQSKSAPASCPA-WKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQSPQRMNILN 313

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
          Length = 396

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 111/344 (32%), Positives = 175/344 (50%), Gaps = 29/344 (8%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 281
            G + G W GP          A+   W +LA     +  +  + +      +S   D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLD 400
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++  AD  T+H     + +++ 
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTADDQTFHCLQPPQRMNIL 312

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 313 NLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
          Length = 475

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 120/367 (32%), Positives = 180/367 (49%), Gaps = 40/367 (10%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TS
Sbjct: 100 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTS 146

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 147 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFLDRKDSYYSIHQI 206

Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
            Q G   G + G W GP          A+  +W +LA        +    +   I  +  
Sbjct: 207 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLA----VHVAMDNTVVMEEIRRLCR 262

Query: 278 DEDGERGGAPVVCIDDASRHCSVF----------SKGQADWTPILLLVPLVLGLEKVNPR 327
                 G A +    DA RHC+ F          S   + W P++LL+PL LGL  +N  
Sbjct: 263 SSLPCSGAAALPA--DADRHCNGFPAPMEVTSRPSPSPSPWRPLVLLIPLRLGLTDINEA 320

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           Y+ TL+  F  PQSLG++GGKP ++ Y +G   +  IYLDPH  QP + +       D +
Sbjct: 321 YVETLKRCFMMPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFIPDET 380

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
            +       + +  +DPS+A+GF+C+ +DDF D+C +  KL+ +    P+F + +     
Sbjct: 381 FHCQHPPCRMGIGELDPSIAVGFFCKTEDDFRDWCQQVRKLSLQGGALPMFELVEQQPSH 440

Query: 448 VNHSDVL 454
           +   DVL
Sbjct: 441 LACPDVL 447


>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
          Length = 510

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 115/346 (33%), Positives = 169/346 (48%), Gaps = 62/346 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
           F  DF SR+ ++YR  F  IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR   +
Sbjct: 115 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 174

Query: 200 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR- 255
           +     Y E+L  F D  S  SP+SIH + + G + +    G W  P  +  +   L   
Sbjct: 175 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTE 233

Query: 256 ---------------CQRAETGLGC---------QSLPMAIYVV---------------S 276
                            R E    C         Q  P+ +                  S
Sbjct: 234 HSPNGLKMYVPKDGIIYRKEVYQLCAVQPADGPAQHSPLRVDDDGGDTDHDGDTDGLESS 293

Query: 277 GDEDGERGGAP-----VVCIDDASRHCSVFSKGQAD------------WTPILLLVPLVL 319
            D      G P     +   D +S H  + S  +++            W P+++LVP+ L
Sbjct: 294 TDSMRHSHGNPGVPSTIEAGDYSSSHAELMSSAESECESLDDNFTELTWHPVIILVPVRL 353

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G++ +NP YIPTL+  F+FPQ LG++GGKP +S Y VG Q+   +Y+DPH VQP + +  
Sbjct: 354 GIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQPTVKMDD 413

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           D L     +Y  ++ + +  D IDPSLA+GF C  + +FDDFC  A
Sbjct: 414 DPL-FPIESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 458


>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
          Length = 370

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 125/347 (36%), Positives = 169/347 (48%), Gaps = 44/347 (12%)

Query: 106 ISSSTSDIWLLGV-CHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK- 163
           I  ST  +WLLG   H I            N L    QD  S++  +YRK F PIG S  
Sbjct: 26  IPQSTEPVWLLGKKYHAI------------NELNTIRQDIVSKLWFTYRKDFVPIGGSDG 73

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 222
            TSD GWGCMLR  QM++ QAL+   LGR W+  P  +  D  Y+ IL  F DS  +PFS
Sbjct: 74  KTSDKGWGCMLRCGQMVLGQALMSIHLGRDWQWNPTTR--DATYLSILKKFEDSRKAPFS 131

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
           IH +   G + G   G W GP  + +  + L +              +AI+V   +    
Sbjct: 132 IHQIASMGISEGKEVGQWFGPNTVAQVLKKLVKFDEGND--------VAIHVALDN---- 179

Query: 283 RGGAPVVCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
                VV I +    C   SK  AD     W P+LL+VPL LGL ++N  Y+  L+  F 
Sbjct: 180 -----VVIISEIRDLC--LSKETADVSTPHWKPLLLIVPLRLGLTQMNSIYLGGLKQCFQ 232

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL--EADTS-TYHSDVI 394
           F QSLGI+GGKP ++ Y +G      IY DPH  Q   ++G  D   E D   +YH    
Sbjct: 233 FKQSLGIIGGKPNSALYFIGYVGNEVIYFDPHTTQKAGSVGNKDTSEEKDVDLSYHCKHA 292

Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 441
             + +  +DPS+A+ F CR + DF+D C        ++   PLF V+
Sbjct: 293 SRMSMLGMDPSVAVCFLCRSEADFNDLCQNIKDQLIKTESQPLFEVS 339


>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
          Length = 373

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 112/357 (31%), Positives = 176/357 (49%), Gaps = 55/357 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178

Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
           D  + C V                     SKG       W P+LL+VPL LG+ ++NP Y
Sbjct: 179 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 238

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++   D  T
Sbjct: 239 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 297

Query: 389 YHS-DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +H     + + + ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 298 FHCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 353


>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
           [Tribolium castaneum]
 gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
          Length = 366

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 117/321 (36%), Positives = 166/321 (51%), Gaps = 26/321 (8%)

Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIG-DSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
           N L E +   QD  S+I  +YRK F PIG D  +T+D GWGCMLR  QM++AQAL+   L
Sbjct: 33  NALQELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHL 92

Query: 191 GRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
           GR W  +P  K  D  Y++IL  F D   +PFSIH +   G +     G W GP  + + 
Sbjct: 93  GRDWVWEPETK--DSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQV 150

Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
            + L +           +L   + +    E         +C+   S  CS       DW 
Sbjct: 151 LKKLVKYDEWSAIEMHIALDNTLIISDIRE---------LCLSQGSDGCS-----SGDWK 196

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           P+LL+VPL LGL+++NP Y   L+  F F QSLG++GGKP  + Y +G   +  IYLDPH
Sbjct: 197 PLLLIVPLRLGLQEINPIYASGLKKCFQFKQSLGVIGGKPNLALYFIGHVGDEVIYLDPH 256

Query: 370 DVQP---VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
             Q    V +   ++     STYH      I++ S+DPS+A+ F+C  + +F+D C    
Sbjct: 257 TTQKSGSVESKETEEEIELDSTYHCKYASRINILSMDPSVAVCFFCNTEGEFNDLCHSIK 316

Query: 427 KLAEESNGAPLFTVTQTHKKP 447
           K   E    PLF +  T++KP
Sbjct: 317 KDLIEPEKQPLFEI--TYEKP 335


>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
          Length = 429

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 174/356 (48%), Gaps = 53/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 60  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 108

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 109 MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 168

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 169 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 211

Query: 293 DASRHCSVF--------------------SKGQAD----WTPILLLVPLVLGLEKVNPRY 328
           D  + C V                     SKG       W P+LL+VPL LG+ ++NP Y
Sbjct: 212 DIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPLLLIVPLRLGINQINPVY 271

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + 
Sbjct: 272 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTF 331

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +     + + + ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 332 HCLQSPQRMSILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 386


>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
          Length = 398

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 112/357 (31%), Positives = 178/357 (49%), Gaps = 55/357 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHC--------------------SVFSKGQAD----WTPILLLVPLVLGLEKVNPRY 328
           D  + C                    S  SKG +     W P+LL+VPL LG+ ++NP Y
Sbjct: 181 DIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPLLLIVPLRLGINQINPVY 240

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           +   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +++   D  T
Sbjct: 241 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDT-EENGTVDDQT 299

Query: 389 YHS-DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +H     + +++ ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 300 FHCLQPPQRMNILNLDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
          Length = 398

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 108/343 (31%), Positives = 172/343 (50%), Gaps = 27/343 (7%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVS--GDEDG 281
            G + G W GP          A+   W +LA     +  +  + +      +S   D   
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 197

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 198 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 253

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 254 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 313

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 314 LDPSVALGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 355


>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
          Length = 381

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 111/345 (32%), Positives = 176/345 (51%), Gaps = 44/345 (12%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           S S +W+LG            + +  + + E N +  SR L +YRK F  I DS  TSD 
Sbjct: 28  SDSPVWILG-----------NELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDS 76

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD----REYVEILHLFGDSETSPFSIH 224
           GWGCMLR  QM++A+AL    LGR W+   Q+  D    ++Y++IL LF DS+ +P+S+H
Sbjct: 77  GWGCMLRCGQMVLAEALQRVSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLH 136

Query: 225 NLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
            +   G++       G+W GP  +    + L +   +ET     + P+ ++V   +    
Sbjct: 137 QIALMGESIQSKKPVGTWFGPNTIA---QVLRKLSVSET-----TNPIRVHVAMDN---- 184

Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
                 V +D+    C  F    +   P+LL +PL LGL ++NP Y   L+  F FPQ L
Sbjct: 185 -----TVIVDEIKESCG-FIGDPSQGKPLLLFIPLRLGLTEINPIYFQDLKECFEFPQIL 238

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPH-----DVQPVINIGKDDLEADTSTYHSDVIRHI 397
           G++GG+P  + Y +G  +   IYLDPH         V+ +G     ++  TYH+D    +
Sbjct: 239 GVIGGRPNHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGG----SEDKTYHTDRAYRM 294

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
               +DPSL++ F C+D+ +F+D C R        + +PLF + +
Sbjct: 295 DFKDLDPSLSLCFLCKDESEFEDMCERFLFKLIRGHNSPLFEICR 339


>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
 gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
          Length = 406

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 112/364 (30%), Positives = 175/364 (48%), Gaps = 61/364 (16%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 327
           D  + C V   G AD                         W P+LL+VPL LG+ ++NP 
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           YI   +  F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  +  L  D +
Sbjct: 241 YIEAFKECFKMPQSLGALGGKPNNAYYFIGSLGDELIFLDPHTTQTFVDTEESGLVDDHT 300

Query: 388 TYHSDVIRHIHLDSIDPSLAI-------GFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
            +     + + + ++DPS+A+       GF+C+++ DFD++C+   K   + N   +F +
Sbjct: 301 FHCLQSPQRMSILNLDPSVALVGQGAFMGFFCKEEKDFDNWCSLVQKEILKEN-LRMFEL 359

Query: 441 TQTH 444
            Q H
Sbjct: 360 VQKH 363


>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
           [Ornithorhynchus anatinus]
          Length = 436

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 172/359 (47%), Gaps = 54/359 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 68  VWILGRQHHLKAEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 116

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W     K    EY +IL  F D +   +SIH + Q G  
Sbjct: 117 MLRCGQMMLAQALICRHLGRDWCWEKHKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVG 176

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 177 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 219

Query: 293 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 328
           D  + C +  +G                         A W P+LL+VPL LG+  +NP Y
Sbjct: 220 DIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPLLLIVPLRLGINHINPIY 279

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 280 IDAFKECFKTPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQTFVDTEENGQVDDHSF 339

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
           +     + + + ++DPS+A+GF+C+++ DFD++C+   K         +F + Q  K+P
Sbjct: 340 HCQQAPQRMKIMNLDPSVALGFFCKEEKDFDNWCSLVQKEILRQQSLRMFELVQ--KRP 396


>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
          Length = 517

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 113/346 (32%), Positives = 169/346 (48%), Gaps = 39/346 (11%)

Query: 111 SDIWLLGVCHKIAQDEALGDAAGN----------------NGLAEFNQDFSSRILISYRK 154
           S +WLLG C+ + +     D + N                  L  F  DF S++  +YRK
Sbjct: 67  SPLWLLGKCYHLKKPSLSSDTSENAEGSQQSTSESYNMLPKHLKLFLVDFHSKLWFTYRK 126

Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYV--EILH 211
           GF  + D+ +TSD GWGCMLR++QM++AQ+ + H LGR WR  P +   ++  +   I+ 
Sbjct: 127 GFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLSMEQSDIHRNIIT 186

Query: 212 LFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
            F D +    PFS+H L + G +Y    G+W GP       +    C + +T L    L 
Sbjct: 187 WFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECAKGKTEL----LN 242

Query: 270 MAIYVVSGDEDGERGGAPVVC-----IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
             +  ++ D          +C       DA    S  S  ++    +++L+P+ LG   +
Sbjct: 243 NIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRS----VIVLIPVRLGEATL 298

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDDL 382
           NP YIP ++   T  QS+GI+GGKP  S Y +G Q+E   YLDPH  Q   +    K+DL
Sbjct: 299 NPIYIPCIQSMLTLDQSVGIMGGKPKHSLYFIGFQDEYLFYLDPHYCQQADHPAAFKNDL 358

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
                 YH +  R  ++  +DPS  +GFYCRD  DF  F   A+K 
Sbjct: 359 ---LQNYHCNSPRKTNISKMDPSCCLGFYCRDYKDFQSFVCEANKF 401


>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
 gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
          Length = 385

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 117/352 (33%), Positives = 167/352 (47%), Gaps = 54/352 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ---DFSSRILISYRKGFDPIGDSKITSDVG 169
           +WLLG C+              N L EF++   D +S+   +YRK + PIG    TSD G
Sbjct: 25  VWLLGCCY--------------NPLEEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKG 70

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCMLR  QM++ QAL+   LGR WR    K     Y +IL LF DS+ S +SIH + Q 
Sbjct: 71  WGCMLRCGQMILGQALVMRHLGRDWRWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQM 130

Query: 230 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
           G + G     W GP    +  + L                M +YV   +         +V
Sbjct: 131 GVSEGKKISQWFGPNTAAQVLKKLIMFDEWSQ--------MGVYVAMDN---------IV 173

Query: 290 CIDDASR----HCSVFSKGQA--------------DWTPILLLVPLVLGLEKVNPRYIPT 331
            IDD  +    H +  S+G A               W P+LL +PL LGL  +NP Y   
Sbjct: 174 VIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLLFIPLRLGLTDLNPIYKDK 233

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           L   F    +LGI+GGKP ++ Y +G+Q +  +YLDPH VQ  + + K +      TYH 
Sbjct: 234 LNKCFRIKNTLGIIGGKPNSAHYFIGIQGDYLLYLDPHTVQETVKV-KPNCPFSDKTYHQ 292

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA-EESNGAPLFTVTQ 442
                +H   +DPS+A+GFY   +++F++ C   + +    S   PLF V +
Sbjct: 293 KGTNRLHFSYMDPSVALGFYSATEEEFNELCRDFTDVCILNSAQPPLFEVVE 344


>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
           gorilla]
          Length = 379

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 37  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 97  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149

Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268

Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           QP +         D S +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+  
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328

Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
               P+F + +     +   DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351


>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
          Length = 383

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 116/350 (33%), Positives = 177/350 (50%), Gaps = 34/350 (9%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I  +  ++W+LG  +   QD           L    +D +S I  +YRKGF PIGD  +T
Sbjct: 22  IPETKDNVWVLGKKYSAIQD-----------LERIRRDITSVIWCTYRKGFVPIGDEGLT 70

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
           SD GWGCMLR  QM++  AL+   L   W   +  P  R+  Y++I+    + + +P+SI
Sbjct: 71  SDKGWGCMLRCGQMVLGVALIKVHLSADW---VWTPETRDPTYLKIVQRLEERKQAPYSI 127

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G   G   G W GP  + +  + L    +  +        + I+V   +   + 
Sbjct: 128 HQVALMGACEGKEVGQWFGPNTVAQVLKKLVVYDKWSS--------LVIHVALDNTVVKE 179

Query: 284 GGAPVVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
                  +++    CS    G   +DW P+LL+VPL LGL ++NP Y+  L++ F  PQS
Sbjct: 180 DILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGLSEINPIYMEGLKICFQSPQS 239

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIH 398
           +G++GGKP  + Y++G   +  IYLDPH  Q    V N   D+ +    TYH      I 
Sbjct: 240 IGVIGGKPNQALYLIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIP 299

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTHKKP 447
           + S+DPS+A+ F CR + DFD+ C    K L +ES   PLF + +  K+P
Sbjct: 300 ILSMDPSVAVCFLCRTRSDFDELCELIEKRLMQESQ--PLFEICE--KRP 345


>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
          Length = 379

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 163/323 (50%), Gaps = 25/323 (7%)

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 37  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 97  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149

Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPL 208

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268

Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           QP +         D S +       + +  +DPS+A+GF+C+ +DDF+D+C +  KL+  
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLL 328

Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
               P+F + +     +   DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351


>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
          Length = 459

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 125/370 (33%), Positives = 184/370 (49%), Gaps = 48/370 (12%)

Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           S ++S +WLLG C+   QD    D+  +     ++  F S +  +YR+ F+ +     TS
Sbjct: 68  SQNSSKLWLLGDCYS-PQDFDNFDSMKD----AYHDAFESILWYTYRRDFETMVPYDFTS 122

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----LQKPFDREYVEILHLFGDS--ETS 219
           D GWGCMLRS+QML+++A   + LG  W+ P     L+ P  + YV++L  F DS     
Sbjct: 123 DAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLELP--KVYVKLLKWFVDSFDTEC 180

Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
            +SIHN+ + G  Y    G W GP          A+  R    L  Q  P    V+   +
Sbjct: 181 KYSIHNITRIGMQYDKLPGEWYGP-------TTAAQALRDLVNLHAQESPECNLVMYVPQ 233

Query: 280 DGERGGAPV--VCI---DDASRHCSVFSKGQADWT---------------------PILL 313
           DG      V  +CI   D  +   +V  + Q+D T                      +L+
Sbjct: 234 DGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRDNSEKMWQKSLLI 293

Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
           L+PL LGL+ +NPRY+P ++  F FPQ++GI+GGK G S Y VG  +     LDPHD+ P
Sbjct: 294 LIPLRLGLDSINPRYLPAIQRVFEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPHDIHP 353

Query: 374 VINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 432
             ++      A    T HS +   + L SIDPSLA+GFYC D+ D+ DF  R  ++  E 
Sbjct: 354 TADLNTAFPTATHLRTVHSRLPLEMSLGSIDPSLALGFYCSDRKDYLDFVDRVDRVQSEL 413

Query: 433 NGAPLFTVTQ 442
            GA  F++ +
Sbjct: 414 GGALPFSIAK 423


>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
          Length = 394

 Score =  187 bits (475), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 108/312 (34%), Positives = 159/312 (50%), Gaps = 40/312 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           ++LLGV + + +D A            F +D  SR   +YRK F PIGD+  TSD GWGC
Sbjct: 45  VYLLGVKYDLPRDGA-----------SFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGC 93

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
            LR  QML+   LL   LGR WR       D +Y +IL +F D   S +SI  +   G  
Sbjct: 94  TLRCGQMLLGHTLLLRHLGRDWRWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGAD 153

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
           +G + G W GP  + ++ + LA        +  Q   +A+YV             +V ID
Sbjct: 154 FGRSVGQWFGPNNVAQAIKRLA--------VHDQWSEVAVYVAMD---------MLVVID 196

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D S           ++ P+L+ +PL LG E+ N  Y   ++  F   QS+GI+GGKP  +
Sbjct: 197 DIS-----------NFRPVLVFIPLRLGQERFNMEYKEAVKACFAVRQSVGIIGGKPRHA 245

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            +  G  ++  IYLDPH  Q  + +    + +D STYH+  I  +H+  +DPSLA+GF+C
Sbjct: 246 LWFTGYHDDYLIYLDPHKTQSCVTLPDAGIVSD-STYHTTQIERLHISELDPSLALGFFC 304

Query: 413 RDKDDFDDFCAR 424
           + + D DD C +
Sbjct: 305 QTEADLDDLCDK 316


>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
 gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
          Length = 410

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 123/381 (32%), Positives = 179/381 (46%), Gaps = 64/381 (16%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           +  S  ++W++G   ++ Q +   D           ++  SR+  +YRK F PIG +   
Sbjct: 28  LFKSGGEVWIVG---RVWQTQDFDD---------IKKEIRSRMWFTYRKSFSPIGGTGPI 75

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 224
           SD GWGCMLR  QML+AQAL+   LGR W+  P  +  D  YV IL +F D +   +SIH
Sbjct: 76  SDSGWGCMLRCGQMLLAQALICRHLGREWQWSPSCR--DEAYVRILRMFQDKKNELYSIH 133

Query: 225 NLLQAGKAYGLAAGSWVGP---------YAMCRSWEALA----------------RCQR- 258
            + + G++ G   G W GP          A+   W +LA                 C R 
Sbjct: 134 MIAKMGESEGKEIGKWFGPSTIAHVIKKLAIYDDWSSLAVHVAMDNVIVQEDVKKLCSRE 193

Query: 259 ---AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
              A      Q  P  I V    ED  +    V C + +S            W P+LL++
Sbjct: 194 VFDALRKRLLQEEPSEI-VADWFEDARKDNKKVDCANLSS-----------PWKPLLLIL 241

Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
           P+ LGL ++NP YIP L+  F    ++G++GGKP  + Y +G  ++  +YLDPH  Q  +
Sbjct: 242 PMRLGLSELNPCYIPALKEFFACKYNIGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTFV 301

Query: 376 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN-- 433
           ++       D S+YHS  I  I  + IDPSLAI FY   + +FDDFC  A ++    N  
Sbjct: 302 DLDVSMDLFDDSSYHSAFILDISFNEIDPSLAIAFYINTEAEFDDFCTFAKQVCLVGNFR 361

Query: 434 ------GAPLFTVTQTHKKPV 448
                    LF V Q +  P+
Sbjct: 362 CFSSGSMVQLFQVLQKYPNPL 382


>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
          Length = 378

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 36  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 95

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 96  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 148

Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 149 -LAVHIAMDNTVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 207

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 208 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 267

Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           QP +         D S +       + +  +DPS+A+GF+C+ +DDF D+C +  KL+  
Sbjct: 268 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLL 327

Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
               P+F + +     +   DVL
Sbjct: 328 GGALPMFELVEQQPSHLACPDVL 350


>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
 gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
          Length = 382

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 112/332 (33%), Positives = 167/332 (50%), Gaps = 43/332 (12%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
           L +   D +S+I ++YRK F  IG +  TSD GWGCMLR  QM++AQAL+   LGR WR 
Sbjct: 35  LDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREWRW 94

Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
           +P  K  +++Y+ IL +F D +   FSIH + Q G + G   G W GP  +      LA 
Sbjct: 95  EPGTK--NKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR-HCSVFS------------ 302
             +  +        +AI+V   +          V I++ S+  C +++            
Sbjct: 153 FDKWSS--------LAIHVAMDN---------TVIINEISKFRCHIWAAADGLVRNRTNS 195

Query: 303 ------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
                   +  W P+LL +PL LGL ++N  Y   L+ TF   QSLG++GGKP  + Y +
Sbjct: 196 EPSRPANSEGSWKPLLLFIPLRLGLSEINRIYAFGLKRTFALKQSLGMIGGKPNHALYFI 255

Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
           GV E+  I+LDPH  Q   ++  D    D  +YH      +++  +DPS+A+ FY   + 
Sbjct: 256 GVVEDELIFLDPHTTQLACDLDVD--SPDDQSYHCAHASRMNISELDPSVALCFYMATES 313

Query: 417 DFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
           DFD +C    K        PLF +TQ   +PV
Sbjct: 314 DFDVWCNLVQKHLISRMQQPLFEITQ--DRPV 343


>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
          Length = 398

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 115/309 (37%), Positives = 161/309 (52%), Gaps = 31/309 (10%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
           + + F + +  +YR+ F  +     TSD GWGCMLRS+QML+ QAL    LGR WR P  
Sbjct: 41  YKRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPAL 100

Query: 198 ----LQKPFDREYVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
               +      +YV +L  F DS      +SIH++++ G  Y    G W GP    +   
Sbjct: 101 FEAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLR 160

Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV-------FSKG 304
            L    R E G       +A+YV    ++G      VV  DD +R C          ++ 
Sbjct: 161 DLVNLHRREFGG-----ELAMYV---PQEG------VVYTDDVTRLCFFDPLLHPPTAED 206

Query: 305 QADW-TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
            +DW T +L+L+PL LGL++VN RY+P L  TF FPQS+GI+GGK G S Y VG Q++  
Sbjct: 207 SSDWSTALLILIPLRLGLDQVNERYVPALEKTFAFPQSVGIIGGKKGHSVYFVGTQQDQL 266

Query: 364 IYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
             LDPHDV P   +      A    T HS     +++  IDPSLA+GF C ++ D++DF 
Sbjct: 267 HLLDPHDVHPAPELNPAFPTATHLRTVHSSRPLVMNVTGIDPSLALGFLCDNRADYEDFE 326

Query: 423 ARASKLAEE 431
            R   L +E
Sbjct: 327 RRVRILHDE 335


>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
           [Megachile rotundata]
          Length = 518

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 119/387 (30%), Positives = 178/387 (45%), Gaps = 58/387 (14%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
           S  S +WLLG  ++   +E L  A+                      + + EF +DF+SR
Sbjct: 126 SKESPVWLLGKIYRKKPEEFLEKASEAEKTLDTGSEISLAMDAISFEDSIEEFKKDFTSR 185

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR    +P   E  
Sbjct: 186 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRWQPDQPIKTEQQ 245

Query: 208 E--------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
           +        I+  FGD     SPFSIH L+  G  +G  AG W GP        ++A   
Sbjct: 246 KLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAGDWYGP-------SSVAHLL 298

Query: 258 RAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILL 313
                   + LP    +A+YV              V + D    C +       W  ++L
Sbjct: 299 SQAVEHAAEHLPIFSNLAVYVAQD---------CAVYLQDVESVCQM---PDGKWKSLIL 346

Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
            VPL LG +K+NP Y   L    T    +G++GG+P  S Y +G QE+  I LDPH  Q 
Sbjct: 347 FVPLRLGTDKLNPVYTSCLTHLLTLDTCIGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE 406

Query: 374 VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL---AE 430
            +++ KD+     +++H    R + +  +DPS  +GFY  DK+ F +F   A       +
Sbjct: 407 TVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKNQFTNFMEIAPSYLVPED 464

Query: 431 ESNGAPLFTVTQTHKKPVNHSDVLGET 457
           E    P+F   +   K ++    + ET
Sbjct: 465 EKVDYPMFLFCEGSGKDLHQQIEIAET 491


>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
          Length = 395

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 160/321 (49%), Gaps = 23/321 (7%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W+  
Sbjct: 43  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWE 102

Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162

Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
              +        +A+Y VS D          +C        +  A+ H   +S+ +    
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213

Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
             + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           IYLDPH  Q  ++  +     D + +       + +  +DPS+A+GF+C+D+++F+++C 
Sbjct: 274 IYLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333

Query: 424 RASKLAEESNGAPLFTVTQTH 444
              K   +     +F +   H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354


>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
 gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
          Length = 395

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W+  
Sbjct: 43  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 102

Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 103 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 162

Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
              +        +A+Y VS D          +C        +  A+ H   +S+ +    
Sbjct: 163 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 213

Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
             + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  
Sbjct: 214 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 273

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           IYLDPH  Q  +   +     D + +       + +  +DPS+A+GF+C+D+++F+++C 
Sbjct: 274 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 333

Query: 424 RASKLAEESNGAPLFTVTQTH 444
              K   +     +F +   H
Sbjct: 334 VIEKEILKHQSLRMFELIPKH 354


>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
          Length = 392

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 159/321 (49%), Gaps = 23/321 (7%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W+  
Sbjct: 40  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWE 99

Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 100 KHKEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 159

Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC--------IDDASRHCSVFSKGQ---- 305
              +        +A+Y VS D          +C        +  A+ H   +S+ +    
Sbjct: 160 EWNS--------LAVY-VSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLE 210

Query: 306 --ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
             + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  
Sbjct: 211 QSSGWRPLLLIVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEI 270

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           IYLDPH  Q  +   +     D + +       + +  +DPS+A+GF+C+D+++F+++C 
Sbjct: 271 IYLDPHTTQTFVETEEAGTVQDQTYHCQKGPNSMKVLKLDPSVALGFFCKDENEFNNWCE 330

Query: 424 RASKLAEESNGAPLFTVTQTH 444
              K   +     +F +   H
Sbjct: 331 VIEKEILKHQSLRMFELIPKH 351


>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 445

 Score =  184 bits (468), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 111/307 (36%), Positives = 159/307 (51%), Gaps = 45/307 (14%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
           +EF  DF SR+ I+YR  F PI  S                        TSD GWGCM+R
Sbjct: 109 SEFLDDFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIR 168

Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
           S Q L+A  ++ HRLGR WRK  +   +RE+ +IL LF D+  +PFSIH  ++ G +A G
Sbjct: 169 SGQSLLANTIVVHRLGRDWRKGQK---EREHKDILSLFADTPDAPFSIHKFVEHGAQACG 225

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
              G W GP        A ARC RA T    Q+  + +Y    D D        V ID A
Sbjct: 226 TYPGEWFGP-------NATARCLRALTDKYHQA-GLRVYARPNDSD--------VYID-A 268

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
               +       ++ P L+++ + LG+EKV P Y   L+     PQS+GI GG+P +S Y
Sbjct: 269 LTATATQKDANDEFQPTLIVLGIRLGIEKVTPAYHAALKAALELPQSMGIAGGRPSSSHY 328

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
            VG Q ++  YLDPH  +P+++      + DT   H+  +R + L  +DPS+ +GF  R 
Sbjct: 329 FVGHQGDNFFYLDPHTTRPMLSPQPSAEDVDTC--HTRRVRRLSLAEMDPSMLLGFLVRS 386

Query: 415 KDDFDDF 421
           K+DF+++
Sbjct: 387 KEDFEEW 393


>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
          Length = 379

 Score =  184 bits (468), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 108/323 (33%), Positives = 162/323 (50%), Gaps = 25/323 (7%)

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
           L S+R+  +  G +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  
Sbjct: 37  LASHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFS 96

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
           +L+ F D + S +SIH + Q G   G + G W GP  + +  + LA      +       
Sbjct: 97  VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS------- 149

Query: 269 PMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFSKGQ------ADWTPI 311
            +A+++     V  +E        V C        D+ RHC+ F  G       + W P+
Sbjct: 150 -LAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPL 208

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           +LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  
Sbjct: 209 VLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTT 268

Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           QP +         D S +       + +  +DPS+A+G +C+ +DDF+D+C +  KL+  
Sbjct: 269 QPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGSFCKTEDDFNDWCQQVKKLSLL 328

Query: 432 SNGAPLFTVTQTHKKPVNHSDVL 454
               P+F + +     +   DVL
Sbjct: 329 GGALPMFELVEQQPSHLACPDVL 351


>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
           litura]
          Length = 365

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 116/351 (33%), Positives = 176/351 (50%), Gaps = 33/351 (9%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I  +   +W+LG  +   QD           L    +D +S I  +YRKGF PIGD  +T
Sbjct: 5   IPQTKESVWILGKKYSAIQD-----------LDRIRRDITSIIWCTYRKGFIPIGDEGLT 53

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPFSI 223
           SD GWGCMLR  QM++  AL+   L   W   +  P  R+  Y++I+  F + + +P+SI
Sbjct: 54  SDKGWGCMLRCGQMVLGVALVRVHLSADW---VWTPETRDPTYLKIIQRFEERKQAPYSI 110

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G + G   G W GP  + +  + L    +  +        + I+V   +   + 
Sbjct: 111 HQVALMGASEGKQVGQWFGPNTVAQVLKKLTVYDKWSS--------LVIHVALDNTVVKE 162

Query: 284 GGAPVVCIDDASRHCSVFSKGQA-DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
                  +++    CS        DW P+LL+VPL LGL ++NP YI  L++ F  PQS+
Sbjct: 163 DILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLSEINPIYIDGLKICFQCPQSI 222

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQP---VINIGKDDLEADTSTYHSDVIRHIHL 399
           G++GGKP  + Y+VG   +  IYLDPH  Q    V     D+ +    +YH      I +
Sbjct: 223 GVIGGKPNQALYLVGCVGDEVIYLDPHTTQRSGLVETKTTDEQKEMDWSYHCKYASRIPM 282

Query: 400 DSIDPSLAIGFYCRDKDDFDDFCAR-ASKLAEESNGAPLFTVTQTHKKPVN 449
            ++DPS+A+ F CR K DF++ CA   +KL  ES   PLF   +  K+P +
Sbjct: 283 LAMDPSVAVCFLCRTKRDFEELCATIETKLMCESQ--PLFETCE--KRPAH 329


>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
           aries]
          Length = 454

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 117/358 (32%), Positives = 168/358 (46%), Gaps = 42/358 (11%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 69  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 115

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +                 
Sbjct: 116 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYCRVPP--------------- 160

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
            Q G   G + G W GP  + +  + LA    A + L          V++      R G 
Sbjct: 161 -QMGVGEGKSIGQWYGPNTVAQVLKKLAVFD-AWSALAVHVAMDNTVVMADVRRLCRSGL 218

Query: 287 PVVCID----DASRHCSVFSKG------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
           P    +    D+ RHC+ F  G       A W P++LL+PL LGL  VN  Y  TL+  F
Sbjct: 219 PCAGAEAFPADSERHCNGFPAGAEGGECTAPWRPLVLLIPLRLGLADVNAAYAGTLKHCF 278

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
             PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +       
Sbjct: 279 RMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDESFHCQHPPGR 338

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
           + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 339 MSITELDPSIAVGFFCKTEDDFNDWCQQVRKLSLLGGALPMFELVEQQPSHLACPDVL 396


>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
          Length = 456

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 117/377 (31%), Positives = 179/377 (47%), Gaps = 51/377 (13%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
           S  S +WLLG C+    ++ L +A+                      N + EF +DF+SR
Sbjct: 62  SKESPVWLLGQCYLKKSEDPLENASEALEPEGTGSQVSLAMDATNFENTIEEFKRDFASR 121

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KP-------LQ 199
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR W+ +P        Q
Sbjct: 122 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKWRPEQSIENTQQ 181

Query: 200 KPFDREYVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
              D  +  I+  F D     SPFSIH L+  G + G  AG W GP ++      L++  
Sbjct: 182 MRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSVAH---LLSQAV 238

Query: 258 RAETGLGCQSLP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
                L    L  +A+YV              V + D    C     G   W  ++LLVP
Sbjct: 239 ERTGELPNSKLSRLAVYVAQD---------CAVYMQDVEEVCRTSDGG---WKSLILLVP 286

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
           L+LG +K+NP Y P +    T    +G++GG+P  S Y +G Q++  I+LDPH  Q  ++
Sbjct: 287 LMLGTDKLNPVYAPCVTSLLTLDACIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQETVD 346

Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 435
           + K++     +++H    R + L  +DPS  +GFY  +++   DF          SN   
Sbjct: 347 VSKENFPL--TSFHCTSPRKMLLSKMDPSCCVGFYFPNRESLTDFMETIHSFVIPSNQKT 404

Query: 436 --PLFTVTQTHKKPVNH 450
             P+F   +  KK +  
Sbjct: 405 DYPMFLFCEGSKKDLQQ 421


>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
 gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
          Length = 673

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 113/318 (35%), Positives = 161/318 (50%), Gaps = 21/318 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  + +     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 253 AAENQVTECPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIA 312

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S+ SPFSIH L++ G+  G 
Sbjct: 313 QGLICHFLGRSWRYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGK 372

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E     
Sbjct: 373 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQ 432

Query: 288 VVCIDDASRHCSVFSK----GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            V    A +  S   K     Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 433 HVPWQHAKKSTSDAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 492

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I+GGKP  S Y VG QE+  I+LDPH  Q ++++ ++       ++H    R I    +D
Sbjct: 493 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMHSFHCKSPRKIKSSKMD 550

Query: 404 PSLAIGFYCRDKDDFDDF 421
           PS  IGFYC  K DFD F
Sbjct: 551 PSCCIGFYCATKTDFDSF 568


>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
           T30-4]
 gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
           T30-4]
          Length = 392

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 118/338 (34%), Positives = 172/338 (50%), Gaps = 26/338 (7%)

Query: 104 TGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK 163
           T  ++ ++ +WLLG   K   D A  D         + + F S +  +YR+ +  +   +
Sbjct: 14  TPSAALSAPVWLLG---KRYDDVAAVD------FDAYKRSFESILWFTYRRDYPAMTPYE 64

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGDSE 217
            TSD GWGCMLRS+QML+ QAL    LGR WR P      +       YV++L  F DS 
Sbjct: 65  HTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPALFETEIDARLPETYVQLLRWFADSP 124

Query: 218 TSP--FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
                +SIH +++ G  Y    G W GP    +    L    R E G           VV
Sbjct: 125 DVECRYSIHQMVKLGVQYDKLPGEWYGPTTAAQVLRDLVNLHRREFGGELSMYVPQEGVV 184

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADW-TPILLLVPLVLGLEKVNPRYIPTLRL 334
             D+  +      +C  D   H    ++ ++DW T +L+L+PL LGL++VN RY+P ++ 
Sbjct: 185 YSDDVAK------LCFFDPLLHPPT-TEDKSDWSTALLILIPLRLGLDQVNERYVPAIQK 237

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDV 393
           +F FPQS+GI+GGK G S Y VG Q++    LDPHDV P   +      A    T HS  
Sbjct: 238 SFAFPQSVGIIGGKKGHSVYFVGTQQDQLHLLDPHDVHPAPELNTAFPTATHLRTVHSSR 297

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
              +++ +IDPSLA+GF C ++ D++DF  R   L +E
Sbjct: 298 PLVMNVTTIDPSLALGFLCENRVDYEDFERRVRILHDE 335


>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
          Length = 387

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 110/338 (32%), Positives = 167/338 (49%), Gaps = 41/338 (12%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
           L +   D +S+I ++YR+ F  I  +  TSD GWGCMLR  QM VA+AL+   L R W+ 
Sbjct: 41  LDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGWQW 100

Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
            P  +  D  Y+ +L +F D +   FSIH + Q G + G A G W GP  +      LA 
Sbjct: 101 APGIR--DESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA--------- 306
             +  +        +AI+V   +         VV +DD  + C + +  ++         
Sbjct: 159 FDKWSS--------LAIHVAMDN---------VVIMDDIRKVCRLEATAESGVRNRAEPA 201

Query: 307 --------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
                    W P+LL +PL LGL ++NP Y   L+ TF   QSLGI+GGKP  + YI+GV
Sbjct: 202 GLAAAAAESWKPLLLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYIIGV 261

Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
             +  ++LDPH  Q  +++  D    D  +YH      + +  +DPS+A+ FY   + +F
Sbjct: 262 VGDDLVFLDPHTTQLAVDL--DTEFPDDESYHCAHASRMDIGQLDPSIALCFYLPTEAEF 319

Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGE 456
           D +C  A K        PLF +T+   +P+   D + E
Sbjct: 320 DSWCNLAHKHLISEMSQPLFEITE--HRPLGWPDFVDE 355


>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
 gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
          Length = 474

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 117/382 (30%), Positives = 179/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +    E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----VHLCGRRYHFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKP------------- 201
             +TSD GWGCMLRS QM++AQ LL H L R WR        P + P             
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLAPPEMPGPASPSRYRGPGR 193

Query: 202 --------------FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                          DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 HVPPRWTQGTLEMEQDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R      C  +P  +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-KCSEVPRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  IGFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTIGFYAGNRKEFETLCSELMR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
          Length = 390

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 184/374 (49%), Gaps = 54/374 (14%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           S S +W+LG            +    N +AE N +  SR+L +YRK F  I  S  TSD 
Sbjct: 28  SDSPVWILG-----------NELCARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDS 76

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 222
           GWGCMLR  QM++ +AL    LGR W+        + +    +Y++IL+LF DS+ +P+S
Sbjct: 77  GWGCMLRCGQMVLGEALQRISLGRDWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYS 136

Query: 223 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           IH +   G++       G+W GP  + +  + L+  ++        ++P+ ++V   +  
Sbjct: 137 IHQIALMGESIQSKKPVGTWFGPNTVAQVLKKLSFFEK--------TVPIRLHVAMDN-- 186

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                   V ID+    C  F  G ++  P+LL +PL LGL ++NP Y   L+  F FPQ
Sbjct: 187 -------TVIIDEIKESCG-FVGGDSE-KPLLLFIPLRLGLTEINPIYFQDLKECFEFPQ 237

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT------STYHSDVI 394
            LG++GG+P  + Y +G  +   IYLDPH     I+        DT       T+H++  
Sbjct: 238 ILGVIGGRPNHALYFIGYVDNELIYLDPH-----ISTQSASSTVDTFGGPQDQTHHTERA 292

Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHS 451
             +    +DPSL++ F CR++ +F+D C R        + +PLF + +    H  P+  S
Sbjct: 293 YRMDFKDLDPSLSLCFLCRNESEFEDMCERFLFKLIRGHNSPLFEICRQRPEHLMPLPLS 352

Query: 452 DVLGE--TGGVPED 463
             L       VPE+
Sbjct: 353 SSLNSDLPNAVPEE 366


>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
 gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
          Length = 606

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 115/318 (36%), Positives = 161/318 (50%), Gaps = 34/318 (10%)

Query: 136 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
           G+  F +DF SRI ++YR+ F  + DS  TSD GWGCM+RS QML+AQ L+ H LGR WR
Sbjct: 195 GIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLGRSWR 254

Query: 196 KPLQKPFDRE---YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
             +      E   + +++  FGD  S+TSPFSIH L+  GK  G   G W GP A+    
Sbjct: 255 WDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAVAHLL 314

Query: 251 EALARCQRAET----GLGCQ-SLPMAIYV--------VSGDEDG---ERGGAPVVCIDDA 294
               R    E     G+    +   A+Y+        V     G   +R GAP      +
Sbjct: 315 RQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGTNSSS 374

Query: 295 SRHCSVFSKG-----------QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
           S   +  S              A W  ++LLVPL LG +K+NP Y   L+   +    +G
Sbjct: 375 STAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLKAMLSLDYCIG 434

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I+GG+P  S Y VG QE+  I+LDPH  Q ++++ +D+     +++H    R + L  +D
Sbjct: 435 IIGGRPKHSLYFVGYQEDKLIHLDPHYCQDMVDVNQDNFP--VASFHCKSPRKMKLSKMD 492

Query: 404 PSLAIGFYCRDKDDFDDF 421
           PS  IGFYC  K DF  F
Sbjct: 493 PSCCIGFYCETKKDFYKF 510


>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
          Length = 486

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 112/343 (32%), Positives = 167/343 (48%), Gaps = 30/343 (8%)

Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
           A+   +  +G+ EF +DF+SR+ ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191

Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
            H LGR WR  + +P   E  +        I+  FGD    TSPFSIH L+  G  +G  
Sbjct: 192 CHFLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKR 251

Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
           AG W GP ++          Q AE      +L  A+YV              V + D   
Sbjct: 252 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 299

Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
            C +       W  ++L VPL LG +K+NP Y   L    T    +G++GG+P  S Y +
Sbjct: 300 VCQM---PDGKWKSLILFVPLRLGADKLNPVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 356

Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
           G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY  +K 
Sbjct: 357 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHNKM 414

Query: 417 DFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGE 456
            F +F   A       +E    P+F   +   K +     + E
Sbjct: 415 QFTNFMEIAPSYLVPEDEKVDYPMFLFCEGSGKDLQQKIEIAE 457


>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
          Length = 354

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 113/328 (34%), Positives = 164/328 (50%), Gaps = 49/328 (14%)

Query: 136 GLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
           G+  F  DF S+I ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR WR
Sbjct: 8   GIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSWR 67

Query: 196 ---KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
              KP+Q    RE+ E      I+  FGD  S  SP SIH ++  G+A G   G W GP 
Sbjct: 68  WSEKPIQN--GREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGP- 124

Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV-------VCIDDASRH 297
                  ++A C +            ++ V +  E+ E     V       + I D   H
Sbjct: 125 ------ASVAHCLK------------SVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTH 166

Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
           C +       W  ++LLVP+ LG E++NP Y P L    T    +GI+GG+P  S Y VG
Sbjct: 167 CRL---PNGCWKSLILLVPVKLGTERLNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVG 223

Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
            Q++  I+LDPH  Q ++++ + +      T+H    R + +  +DPS  IGFY +   D
Sbjct: 224 YQDDRLIHLDPHYCQEMVDVWQPNFSLQ--TFHCRSPRKMPISKMDPSCCIGFYLQTHHD 281

Query: 418 FDDFCARASKL-----AEESNGAPLFTV 440
           F+ F    +          SN  P+FT+
Sbjct: 282 FETFVNVINTFLTPQGVSSSNEYPMFTL 309


>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
          Length = 207

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 94/179 (52%), Positives = 124/179 (69%), Gaps = 7/179 (3%)

Query: 39  KSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERV 98
           K  K S+LS +F+S FS+FE + +SSA+     H+ S  W+  ++R+   GSM R     
Sbjct: 35  KQLKNSILSCVFSSPFSIFEAHQDSSANRSLKPHSGSYAWSRFLRRIACTGSMWRF---- 90

Query: 99  LGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDP 158
           LG S+   + ++SD+W LG C+K++ +E    +   +G A F +DFSSRI I+YRKGFD 
Sbjct: 91  LGASK---ALTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDA 147

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSE 217
           I DSK TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP QKP+  EY+ ILH+FGDSE
Sbjct: 148 ISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSE 206


>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
 gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
          Length = 388

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 128/389 (32%), Positives = 196/389 (50%), Gaps = 39/389 (10%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I  +   +W+LG  +   +D           +     D  S++  +YRKGF PIGDS +T
Sbjct: 21  IPQTREPVWILGRKYDAGRD-----------VTAIRSDIKSKLWFTYRKGFVPIGDSGLT 69

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 223
           SD GWGCMLR  QM++AQAL+   LGR WR  K  ++P   EY+ IL +F D++T+ +SI
Sbjct: 70  SDKGWGCMLRCGQMVLAQALVCLHLGRDWRWKKDSKEP---EYLRILKMFEDTKTATYSI 126

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G + G   G W GP  + +  + L+   +  + +   +L   I V       +R
Sbjct: 127 HQIALMGVSEGKDVGQWFGPNTVTQVLKKLSVYDKWSSIVIHVALDNTIIVNDIKSLCQR 186

Query: 284 GGAPVVCIDDASRHCS-----VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
               V  ID +++  S     V+      W P+LL+VPL LGL ++NP Y+  L+  FTF
Sbjct: 187 NEQSV--IDSSAQKHSPLNEPVYFNSARKWKPLLLVVPLRLGLSEINPVYLNGLKTCFTF 244

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS---TYHSDVIR 395
            QSLG++GGKP  + Y +G   E  IYLDPH  QPV  +   +L  + +   +YH     
Sbjct: 245 RQSLGVIGGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYHCPRAS 304

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT---HKKPVNHSD 452
              +  +DPS+A+ F+C  + +FD  C +  +   +S   PLF +T     H  PV +  
Sbjct: 305 RSRILDMDPSVAVCFFCSSEVEFDILCQQIQEKLIKSEKQPLFEITLNKPRHWIPVEN-- 362

Query: 453 VLGETGGVPEDDSLGVMSMNDAVGNAHED 481
                   P + +L +     +  N+ ED
Sbjct: 363 --------PVERTLNLQDYERSFENSDED 383


>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
          Length = 486

 Score =  181 bits (458), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 112/347 (32%), Positives = 167/347 (48%), Gaps = 38/347 (10%)

Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
           A+   +  +G+ EF +DF+SR+ ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+
Sbjct: 132 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 191

Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
            H LGR WR    +P   E  +        I+  FGD    TSPFSIH L+  G  +G  
Sbjct: 192 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 251

Query: 237 AGSWVGP----YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
           AG W GP    + + ++ E  A    A   L       A+YV              V + 
Sbjct: 252 AGDWYGPSSVAHLLSQAVENAAERHPAFNNL-------AVYVAQD---------CAVYLQ 295

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D    C         W  ++L VPL LG +K+NP Y   L    T    +G++GG+P  S
Sbjct: 296 DIENVCQT---PDGKWKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 352

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            Y +G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY 
Sbjct: 353 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 410

Query: 413 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGE 456
            +K  F +F   A       +E    P+F   +   K ++    + E
Sbjct: 411 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAE 457


>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
          Length = 393

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 119/401 (29%), Positives = 186/401 (46%), Gaps = 61/401 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD GWGC
Sbjct: 25  VWILGRKYSVLTEKE-----------EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGC 73

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR    K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 74  MLRCGQMIFAQALICRHLGRDWRWSKGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVG 133

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +    LA      +        +A+++   +          V ++
Sbjct: 134 EGKSIGQWYGPNTVAQVLRKLASFDTWSS--------LAVHIAMDN---------TVVME 176

Query: 293 DASRHC---------SVFSKGQADW------------------TPILLLVPLVLGLEKVN 325
           +  R C         S F   + D+                   P++LL+PL LGL  +N
Sbjct: 177 EIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLLWKPLVLLIPLRLGLTDIN 236

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
             YI TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D
Sbjct: 237 EAYIETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPMDSCYIPD 296

Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 445
            S +       + +  +DPS+A+GF+C  ++DF+D+C R  KL+      P+F + +   
Sbjct: 297 ESFHCQHPPCRMSIAELDPSIAVGFFCNSEEDFNDWCQRIKKLSLIRGALPMFELVEHQP 356

Query: 446 KPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
              +  DVL  T    + D L      +   ++ ++D+++L
Sbjct: 357 SHFSSPDVLNLTPDSSDADRL------ERFFDSEDEDFEIL 391


>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
          Length = 525

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 114/347 (32%), Positives = 171/347 (49%), Gaps = 38/347 (10%)

Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
           A+   +  +G+ EF +DF+SR+ ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+
Sbjct: 171 AMDAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALV 230

Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
            H LGR WR    +P   E  +        I+  FGD    TSPFSIH L+  G  +G  
Sbjct: 231 CHFLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKR 290

Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLP----MAIYVVSGDEDGERGGAPVVCID 292
           AG W GP ++     A    Q  E  +  +  P    +A+YV              V + 
Sbjct: 291 AGDWYGPSSV-----AHLLSQAVENAV--ERHPAFNNLAVYVAQD---------CAVYLQ 334

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D    C   S G+  W  ++L VPL LG +K+NP Y   L    T    +G++GG+P  S
Sbjct: 335 DIENVCQT-SDGK--WKSLILFVPLRLGADKLNPVYTSCLTHLLTLDTCIGVIGGRPRHS 391

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            Y +G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY 
Sbjct: 392 LYFIGFQEDKLINLDPHYCQETVDVLKDNFSL--TSFHCTSPRKMLISKMDPSCCVGFYF 449

Query: 413 RDKDDFDDFCARASKL---AEESNGAPLFTVTQTHKKPVNHSDVLGE 456
            +K  F +F   A       +E    P+F   +   K ++    + E
Sbjct: 450 HNKMQFTNFMEIAPSYLVPEDEKIDYPMFLFCEGSGKDLHQKIEIAE 496


>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
 gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
 gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
          Length = 397

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/312 (36%), Positives = 166/312 (53%), Gaps = 18/312 (5%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
           +SR+  +YRK F PIG +  TSD GWGCMLR  QM++ +AL+   LGR WR    +    
Sbjct: 47  TSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALVRRHLGRDWRWVRSQSQRE 106

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEALAR 255
           +Y+ IL+ F D +   +S+H + Q G   G + G W GP          A+  SW  L  
Sbjct: 107 DYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDSWSRLTV 166

Query: 256 CQRAETGLGCQS-----LPMAIYVVSGDEDGERG-GAPVVCIDDASRHCSVFSKGQADWT 309
               +  +  +      +P   Y  +   D + G   P  C++ A   C++  +  A W 
Sbjct: 167 HVAMDNTVVIEEIKRLCMPWLDYGGAACVDLQGGMPEPNGCLEGA---CALAEEETALWK 223

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           P+LLL+PL LGL  +N  YI TL+  F  PQSLG++GGKP  + Y +G   E  IYLDPH
Sbjct: 224 PLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGGKPNHAHYFIGYVGEELIYLDPH 283

Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
             QP +   +D    D + +       +H+  IDPS+A+GF+CR +DDFDD+C R  KL+
Sbjct: 284 TTQPAVEPCEDSQVPDDTYHCQHPPCRMHICEIDPSIAVGFFCRTEDDFDDWCMRFRKLS 343

Query: 430 EESNGAPLFTVT 441
               G P+F + 
Sbjct: 344 HTRAGLPMFELV 355


>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
          Length = 459

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 120/405 (29%), Positives = 181/405 (44%), Gaps = 82/405 (20%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 155
           S  S ++LLG C+    DE+ G+ +  G+N           + EF +DF SRI ++YR+ 
Sbjct: 36  SRNSPVFLLGKCYHFKTDES-GELSTDGSNFDKINTEISGNVEEFRKDFISRIWLTYREE 94

Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 194
           F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                     
Sbjct: 95  FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIDSSDSESWTAHTV 154

Query: 195 --------------RKP----------LQKPFDRE-------YVEILHLFGDSETSPFSI 223
                         R+P          L++ +D         + +I+  FGDS  + F +
Sbjct: 155 KKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNEVYHRKIISWFGDSPLTAFGL 214

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H L++ GK  G  AG W GP  +           R     G     + IYV         
Sbjct: 215 HQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTIYVAQD------ 263

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
               V   D   R CS    G+AD   +++LVP+ LG E+ N  Y+  ++   +    +G
Sbjct: 264 --CTVYSSDVIDRQCSFMDSGEADTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVG 321

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I+GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +D
Sbjct: 322 IIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMD 379

Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
           PS  IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 380 PSCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424


>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
 gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
          Length = 474

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 175/382 (45%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S + L G C+            G   + +F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQQFQRDFVSRLWLTYRRDFPPLAG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
 gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
          Length = 397

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 183/388 (47%), Gaps = 48/388 (12%)

Query: 91  MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
           M  + E  LGP             I    +D+WLLG  +   Q+  L             
Sbjct: 13  MDSVFEAYLGPDSMLAGAVGEPEDIPKRNTDVWLLGKRYNAIQELEL-----------IR 61

Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
           +D  SR+  +YR GF P+G+ ++T+D GWGCMLR  QM++AQAL+   LGR W      P
Sbjct: 62  RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTP 118

Query: 202 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
              D  Y++I++ F D+  S +SIH +   G++   A G W+GP  + +  + L R    
Sbjct: 119 DCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVRFDDW 178

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
            +        + ++V              V +D+    C   S   + W P+LL+VPL L
Sbjct: 179 SS--------LVVHVAMDS---------TVVLDEIYTRCQEVSA--STWKPLLLIVPLRL 219

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G+  +NP YIP L+       S G++GG+P  + Y +G  ++  +YLDPH  Q   ++ +
Sbjct: 220 GISDINPMYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGSVAQ 279

Query: 380 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
               A+     +YH      +   ++DPSLA+ F C+ ++ FD+   +  +         
Sbjct: 280 KTTAAEQELDESYHQKYAARLSFGAMDPSLAVCFLCKTRNSFDELLQQLRQEVLSLCTPA 339

Query: 437 LFTVTQTHKKPVNHSDVLGETGGVPEDD 464
           LF ++Q+     + +D + E   +P+ D
Sbjct: 340 LFEISQSRAVDWDTADDI-EWPAMPDID 366


>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
          Length = 401

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 110/327 (33%), Positives = 168/327 (51%), Gaps = 18/327 (5%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
           L +   D +S+I ++YRK F  I  +  TSD GWGCMLR  QM++A+AL+   LG+ W+ 
Sbjct: 54  LDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGWQW 113

Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
            P  +  D  Y+ +L +F D +   +SIH + Q G + G A G W GP  +      L+ 
Sbjct: 114 APGIR--DENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA-----DWTP 310
             +  + L        + V+       R   P V  DD  RH    S G A      W P
Sbjct: 172 FDKW-SSLAVHVAMDNVVVMDDIRKICRVETPAV--DDGVRH-RTQSHGLACASAVSWKP 227

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
           +LL +PL LGL ++NP Y   L+ TF   QS+GI+GGKP  + +I+GV  +  ++LDPH 
Sbjct: 228 LLLFIPLRLGLNEINPVYYCGLKRTFALKQSVGIIGGKPNHALFIIGVVGDDLVFLDPHT 287

Query: 371 VQPVINIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
            Q  +++   D+E  +  +YH      + +  +DPS+A+ FY   + +FD +C  A K  
Sbjct: 288 TQLAVDL---DVEFPEDESYHCAHASRMDIGQLDPSIALCFYLPTECEFDSWCNLAHKHL 344

Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGE 456
                 PLF +T+  ++P+   D   E
Sbjct: 345 ITQMKQPLFEITE--ERPLGWPDFTEE 369


>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
           pulchellus]
          Length = 390

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 164/328 (50%), Gaps = 44/328 (13%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
           L +   + +S+I ++YRK F  I  +  TSD GWGCMLR  QM+VA+A++   LG+ W+ 
Sbjct: 41  LDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGKDWQW 100

Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
            P  K  D +Y+ +L +F D +   +SIH + Q G + G   G W GP  +      L+ 
Sbjct: 101 SPGTK--DEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKLST 158

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK------------ 303
             +  +        +A++V   +         VV +DD  + C V +             
Sbjct: 159 FDKWSS--------LAMHVAMDN---------VVVMDDIRKICRVETTTDVEDGIRNRTQ 201

Query: 304 --------GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
                   G   W P++L +PL LGL ++NP Y   L+ TF   QSLGI+GGKP  + YI
Sbjct: 202 SHGGPAAAGARSWKPLVLFIPLRLGLSEINPIYYCGLKRTFALKQSLGIIGGKPNHALYI 261

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
           +GV  +  ++LDPH  Q  +++   D+E  +  +YH      + +  +DPS+A+ FY   
Sbjct: 262 IGVVGDDLVFLDPHTTQLAVDL---DVECPEDESYHCAHASRMDIGQLDPSIALCFYMAT 318

Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQ 442
           + +FD +C  A K        PLF +T+
Sbjct: 319 EAEFDSWCNLAHKHLISQMKQPLFEITE 346


>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
 gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
           cysteine endopeptidase; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related cysteine endopeptidase
           4; AltName: Full=Autophagy-related protein 4 homolog D
 gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
 gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
 gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
 gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
 gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
          Length = 474

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
          Length = 474

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 174/382 (45%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 250 ---SVVAHILRKAVE-SCSEVSRLVVYVSQDC--------TVYKADVARLLS-WPDPTAE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 357 PHYCQPTVDVSQPSFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FTV + H +
Sbjct: 415 ILSSSSVTERYPMFTVAEGHAQ 436


>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
          Length = 405

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 105/335 (31%), Positives = 156/335 (46%), Gaps = 47/335 (14%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--K 196
           E   DF S+I  +YRK F  IG +  T D GWGCMLR  QM++AQAL+   LGR W+  K
Sbjct: 46  ELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWKWNK 105

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
             Q   D+ Y  IL +F D +++ +SI  +   G + G   GSW GP  + +  + LA  
Sbjct: 106 NCQ---DQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162

Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----------- 305
               +          +  ++ D          VC DD    C +    Q           
Sbjct: 163 DEWSS---------IVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRS 213

Query: 306 --------------------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
                                 W P+LL++PL LGL ++N  Y+ +L+   +FPQS+GI+
Sbjct: 214 KKSSQDSSKQDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKACLSFPQSVGII 273

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  + + VG   +  IYLDPH  Q   ++  D       +YH      +++  +DPS
Sbjct: 274 GGKPNHAHWFVGYMSDKLIYLDPHTTQLCEDL--DSPNFSDESYHCPYPSTMNVMELDPS 331

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           +A+GFYC  + +FDD      K    S+  P+F +
Sbjct: 332 IALGFYCGTEKEFDDLTQSVQKFVVGSSKTPMFEL 366


>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
          Length = 424

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 179/383 (46%), Gaps = 67/383 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E+ GD      +  F +DF+SR+ ++YR+ F P+  
Sbjct: 33  SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 82

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 83  GCLTSDCGWGCMLRSGQMMLAQGLLLHYLPRDWTWAEGAGLGPPEPVGLSSPNRYRGPAR 142

Query: 196 ---------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 246
                     P     +R + +I+  F D   +PF +H L++ G++ G  AG W GP   
Sbjct: 143 WMAPTLGPGAPPSWSRERRHRQIVSWFADHPRAPFGLHQLVELGQSSGKKAGDWYGP--- 199

Query: 247 CRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQA 306
                 +A   R       +   + +YV       +   A +V   D +          A
Sbjct: 200 ----SLVAHILRKAVESCAEVTRLVVYVSQDCTVYKADVARLVARPDPT----------A 245

Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
           +W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YL
Sbjct: 246 EWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYL 305

Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           DPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +
Sbjct: 306 DPHYCQPTVDVSRADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELT 363

Query: 427 KLAEESNGA---PLFTVTQTHKK 446
           ++   S+     P+FT+ + H +
Sbjct: 364 RVLSSSSATERYPMFTLAEGHAQ 386


>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
 gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
          Length = 473

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 112/317 (35%), Positives = 160/317 (50%), Gaps = 47/317 (14%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 177
           F  DF SR+ I+YR  F PI  +                        TSD GWGCM+RS 
Sbjct: 138 FLDDFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSG 197

Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLA 236
           Q L+A  LLF RLGR WR+  Q   ++E  E+L LF D   +PFSIH  +Q G  A G  
Sbjct: 198 QSLLANTLLFLRLGRGWRRGSQ---EQEESELLSLFADHPRAPFSIHRFVQHGATACGKC 254

Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCIDDAS 295
            G W GP A  +  +ALA         G     + +Y+ S G +  ER    + C     
Sbjct: 255 PGEWFGPAAAAQCIQALAN--------GHPQAGLNVYITSDGSDIYERQFREIACR---- 302

Query: 296 RHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
               +   G+ D   P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+P +S Y
Sbjct: 303 ---GLGEDGEDDSIKPTLILLGVRLGIDRVTPVYWESLKEVIRFPQSVGIAGGRPSSSHY 359

Query: 355 IVGVQEESAIYLDPHDVQPVI---NIGKDDLE-ADTSTYHSDVIRHIHLDSIDPSLAIGF 410
            +  Q ++  YLDPH  +P +     G+D     + STYH+  +R +H+  +DPS+ IGF
Sbjct: 360 FIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGELSTYHTRRLRRLHIREMDPSMLIGF 419

Query: 411 YCRDKDDFDDFCARASK 427
             RD+ D++D   R  +
Sbjct: 420 LVRDEGDWEDLKGRIRR 436


>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
          Length = 389

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 178/362 (49%), Gaps = 34/362 (9%)

Query: 83  KRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQ 142
           KR++ A      +E  +   R G   +   +W+LG            +      L E N 
Sbjct: 23  KRMLEACEAFVTYESGIILERQGFEVNDEPVWILG-----------REYDTKTKLDELNS 71

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--WRKPLQK 200
           D  SR+L++YR+ F PIGDS +TSD GWGCMLR  QM+VAQAL+   LGR   W     +
Sbjct: 72  DVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFWPVGDDQ 131

Query: 201 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
                Y +IL LF D +T+ +SIH L Q G + G   G W GP  + +  + L+      
Sbjct: 132 RTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLSEYDEWS 191

Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC--SVFSKGQADWTPILLLVPLV 318
                    + I+V   +          V I++  + C   +     + W+P+LL+VPL 
Sbjct: 192 A--------LKIHVAMDN---------AVVIEEIEQLCHKKITPTETSTWSPLLLVVPLR 234

Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
           LGL  +NP YI +L+     PQS+G++GGKP  + Y +G   +  ++LDPH  Q  I++ 
Sbjct: 235 LGLLNINPIYIDSLKACLQMPQSIGMIGGKPSQALYFIGYVGDDVVFLDPHLTQNAIDLD 294

Query: 379 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 438
           +D  E D S+YH      I   S+DPSLA+ F C    ++ D   +   ++E      LF
Sbjct: 295 ED--EFDDSSYHPATCARISFQSMDPSLAVCFSCTTHSEWKDLLRQFKDMSEIGKKQNLF 352

Query: 439 TV 440
            V
Sbjct: 353 EV 354


>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
 gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
          Length = 382

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 184/388 (47%), Gaps = 48/388 (12%)

Query: 91  MRRIHERVLGPSRT---------GISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
           M  + E  LGP             I    +++WLLG  +   Q+           L    
Sbjct: 13  MDSVFEAYLGPDGVLAGAVGEIEDIPKRNTNVWLLGKRYNAIQE-----------LEPIR 61

Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
           +D  SR+  +YR GF P+G+ ++T+D GWGCMLR  QM++AQAL+   LGR W      P
Sbjct: 62  RDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTP 118

Query: 202 --FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
              D  Y++I++ F D+  S +SIH +   G++   A G W+GP  + +  + L R    
Sbjct: 119 DCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVRFDDW 178

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
            +        +A++V              V +DD    C      ++ W P+LL+VPL L
Sbjct: 179 SS--------LAVHVAMDS---------TVVLDDIYTCCQ--ESSESSWKPLLLIVPLRL 219

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G+  +NP YIP L+       S G++GG+P  + Y +G  ++  +YLDPH  Q    + +
Sbjct: 220 GITDINPIYIPALKRCLELSSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQ 279

Query: 380 DDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
               A+     +YH      +   ++DPSLA+ F C+ +D F++   +  +     +   
Sbjct: 280 KTTAAERELDESYHQKYAARLSFGAMDPSLAVCFLCKTRDSFEELLQQLRQDVLTLSTPA 339

Query: 437 LFTVTQTHKKPVNHSDVLGETGGVPEDD 464
           LF ++Q+     + +D + E   +P+ D
Sbjct: 340 LFEISQSRAVDWDTADDI-EWPAMPDID 366


>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
 gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related protein 4 homolog D
 gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
          Length = 469

 Score =  178 bits (451), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 110/378 (29%), Positives = 178/378 (47%), Gaps = 62/378 (16%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------KPLQKPF----------- 202
             +TSD GWGCMLRS QM++AQ LL H L R W          P   P            
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSQGVGLGPPESSPNRYRGPAHWMPP 192

Query: 203 -----------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
                      +R + +I+  F D   +PF +H L++ G++ G  AG W GP        
Sbjct: 193 HWVQAAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------S 245

Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 311
            +A   R       +   + +YV       +   A +V   D +          A+W  +
Sbjct: 246 LVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKAV 295

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           ++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  
Sbjct: 296 VILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYC 355

Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++   
Sbjct: 356 QPTVDVSQADFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTRVLSS 413

Query: 432 SNGA---PLFTVTQTHKK 446
           S+     P+FT+ + H +
Sbjct: 414 SSATERYPMFTLVEGHAQ 431


>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
          Length = 408

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 172/355 (48%), Gaps = 39/355 (10%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYV--VSGDEDG 281
            G + G W GP          A+   W +LA     +  +  + +        +S D   
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPA 195

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
           ER    +     AS      S     W P+LL+VPL LG+ ++NP Y+   +  F  PQS
Sbjct: 196 ERPLESLT----ASNQSKGPSACCTAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQS 251

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           LG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ +
Sbjct: 252 LGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILN 311

Query: 402 IDPSLAI------------GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +DPS+A+            GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 312 LDPSVALVVLSCLLLLPPKGFFCKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 365


>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
 gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
          Length = 472

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 198
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192

Query: 199 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
                  Q P    +R + +I+  F D   +PF +H L++ G+  G  AG W GP     
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247

Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
               +A   R       +   + +YV       +   A +V   D +          A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
             +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           H  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413

Query: 429 AEESNGA---PLFTVTQTHKK 446
              S+     P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434


>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
 gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
          Length = 703

 Score =  177 bits (450), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463

Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
            V    A R  +   K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 464 HVPWQQAKRPQAETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581

Query: 406 LAIGFYCRDKDDFDDF 421
             IGFYC  K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597


>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 918

 Score =  177 bits (449), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 120/351 (34%), Positives = 172/351 (49%), Gaps = 37/351 (10%)

Query: 107 SSSTSDIWLLGVCHKIAQDEALGDAAGNNG-----LAEFNQDFSSRILISYRKGFDPIGD 161
           S S S IW+LG C+   + E  G     +      + +F  DF + +  SYRK F+ I  
Sbjct: 260 SISDSPIWMLGNCYSGKELECNGHTENKHNKRSRHICKFFADFQTLVCFSYRKDFERIPG 319

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQKPFDREYVEILHLFG 214
           SK T+D GWGC LRS+QMLVA+AL+    GR WR        PL    + +   I+ LF 
Sbjct: 320 SKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCPAPLSSSKEDQLRLIIRLFQ 379

Query: 215 DSET--SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
           D     SPFSIHN++Q G + +   AG W GP ++ R +  L     A      ++    
Sbjct: 380 DQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFADLINQAYAMHQSPFRAYQAI 439

Query: 272 IYVVSGDEDGERGGAPVVCID-DASRHCSVFSKGQADWT-------------------PI 311
            +++  D   E    P    D + S   S       D T                   P+
Sbjct: 440 DHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVTPSASTSQSPPVLPPPFIPL 499

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           L+L+PL LGL ++N  YIP L+      Q +GI+GG+P  S Y VG QE++ I+ DPH  
Sbjct: 500 LILMPLRLGLNEINRMYIPCLKALLMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPHGC 559

Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
           +  +++ +      T T+HS V   I    +DPS+AIGF C+++ DFDD C
Sbjct: 560 KRFVDMQQTSFP--TETFHSAVPNKIPFTHMDPSMAIGFLCQNQADFDDLC 608


>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
          Length = 442

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 182/385 (47%), Gaps = 66/385 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 52  SRTSFSKLSS----VHLCGRRYRFETEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 101

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR----------KP--LQKPF------- 202
             +TSD GWGCMLRS QM++AQ LL H L R W           +P  L  P+       
Sbjct: 102 GYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWVKGVGLDPPEPSRLASPYWHHGPAC 161

Query: 203 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                          +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 162 WIPPHWTQGSPELEQERRHRQIVSWFADHPKAPFGLHQLVELGQSSGKKAGDWYGP---- 217

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 218 ---SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT----------AE 264

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 325 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 382

Query: 428 LAEESNGA---PLFTVTQTHKKPVN 449
           +   S+     P+FT+ + H +  N
Sbjct: 383 VLSSSSATERYPMFTLAEGHAQDHN 407


>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
 gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
          Length = 703

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 284 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 343

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 344 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 403

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 404 KPGDWYGPASVSYLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 463

Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
            V    A R  +   K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 464 HVPWQKAKRPQAENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 523

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 524 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 581

Query: 406 LAIGFYCRDKDDFDDF 421
             IGFYC  K DFD+F
Sbjct: 582 CCIGFYCATKSDFDNF 597


>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
 gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
 gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
          Length = 668

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 249 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 308

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 309 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 368

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 369 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 428

Query: 288 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
            V    A R  +  +K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 429 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 488

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 489 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 546

Query: 406 LAIGFYCRDKDDFDDF 421
             IGFYC  K DFD+F
Sbjct: 547 CCIGFYCATKSDFDNF 562


>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
          Length = 472

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 179/381 (46%), Gaps = 65/381 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPL- 198
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWCQGAGLGPSEPPGLGSPSRRRGPAR 192

Query: 199 -------QKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
                  Q P    +R + +I+  F D   +PF +H L++ G+  G  AG W GP     
Sbjct: 193 WLPPRWAQAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQGSGKKAGDWYGP----- 247

Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
               +A   R       +   + +YV       +   A +V   D +          A+W
Sbjct: 248 --SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEW 295

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
             +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDP
Sbjct: 296 KSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDP 355

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           H  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++
Sbjct: 356 HYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRV 413

Query: 429 AEESNGA---PLFTVTQTHKK 446
              S+     P+FT+ + H +
Sbjct: 414 LSSSSATERYPMFTLVEGHAQ 434


>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
          Length = 485

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 106/305 (34%), Positives = 153/305 (50%), Gaps = 27/305 (8%)

Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
           A+   +  + + EF +DF+SR+ ++YR+ F  +  S  TSD GWGCMLRS QM++AQAL+
Sbjct: 131 AMDAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALV 190

Query: 187 FHRLGRPWRKPLQKPFDREYVE--------ILHLFGD--SETSPFSIHNLLQAGKAYGLA 236
            H LGR WR  + +P   E  +        I+  FGD    TSPFSIH L+  G   G  
Sbjct: 191 CHFLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKR 250

Query: 237 AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
           AG W GP ++          Q AE      +L  A+YV              V + D   
Sbjct: 251 AGDWYGPSSVAHLLSQAVE-QAAERHPVFSNL--AVYVAQD---------CAVYLQDVEN 298

Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
            C +       W  ++L VPL LG +K+N  Y   L    T    +G++GG+P  S Y +
Sbjct: 299 VCQM---PDGKWKSLILFVPLRLGADKLNLVYASCLTHLLTLNTCIGVIGGRPRHSLYFI 355

Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
           G QE+  I LDPH  Q  +++ KD+     +++H    R + +  +DPS  +GFY  DK 
Sbjct: 356 GFQEDKLINLDPHYCQETVDVLKDNFPL--TSFHCTSPRKMLISKMDPSCCVGFYFHDKM 413

Query: 417 DFDDF 421
            F +F
Sbjct: 414 QFTNF 418


>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
          Length = 653

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 166/316 (52%), Gaps = 19/316 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML A
Sbjct: 234 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFA 293

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 294 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 353

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 354 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 413

Query: 288 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
            V    A R  +  +K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 414 HVPWQQAKRPQAETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGIL 473

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 474 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 531

Query: 406 LAIGFYCRDKDDFDDF 421
             IGFYC  K DFD+F
Sbjct: 532 CCIGFYCATKSDFDNF 547


>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
 gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
          Length = 676

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 111/328 (33%), Positives = 156/328 (47%), Gaps = 43/328 (13%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SR+ ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 258 AVENQVGETPWEEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 317

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L+  G A G 
Sbjct: 318 QGLIVHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGK 377

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP ++      L       T        +++YV              + I D  
Sbjct: 378 KPGDWYGPASVSY---LLKHALEHATQENADFDNISVYVAKD---------CTIYIQDIE 425

Query: 296 RHCSV----------------------FSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
             CS+                          Q  W  +++L+PL LG +KVNP Y   L+
Sbjct: 426 DQCSIPEPAPKQTHVPWQQMKRPSLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAHCLK 485

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
           L  +    LGI+GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H   
Sbjct: 486 LLLSTENCLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQENF--SMQSFHCKS 543

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
            R I    +DPS  IGFYC  K DFD  
Sbjct: 544 PRKIKTSKMDPSCCIGFYCATKSDFDSL 571


>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
 gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
          Length = 708

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 289 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 348

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 349 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 408

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 409 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 468

Query: 288 VVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
            V    A R  +   K   Q  W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 469 HVPWQQAKRPQAETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 528

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 529 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 586

Query: 406 LAIGFYCRDKDDFDDF 421
             IGFYC  K DFD F
Sbjct: 587 CCIGFYCATKSDFDSF 602


>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
 gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
          Length = 473

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
           S +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192

Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 248

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 249 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 295

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 356 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 413

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FTV + H +
Sbjct: 414 ILSSSSVTERYPMFTVAEGHAQ 435


>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
          Length = 437

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 167/356 (46%), Gaps = 54/356 (15%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           S+ S + +LG  +   +D           +  F   F S   ++YR GF PI  S +T+D
Sbjct: 61  SNNSPVLVLGKLYIPERDTKPQSEGIPRHILMFMDHFYSLPWMTYRCGFSPILSSSLTTD 120

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR-----------KPLQKPFDREYVEILHLFGDS 216
            GWGCM+RS QML+A  L  H LGR WR               K ++   V IL  FGDS
Sbjct: 121 CGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHRQVKNWNNYVVLILSWFGDS 180

Query: 217 ETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIY 273
           E+   PFSIH L++A   +G   G W GP  +      L R C R           + IY
Sbjct: 181 ESELCPFSIHRLMEAAYYHGNKPGDWFGPSQV----SILIRDCVRRALREHINLQKLNIY 236

Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW------TPILLLVPLVLGLEKVNPR 327
           V                    S  C+V+ K   D         +L+LVP+ LG E +NP 
Sbjct: 237 V--------------------SHDCTVYIKDVQDIFESDLDQSLLVLVPVRLGSESLNPI 276

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           YIP ++       ++GI+GG+P  S + +G Q+E+ I+LDPH  Q  +N+ + D   D S
Sbjct: 277 YIPCVKALLALDHTVGIIGGRPKHSVFFIGFQDENLIHLDPHYSQTAVNMTRTDF--DVS 334

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
           +YH    + I +  +DPS  +GFYC   +DF+ F   A K+         FTVT T
Sbjct: 335 SYHCRSPKKIPVTKMDPSCTLGFYCHTLEDFNHFRIEAEKVT--------FTVTPT 382


>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
           NZE10]
          Length = 442

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 169/354 (47%), Gaps = 56/354 (15%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
           +EF +D  S+I ++YR  F PI  S                        TSD GWGCM+R
Sbjct: 111 SEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDGFTSDTGWGCMIR 170

Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
           S Q L+A A+L HRLGR WR+  +   +REY +IL LF D+  SP SIH  ++ G +A G
Sbjct: 171 SGQSLLANAILIHRLGRDWRRGDK---EREYKDILSLFADTPESPLSIHKFVEHGAQACG 227

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP--MAIYVVSGDEDGERGGAPVVCID 292
              G W GP A  R   AL   +  E GL   S P    +YV                  
Sbjct: 228 TYPGEWFGPNATARCIRALTE-KYHEAGLQVYSRPNDSDVYV------------------ 268

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D+    +        + P L+++ + LG+EKV P Y   L+      QS+GI GG+P +S
Sbjct: 269 DSLMQTAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALELSQSVGIAGGRPSSS 328

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            Y +G Q ++  YLDPH  +P+++     L  D ++ H+  +R + +  +DPS+ +GF  
Sbjct: 329 HYFIGHQGDNFFYLDPHTTRPMLS--PQPLAEDINSCHTRRVRRLGIAEMDPSMLLGFLI 386

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSL 466
           R KD+F+ +    S++       P   +   H+    +S      G V E ++L
Sbjct: 387 RSKDEFEQWRKSISEI-------PGKAIIHIHETEPKYSTGTERAGAVDEVETL 433


>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
          Length = 405

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/336 (33%), Positives = 166/336 (49%), Gaps = 39/336 (11%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           S  S IWLLG  +  +       +   N       DF SRI ++YRK F  +  S  TSD
Sbjct: 18  SKDSPIWLLGRIYHQSHKTDDSSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSD 77

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPFDREYVEILHLFGD--S 216
            GWGCMLRS QML+AQAL+ H LGR WR         + LQ+   R    I+  FGD  S
Sbjct: 78  CGWGCMLRSGQMLLAQALVCHFLGRDWRWNESGAQEQQTLQESLHR---MIVQWFGDKPS 134

Query: 217 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
              P SIH ++  G  + G   G W GP ++  S+      QRA T    +   + +Y+ 
Sbjct: 135 PACPLSIHQMVSQGHISAGKRPGDWYGPSSV--SYIIKQILQRA-TDTYPELDTLRVYIA 191

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD----------WTPILLLVPLVLGLEKVN 325
                        V +DD  + CS     + +          W  ++LL+PL LG E++N
Sbjct: 192 QD---------CTVYLDDVKQSCSKICNYECEETDYELIDDQWKSLILLIPLRLGGERMN 242

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
           P Y   L+   +  Q +GI+GGKP  S Y +G Q++  I+LDPH+ Q ++++   +   +
Sbjct: 243 PTYDSCLKGLLSLEQCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDVLIPNF--N 300

Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             ++H   +R   L  +DPS  +GFY R + +FD+F
Sbjct: 301 LKSFHCHELRKTALKQVDPSCCVGFYLRSQREFDEF 336


>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
 gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
          Length = 668

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/316 (35%), Positives = 167/316 (52%), Gaps = 19/316 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 248 AVENQVGEHPWEEGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLA 307

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H +GR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 308 QGLICHFMGRTWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGK 367

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 368 KPGDWYGPASVSYLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKP 427

Query: 288 VVCIDDASRHCSVFSKG--QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
            V    A R  +  SK   Q  W  +++L+PL LG +K+N  Y   L+L  +    LGI+
Sbjct: 428 NVPWQQAKRPQAEVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAHCLKLLLSTEHCLGII 487

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++   +  ++H    R +    +DPS
Sbjct: 488 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENFSLN--SFHCKSPRKLKSSKMDPS 545

Query: 406 LAIGFYCRDKDDFDDF 421
             IGFYC  K DFD+F
Sbjct: 546 CCIGFYCATKSDFDNF 561


>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
          Length = 442

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 176/382 (46%), Gaps = 67/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S + L G C+            G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 53  SRTSFSK-ISSVHLCGRCYHFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 102

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
           S +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 103 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 161

Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 162 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 217

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R      C  +   +  VS D          V   D +R  S +    A+
Sbjct: 218 ---SVVAHILRKAVE-SCSEVTRLVVYVSQDC--------TVYKADVARLVS-WPDPTAE 264

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 265 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 324

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  ++ +F+  C+   +
Sbjct: 325 PHYCQPTVDVNQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMR 382

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FTV + H +
Sbjct: 383 ILSSSSVTERYPMFTVAEGHAQ 404


>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
          Length = 355

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/297 (35%), Positives = 151/297 (50%), Gaps = 27/297 (9%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
            G+  F  DF S+I ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR W
Sbjct: 15  EGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 74

Query: 195 RKPLQKPFD--REYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPY 244
           R   +KP    RE+ E      I+  FGD  S  SP SIH ++  G+A G   G W GP 
Sbjct: 75  RWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALGKKPGDWYGPA 134

Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
           ++    ++L      E     +   + +YV              V I D    C +    
Sbjct: 135 SVAHCLKSLIASASKENY---EFDHLEVYVAQDS---------TVYIQDIYSMCQLL--- 179

Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
              W  ++LLVP+ LG EK NP Y P L    T    +GI+GG+P  S Y VG Q++  I
Sbjct: 180 HGAWKSLILLVPVKLGTEKFNPIYGPCLTSLLTLDFCIGIIGGRPKHSLYFVGYQDDKLI 239

Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           +LDPH  Q ++++ + +      ++H    R + L  +DPS  IGFY   + DF+ F
Sbjct: 240 HLDPHYCQEMVDVWQPNFSL--QSFHCRSPRKMPLAKMDPSCCIGFYLGTQHDFETF 294


>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
           porcellus]
          Length = 474

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 179/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S I+L G  ++           G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSK-LSSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 133 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWAEGPGLGSPELPGTASPSPGRSPAR 192

Query: 195 ----RKPLQKP-FDRE--YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R P   P  ++E  + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 193 WVPPRWPRGAPELEQELRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 248

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   +A+YV       +   A +V   D +          A+
Sbjct: 249 ---SLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHLVASRDPT----------AE 295

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  ++ +F+  CA  ++
Sbjct: 356 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCAELTR 413

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 414 ILSCSSATERYPMFTLAEGHAQ 435


>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
 gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
          Length = 672

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/318 (35%), Positives = 163/318 (51%), Gaps = 21/318 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  + D+    G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 252 AAENQMADSPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 311

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 312 QGLICHFLGRSWRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGK 371

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   +E     E    P
Sbjct: 372 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKP 431

Query: 288 VVCIDDASRH----CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            V     S+          + Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 432 HVPWQMTSKKPASDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAHCLKLLLSTEHCLG 491

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I+GGKP  S Y VG QE+  I+LDPH  Q ++++ ++       ++H    R +    +D
Sbjct: 492 IIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDVNQETF--SMQSFHCKSPRKLKSSKMD 549

Query: 404 PSLAIGFYCRDKDDFDDF 421
           PS  IGFYC  K DFD F
Sbjct: 550 PSCCIGFYCATKTDFDSF 567


>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
 gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
          Length = 439

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/310 (34%), Positives = 159/310 (51%), Gaps = 51/310 (16%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF ++I ++YR  F  I  S+                        TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A ALL  R+GR WR+ +    +R+   IL LF D   +P+SIH  ++ G  A G 
Sbjct: 163 GQSLLANALLTLRMGREWRRGVSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +AL+  Q            + +Y+ +GD      G+ V       
Sbjct: 220 HPGEWFGPSATARCIQALSNSQAKSE--------LRVYI-TGD------GSDVY----ED 260

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
           +  S+     +D+TP L+LV   LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 261 KFMSIAKPNHSDFTPTLILVGTRLGLDKITPVYWEALKYSLQMPQSVGIAGGRPSSSHYF 320

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLE----ADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
           +GVQE    YLDPH  +P +   KD++E     D  + H+  +R +H+  +DPS+ I F 
Sbjct: 321 IGVQESDFFYLDPHQTRPALPY-KDNVEDYTTEDIDSCHTRRLRRLHIKEMDPSMLIAFL 379

Query: 412 CRDKDDFDDF 421
            RD++D++++
Sbjct: 380 IRDENDWNEW 389


>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
 gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
          Length = 706

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 113/316 (35%), Positives = 165/316 (52%), Gaps = 19/316 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SRI ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 287 AVENQVGEQPWEEGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 346

Query: 183 QALLFHRLGRPWRKPLQKPFDREYVEILH-----LFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR   +      Y + +H      FGD  S++SPFSIH L++ G+  G 
Sbjct: 347 QGLICHFLGRSWRYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGK 406

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 407 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKP 466

Query: 288 VVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
            V    A R  +   K +    W  +++L+PL LG +K+NP Y   L+L  +    LGI+
Sbjct: 467 HVPWQQAKRPQAETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLGII 526

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y VG QE+  I+LDPH  Q ++++ +++      ++H    R +    +DPS
Sbjct: 527 GGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDVNQENF--SLHSFHCKSPRKLKASKMDPS 584

Query: 406 LAIGFYCRDKDDFDDF 421
             IGFYC  K DFD F
Sbjct: 585 CCIGFYCATKSDFDSF 600


>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
          Length = 408

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 117/401 (29%), Positives = 187/401 (46%), Gaps = 69/401 (17%)

Query: 89  GSMRRIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS 145
           G  RR  E   R    SRT  S  +S    + VC +  + E  GD      +  F +DF 
Sbjct: 4   GGARRPREHGGRWAVKSRTSFSKISS----VHVCGRRYRFEGEGD------IQRFQRDFV 53

Query: 146 SRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------- 194
           SR+ ++YR+ F P+    +TSD GWGCMLRS QM++AQ+LL H L R W           
Sbjct: 54  SRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWTWAEGLGSAEP 113

Query: 195 ---------RKPL------------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
                    R P             +   +R + +I+  F D   +PF +H L++ G++ 
Sbjct: 114 AGSASPSRYRGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPGAPFGLHRLVELGQSS 173

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
           G  AG W GP         +A   R       +   + +YV       +   A +V   D
Sbjct: 174 GKKAGDWYGP-------SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPD 226

Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
            +          A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S 
Sbjct: 227 PT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRLELCLGIMGGKPRHSL 276

Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
           Y +G Q++  +YLDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  
Sbjct: 277 YFIGYQDDFLLYLDPHYCQPTVDVSQTDFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAG 334

Query: 414 DKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 451
            + +F+  C+  +++   S+     P+FT+ + H +  +HS
Sbjct: 335 GRKEFETLCSELTRVLGSSSATERYPMFTLAEGHAQ--DHS 373


>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
 gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
          Length = 397

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 163/320 (50%), Gaps = 21/320 (6%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
            +   D  SR+  +YRK F PIG +  +SD GWGCMLR  QM++AQAL+   LGR WR  
Sbjct: 45  CDLQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWE 104

Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
             K    EY +IL  F D +   +SIH + Q G   G + G W GP  + +  + LA   
Sbjct: 105 KHKNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFD 164

Query: 258 RAETGLGCQSLPMAIY------VVSGDEDGERGGAPVVC-IDDASRHCSVFSKGQ----- 305
              +        +A+Y      VV  D        P  C +  A+ H S +S+ +     
Sbjct: 165 EWNS--------LAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGH 216

Query: 306 -ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
            + W P+LL+VPL LG+  +NP Y+   +  F  PQSLG +GGKP  + Y +G   +  I
Sbjct: 217 CSGWRPLLLVVPLRLGINHINPVYVDAFKACFKMPQSLGALGGKPNHAYYFIGFSGDEII 276

Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           YLDPH  Q  ++  +     D + +       + + ++DPS+A+GF+C+D++DF+++C  
Sbjct: 277 YLDPHTTQTFVDTEEAGTVQDQTYHCQKGPNSMKVLNLDPSVALGFFCKDENDFNNWCEV 336

Query: 425 ASKLAEESNGAPLFTVTQTH 444
             K   +     +F +T  H
Sbjct: 337 IEKEILKHQSLRMFELTPKH 356


>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
          Length = 332

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 150/285 (52%), Gaps = 40/285 (14%)

Query: 113 IWLLGVCHKIA------------QDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG 160
           +WLLGV + +A             ++ + D + N     F  D  SR+  SYR  F PI 
Sbjct: 70  VWLLGVRYTLAPPPMGQRGEGRETEQTVVDESQN-----FKLDMWSRLWFSYRYNFHPIS 124

Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR---EYVEILHLFGDSE 217
            +++T+D GWGCM+RS QML+ QAL+ H LGR WR      ++    +Y ++L +F D  
Sbjct: 125 GTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLSHTSKYNELPSDYRKVLEMFLDHP 184

Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC-QSLPMAIYVVS 276
            +P SIH+ ++AG+  G  AG+W GP  +C ++  L     A   LG   +L +  Y   
Sbjct: 185 CAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKL----HAGGALGSDNNLQLLAY--- 237

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
              DG  G       D+           QA   P+ +L+P  LG+  V+P YIP +   F
Sbjct: 238 ---DGNDG-------DNTIYKSEALELLQAG--PLFILLPTRLGVSSVDPSYIPKISHVF 285

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
           +FPQSLG +GGKP ++ Y +  Q E+  YLDPH  QP+INI + +
Sbjct: 286 SFPQSLGFIGGKPSSAHYFIASQGEAVYYLDPHTPQPLINISEKE 330


>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 628

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 115/345 (33%), Positives = 164/345 (47%), Gaps = 59/345 (17%)

Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
           + G+  F +DF SR+ ++YRK F  + DS  TSD GWGCM+RS QML+AQ L+ H LGR 
Sbjct: 188 DEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLGRG 247

Query: 194 WR-----KPLQKPFDREYVE------ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSW 240
           WR     + L+  FD    E      I+  FGD  S TSPFSIH L+  GK  G   G W
Sbjct: 248 WRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPGDW 307

Query: 241 VGPYAMCRSWEALARCQRAET----GLGCQ-SLPMAIYVVSGDEDGERGGAPVV------ 289
            GP ++        +    E     G+    +   A+Y+    ++      P V      
Sbjct: 308 YGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAPWQKK 367

Query: 290 ------CIDDASR------------------HCSVF---------SKGQADWTPILLLVP 316
                 C D  S+                  H + F         S   + W  ++LLVP
Sbjct: 368 MSSAAACTDSPSQATTPRVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLILLVP 427

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
           L LG EK+NP Y   L+   +    +GI+GG+P  S + VG QE+  I+LDPH  Q +++
Sbjct: 428 LRLGTEKLNPIYNDCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYCQDMVD 487

Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           + +++     S++H    R + L  +DPS  IGFYC  + DF  F
Sbjct: 488 VNQENFPV--SSFHCKSPRKMKLSKMDPSCCIGFYCATRKDFFKF 530


>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
 gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
          Length = 402

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 172/357 (48%), Gaps = 42/357 (11%)

Query: 94  IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 153
           + + V G     I    +D+W+LG  +   Q+  L             +D  SR+  +YR
Sbjct: 31  VGQAVGGGESEDIPRRNTDVWVLGKRYNAIQELEL-----------IRRDIQSRLWCTYR 79

Query: 154 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILH 211
            GF P+G+ ++T+D GWGCMLR  QM++AQAL+   LGR W      P  R+  Y++I++
Sbjct: 80  CGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPECRDATYLKIVN 136

Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
            F D + S +SIH +   G++   A G W+GP  + +  + L R              +A
Sbjct: 137 RFEDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVRFD--------DWCSLA 188

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
           ++V              V +DD    C    +    W P+LL++PL LG+  +NP Y+P 
Sbjct: 189 VHVAMDS---------TVVLDDIYSLC----REGDSWKPLLLVIPLRLGITDINPMYVPA 235

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----DLEADTS 387
           L+       S G++GG+P  + Y +G  ++  +YLDPH  Q    +G+     + E D  
Sbjct: 236 LKRCLELDSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGTVGQKTGVGEQEYD-E 294

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           TYH      ++  ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 295 TYHQKHAARLNFSAMDPSLAVCFLCKTSDSFESLLTKFRQEVLGLCSPALFEISQTR 351


>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
          Length = 397

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H + +D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W+    K    EY  ILH F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 181 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 240

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 241 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 300

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 301 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356


>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
 gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
          Length = 440

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 155/307 (50%), Gaps = 45/307 (14%)

Query: 138 AEFNQDFSSRILISYRKGFDPI----------------------GDSKITSDVGWGCMLR 175
           ++F  DF SR+ ++YR  F PI                           TSD GWGCM+R
Sbjct: 109 SQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGNFTSDTGWGCMIR 168

Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
           S Q L+A  ++  RLGR WR+  +   ++++ EIL +F D+  +PFSIH  ++ G  A G
Sbjct: 169 SGQSLLANTVVMLRLGRDWRRGQK---EKQHHEILSMFADTPEAPFSIHKFVEHGASACG 225

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
              G W GP        A ARC RA T      + + +Y    D D        V ID  
Sbjct: 226 TYPGEWFGP-------SATARCIRALTE-KYHDVGLRVYARPNDSD--------VYIDTL 269

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
           +   +  S  +  ++P L+++ + LG+EKV P Y   L+     PQS+GI GG+P +S Y
Sbjct: 270 TATTTQHSASET-FSPTLIVLGVRLGIEKVTPAYHAALKSILELPQSVGIAGGRPSSSHY 328

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
            VG Q +   YLDPH  +P++         D  + H+  IR + +  +DPS+ +GF  RD
Sbjct: 329 FVGHQGDHFFYLDPHTTRPMLTAQP--TAEDVESCHTRRIRRLSIAEMDPSMLLGFLVRD 386

Query: 415 KDDFDDF 421
           K+DF+D+
Sbjct: 387 KEDFEDW 393


>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
          Length = 457

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 111/368 (30%), Positives = 170/368 (46%), Gaps = 60/368 (16%)

Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
           ALG +    +G+    +  SSR   +YRK F PIG +  TSD GWGCMLR +QML+ + L
Sbjct: 34  ALGKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVL 93

Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
           L   +GR +   ++      Y +IL +F D + + +SIH + Q G   G     W GP  
Sbjct: 94  LRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNT 152

Query: 246 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
             +          W  +A     +  L  + +L MA    S D      E+G+       
Sbjct: 153 AAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQ------- 205

Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
                 +H +  +  + +W P+LL++PL LGL  +N  Y+P ++  F  PQ +GI+GGKP
Sbjct: 206 ----VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKLPQCVGIIGGKP 261

Query: 350 GASTYIVGVQEESAIYLDPHDVQPV------------------INIGK-DDLE------- 383
             + Y VG+      YLDPH  +P                    N  + +DLE       
Sbjct: 262 NLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTESEQHDTNFSELEDLEPLPSQTS 321

Query: 384 -----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 438
                 D STYH  +++ +  +SIDPSLA+  +C  ++DFD+ C    K    ++  P+F
Sbjct: 322 DVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESREDFDNLCEELQKTTLPASKPPMF 381

Query: 439 TVTQTHKK 446
              +   K
Sbjct: 382 EFLEKRPK 389


>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
          Length = 458

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 117/403 (29%), Positives = 173/403 (42%), Gaps = 79/403 (19%)

Query: 108 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 156
           S  S ++LLG C+    DE+  L     N           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155

Query: 195 -------------RKPLQKPFDREYV-----------EILH-----LFGDSETSPFSIHN 225
                        R+P      +E +           E+ H      FGDS  + F +H 
Sbjct: 156 KLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEVYHRKIISWFGDSPLAAFGLHQ 215

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           L++ GK  G  AG W GP  +           R     G     + +YV           
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQ--------D 262

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
             V   D   R CS    G+ D   +++LVP+ LG E+ N  Y+  ++   +    +GI+
Sbjct: 263 CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGII 322

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPS 380

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
             IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 381 CTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 423


>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
 gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
          Length = 380

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 171/356 (48%), Gaps = 52/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H + +D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 12  VWILGRQHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 60

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W+    K    EY  ILH F D +   +SIH + Q G  
Sbjct: 61  MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVG 120

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 121 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 163

Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 164 DIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCTGWKPLLLIIPLRLGINHINPVY 223

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 224 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 283

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 284 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 339


>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
          Length = 445

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 55  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 104

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 105 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 164

Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
                       L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 165 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 220

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 221 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 265

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 266 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 325

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 326 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 383

Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
           +++   S+     P+FT+ + H +
Sbjct: 384 TRVLSSSSATERYPMFTLAEGHAQ 407


>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
          Length = 428

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 38  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 87

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 88  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSAPSPSEPSGLASPNRYRGPAR 147

Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
                       L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 148 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 203

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 204 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 248

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 249 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 308

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 309 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 366

Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
           +++   S+     P+FT+ + H +
Sbjct: 367 TRVLSSSSATERYPMFTLAEGHAQ 390


>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
          Length = 433

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 118/391 (30%), Positives = 176/391 (45%), Gaps = 62/391 (15%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           +  S + ++LLG  HK     A GD    + + E+    +SR+  +YRK F PIG +  T
Sbjct: 20  VFDSNTPVYLLG--HKFP---ARGDM---DSIKEY---VTSRLWFTYRKNFMPIGGTGPT 68

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           SD GWGCMLR  QML+AQAL+   LG  W        + +Y  IL +F D +  PFS+H 
Sbjct: 69  SDQGWGCMLRCGQMLLAQALIVRHLGTEWMWDRDNK-EEDYKRILRMFQDKKCCPFSLHQ 127

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALA---RCQRAETGLGCQSLPMAIYVVS------ 276
           + Q G +     G W GP    +  + L       R    +   +L +A  V +      
Sbjct: 128 IAQMGVSERKQIGEWFGPNTAAQVLKKLVVYDDWSRLAVHVALDNLLIASDVRTMAHTRP 187

Query: 277 --------------GDEDGERGGAPVVCIDDASRHCSVFS-----------KGQADWTPI 311
                          +E G   G   +C   + + C + S           + +  W P+
Sbjct: 188 PSRLSSRHTTENEQSEESGNASGGNSLCSFGSVKMCMLQSALMKECDENPVEDEEQWRPL 247

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           L++VPL LGL  +N  Y+P +   F  PQ  GI+GG+P  + Y +G+  E  IYLDPH  
Sbjct: 248 LIIVPLRLGLTSINRCYLPAIEAFFQLPQCTGIIGGRPNHALYFIGIAGEQLIYLDPHVC 307

Query: 372 QPVINIG----------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDK 415
           Q  I++                 K     D S+YH   + HI  DS DPSLA+ F CR +
Sbjct: 308 QAAIDLDERCASLQQQDGFVEVVKSTDIFDDSSYHCPFLLHIAYDSADPSLALSFICRTE 367

Query: 416 DDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
           ++++            ++  PLF + +T  K
Sbjct: 368 EEYEHLANNLKTKVLPASSPPLFELLETRPK 398


>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
          Length = 423

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 185/389 (47%), Gaps = 72/389 (18%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E+ GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 33  SRTSFSKISS----VHLCGRRYRFESEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 82

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 83  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWSEASGLGPSEPSGLASPNRYRGPAR 142

Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
                       L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 143 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 198

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 199 -----SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 243

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 244 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 303

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 304 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 361

Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
           +++   S+     P+FT+ + H +  +HS
Sbjct: 362 TRVLSCSSATERYPMFTLAEGHAQ--DHS 388


>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
          Length = 471

 Score =  175 bits (443), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 165/349 (47%), Gaps = 54/349 (15%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
           G   +  F +DF SR+  +YR+ F P+    +TSD GWGCMLRS QM++AQ LL H L R
Sbjct: 104 GEGDIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163

Query: 193 PW--------------------RKPLQKPFDR------------EYVEILHLFGDSETSP 220
            W                    R P +    R            ++ +I+  F D   +P
Sbjct: 164 DWTWAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAP 223

Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           FS+H L++ G++ G  AG W GP         +A   R       +   + +YV      
Sbjct: 224 FSLHRLVELGQSLGKKAGDWYGP-------SVVAHILRKAVESCSEVTHLVVYVSQDCTV 276

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
            +   A +V   D +          A+W  +++LVP+ LG E +NP Y+P ++       
Sbjct: 277 YKADVARLVARPDPT----------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSEL 326

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
            LGI+GGKP  S Y +G Q++  +YLDPH  QP ++I + D   +  ++H    R +   
Sbjct: 327 CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDISQADFPLE--SFHCTAPRKMAFT 384

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 446
            +DPS  +GFY   K +F+  C+  +++   S+     P+FT+ + H +
Sbjct: 385 KMDPSCTVGFYAGGKKEFETLCSELTRVLSSSSAMERYPMFTLAEGHAQ 433


>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
          Length = 456

 Score =  175 bits (443), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 117/402 (29%), Positives = 179/402 (44%), Gaps = 79/402 (19%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA--GNN----------GLAEFNQDFSSRILISYRKG 155
           S  S ++LLG C+    +E+ G+ +  G+N           + EF +DF SRI ++YR+ 
Sbjct: 36  SRNSPVFLLGKCYHFKTEES-GELSTDGSNFDKISTEISGNVEEFRKDFISRIWLTYREE 94

Query: 156 FDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------- 194
           F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                     
Sbjct: 95  FPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPEALDMESCDWESWTSSTV 154

Query: 195 ---------------------RKPLQKPFD----REYV---EILHLFGDSETSPFSIHNL 226
                                R P ++ +D    R  V   +I+  FGDS  + F +H L
Sbjct: 155 RKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYHRKIISWFGDSPLAAFGLHQL 214

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGA 286
           ++ GK  G  AG W GP  +           R     G     + +YV            
Sbjct: 215 IEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD--------C 261

Query: 287 PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
            V   D   R CS+   G+A    +++L P+ LG E+ N  Y+  ++   +    +GI+G
Sbjct: 262 TVYSSDVIDRQCSLVDSGKAGTKAVIILFPVRLGGERTNTDYLEFVKGILSLEYCVGIIG 321

Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
           GKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DPS 
Sbjct: 322 GKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDPSC 379

Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
            IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 380 TIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 421


>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
          Length = 505

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 173/369 (46%), Gaps = 66/369 (17%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAG--------------------NNGLAEFNQDFSSR 147
           S  S +WLLG C+    +  L  A+                      N + EF +DF SR
Sbjct: 80  SKESPVWLLGQCYLKKSEYPLERASEALEPVGTGSQVSLAMDATNFENTIEEFKRDFMSR 139

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR-PWR-KPLQKPFDRE 205
           + ++YR+ F  +  S  T+D GWGCMLRS QM++AQAL+ H LGR  WR +P Q   +  
Sbjct: 140 LWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWRWRPEQLTDESS 199

Query: 206 YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSWEALARCQRA 259
           +  I+  FGD  T  SPFSIH L+  G + G  AG W GP    + +C++ E      RA
Sbjct: 200 HRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQAME------RA 253

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
                 +   +A+YV        +    V C  D  R              ++LLVPL L
Sbjct: 254 SEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGRRKA------------LILLVPLRL 301

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ------- 372
           G +K+NP Y P L    T    +G++GG+P  S Y +G Q++  I+LDPH  Q       
Sbjct: 302 GADKLNPVYAPCLTALLTLDTCIGVIGGRPRHSLYFIGYQDDKLIHLDPHYCQNEFYFRI 361

Query: 373 --------PVINIGKD-DLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
                   P + I +  D+E +     +++H    R + L  +DPS  +GFY  DK+   
Sbjct: 362 LLSITDSLPYLFIQETVDVEGNEKFPLTSFHCTSPRKMLLSKMDPSCCVGFYFPDKESLT 421

Query: 420 DFCARASKL 428
           DF     ++
Sbjct: 422 DFMETIQRI 430


>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
           familiaris]
          Length = 473

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 181/384 (47%), Gaps = 70/384 (18%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGPGLGPSEPAGLASPNRYRGPAR 192

Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
                       L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 193 WMPPRWAQGTPELEQ--ERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-- 248

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 249 -----SLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARLVARPDPT---------- 293

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 354 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 411

Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
           +++   S+     P+FT+ + H +
Sbjct: 412 TRVLSSSSATERYPMFTLAEGHAQ 435


>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
          Length = 473

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 181/382 (47%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S ++L G  ++    E+ GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSK-ISSVYLCGHRYRF---ESEGD------IQRFQRDFMSRLWLTYRRDFPPLAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKPLQ 199
             +TSD GWGCMLRS QML+AQ LL H L R W                      R P +
Sbjct: 133 GCLTSDCGWGCMLRSGQMLLAQGLLLHFLPRDWTWAEGSGLGPPELSGSASPSRYRGPAR 192

Query: 200 K----------PFDREY--VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
           +            ++E+   +I+  F D   +PF +H L+  G++ G  AG W GP    
Sbjct: 193 RVPPHWAQCTPELEQEHWHRQIVSWFADHPQAPFGLHRLVALGQSSGKKAGDWYGP---- 248

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D           +A+
Sbjct: 249 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDP----------KAE 295

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 296 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 355

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 356 PHYCQPSVDVSQADFSLE--SFHCTSPRKMAFTKMDPSCTVGFYAGDRKEFETLCSELTR 413

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 414 VLSSSSATERYPMFTLAEGHAQ 435


>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
          Length = 474

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
          Length = 474

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
 gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
          Length = 583

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 112/334 (33%), Positives = 154/334 (46%), Gaps = 65/334 (19%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR- 195
           +  F +DF +R+ ++YRK F  + DS  TSD GWGCM+RS QML+AQ LL H LGR WR 
Sbjct: 167 IEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNWRW 226

Query: 196 ----KPLQKPF------DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 243
               + L+  +      D  + +I+  FGD  S TSPFSIH L+  GK  G   G W GP
Sbjct: 227 DATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWYGP 286

Query: 244 YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC---IDDASRHCSV 300
                   ++A   R    L  Q +         D DG        C   I D    C+V
Sbjct: 287 -------GSVAHLLRQAVKLAAQEI--------SDLDGVNVYVAQDCAVYIQDIIDECTV 331

Query: 301 FS---------------------------------KGQADWTPILLLVPLVLGLEKVNPR 327
            +                                      W  ++LLVPL LG EK+NP 
Sbjct: 332 SAGPTLAPWQKKSPGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNPI 391

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           Y   L+   +    +GI+GG+P  S Y VG QE+  I+LDPH  Q ++++   +     +
Sbjct: 392 YSDCLKAMLSLDNCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQDMVDVVNQE-NFPVA 450

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           ++H    R + L  +DPS  IGFYC  + DF  F
Sbjct: 451 SFHCKSPRKMKLSKMDPSCCIGFYCETRKDFFKF 484


>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
          Length = 474

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSAMERYPMFTLAEGHAQ 436


>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
          Length = 474

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR---------KPLQKPF---------- 202
             +TSD GWGCMLRS QM++AQ LL H L R W            L  P           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGPASPSWYHGPAR 193

Query: 203 ---------------DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                          +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPCWAQGAPELEQERRHRQIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D S          A+
Sbjct: 250 ---SLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARLVARPDPS----------AE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WNSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + +   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S      P+FT+ + H +
Sbjct: 415 VLSSSAATERYPMFTLAEGHAQ 436


>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
          Length = 453

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 168/361 (46%), Gaps = 56/361 (15%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
           G   +  F +DF SR+ ++YR+ F P+    +TSD GWGCMLRS QML+AQ LL H   R
Sbjct: 84  GEGDIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSR 143

Query: 193 PW-----------RKPL---------------------QKPFDRE--YVEILHLFGDSET 218
            W           R+P                       + F++E  +  I+  F D   
Sbjct: 144 DWTWSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPG 203

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
           +PF +H L++ G++ G  AG W GP         +A   R       +   + +YV    
Sbjct: 204 APFGLHRLVELGRSSGKRAGDWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDC 256

Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
              +   A +V   D S           +W  I++LVP+ LG E +NP Y+P ++     
Sbjct: 257 TVYKADVAQLVAQPDPS----------TEWKSIVILVPVRLGGETLNPVYVPCVKELLRL 306

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
              +GI+GGKP  S Y +G Q++  +YLDPH  QP ++  ++    +  ++H    R + 
Sbjct: 307 ELCIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPFVDTSQESFPLE--SFHCTSPRKMA 364

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDVLG 455
              +DPS  IGFY  ++ +F+  C   +++   S+     P+FT+++ H +     +V  
Sbjct: 365 FSRMDPSCTIGFYAGNRKEFELLCLELTRVLNSSSATERYPMFTLSEGHAQEYGLEEVCS 424

Query: 456 E 456
           +
Sbjct: 425 Q 425


>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
           rotundus]
          Length = 473

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 180/384 (46%), Gaps = 70/384 (18%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +    E+ GD      +  F +DF SR+ ++YR+ F P   
Sbjct: 83  SRTRFSKISS----VHLCGRRYCFESEGD------IQRFQRDFVSRLWLTYRRDFPPFAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 133 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWARGASLSPPEPSGLASSNRYRGPAH 192

Query: 195 --------RKP-LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
                   R P L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 193 CMTPCWAQRAPELEQ--ERRHRQIVSWFADHPQAPFGLHQLVELGQSSGKKAGDWYGP-- 248

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 249 -----SLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVARPDPT---------- 293

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 294 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 353

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 354 LDPHYCQPAVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 411

Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
           +++   S+     P+FT+ + H +
Sbjct: 412 TRVLSSSSTTERYPMFTLAEGHAQ 435


>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
          Length = 411

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 181/387 (46%), Gaps = 68/387 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVIILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS 451
           +   S+     P+FT+ + H +  +HS
Sbjct: 352 VLSSSSAMERYPMFTLAEGHAQ--DHS 376


>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
          Length = 411

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 130

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373


>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
          Length = 411

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ 373


>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
 gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
          Length = 393

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 177/364 (48%), Gaps = 39/364 (10%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +++WLLG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPKRNANVWLLGKRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFVPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D+  S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIELHLGRDW---FWTPDCRDTTYLKIVNRFEDTRKSFYSI 148

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G++   A G W+GP  + +  + L R           SL + + + S       
Sbjct: 149 HQIALMGESQNKAVGEWLGPNTVAQILKILVRFD------DWSSLNVHVAMDS------- 195

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
                V +DD    C      ++ W P+LL+VPL LG+  +NP Y+P L+       S G
Sbjct: 196 ----TVVLDDIFTLCQ--EPSESAWKPLLLIVPLRLGISDINPIYVPALKRCLELNSSCG 249

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     +YH      +   
Sbjct: 250 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRAGAVAQKTTAAEQELDESYHQKYAARLSFA 309

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGV 460
           ++DPSLA+ F C+ +D F++   +  +         LF ++Q+     + +D + E   +
Sbjct: 310 AMDPSLAVCFLCKTRDSFNELLQQLRQEVLSLCTPALFEISQSRAVDWDTADDI-EWPAM 368

Query: 461 PEDD 464
           P+ D
Sbjct: 369 PDID 372


>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
          Length = 497

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 107 SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 156

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 157 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 216

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 217 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 272

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 273 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 319

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 320 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 379

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 380 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 437

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 438 VLGSSSATERYPMFTLAEGHAQ 459


>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
 gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
          Length = 454

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/318 (34%), Positives = 160/318 (50%), Gaps = 58/318 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 174
           +F  DF S++ I+YR  F PI        GDS I                TSD GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
           RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G  A 
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKL---QEESELVSLFADHPRAPFSIHRFVHHGATAC 235

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 292
           G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V C +
Sbjct: 236 GKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACDE 287

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
                            P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+P +S
Sbjct: 288 SGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGRPSSS 335

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHLDSID 403
            Y +  Q +S  YLDPH  +P +    +   D E+      + STYH+  +R +H+  +D
Sbjct: 336 HYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHIREMD 395

Query: 404 PSLAIGFYCRDKDDFDDF 421
           PS+ IGF  RD+DD++D 
Sbjct: 396 PSMLIGFLVRDEDDWEDL 413


>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
          Length = 474

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 193

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 415 VLGSSSATERYPMFTLAEGHAQ 436


>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
          Length = 392

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/392 (29%), Positives = 187/392 (47%), Gaps = 69/392 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +    E+ GD      +  F +DF+SR+ ++YR+ F P+  
Sbjct: 5   SRTSFSKISS----VHLCGRRYCFESEGD------IQRFQRDFASRLWLTYRRDFPPLAG 54

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 55  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGAGLSPPEPSGLASPNRHHGLAH 114

Query: 196 -KPLQ-----KPFDREYV--EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
            KP +        ++E+   +I+  F D   +PF +H L++ G+++G  AG W GP    
Sbjct: 115 WKPPRWAQGAPELEQEHWHRQIVSWFADHPQAPFGLHQLVELGQSWGKKAGDWYGP---- 170

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 171 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDCT----------AE 217

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++S +YLD
Sbjct: 218 WKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDSLLYLD 277

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ +     +  ++H    R +    +DPS  +GFY  ++ +F+  C+  ++
Sbjct: 278 PHYCQPTVDVSQAGFPLE--SFHCTSPRKMAFTKMDPSCTVGFYAGNRKEFETLCSELTR 335

Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS-DVLG 455
           +   S      P+FT+ + H +  +HS D LG
Sbjct: 336 VLSSSAATQRYPMFTLAEGHAQ--DHSLDNLG 365


>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
          Length = 439

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 178/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 49  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 98

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 99  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 158

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 159 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 214

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 215 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 261

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 262 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 321

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 322 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 379

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 380 VLGSSSATERYPMFTLAEGHAQ 401


>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
 gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
          Length = 678

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SR+ ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310

Query: 183 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR      L   + D  + +I+  FGD  S++SPFSIH L++ G+  G 
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430

Query: 288 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            V    A R  +         Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I+GGKP  S Y VG QE+  I+LDPH  Q +++I ++       ++H    R + +  +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548

Query: 404 PSLAIGFYCRDKDDFDDF 421
           PS  IGFYC  K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566


>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
          Length = 518

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 172/367 (46%), Gaps = 42/367 (11%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 145 TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 191

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 192 DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 251

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 252 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 303

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 304 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 363

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 364 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 423

Query: 390 HSDVIRHIHLDSIDPSLAI--GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
                  + +  +DPS+A+  G +   +    + C    +L+      P+F + +     
Sbjct: 424 CQHPPCRMSIAELDPSIAVVRGGHRSTQAFCAECCLGMKQLSLLGGALPMFELVEQQPSH 483

Query: 448 VNHSDVL 454
           +   DVL
Sbjct: 484 LACPDVL 490


>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
 gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
          Length = 676

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/318 (35%), Positives = 166/318 (52%), Gaps = 21/318 (6%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVA 182
           A +  +G+     G+  F +DF SR+ ++YR+ F  +  S  TSD GWGCMLRS QML+A
Sbjct: 251 AVENQVGEQPWEEGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLA 310

Query: 183 QALLFHRLGRPWR----KPLQKPF-DREYVEILHLFGD--SETSPFSIHNLLQAGKAYGL 235
           Q L+ H LGR WR      L   + D  + +I+  FGD  S++SPFSIH L++ G+  G 
Sbjct: 311 QGLICHFLGRSWRYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGK 370

Query: 236 AAGSWVGPYAMCRSWE-ALARCQRAETGLGCQSLPMA----IYVVSGDED---GERGGAP 287
             G W GP ++    + AL    +        S+ +A    IY+   ++     E    P
Sbjct: 371 KPGDWYGPASVSYLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKP 430

Query: 288 VVCIDDASRHCSVF----SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            V    A R  +         Q  W  +++L+PL LG +K+NP Y   L+L  +    LG
Sbjct: 431 HVPWQQAKRPQAEAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAHCLKLLLSTEHCLG 490

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I+GGKP  S Y VG QE+  I+LDPH  Q +++I ++       ++H    R + +  +D
Sbjct: 491 IIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDINQEHF--SLHSFHCKSARKLKVSKMD 548

Query: 404 PSLAIGFYCRDKDDFDDF 421
           PS  IGFYC  K DFD F
Sbjct: 549 PSCCIGFYCATKTDFDSF 566


>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
 gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
           cysteine endopeptidase; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related cysteine endopeptidase
           4; AltName: Full=Autophagy-related protein 4 homolog D
 gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
 gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
 gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
 gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
           construct]
          Length = 474

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R           + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
          Length = 411

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 180/387 (46%), Gaps = 68/387 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 21  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 70

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 71  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 130

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 131 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 186

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R           + +YV       +   A +V   D +          A+
Sbjct: 187 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 233

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 234 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 293

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 294 PHYCQPTVDVSQADFPLE--SFHCTSPRRMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 351

Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS 451
           +   S+     P+FT+ + H +  +HS
Sbjct: 352 VLSSSSATERYPMFTLAEGHAQ--DHS 376


>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
          Length = 474

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPAR 193

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 194 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 249

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R           + +YV       +   A +V   D +          A+
Sbjct: 250 ---SLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 296

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 297 WMSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLD 356

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 357 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 414

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 415 VLSSSSATERYPMFTLAEGHAQ 436


>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
           gallopavo]
          Length = 421

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 170/356 (47%), Gaps = 52/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H + +D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGRRHHLNEDKS-----------KLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W+    K    EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 161

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204

Query: 293 DASRHC------------------SVFSKGQ------ADWTPILLLVPLVLGLEKVNPRY 328
           D  + C                  S   + +        W P+LL++PL LG+  +NP Y
Sbjct: 205 DIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCTGWKPLLLIIPLRLGINHINPVY 264

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 265 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDQSF 324

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 325 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLQMFELVQKH 380


>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
          Length = 385

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 108/347 (31%), Positives = 171/347 (49%), Gaps = 35/347 (10%)

Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
           W+LG  H++  +++           +   D S+R+  +YR+ F PIG +  +SD GWGCM
Sbjct: 17  WILGRQHQLKTEKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCM 65

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
           LR  QM++AQAL+   LGR W     K    EY  IL  F D +   +SIH + Q G   
Sbjct: 66  LRCGQMMLAQALICRHLGRDWHWEEHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGE 125

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGER------ 283
           G + G W GP  + +  + LA      +        +A+YV   +    ED ++      
Sbjct: 126 GKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCRLPN 177

Query: 284 GGAPVVCIDDASRHCSVFSKGQAD------WTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
              P V       H S+ S+ ++       W P+LL++PL LG+  +NP Y+   +  F 
Sbjct: 178 QNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLIIPLRLGINHINPVYVDAFKECFK 237

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
            PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S +       +
Sbjct: 238 MPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQLFVDSEENSTVDDRSFHCQQAPHRM 297

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + ++DPS+A+GF+C+++ DFD +C+   K   +     +F + Q H
Sbjct: 298 KIMNLDPSVALGFFCKEEKDFDTWCSLVQKEIHKQQSLRMFELIQKH 344


>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
          Length = 607

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 108/382 (28%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S I+L G  ++           G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 216 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 265

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 266 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWMWIEGPGLAHPELPGSASSSQGRGPAR 325

Query: 198 ----------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
                     L++  +  + +I+  F D   +P  +H L++ G++ G  AG W GP    
Sbjct: 326 WMPPSCPWGALEREQELRHRQIVSWFADHPRAPLGLHRLVELGQSSGKKAGDWYGP---- 381

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   +A+YV       +   A +V   D +          A+
Sbjct: 382 ---SLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHLVASPDPA----------AE 428

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 429 WKSVVILVPVRLGGETLNPVYVPGVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLD 488

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  ++ + +  C+  ++
Sbjct: 489 PHYCQPTVDVSQADFSLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKELETLCSELTR 546

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 547 ILSSSSATERYPMFTLVEGHAQ 568



 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 10/93 (10%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S I+L G  ++           G   +  F +DF SR+ ++YR+ F P+  
Sbjct: 130 SRTSFSK-ISSIYLCGRRYRFE---------GEGDIQRFQRDFVSRLWLTYRRDFPPLAG 179

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
             +TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 180 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212


>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
          Length = 447

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 111/332 (33%), Positives = 158/332 (47%), Gaps = 49/332 (14%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLR 175
           ++F  DF SRI I+YR GF PI  S                        TSD GWGCM+R
Sbjct: 110 SDFIDDFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIR 169

Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYG 234
           S Q L+A  +L HRLGR WRK  ++    E+  IL LF D+  +PFSIH  ++ G +A G
Sbjct: 170 SGQSLLANTILLHRLGRDWRKGQKQ---EEHKNILSLFADTPEAPFSIHKFVEHGAQACG 226

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
              G W GP        A ARC RA T        + +Y    D D            DA
Sbjct: 227 TYPGEWFGP-------NATARCLRALTD-KYHGAGLRVYARPNDSD---------VYADA 269

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
               +        + P L+++ + LG+EKV   Y   L+     PQS+GI GG+P +S Y
Sbjct: 270 LIETATQKDADDKFQPTLIVLGIRLGIEKVTSAYHVALKAALELPQSVGIAGGRPSSSHY 329

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
            +G Q +S  YLDPH  + +++        D  T H+  IR + L  +DPS+ +GF  R 
Sbjct: 330 FLGHQGDSFFYLDPHTTRHMLSPQPS--AEDIETCHTRRIRKLPLSEMDPSMLLGFLVRS 387

Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
           +++F+++     K   E  G  +  + +T  K
Sbjct: 388 QEEFEEW----RKAVLEMPGKAIIHIHETEPK 415


>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
          Length = 395

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 168/356 (47%), Gaps = 52/356 (14%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W+    K    EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 135

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 178

Query: 293 DASRHCSVFSKGQ------------------------ADWTPILLLVPLVLGLEKVNPRY 328
           D  + C    +G                           W P+LL++PL LG+  +NP Y
Sbjct: 179 DIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGWKPLLLIIPLRLGINHINPVY 238

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           I   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D S 
Sbjct: 239 IDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDESF 298

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 299 HCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 354


>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
          Length = 412

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 109/358 (30%), Positives = 170/358 (47%), Gaps = 56/358 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  D++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGRQHHLNTDKS-----------KLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W+    K    EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWQWEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQAD--------------------------WTPILLLVPLVLGLEKVNP 326
           D  + C  +S  Q+                           W P+LL++PL LG+  +NP
Sbjct: 181 DIKKMC--WSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGWKPLLLIIPLRLGINHINP 238

Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
            YI   +  F  PQSLG +GGKP  + Y +G      IYLDPH  Q  ++  ++    D 
Sbjct: 239 VYIDAFKECFKMPQSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSFVDSEENGTVDDK 298

Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           S +       + + ++DPS+A+GF+C+++ DFD++C+   K   +     +F + Q H
Sbjct: 299 SFHCQQAPHRMKIMNLDPSVALGFFCKEECDFDNWCSLVQKEILKQQSLRMFELVQKH 356


>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
           1015]
          Length = 384

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 121/378 (32%), Positives = 180/378 (47%), Gaps = 54/378 (14%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
           +RI + +  P         S IW LG+ +   +D A      +     F  DF SRI ++
Sbjct: 11  KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKD-ANTRETQHAWPESFLLDFESRIWMT 69

Query: 152 YRKGFDPI----GDSK-------------------ITSDVGWGCMLRSSQMLVAQALLFH 188
           YR  F PI    GD K                    TSD GWGCM+RS Q L+A AL   
Sbjct: 70  YRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQSLLANALSML 129

Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMC 247
            LGR WR+  +  F+ E  ++L LF D+ T+PFS+H  ++ G ++ G   G W GP A  
Sbjct: 130 VLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKYPGEWFGPSATA 186

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
           +  EAL+          C +  + +YV +   +  +         D +R+ S        
Sbjct: 187 KCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK-----FMDIARNTS------GA 227

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           + P L+L+   LG++ + P Y   L+    FPQS+GI GG+P AS Y VG Q     YLD
Sbjct: 228 FQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGRPSASHYFVGAQGSHLFYLD 287

Query: 368 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           PH  +P +     G+   + +  TYH+  +R IH+  +DPS+ IGF  R+++D+ D+  R
Sbjct: 288 PHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRNQEDWADWLKR 347

Query: 425 ASKLAEESNGAPLFTVTQ 442
                E   G P+  V +
Sbjct: 348 ----IEAVKGRPIIHVLK 361


>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
          Length = 482

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 105/398 (26%), Positives = 176/398 (44%), Gaps = 77/398 (19%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           +S S +  + +C +  Q E  GD      +  F +DF+SR+ ++YR+ F P+    +TSD
Sbjct: 79  TSFSKLSTVHLCGRRYQFEGEGD------IQRFQKDFASRLWLTYRRDFPPLDGGSLTSD 132

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------------- 195
            GWGCMLRS QML+AQ LL H   R W                                 
Sbjct: 133 CGWGCMLRSGQMLLAQGLLLHFFSRDWTWAEAVLPPSPRESELFRSMSPSRSGASWQRGS 192

Query: 196 -----------------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAG 238
                             P Q   + ++  I+  F D   +PF +H L++ G++ G  AG
Sbjct: 193 STASGLGRATWSTGGTLSPRQLEQEEQHRRIVSWFADQPGAPFGLHRLVELGRSSGKRAG 252

Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
            W GP         +A   R       +   + +YV       +   A ++   D S   
Sbjct: 253 DWYGP-------SVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQLMAQPDPS--- 302

Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
                   +W  +++LVP+ LG E +NP Y+P ++        +GI+GGKP  S Y +G 
Sbjct: 303 -------TEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDLCIGIIGGKPRHSLYFIGY 355

Query: 359 QEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
           Q++  +YLDPH  QP ++  ++    +  ++H    R +    +DPS  IGFY  ++ +F
Sbjct: 356 QDDFLLYLDPHYCQPCVDTSQERFPLE--SFHCTSPRKMAFSRMDPSCTIGFYAGNRKEF 413

Query: 419 DDFCARASKLAEESNGA---PLFTVTQTHKKPVNHSDV 453
           +  C   +++   S+     P+FT+++ H +  +  +V
Sbjct: 414 EMLCLELTRVLNSSSATERYPMFTLSEGHAQEYSLEEV 451


>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
          Length = 457

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 123/422 (29%), Positives = 181/422 (42%), Gaps = 80/422 (18%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      ++E L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENEMLSARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPGALNIENSDSESWTSHTVK 155

Query: 198 ---------------LQKP-------------FDREYVEILH-----LFGDSETSPFSIH 224
                          L+ P             +     EI H      FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNEIYHRKIVSWFGDSPLAFFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           +     G     + IYV    +D    
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQG-----ITIYVA---QDCTVY 267

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
            + V+    ASR     S+G  D   +++LVP+ LG E+ NP Y+  ++   +    +GI
Sbjct: 268 NSDVIDTQSASRT----SEGAED-KAVIILVPVRLGGERTNPDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  QP +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQPFVDVSVKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKKPVNHSDVLGETGGVPE 462
           S  IGFYCR+  DF+      +K+   S     PLFT    H +  + +          E
Sbjct: 381 SCTIGFYCRNVQDFERTSEEITKMLRISAKEKYPLFTFVNGHSRDYDFTSTTTNEDLFSE 440

Query: 463 DD 464
           D+
Sbjct: 441 DE 442


>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 494

 Score =  171 bits (433), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 120/363 (33%), Positives = 173/363 (47%), Gaps = 65/363 (17%)

Query: 127 ALGDAAGNNG---LAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------- 165
           A GDA G         F  DF SRI ++YR GF       DP   S ++           
Sbjct: 139 AYGDADGTTDGGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGA 198

Query: 166 ------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
                 SD GWGCM+RS Q L+A ALL  RLGR WR+      +RE   IL LF D   +
Sbjct: 199 DQAGFSSDTGWGCMIRSGQSLLANALLISRLGREWRRGQNPKAERE---ILSLFADDPRA 255

Query: 220 PFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
           P+S+HN ++ G +A G   G W GP A  R  +ALA    +E         + +Y     
Sbjct: 256 PYSLHNFVKHGAEACGKFPGEWFGPSATARCIQALANKHESE---------LRVYST--- 303

Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
                G  P V  D      ++ +     + P L+LV   LG++K+N  Y   L  T   
Sbjct: 304 -----GDLPDVYEDS---FMAIANPDGQHFHPTLVLVCTRLGIDKINKVYEQALISTLQM 355

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIR 395
            QS+GI GG+P  S Y +GVQ++   YLDPH  +P++      +D  + +  + H+  +R
Sbjct: 356 EQSIGIAGGRPSQSHYFIGVQDQWLFYLDPHYPRPMLPYRENPEDYTQEEVDSCHTRRLR 415

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           H+H++ +DPS+ IGF  +D+DD+D + +    +     G  + TV+        H   LG
Sbjct: 416 HLHVEDLDPSMLIGFLIKDEDDWDTWKSAVKHV----QGKAIITVSP-------HDPALG 464

Query: 456 ETG 458
            TG
Sbjct: 465 GTG 467


>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
          Length = 417

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 182/373 (48%), Gaps = 25/373 (6%)

Query: 82  VKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN 141
           V   V  G  R I     GP    +  +   +W+LG  + +         A     ++  
Sbjct: 19  VTLCVFPGVKRHITILSDGPEE--LPETDEPVWILGKQYDLQ--------AVITEKSKLL 68

Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
            D S+R+  +YR+ F PIG +  +SD GWGCMLR  QM++AQAL+   LGR W   +Q+ 
Sbjct: 69  SDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEMQQE 128

Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------YAMCRSWEA 252
              EY  IL  F D +   +SIH + Q G   G + G W GP          A+   W +
Sbjct: 129 QPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNS 188

Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 312
           LA     +  +  + +    ++       +   +P   +D  S H    S G   W P+L
Sbjct: 189 LAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGLDQ-STHLPEPSPG---WKPLL 244

Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           L++PL LG+ ++NP YI   +  F  PQSLG +GGKP ++ Y +G      IYLDPH  Q
Sbjct: 245 LIIPLRLGINQINPVYIDAFKECFKMPQSLGALGGKPNSAYYFIGFLGNELIYLDPHTTQ 304

Query: 373 PVINIGKDDLEADTSTYHSDVIRH-IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
             ++  ++D   D  ++H     H + + ++DPS+A+GF+ ++++DFD++C    K   +
Sbjct: 305 TFVD-SEEDGTVDDQSFHCQQSPHRMQILNLDPSVALGFFFKEEEDFDNWCRLVQKEILK 363

Query: 432 SNGAPLFTVTQTH 444
                +F + Q H
Sbjct: 364 PQSLQMFELVQKH 376


>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
          Length = 508

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 110/345 (31%), Positives = 164/345 (47%), Gaps = 51/345 (14%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 169
           G++  A F  DF S+I ++YR GF  I  S                         T+D G
Sbjct: 138 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 197

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+RS Q L+A AL    LGR WR+  +   D+E   +L LF D   +PFSIH  ++ 
Sbjct: 198 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 254

Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G  A G   G W GP A  R  +AL+          C+   + +YV S   D        
Sbjct: 255 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 298

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +D  R  +     +A   P L+L+ + LG+++V P Y   L+    +PQS+GI GG+
Sbjct: 299 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 357

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           P +S Y +G Q     YLDPH  +P +     G+   E + ++YH+  +R +H+  +DPS
Sbjct: 358 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 417

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNH 450
           + IGF  +D+DD+ D+      +A    G  +  V+     P  H
Sbjct: 418 MLIGFLIKDEDDWADWKRNVGSVA----GKAIVHVSDKENSPFGH 458


>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 441

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 111/307 (36%), Positives = 153/307 (49%), Gaps = 50/307 (16%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF S+I ++YR  F  I  S+                        TSD GWGCM+RS
Sbjct: 103 FLDDFESKIWLTYRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGFTSDTGWGCMIRS 162

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A AL+  R+GR WR+       +E   I+ LF D+ T+P+SIHN ++ G A  G 
Sbjct: 163 GQSLLANALVMLRMGRDWRR--GSSASQEERSIISLFADTPTAPYSIHNFVEHGAAACGK 220

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGDEDGERGGAPVVCIDDA 294
             G W GP A  R  +ALA         G QS  + +YV   G E  E     +   D  
Sbjct: 221 HPGEWFGPSATARCIQALAN--------GHQSPELRVYVTGDGLEVYEDSFMKIAKPD-- 270

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
                    GQA + P L+LV   LGL+K+ P Y   L+ +   PQSLGI GG+P +S Y
Sbjct: 271 ---------GQA-FIPTLILVGTRLGLDKITPVYWEALKSSLQIPQSLGIAGGQPSSSHY 320

Query: 355 IVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
            +GVQ     YLDPH  +P + +    +D  + D  + H+  +R IH+  +DPS+ I F 
Sbjct: 321 FIGVQGHHFFYLDPHQTRPALPLPDNIEDYSQEDIDSCHTRRLRRIHIKEMDPSMLIAFL 380

Query: 412 CRDKDDF 418
            RD+DD+
Sbjct: 381 IRDEDDW 387


>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
 gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
          Length = 471

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 110/324 (33%), Positives = 160/324 (49%), Gaps = 58/324 (17%)

Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSKI----------------TSDVGWGCML 174
           +F  DF SR+ I+YR  F PI         DS +                TSD GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
           RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +Q G  A 
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKI---QEESELVSLFADHPRAPFSIHRFVQHGATAC 252

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCID 292
           G   G W GP A  +  +AL +    + GL        +YV + G +  ER    V C +
Sbjct: 253 GKCPGEWFGPSAAAQCIQALVKSN-PQAGL-------RVYVTNDGSDIYERQFREVACDE 304

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
             S              P L+L+ + LG+++V P Y  +L+    +PQS+GI GG+P +S
Sbjct: 305 SGS------------IKPTLILLGVRLGIDRVTPIYWDSLKALLHYPQSVGIAGGRPSSS 352

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---------ADTSTYHSDVIRHIHLDSID 403
            Y +  Q +S  YLDPH  +P +    +  E          + STYH+  +R +H+  +D
Sbjct: 353 HYFIATQGDSFFYLDPHQTRPCLAPRSEPTEDEESHPYSPEELSTYHTRRLRRLHVREMD 412

Query: 404 PSLAIGFYCRDKDDFDDFCARASK 427
           PS+ IG   RD+ D++D  +R  +
Sbjct: 413 PSMLIGLLVRDEGDWEDLKSRVKE 436


>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
 gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
          Length = 404

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 122/397 (30%), Positives = 181/397 (45%), Gaps = 72/397 (18%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 139
           +RI + +  P         S IW LG+ +   +D    +    N   E            
Sbjct: 11  KRIVQYLWDPEPRNDEDPNSSIWCLGIEYHPDKDANTRETPDKNNTRENVMGTTNYRKPS 70

Query: 140 -------FNQDFSSRILISYRKGFDPI----GDSK-------------------ITSDVG 169
                  F  DF SRI ++YR  F PI    GD K                    TSD G
Sbjct: 71  EHAWPESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTG 130

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+RS Q L+A AL    LGR WR+  +  F+ E  ++L LF D+ T+PFS+H  ++ 
Sbjct: 131 WGCMIRSGQSLLANALSMLVLGRDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKH 187

Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G ++ G   G W GP A  +  EAL+          C +  + +YV +   +  +     
Sbjct: 188 GAESCGKYPGEWFGPSATAKCIEALSS--------QCGNPTLKVYVSNDTSEVYQDK--- 236

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
               D +R+ S        + P L+L+   LG++ + P Y   L+    FPQS+GI GG+
Sbjct: 237 --FMDIARNTS------GAFQPTLILLGTRLGIDNITPVYWDGLKAALQFPQSVGIAGGR 288

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           P AS Y VG Q     YLDPH  +P +     G+   + +  TYH+  +R IH+  +DPS
Sbjct: 289 PSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVDTYHTRRLRRIHVRDMDPS 348

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
           + IGF  R+++D+ D+  R     E   G P+  V +
Sbjct: 349 MLIGFLIRNQEDWADWLKR----IEAVKGRPIIHVLK 381


>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
          Length = 354

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 107/319 (33%), Positives = 155/319 (48%), Gaps = 40/319 (12%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
            TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFH 300

Query: 390 HSDVIRHIHLDSIDPSLAI 408
                  + +  +DPS+A+
Sbjct: 301 CQHPPCRMSIAELDPSIAV 319


>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
          Length = 478

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/416 (28%), Positives = 179/416 (43%), Gaps = 84/416 (20%)

Query: 108 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGD 161
           S  S + LLG C H  A+DE     A    L       F +DF+SR+ ++YR+ F P+  
Sbjct: 36  SRNSPVLLLGKCYHFKAEDEESPTEASVEDLVMGDVDAFRRDFASRVWLTYREEFSPLPG 95

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE------------- 205
           S +TSD GWGCMLR+ QM++AQ L+ H LGR   W + L  +P D E             
Sbjct: 96  STLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWTWSEALTLQPLDTETWTTTAAKRLVAS 155

Query: 206 ---------------------------------------YVEILHLFGDSETSPFSIHNL 226
                                                  +  ++  FGDS ++P  +H L
Sbjct: 156 LEASLQGVPGPSVRSSSPQAQALSLGSAEEADAHLKEMYHRTLVSWFGDSPSTPLGLHRL 215

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA--IYVVSGD------ 278
           ++ G   G  AG W GP  +    +     +  + GL C +  ++    V S D      
Sbjct: 216 VRLGLTMGKQAGDWYGPAVVAHILKKAVE-EAMDPGLACITAYVSQDCTVYSADVVDCHR 274

Query: 279 ------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
                    E   AP +  +D   H S   + +A    +++LVP+ LG EK NP Y    
Sbjct: 275 APRAERTSDETPDAPTLPQNDQPAHASTLPESRA----VIILVPVRLGGEKTNPEYFDFA 330

Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSD 392
           +   +    +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      +YH  
Sbjct: 331 KSILSLEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSYHCP 388

Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKK 446
             + +    +DPS  +GFY R   D++      SKL + S     P FT  Q H +
Sbjct: 389 SPKKMPFSKMDPSCTVGFYSRSVQDYERISQELSKLLQPSAKEKYPAFTFVQGHGR 444


>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 336

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/295 (32%), Positives = 149/295 (50%), Gaps = 25/295 (8%)

Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
           ++YR  F  I DS   +D GWGCMLR  QML+A+A+    LG+ W    +K   +E    
Sbjct: 36  MTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWAPTSRKQRHQEMARF 95

Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
           L LF D+  +PFSIH + + G+A G   G W GP  + +  + L   QR+   + C    
Sbjct: 96  LPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVNSQRSSLIVHCA--- 152

Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRY 328
               V++  E   +  A    + D  +H             +L+LVP+ LGL + +NP Y
Sbjct: 153 -MDGVLNRTEASTQLAA---ALSDGKKHS------------LLVLVPIRLGLNQSINPVY 196

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ-PVINIGKDDLEADTS 387
           IP L+ T   PQ LGI+GGKP A+ + VG   E+ +YLDPH VQ   + +  D +E    
Sbjct: 197 IPALKATLELPQCLGIIGGKPNAAHFFVGTVNENVLYLDPHVVQDAAMELTPDTVE---- 252

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
           ++   V+  + +  +DPS+   + C    + +D   R+ ++  +  G  LF V +
Sbjct: 253 SFSVAVLSKMAISDVDPSMCAAYLCSSVAELEDLGKRSKQITSQFRGYGLFDVIE 307


>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 454

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 110/337 (32%), Positives = 155/337 (45%), Gaps = 51/337 (15%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 175
           F  DF  RI ++YR GF PI  S+                         TSD GWGCM+R
Sbjct: 117 FLDDFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 176

Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
           S Q L+A AL   RLGR WR+        E   +L LF D   +PFSIH  ++ G  Y G
Sbjct: 177 SGQSLLANALAISRLGRDWRRGSNS---TEENRLLSLFADDPAAPFSIHKFVRHGALYCG 233

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
              G W GP A     +AL+  +  + G       M +YV S +          V  + +
Sbjct: 234 KHPGEWFGPSATATCIQALSD-EYKDAG-------MNVYVSSDNTYVYEDKFKAVAYNQS 285

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
            R             P L+L+   LG++++ P Y   L      PQ+LGI GG+P AS Y
Sbjct: 286 DRM-----------RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQALGIAGGRPSASHY 334

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
            +GVQ     YLDPH  +P +     DL   + +  + H+  +R IH+D +DPS+ +GF 
Sbjct: 335 FIGVQNSFFFYLDPHHTRPALPYKTGDLAYTQEEIDSCHTRRLRRIHIDDMDPSMLVGFL 394

Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV 448
            RD++D+ D+  R +    E NG  +  +  T   P 
Sbjct: 395 IRDENDWMDWKRRITSSRPE-NGKAIIHIVDTKNVPT 430


>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
          Length = 450

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 171/370 (46%), Gaps = 50/370 (13%)

Query: 111 SDIWLLGVCHKIAQDEALGDAAGNNG------LAEFNQDFSSRILISYRKGFDPIGDSKI 164
           S I LLG C+  ++ E        N          F +DFSS+I  +YRK F  +  S +
Sbjct: 82  SPIILLGKCYCCSKSEKEDQRRQPNNSNILTTFDRFKRDFSSKIWFTYRKDFPKLYGSPL 141

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETS 219
           TSDVGWGCMLR++QM++AQAL+ H LGR W     +   +E   + +I+ LFGD     S
Sbjct: 142 TSDVGWGCMLRTAQMIIAQALVMHYLGRDWTIHHTQQNRKETMLHRQIIRLFGDFPGNDS 201

Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
           PFSI  L++ G  +G   G W GP ++                          YVV    
Sbjct: 202 PFSIQALVRIGVDHGKRPGDWYGPASVA-------------------------YVVRDAI 236

Query: 280 DGERGGAPV---VCIDDASRHCSVFSKGQAD-----WTPILLLVPLVLGLEKVNPRYIPT 331
           +      P+   VC+  A   C+V+ +   D     W  +++LVP+ LG E +NP Y   
Sbjct: 237 NQVPDFHPLLSQVCVYVAP-DCTVYIQDVIDLCTQHWKAVVILVPVRLGGEALNPIYSQC 295

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           ++        LGI+GG+P  S Y VG QEE  +YLDPH  Q  ++    D    TSTYH 
Sbjct: 296 VQSLLAHELCLGIIGGRPKHSLYFVGWQEEKLLYLDPHFCQDTVDTRFRDFP--TSTYHC 353

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA---EESNGAPLFTVTQTHKKPV 448
              R + L  +DPS  +GFY      F+       KL    ++    PLF         +
Sbjct: 354 LSPRKLALQKMDPSCTLGFYIPTHAAFNRLVKDMQKLVTPPKDQGIYPLFVFQDGRSIDI 413

Query: 449 NHSDVLGETG 458
            HS +  E+ 
Sbjct: 414 EHSHIKPESN 423


>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
           purpuratus]
          Length = 390

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 105/332 (31%), Positives = 159/332 (47%), Gaps = 50/332 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           IW+LG  + ++Q +            E   D  SR+  +YRKGF  IG +  T+D GWGC
Sbjct: 48  IWILGKKYDLSQHQL-----------EARLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGC 96

Query: 173 MLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
           MLR  QM++AQAL++  LGR WR +P ++  D  Y++IL LF D + S FSIH + Q G 
Sbjct: 97  MLRCGQMMLAQALVYKHLGRDWRWRPQEQ--DETYLKILQLFLDKKDSCFSIHQIAQMGV 154

Query: 232 AYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
             G   G W GP  + +         SW  LA     +  +  + +     V S  E+  
Sbjct: 155 GEGKKVGDWFGPNTVGQVIRKLSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETS 214

Query: 283 RGGAPV--------------------------VCIDDASRHCSVFSKGQADWTPILLLVP 316
             G+                            + + +     +  S G   W  + L++P
Sbjct: 215 SEGSKTGSERRKRTSSSENIRHKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIP 274

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
           L LGL ++N  Y+  L+  FT PQSLG++GGKP  + Y +GV  +  +YLDPH  QP  +
Sbjct: 275 LRLGLNEINTVYMQRLKRCFTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHTTQPAAD 334

Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           I K     D S +H +    + + ++DPS+ +
Sbjct: 335 IDKWAFLQDES-FHCEHASRMPIKNLDPSIGL 365


>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 489

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 112/355 (31%), Positives = 161/355 (45%), Gaps = 53/355 (14%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------------------ITSDVGWGCMLR 175
           F  DF S+I ++YR  F PI  S+                         TSD GWGCM+R
Sbjct: 153 FLDDFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIR 212

Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
           S QML+A AL   RLGR WR+        E  ++L LF D   +PFSIH  ++ G  Y G
Sbjct: 213 SGQMLLANALAISRLGRDWRRVSHT---TEENKLLSLFADDPAAPFSIHRFVRHGALYCG 269

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
              G W GP A     +AL+   +           M +YV S               +D 
Sbjct: 270 KHPGEWFGPSATATCIQALSEEYKVAG--------MNVYVSSDS---------TYVYEDK 312

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
            +  +    G     P L+L+   LG++++ P Y   L      PQSLGI GG+P +S Y
Sbjct: 313 FKAVAYNQPGHM--RPTLILLGTRLGIDRITPVYRKGLEDLLKLPQSLGIAGGRPSSSHY 370

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            +GVQ     YLDPH  +P +    D    +    + H+  +R IH+D +DPS+ +GF  
Sbjct: 371 FIGVQNSFFFYLDPHHTRPALPHKVDSAYTQEQVDSCHTRRLRRIHIDDMDPSMLVGFLI 430

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP---VNHSDVLGETGGVPEDD 464
           RD++D+ D+  R +  + E NG  +  +  T   P   +     L E   + +DD
Sbjct: 431 RDENDWIDWKRRIAS-SREGNGKAIIHIIDTESVPTPTMEREAALDEVEALDDDD 484


>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
 gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
          Length = 458

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 123/422 (29%), Positives = 183/422 (43%), Gaps = 77/422 (18%)

Query: 93  RIHE---RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSR-- 147
           R+HE   R+    +  +S        L     +A++ AL D+  N  +    + F+SR  
Sbjct: 11  RVHEEAKRLFADWKPAVSKMLETYLTLDPSFSVAENYALFDS--NLPIYLLGEKFTSRRD 68

Query: 148 -----------ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
                      +  +YRK F PIG    T+D GWGCMLR  QML+A+ L+   LGR W  
Sbjct: 69  MERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGRNWL- 127

Query: 197 PLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR--- 248
                +DR     EY  IL +F D + S FSIH +   G + G   G W GP    +   
Sbjct: 128 -----WDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLK 182

Query: 249 ------SWEALARCQRAETGLGCQSL-PMAI----YVVSG----------DEDGERGGAP 287
                  W  LA     +  L    +  MA     Y  SG          D       A 
Sbjct: 183 KLVIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAE 242

Query: 288 VVCIDDASRH--------CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
                +++R          S +     +W P+L+++PL LGL  +N  Y P ++  F  P
Sbjct: 243 AEIFPESTRSPTRSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFFQLP 302

Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---------------KDDLEA 384
           Q +GI+GG+P  + Y  G+ + + +YLDPH  Q  +++                K+D E 
Sbjct: 303 QCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDFVDLDETTATRDERDGYVEIKND-EF 361

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
             STYH   I    +D +DPSLA+GF C  +DD+++   R       ++  PLF + +T 
Sbjct: 362 RDSTYHCPFILTTKIDKVDPSLALGFLCHTEDDYNELAQRLRTHLLPASTPPLFEMLETR 421

Query: 445 KK 446
            K
Sbjct: 422 PK 423


>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
           digitatum PHI26]
          Length = 401

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 123/421 (29%), Positives = 186/421 (44%), Gaps = 75/421 (17%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE------------ 139
           +RI +    P  T    + S IW LG   + A  +   D A NN  +             
Sbjct: 9   KRIVQYFWDPEPTNNVPAAS-IWCLG--KEYAPPQPFSDPATNNPHSSSGQPDASTLNDT 65

Query: 140 -----FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWG 171
                F  DF SRI I+YR  F PI  +K                        TSD GWG
Sbjct: 66  AWPNAFVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWG 125

Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
           CM+RS Q L+A A     LGR WR+  +   + E  +++ +F D   +PFSIH  +  G 
Sbjct: 126 CMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIHKFVNRGA 182

Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
           ++ G   G W GP A  +  + L+    A          + +YV +   D          
Sbjct: 183 ESCGKYPGEWFGPSATAKCIQLLSTQSEAHR--------LRVYVTNDTSD---------V 225

Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
            +D   H S    G     P L+L+   LG+E V P Y   LR   T+PQS+GI GG+P 
Sbjct: 226 YEDKFAHVSHDRSGCIQ--PTLILIGTRLGIENVTPAYWDGLRAALTYPQSVGIAGGRPS 283

Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAI 408
           AS Y +G Q+    +LDPH  +P      D+L  + +  +Y++  +R IH+  +DPS+ I
Sbjct: 284 ASHYFLGAQDCHLFFLDPHTTRPATPYRPDELYTQEELDSYYTSRLRRIHIKDMDPSMLI 343

Query: 409 GFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVN---HSDVLGETGGVPEDDS 465
           GF  +D++D+ D+     K  + + G P+  +     +P N    ++ L E   + + D 
Sbjct: 344 GFLIKDEEDWADW----KKRVQSTPGQPIVHMLPCQHQPDNGQGRAEALDEVEALDDSDE 399

Query: 466 L 466
           +
Sbjct: 400 I 400


>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
          Length = 439

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 104/309 (33%), Positives = 150/309 (48%), Gaps = 49/309 (15%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF ++I ++YR  F  I  S+                        TSD GWGCM+RS
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRS 162

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A ALL  R+GR WR+      +R+   IL LF D   +P+SIH  ++ G  A G 
Sbjct: 163 GQSLLANALLTLRMGREWRRGSSSNEERK---ILSLFADDPRAPYSIHKFVEHGASACGK 219

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP        A ARC +A T    +S  + +Y+     D           +D  
Sbjct: 220 HPGEWFGP-------SAAARCIQALTNSQVES-ELRVYITGDGSD---------VYEDT- 261

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              S+       +TP L+LV   LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 262 -FMSIAKPNSTKFTPTLILVGTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYF 320

Query: 356 VGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           +GVQE    YLDPH  +P +      +D    D  + H+  +R +H+  +DPS+ I F  
Sbjct: 321 IGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLI 380

Query: 413 RDKDDFDDF 421
           RD++D+ D+
Sbjct: 381 RDENDWKDW 389


>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
          Length = 331

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 158/319 (49%), Gaps = 27/319 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V+C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VLCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292

Query: 456 ETGGVPEDDSLGVMSMNDA 474
            + G  E   + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309


>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
           boliviensis]
          Length = 319

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 99/300 (33%), Positives = 149/300 (49%), Gaps = 25/300 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V C        DA+RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292


>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
           24927]
          Length = 444

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 110/301 (36%), Positives = 160/301 (53%), Gaps = 45/301 (14%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-------------------ITSDVGWGCMLRSSQML 180
           F  DF ++  ++YR  F PI  S                     TSD GWGCM+RS Q +
Sbjct: 111 FLDDFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCV 170

Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGS 239
           +A A+   +LGR WR+  + P  +E   IL LF D   +PFS+HN ++ G+A  G+  G 
Sbjct: 171 LANAISLLKLGRDWRRG-KSP--QEEQHILSLFADDPRAPFSLHNFVKYGEASCGVYPGE 227

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCS 299
           W GP A  R  +ALA    A+   G Q     +Y+ +GD     GG      +DA R  +
Sbjct: 228 WFGPSATARCIQALA----AQHDEGLQ-----VYI-TGD-----GGD---VYEDAFRKIA 269

Query: 300 VFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
           +   G   + P L+LV + LG+E+V P Y   L+ +   PQS+GI GG+P AS Y +GVQ
Sbjct: 270 ISDDGV--FHPTLVLVGIRLGIERVTPVYWEALKSSLMMPQSVGIAGGRPSASHYFIGVQ 327

Query: 360 EESAIYLDPHDVQPVINIGKD-DLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
            +S  YLDPH+ +P++   KD D  A+   + H+  +R +HL  +DPS+ + F  RD  D
Sbjct: 328 GQSLFYLDPHNTRPLLPYRKDSDYTAEEIEFCHTRKLRRLHLREMDPSMLLAFLIRDDRD 387

Query: 418 F 418
           +
Sbjct: 388 W 388


>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
 gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
          Length = 458

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 116/409 (28%), Positives = 174/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKLLPARSGCTIKDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             +  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPDALNIENSDSESWTSNTAK 155

Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNEIYHRKIISWFGDSPLTLFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCASMAPDNTDDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKSSSKEKYPLFTFVNAHSRDYDFT 429


>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
          Length = 319

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 99/300 (33%), Positives = 148/300 (49%), Gaps = 25/300 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V C        DA RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDSCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292


>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
          Length = 396

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 106/351 (30%), Positives = 168/351 (47%), Gaps = 43/351 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 135

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 136 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 190

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S HC         W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 191 ADTAGDRPLDYLTASNQSKGTSAHCPA-------WKPLLLIVPLRLGINQINPVYVDAFK 243

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 244 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 303

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+    R     D  C  + +   + N   +F + Q H
Sbjct: 304 PQRMNILNLDPSVALVGIRRLSGPGDTMCTVSPQEILKEN-LRMFELVQKH 353


>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
          Length = 405

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 122/364 (33%), Positives = 177/364 (48%), Gaps = 31/364 (8%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD--SK 163
           I  + + +W+LG  +   +D           +    +D  SR+  +YRKGF PIG   S 
Sbjct: 46  IPQTENSVWVLGKKYNAKKD-----------IDAIRRDIRSRLWFTYRKGFVPIGGFGST 94

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE--YVEILHLFGDSETSPF 221
            TSD GWGCMLR  QM++ QAL+   LGR WR     P  R   Y+ IL  F D   +P+
Sbjct: 95  FTSDKGWGCMLRCGQMVLGQALISLHLGRDWR---WTPETRSSTYLNILRRFEDRRAAPY 151

Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
           SIH +   G + G   G W GP  + +  + L       +     +L   + V    +  
Sbjct: 152 SIHQIALMGASEGKDVGQWFGPNTIAQVLKKLVVYDDWSSITIHVALDNTLVVNDVVQQC 211

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
              GA    +D          K  + W P+LLL+PL LGL ++NP YI  L+ +F FPQS
Sbjct: 212 RVEGATTAEVDGEKPL-----KAPSQWKPLLLLIPLRLGLNEINPIYINGLKTSFQFPQS 266

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV--INIGKDDLEADT-STYHSDVIRHIH 398
           LG++GGKP  + Y +G   +  I+LDPH  Q    ++   DD EA+  +TYH  +   I 
Sbjct: 267 LGLIGGKPSHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCKIASRIP 326

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---DVLG 455
           +  +DPS+A+ F+C  + DF   C         +   PLF + Q  ++P + S   DV  
Sbjct: 327 ITGMDPSVALCFFCATEKDFMSLCRLMQDELIGNEKQPLFELCQ--ERPASWSPAEDVAA 384

Query: 456 ETGG 459
           E  G
Sbjct: 385 EALG 388


>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
 gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
          Length = 494

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)

Query: 138 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 174
           A F  DF S+I ++YR  F       DP   S +T                +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
           RS Q L+A AL    LGR WR+  +    +E   +L LF D   +PFSIH  ++ G  A 
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
           G   G W GP A  R  +AL+          C+   + +YV S   D           +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276

Query: 294 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
             R  ++ S G      D  P L+L+ + LG+++V P Y   L+    +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334

Query: 350 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
            +S Y +G Q     YLDPH  +P     + + +   + + +TYH+  +R +H+  +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
           + IGF  RD+DD+D++       A   NG  +  V      P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNVRGGAVTGNGKAIIHVFDKETSP 436


>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
 gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
          Length = 458

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIADHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                        + L+ P             D E      + +I+  FGDS  +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLAPFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
          Length = 459

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 114/404 (28%), Positives = 173/404 (42%), Gaps = 80/404 (19%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNN-----------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +E  G    +N            + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKSEEDDGIPVRSNWAPEDPAVISGNVDEFRKDFVSRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
            P+G S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWTWPAALDMENSDSESWTSHTVK 155

Query: 198 -LQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 224
            L   F+  +V                                +I+  FGDS  + F +H
Sbjct: 156 KLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNEGFHRKIISWFGDSPRTYFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L + GK  G  AG W GP  +           R     G     + +YV          
Sbjct: 216 QLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQG-----LTVYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +    G+ D   +L+LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVTDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
           S  +GFYCR+  DF+      +K+ + S+    PLFT  + H +
Sbjct: 381 SCTVGFYCRNVQDFERASEEITKVLKASSKEKYPLFTFVKGHSR 424


>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
 gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
          Length = 494

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 54/342 (15%)

Query: 138 AEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCML 174
           A F  DF S+I ++YR  F       DP   S +T                +D GWGCM+
Sbjct: 117 AAFLDDFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMI 176

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
           RS Q L+A AL    LGR WR+  +    +E   +L LF D   +PFSIH  ++ G  A 
Sbjct: 177 RSGQSLLANALAILFLGREWRRGTKV---KEESNLLSLFADDPRAPFSIHRFVEHGASAC 233

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
           G   G W GP A  R  +AL+          C+   + +YV S   D           +D
Sbjct: 234 GKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD---------VYED 276

Query: 294 ASRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
             R  ++ S G      D  P L+L+ + LG+++V P Y   L+    +PQ++GI GG+P
Sbjct: 277 RFR--AIASGGGTGTSTDIRPTLILLGIRLGIDRVTPVYWEALKAVLKYPQAVGIAGGRP 334

Query: 350 GASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
            +S Y +G Q     YLDPH  +P     + + +   + + +TYH+  +R +H+  +DPS
Sbjct: 335 SSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMDPS 394

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
           + IGF  RD+DD+D++       A   NG  +  V      P
Sbjct: 395 MLIGFLIRDEDDWDNWKRNLRGGAVTGNGKAIIHVFDKETSP 436


>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
          Length = 402

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 120/382 (31%), Positives = 169/382 (44%), Gaps = 66/382 (17%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 134
           +RI + +  P         + IW LGV +     KI       QDE       + D   +
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 70

Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 171
                F  DF S+I ++YR  F PI                            TSD GWG
Sbjct: 71  GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 130

Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
           CM+RS Q L+A A+L   LGR WR+  +     E   +L LF D   +P SIH  ++ G 
Sbjct: 131 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 187

Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
           ++ G   G W GP A  R  EAL+          C ++   +YV +   D        V 
Sbjct: 188 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 231

Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
            D   R   V   G     P L+L+   LG++ V P Y   L+     PQS+GI GG+P 
Sbjct: 232 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 288

Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 407
           AS Y +G Q     YLDPH  +P +    D     + + STYH+  +R IH+  +DPS+ 
Sbjct: 289 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 348

Query: 408 IGFYCRDKDDFDDFCARASKLA 429
           IGF  R++DD++D+  R   + 
Sbjct: 349 IGFLVRNEDDWEDWKGRVGSVV 370


>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 601

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 111/345 (32%), Positives = 165/345 (47%), Gaps = 51/345 (14%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVG 169
           G++  A F  DF S+I ++YR GF       DP   S +T                +D G
Sbjct: 229 GHDWPAPFLDDFESKIWLTYRSGFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 288

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+RS Q L+A AL    LGR WR+  +   D+E   +L LF D   +PFSIH  ++ 
Sbjct: 289 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 345

Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G  A G   G W GP A  R  +AL+          C+   + +YV S   D        
Sbjct: 346 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 389

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +D  R  +     +A   P L+L+ + LG+++V P Y   L+    +PQS+GI GG+
Sbjct: 390 -VYEDRFRTIASGGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 448

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           P +S Y +G Q     YLDPH  +P +     G+   E + ++YH+  +R +H+  +DPS
Sbjct: 449 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 508

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNH 450
           + IGF  +D+DD+ D+      +A    G  +  V      P  H
Sbjct: 509 MLIGFLIKDEDDWADWKRNVGSVA----GKAIVHVFDKENSPFGH 549


>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
           [Ciona intestinalis]
          Length = 422

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 168/359 (46%), Gaps = 58/359 (16%)

Query: 112 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWG 171
           +IW+LG    +  + AL           F +   S +  +YRKG+ PIG +  TSD GWG
Sbjct: 39  NIWVLGSRFHLPHERAL-----------FLEHIKSFLWFTYRKGYTPIGGTGPTSDSGWG 87

Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
           CMLR  QML+A+AL    + + W+    KP    Y  ILH   D  +S +SIH + Q G 
Sbjct: 88  CMLRCGQMLLARALAELTMDKDWKWTEDKPQPPPYKRILHQLSDERSSCYSIHQIAQMGV 147

Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
             G   G W GP  + +    L++  +           +AI+V   +          VCI
Sbjct: 148 EEGKEVGQWFGPNTISQVLRRLSQFDQENV--------LAIHVAMDN---------TVCI 190

Query: 292 DDASRHCSVFSKGQAD----------------------------WTPILLLVPLVLGLEK 323
           +D  R CS     Q +                            W P+LLL+PL LGL +
Sbjct: 191 EDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLLLLIPLRLGLSE 250

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG--KDD 381
           +NP Y   L+    + +S+G++GGKP  + Y +G  E+S I+LDPH  QP + +     +
Sbjct: 251 INPVYFTHLKECLHWKESVGVIGGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN 310

Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
              D +T+H D    + L ++DPSLA+GF C  +  F D C +  ++ +     PLF V
Sbjct: 311 ERYDDTTFHCDTPGRMLLTNLDPSLALGFICTTRGSFCDLCHKVKQMVKTPTSFPLFEV 369


>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
          Length = 435

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 125/437 (28%), Positives = 190/437 (43%), Gaps = 82/437 (18%)

Query: 94  IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDE-----------ALGDAAGNNGLAEFNQ 142
           +H R +  ++T  S + S + LLG C+    ++           A+ D      + EF +
Sbjct: 1   MHTRWVLKTKTYFSRN-SPVLLLGKCYHFKYEDEHKMLTARSGCAIEDRVIAGNVDEFRK 59

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP--------- 193
           DF SRI ++YR+ F PI  S +++D GWGC LR+ QML+AQ L+ H LGR          
Sbjct: 60  DFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWIWPDALNI 119

Query: 194 -------WRKPLQKPFD--------------------REYVE----------------IL 210
                  W     K F                     +E +E                I+
Sbjct: 120 ENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDEVYHRKII 179

Query: 211 HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
             FGDS ++ F +H L++ G+  G  AG W GP  +           R     G     +
Sbjct: 180 SWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----I 234

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
            +YV    +D     + V+    ASR       G AD   +++LVP+ LG E+ N  Y+ 
Sbjct: 235 TVYVA---QDCTVYNSDVIDKQSASR-----PAGNADDKAVIILVPVRLGGERTNTDYLE 286

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
            ++   +    +GI+GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H
Sbjct: 287 FVKGVLSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFH 344

Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPV 448
               + +    +DPS  IGFYCR+  DF       +K+ + S+    PLFT    H K  
Sbjct: 345 CPSPKKMSFRKMDPSCTIGFYCRNVQDFQRASEEITKMLKMSSKEKYPLFTFVHGHSKDY 404

Query: 449 NH-SDVLGETGGVPEDD 464
           +  S V  E     +DD
Sbjct: 405 DFTSTVANEEDLFSQDD 421


>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
           NRRL3357]
 gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
           NRRL3357]
          Length = 439

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 120/382 (31%), Positives = 169/382 (44%), Gaps = 66/382 (17%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA------QDE------ALGDAAGN 134
           +RI + +  P         + IW LGV +     KI       QDE       + D   +
Sbjct: 47  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDQDELEAGTSKIDDVTAH 106

Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITSDVGWG 171
                F  DF S+I ++YR  F PI                            TSD GWG
Sbjct: 107 GWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWG 166

Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG- 230
           CM+RS Q L+A A+L   LGR WR+  +     E   +L LF D   +P SIH  ++ G 
Sbjct: 167 CMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRFVKYGA 223

Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
           ++ G   G W GP A  R  EAL+          C ++   +YV +   D        V 
Sbjct: 224 ESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD--------VY 267

Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
            D   R   V   G     P L+L+   LG++ V P Y   L+     PQS+GI GG+P 
Sbjct: 268 EDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIAGGRPS 324

Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSIDPSLA 407
           AS Y +G Q     YLDPH  +P +    D     + + STYH+  +R IH+  +DPS+ 
Sbjct: 325 ASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDMDPSML 384

Query: 408 IGFYCRDKDDFDDFCARASKLA 429
           IGF  R++DD++D+  R   + 
Sbjct: 385 IGFLVRNEDDWEDWKGRVGSVV 406


>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
          Length = 513

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 158/324 (48%), Gaps = 47/324 (14%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVG 169
           G++  A F  DF S+I ++YR GF  I  S                         T+D G
Sbjct: 143 GHDWPAPFLDDFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTG 202

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+RS Q L+A AL    LGR WR+  +   D+E   +L LF D   +PFSIH  ++ 
Sbjct: 203 WGCMIRSGQSLLASALSILSLGRDWRRGTKT--DQE-SNLLSLFADDPKAPFSIHRFVEY 259

Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G  A G   G W GP A  R  +AL+          C+   + +YV S   D        
Sbjct: 260 GASACGKYPGEWFGPSATARCIQALSS--------ECKHAGLNVYVTSDGSD-------- 303

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +D  R  +     +A   P L+L+ + LG+++V P Y   L+    +PQS+GI GG+
Sbjct: 304 -VYEDRFRTIASSGATEAGIHPTLILLGIRLGIDRVTPVYWEALKDVLKYPQSVGIAGGR 362

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINI---GKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           P +S Y +G Q     YLDPH  +P +     G+   E + ++YH+  +R +H+  +DPS
Sbjct: 363 PSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDPS 422

Query: 406 LAIGFYCRDKDDFDDFCARASKLA 429
           + IGF  +D+DD+ D+      +A
Sbjct: 423 MLIGFLIKDEDDWADWKRNVGSVA 446


>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 331

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 157/319 (49%), Gaps = 27/319 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292

Query: 456 ETGGVPEDDSLGVMSMNDA 474
            + G  E   + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309


>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
          Length = 458

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 117/422 (27%), Positives = 179/422 (42%), Gaps = 80/422 (18%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF+SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHIIAGNVEEFRKDFTSRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDFESWTSNTVK 155

Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCASMASDHADDKAVIILVPVRLGGERTNTDYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHSDVLGETGGVPE 462
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +  + +    +   +  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTAAKEDDLFS 440

Query: 463 DD 464
           +D
Sbjct: 441 ED 442


>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
          Length = 469

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 170
           +F  DF S++ I+YR  F PI  +                              TSD GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
           GCM+RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 246

Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 288
             A G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V
Sbjct: 247 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 298

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
            C +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+
Sbjct: 299 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 346

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 399
           P +S Y +  Q +S  YLDPH  +P +    +   D E+      + STYH+  +R +H+
Sbjct: 347 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 406

Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASK 427
             +DPS+ IGF  RD+DD++D   R  +
Sbjct: 407 REMDPSMLIGFLVRDEDDWEDLKRRVRE 434


>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
          Length = 454

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 62/328 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK----------------------------ITSDVGW 170
           +F  DF S++ I+YR  F PI  +                              TSD GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
           GCM+RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHG 231

Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPV 288
             A G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V
Sbjct: 232 ATACGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEV 283

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
            C +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI GG+
Sbjct: 284 ACDESGGIQ------------PTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGGR 331

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKD---DLEA------DTSTYHSDVIRHIHL 399
           P +S Y +  Q +S  YLDPH  +P +    +   D E+      + STYH+  +R +H+
Sbjct: 332 PSSSHYFIATQGDSFFYLDPHQTRPCLTPRAESTGDEESHPYSPEELSTYHTRRLRRLHI 391

Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASK 427
             +DPS+ IGF  RD+DD++D   R  +
Sbjct: 392 REMDPSMLIGFLVRDEDDWEDLKRRVRE 419


>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
 gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
          Length = 458

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 117/409 (28%), Positives = 175/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +DE L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFT 429


>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
          Length = 356

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 111/342 (32%), Positives = 159/342 (46%), Gaps = 32/342 (9%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
           +RI + +  P         + IW LGV +     +   +   +N  A      + RI   
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70

Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
                DP G    TSD GWGCM+RS Q L+A A+L   LGR WR+  +     E   +L 
Sbjct: 71  L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121

Query: 212 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
           LF D   +P SIH  ++ G ++ G   G W GP A  R  EAL+          C ++  
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
            +YV +   D        V  D   R   V   G     P L+L+   LG++ V P Y  
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 387
            L+     PQS+GI GG+P AS Y +G Q     YLDPH  +P +    D     + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
           TYH+  +R IH+  +DPS+ IGF  R++DD++D+  R   + 
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSVV 324


>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
          Length = 319

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 150/300 (50%), Gaps = 25/300 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E  +   A 
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEISKLCRAS 112

Query: 288 VVCIDDAS------RHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           + C   A+      RHC+    G         W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 LPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP + +       D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ ++DF+D+C +  KL++     P+F + +     +   DVL 
Sbjct: 233 RMGIGELDPSIAVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 292


>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
          Length = 466

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE          A+ D      + EF +DF SRI ++YR+ F
Sbjct: 44  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 103

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 104 PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 163

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 164 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 223

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 224 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 271

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 272 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 330

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 331 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 388

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 389 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 437


>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
 gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
 gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
           cysteine endopeptidase; AltName: Full=Autophagin-3;
           AltName: Full=Autophagy-related cysteine endopeptidase
           3; AltName: Full=Autophagy-related protein 4 homolog C
          Length = 458

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE          A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
          Length = 431

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 124/406 (30%), Positives = 185/406 (45%), Gaps = 43/406 (10%)

Query: 91  MRRIHERVLGPSRTGISSSTSDI--WLLGVCHKIAQDEALGDAAGNNGLA--EFNQDFSS 146
           MR    R   P R+ +SS+  +   W      +++    L     +      E   D +S
Sbjct: 1   MRPGPRRSCTPRRSALSSTLGEASDWCTAAAREVSAVSGLSQLQQDESYEKDEILSDVAS 60

Query: 147 RILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY 206
           R+  +YRK F  IG +  TSD GWGCMLR  QM+ AQAL+   LGR WR   +K     Y
Sbjct: 61  RLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSY 120

Query: 207 VEILHLFGDSETSPFSIHNLL------QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
             +L  F D + S +SIH +       +  +       S +GP  +C+S+ A+   +R  
Sbjct: 121 FSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGP-QLCQSFAAVRLSRRRR 179

Query: 261 TGLGCQSLP--MAIYVVSGDEDGERGGAPVVCIDD--ASRHCSVFSKG--------QADW 308
             L   S P  +A++               V ++D  A RHC+    G           W
Sbjct: 180 WELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDISADRHCNGVPAGAEVTHRPPLPPW 239

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLT-------------------FTFPQSLGIVGGKP 349
            P++LL+PL LGL  +N  Y+ TL+L                    F  PQSLG++GGKP
Sbjct: 240 RPLVLLIPLRLGLTDINEAYVGTLKLASTLVGLCSAAASLPLRQHCFMMPQSLGVIGGKP 299

Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
            ++ Y +G   E  IYLDPH  QP + +       D S +       + +  +DPS+A G
Sbjct: 300 NSAHYFIGYVGEELIYLDPHTTQPAVEVADRRSIPDESFHCQHPPSRMRIGELDPSIA-G 358

Query: 410 FYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           F+C+ +DDFDD+C +  KL+      P+F + +     +   DVL 
Sbjct: 359 FFCQTEDDFDDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 404


>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
          Length = 458

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 178/421 (42%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                        + L+ P             D E      + +++  FGDS  +PF +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNEIYHRKVISWFGDSPLAPFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLQFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
           oryzae 3.042]
          Length = 357

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 111/342 (32%), Positives = 159/342 (46%), Gaps = 32/342 (9%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151
           +RI + +  P         + IW LGV +     +   +   +N  A      + RI   
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPDNDEANHPMTLTVRIRTQ 70

Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
                DP G    TSD GWGCM+RS Q L+A A+L   LGR WR+  +     E   +L 
Sbjct: 71  L---MDPQG---FTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLS 121

Query: 212 LFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
           LF D   +P SIH  ++ G ++ G   G W GP A  R  EAL+          C ++  
Sbjct: 122 LFADHPDAPLSIHRFVKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAP 173

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
            +YV +   D        V  D   R   V   G     P L+L+   LG++ V P Y  
Sbjct: 174 RVYVTNDTSD--------VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWD 222

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTS 387
            L+     PQS+GI GG+P AS Y +G Q     YLDPH  +P +    D     + + S
Sbjct: 223 GLKAVLQLPQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEIS 282

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
           TYH+  +R IH+  +DPS+ IGF  R++DD++D+  R   + 
Sbjct: 283 TYHTRRLRRIHIQDMDPSMLIGFLVRNEDDWEDWKGRVGSVV 324


>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
 gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
          Length = 458

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE          A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNEAYHRKIISWFGDSPVAVFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
          Length = 458

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
 gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
          Length = 458

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 115/409 (28%), Positives = 171/409 (41%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 198 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 224
                                 QK   R Y +            I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    D   +++L+P+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFT 429


>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
          Length = 319

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDSTVVMEEIRRLCRTS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 292


>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
          Length = 508

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 173/382 (45%), Gaps = 57/382 (14%)

Query: 101 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 156
           P+R+  S++     LL    H+ +    LG     +    F  DF S+I ++YR  F   
Sbjct: 85  PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144

Query: 157 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
               DP                +     T+D GWGCM+RS Q L+A AL    LGR WR+
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRR 204

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
             +    +E  ++L LF D   +PFSIH  ++ G  A G   G W GP A  R  +AL+ 
Sbjct: 205 GTKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATARCIQALSS 261

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG------QADWT 309
                    C+   + +YV S   D           +D  R  ++ S G        D  
Sbjct: 262 --------ECEHAGLNVYVTSDGSD---------VYEDRFR--AIASAGGTGAGTSTDVH 302

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           P L+L+ + LG+++V P Y   L+    +PQS+GI GG+P +S Y +G Q     YLDPH
Sbjct: 303 PTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFYLDPH 362

Query: 370 DVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
             +P +       +     + +TYH+  +R +H+  +DPS+ IGF  RD+DD++ +    
Sbjct: 363 HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSWKRSV 422

Query: 426 SKLAEESNGAPLFTVTQTHKKP 447
              A    G  +  V    K P
Sbjct: 423 HNRAMIGTGKAIIHVFDKEKSP 444


>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
          Length = 458

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIHHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
           gorilla]
 gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
           gorilla]
 gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
          Length = 319

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 98/300 (32%), Positives = 148/300 (49%), Gaps = 25/300 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292


>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
          Length = 1119

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 134/455 (29%), Positives = 185/455 (40%), Gaps = 131/455 (28%)

Query: 134  NNGLAEFNQDFSSRILISYRKGF-----DPIGDSK------------------------- 163
            N   A F  D  SRI ++YR GF     DP   S                          
Sbjct: 644  NGWPAAFYHDSYSRIALTYRSGFPIIPCDPSSSSTGVVQGMLNNLSMSIGRGGHRGPSPT 703

Query: 164  -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-----------KPFDREYV 207
                 ++SD GWGCMLR+ Q L+A AL+   LGR WR+PL             P    Y 
Sbjct: 704  NAEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYA 763

Query: 208  EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
             IL LF D  S  SPFS+H   Q GK  G   G W GP     + + L            
Sbjct: 764  RILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYE------- 816

Query: 266  QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---W-TPILLLVPLVLGL 321
               P  + VVS             C+D       V +    D   W TP+L+L+ + LG+
Sbjct: 817  ---PAGLKVVS-------------CVDGTVYESEVVAASTKDGEKWKTPVLVLINVRLGI 860

Query: 322  EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN--IGK 379
            + VNP Y   ++  F  PQS+GI GG+P +S Y VG Q  S  Y+DPH  +P +   +  
Sbjct: 861  DGVNPIYYEAIKGIFRLPQSVGIAGGRPSSSYYFVGAQANSLFYIDPHHPRPAVPLVLPP 920

Query: 380  DD-------------LEADT----------------------STYHSDVIRHIHLDSIDP 404
            DD               ADT                      +TYH+D +R   L S+DP
Sbjct: 921  DDSLVRAAQHLPLTPSTADTPAKESARQLDDFLLAAYPDAAWATYHTDKVRKCALSSLDP 980

Query: 405  SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHS---------DVLG 455
            S+ +GF   D+ D+ DF  R  +L++ S+  P+F +  +    +  S           L 
Sbjct: 981  SMLLGFLVEDERDWQDFRLRVQELSQASS--PIFAIAPSPPSWMRRSTSSAAPATVSALS 1038

Query: 456  ETGGVPEDDSLGVMSMN-----DAVGNAHEDDWQL 485
             T G   DDS   ++       D+ G +  +DW+L
Sbjct: 1039 PTIG---DDSFSEVAGEDVADADSAGFSEPEDWEL 1070


>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
          Length = 331

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 156/319 (48%), Gaps = 27/319 (8%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRNS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            + +  +DPS+A+GF+C+ +DDF D+C +  KL+      P+F + +     +   DVL 
Sbjct: 233 RMSIAELDPSIAVGFFCKTEDDFSDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 292

Query: 456 ETGGVPEDDSLGVMSMNDA 474
            + G  E   + V S+ D+
Sbjct: 293 LSLG--ESCQVQVGSLGDS 309


>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
 gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
          Length = 400

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 150/313 (47%), Gaps = 49/313 (15%)

Query: 139 EFNQDFSSRILISYRKGFDPIG----------------------DSK-ITSDVGWGCMLR 175
           EF  D  SRI I+YR  F PI                       DS+  TSD GWGCM+R
Sbjct: 75  EFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTGWGCMIR 134

Query: 176 SSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-G 234
           S Q L+A A+L   LGR WR+  +     +  ++LH F D   +PFSIH  +Q G  +  
Sbjct: 135 SGQSLLANAMLILLLGRDWRRGTEA---GKEAQLLHQFADHPEAPFSIHRFVQHGAEFCN 191

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
              G W GP A  R  +AL     A+ G    S  + +Y+     D        +  D  
Sbjct: 192 KYPGEWFGPSATARCIQALV----AQQG----SSELRVYITDDTAD--------IYEDKF 235

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
           +R   +      D+ P L+LV   LG++ V P Y   L+     PQS+GI GG+P AS Y
Sbjct: 236 AR---IAQAEHGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLPQSVGIAGGRPSASHY 292

Query: 355 IVGVQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
            +GV  +   YLDPH  +P     ++       + +TYH+  +R IH+  +DPS+ IGF 
Sbjct: 293 FIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRLRRIHIKDMDPSMLIGFI 352

Query: 412 CRDKDDFDDFCAR 424
            R ++D+ D+  R
Sbjct: 353 IRSREDWTDWKTR 365


>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
          Length = 458

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 174/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE          A+ D      + EF +DF SR+ ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRLWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          DRE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGDSPVAVFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
          Length = 407

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 120/387 (31%), Positives = 169/387 (43%), Gaps = 71/387 (18%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCH-----KIA-----------QDE------ALG 129
           +RI + +  P         + IW LGV +     KI            QDE       + 
Sbjct: 11  KRIVQYIWDPEPRNDEEPDASIWCLGVEYAPQPQKITANTTPGKLGNYQDELEAGTSKID 70

Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPI-----------------------GDSKITS 166
           D   +     F  DF S+I ++YR  F PI                            TS
Sbjct: 71  DVTAHGWPEAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTS 130

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCM+RS Q L+A A+L   LGR WR+  +     E   +L LF D   +P SIH  
Sbjct: 131 DTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKA---EEEARLLSLFADHPDAPLSIHRF 187

Query: 227 LQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           ++ G ++ G   G W GP A  R  EAL+          C ++   +YV +   D     
Sbjct: 188 VKYGAESCGKHPGEWFGPSATARCIEALS--------AQCGNIAPRVYVTNDTSD----- 234

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
              V  D   R   V   G     P L+L+   LG++ V P Y   L+     PQS+GI 
Sbjct: 235 ---VYEDSFLR---VARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQLPQSVGIA 288

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVIRHIHLDSI 402
           GG+P AS Y +G Q     YLDPH  +P +    D     + + STYH+  +R IH+  +
Sbjct: 289 GGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEISTYHTRRLRRIHIQDM 348

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLA 429
           DPS+ IGF  R++DD++D+  R   + 
Sbjct: 349 DPSMLIGFLVRNEDDWEDWKGRVGSVV 375


>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
           rotundus]
          Length = 458

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 117/404 (28%), Positives = 172/404 (42%), Gaps = 80/404 (19%)

Query: 108 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C H   +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-----------------LQ 199
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                 ++
Sbjct: 96  PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 200 K-----------------------------PFDRE------YVEILHLFGDSETSPFSIH 224
           K                             P DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNEVYHRKIISWFGDSPVALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +
Sbjct: 381 SCTIGFYCRNVQDFQRASEEITKMLKISSKEKYPLFTFVNGHSR 424


>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
           terrestris]
          Length = 383

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 16  IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 64

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
            TSD GWGCMLR  QM++ QAL+   LGR W+   +   +  Y++IL  F D  T+ FSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 123

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G + G   G W GP  + +  + L       +     +L   + V    +    
Sbjct: 124 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            G   V  D A     V  K  + W P+LLL+PL LGL ++NP YI  L+ +F  PQSLG
Sbjct: 184 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 238

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 399
           ++GGKP  + Y +G  E   IYLDPH  Q   ++GK    +++E D +TYH      I +
Sbjct: 239 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 297

Query: 400 DSIDPSLAIGFYCRDKDDFDDFC 422
             IDPS+A+ F+C  + DF   C
Sbjct: 298 TGIDPSVALCFFCATEKDFKSLC 320


>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
 gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
          Length = 458

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 120/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF       +K+ E S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLEFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
          Length = 393

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 163/307 (53%), Gaps = 33/307 (10%)

Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW------R 195
           + FSS +  +YRK F  IG    TSD GWGCMLR+ QM++ QAL+   LGR W      R
Sbjct: 79  KSFSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWMWTSDDR 138

Query: 196 KPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
            P     DRE Y+ IL +F D +++ FSIH +   G + G A G W GP  + ++ + L 
Sbjct: 139 LP-----DRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKLV 193

Query: 255 RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLL 314
           +              M ++V   +         ++ + D    C   +K    W P+LL+
Sbjct: 194 QYDHWS--------EMKLHVAMDN---------IIILSDIKSLCC--AKESNKWRPLLLV 234

Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
           VPL LGL ++N  Y   +  +F    SLGI+GG+P  + Y +G+Q E  ++LDPH     
Sbjct: 235 VPLRLGLSEINDIYTNAVLNSFKMKHSLGIIGGRPSHALYFIGIQREELVFLDPHTTHNY 294

Query: 375 INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 434
           +++  D+   + STYH    + + + ++DPS+A+ FY  D+D+ D +  +A +L  +++G
Sbjct: 295 VDL--DEEPYNDSTYHCQRAQRMKISNMDPSIAMCFYIGDEDELDQWRVQAKELLVDNSG 352

Query: 435 APLFTVT 441
             LF +T
Sbjct: 353 HMLFEIT 359


>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
          Length = 383

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 16  IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 64

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
            TSD GWGCMLR  QM++ QAL+   LGR W+   +   +  Y++IL  F D  T+ FSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 123

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G + G   G W GP  + +  + L       +     +L   + V    +    
Sbjct: 124 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            G   V  D A     V  K  + W P+LLL+PL LGL ++NP YI  L+ +F  PQSLG
Sbjct: 184 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 238

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 399
           ++GGKP  + Y +G  E   IYLDPH  Q   ++GK    +++E D +TYH      I +
Sbjct: 239 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 297

Query: 400 DSIDPSLAIGFYCRDKDDFDDFC 422
             IDPS+A+ F+C  + DF   C
Sbjct: 298 TGIDPSVALCFFCATEKDFKSLC 320


>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
           terrestris]
          Length = 386

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 163/323 (50%), Gaps = 24/323 (7%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 19  IPQTDEPVWILGKKYNAIRE-----------LDIIRRDIRSKLWFTYRKNFVPIGGYNST 67

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
            TSD GWGCMLR  QM++ QAL+   LGR W+   +   +  Y++IL  F D  T+ FSI
Sbjct: 68  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWTAETR-NSTYLKILERFEDKRTAAFSI 126

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G + G   G W GP  + +  + L       +     +L   + V    +    
Sbjct: 127 HQIASMGASEGKEVGQWFGPNTIAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 186

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            G   V  D A     V  K  + W P+LLL+PL LGL ++NP YI  L+ +F  PQSLG
Sbjct: 187 EGGTTVEADGA-----VPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFKIPQSLG 241

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHL 399
           ++GGKP  + Y +G  E   IYLDPH  Q   ++GK    +++E D +TYH      I +
Sbjct: 242 VIGGKPNLALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMD-ATYHCKSSSRIPI 300

Query: 400 DSIDPSLAIGFYCRDKDDFDDFC 422
             IDPS+A+ F+C  + DF   C
Sbjct: 301 TGIDPSVALCFFCATEKDFKSLC 323


>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
          Length = 458

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 174/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE          A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKMLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENADSDSWTSNTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          DRE                          + +I+  FG+S  + F +H
Sbjct: 156 KFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNEAYHRKIISWFGNSPVAVFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H K  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSKDFDFT 429


>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
          Length = 480

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 106/328 (32%), Positives = 153/328 (46%), Gaps = 56/328 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK----------------------ITSDVGWGCMLRSS 177
           F  DF SRI ++YR  F PI  S+                       TSD GWGCM+RS 
Sbjct: 114 FLDDFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSG 173

Query: 178 QMLVAQALLFHRLGRPWRK-------------PLQKPFDREYVEILHLFGDSETSPFSIH 224
           Q L+A  L+   LGR WR+                    +   EIL LF DS  +PFSIH
Sbjct: 174 QSLLANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREAEILSLFADSPDAPFSIH 233

Query: 225 NLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
             +Q G  A G   G W GP        A A C R E    C +  + +YV     +   
Sbjct: 234 RFVQHGASACGKHPGQWFGP-------SATASCIR-ELSTECAAAGLRVYVTPSASE--- 282

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
                   +D  R  +  S       P L+L  + LGL+++ P Y   L+ + T+PQS+G
Sbjct: 283 ------LYEDRFRSIAAASPSDPTIKPTLILFGIRLGLDRITPVYHEALKSSLTYPQSIG 336

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLD 400
           I GG+P +S Y VG Q +   YLDPH+ +P +       D  E + +T H+  +R + ++
Sbjct: 337 IAGGRPSSSHYFVGCQGDLFFYLDPHETRPALPHHASPADYSEEEIATCHTRRLRGLRIN 396

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKL 428
            +DPS+ IGF  +D+ D++D+  R  ++
Sbjct: 397 EMDPSMLIGFLIKDEADWEDWKRRIKEV 424


>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
 gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
          Length = 458

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 175/421 (41%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENELLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 198 ---------------LQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                          L+ P             DRE      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L+  GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLKG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTNDKAVIILVPVRLGGERTNADYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
          Length = 396

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 112/386 (29%), Positives = 163/386 (42%), Gaps = 78/386 (20%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
           AGN  + EF +DF SRI ++YR+ F  I  S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 11  AGN--VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 68

Query: 192 RPWRKP----------------------------------------LQKPFDREYVE--- 208
           R W  P                                         QK   R Y +   
Sbjct: 69  RAWTWPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHE 128

Query: 209 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
                    I+  FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 129 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 188

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
               G     + IYV             V   D   + C+  +    D   +++L+P+ L
Sbjct: 189 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRL 235

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G E+ N  Y+  ++   +    +GI+GGKP  S Y  G Q++S IY+DPH  Q  +++  
Sbjct: 236 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 295

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 437
            D   +  T+H    + +    +DPS  IGFYCR+  DF       +K+ + S+    PL
Sbjct: 296 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 353

Query: 438 FTVTQTHKK-------PVNHSDVLGE 456
           FT    H +         N  D+  E
Sbjct: 354 FTFVNGHSRDYDFTSTTTNEEDLFSE 379


>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
 gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
          Length = 651

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 149/511 (29%), Positives = 220/511 (43%), Gaps = 98/511 (19%)

Query: 22  PNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAA 81
           P+   AS  SE+ S+ S S      ++  +S+       S+ SA E     ++ +     
Sbjct: 177 PSEDTASAASEVLSTSSYSPDTPSTATAVDSSHQ-----SDPSAKETPLCPSQMHSSQQP 231

Query: 82  VKRLVTAGSMRRIHERVLGPSRTGISSST-------SDIWLLGVCHKIAQDEALGDAAGN 134
           +       ++  + E VLG S T  +S T       +  W L   H +         A  
Sbjct: 232 ISDHQPVSTLLSLVEAVLGSSDTLPTSVTWLAHQLKARGWELLASHGVPYTSPTAHTAFP 291

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
                 +  F   + +++R  F        TSDVGWGCMLRS Q ++A AL+   LGR W
Sbjct: 292 GVWHSVHAVFQHILSLTHRTCF--------TSDVGWGCMLRSVQSMLANALIRVHLGRHW 343

Query: 195 RKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP----YAMCR 248
           R+  ++    +Y  IL  F D  S   PFSIH L+  G+  G+ AG W GP    +A+C+
Sbjct: 344 RRRAKQKTHPQYARILSWFMDDPSLECPFSIHRLVDEGQRLGVQAGDWFGPSTAAFALCK 403

Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD- 307
             +A   C     GLG          V    DG      VV         + F+ G++D 
Sbjct: 404 LIQAYDAC-----GLG----------VVVTNDGMLYKEQVVA--------ASFAPGRSDP 440

Query: 308 WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
           WT P+L+L+   LGL++V P Y P L+ +FT PQS+G+VGG+P +S Y VGVQ E  + L
Sbjct: 441 WTRPVLILLVQRLGLDQVPPHYRPALKQSFTMPQSVGVVGGRPRSSLYFVGVQREHLLCL 500

Query: 367 DPHDVQPVINI------------GKDDLEADTSTYHSDVIRHIHLDS------------- 401
           DPH V+P +                 DL +  S +  +      LDS             
Sbjct: 501 DPHHVRPCVPFRSPPRMTRASVGASTDLASTVSPWFEEAYTAEELDSFHTPHTSLLPISQ 560

Query: 402 IDPSLAIGFYCRDKDDFDDFCAR----ASKLAEESNGAPLF----------------TVT 441
           +DPS+ +GF C    D  D  AR     ++L + ++  P +                   
Sbjct: 561 MDPSMLLGFVCEQASDLIDLQARIESSETRLFDVADNMPSYYRLSMSMGGEGEGDDDDNH 620

Query: 442 QTHKKPVNHSDVLGETGGVPE--DDSLGVMS 470
           +THK    HSD +    GV +  DDS   M+
Sbjct: 621 RTHKAEDGHSDRVAAHSGVGDNVDDSGWTMA 651


>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
 gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
          Length = 454

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 110/309 (35%), Positives = 153/309 (49%), Gaps = 50/309 (16%)

Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
           F  DF SRI ++YR GF       DP  +S ++                SD GWGCM+RS
Sbjct: 118 FLDDFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRS 177

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A AL   RLGR WR+      +RE   IL LF D   +P+S+HN ++ G A  G 
Sbjct: 178 GQSLLANALQISRLGRDWRRATDPDAERE---ILSLFADDPRAPYSLHNFVKHGAAACGK 234

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  EALA   + E+ L   S                G  P V  D   
Sbjct: 235 YPGEWFGPSATARCIEALA--NQHESSLRVYST---------------GDLPDVYEDS-- 275

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              +V +     + P L+LV   LG++K+N  Y   L  T    QS+GI GG+P +S Y 
Sbjct: 276 -FMAVANPDGEHFHPTLILVCTRLGIDKINQVYEEALISTLQMEQSIGIAGGRPSSSHYF 334

Query: 356 VGVQEESAIYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           VGVQ +   YLDPH  +P +      +D    +  + H+  +RH+H++ +DPS+ IGF  
Sbjct: 335 VGVQGQWLFYLDPHHPRPALPYREAPEDYTSEELGSCHTRRLRHLHVEDMDPSMLIGFLI 394

Query: 413 RDKDDFDDF 421
           +D+DD+D +
Sbjct: 395 KDEDDWDTW 403


>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
          Length = 458

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 116/421 (27%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 154
           S  S + LLG C+    +E    A       G N        + EF +DF SRI ++YR+
Sbjct: 36  SRNSPVLLLGKCYHFKSEEENDPAPVQPQWVGENEPVVVSGNVEEFRRDFISRIWLTYRE 95

Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR------------------- 195
            F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                    
Sbjct: 96  EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDVDNSDSESWTSHT 155

Query: 196 ---------------------KPLQKPFDRE----------YVEILHLFGDSETSPFSIH 224
                                 P+++P  R           + +I+  F DS  + F +H
Sbjct: 156 VKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESCHRKIVSWFADSPLACFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + CS       +   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYKADVIDKQCSSMDPENTEDKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDVLG 455
           S  +GFYCR+  DF+      +K+ + S+    PLFT  + H +       P N  D+  
Sbjct: 381 SCTVGFYCRNIQDFERASEEITKVLKASSREKYPLFTFVKGHARDYDFTCTPTNEDDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
          Length = 388

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 171/368 (46%), Gaps = 67/368 (18%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 50  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 99

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------RKP-- 197
             +TSD GWGCMLRS QM++AQ LL H L R W                      R P  
Sbjct: 100 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGSGLGPSEPSGLASPNRYRGPAR 159

Query: 198 ------------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
                       L++  +R + +I+  F D   +PF +H L   G++ G  AG W GP  
Sbjct: 160 WVPPRWAHGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLGGLGQSSGKKAGDWYGP-- 215

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                  +A   R       +   + +YV       +   A +V   D +          
Sbjct: 216 -----SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT---------- 260

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 261 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 320

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 321 LDPHYCQPTVDVTQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDQKEFETLCSEL 378

Query: 426 SKLAEESN 433
           +++   S+
Sbjct: 379 TRVLSSSS 386


>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
          Length = 400

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 165/374 (44%), Gaps = 71/374 (18%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
           AGN  + EF +DF+SRI ++YR+ F  I  S +T+D GWGC +R+ QML+AQ L+ H LG
Sbjct: 15  AGN--VEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLG 72

Query: 192 RPW----------------------------------RKPLQKPF------------DRE 205
           R W                                   + L+ P             D E
Sbjct: 73  RAWTWPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHE 132

Query: 206 ------YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
                 + +I+  FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 133 MRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 192

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
               G     + IYV             V   D   + C+  +   AD   +++LVP+ L
Sbjct: 193 PDLQG-----ITIYVAQD--------CTVYSSDVIDKQCASMASDHADDKAVIILVPVRL 239

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G E+ N  Y+  ++   +    +GI+GGKP  S Y  G Q++S IY+DPH  Q  +++  
Sbjct: 240 GGERTNTDYLDFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSI 299

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 437
            D   +  T+H    + +    +DPS  IGFYCR+  DF       +K+ + S+    PL
Sbjct: 300 KDFPLE--TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPL 357

Query: 438 FTVTQTHKKPVNHS 451
           FT    H +  + +
Sbjct: 358 FTFVNGHSRDYDFT 371


>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
          Length = 451

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGSVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGEERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
           42464]
 gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
           42464]
          Length = 456

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 114/321 (35%), Positives = 163/321 (50%), Gaps = 57/321 (17%)

Query: 130 DAAGNNGLAE-FNQDFSSRILISYRKGF-------DP---------------IGD-SKIT 165
           +++G++G    F  DF SRI ++YR GF       DP               +GD +  T
Sbjct: 111 ESSGDSGWPPAFLDDFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFT 170

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           SD GWGCM+RS Q L+A ALL  RLGR WR+      +R    IL LF D   +P+S+HN
Sbjct: 171 SDTGWGCMIRSGQSLLANALLISRLGRDWRRMTDPDAERP---ILALFADDSRAPYSLHN 227

Query: 226 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            ++ G+ A G   G W GP A  R  +ALA   + E+ L   S                G
Sbjct: 228 FVKHGELACGKYPGEWFGPSATARCIQALA--NKHESSLRVYST---------------G 270

Query: 285 GAPVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
             P V  D      S  +  + D   + P L+LV   LG++K+N  Y+  L  T    QS
Sbjct: 271 DLPDVYED------SFMATAKPDGETFHPTLILVCTRLGIDKINQVYVEALISTLQMEQS 324

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--GKDDLEADT-STYHSDVIRHIH 398
           +GI GG+P +S Y VGVQ +   YLDPH  +P +      DD  ++   + H+  +R +H
Sbjct: 325 IGIAGGRPASSHYFVGVQGQWLFYLDPHHPRPKLPYRENPDDYTSEELDSCHTRRLRRLH 384

Query: 399 LDSIDPSLAIGFYCRDKDDFD 419
           ++ +DPS+ IGF  +D+DD+D
Sbjct: 385 VEDMDPSMLIGFLIKDEDDWD 405


>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
 gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
 gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
 gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
 gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
           cysteine endopeptidase; AltName: Full=Autophagin-3;
           AltName: Full=Autophagy-related cysteine endopeptidase
           3; AltName: Full=Autophagy-related protein 4 homolog C
 gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
 gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
 gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
 gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
           construct]
 gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
           construct]
 gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
          Length = 458

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 176/421 (41%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
          Length = 342

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 111/322 (34%), Positives = 151/322 (46%), Gaps = 46/322 (14%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + +   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSVLTEKDEILADVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L  F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYCSVLQAFLDRKDSCYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGP---------YAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
            Q G   G + G W GP          A+  +W ALA             + M   VV  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSALA-----------VHVAMDNTVVMA 177

Query: 278 D-EDGERGGAPVVCID----DASRHCSVF------SKGQADWTPILLLVPLVLGLEKVNP 326
           D     R   P    +    D+ RHC+ F          A W P++LL+PL LGL  VN 
Sbjct: 178 DIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPWRPLVLLIPLRLGLADVNA 237

Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
            Y  TL+  F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D 
Sbjct: 238 AYAGTLKHCFRMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVPDE 297

Query: 387 STYHSDVIRHIHLDSIDPSLAI 408
           S +       + +  +DPS+A+
Sbjct: 298 SFHCQHPPGRMSIAELDPSIAV 319


>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
 gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
          Length = 411

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/342 (30%), Positives = 165/342 (48%), Gaps = 36/342 (10%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           +D GWGCMLR  QM++AQAL+   LGR W        D  Y++I++ F D   S +SIH 
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-ADCRDATYLKIVNRFEDVRNSFYSIHQ 150

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           + Q G++   A G W+GP  + +  + L R     +        +AI+V           
Sbjct: 151 IAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
              V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCGMI 249

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
           GG+P  + Y +G  ++  +YLDPH  Q    +G+    A+     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAVAEQDYDETYHQKHAARLSFSAM 309

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 310 DPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351


>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
          Length = 458

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
 gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
          Length = 411

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/343 (32%), Positives = 170/343 (49%), Gaps = 42/343 (12%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +D+W+LG  +   Q+  L             +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           +D GWGCMLR  QM++AQAL+   LGR W        D  Y++I++ F D   S +SIH 
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDWFWT-SDCRDATYLKIVNRFEDVRNSYYSIHQ 150

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           + Q G+    A G W+GP  + +  + L R     +        +AI+V           
Sbjct: 151 IAQMGETQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD-------- 194

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
              V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G++
Sbjct: 195 -STVVLDDVYSSC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELESSCGMI 249

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
           GG+P  + Y +G  ++  +YLDPH  Q    +G+    A+     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVDDEVLYLDPHTTQRTGVVGQKTAAAEQDYDETYHQKHAARLSFSAM 309

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEE--SNGAP-LFTVTQ 442
           DPSLA+ F C+  D F+   A  +KL EE  S  +P LF ++Q
Sbjct: 310 DPSLAVCFLCKTSDSFE---ALLTKLKEEVLSLCSPALFEISQ 349


>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 515

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 108/307 (35%), Positives = 150/307 (48%), Gaps = 50/307 (16%)

Query: 140 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 176
           F  DF SRI ++YR  F       DP                   +  +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A A+L  RLGR WR+  +   D E  +I+ LF D   +PFS+HN ++ G  A G 
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYGATACGK 296

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +AL      E+GL   S                G  P V  D   
Sbjct: 297 YPGEWFGPLATARCIQALT--DEKESGLRVYST---------------GDLPDVYEDS-- 337

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              +V +     + P L+LV   LG++K+N  Y   L  T   PQS+GI GG+P +S Y 
Sbjct: 338 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 396

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           +GVQ +   YLDPH  +P +   +D       +  T H+  +R +H+D +DPS+ IGF  
Sbjct: 397 IGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMDPSMLIGFLI 456

Query: 413 RDKDDFD 419
           +D+DD+D
Sbjct: 457 KDEDDWD 463


>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
          Length = 448

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 56/310 (18%)

Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
           F  DF SRI ++YR GF+PI                      GD +  +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
            Q L+A ALL  +LGR WR+      +R    I+ LF D   +P+S+ N ++ G  A G 
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAERN---IVALFADDARAPYSLQNFVKHGAIACGK 229

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +ALA    +          + IY          G  P V  D   
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269

Query: 296 RHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
              S  +  + D   + P L+LV   LG++K+NP Y   L  T    QS+GI GG+P +S
Sbjct: 270 ---SFLATARPDGETFHPTLILVCTRLGIDKINPVYEEALISTLQMEQSIGIAGGRPSSS 326

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIG 409
            Y VGVQ +   YLDPH  +P +   ++ L     +  + H+  +R++H++ +DPS+ IG
Sbjct: 327 HYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIG 386

Query: 410 FYCRDKDDFD 419
           F  +D+DD+D
Sbjct: 387 FLIQDEDDWD 396


>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
           boliviensis]
          Length = 458

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 177/421 (42%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKMLPATSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNEMYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
 gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
          Length = 410

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D   S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H + Q G++   A G W+GP  + +  + L R     +        +AI+V         
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
                V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH      ++  
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351


>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
          Length = 384

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 111/329 (33%), Positives = 163/329 (49%), Gaps = 50/329 (15%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 170
           +W+LG  +   ++           L    +D  S++  +YRKGF PIG   S  TSD GW
Sbjct: 23  VWILGKQYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGW 71

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           GCMLR  QM++ QAL+   LGR W+  P  +  +  Y++IL  F D  T+PFSIH +   
Sbjct: 72  GCMLRCGQMVLGQALIILHLGRDWQWTPETR--NSTYLKILERFEDRRTAPFSIHQIASM 129

Query: 230 GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
           G + G   G W GP  + +  + L       +        + I+V   +          +
Sbjct: 130 GASEGKEVGQWFGPNTIAQVLKKLVVYDDWSS--------ITIHVALDN---------TL 172

Query: 290 CIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
            ++D  R C V              K  + W P+LLL+PL LGL ++NP YI  L+ +F 
Sbjct: 173 IVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLRLGLSEINPIYINGLKTSFK 232

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDV 393
            PQSLG++GGKP  + Y +G      IYLDPH  Q   ++ K    +++E D +TYH   
Sbjct: 233 IPQSLGVIGGKPNLALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMD-ATYHCKF 291

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
              I +  IDPS+A+ F+C  + DF   C
Sbjct: 292 ASRIPITGIDPSVALCFFCATERDFKSLC 320


>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
 gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
 gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
 gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
 gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
          Length = 411

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D   S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H + Q G++   A G W+GP  + +  + L R     +        +AI+V         
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
                V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH      ++  
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKLKEEVLSLCSPALFEISQTR 351


>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
          Length = 382

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 16  IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
            TSD GWGCMLR  QM++ QAL+   LGR W+  L+   +  Y++IL  F D   +PFSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWSLETR-NSTYLKILERFEDKRNAPFSI 123

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 274
           H +   G + G   G W GP  + +          W ++      +  L    +     V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183

Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
             G      G AP+              K  + W P+LLL+PL LGL ++NP YI  L+ 
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
           +F  PQSLG++GGKP  + Y +G      IYLDPH  Q   ++ K    +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288

Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
                 I +  IDPS+A+ F+C  + DF   C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320


>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
          Length = 458

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 169/404 (41%), Gaps = 80/404 (19%)

Query: 108 SSTSDIWLLGVC-HKIAQDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C H   +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKCEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------------------ 198
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPYALSIENSDSESRTSHTVK 155

Query: 199 ----------------------------QKPFDRE------YVEILHLFGDSETSPFSIH 224
                                       + P D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQRASMASDNTDDKAVIILVPVRLGGERTNTDYLEFIKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +
Sbjct: 381 SCTIGFYCRNIQDFKRASEEITKMLKISSKEKYPLFTFVNGHSR 424


>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
 gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
          Length = 411

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 104/344 (30%), Positives = 166/344 (48%), Gaps = 40/344 (11%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTDVWVLGKKYNAIQE-----------LELIRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP--FDREYVEILHLFGDSETSPFSI 223
           +D GWGCMLR  QM++AQAL+   LGR W      P   D  Y++I++ F D   S +SI
Sbjct: 92  TDKGWGCMLRCGQMVLAQALIDLHLGRDW---FWTPDCRDATYLKIVNRFEDVRNSFYSI 148

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H + Q G++   A G W+GP  + +  + L R     +        +AI+V         
Sbjct: 149 HQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD------ 194

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
                V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       S G
Sbjct: 195 ---STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDSSCG 247

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLD 400
           ++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH      ++  
Sbjct: 248 MIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFS 307

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 308 AMDPSLAVCFLCKTSDSFESLLTKFKEEVLSLCSPALFEISQTR 351


>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
 gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
          Length = 458

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAGSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKSILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMAFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 ETGGVPEDDSLGVMSMNDAV 475
           E     E   L   SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456


>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
          Length = 382

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/332 (33%), Positives = 161/332 (48%), Gaps = 42/332 (12%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
           I  +   +W+LG  +   ++           L    +D  S++  +YRK F PIG  +S 
Sbjct: 16  IPQTDEPVWVLGKKYNAIRE-----------LDAIRRDIRSKLWFTYRKNFVPIGGYNST 64

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
            TSD GWGCMLR  QM++ QAL+   LGR W+  L+   +  Y++IL  F D   +PFSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLGQALIILHLGRDWQWNLETR-NSTYLKILERFEDKRNAPFSI 123

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIYV 274
           H +   G + G   G W GP  + +          W ++      +  L    +     V
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTVAQVLKKLVVFDEWSSITIHVALDNTLIVNDILKQCRV 183

Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
             G      G AP+              K  + W P+LLL+PL LGL ++NP YI  L+ 
Sbjct: 184 EGGTTVEADGDAPL--------------KAPSQWKPLLLLIPLRLGLSEINPIYINGLKT 229

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYH 390
           +F  PQSLG++GGKP  + Y +G      IYLDPH  Q   ++ K    +++E D +TYH
Sbjct: 230 SFKIPQSLGVIGGKPTHALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMD-ATYH 288

Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
                 I +  IDPS+A+ F+C  + DF   C
Sbjct: 289 CKFSGRIPIIEIDPSVALCFFCATEKDFKSLC 320


>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
 gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
          Length = 508

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 151/307 (49%), Gaps = 50/307 (16%)

Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
           F  DF SRI ++YR GF+ I                      GD +  +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A A+L  R GR WR+      +RE   I+ LF D   +P+SI N +  G A  G 
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +ALA+   +          + +Y+            P V  D+  
Sbjct: 290 YPGEWFGPSATARCIQALAKKHDSS---------LRVYLTRD--------LPEVYEDN-- 330

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              S  +     + P L+LV   LG++K+NP Y   L  T   PQ++GI GG+P +S Y 
Sbjct: 331 -FMSTANPDGNHFHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSSHYF 389

Query: 356 VGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           +G Q +   YLDPH  +P +   +   D    +  + H+  +RH+H++ +DPS+ IGF  
Sbjct: 390 IGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIGFLI 449

Query: 413 RDKDDFD 419
           +D+DD+D
Sbjct: 450 KDEDDWD 456


>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
          Length = 458

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 172/404 (42%), Gaps = 80/404 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE          A+ D   +  + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKMLPAISSCAIEDCVISGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 224
            F                              D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNEIYHRKIVSWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQCASMASDNPDNKAVIILVPVRLGGERTNVDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
           S  IGFYC++  DF+      +K+ + S+    PLFT    H +
Sbjct: 381 SCTIGFYCQNVQDFERASEEITKMLKVSSKEKYPLFTFVNGHSR 424


>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
 gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
 gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
 gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
          Length = 458

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 ETGGVPEDDSLGVMSMNDAV 475
           E     E   L   SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456


>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
           UAMH 10762]
          Length = 446

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 109/326 (33%), Positives = 165/326 (50%), Gaps = 65/326 (19%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------- 163
           A++EALG        AEF  D  +RI ++YR  F PI  S                    
Sbjct: 103 AEEEALG------WPAEFMDDMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGN 156

Query: 164 ---ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 220
               TSD GWGCM+RS Q L+A +L   +LGR WR+  +   + +Y  ++ LF D+  +P
Sbjct: 157 SGGFTSDAGWGCMIRSGQTLLANSLATLKLGRDWRRGQK---EDDYKHLISLFADTPEAP 213

Query: 221 FSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
           FSIH  ++ G +A G   G W GP A  RS +AL    R + GL   + P         +
Sbjct: 214 FSIHKFVEHGAQACGKHPGEWFGPSATARSVQALTEKYR-DVGLRVYARP---------D 263

Query: 280 DGERGGAPVVCIDDASRHCSVF-SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRL 334
           DG+      V +D      S+F + GQ D    + P L+++ + LG++++ P Y   L+ 
Sbjct: 264 DGD------VYVD------SLFATAGQMDANDEFQPTLIVLGIRLGIDRITPVYHAALKA 311

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI--NIGKDDLEADTSTYHSD 392
           T   PQS+GI GG+P +S Y VG Q ++  YLDPH  +  I  N   +DL    ++ H+ 
Sbjct: 312 TLEMPQSVGIAGGRPSSSHYFVGHQGDNFFYLDPHTTRQAIPQNPSAEDL----ASCHTR 367

Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDF 418
            +R + +  +DPS+ +GF    K++F
Sbjct: 368 RLRRLKIAEMDPSMLLGFLIHSKEEF 393


>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
          Length = 509

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 187/404 (46%), Gaps = 80/404 (19%)

Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 163
           L  + HK   D+A    A  +   EF +D  SRI ++YR GF  I  ++           
Sbjct: 51  LRTLFHKFKPDQAADTEA--SWPREFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSL 108

Query: 164 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
                         T+D GWGCM+R+SQ L+A +LL  RLGR WR    +   + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANSLLQLRLGRGWRYDQTRECAK-HAEIV 167

Query: 211 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
             F D  T+PFSIHN ++ G    G   G W GP A  RS + L      +TGL      
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKTGLKV---- 223

Query: 270 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 325
              +  SGD  ED                   +F   Q  A+  P+L+L  + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQQGAELRPVLILAGIRLGVKNVN 263

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 380
           P Y   L+ T  +PQS+GI GG+P +S Y  G Q +   YLDPH  Q  + I  +     
Sbjct: 264 PLYWDFLKKTLGWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323

Query: 381 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
                  ++E+  D  + H++ IR +HLD +DPS+ +G    ++  +D   A    +   
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHSINSH 380

Query: 432 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 473
             G+    V  +  +PV  +     +GG+ E +   LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419


>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
          Length = 509

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 186/404 (46%), Gaps = 80/404 (19%)

Query: 115 LLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK----------- 163
           L  + HK  QD+A    A  +   EF  D  SRI ++YR GF  I  ++           
Sbjct: 51  LRTLFHKFKQDQAAETEA--SWPREFLGDVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSL 108

Query: 164 -------------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
                         T+D GWGCM+R+SQ L+A  LL  RLGR WR    +   + + EI+
Sbjct: 109 IRGTVDLATVTKGFTTDAGWGCMIRTSQSLLANGLLQLRLGRGWRYDQTRECAK-HAEIV 167

Query: 211 HLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
             F D  T+PFSIHN ++ G    G   G W GP A  RS + L      + GL      
Sbjct: 168 SWFVDIPTAPFSIHNFVEQGANCAGKKPGEWFGPSAAARSIQVLCEANYDKIGLKV---- 223

Query: 270 MAIYVVSGD--EDGERGGAPVVCIDDASRHCSVFSKGQ--ADWTPILLLVPLVLGLEKVN 325
              +  SGD  ED                   +F   Q  A+  P+L+L  + LG++ VN
Sbjct: 224 --YFTASGDIYED------------------ELFELAQEGAELRPVLILAGIRLGVKNVN 263

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD----- 380
           P Y   L+ T ++PQS+GI GG+P +S Y  G Q +   YLDPH  Q  + I  +     
Sbjct: 264 PLYWDFLKKTLSWPQSVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHES 323

Query: 381 -------DLEA--DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
                  ++E+  D  + H++ IR +HLD +DPS+ +G    ++  +D   A    +   
Sbjct: 324 PDPNHYVEVESGLDLDSVHTNKIRKLHLDQMDPSMLVGLLVENRASYD---ALKHNINAH 380

Query: 432 SNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD--SLGVMSMND 473
             G+    V  +  +PV  +     +GG+ E +   LGV+SMN+
Sbjct: 381 DQGSRFLNVYDS--RPVLAAK---SSGGLEESEFVDLGVLSMNE 419


>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
 gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
          Length = 389

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 114/343 (33%), Positives = 172/343 (50%), Gaps = 42/343 (12%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  +   +D  L             +D  +R+  +YR+GF PIG S++T+D GWGC
Sbjct: 28  VWILGKSYSATEDLDL-----------IRRDVQTRLWCTYRRGFVPIGGSQLTTDKGWGC 76

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL    LGR W    +   +  Y++I++ F DS+ +PFS+H +   G++
Sbjct: 77  MLRCGQMVLAQALTQLHLGRDWSWTPETT-NETYLKIVNRFEDSKAAPFSLHQIALTGES 135

Query: 233 -YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
                 G W GP  + +  + L +              + I+V   +          +  
Sbjct: 136 SEEKRVGEWFGPNTVAQVLKKLVKFD--------DWCSLVIHVALDN---------TLAT 178

Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
           D+    C V       W P+LL++PL LGL ++NP Y+  L+  F    + G+VGG+P  
Sbjct: 179 DEVLELC-VDRSNPDSWKPLLLIIPLRLGLSEINPIYVDGLKKCFELAGNCGMVGGRPNQ 237

Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLA 407
           + Y +G   + A+YLDPH VQ    IG     D+ E D  T+H    R I+   +DPSLA
Sbjct: 238 ALYFIGYVADEALYLDPHTVQRSGTIGSKRDPDERELD-ETFHQKYARRINFKGMDPSLA 296

Query: 408 IGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKP 447
           + F C  + DFDD   R     E+ NG    PLF VT+T + P
Sbjct: 297 LCFLCATRKDFDDLIQR---FKEDLNGGGCQPLFEVTKTRQAP 336


>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
 gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
           Full=Autophagy-related protein 4 homolog C
 gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
          Length = 450

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 113/403 (28%), Positives = 171/403 (42%), Gaps = 95/403 (23%)

Query: 110 TSDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFD 157
            S ++LLG C+    +++    D   N+G          + EF +DF SRI ++YRK F 
Sbjct: 38  NSPVFLLGKCYHFKYEDSGVTADDCSNSGSDSKEDLSGNVDEFRKDFISRIWLTYRKEFP 97

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----------------------- 194
            I  S  T+D GWGC LR+ QML+AQ LL H LGR W                       
Sbjct: 98  QIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWTWTEALDIFCSESDFWTANTARK 157

Query: 195 -------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAG 230
                              ++PLQ    + Y E LH      F D   + F +H L++ G
Sbjct: 158 LDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRKIISWFADYPLAYFGLHQLVKLG 217

Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVC 290
           K  G  AG W GP  +      L R    E+                  D E  G  +  
Sbjct: 218 KNSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYV 256

Query: 291 IDDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
             D    C++++    D          +++LVP+ LG E+ N  Y   ++   +    +G
Sbjct: 257 AQD----CTIYNADVYDLQCNKGNEKAVVILVPVRLGGERTNMEYFEYVKGILSLEFCIG 312

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I+GGKP  S Y VG Q++S IY+DPH  Q  +++   +   +  ++H    + +    +D
Sbjct: 313 IIGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMD 370

Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 444
           PS  +GFYCR+  +F+      +K+ + S     PLFT    H
Sbjct: 371 PSCTVGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413


>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
          Length = 458

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 113/404 (27%), Positives = 170/404 (42%), Gaps = 80/404 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 195 ------------RKPLQKPF------------DRE------YVEILHLFGDSETSPFSIH 224
                        + L+ P             D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPTICLKETIGKCSEDHETENEICHRKIISWFGDSPLAAFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + +YV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITVYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +    D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQSASMTSDNTDDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKMSSKEKYPLFTFVNGHSR 424


>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
          Length = 459

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 113/404 (27%), Positives = 168/404 (41%), Gaps = 80/404 (19%)

Query: 108 SSTSDIWLLGVCHKIAQDEA--LGDAAGN---------NGLAEFNQDFSSRILISYRKGF 156
           S  S ++LLG C+    DE+  L     N           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKSDESGELSTEGSNFDKINTEISGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQA-------------------------------- 184
             I  S +T+D GWGC LR+ QML+AQ                                 
Sbjct: 96  PQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWVWPDALDIENSDSESWTAHTVK 155

Query: 185 -----------------LLFHRLGRPWRKPLQKPFDREYV---EILHLFGDSETSPFSIH 224
                            +L H   R  R+       R  V   +I+  FGDS  + F +H
Sbjct: 156 KLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNEVYHRKIISWFGDSPLAAFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + +YV          
Sbjct: 216 QLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG-----VTVYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   R CS    G+ D   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDRQCSFMDSGETDTKAVIILVPVRLGGERTNMDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFKKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
           S  IGFYCR   DF+      +K+ + S+    PLFT  + H +
Sbjct: 381 SCTIGFYCRTVQDFEKASEEITKMLKSSSKEKYPLFTFVKGHSR 424


>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
          Length = 458

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 91/440 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -FSVYNCDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKFSSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 ETGGVPEDDSLGVMSMNDAV 475
           E     E   L   SM + V
Sbjct: 441 E----DEKKRLKRFSMEEFV 456


>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
 gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
          Length = 463

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 118/408 (28%), Positives = 177/408 (43%), Gaps = 81/408 (19%)

Query: 108 SSTSDIWLLGVCH--KIAQDE--------ALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
           S  S ++LLG C+  K+  DE        AL D      + EF +DF+SR+ ++YR+ F 
Sbjct: 36  SRNSPVFLLGKCYHFKVVDDENPTESTAEALDDDVVTGNVDEFRKDFTSRVWLTYREEFP 95

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQK---------- 200
            +  S  TSD GWGC LR+ QM++AQALL H LGR W+       +PL            
Sbjct: 96  ALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWKWSEALSLEPLDTETWTSSAARR 155

Query: 201 ---------------------PFDREYVE------------ILHLFGDSETSPFSIHNLL 227
                                P   E  E            I+  FGD  ++   I+ L+
Sbjct: 156 LVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETYHRTIVSWFGDGPSAQLGIYKLV 215

Query: 228 QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 287
           + G   G  AG W GP         +A   R        ++   I V    +D     A 
Sbjct: 216 ELGMTSGKQAGDWYGP-------AVVAHILRKAVDEAVDAMLKGIRVYVA-QDCTVYSAD 267

Query: 288 VVCIDDASRHCSVFSKGQA-------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
           V  ID  S      S  Q        D   +++L+P+ LG EK+NP Y+  ++   +   
Sbjct: 268 V--IDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEYLNFVKSILSLEY 325

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
            +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      ++H    + +   
Sbjct: 326 CIGIIGGKPKQAYYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSPKKMSFS 383

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK 446
            +DPS  IGFY +  + F+      SK+ + S+    P FT+ + H K
Sbjct: 384 KMDPSCTIGFYSKSVEHFEKIANELSKILQPSSKEKYPAFTIMKGHGK 431


>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
          Length = 454

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 103/306 (33%), Positives = 142/306 (46%), Gaps = 49/306 (16%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF S+  ++YR  F  I  S                         TSD GWGCM+RS
Sbjct: 118 FLDDFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRS 177

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
            Q L+A A+    LGR WR+  Q P D    ++L  F D   +P+SIH  +Q G  A G 
Sbjct: 178 GQSLLANAMAAINLGRDWRR-GQNPEDER--KLLSWFADDPRAPYSIHQFVQHGAVACGK 234

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +ALA  Q  +        P+ +Y          G  P V  D   
Sbjct: 235 YPGEWFGPSATARCIQALANAQEQQ--------PLRVYST--------GDGPDVYED--- 275

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
           +   +     + + P L+LV   LG++K+ P Y   L      PQS+GI GG+P +S Y 
Sbjct: 276 KFMEIAKPDGSRFNPTLILVGTRLGIDKITPVYWEALIAALQMPQSVGIAGGRPASSHYF 335

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           +G Q     YLDPH  +P +    D     EAD  T H+  +R +H+  +DPS+ +GF  
Sbjct: 336 IGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVDTVHTRRLRRLHVRELDPSMLVGFLI 395

Query: 413 RDKDDF 418
           RD+DD+
Sbjct: 396 RDEDDW 401


>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
 gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
          Length = 468

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 99/354 (27%), Positives = 161/354 (45%), Gaps = 59/354 (16%)

Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
           ++ +  F +DF SR+ ++YR+ F  +  + +T+D GWGCM+RS QML+AQ LL H L R 
Sbjct: 92  DDEIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSRE 151

Query: 194 W----------------------RKPL-------------------QKPF-DREYVEILH 211
           W                      R PL                   + P  ++ +  I+ 
Sbjct: 152 WTWPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIR 211

Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
            F D  ++PF +H ++  G  +G  AG W GP         +A   +      C+   ++
Sbjct: 212 WFSDHPSAPFGLHRMVALGSIFGKKAGDWYGP-------SIVAHIIKKAIETSCEVAELS 264

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
           +YV S D    +     +   D     +    G+A    +++LVP  LG E  NP Y   
Sbjct: 265 VYV-SQDCTVYKADIEQLFAGDVPHAETSRDAGKA----VIILVPARLGGETFNPVYKHC 319

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           L+     P  LGI+GGKP  S Y +G Q+   +YLDPH  Q  I+  ++D   +  ++H 
Sbjct: 320 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQSYIDTSRNDFPLE--SFHC 377

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 442
           +  R I +  +DPS    FY +++DDF   C    K+    +     P+F++++
Sbjct: 378 NTPRKISITRMDPSCTFAFYAQNRDDFGKLCDHLMKVLHSPHAEEKYPIFSISE 431


>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
          Length = 585

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 157/359 (43%), Gaps = 69/359 (19%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----- 194
           F +DF+SRI ++YR+ F  +  +  T+D GWGCMLRS QML+AQ L+ H LG+ W     
Sbjct: 198 FQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDWTWPDA 257

Query: 195 ------------------------------------------------RKPLQKPFDREY 206
                                                           R P +   +R +
Sbjct: 258 LHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELEKERYH 317

Query: 207 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
            +I+  F D   + F IH L+  G + G  AG W GP            C        C 
Sbjct: 318 RKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDC--------CS 369

Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 326
                +  VS D    +G   V  + + S   + +  G A W  +++LVP+ LG E  NP
Sbjct: 370 EAGNLVVYVSQDCTVYKGD--VANLANKSEDRTAWDPG-AVWKAVIILVPMRLGGEAFNP 426

Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
            Y+  ++        +GI+GGKP  S Y VG Q+++ +YLDPH  QP ++  K++   + 
Sbjct: 427 AYVDCVKELLKLEFCIGIIGGKPRHSLYFVGYQDDALLYLDPHYCQPFVDTTKENFPLE- 485

Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQ 442
            ++H +  R      +DPS  IGFY   + +F++ C   +++   S      P+F++ +
Sbjct: 486 -SFHCNSPRKTAFTKVDPSCTIGFYAHHRTEFEELCLHLTQVLNSSTAKEKYPMFSIVE 543


>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
           familiaris]
 gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
           familiaris]
          Length = 458

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 117/421 (27%), Positives = 172/421 (40%), Gaps = 87/421 (20%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKFEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
             I  S  T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSDSWTSNTVK 155

Query: 201 PF------------------------------DRE------YVEILHLFGDSETSPFSIH 224
            F                              D E      + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    D   +++L+P+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHSDVLG 455
           S  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  D+  
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEEDLFS 440

Query: 456 E 456
           E
Sbjct: 441 E 441


>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score =  161 bits (407), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 115/397 (28%), Positives = 170/397 (42%), Gaps = 80/397 (20%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFT 439
           S  IGFYCR+  DF       +K+ + S+    PLFT
Sbjct: 381 SCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFT 417


>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
          Length = 482

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 120/424 (28%), Positives = 179/424 (42%), Gaps = 96/424 (22%)

Query: 108 SSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAE---------FNQDFSSRILISYRKGFD 157
           S  S + LLG C H  A D+   D A      E         F +DF+SR+ ++YR+ F 
Sbjct: 36  SRNSPVLLLGRCYHFKADDDGSADEASCREPEEGFSMGNVEAFRKDFTSRVWLTYREEFP 95

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------KPLQ----------- 199
           P+  S +T+D GWGC+LR+ QM++AQAL+ H LGR W        +PL            
Sbjct: 96  PLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLGRDWTWSEALTLQPLDTETWTASAAKR 155

Query: 200 -------------KPFDREYVE-----------------------ILHLFGDSETSPFSI 223
                        K  DR++ E                       I+  FGD+ ++   +
Sbjct: 156 LVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEAEAHLKEMYHRTIISWFGDTSSALLGL 215

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG------------------- 264
           H L++ G   G  AG+W GP  +    +     +  ++GL                    
Sbjct: 216 HRLVRLGLTMGKNAGNWYGPAVVAHILKKAVE-EAMDSGLAGITAYVSQDCTVYSADVAD 274

Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
           C   P A            GG P    +D     S+    QA    +++L+P+ LG EK+
Sbjct: 275 CHKPPSARQASVSPPIA--GGGP--SKEDQPGSASILPDSQA----VIILIPVRLGGEKI 326

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
           NP Y   ++   +    +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D   
Sbjct: 327 NPEYFEFVKNILSVEYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDFP- 385

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 442
              ++H    + I    +DPS  IGFY R   D+D      SKL + S     P FT  Q
Sbjct: 386 -LQSFHCPSPKKIPFTRMDPSCTIGFYSRSLQDYDRIREELSKLLQPSTKEKYPAFTFVQ 444

Query: 443 THKK 446
            H +
Sbjct: 445 GHGR 448


>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
 gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
          Length = 454

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)

Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
           ALG + +  +G+    +  +SR   +YR+ F PIG +  ++D GWGCMLR +QML+ + L
Sbjct: 39  ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 98

Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
           L   +GR +   ++K     Y +IL +F D + + +SIH + Q G   G     W GP  
Sbjct: 99  LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 157

Query: 246 MCR---------SWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
             +          W  +A     +  L  + ++ MA    S D      E+G        
Sbjct: 158 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 209

Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
            + D +R          +W P+LL++PL LGL  +NP Y+  ++  F  PQ +GI+GG+P
Sbjct: 210 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 268

Query: 350 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 383
             + Y VG+      YLDPH  +P                   ++G   LE         
Sbjct: 269 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 328

Query: 384 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 437
                  D STYH  ++  I  +++DPSLA+  +C  +D+F++ C    K    ++  P+
Sbjct: 329 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 388

Query: 438 FTVTQTHKK 446
           F   Q   K
Sbjct: 389 FEFLQRRPK 397


>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
          Length = 460

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 122/423 (28%), Positives = 183/423 (43%), Gaps = 89/423 (21%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDA------AGNN-------GLAEFNQDFSSRILISYRK 154
           S  S + LLG C+    +E    A      AG N        + EF +DF SRI ++YR+
Sbjct: 36  SRNSPVLLLGKCYHFKSEEENDPAPVGSGWAGENEHVVIYGNVEEFRRDFISRIWLTYRE 95

Query: 155 GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------- 197
            F  I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                 
Sbjct: 96  EFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALDIENSDSASWTSHT 155

Query: 198 ------------------------LQKPF-----DRE------YVEILHLFGDSETSPFS 222
                                   L++P      D E      + +I+  FGDS  + F 
Sbjct: 156 VKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGRNELCHRKIISWFGDSPLACFG 215

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
           +H L++ GK  G  AG W GP  +           R     G     + IYV    +D  
Sbjct: 216 LHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVA---QDCT 267

Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
              A V+     S      ++ +A    I+LLVP+ LG E+ N  Y+  ++   +    +
Sbjct: 268 VYKADVIDKQGISAGLET-TEDKA----IILLVPVRLGGERTNMDYLDFVKGILSLEYCV 322

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 402
           GI+GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +
Sbjct: 323 GIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--SFHCPSPKKMSFRKM 380

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESN--GAPLFTVTQTHKK-------PVNHSDV 453
           DPS  +GFYCR+  DF+      +++ + S+    PLFT  + H +       P N  D+
Sbjct: 381 DPSCTVGFYCRNAQDFERASEELTQVLKASSREKYPLFTFVKGHARDYDFTSTPTNEDDL 440

Query: 454 LGE 456
             E
Sbjct: 441 FSE 443


>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 440

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 115/410 (28%), Positives = 174/410 (42%), Gaps = 81/410 (19%)

Query: 108 SSTSDIWLLGVCH--KIAQDEALGDAAGN--------NGLAEFNQDFSSRILISYRKGFD 157
           S  S + LLG C+  K+ +DE + +A             + +F +DF SRI ++YR+ F 
Sbjct: 36  SRNSPVLLLGKCYHFKVEEDEGVAEACCEASDEEDVVGNVEDFRRDFGSRIWLTYREEFP 95

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDRE--------- 205
           P+  S +TSD GWGCMLR+ QM++AQALL H +GR W   R    +P D E         
Sbjct: 96  PLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWTWSRTMSLQPLDTETWTTSAAKR 155

Query: 206 ----------------------------------YVE-------ILHLFGDSETSPFSIH 224
                                             +VE       ++  FGDS ++ F +H
Sbjct: 156 LVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEGEAFHRTLVSWFGDSPSAQFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSW-----EALARCQRAETGLGCQSLPMAIYVVSGDE 279
            ++  G   G  AG W GP  +         EAL       T    Q   +    V    
Sbjct: 216 RMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSLAGITAYVSQDCTVYSADVIDGH 275

Query: 280 DGERGGAP-----VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
                 +P     V  +   ++  S     +A    +++LVP+ LG EK NP Y    + 
Sbjct: 276 KASTSASPESSDDVTLLSPNNQAASALPDSRA----VIILVPVRLGGEKTNPDYFNLAKS 331

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
             +    +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      ++H    
Sbjct: 332 ILSLDYCIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSP 389

Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQ 442
           + +    +DPS  +GFY R   DF+      +KL + S+    P F   Q
Sbjct: 390 KKMPFTKMDPSCTLGFYSRSAQDFEKIKQELTKLLQPSSKEKYPAFIFVQ 439


>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
          Length = 454

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 152/329 (46%), Gaps = 54/329 (16%)

Query: 122 IAQDEALGDAAGNNG--LAEFNQDFSSRILISYRKGFDPIGDSK---------------- 163
           +A DE   D +G +G     F  DF S+  ++YR  F  I  S                 
Sbjct: 101 LAYDE---DYSGQDGGWPTAFLDDFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKS 157

Query: 164 -------ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216
                   +SD GWGCM+RS Q L+A A+    LGR WR+   +  +R+   +L LF D 
Sbjct: 158 QLVDQNGFSSDSGWGCMIRSGQSLLANAMAVINLGRDWRRGQNQEEERK---LLSLFADD 214

Query: 217 ETSPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
             +P+SIH  +Q G  A G   G W GP A  R  +ALA  Q  +        P+ +Y  
Sbjct: 215 PRAPYSIHQFVQHGAVACGKYPGEWFGPSATARCIQALANAQMHQ--------PLRVYST 266

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
               D        +   D SR           + P L+LV   LG++K+ P Y   L   
Sbjct: 267 GDGPDVYEDKFMKIAKPDGSR-----------FHPTLILVGTRLGIDKITPVYWEALIAA 315

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSD 392
              PQS+GI GG+P +S Y +G Q     YLDPH  +P +    +     EAD  T H+ 
Sbjct: 316 LQMPQSVGIAGGRPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVDTVHTR 375

Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
            +R +H+  +DPS+ IGF   D+DD+D++
Sbjct: 376 RLRRLHVRELDPSMLIGFLILDEDDWDEW 404


>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
           2508]
 gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
           2509]
          Length = 506

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 108/309 (34%), Positives = 151/309 (48%), Gaps = 50/309 (16%)

Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
           F  DF SRI ++YR  F       DP   S ++                SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A A+L  RLGR WR+      D E  +I+ LF D   +P+S+HN ++ G  A G 
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAE-KDIIALFADDPRAPYSLHNFVKYGATACGK 287

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +ALA     ++GL   S                G  P V  D   
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDS-- 328

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              +V +     + P L+LV   LG++K+N  Y   L  T   PQS+GI GG+P +S Y 
Sbjct: 329 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           VGVQ +   YLDPH  +P +   +D       +  T H+  +R +H+  +DPS+ IGF  
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLI 447

Query: 413 RDKDDFDDF 421
           +D+DD+D +
Sbjct: 448 KDEDDWDTW 456


>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
 gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
          Length = 481

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 59/369 (15%)

Query: 127 ALG-DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL 185
           ALG + +  +G+    +  +SR   +YR+ F PIG +  ++D GWGCMLR +QML+ + L
Sbjct: 66  ALGKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVL 125

Query: 186 LFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-- 243
           L   +GR +   ++K     Y +IL +F D + + +SIH + Q G   G     W GP  
Sbjct: 126 LRRHIGRHFEWDIEKT-SEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNT 184

Query: 244 -------YAMCRSWEALARCQRAETGLGCQ-SLPMAIYVVSGD------EDGERGGAPVV 289
                    +   W  +A     +  L  + ++ MA    S D      E+G        
Sbjct: 185 AAQVMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENG-------- 236

Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
            + D +R          +W P+LL++PL LGL  +NP Y+  ++  F  PQ +GI+GG+P
Sbjct: 237 -LVDKNRLSLSPGNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEFFKIPQCVGIIGGRP 295

Query: 350 GASTYIVGVQEESAIYLDPHDVQPVI-----------------NIGKDDLE--------- 383
             + Y VG+      YLDPH  +P                   ++G   LE         
Sbjct: 296 NHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDVGFSHLEELVPLPSQT 355

Query: 384 ------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 437
                  D STYH  ++  I  +++DPSLA+  +C  +D+F++ C    K    ++  P+
Sbjct: 356 ADVYTKMDDSTYHCQMMLWIEYENVDPSLALAMFCETRDEFENLCETLQKTTLPASQPPM 415

Query: 438 FTVTQTHKK 446
           F   Q   K
Sbjct: 416 FEFLQRRPK 424


>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
 gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 401

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 171/380 (45%), Gaps = 75/380 (19%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAE-----------------FNQDFSSRILISYRKG 155
           IW LG   + A  +   D A NN  +                  F  DF SRI I+YR  
Sbjct: 29  IWCLG--REYAPSQPPSDPASNNPRSPSRQPNASTLNDTTWPKAFLSDFGSRIWITYRSN 86

Query: 156 FDPIGDSK-----------------------ITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
           F PI  +K                        TSD GWGCM+RS Q L+A       LGR
Sbjct: 87  FTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQSLLANTFSVLLLGR 146

Query: 193 PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWE 251
            WR+  +     E  +++ +F D   +PFSIH  +  G ++ G   G W GP        
Sbjct: 147 DWRRGEKV---EEESKLISMFADHPEAPFSIHRFVNRGAESCGKYPGEWFGP-------S 196

Query: 252 ALARCQRAETGLGCQS-LP-MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
           A A+C +    L  QS +P + +Y+ +   D           +D   H +    G+    
Sbjct: 197 ATAKCIQL---LSTQSEVPQLRVYLTNDTSD---------VYEDKFAHVAHDESGRIQ-- 242

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           P L+L+   LG++ V P Y   LR   T+PQS+GI GG+P AS Y VG Q+    +LDPH
Sbjct: 243 PTLILIGTRLGIDNVTPAYWDGLRAALTYPQSVGIAGGRPSASHYFVGAQDCHLFFLDPH 302

Query: 370 DVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
             +P      D L  + +  +Y++  +R IH+  +DPS+ IGF  +D+DD+ D+     K
Sbjct: 303 TTRPATLYRPDGLYTQEELDSYYTSRLRRIHIKDMDPSMLIGFLVKDEDDWADW----KK 358

Query: 428 LAEESNGAPLFTVTQTHKKP 447
               + G P+  +  +  +P
Sbjct: 359 RIRSTPGQPIVHIFPSQHQP 378


>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
 gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
           Full=Autophagy-related protein 4
 gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 506

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 108/309 (34%), Positives = 151/309 (48%), Gaps = 50/309 (16%)

Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
           F  DF SRI ++YR  F       DP   S ++                SD GWGCM+RS
Sbjct: 171 FLDDFESRIWMTYRTDFALIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 230

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A A+L  RLGR WR+      D E  +I+ LF D   +P+S+HN ++ G  A G 
Sbjct: 231 GQSLLANAILIARLGREWRRGTD--LDAEK-DIIALFADDPRAPYSLHNFVKYGATACGK 287

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +ALA     ++GL   S                G  P V  D   
Sbjct: 288 YPGEWFGPSATARCIQALA--DEKQSGLRVYST---------------GDLPDVYEDS-- 328

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              +V +     + P L+LV   LG++K+N  Y   L  T   PQS+GI GG+P +S Y 
Sbjct: 329 -FMAVANPDGRGFQPTLILVCTRLGIDKINQVYEEALISTLQLPQSIGIAGGRPSSSHYF 387

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           VGVQ +   YLDPH  +P +   +D       +  T H+  +R +H+  +DPS+ IGF  
Sbjct: 388 VGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELDTCHTRRLRQLHIGDMDPSMLIGFLI 447

Query: 413 RDKDDFDDF 421
           +D+DD+D +
Sbjct: 448 KDEDDWDTW 456


>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
          Length = 457

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/402 (28%), Positives = 170/402 (42%), Gaps = 78/402 (19%)

Query: 108 SSTSDIWLLGVCHKIAQDEA-----------LGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S ++LLG C+    DE            + D + +  + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKSDEPSDQSPNGSCDDMTDESFSRNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------------------- 194
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                      
Sbjct: 96  PQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWANAFVFENPESESWTSQTVK 155

Query: 195 -----------------------RKPLQKPFDREYVE------ILHLFGDSETSPFSIHN 225
                                  + P++     E VE      I+  F DS  + F +H 
Sbjct: 156 KLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEEQYHRRIISWFADSPFANFGLHR 215

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           L++ GK  G  AG W GP  +      L R +  E     +   + IYV       +   
Sbjct: 216 LIEYGKKSGKIAGDWYGPAVVAH----LLR-KAVEKARDPELQGITIYVAQDCTVYKSDV 270

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
              +C    S   SV S        I++L+P+ LG E+ N  Y   ++   +    +GI+
Sbjct: 271 IDALCPFTDSEKTSVKS--------IIILIPVRLGGERTNMEYFEFVKGILSLDYCIGII 322

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  ++H    + +    +DPS
Sbjct: 323 GGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSVKDFPLE--SFHCPSPKKMSFKKMDPS 380

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESN-GAPLFTVTQTHKK 446
             IG YC D   F+      +K+ + S    PLFT    H +
Sbjct: 381 CTIGLYCPDMQGFERAAEEITKILKLSKEKYPLFTFVNGHSR 422


>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
          Length = 458

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 115/409 (28%), Positives = 173/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKMLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP----------------WRKPLQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR                 W     K
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIENSDSDSWTSNTVK 155

Query: 201 PF----------DRE--------------------------YVEILHLFGDSETSPFSIH 224
            F          +RE                          + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNEIYHRKIISWFGDSPVAVFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVKGVLSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKILKISSKEKYPLFTFVNGHSRDFDFT 429


>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
          Length = 481

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 122/469 (26%), Positives = 195/469 (41%), Gaps = 104/469 (22%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           IS  T  IW LG            + +  +G+    +  +SR   +YR+ F PIG +  +
Sbjct: 25  ISIDTFPIWALG-----------KEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPS 73

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           +D  WGCMLR +QML+ + LL   +GR +   ++K  D  Y +IL +F D + + +SIH 
Sbjct: 74  TDQYWGCMLRCAQMLLGEVLLRRHIGRHFEWDIEKTSDV-YEKILQMFFDEKDALYSIHQ 132

Query: 226 LLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYVV 275
           + Q G + G     W GP    +          W  +A     +  L  Q +L MA    
Sbjct: 133 IAQMGVSEGKEVSEWFGPNTAAQVIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYP 192

Query: 276 SGDE----DGERG-------GAPVVCID-DASRHCSVFSKGQAD-------------WTP 310
           S D      GE G        + ++C++ D  +    F  G  +             W P
Sbjct: 193 SEDAVKLIMGEFGFKSDRISSSHIICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRP 252

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
           +LL++PL LGL  +N  Y+  ++  F  PQ +GI+GGKP  + Y VG+      YLDPH 
Sbjct: 253 LLLMIPLRLGLTSINSCYLSAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHH 312

Query: 371 VQP--------------------------VINIGKDDLE---------------ADTSTY 389
            +P                          + + G  +LE                + STY
Sbjct: 313 CRPKTSKFFVEKEQQQQSSGDSTPEKVEKIDDNGFHELEDLEPLPSQTSDVYTKMNDSTY 372

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPV- 448
           H  +++ +  DSIDPSLA+  +C  +++F++ C    K    ++  P+F   +   K + 
Sbjct: 373 HCQMMQWMEYDSIDPSLALALFCETREEFENLCDELQKTTLTASNPPMFEFLEKRPKYLP 432

Query: 449 ---------------NHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
                             D+      + ED  +  +S+ DA   A  DD
Sbjct: 433 KFEPYTGVSMKIEMKEFDDIGAANSKIDEDFEVLDVSVEDAETGAEADD 481


>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
          Length = 467

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 158/354 (44%), Gaps = 87/354 (24%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
           N  + F  DF SR+ ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
           +RS Q ++A AL   RLGR WR   +   D+++ EIL LF D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209

Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G   G W GP A  R  + LA   R E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVYV-SGD------GADVY--E 252

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  +  +V   G   W P L+LV   LG++K+ P Y   L+ +   PQS+GI GG+P AS
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKASLQIPQSIGIAGGRPSAS 310

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEA---------------------------- 384
            Y VGVQ  +  YLDPH  +P++      L A                            
Sbjct: 311 HYFVGVQGNNFYYLDPHSTRPLLPFHPPSLAAATSDTPNLTASTTSVSSTTSSTTIVPPA 370

Query: 385 -----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                            D ST H+  IR + +  +DPS+ + F    + D+ D+
Sbjct: 371 DSIPAPSDPRQSLYPPSDLSTCHTRRIRRLQIREMDPSMLLAFLVTSEADYQDW 424


>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
          Length = 437

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 107/342 (31%), Positives = 157/342 (45%), Gaps = 54/342 (15%)

Query: 130 DAAGNNGL-AEFNQDFSSRILISYRKGFDPIGDSK-----------------------IT 165
           D+  N G  + F  DF +R+ I+YR  F  I  S+                        +
Sbjct: 94  DSDANGGWPSPFLDDFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFS 153

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           SD GWGCM+RS Q L+A AL   RLGR WR+      +R    IL LF D   +PFSIH 
Sbjct: 154 SDTGWGCMIRSGQSLLANALQVLRLGRAWRRGQDSQGERR---ILSLFADDPKAPFSIHR 210

Query: 226 LLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            ++ G  A G   G W GP A  R  +AL+         G +   + +Y+     D    
Sbjct: 211 FVEHGAVACGKHPGEWFGPSATARCIQALSN--------GYEDAGLRVYITGDGSD---- 258

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
                  +D+     V       + P L+LV + LG+++V P Y   L+ +    QS+GI
Sbjct: 259 -----VYEDS--FMKVAKDANNTFHPTLVLVGIRLGIDRVTPVYWEALKASLQLSQSIGI 311

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDS 401
            GG+P AS Y VG Q     YLDPH  +P + +     D  + D  + H+  +R +H+  
Sbjct: 312 AGGRPSASHYFVGTQGSYFFYLDPHTTRPFLPLHSDLSDYTQEDIDSCHTRRLRRLHVKE 371

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
           +DPS+ I F  RD+ D+ ++     K   E +G P+  V  +
Sbjct: 372 MDPSMLIAFLIRDETDWQNW----RKAVAEVHGKPVIHVADS 409


>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
 gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
          Length = 478

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 174/386 (45%), Gaps = 82/386 (21%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           +GL    +  +SR+  +YR+ F PIG +  ++D GWGCMLR +QML+ + LL   +GR +
Sbjct: 47  DGLEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHF 106

Query: 195 RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR------ 248
              ++K     Y +IL +F D + + +SIH + Q G   G     W GP    +      
Sbjct: 107 EWDIEKT-SEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLT 165

Query: 249 ---SWEALARCQRAETGLGCQ-SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSV---- 300
               W  +A     +  L  + +L MA    S +       + +  + +  ++ ++    
Sbjct: 166 IFDDWSNIAVHVALDNILVKEDALTMATTYPSDN------ASYIFAVHNFLKYFTLNLTF 219

Query: 301 --FSK-GQ-----------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
             F++ GQ            DW P+L+++PL LGL  +NP Y+P ++  F  PQ +GI+G
Sbjct: 220 PNFAENGQIEKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFELPQCVGIIG 279

Query: 347 GKPGASTYIVGVQEESAIYLDPH-----------------------------DVQPVINI 377
           GKP  + Y VG+      YLDPH                             D+Q  I+ 
Sbjct: 280 GKPNLAHYFVGIAGTKLFYLDPHHCRAKTTKRDAGVTTNTMISSITTTDAQLDIQNQIDD 339

Query: 378 GK----DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                 +DLE             D STYH  +++ +  +SIDPSLA+  +C  + DFD  
Sbjct: 340 SDFHKLEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEYESIDPSLALALFCETRQDFDTL 399

Query: 422 CARASKLAEESNGAPLFTVTQTHKKP 447
           C    K    S+  P+F   +  K+P
Sbjct: 400 CEELQKTTLPSSVPPMFEFLE--KRP 423


>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
          Length = 343

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 139/303 (45%), Gaps = 48/303 (15%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
           E   D +SR+  +YRK F  IG +  TSD GWGCMLR  QM+ AQAL+   LGR WR   
Sbjct: 40  EILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWIK 99

Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR---------S 249
            K     Y  +L+ F D + S +SIH + Q G   G + G W GP  + +         +
Sbjct: 100 GKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFDT 159

Query: 250 WEALA----------------RCQRAETGLGCQSLPM----AIYVVSGDEDGERGGAPVV 289
           W +LA                 CQ   +  G  + P      +Y    +E G R    + 
Sbjct: 160 WSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL- 218

Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
                             W P++LL+PL LGL ++N  YI TL+  F  PQSLG++GGKP
Sbjct: 219 ------------------WKPLVLLIPLRLGLTEINEAYIETLKHCFMMPQSLGVIGGKP 260

Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
            ++ Y +G   E  IYLDPH  QP +         D S +       + +  +DPS+A+ 
Sbjct: 261 NSAHYFIGYVGEELIYLDPHTTQPAVEPSDSGCLPDESFHCQHPPCRMSIAELDPSIAVV 320

Query: 410 FYC 412
             C
Sbjct: 321 CSC 323


>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
          Length = 358

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 109/340 (32%), Positives = 165/340 (48%), Gaps = 44/340 (12%)

Query: 137 LAEFNQDF---SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
            AE+ +DF   S  + I  RK     G +  TSD GWGCMLR  QM+ AQAL+   LGR 
Sbjct: 13  FAEY-EDFPETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRD 71

Query: 194 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
           WR   +K     Y  +L+ F D + S +SIH + Q G   G + G W GP  + +  + L
Sbjct: 72  WRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKL 131

Query: 254 ARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAPVVCID------DASRHCSVFS 302
           A      +        +A+++     V  +E        V C        D+ RHC+ F 
Sbjct: 132 AVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFP 183

Query: 303 KGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
            G       + W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++GGKP ++ Y +
Sbjct: 184 AGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFI 243

Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP--SLAIGFYCRD 414
           G   ES+ +  P        +G   L A       + + H   + ++P  S A+GF+C+ 
Sbjct: 244 GYVGESSSHRVP--------VGLCPLRA-----FCEQVPHARCNIVEPEGSRALGFFCKT 290

Query: 415 KDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVL 454
           +DDF+D+C +  KL+      P+F + +     +   DVL
Sbjct: 291 EDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVL 330


>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
          Length = 468

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 115/408 (28%), Positives = 173/408 (42%), Gaps = 71/408 (17%)

Query: 108 SSTSDIWLLGVCHKI------AQDEALGDAAGNNGLA----EFNQDFSSRILISYRKGFD 157
           S  S + LLG C+         Q EA  +A+   G+     +F +DF SRI ++YR+ F 
Sbjct: 29  SRNSPVLLLGKCYHFKAEEDEGQTEACREASDEEGVMGNVEDFRRDFGSRIWLTYREEFP 88

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR--PWRKPLQ-KPFDRE--------- 205
           P+  S +TSD GWGCMLR+ QM++AQALL H LGR   W   +  +P D E         
Sbjct: 89  PLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWTWSGAMSLQPLDTETWTTSAAKR 148

Query: 206 ----------------------------------------YVEILHLFGDSETSPFSIHN 225
                                                   +  ++  FGDS ++ F +H 
Sbjct: 149 LVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDGGFHRTLVSWFGDSPSAQFGLHR 208

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS-LPMAIYVVSGDEDGERG 284
           +++ G A G  AG W GP  +    +      R     G  S +     V S D      
Sbjct: 209 MVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLAGISSYVSQDCTVYSADVIDSHK 268

Query: 285 GAPVVCID----DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
            +     +     +S H S  +    D   +++LVP+ LG EK NP Y    +   +   
Sbjct: 269 ASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVRLGGEKTNPDYFNLAKSFLSLDY 328

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
            +GI+GGKP  + Y VG Q++S IY+DPH  Q  +++   D      ++H    + +   
Sbjct: 329 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSTSDFP--LQSFHCPSPKKMPFT 386

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTHKK 446
            +DPS   GFY R   DF+      ++L + S     P F   Q H +
Sbjct: 387 KMDPSCTFGFYSRSAQDFERIKHELTELLQPSAKEKYPAFIFVQGHGR 434


>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
          Length = 500

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 64/310 (20%)

Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
           F  DF SRI ++YR GF+ I                      GD +  +SD GWGCM+RS
Sbjct: 173 FLDDFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRS 232

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A A+L  R GR WR+      +RE   I+ LF D   +P+SI N +  G A  G 
Sbjct: 233 GQSLLANAMLISRAGRAWRRTTNPDIERE---IVCLFADDPRAPYSIQNFVNHGAAACGK 289

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI---YVVSGDEDGERGGAPVVCID 292
             G W GP        A ARC  +      + LP      ++ + + DG           
Sbjct: 290 YPGEWFGP-------SATARCIHSLRVYLTRDLPEVYEDNFMSTANPDGNH--------- 333

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
                          + P L+LV   LG++K+NP Y   L  T   PQ++GI GG+P +S
Sbjct: 334 ---------------FHPTLILVSTRLGIDKINPIYHEALISTLQLPQAIGIAGGRPSSS 378

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGK---DDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
            Y +G Q +   YLDPH  +P +   +   D    +  + H+  +RH+H++ +DPS+ IG
Sbjct: 379 HYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSCHTRRLRHLHVEDMDPSMLIG 438

Query: 410 FYCRDKDDFD 419
           F  +D+DD+D
Sbjct: 439 FLIKDEDDWD 448


>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 468

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 105/313 (33%), Positives = 145/313 (46%), Gaps = 56/313 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF SRI +SYR GF PI  S                         T+D GWGCM+R+
Sbjct: 128 FLDDFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRT 187

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A  LL HRLGR WR+  +   +R+   +L LF D   +P+SIH  ++ G A  G 
Sbjct: 188 GQSLLANTLLSHRLGRGWRRGEKSDEERK---LLSLFADDPRAPYSIHKFVEHGAAKCGK 244

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  EALA               + +Y          G  P V  D   
Sbjct: 245 YPGEWFGPSATARCIEALANTNEKT---------LRVYST--------GDLPDVYEDS-- 285

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
               V       + P L+LV   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 286 -FMEVARPDGKTFHPTLILVSTRLGIDKINQVYWESLTATLQMPQSVGIAGGRPSSSHYF 344

Query: 356 VGVQE------ESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
           VG Q        +  YLDPH  +P +    D      +D  + H+  +R +H+  +DPS+
Sbjct: 345 VGAQRSDEDQGSNLFYLDPHHTRPALPYFDDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 404

Query: 407 AIGFYCRDKDDFD 419
            IGF   D++D++
Sbjct: 405 LIGFLITDEEDWE 417


>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
 gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
           SOWgp]
 gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
           SOWgp]
 gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
 gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
          Length = 432

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 100/297 (33%), Positives = 138/297 (46%), Gaps = 50/297 (16%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF S+   +YR  F  I  S+                        T+D GWGCM+RS
Sbjct: 105 FLDDFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRS 164

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A AL    LGR WR+  +    +E  E+L LF D+  +PFSIH  +  G  A G 
Sbjct: 165 GQSLLANALSILNLGRDWRRGSKI---KEECELLSLFADNPQAPFSIHRFVDYGASACGK 221

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  EAL+          C+   + +YV+S   D        +   D  
Sbjct: 222 HPGEWFGPSATARCIEALSN--------ECKHTDLNVYVMSDGSDVHEDQFRQIAGPDGI 273

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
           R             P L+L+ + LG+E V P Y   LR    +PQS+GI GG+P +S Y 
Sbjct: 274 R-------------PTLILLGVRLGIESVTPVYWEALRAIIRYPQSVGIAGGRPSSSLYF 320

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDL--EADTSTYHSDVIRHIHLDSIDPSLAIGF 410
           +GVQ     YLDPH  +P ++   D      +  TYH+  +R +H+  +DPS+ IGF
Sbjct: 321 IGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLDTYHTRRLRRLHIREMDPSMLIGF 377


>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
 gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
          Length = 458

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 115/409 (28%), Positives = 172/409 (42%), Gaps = 80/409 (19%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE          A+ D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDESKVLPARSGCAIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG------------------------- 191
             I  S +T+D GWGC LR+ QML+AQ L+ H LG                         
Sbjct: 96  PQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALHIESSDSDSWTSNTIH 155

Query: 192 -------------RPWRKPL--------QKPFDRE------YVEILHLFGDSETSPFSIH 224
                        R  R P         + P D        + +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSEIYHRQIISWFGDSPVAVFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 RLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----LTIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +     + G A    +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNSDVIDKQTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVKGVLSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +    +DP
Sbjct: 323 IGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFRKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNHS 451
           S  IGFYCR+  DF+      +K+ + S+    PLFT    H +  + +
Sbjct: 381 SCTIGFYCRNVQDFERASEEITKMLKISSKEKYPLFTFVNGHSRDFDFT 429


>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
 gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
 gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
          Length = 321

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 92/291 (31%), Positives = 147/291 (50%), Gaps = 32/291 (10%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 1   MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60

Query: 233 YGLAAGSWVGP---------YAMCRSWEALARCQRAETGL-------GCQSLPMAIYVVS 276
            G + G W GP          A+   W +LA     +  +        C+ LP+     S
Sbjct: 61  EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPL-----S 115

Query: 277 GDEDGERGGAPVVCIDDA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
            D  G+R    +   + +   S +CS        W P+LL+VPL LG+ ++NP Y+   +
Sbjct: 116 ADTAGDRPPDSLTASNQSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFK 168

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393
             F  PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++    D + +    
Sbjct: 169 ECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQS 228

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
            + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 229 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 278


>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
          Length = 459

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 109/332 (32%), Positives = 153/332 (46%), Gaps = 60/332 (18%)

Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 163
           +A DE L DA        F  DF SR+ ++YR  F+PI  S                   
Sbjct: 109 LAYDELLEDAGWP---IAFLDDFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLA 165

Query: 164 ----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
                +SD GWGCM+RS Q L+A  L+  +LGR WR+       R+  EIL  F D   +
Sbjct: 166 DQGGFSSDTGWGCMIRSGQSLLANTLVICQLGRDWRRGKAA---RQEREILARFADDPRA 222

Query: 220 PFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD 278
           P+S+HN ++ G  A G   G W GP A  R  +ALA    +          + +Y     
Sbjct: 223 PYSLHNFVRHGAVACGKFPGEWFGPSATARCIQALANSNESS---------LRVYST--- 270

Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
                G  P V  D      +V       + P L+LV   LG++K+N  Y   L  T   
Sbjct: 271 -----GDLPDVYEDS---FMAVAKPDGETFHPTLILVGTRLGIDKINQVYWEALTATLQM 322

Query: 339 PQSLGIVGGKPGASTYIVGVQEES--------AIYLDPHDVQPVINIGKDDLEA---DTS 387
           PQS+GI GG+P AS Y +G Q             YLDPH  +P +   +D  +    D +
Sbjct: 323 PQSVGIAGGRPSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN 382

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           T H+  +R +H+  +DPS+ IGF  +D+DD+D
Sbjct: 383 TCHTRRLRRLHVRDMDPSMLIGFLIKDEDDWD 414


>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
 gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
          Length = 379

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/336 (34%), Positives = 172/336 (51%), Gaps = 30/336 (8%)

Query: 120 HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
           H+I     L +A     L +  +D  SR+  +YR+GF PIG S+ TSD GWGCMLR  QM
Sbjct: 13  HRIRCIFGLSNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQM 72

Query: 180 LVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAG 238
           ++AQALL   LGR W    +   D  Y+ I++ F D++ +PFS+H +   G++      G
Sbjct: 73  VLAQALLQLHLGRDWEWTAETR-DETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVG 131

Query: 239 SWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHC 298
            W GP  + +  + L +              + ++V              +  D+    C
Sbjct: 132 EWFGPNTVAQVLKKLVKFD--------DWCSVVVHVALD---------STLATDEVVELC 174

Query: 299 SVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
              S     W P+LL++PL LGL ++NP Y+  L+  F    + G++GG+P  + Y +G 
Sbjct: 175 EDKSDAGTSWKPLLLIIPLRLGLSEINPIYVAGLKKCFELAGNCGMIGGRPNQALYFIGY 234

Query: 359 QEESAIYLDPHDVQPVINIGK----DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
             + A++LDPH VQ   NIG     D+ E D S +H    R I+  ++DPSLA+ F C  
Sbjct: 235 VGDEALFLDPHTVQRSGNIGDKTGLDEREMDES-FHQRYARRINFKAMDPSLALCFLCAT 293

Query: 415 KDDFDDFCARASKLAEESNGAP---LFTVTQTHKKP 447
           + +FDD  AR    AE+ NG     LF VT+T + P
Sbjct: 294 RTEFDDLLAR---FAEDLNGGSCQGLFEVTKTRQAP 326


>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
          Length = 449

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 111/327 (33%), Positives = 155/327 (47%), Gaps = 53/327 (16%)

Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 159
           +A D+   D    +G   F  DF SRI ++YR  FDPI                      
Sbjct: 99  LAYDDQSNDGGWPSG---FITDFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQL 155

Query: 160 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           GD S  +SD GWGCM+RS Q L+A  +   RLGR WR   Q     E   IL  F D   
Sbjct: 156 GDQSPFSSDSGWGCMIRSGQSLLANTIALVRLGRDWR---QGQSLEEECRILKDFADDPR 212

Query: 219 SPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
           +P+SIH+ ++ G  A G   G W GP A  R  +ALA                +I V S 
Sbjct: 213 APYSIHSFVRHGASACGKYPGEWFGPSATARCIQALANSHEP-----------SIRVYST 261

Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
                 G  P V  DD  +  +    G+A + P L+LV   LGL+K+ P Y   L     
Sbjct: 262 ------GDGPDVYEDDFMKIAN--PTGEA-FHPTLVLVGTRLGLDKITPVYWEALIAALQ 312

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVI 394
            PQS+GI GG+P +S Y +G Q     YLDPH  +P +   ++ ++    +  + H+  +
Sbjct: 313 MPQSVGIAGGRPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMDYTSEEIESCHTARL 372

Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           R IH+  +DPS+ IGF  R ++D+ D+
Sbjct: 373 RRIHVREMDPSMLIGFLIRSEEDWQDW 399


>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
          Length = 448

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/307 (32%), Positives = 148/307 (48%), Gaps = 49/307 (15%)

Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKIT----------------SDVGWGCMLRS 176
           F  DF S++  SYR GF       DP   S ++                SD GWGCM+RS
Sbjct: 108 FLDDFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRS 167

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A +++  RL R WR+ + +  +RE   I+ LF D   +P+SIH  ++ G +A G 
Sbjct: 168 GQSLLANSMVILRLSRGWRRGVGRDKERE---IVSLFADDPRAPYSIHKFVEHGAEACGK 224

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  + LA+          +S  + +Y+     D  + G          
Sbjct: 225 YPGQWFGPSATARCIQELAKRH--------ESADVRVYITGDGSDVYKDG---------- 266

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              SV      ++ P L+LV   LG++KV P Y   L+ +   PQS+GI GG+P +S Y 
Sbjct: 267 -FMSVAKPDGVNFKPTLILVGTRLGIDKVTPVYWEALKASLQMPQSVGIAGGRPSSSHYF 325

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           VGVQ     YLDPH     I    D  E   A+  + H+  +R + +  +DPS+ IGF  
Sbjct: 326 VGVQGSHFFYLDPHQTMAAIPFHTDVDEYTPAEIDSCHTRRLRRLDIKEMDPSMLIGFLI 385

Query: 413 RDKDDFD 419
           RD+ D++
Sbjct: 386 RDEKDWE 392


>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
 gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
          Length = 545

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 98/393 (24%)

Query: 139 EFNQDFSSRILISYRKGF--------------------------DPIGDSKITSDVGWGC 172
           +F  D  SRI +SYR GF                          DP G    TSDVGWGC
Sbjct: 64  DFLADVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRG---YTSDVGWGC 120

Query: 173 MLRSSQMLVAQALLFHRLGRPWR----------------------------KPLQKPFDR 204
           M+R+SQ L+A ALLF  LGR WR                            K  +     
Sbjct: 121 MIRTSQSLLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSE 180

Query: 205 EYV----EILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRA 259
           E       I+  F DS  SPFSIH  ++ G KA    AG W GP A   S  AL      
Sbjct: 181 ETAVSEETIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYAL------ 234

Query: 260 ETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
                C   P   + +Y      +G  GG   V  D+      +   G     P+L+L  
Sbjct: 235 -----CNEFPDSGLKVYY-----NGNGGGD--VYEDE------LLETG----FPLLVLCG 272

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
           L LG++ VNP Y  +LR   + PQS+GI GG+P  S Y  G Q E   YLDPH  +P + 
Sbjct: 273 LRLGIDNVNPIYWDSLRQMLSLPQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPAVK 332

Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP 436
                 + DT+++HS  I  +HL  +DPS+ +GFY   + D++ F    +   E+++   
Sbjct: 333 T----TDKDTTSFHSSRIWKLHLKEMDPSMLVGFYITSEADWETFKGSLTASKEKTSSQI 388

Query: 437 LFTVTQTHKKP-VNHSDVLGETGGVPEDDSLGV 468
           +      H  P  +  D     GG  +DD + V
Sbjct: 389 VHIHPSRHNIPSFDEEDEYVSIGGASDDDFVDV 421


>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
 gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
          Length = 389

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 117/346 (33%), Positives = 170/346 (49%), Gaps = 34/346 (9%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I  +   +W+LG  +  + D           L    QD  SR+  +YR+GF PIG++++T
Sbjct: 21  IPKTNDTVWILGKQYNASDD-----------LEAIRQDVQSRLWCTYRRGFVPIGNTQLT 69

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           +D GWGCMLR  QM++AQALL   LGR W    +   D  Y+ I++ F DS+ +PFS+H 
Sbjct: 70  TDKGWGCMLRCGQMVLAQALLQLHLGRDWVWEAETR-DDIYLNIVNRFEDSKQAPFSLHQ 128

Query: 226 L-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
           + L    +     G W GP  + +  + L +         C+   + I+V   +      
Sbjct: 129 IALMGDSSEEKRIGEWFGPNTVAQVLKKLVKFDD-----WCR---LVIHVALDN------ 174

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
               V  D+    C V  K    W P+LL++PL LGL +VNP YI  L+  F  P S G+
Sbjct: 175 ---TVATDEIVELC-VDKKEPEAWKPLLLIIPLRLGLSEVNPIYIEGLKKCFQLPGSCGM 230

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDS 401
           +GG+P  + Y +G     A+YLDPH VQ V  +G     A+     T+H      I   S
Sbjct: 231 IGGRPNQALYFIGYVGGEALYLDPHTVQRVGTVGSKQDPAEQELDETFHQRYASRISFTS 290

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
           +DPSLA+ F C  +  FD   AR +          LF VT+T + P
Sbjct: 291 MDPSLAVCFLCVSRQQFDQLVARFNDSVNGGTSQALFEVTKTRQAP 336


>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
          Length = 383

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 168/353 (47%), Gaps = 48/353 (13%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
           I  +   +W+LG  +   ++           L    +D  S++  +YRKGF PIG  +S 
Sbjct: 16  IPPTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFVPIGGCNST 64

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
            TSD GWGCMLR  QM++AQAL+   LG+ W+  + +  +  Y++IL  F D   + FSI
Sbjct: 65  FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQW-MPETKNNTYLKILRRFEDKRAAAFSI 123

Query: 224 HNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           H +   G + G   G W GP  + +  + L       +        + I+V   +     
Sbjct: 124 HQIALMGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSS--------LTIHVALDN----- 170

Query: 284 GGAPVVCIDDASRHCSVFS------------KGQADWTPILLLVPLVLGLEKVNPRYIPT 331
                + ++D  R C V              +  + W P+LLL+PL LGL ++NP YI  
Sbjct: 171 ----TLIVNDILRQCRVEGGVTAEADGEIPLRAPSQWKPLLLLIPLRLGLSEINPVYING 226

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV----INIGKDDLEADTS 387
           L+ +F   QSLG++GGKP  + Y +G   +  IYLDPH  Q        I ++++E D S
Sbjct: 227 LKTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDIS 286

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
            YH      I +  +DPS+A+ F+C  + +F   C    +        PLF +
Sbjct: 287 -YHCKSASRIPITGMDPSVALCFFCATEKEFKSLCKSMQEELILPEKQPLFEL 338


>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
 gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
          Length = 480

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 159/312 (50%), Gaps = 41/312 (13%)

Query: 144 FSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFH----RLGRPWRKPL 198
           F S    +YR   + PIG S   SD GWGCM+R+ QML+ QA++ H     L   + + +
Sbjct: 154 FKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMMRHVFEDNLKYEYIEKI 213

Query: 199 QKPFDREYVEILHLF---GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
            + +  EY+ +L LF   G+ + SP+SI N+   G       G W GP A+    + L +
Sbjct: 214 TE-YREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIVLKRLTK 272

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP-ILLL 314
             +          P+  + +             VC++  + + +V  +   DWT  + ++
Sbjct: 273 IYK----------PVKQFTM------------YVCLE-GNIYLNVIQEKSKDWTQSVFIV 309

Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES--AIYLDPHDVQ 372
           +PL LGL  + P Y+ +++  FTFPQ++GI GG+  ++ Y +G+ + S   IYLDPH VQ
Sbjct: 310 IPLRLGLNYIEPEYLSSVKKVFTFPQNVGIAGGRENSALYFIGISDSSNNLIYLDPHLVQ 369

Query: 373 ---PVINIGKDD-LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
              P  N+  ++      S++H    + + L+ +  S+AIGFY RD +DF DF  R   L
Sbjct: 370 KSVPTCNMQTNEQFYQYESSFHCTKFKKMPLNRMCTSVAIGFYIRDYNDFLDFQTRIKSL 429

Query: 429 AEESNGAPLFTV 440
           +   N   +FTV
Sbjct: 430 SSGENS--IFTV 439


>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
          Length = 507

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 105/304 (34%), Positives = 148/304 (48%), Gaps = 52/304 (17%)

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-----KPLQKPFDREYVEILHLFGD--SE 217
           TSD GWGCM+RS QML+AQ L+ H LGR WR      P++ P D  + +++  F D  S+
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242

Query: 218 TSPFSIHNLLQAGKAYGLAAGSWVGPYAMC-----------RSWEALARCQ--------- 257
            SPFS+H L+QA    G   GSW GP  +C           R +E LAR           
Sbjct: 243 ESPFSLHRLVQAS---GQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299

Query: 258 -RAET-----GLGCQSLPMAIYVVSGDEDGERGGA-------PVVCIDD---ASRHCSVF 301
            R E      G   +  P  +      E+ +   +       P   + D   +S   ++F
Sbjct: 300 YREEIMNLARGQPVRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQDGIQSSPSTTLF 359

Query: 302 SKGQADWTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE 360
                    ++LL+P+ LGL+K ++ RY+P +      P  +GI+GG+P  S YI+G Q 
Sbjct: 360 PSHA-----VILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIYILGCQN 414

Query: 361 ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
              I+LDPH  QPV+    D  E +  T+H  V R I    +DPS A+GFYCR + D  D
Sbjct: 415 TQLIHLDPHFTQPVVRNVVDSEEFNVKTWHCLVPRVIEAAKLDPSCAVGFYCRSRGDLSD 474

Query: 421 FCAR 424
              R
Sbjct: 475 LLER 478


>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 102/289 (35%), Positives = 143/289 (49%), Gaps = 29/289 (10%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
           L +   DF SR+  +YR+ F  IG S  TSD GWGCMLR+ QMLVA+ LL  RLGR +  
Sbjct: 39  LEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRNYVW 98

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
                 D  Y EIL LF D+ ++  S+  + L    A   A G W GP  M    + L R
Sbjct: 99  SESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMA---QVLKR 155

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
             ++      +SL   + V             VV ++D S    + + G+   TP++L++
Sbjct: 156 ITKS------RSLGFGVTVAMDS---------VVSVEDVS--AEIINGGKP--TPLVLMI 196

Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA----IYLDPHDV 371
           PL LGL  VN  Y+  L++       +GI+GGKP  + Y VG QE       +YLDPH  
Sbjct: 197 PLRLGLNSVNEIYVNPLKIFLASKYCVGIMGGKPNQAHYFVGYQETVEDTWLLYLDPHTT 256

Query: 372 Q--PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
           Q  PV        E    + H+D +  I    +DPSLA+GF+    ++F
Sbjct: 257 QQSPVSVNNNMPFEQFDKSLHTDKLCWIKALKLDPSLAVGFFFNTVEEF 305


>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
          Length = 427

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 172/387 (44%), Gaps = 68/387 (17%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 37  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 86

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 87  GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPAR 146

Query: 195 ----RKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMC 247
               R     P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP    
Sbjct: 147 WMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP---- 202

Query: 248 RSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD 307
                +A   R       +   + +YV       +   A +V   D +          A+
Sbjct: 203 ---SLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AE 249

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G          
Sbjct: 250 WKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGXXXXXXXXXX 309

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
               QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 310 XXXCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 367

Query: 428 LAEESNGA---PLFTVTQTHKKPVNHS 451
           +   S+     P+FT+ + H +  +HS
Sbjct: 368 VLGSSSATERYPMFTLAEGHAQ--DHS 392


>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
 gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
           Full=Autophagy-related protein 4 homolog C
 gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
 gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
          Length = 450

 Score =  154 bits (390), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 110/402 (27%), Positives = 168/402 (41%), Gaps = 95/402 (23%)

Query: 111 SDIWLLGVCHKIAQDEA--LGDAAGNNG----------LAEFNQDFSSRILISYRKGFDP 158
           S ++LLG C+    +++    D   N+G          + EF +DF SRI ++YR+ F  
Sbjct: 39  SPVFLLGKCYHFKYEDSSVTSDGGSNSGSESKEDLSGNVDEFRKDFISRIWLTYREEFPQ 98

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW------------------------ 194
           I  S  T+D GWGC LR+ QML+AQ L+ H LGR W                        
Sbjct: 99  IETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWTWTEALDIFSSESEFWTANTARKL 158

Query: 195 ------------------RKPLQKPFDREYVEILH-----LFGDSETSPFSIHNLLQAGK 231
                             ++PL     +   E  H      F D   + F +H L++ GK
Sbjct: 159 TPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQKIISWFADYPLAYFGLHQLVKLGK 218

Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
             G  AG W GP  +      L R    E+                  D E  G  +   
Sbjct: 219 NSGKVAGDWYGPAVVSH----LLRKAIEESS-----------------DPELQGITIYVA 257

Query: 292 DDASRHCSVFSKGQADW-------TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
            D    C+++S    D          +++LVP+ LG E+ N  Y   ++   +    +GI
Sbjct: 258 QD----CTIYSADVYDLQCNKGTEKAVVILVPVRLGGERTNMEYFEFVKGILSLEFCIGI 313

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y VG Q++S IY+DPH  Q  +++   +   +  ++H    + +    +DP
Sbjct: 314 IGGKPKQSYYFVGFQDDSLIYMDPHYCQSFVDVSVKNFPLE--SFHCPSPKKMSFKKMDP 371

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEES--NGAPLFTVTQTH 444
           S  IGFYCR+  +F+      +K+ + S     PLFT    H
Sbjct: 372 SCTIGFYCRNAREFEKAAEELTKVLKSSTKQNYPLFTFVNGH 413


>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
          Length = 459

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 118/437 (27%), Positives = 182/437 (41%), Gaps = 84/437 (19%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAA-GNN----------GLAEFNQDFSSRILISYRKGF 156
           S  S ++LLG C+    DE    +  G+N           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVFLLGKCYHFKTDEPTEQSPNGSNYDVTEEEVSRNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR--------------------K 196
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W                     K
Sbjct: 96  PQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDWTWPDALVNENPESESWTSHTVK 155

Query: 197 PLQKPFDREYV--------------------------------EILHLFGDSETSPFSIH 224
            L   F+   +                                +I+  FGDS  + F +H
Sbjct: 156 KLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRDEHYHRKIVSWFGDSPLANFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ G   G  AG W GP  +      L R +  E     +   + +YV          
Sbjct: 216 RLIEYGNKSGKMAGDWYGPAVVAH----LLR-KAVEEAKDPELQGITVYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D     CS+    +     +++L+P+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYKSDVVEMQCSLKDSEKPGAKSVIILIPVRLGGERTNMEYLEFVKGILSLEYCIGI 322

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           VGG+P  S Y  G Q++S IY+DPH  Q  +++   +   +  ++H    + +    +DP
Sbjct: 323 VGGRPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKNFPLE--SFHCPSPKKMSFKKMDP 380

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKKPVNH--SDVLGETGGV 460
           S  IG YC +   F+      +K+ + S+    PLFT    H K  +   S V  E    
Sbjct: 381 SCTIGLYCPNVQGFERASEEITKILKASSKEKYPLFTFVNGHSKDYDFMMSPVQEEKALF 440

Query: 461 PEDDS--LGVMSMNDAV 475
            ED++  L   S  D V
Sbjct: 441 SEDENKKLKRFSTEDFV 457


>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
           4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
           nidulans FGSC A4]
          Length = 402

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 120/390 (30%), Positives = 178/390 (45%), Gaps = 68/390 (17%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGV-----CHKIAQDEALGDAAGNN--------GLA 138
           +RI + +  P         S IW LG      C +   DE+     G          G  
Sbjct: 11  KRIIQYIWDPEPKNDEEPGSPIWCLGTRYPPQCVEETADESRNPDHGQQQNTNTSAPGWP 70

Query: 139 E-FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 174
           E F  DF S+I ++YR  F PI                            TSD GWGCM+
Sbjct: 71  EAFLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMI 130

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
           RS Q L+A ++    LGR WR+  +     E  ++L LF DS  +PFSIH+ ++ G  + 
Sbjct: 131 RSGQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFC 187

Query: 234 GLAAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
           G   G W GP A  R  + LA R  ++          + +Y+   + D  +     V  D
Sbjct: 188 GKHPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRD 238

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           +         KG     P L+L+ L LG++++   Y   L+     PQS+GI GG+P AS
Sbjct: 239 E---------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSAS 287

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
            Y V VQ     YLDPH+ +P +   +     E + +TYH+  +R +++  +DPS+ IGF
Sbjct: 288 HYFVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGF 347

Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFTV 440
             RD+DD++D+ AR   L     G P+ T+
Sbjct: 348 LIRDEDDWEDWKARIMSL----EGKPIITI 373


>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
 gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
          Length = 409

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 166/345 (48%), Gaps = 42/345 (12%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           +D GWGCMLR  QM++AQAL+   LGR W    +   D  Y++I++ F D   S +SIH 
Sbjct: 92  TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           +   G++   A G W+GP  + +  + L         L      + ++V           
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
              V +DD    C    +G A W P+LL++PL LG+  +NP YIP L+       S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
           GG+P  + Y +G  E+  +YLDPH  Q    +G+     +     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQKTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTH 444
           DPSLA+ F C+  D F        KL +E  G     LF ++QT 
Sbjct: 310 DPSLAVCFLCKTSDSFQQL---LDKLRQEVLGMCSPALFEISQTR 351


>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
 gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
          Length = 409

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/342 (30%), Positives = 163/342 (47%), Gaps = 36/342 (10%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +D+W+LG  +   Q+           L    +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPKRNTDVWVLGRRYNAIQE-----------LEVIRRDIQSRLWCTYRHGFMPLGEVQLT 91

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           +D GWGCMLR  QM++AQAL+   LGR W    +   D  Y++I++ F D   S +SIH 
Sbjct: 92  TDRGWGCMLRCGQMVLAQALIDLHLGRDWFWTPECQ-DATYLKIVNRFEDVRKSYYSIHQ 150

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           +   G++   A G W+GP  + +  + L         L      + ++V           
Sbjct: 151 IALMGESQNKAVGEWLGPNTVAQILKKLV--------LFDDWCSLVVHVAMDS------- 195

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
              V +DD    C    +G A W P+LL++PL LG+  +NP YIP L+       S G++
Sbjct: 196 --TVVLDDVYSLC---LEGDA-WKPLLLIIPLRLGISDINPIYIPALKRCLELDSSCGMI 249

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHIHLDSI 402
           GG+P  + Y +G  E+  +YLDPH  Q    +G+     +     TYH      +   ++
Sbjct: 250 GGRPNQALYFLGYVEDEVLYLDPHTTQRTGVVGQKTSSGEQEHDETYHQKHAARLSFSAM 309

Query: 403 DPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           DPSLA+ F C+  D F     +  +         LF ++QT 
Sbjct: 310 DPSLAVCFLCKTSDSFQQLLEKLRQEVLGMCSPALFEISQTR 351


>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
          Length = 433

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 97/329 (29%), Positives = 150/329 (45%), Gaps = 59/329 (17%)

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
           TSD GWGCMLR +QML+ + LL   +GR +   ++      Y +IL +F D + + +SIH
Sbjct: 49  TSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETT-SVVYEKILQMFFDEKDALYSIH 107

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQ-SLPMAIYV 274
            + Q G   G     W GP    +          W  +A     +  L  + +L MA   
Sbjct: 108 QIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTY 167

Query: 275 VSGD------EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
            S D      E+G+             +H +  +  + +W P+LL++PL LGL  +N  Y
Sbjct: 168 PSEDAVKLIMENGQ-----------VEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCY 216

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV-------------- 374
           +P ++  F  PQ +GI+GGKP  + Y VG+      YLDPH  +P               
Sbjct: 217 LPAIQEFFKLPQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTES 276

Query: 375 ----INIGK-DDLE------------ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
                N  + +DLE             D STYH  +++ +  +SIDPSLA+  +C  ++D
Sbjct: 277 EQHDTNFSELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESRED 336

Query: 418 FDDFCARASKLAEESNGAPLFTVTQTHKK 446
           FD+ C    K    ++  P+F   +   K
Sbjct: 337 FDNLCQELQKTTLPASKPPMFEFLEKRPK 365


>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
          Length = 449

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 52/327 (15%)

Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK------------------ 163
           +A DEA+    G    + F  DF S+  ++YR  F+PI  S                   
Sbjct: 98  LAYDEAMNQDGG--WPSAFLDDFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQF 155

Query: 164 -----ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
                 +SD GWGCM+RS Q L+A A+    LGR WR+ +    +R+   +L  F D   
Sbjct: 156 MDQAGYSSDSGWGCMIRSGQSLLANAMAVLDLGRDWRRGVAAEKERQ---LLSKFADDPK 212

Query: 219 SPFSIHNLLQAGK-AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
           +P+SIH  +Q G  A G   G W GP A  R  +AL                + +Y    
Sbjct: 213 APYSIHRFVQHGAVACGKYPGEWFGPSATARCIQALVNANEPH---------LRVYST-- 261

Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
                 G  P V  D   R   +       + P L+LV   LG++K+ P Y   L     
Sbjct: 262 ------GDGPDVYED---RFFDIAKPSGETFHPTLILVGTRLGIDKITPVYWDALIAALQ 312

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL---EADTSTYHSDVI 394
            PQS+GI GG+P +S Y +G Q     YLDPH  +  +   +D     +AD  + H+  +
Sbjct: 313 MPQSIGIAGGRPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSVHTRRL 372

Query: 395 RHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           R +H+  +DPS+ IGF   D+DD+D++
Sbjct: 373 RRLHVREMDPSMLIGFVIHDEDDWDEW 399


>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
 gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
          Length = 518

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 155/318 (48%), Gaps = 49/318 (15%)

Query: 130 DAAG-NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
           DA G ++G  +F  D+ SR+ I+YR  F P+ ++  T+D GWGCM+R++QM+VAQA++ +
Sbjct: 159 DANGVSSGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIMLN 218

Query: 189 RLGRPWRKPLQKP-----------FDREYVE---ILHLFGDSETSPFSIHNLLQ--AGKA 232
           R GR WR   +K            FDRE ++   IL LF D  +SP  IH +++  A + 
Sbjct: 219 RFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAKEK 278

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
              A GSW  P       EA+   ++A        L  +I  ++GD       A  + I 
Sbjct: 279 GKKAVGSWYSPS------EAVFIMKKA--------LTESISPLTGD------TAMYLSI- 317

Query: 293 DASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
           D   H         +W   L+LV +V LG  ++NP Y+P L   F+    LG+ GG+P  
Sbjct: 318 DGRVHIRDIEVETKNWMKTLILVIVVRLGAAELNPIYVPHLMRLFSMESCLGVTGGRPDH 377

Query: 352 STYIVGVQEESAIYLDPHDVQPVINI----------GKDDLEADTSTYHSDVIRHIHLDS 401
           S + VG   +  IYLDPH     I I           K   +    +YH  ++  +H   
Sbjct: 378 SCWFVGFYGDQIIYLDPHVAHEYIPIDMNFNVNMTDNKKSKKCPERSYHCRLLSKMHFLD 437

Query: 402 IDPSLAIGFYCRDKDDFD 419
           +DPS A+ F    ++ FD
Sbjct: 438 MDPSCALCFRFESREQFD 455


>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
          Length = 409

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/313 (32%), Positives = 150/313 (47%), Gaps = 49/313 (15%)

Query: 140 FNQDFSSRILISYRKGFDPI---------------------GDSKITSDVGWGCMLRSSQ 178
           F +DF S + ++YR  F PI                          TSD GWGCM+RS Q
Sbjct: 86  FLEDFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQ 145

Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAA 237
            ++A AL   RLGR WR+ + KP   E   +L LF D   +PFSIH  ++ G+   G   
Sbjct: 146 AVIANALAHLRLGRGWRRGM-KP--EEEKRLLALFADDPRAPFSIHKFVRHGEVECGKNP 202

Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRH 297
           G W GP        A A C +A T    +   + +Y  + ++  E     V  ++     
Sbjct: 203 GEWFGP-------SAAAMCIQALTH-AYEPAGLRVYQTNSNDLYEEDFRKVAVVN----- 249

Query: 298 CSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
             VF        P L+L  + LG+E++   Y   L      PQ++GI GG+P +S Y + 
Sbjct: 250 -GVFK-------PTLVLAGIRLGIERITNIYYEPLAACLRMPQTVGIAGGRPSSSHYFIA 301

Query: 358 VQEESAIYLDPHDVQPVINIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
           VQ E+  YLDPH  +P++      +D  E +  T H+  IR +H+  +DPS+ I F  RD
Sbjct: 302 VQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHTRRIRRLHIREMDPSMLIAFLIRD 361

Query: 415 KDDFDDFCARASK 427
           + D++D+  R S+
Sbjct: 362 EADWEDWQRRISE 374


>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
 gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
          Length = 521

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/340 (31%), Positives = 159/340 (46%), Gaps = 56/340 (16%)

Query: 111 SDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 170
           +D+  LG  +  + DE+       +G   F  D+ SR+ I+YR  F  + D+  T+D GW
Sbjct: 146 NDVVFLGRRYSTSVDES----GLRSGFENFCSDYYSRLWITYRTDFPALLDTDTTTDCGW 201

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQK-----------PFDREYVE---ILHLFGDS 216
           GCM+R++QM+VAQA++ +R GR WR   +K            FDRE ++   IL LF D 
Sbjct: 202 GCMIRTTQMMVAQAIMVNRFGRDWRFTRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDK 261

Query: 217 ETSPFSIHNLL---QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273
            T+P  IH ++     GK    A GSW  P       EA+   ++A   L   S P+   
Sbjct: 262 PTAPLGIHKMVGIAAMGKGKK-AVGSWYSPS------EAVFIMKKA---LTESSSPLT-- 309

Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPRYIPTL 332
                     G   ++   D   H         +W   L+LV +V LG  ++NP Y+P L
Sbjct: 310 ----------GNTAMLLSIDGRVHIRDIEVETKNWMKKLILVIVVRLGAAELNPIYVPHL 359

Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH--------DVQPVINI----GKD 380
              F     LGI GG+P  S++ VG   +  IYLDPH        D+ P  N+     K 
Sbjct: 360 MRLFAMESCLGITGGRPDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSDSKK 419

Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
             +    +YH  ++  +H   +DPS A+ F    ++ FD+
Sbjct: 420 AKKCPEKSYHCRLLSKMHFFDMDPSCALCFQFESREQFDN 459


>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
 gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
          Length = 491

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF SRI ++YR GF+PI  S                         T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367

Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427

Query: 407 AIGFYCRDKDDF 418
            IGF   D++++
Sbjct: 428 LIGFLILDEENW 439


>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
           TFB-10046 SS5]
          Length = 989

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 99/284 (34%), Positives = 134/284 (47%), Gaps = 47/284 (16%)

Query: 140 FNQDFSSRILISYRKGFDPI-----------------------------GDSKITSDVGW 170
           F  DF+SR+ ++YR  F PI                             G+   TSD GW
Sbjct: 314 FYADFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGW 373

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPL---QKPFDREYVEILHLFGDSET--SPFSIHN 225
           GCMLR+ Q L+A  L+   LGR WR+P      P    YV+IL  F D+ +  +PFS+H 
Sbjct: 374 GCMLRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHR 433

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALAR-CQRAETGLGCQSLPMAIYVVSGDEDGERG 284
           +  +GK +G   G W GP     +   L     RA+ G+      +A+  V  + D    
Sbjct: 434 MAMSGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVA-----IAVDGVLYETDIYSA 488

Query: 285 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
               +   D +R  S F +    W    +L+LV   LGL+ VNP Y   L+  FTFPQSL
Sbjct: 489 SHYPMSSADGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTIFTFPQSL 548

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI-----GKDD 381
           GI GG+P +S Y VG Q  S  YLDPH  +P + +     G DD
Sbjct: 549 GIAGGRPSSSYYFVGSQGNSLFYLDPHHTRPAVPLRTPPPGDDD 592



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 42/71 (59%), Gaps = 2/71 (2%)

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           D  T+H D +R + L  +DPS+ +GF CRD+ D+ DF  R +++++  +   LF++ +  
Sbjct: 699 DLKTFHCDRVRKMPLSGLDPSMLLGFLCRDEQDWKDFRRRMAEISKGRDT--LFSIQEEP 756

Query: 445 KKPVNHSDVLG 455
               + SD +G
Sbjct: 757 PSWPSDSDDMG 767


>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
          Length = 572

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF SRI ++YR GF+PI  S                         T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 390 -FMEVAKSDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448

Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508

Query: 407 AIGFYCRDKDDF 418
            IGF   D++++
Sbjct: 509 LIGFLILDEENW 520


>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
          Length = 572

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 143/312 (45%), Gaps = 56/312 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF SRI ++YR GF+PI  S                         T+D GWGCM+R+
Sbjct: 232 FLNDFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 291

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 292 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 348

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 349 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 389

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 390 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 448

Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 449 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 508

Query: 407 AIGFYCRDKDDF 418
            IGF   D++++
Sbjct: 509 LIGFLILDEENW 520


>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
           4308]
          Length = 378

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 113/375 (30%), Positives = 172/375 (45%), Gaps = 54/375 (14%)

Query: 92  RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQD-EALGDAAGNNGLAE----------- 139
           +RI + +  P        TS IW LG+ +   +D    G+    N   +           
Sbjct: 11  KRIVQYLWDPEPRNDEDPTSSIWCLGIEYHPEKDVSPRGETPDKNSARDNTTGTTNYRKP 70

Query: 140 --------FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
                   F  DF SRI ++YR  F PI   ++  D     M   S  L+A AL    LG
Sbjct: 71  SEHAWPESFLLDFESRIWMTYRSNFPPI--PRVEGDDKSASMTLGS--LLANALSTLVLG 126

Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSW 250
           R WR+  +  F+ E  ++L LF D+ T+PFS+H  ++ G ++ G   G W GP A  +  
Sbjct: 127 RDWRRGAR--FEEE-SQLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCI 183

Query: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 310
           EAL+          C S  + +YV +   +  +            R  +V       + P
Sbjct: 184 EALSS--------QCGSPTLKVYVSNDTSEVYQ-----------DRFMNVARNSSGVFQP 224

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
            L+L+   LG++ + P Y   L+ T   PQS+GI GG+P AS Y VG Q     YLDPH 
Sbjct: 225 TLILLGTRLGIDHITPVYWDGLKATLQLPQSVGIAGGRPSASHYFVGAQGSHLFYLDPHY 284

Query: 371 VQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
            +P +     G+   + +  TYH+  +R IH+  +DPS+ IGF  RD++D+DD+  R   
Sbjct: 285 TRPALPDRQGGELYSKEEVDTYHTRRLRRIHVRDMDPSMLIGFLIRDQEDWDDWLNRIQA 344

Query: 428 LAEESNGAPLFTVTQ 442
           +     G P+  V +
Sbjct: 345 V----KGRPIIHVLK 355


>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
          Length = 758

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
           F  D  S+I ++YR GF PI   K                      TSD GWGCM+R+SQ
Sbjct: 65  FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124

Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
            L+A ALLF  LGR W    + P + E+  I+  F D    PFSIHN +Q G K      
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184

Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
           G W GP A  R+ + L           C+  P   + +Y  S             C D  
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221

Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
                  + G +D +TPIL+L+ + LG+EKVNP Y  +LR   +  QS+GI GG+P +S 
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281

Query: 354 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 384
           Y  G Q +   YLDPH  Q  +  G            K D  A                 
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341

Query: 385 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                               D  + H+  +  +HL  +DPS+ IGF    +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398


>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
          Length = 459

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 117/425 (27%), Positives = 171/425 (40%), Gaps = 94/425 (22%)

Query: 108 SSTSDIWLLGVCHKIA-QDE----------ALGDAAGNNGLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+    +DE           + D      + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDENKLLPARSGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP------------------- 197
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                   
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSNTVK 155

Query: 198 ---------------------LQKPFDREYVE------------ILHLFGDSETSPFSIH 224
                                 QK   R Y +            I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNEIYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQ-------- 262

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   + C+  +    D   +++L+P+ LG E+ N  Y+  ++      ++L I
Sbjct: 263 DCTVYSSDVIDKQCTSMASDNTDDKAVIILIPVRLGGERTNTDYLDFVKGIL---RALNI 319

Query: 345 VG----GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
           V      KP  S Y  G Q++S IY+DPH  Q  +++   D   +  T+H    + +   
Sbjct: 320 VWVLLVAKPKQSYYFAGFQDDSLIYMDPHYCQSFVDVSIKDFPLE--TFHCPSPKKMSFR 377

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PLFTVTQTHKK-------PVNHS 451
            +DPS  IGFYCR+  DF       +K+ + S+    PLFT    H +         N  
Sbjct: 378 KMDPSCTIGFYCRNVQDFKRASEEITKMLKISSKEKYPLFTFVNGHSRDYDFTSTTTNEE 437

Query: 452 DVLGE 456
           D+  E
Sbjct: 438 DLFSE 442


>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
          Length = 318

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 49/266 (18%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 77  VWILGKQHPLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 125

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 126 MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 185

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 186 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 228

Query: 293 DASRHCSVFSKGQAD---------------------WTPILLLVPLVLGLEKVNPRYIPT 331
           D  + C V   G AD                     W P+LL+VPL LG+ ++NP Y+  
Sbjct: 229 DIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLIVPLRLGINQINPVYVEA 288

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVG 357
            +  F  PQSLG +GGKP  + Y +G
Sbjct: 289 FKECFKMPQSLGALGGKPNNAYYFIG 314


>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
           pastoris GS115]
 gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
           pastoris GS115]
          Length = 531

 Score =  151 bits (381), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 150/357 (42%), Gaps = 98/357 (27%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
           F  D  S+I ++YR GF PI   K                      TSD GWGCM+R+SQ
Sbjct: 65  FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124

Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
            L+A ALLF  LGR W    + P + E+  I+  F D    PFSIHN +Q G K      
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184

Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
           G W GP A  R+ + L           C+  P   + +Y  S             C D  
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221

Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
                  + G +D +TPIL+L+ + LG+EKVNP Y  +LR   +  QS+GI GG+P +S 
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNPVYWDSLRECLSLKQSVGIAGGRPCSSH 281

Query: 354 YIVGVQEESAIYLDPHDVQPVINIG------------KDDLEA----------------- 384
           Y  G Q +   YLDPH  Q  +  G            K D  A                 
Sbjct: 282 YFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTKKTDENAAGQYPVSNTDSNNETNH 341

Query: 385 --------------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                               D  + H+  +  +HL  +DPS+ IGF    +DDF+D+
Sbjct: 342 DDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSHMDPSMLIGFLITSEDDFNDW 398


>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
          Length = 491

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 144/312 (46%), Gaps = 56/312 (17%)

Query: 140 FNQDFSSRILISYRKGF-------DPIGDSKI----------------TSDVGWGCMLRS 176
           F  DF SRI ++YR GF       DP   S++                T+D GWGCM+R+
Sbjct: 151 FLNDFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRT 210

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A +LL  RLGR WR+  Q P   E  ++L LF D   +P+SIHN +  G A  G 
Sbjct: 211 GQSLLANSLLTCRLGRSWRR-GQAP--DEERKLLSLFADDPRAPYSIHNFVAHGAAKCGK 267

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R   ALA                 +Y          G  P V  D   
Sbjct: 268 YPGEWFGPSATARCIHALANATENS---------FRVYST--------GDLPDVYEDS-- 308

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
               V       + P L+L+   LG++K+N  Y  +L  T   PQS+GI GG+P +S Y 
Sbjct: 309 -FMEVAKPDGKTFHPTLILISTRLGIDKINQVYWESLTATLQLPQSVGIAGGRPSSSHYF 367

Query: 356 VGVQEESA------IYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSL 406
           VG Q           YLDPH  +P +   +D      +D  + H+  +R +H+  +DPS+
Sbjct: 368 VGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSCHTRRLRRLHIREMDPSM 427

Query: 407 AIGFYCRDKDDF 418
            IGF   D++++
Sbjct: 428 LIGFLILDEENW 439


>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
 gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
          Length = 379

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/328 (32%), Positives = 159/328 (48%), Gaps = 54/328 (16%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCMLRS 176
           F  DF S+I ++YR  F PI                            TSD GWGCM+RS
Sbjct: 50  FLLDFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRS 109

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GL 235
            Q L+A ++    LGR WR+  +     E  ++L LF DS  +PFSIH+ ++ G  + G 
Sbjct: 110 GQSLLANSMAILLLGRDWRRGERL---EEEGKLLSLFADSPHAPFSIHSFVKHGADFCGK 166

Query: 236 AAGSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
             G W GP A  R  + LA R  ++          + +Y+   + D  +     V  D+ 
Sbjct: 167 HPGEWFGPTATARCIQGLAARYDQSN---------LQVYIADDNSDVHQDKFMSVSRDE- 216

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
                   KG     P L+L+ L LG++++   Y   L+     PQS+GI GG+P AS Y
Sbjct: 217 --------KGTV--RPTLILLGLRLGIDRITAVYWNGLKAVLQLPQSVGIAGGRPSASHY 266

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD--LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            V VQ     YLDPH+ +P +   +     E + +TYH+  +R +++  +DPS+ IGF  
Sbjct: 267 FVAVQGSHFFYLDPHNTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDPSMLIGFLI 326

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTV 440
           RD+DD++D+ AR   L     G P+ T+
Sbjct: 327 RDEDDWEDWKARIMSL----EGKPIITI 350


>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
          Length = 451

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 98/309 (31%), Positives = 146/309 (47%), Gaps = 50/309 (16%)

Query: 140 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 176
           F +D +++  ++YR GFDPI  S                         +SD GWGCM+RS
Sbjct: 117 FLEDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRS 176

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A  +   +LGR WR+       +E  +++ +F D   +P+SIHN ++ G  A G 
Sbjct: 177 GQSLLATTIGILQLGRDWRR---GKCQQEERQLISMFADDPRAPYSIHNFVRHGATACGK 233

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP        A A+C +A T      LP+ +Y  +  +D        +   D  
Sbjct: 234 FPGEWFGP-------SATAQCIQALTS--ASGLPLKVYSPNDGQDVYEDSFMKIAKPD-- 282

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
                   GQ D+ P L+L+   LG++K+ P Y   L      PQS+GI GG+P +S Y 
Sbjct: 283 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLLAALQMPQSVGIAGGRPSSSHYF 333

Query: 356 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           VG Q     YLDPH  +  I    D     E D  + H+  +R +HL  +DPS+ IGF  
Sbjct: 334 VGSQGSYLFYLDPHHTRKAIPYHADVTKYTEEDIESCHTSRLRRLHLKEMDPSMLIGFLI 393

Query: 413 RDKDDFDDF 421
           R + D+ ++
Sbjct: 394 RTESDWSEW 402


>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
          Length = 1505

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 155/339 (45%), Gaps = 77/339 (22%)

Query: 164  ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE------------------ 205
            +T+D GWGCMLR+ Q L+A AL+   LGR W +  + P  R+                  
Sbjct: 785  LTTDSGWGCMLRTGQSLLANALINVHLGRSWMR--EAPPARQLEFLQELANLSLDTSAEK 842

Query: 206  ---------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
                           Y++IL  F D  S   PF +H + + GK  G   G W GP     
Sbjct: 843  QSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 902

Query: 249  SWEALARCQRAETGLGCQSLPMAIYVVSGDE-DGERGGAPVVCIDDASRHCSVFSKGQAD 307
            + + L   +  + GL  +     ++ +  DE     G +  +    AS   +   KG   
Sbjct: 903  AIKQLV-SEFPDAGLAVELAHDGVFYL--DEVRAAAGASRQLGKGRASATGTNGRKGDTA 959

Query: 308  WT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
             T   P+L+L+ + LGL+ VNP Y  +++ TF+FP S+GI GG+P +S Y +G Q  S  
Sbjct: 960  LTWHKPVLILIGIRLGLDSVNPIYYESVKATFSFPHSVGIAGGRPSSSYYFMGHQGNSLF 1019

Query: 365  YLDPHDVQPVINI------------------------GKDD---------LEADTSTYHS 391
            YLDPH+V+P + +                          DD          EA TST+H 
Sbjct: 1020 YLDPHNVRPAVALRFPPSTFPAAVPRQLDIAHRFAFEEHDDEDEWWSHAYTEAQTSTFHC 1079

Query: 392  DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
            D +R + + S+DPS+ +GF  +D++D  D CAR   L++
Sbjct: 1080 DKVRRMPIKSLDPSMLLGFLVKDEEDLADLCARIKALSK 1118


>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
           206040]
          Length = 452

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/317 (32%), Positives = 150/317 (47%), Gaps = 52/317 (16%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVG 169
           G    A F +D SS+  ++YR GF+PI  S                         +SD G
Sbjct: 113 GTGWPAGFVEDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSG 172

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+RS Q L+A  +   RLGR WR+   +  +R    ++ +F D   +P+SIHN ++ 
Sbjct: 173 WGCMIRSGQSLLATTIGILRLGRDWRRDQSQEEERH---LISMFADDPRAPYSIHNFVRH 229

Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G  A G   G W GP        A A+C +A T      L + IY  +  +D        
Sbjct: 230 GATACGKYPGEWFGP-------SATAQCIQALTS--SSGLSLNIYSPNDGQD-------- 272

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
             + + S      S GQ  + P L+L+   LG++K+ P Y   L      PQS+GI GG+
Sbjct: 273 --VYEDSFMKIAKSDGQT-FNPTLILIRTRLGIDKITPIYWDALIAALHMPQSVGIAGGR 329

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL----EADTSTYHSDVIRHIHLDSIDP 404
           P +S Y VG Q     YLDPH  +  I    DD+    E D  + H+  +R IH+  +DP
Sbjct: 330 PASSHYFVGSQGSYLFYLDPHHTRKAIPY-HDDVTKYTEEDIESCHTSRLRRIHIKEMDP 388

Query: 405 SLAIGFYCRDKDDFDDF 421
           S+ IGF  R + D+ ++
Sbjct: 389 SMLIGFLIRTESDWTEW 405


>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
           heterostrophus C5]
          Length = 471

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 160/358 (44%), Gaps = 91/358 (25%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
           N  + F  DF SRI ++YR GF  I  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
           +RS Q ++A AL   RLGR WR    KP  +E+ EIL LF D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAV 209

Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G   G W GP A  R  + LA   R E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYV-SGD------GADVY--E 252

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  +  ++   GQ  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+P AS
Sbjct: 253 DKLKEVAIDDDGQ--WQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310

Query: 353 TYIVGVQEESAIYLDPHDVQPVI--------------NIGKDDLE--------------- 383
            Y V  Q  +  YLDPH  +P++              N  ++ L                
Sbjct: 311 HYFVATQGNNFFYLDPHSTRPLLPYRPPPSSTENESQNQSQNQLAVPSSLDASATSNSSS 370

Query: 384 ------------ADTSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                       +D +TY        H+  IR + +  +DPS+ I F     DD++++
Sbjct: 371 TTIVPSATPTDGSDRTTYSEEELATCHTRRIRRLQIREMDPSMLIAFLITSADDYENW 428


>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 470

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 112/357 (31%), Positives = 159/357 (44%), Gaps = 90/357 (25%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
           N  + F  DF SRI ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCM 150

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
           +RS Q ++A AL   RLGR WR   ++P  +E+ +++ +F D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAV 209

Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G   G W GP A  R  + L    R E GL        +YV SGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLVHKNR-EAGL-------KVYV-SGD------GADVY--E 252

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  +  +V   G+  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+P AS
Sbjct: 253 DKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGRPSAS 310

Query: 353 TYIVGVQEESAIYLDPHDVQPVINI------------GKDDLE----------------- 383
            Y V  Q  +  YLDPH  +P++                  LE                 
Sbjct: 311 HYFVATQANNFFYLDPHSTRPLLPYRPSSSSTEEQVAAPSTLEASATSVTSTSSSTTIVP 370

Query: 384 -ADTSTYHSDV------------------IRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
            A+  T  SDV                  IR + +  +DPS+ + F    +DD++D+
Sbjct: 371 SANEVTAPSDVSKPSGYSLEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYEDW 427


>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
 gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
          Length = 470

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 110/361 (30%), Positives = 158/361 (43%), Gaps = 90/361 (24%)

Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 169
           A   N  + F  DF SRI ++YR GF PI  S+                      TSD G
Sbjct: 87  AQYGNWPSAFLDDFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           +GCM+RS Q ++A AL   RLGR WR   ++P  +E+ +I+ +F D   +PFSIH  ++ 
Sbjct: 147 FGCMIRSGQCILANALQILRLGRDWRY-QEQPDAKEHCDIVAMFADDPRAPFSIHRFVEH 205

Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G A  G   G W GP A  R  + L   +  E GL        +YV SGD      GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KNKEVGL-------KVYV-SGD------GADV 250

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +D  +  +V   G+  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+
Sbjct: 251 Y--EDKLKEIAVDDDGE--WHPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGR 306

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD----------------------- 385
           P AS Y V  Q  +  YLDPH  +P++         +                       
Sbjct: 307 PSASHYFVATQANNFFYLDPHSTRPLLPYRPSSWSTEEQASAPSTLEASATSATSTSSST 366

Query: 386 -----------------TSTY--------HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
                            TS Y        H+  IR + +  +DPS+ + F    +DD++D
Sbjct: 367 TIVPSANEVTAPSDASRTSGYSPEELATCHTRRIRRLQIREMDPSMLLAFLITSEDDYED 426

Query: 421 F 421
           +
Sbjct: 427 W 427


>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
 gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
          Length = 1541

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 109/367 (29%), Positives = 165/367 (44%), Gaps = 86/367 (23%)

Query: 155  GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK--PLQKPFD--------- 203
            GF   G   +T+D GWGCMLR+ Q L+A ALL   LGR W +  P  +  D         
Sbjct: 814  GFSRAG---LTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLS 870

Query: 204  -------------RE-------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 241
                         RE       Y++IL  F D  S   PF +H + + GK  G   G W 
Sbjct: 871  LDSSVEMQSLQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 930

Query: 242  GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 301
            GP     + + L   +  + G+  +     ++ +  DE     GA         R     
Sbjct: 931  GPSTAAGAIKQLV-TEFPDAGIAVELAHDGVFYL--DEVRLAAGARSALQSGKGR----- 982

Query: 302  SKGQADWT---PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV 358
             +G A  T   P+++L+ + LGL+ VNP Y  +++ TF+FP S+GI GG+P +S Y +G 
Sbjct: 983  -QGDAAVTWRRPVVILIGIRLGLDSVNPIYYESVKETFSFPHSVGIAGGRPSSSYYFMGH 1041

Query: 359  QEESAIYLDPHDVQPVINI------------------------GKDD---------LEAD 385
            Q  S  YLDPH+V+P + +                         KDD          EA 
Sbjct: 1042 QGNSLFYLDPHNVRPAVALRYPPSTFPTAVPHQLDVAHRFALEDKDDELEWWSHAYTEAQ 1101

Query: 386  TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 445
            TST+H + +R + + S+DPS+ +GF  +D++D  D C R   L +      +F+  ++  
Sbjct: 1102 TSTFHCEKVRRMPIKSLDPSMLLGFLVKDEEDLMDLCTRIKGLPKT-----IFSFAESAP 1156

Query: 446  KPVNHSD 452
            K V+  D
Sbjct: 1157 KWVDDDD 1163


>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
          Length = 1509

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 115/392 (29%), Positives = 172/392 (43%), Gaps = 88/392 (22%)

Query: 164  ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE-------------- 205
            +T+D GWGCMLR+ Q L+A AL+   LGR W++      Q  F  E              
Sbjct: 776  LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAENQS 835

Query: 206  -------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250
                         Y+ IL  F D  S   PF +H + + GK  G   G W GP     + 
Sbjct: 836  LASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAGAI 895

Query: 251  EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ----A 306
            + L      E G+  +     ++ +    D  R  A        SR   + S  +    A
Sbjct: 896  KQLV-FDFPEAGIAVELAHDGVFYL----DEVRAAASAST--GKSRASGMLSGNRRAETA 948

Query: 307  DWT-PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
             W  P+L+L+ + LGLE VNP Y  +++ TF+FPQS+GI GG+P +S Y +G Q  S  Y
Sbjct: 949  VWRRPVLILIGIRLGLETVNPIYYESVKATFSFPQSVGIAGGRPSSSYYFMGHQGNSLFY 1008

Query: 366  LDPHDVQPVINI------------------------GKDD---------LEADTSTYHSD 392
            LDPH+V+P + +                         +DD          EA TST+H +
Sbjct: 1009 LDPHNVRPAVPLRYPPTTFPAAAPSRFDVSHRYALEDRDDEDEWWSHAYTEAQTSTFHCE 1068

Query: 393  VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSD 452
             +R + + S+DPS+ +GF  +D++   D CAR   L +      +F+  ++  K V+  D
Sbjct: 1069 KVRRMPIKSLDPSMLLGFLVKDEEALVDLCARIKALPKT-----IFSFAESAPKWVDDDD 1123

Query: 453  V--LGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
                 E+   P  D  G    +D VG   + D
Sbjct: 1124 FDPSMESFSEPSADEAG---SDDDVGKGEDQD 1152


>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
          Length = 1093

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 153/319 (47%), Gaps = 42/319 (13%)

Query: 160 GDSKITSDVGWGCMLRSSQMLVAQALL-------------FHRLGRPWRKPLQKPFDRE- 205
           G   +TSD GWGCMLR+ QML+A +L+              +    P   P +   DR+ 
Sbjct: 431 GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPAPSLPPSET--DRQR 488

Query: 206 ---YVEILHLFGDSET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
              YV+IL  F D  +   PFS+H L  AG   G   G W GP     S + L     A 
Sbjct: 489 FEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAGSIKKLVSAFPA- 547

Query: 261 TGLGCQSLP------MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPIL 312
            GLG    P       A++  S         + +    D        ++ + +W    +L
Sbjct: 548 CGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERA-NRMKEEWGDRAVL 606

Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           +L+ L LG+E V P Y  +++  FTFPQ++GI GG+P +S Y VG Q +   YLDPH  +
Sbjct: 607 ILIGLRLGIEGVTPIYYDSVKALFTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTR 666

Query: 373 PVINI-----GKDDLE-----ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
           P + +     G  D       ++  T+HSD +R +H+  +DPS+  GF  R+ +++ D  
Sbjct: 667 PAVPLRVPTDGPYDATGQFTLSEMKTFHSDKVRKMHISGLDPSMLCGFIVRNVEEWRDLR 726

Query: 423 ARASKLAEESNG-APLFTV 440
           AR   LA+   G AP+FT+
Sbjct: 727 ARVDALAKSKGGKAPIFTI 745


>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
          Length = 1572

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 114/387 (29%)

Query: 155  GFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK---PL-QKPFDRE----- 205
            GF   G   +T+D GWGCMLR+ Q L+A AL+   LGR W++   PL Q+ F  E     
Sbjct: 824  GFSRAG---LTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLS 880

Query: 206  ----------------------YVEILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWV 241
                                  Y++IL  F D  S   PF +H + + GK  G   G W 
Sbjct: 881  IADAAEKESLQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWF 940

Query: 242  GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD-------A 294
            GP     + + L               P A   V    DG      V  +D+       +
Sbjct: 941  GPSTASGAIKQL-----------VSEFPQAGIAVELARDG------VFYLDEVRAAASAS 983

Query: 295  SRHCSVFSKGQAD---------------WT-PILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
            +   SV S G+A                W  P+L+L+ + LGLE VNP Y  +++ TF+F
Sbjct: 984  ASAASVQSGGKARSSGAASGSRKGEGLIWRRPVLILIGIRLGLESVNPIYYESVKATFSF 1043

Query: 339  PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI--------------------- 377
            P S+GI GG+P +S Y +G Q  S  YLDPH+V+P + +                     
Sbjct: 1044 PHSVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPSTFPDAVPRHLGIAHRF 1103

Query: 378  ---GKDD---------LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
                KDD          E  TST+H + +R + + S+DPS+ +GF  +D++   D CAR 
Sbjct: 1104 VLEDKDDEDEWWSHAYSEVQTSTFHCEKVRRMPIKSLDPSMLLGFLVKDEESLQDLCARI 1163

Query: 426  SKLAEESNGAPLFTVTQTHKKPVNHSD 452
              L +      +F+  ++  K V+  D
Sbjct: 1164 KALPKT-----IFSFAESAPKWVDDDD 1185


>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
          Length = 1257

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 98/309 (31%), Positives = 142/309 (45%), Gaps = 61/309 (19%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
           F  D++SR+ ++YR  F PI D+ +                                   
Sbjct: 317 FYSDYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWS 376

Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE---ILHLFGD- 215
                TSD GWGCMLR+ Q L+A AL+   L R WR+P    +  +YV+   IL  F D 
Sbjct: 377 GEKGWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDN 436

Query: 216 -SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC--------- 265
            S  +PF IH +  AGK  G   GSW GP     + + L   +  + GL           
Sbjct: 437 PSPLAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLV-GEFEDAGLEVALAVDSVVY 495

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEK 323
           QS   A    S +++G  G +  V    + +      +G   W   P+L+LV + LG++ 
Sbjct: 496 QSDVYAASAASRNQNGVEGDSKTVGTSKSRKKG----QGPPKWGNRPVLILVGIRLGIDG 551

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           VNP Y  +++  FTFPQ++GI GG+P +S Y VG Q +S  YLDPH  +P I +      
Sbjct: 552 VNPIYYESVKTLFTFPQTVGIAGGRPSSSYYFVGAQGDSLFYLDPHHTRPAIPLRPPPAF 611

Query: 384 ADTSTYHSD 392
            +TS   +D
Sbjct: 612 DETSIISTD 620



 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 16/53 (30%), Positives = 34/53 (64%), Gaps = 2/53 (3%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           T+H + +R + L ++DPS+ +GF CR+++++ D   R +++A       +F+V
Sbjct: 794 TFHCERVRKMPLSALDPSMLLGFLCRNEEEWKDLRERLAEMARTKKA--IFSV 844


>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
          Length = 459

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 98/342 (28%), Positives = 154/342 (45%), Gaps = 59/342 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------------- 185
           F + F+S +  +YR+GF P+  S +T+D GWGC+LRSSQML+AQ L              
Sbjct: 98  FRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWSGN 157

Query: 186 ---------LFHRLGR---------------PWRKPLQKPFDREYVEILHLFGDSETSPF 221
                    L H +                  W   L +P +     IL  F D+ T+PF
Sbjct: 158 QRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTAPF 217

Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
            IH L++ GK+ G  AG W GP          A   R         LP  +  V+ D   
Sbjct: 218 GIHRLVELGKSSGKKAGDWYGP-------SIAAHILRKAVEASVVDLPNLVAYVAQD--- 267

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
                  + + D  + C         W  +L+LVP+ LG + +NP YI +++        
Sbjct: 268 -----CTIYLQDVRKLCE--RPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLECC 320

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           +GI+GGKP  S + VG Q++  +YLDPH  QP +++ K+       ++H    R +    
Sbjct: 321 IGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPTVDVTKN---FPLESFHCKNPRKMPFSR 377

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQ 442
           +DPS  IGFY + + +F+  C   ++ ++  +   P+F   +
Sbjct: 378 MDPSCTIGFYAKGQMEFESLCTSVNEAVSASAETYPMFIFEE 419


>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
 gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
          Length = 376

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/317 (32%), Positives = 155/317 (48%), Gaps = 42/317 (13%)

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFS 222
           TSD GWGCM R  QML+AQAL+ H LGR WR    +      ++I+  F DS +  SP S
Sbjct: 67  TSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLS 126

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSG 277
           +H L+Q         G W GP ++C    A+ R     + L  +   + +Y     V+  
Sbjct: 127 LHRLVQMSDR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYR 180

Query: 278 DE--DGERG------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLG 320
           +E  D  RG        P +   D   H +++ + Q+D          T ILLL+PL+ G
Sbjct: 181 EEIIDLARGLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFG 236

Query: 321 L-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
              ++NPRYI  +   F+ P  +G++GG+   S+Y VG Q  S IYLDPH  QP  N+  
Sbjct: 237 KGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNS 296

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 439
                D  ++H  + + +   +++PS A+GFYCR + +  D   R   L   S+      
Sbjct: 297 PKFSVD--SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ--- 351

Query: 440 VTQTHKKPVNHS-DVLG 455
              T  +PV  + +VLG
Sbjct: 352 -ASTRSRPVAFTVEVLG 367


>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
          Length = 473

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/263 (36%), Positives = 122/263 (46%), Gaps = 42/263 (15%)

Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 169
           A   N  + F  DF SRI ++YR GF  I  S+                      TSD G
Sbjct: 87  AQYGNWPSAFLDDFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           +GCM+RS Q ++A AL   RLGR WR    KP  +E+ EIL LF D   +PFSIH  ++ 
Sbjct: 147 FGCMIRSGQSILANALQILRLGRDWRY-QDKPTAKEHCEILSLFADDPRAPFSIHRFVEH 205

Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G A  G   G W GP A  R  + LA   R E GL        +YV     D        
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------RVYVSGDGADVYEDKLKE 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
           V IDD             +W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+
Sbjct: 258 VAIDD-----------DGEWQPTLILVGTRLGIDKITPVYWEALKASLQMKQSIGIAGGR 306

Query: 349 PGASTYIVGVQEESAIYLDPHDV 371
           P AS Y V  Q  +  YLDPH  
Sbjct: 307 PSASHYFVATQGNNFFYLDPHST 329


>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
          Length = 431

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/384 (27%), Positives = 171/384 (44%), Gaps = 89/384 (23%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 60  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 109

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW--------------------------- 194
             +TSD GWGCMLRS QM++AQ LL H L R W                           
Sbjct: 110 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSRSASPSRYHGPAH 169

Query: 195 -RKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYA 245
            R P        L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP  
Sbjct: 170 WRPPRWAQGTPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-- 225

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                  +A   R       +   + +YV    +D     A VV +              
Sbjct: 226 -----SLVAHILRKAVESCSEVTRLVVYV---SQDCTVYKADVVRL-------VARPDPA 270

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++L  T P                    ++  +Y
Sbjct: 271 AEWKSVVILVPVRLGGETLNPVYVPCVKLMPTPP-------------------TDDFLLY 311

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 312 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFEMLCSEL 369

Query: 426 SKLAEESNGA---PLFTVTQTHKK 446
           +++   S+     P+FT+ + H +
Sbjct: 370 TRVLSSSSATERYPMFTLAEGHAQ 393


>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
          Length = 268

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 92/268 (34%), Positives = 133/268 (49%), Gaps = 40/268 (14%)

Query: 109 STSDIWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           ++  +W+LG  + I   +DE L D A             SR+  +YRK F  IG +  TS
Sbjct: 22  TSEPVWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRKNFPAIGGTGPTS 68

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 69  DTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQI 128

Query: 227 LQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDG 281
            Q G   G + G W GP  + +  + LA      +        +A+++     V  +E  
Sbjct: 129 AQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIR 180

Query: 282 ERGGAPVVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYI 329
                 V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+
Sbjct: 181 RLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYV 240

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVG 357
            TL+  F  PQSLG++GGKP ++ Y +G
Sbjct: 241 ETLKHCFMMPQSLGVIGGKPNSAHYFIG 268


>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
          Length = 452

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 167/371 (45%), Gaps = 65/371 (17%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           S +S + LLG  +++ +DEA  +         F + F+S + ++YR+GF  +  S +T+D
Sbjct: 70  SKSSPLILLGKSYEL-KDEANKE--------RFRRSFASLLWLTYRRGFPQLAGSSLTTD 120

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPL----------------------------- 198
            GWGC+LR+ QML+A+ LL H +   W   +                             
Sbjct: 121 SGWGCVLRTGQMLLARGLLTHLMPPGWMWSVWYRAVKDDLDLPHHADCTDCKSNMRCRYQ 180

Query: 199 ------QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
                  +P +  + +++  F D   +PF IH L++ G + G  AG W GP  +      
Sbjct: 181 SLGSLYDRPLEAMHRKVVSWFADHPKAPFGIHRLVELGASSGKKAGDWYGPSIVA---HI 237

Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPIL 312
           L +   A        LP  +  V+ D          + + D    C         W  ++
Sbjct: 238 LQKAVAASV-----DLPNLVVYVAQD--------CTIYLQDVRGLCE--RPPPHSWKSVI 282

Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           +LVP+ LG + +NP YI  ++        +GI+GG+P  S + VG Q++  +YLDPH  Q
Sbjct: 283 ILVPVRLGGQDLNPSYISCVKKLLELQCCIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQ 342

Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES 432
             +N+ K++   +  ++H    R +    +DPS  IGFY   + + +  C   +++   S
Sbjct: 343 LTVNVTKENFPLE--SFHCKYPRKMPFSRMDPSCTIGFYASGQQELELLCTNVNEVVSTS 400

Query: 433 -NGAPLFTVTQ 442
             G P+F  ++
Sbjct: 401 AEGYPMFIFSE 411


>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
          Length = 450

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/312 (31%), Positives = 148/312 (47%), Gaps = 50/312 (16%)

Query: 140 FNQDFSSRILISYRKGFDPIGDS-----------------------KITSDVGWGCMLRS 176
           F +D +++  ++YR GF+PI  S                         +SD GWGCM+RS
Sbjct: 115 FTEDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRS 174

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGL 235
            Q L+A  +   +LGR WR+   +   +E   ++ +F D   +PFSIHN ++ G  A G 
Sbjct: 175 GQSLLATTIATLQLGRDWRRGKNQ---QEERRLISMFADDPRAPFSIHNFVRHGATACGK 231

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP        A A+C +A T      L + +Y  +  +D        V   D  
Sbjct: 232 FPGEWFGP-------SATAQCIQALTS--SSDLDLHVYSPNDGQDVYEDSFMKVAKPD-- 280

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
                   GQ D+ P L+L+   LG++K+ P Y   L  T   PQS+GI GG+P +S Y 
Sbjct: 281 --------GQ-DFHPTLILIRTRLGIDKITPIYWEPLIATLQMPQSVGIAGGRPSSSHYF 331

Query: 356 VGVQEESAIYLDPHDVQPVINIGKD---DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           VG Q     YLDPH  +  +   +D     + D  + H+  +R +H+  +DPS+ IGF  
Sbjct: 332 VGSQGSYLFYLDPHHTRKALPYHEDVANYTDEDIDSCHTSRLRRLHVKEMDPSMLIGFLI 391

Query: 413 RDKDDFDDFCAR 424
           R + D+ ++  R
Sbjct: 392 RSESDWAEWRQR 403


>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
          Length = 1202

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 114/390 (29%), Positives = 161/390 (41%), Gaps = 110/390 (28%)

Query: 140 FNQDFSSRILISYRKGFDPI---------------------------GDSKITSDVGWGC 172
           F +DF+SRI ++YR GF PI                            +  +++D GWGC
Sbjct: 545 FYEDFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGC 604

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPL----QKPFDRE----------YVEILHLFGD--S 216
           MLR+ Q L+A AL F  LGR WR+      + P   E          Y  +L  F D  S
Sbjct: 605 MLRTGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPS 664

Query: 217 ETSPFSIHNLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV- 274
              PFS+H     GK  G    G W GP     + + LA      +     +L +A+ V 
Sbjct: 665 PLCPFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLA------SNFAPANLGVAVSVD 718

Query: 275 --VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
             V   +       P      A R     S   +   P+L+L+   LGL+KVNP Y  ++
Sbjct: 719 GTVYRSDVQAAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESI 778

Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH----------------------- 369
           +   +FPQS+GI GG+P +S Y VGVQ+ S  Y+DPH                       
Sbjct: 779 KAALSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPHHTKPAIPFRQPPPDIAALAAELP 838

Query: 370 -DVQPVINIGKDDL----------------EADTST-----------------YHSDVIR 395
            D+   +N  +  L                E D +T                 +H D +R
Sbjct: 839 LDIHSPLNAWQRSLGDSLPPTPGAEPPAPDECDDATRLRAWFANEYDETCFGSFHCDRVR 898

Query: 396 HIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
            + L  +DPS+ IGF CRD+ D+DD  +RA
Sbjct: 899 KMPLSGLDPSMLIGFLCRDEADWDDLQSRA 928


>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
 gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
          Length = 480

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 98/330 (29%), Positives = 154/330 (46%), Gaps = 60/330 (18%)

Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 159
           LG   G+    E ++D  SRI  +YR GF+PI                            
Sbjct: 69  LGRRYGSGSKEEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128

Query: 160 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
                  +   T+DVGWGCM+R+SQML+A A+    LGR +        ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAIQLLLLGRGFT--YADSSEKKHSDIIDMF 186

Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
            D   +PFS+HN ++A     L    G W GP A   S + L + Q  E+     S P  
Sbjct: 187 TDDPKAPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDES-----SSPRF 241

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
             ++S   D           DD  +   +  + +     IL+L+P+ LGL KV+P Y  +
Sbjct: 242 RVIISESCD---------IYDD--KIGKLLQENEDAEGAILILLPVRLGLNKVSPYYHNS 290

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           L   F+ PQ +GI GGKP +S Y  G    + +YLDPH  Q V         +   T+H+
Sbjct: 291 LSSLFSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSV------KASSIYDTFHT 344

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             ++ + ++ +DPS+ IG   + K+D++ F
Sbjct: 345 HNVQSLKIEDMDPSMLIGILIKSKEDYESF 374


>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
 gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
          Length = 470

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 172/385 (44%), Gaps = 79/385 (20%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           S ++ ++LLG  +    D+ +           F +DF SR+ ++YR+ F  +  + +T+D
Sbjct: 76  SRSAPVYLLGERYYFRLDDEID---------RFQKDFVSRVWLTYRRDFPALEGTALTTD 126

Query: 168 VGWGCMLRSSQMLV---------------AQALLFH------------------------ 188
            GWGCM+RS QML+               ++AL  H                        
Sbjct: 127 CGWGCMIRSGQMLLAQGLLLHLLSREWTWSEALYTHFVEMEPIRSSSPSSMPLSLATDHS 186

Query: 189 -RLGRPWRKPLQKPFDRE-YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAM 246
            R  +P     + P+  E +  I+  F D  ++PF +H ++  G  +G  AG W GP   
Sbjct: 187 GRHSQPQTHCSRAPYGGEVHQNIVSWFSDHASAPFGLHRMVALGSIFGKRAGDWYGP--- 243

Query: 247 CRSWEALARCQRAETGLGCQSLPMAIYVVSG----DEDGERGGAPVVCIDDASRHCSVFS 302
                 +A   +       +   +++YV         D E+  A  V   D SR      
Sbjct: 244 ----SIVAHIIKKAIESSSEVPDLSVYVSQDCTVYKADIEQLFAGEVPHTDTSR-----G 294

Query: 303 KGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEES 362
            G+A    +++LVP  LG E  NP Y   L+     P  LGI+GGKP  S Y +G Q+  
Sbjct: 295 AGKA----VIILVPARLGGETFNPVYKHCLKEFLRMPSCLGIIGGKPKHSLYFIGYQDNY 350

Query: 363 AIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
            +YLDPH  QP I+  +D+   +  ++H +  R + +  +DPS    FY +++DDF   C
Sbjct: 351 LLYLDPHYCQPYIDTSRDNFPLE--SFHCNAPRKLSITRMDPSCTFAFYAKNRDDFGKLC 408

Query: 423 ARASKL-----AEESNGAPLFTVTQ 442
              SK+     AEE    P+F++++
Sbjct: 409 EHLSKVLHSPQAEEK--YPIFSISE 431


>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
           SS1]
          Length = 1286

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 102/312 (32%), Positives = 145/312 (46%), Gaps = 57/312 (18%)

Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKIT---------------------------- 165
           NN    F  DF+SR+ ++YR  F PI DS +T                            
Sbjct: 333 NNWPPVFYSDFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGW 392

Query: 166 ---------SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLF 213
                    SD GWGCMLR+ Q L+A AL+   LGR WR+P    +  +Y   V++L  F
Sbjct: 393 PGSGEKGWTSDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWF 452

Query: 214 GDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
            DS T   PFS+H +  AGK  G   G W GP     + + L      E GLG     +A
Sbjct: 453 FDSPTPHCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---IA 508

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
                   D      P +    + +  +    G+A    +L+L+ + LGL+ VNP Y  T
Sbjct: 509 SDSQIFQSDVFAASHPPMDSPSSKKKLASTWGGRA----VLVLIGIRLGLDGVNPIYYET 564

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           ++  +TFPQS+GI GG+P +S Y VG Q ++  YLDPH  +P +      L    ST  +
Sbjct: 565 IKALYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAV-----PLRPPPST--N 617

Query: 392 DVIRHIHLDSID 403
           D++  I  +SI+
Sbjct: 618 DIVLDISRESIE 629



 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 51/91 (56%), Gaps = 15/91 (16%)

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
           A+  T+H + +R + L  +DPS+ +GF CRD+ D++DF AR + L++        T+   
Sbjct: 836 AELKTFHCERVRKMPLSGLDPSMLVGFLCRDEGDWEDFKARVADLSKTHK-----TIFSI 890

Query: 444 HKKPVNH-SDVLGETGGVPEDDSLGVMSMND 473
           H +P ++ SD          +D LG+ SM++
Sbjct: 891 HDEPPSYPSD---------SEDHLGLESMSE 912


>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
 gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
          Length = 400

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 157/316 (49%), Gaps = 28/316 (8%)

Query: 135 NGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
           N + E +   +D  SR+  +YR  F P+G+ ++T+D GWGCMLR  QM++AQAL+   LG
Sbjct: 52  NAIQELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLG 111

Query: 192 RPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWE 251
           R W     +  D  Y++I++ F D+  S +S+H +   G++     G W+GP  + +  +
Sbjct: 112 REWYWT-SECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILK 170

Query: 252 ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI 311
            L  C      L        I+V              V +DD        S+    W P+
Sbjct: 171 KLV-CFDDWCSL-------VIHVAMDS---------TVVLDDIYS----LSQDGESWKPL 209

Query: 312 LLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           LL++PL LG+  +NP Y+P L+  F    S G++GG+P  + Y VG  ++  +YLDPH  
Sbjct: 210 LLIIPLRLGITDINPIYVPALKRCFELESSCGMIGGRPNQALYFVGYVDDEVLYLDPHTT 269

Query: 372 QPVINIGKDDLEADT---STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           Q    +G+    A+     TYH      ++  ++DPSLA+ F C+ +  F+    +  + 
Sbjct: 270 QRTGAVGQKTTTAEQELDETYHQKYAARLNFSAMDPSLAVCFICKTQSSFELLLKQLREE 329

Query: 429 AEESNGAPLFTVTQTH 444
               +   LF ++++ 
Sbjct: 330 VLTLSSPALFEISKSR 345


>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
           aries]
          Length = 438

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 107/353 (30%), Positives = 165/353 (46%), Gaps = 35/353 (9%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 83  SRTSFSKISS----VHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 132

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
             +TSD GWGCMLRS QM++AQ LL H L R W    Q                    P 
Sbjct: 133 GTLTSDCGWGCMLRSGQMMLAQGLLLHLLPRDWTWS-QGAGLGPAEPPGLGSPSPGPGPX 191

Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
                   G+A G  AG W GP         +A   R      C  +   +  VS D   
Sbjct: 192 XXXXXXSWGRAPGKKAGDWYGP-------SLVAHILRKAVE-SCSEVTRLVVYVSQDC-- 241

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
                  V   D +R  +  S   A+W  +++LVP+ LG E +NP Y+P ++        
Sbjct: 242 ------TVYKADVARLVAR-SDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELC 294

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
           LGI+GG P  S Y +G Q++  +YLDPH  QP +++ + D   +  ++H    R +    
Sbjct: 295 LGIMGGTPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAK 352

Query: 402 IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 451
           +DPS  +GFY  D+ +F+  C+  +++   S+     P+FT+ + H +  +HS
Sbjct: 353 MDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLVEGHAQ--DHS 403


>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
          Length = 403

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 110/371 (29%), Positives = 166/371 (44%), Gaps = 64/371 (17%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSK 163
           I  +   +W+LG  +   ++           L    +D  S++  +YRKGF PIG  +S 
Sbjct: 16  IPQTDEPVWILGRKYNAIKE-----------LDAIRRDIRSKLWFTYRKGFIPIGGCNST 64

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFS 222
            TSD GWGCMLR  QM++AQAL+   LG+ W+  P  K  +  Y++IL  F D   + FS
Sbjct: 65  FTSDKGWGCMLRCGQMVLAQALITLHLGKDWQWMPETK--NNTYLKILSRFEDKRAAAFS 122

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCR---------SWEALARCQRAETGLGCQSLPMAIY 273
           IH +   G + G   G W GP  + +          W +L      +  L    +     
Sbjct: 123 IHQIALTGASEGKEVGQWFGPNTIAQVLKKLIVYDEWSSLTIHVALDNTLIVNDILKQCR 182

Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
           +  G+     G  P+              K  + W P+LLL+PL LGL ++NP YI  L+
Sbjct: 183 IEGGETAEADGEVPL--------------KAPSQWKPLLLLIPLRLGLSEINPVYINGLK 228

Query: 334 L--------------------TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
           +                    +F   QSLG++GGKP  + Y +G   +  IYLDPH  Q 
Sbjct: 229 VKFKILCMQKKKYICIQFFQTSFKISQSLGVIGGKPNLALYFIGCVGDEVIYLDPHTTQR 288

Query: 374 V----INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
                  I ++++E D  TYH      I +  +DPS+A+ F+C  + +F   C    +  
Sbjct: 289 SGSVEDKISEEEIEMDI-TYHCKSASRIPITGMDPSVALCFFCATEKEFMSLCKSMQEEL 347

Query: 430 EESNGAPLFTV 440
                 PLF +
Sbjct: 348 ILPEKQPLFEL 358


>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
 gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related protein 4 homolog D
 gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
          Length = 469

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 161/356 (45%), Gaps = 63/356 (17%)

Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
           ++ +  F +DF SR+ ++YR+ F  +  + +T+D GWGCM+RS QML+AQ LL H L R 
Sbjct: 93  DDEIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSRE 152

Query: 194 WR--KPLQKPF----------------------------------------DREYVEILH 211
           W   + L + F                                        D+ +  I+ 
Sbjct: 153 WTWSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMR 212

Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
            F D   SPF +H L+  G  +G  AG W GP         +A   +       +   ++
Sbjct: 213 WFSDHPGSPFGLHQLVTLGSIFGKKAGDWYGP-------SIVAHIIKKAIETSSEVPELS 265

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
           +YV S D    +     +   D     +    G+A    +++LVP+ LG E  NP Y   
Sbjct: 266 VYV-SQDCTVYKADIEQLFAGDVPHAETSRGAGKA----VIILVPVRLGGETFNPVYKHC 320

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           L+     P  LGI+GGKP  S Y +G Q+   +YLDPH  QP I+  K+D   +  ++H 
Sbjct: 321 LKEFLRMPSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPYIDTSKNDFPLE--SFHC 378

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL-----AEESNGAPLFTVTQ 442
           +  R I +  +DPS    FY ++ +DF   C    K+     AEE    P+F++++
Sbjct: 379 NSPRKISITRMDPSCTFAFYAKNSEDFGKLCDHLMKVLHSPRAEEK--YPIFSISE 432


>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
 gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
          Length = 414

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 115/378 (30%), Positives = 177/378 (46%), Gaps = 63/378 (16%)

Query: 109 STSDIWLLGVCHKIAQDE-------ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           S S IWLLG   + A++E          + +    L++F +DF +RI  +YR GF  I  
Sbjct: 45  SHSPIWLLG--KQYAKNEPRPNLRRGFDENSAVGKLSDFLEDFRTRIWFTYRHGFPCIPG 102

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDREYVEILHLFGDSET 218
           +K  +D GWGC +RS QML+A+ +L H LGR W   +  L +     + +++ LF D+ T
Sbjct: 103 TKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLLGQSGLPEDEALMHRKVIGLFCDNLT 162

Query: 219 SPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSG 277
           SPFS+HNL+Q G+  +G  AGSW GP ++ +  + +A     E GL      +A++V+  
Sbjct: 163 SPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQ-VAMNNAIERGL---VEGLAVHVIGD 218

Query: 278 DE----DGERGG-----APV----------------VCIDDASRHCSV------------ 300
            E    D ER G     APV                    D  R  SV            
Sbjct: 219 GELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSYLDLRRLTSVSNGDLLPSHDGE 278

Query: 301 ------FSKGQADWTP-ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
                 F      W+  +L+L+PL LG+EK N  Y   L+   +    +G++GG+     
Sbjct: 279 SIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHLKRVLSTKFCVGVIGGRHHKCY 338

Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
           Y  G   +  I LDPH  QP ++  +  +     ++H    +   +  IDP  +IGFY R
Sbjct: 339 YFCGWHTDYLIRLDPHYSQPAVDATQPGVS--LHSFHCKYPKKTLIADIDPWCSIGFYIR 396

Query: 414 DKDDFDDFCARASKLAEE 431
           ++ +   F A  S++  E
Sbjct: 397 NRLELQSFLADISEVGFE 414


>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
           boliviensis]
          Length = 463

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 109/409 (26%), Positives = 173/409 (42%), Gaps = 101/409 (24%)

Query: 76  NGWTAAVKRLV-TAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGN 134
           NG   AV R++  AG    +       SRT  S  +S    + +C +  + E  GD    
Sbjct: 80  NGIAVAVMRVLHLAGRCPHVSPGWAVKSRTSFSKISS----IHLCGRRYRFEGEGD---- 131

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
             +  F +DF SR+ ++YR+ F P+    +TSD GWGCMLRS QM++AQ LL H L R W
Sbjct: 132 --IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDW 189

Query: 195 R---------KPLQKPF-------------------------DREYVEILHLFGDSETSP 220
                       L  P                          +R + +I+  F D   +P
Sbjct: 190 TWAEGTGLGPPELSGPASPSRYHGPARWMPPCWAQGAPELEQERRHRQIVSWFADHPQAP 249

Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           F +H L++ G++ G  AG W GP         +A   R       +   + +YV      
Sbjct: 250 FGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVESSSEVTRLVVYV------ 296

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                         S+ C+    G+   TP L  +                LR       
Sbjct: 297 --------------SQDCT----GKGTCTPSLQEL----------------LRCELC--- 319

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
            LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ + +   +  ++H    R +   
Sbjct: 320 -LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--SFHCTSPRKMAFA 376

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 446
            +DPS  +GFY  D+ +F+  C+  +++   S+     P+FT+ + H +
Sbjct: 377 KMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ 425


>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
          Length = 499

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 175/387 (45%), Gaps = 76/387 (19%)

Query: 120 HKIAQDEALGDAAGNNG---LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC---- 172
           +KI+    LGD+   N    +  F   F SRI ++YRK F  +  S  T+D GWGC    
Sbjct: 83  NKISPVTILGDSYLLNSEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRS 142

Query: 173 --MLRSSQMLV-----------AQAL------LFH-----RLG----------------- 191
             ML +  +LV           AQ L      +F      R G                 
Sbjct: 143 GQMLLAQGLLVHLMPRGWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPE 202

Query: 192 RPW----------RKPLQKPFDRE----YVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
           RP           +K L+   DR+    + +++  FGD  T+PF IH L++ GK+ G  A
Sbjct: 203 RPLLSEQATKCSRKKRLESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKA 262

Query: 238 GSWVGPYAMCRSW-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR 296
           G W GP  +     +A+AR     +        + +YV   D    +     +C    S+
Sbjct: 263 GDWYGPAIVAHILRKAVARASAVHS--------LVVYVAQ-DCTVYKEDVMHLCDPTPSQ 313

Query: 297 HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIV 356
             S     QA W  +++LVP+ LG E +NP YI  ++        +GI+GGKP  S Y V
Sbjct: 314 TPSDPLSHQA-WKSVIILVPVRLGGECLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFV 372

Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
           G Q+E  +YLDPH  QPV+++ +  + +   ++H +  + +  + +DPS  IGFY + K 
Sbjct: 373 GFQDEQLLYLDPHYCQPVVDVSQ--VNSSLESFHCNAPKKMPFNRMDPSCTIGFYAKSKK 430

Query: 417 DFDDFC-ARASKLAEESNGAPLFTVTQ 442
           DF+  C A  + L+      PLFT  +
Sbjct: 431 DFESLCSAVGTALSSSKERYPLFTFIE 457


>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
 gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
          Length = 462

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 109/353 (30%), Positives = 152/353 (43%), Gaps = 82/353 (23%)

Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVG 169
           A   N  + F  DF SRI ++YR GF  I  S+                      TSD G
Sbjct: 87  AQYGNWPSAFLDDFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTG 146

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           +GCM+RS Q ++A AL   RLGR WR     P  +E+  IL LF D   +PFSIH  ++ 
Sbjct: 147 FGCMIRSGQCILANALQTLRLGRDWRY-QDDPTAQEHCNILSLFADDPQAPFSIHRFVEH 205

Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G A  G   G W GP A  R  + L   +  E GL        +YV SGD      GA V
Sbjct: 206 GAAVCGKYPGEWFGPSAAARCIQDLVH-KYKEAGL-------RVYV-SGD------GADV 250

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +D  +  +V   G+  W P L+LV   LG++K+ P Y   L+ +    QS+GI GG+
Sbjct: 251 Y--EDKLKQVAVEEDGE--WIPTLILVGTRLGIDKITPVYWEALKASLQMKQSMGIAGGR 306

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVI----NIGKDDLEA-------------------- 384
           P AS Y V  Q     YLDPH  +P +        D+                       
Sbjct: 307 PSASHYFVATQANHFFYLDPHSTRPHLPYRPPTSSDETTTQLASSITSTSSSTTIVPSAS 366

Query: 385 ----------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                           D S+ H+  IR + +  +DPS+ + F    ++D++ +
Sbjct: 367 SLPPRSPPEPSTYTLDDISSCHTRRIRRLQIREMDPSMLLAFLVTSQEDYEKW 419


>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
          Length = 208

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
           S+  K S+LS +F    ++FE   + S++   A   K    S  W+  ++R V +GSM R
Sbjct: 45  SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104

Query: 94  IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
           +    LG +R     ++ D+W LG C++++ ++E  G +  ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157

Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
           RKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205


>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
          Length = 271

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
           S+  K S+LS +F    ++FE   + S++   A   K    S  W+  ++R V +GSM R
Sbjct: 45  SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104

Query: 94  IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
           +    LG +R     ++ D+W LG C++++ ++E  G +  ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157

Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
           RKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205


>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
          Length = 292

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 77/168 (45%), Positives = 109/168 (64%), Gaps = 12/168 (7%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNK----SNGWTAAVKRLVTAGSMRR 93
           S+  K S+LS +F    ++FE   + S++   A   K    S  W+  ++R V +GSM R
Sbjct: 45  SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRILRRFVGSGSMWR 104

Query: 94  IHERVLGPSRTGISSSTSDIWLLGVCHKIA-QDEALGDAAGNNGLAEFNQDFSSRILISY 152
           +    LG +R     ++ D+W LG C++++ ++E  G +  ++G A F +DFSSRI I+Y
Sbjct: 105 L----LGCARV---LTSGDVWFLGKCYRVSPEEEESGGSDSDSGHAAFLEDFSSRIWITY 157

Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
           RKGFD I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 158 RKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 205


>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
           MF3/22]
          Length = 1147

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/298 (33%), Positives = 135/298 (45%), Gaps = 62/298 (20%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
           G N    F  DFSSR+ ++YR  + PI D  +                            
Sbjct: 335 GANWPPGFYSDFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGS 394

Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDS 216
                TSD GWGCMLR+ Q L+A AL+   LGR WR+P Q  +  +   YV+IL  F DS
Sbjct: 395 GEKGWTSDSGWGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYATYVKILTWFFDS 454

Query: 217 ET--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV 274
                PFS+H +  AGK  G   G W GP     + + +     AE GLG  S+     V
Sbjct: 455 TDIHCPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHA-FAEAGLGV-SVATDGVV 512

Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD---------------W--TPILLLVPL 317
              D        P +      RH  + +   +                W   P+L+LV +
Sbjct: 513 YETDVLAASNAGPYMY-----RHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGI 567

Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
            LG++ VNP Y   ++  FTFPQS+GI GG+P +S Y VGVQ ++  YLDPH  +P +
Sbjct: 568 RLGIDCVNPVYYDAVKALFTFPQSVGIAGGRPSSSYYFVGVQTDNLFYLDPHHSRPSV 625



 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 29/42 (69%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
           T+H D +R + L S+DPS+ IGF CRD+ D+ D   R ++++
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCRDERDWKDLRERVTEMS 769


>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
          Length = 511

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 163/382 (42%), Gaps = 80/382 (20%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 129 TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 182

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPW-------------------------------RK 196
            GWGCMLRS QM++AQ LL H L R W                               R 
Sbjct: 183 CGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRW 242

Query: 197 PLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
               P    +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +
Sbjct: 243 AQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLV 295

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS------KGQAD 307
           A   R      C  +   +  VS D       +PV     +     +        + +  
Sbjct: 296 AHILRKAVE-SCSEVTRLVVYVSQDCTAAEASSPVSDTPASGPLHLLPLLLGVLFQQRCR 354

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W    L V  +L  E                   LGI+GGKP  S Y +G Q++  +YLD
Sbjct: 355 W----LFVCELLRCELC-----------------LGIMGGKPRHSLYFIGYQDDFLLYLD 393

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  ++
Sbjct: 394 PHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTR 451

Query: 428 LAEESNGA---PLFTVTQTHKK 446
           +   S+     P+FT+ + H +
Sbjct: 452 VLGSSSATERYPMFTLAEGHAQ 473


>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
          Length = 491

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/452 (25%), Positives = 176/452 (38%), Gaps = 116/452 (25%)

Query: 108 SSTSDIWLLGVCHKIA---QDEALGDAAGNN--------GLAEFNQDFSSRILISYRKGF 156
           S  S + LLG C+      +D+ L   +G           + EF +DF SRI ++YR+ F
Sbjct: 36  SRNSPVLLLGKCYHFKYEDEDKTLPTESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEF 95

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP----------------LQK 200
             I  S +T+D GWGC LR+ QML+AQ L+ H LGR W  P                  K
Sbjct: 96  PQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVK 155

Query: 201 PF----------DREYV--------------------------EILHLFGDSETSPFSIH 224
            F          +RE+                           +I+  FGDS  + F +H
Sbjct: 156 KFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLH 215

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            L++ GK  G  AG W GP  +           R     G     + IYV          
Sbjct: 216 QLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG-----ITIYVAQD------- 263

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              V   D   +  +  +   AD   +++LVP+ LG E+ N  Y+  ++   +    +GI
Sbjct: 264 -CTVYNYDVIDKQSASMTSDNADDKAVIILVPVRLGGERTNTDYLEFVKGILSLEYCVGI 322

Query: 345 VGGKPGASTYIVGVQE----------------ESAIYLDPHDVQPVINIGKDDLEADT-- 386
           +GGKP  S Y  G QE                ++ + L+  + +P +  G +D   +   
Sbjct: 323 IGGKPKQSYYFAGFQENEVQRSSMNSLKQKSSKNNLKLEGSEKRPQMGFGSEDEFKNILL 382

Query: 387 -------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 433
                         T+H    + +    +DPS  IGFYCR+  DF+      +K+ + S+
Sbjct: 383 DHVQAFGPPSYPRLTFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFERASEEITKMLKFSS 442

Query: 434 GA--PLFTVTQTHKK-------PVNHSDVLGE 456
               PLFT    H +         N  D+  E
Sbjct: 443 KEKYPLFTFVNGHSRDYDFTSTTTNEEDLFSE 474


>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1355

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/292 (33%), Positives = 133/292 (45%), Gaps = 65/292 (22%)

Query: 140 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 165
           F  DF SRI ++YR  F  PI DS +T                                 
Sbjct: 334 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 393

Query: 166 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 217
                SD GWGCMLR+ Q L+A AL+   LGR WRKP    +  +Y   V+IL  F D+ 
Sbjct: 394 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 453

Query: 218 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
           +  +PFS+H +  AGK +G   G W GP     + + L               P +   V
Sbjct: 454 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 502

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQAD--------W--TPILLLVPLVLGLEKVN 325
           S  +DG      V     A    +  +  ++         W   P+L+LV L LG++ VN
Sbjct: 503 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 562

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           P Y  T++  FT PQS+GI GG+PG+S Y VG Q ++  YLDPH  +P I +
Sbjct: 563 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 614


>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
           bisporus H97]
          Length = 1261

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 101/292 (34%), Positives = 134/292 (45%), Gaps = 65/292 (22%)

Query: 140 FNQDFSSRILISYRKGF-DPIGDSKIT--------------------------------- 165
           F  DF SRI ++YR  F  PI DS +T                                 
Sbjct: 247 FYIDFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGG 306

Query: 166 -----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSE 217
                SD GWGCMLR+ Q L+A AL+   LGR WRKP    +  +Y   V+IL  F D+ 
Sbjct: 307 EKSWSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTP 366

Query: 218 T--SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
           +  +PFS+H +  AGK +G   G W GP     + + L               P +   V
Sbjct: 367 SPDAPFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRL-----------VNEFPRSGVGV 415

Query: 276 SGDEDGERGGAPVVCIDDA-------SRHCSVFSKGQA-DW--TPILLLVPLVLGLEKVN 325
           S  +DG      V     A       +   S  S  QA  W   P+L+LV L LG++ VN
Sbjct: 416 SVAKDGVLSQTDVFLASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVN 475

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           P Y  T++  FT PQS+GI GG+PG+S Y VG Q ++  YLDPH  +P I +
Sbjct: 476 PIYYETIKTLFTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPHHTRPAIPL 527


>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
 gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
          Length = 858

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 94/275 (34%), Positives = 132/275 (48%), Gaps = 58/275 (21%)

Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPI------------------------GDSKITS 166
           AA +    EF  DF+SR+ ++YR GF PI                        G   +TS
Sbjct: 148 AAASGWPQEFFSDFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTS 207

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIH 224
           D GWGCMLR+ Q L+A AL+   +GR             Y+ ++ LF DS +  +PFS+H
Sbjct: 208 DAGWGCMLRTGQSLLANALVVAWMGRGALA--------LYIHLISLFLDSPSPSAPFSVH 259

Query: 225 NLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            +  AG+A G   G W GP     + +AL      + GLG          V+  EDG   
Sbjct: 260 RMALAGRALGKDVGQWFGPSTAAGAIKALVNAY-PDAGLG----------VAIAEDG--- 305

Query: 285 GAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
               V      R      + + +W   P+L+L+ + LGL+ VNP Y  T++  +TFPQSL
Sbjct: 306 ----VVYQTQRRQ----KEREREWGDQPVLVLLGIRLGLDGVNPIYYDTIKQLYTFPQSL 357

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           GI GG+P +S Y VG Q     YLDPH  +P + +
Sbjct: 358 GIAGGRPSSSYYFVGAQAGDLFYLDPHHARPTVPL 392



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 33/47 (70%)

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
           A+T T+H + +R + +  +DPS+ IGF C+D+ D++D+  R SKL +
Sbjct: 537 AETRTFHCERVRKMPMSGLDPSMLIGFLCKDRADWEDWRTRVSKLPK 583


>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
           FP-101664 SS1]
          Length = 997

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 91/282 (32%), Positives = 133/282 (47%), Gaps = 58/282 (20%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 166
           F  DF+SRI ++YR  F PI D+ +                                 T+
Sbjct: 298 FYADFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTT 357

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 221
           D GWGCMLR+ Q L+A AL+   LGR WR+P    +  +Y   V+I+  F D+ +   PF
Sbjct: 358 DAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPF 417

Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
           S+H +   GK  G   G W GP     + + L             + P A   V+   DG
Sbjct: 418 SVHRMALVGKDLGKDVGQWFGPSTAAGAIKTL-----------VHAFPEATLGVANAVDG 466

Query: 282 ERGGAPVVCIDDASRHCSVFSK----GQADW--TPILLLVPLVLGLEKVNPRYIPTLRLT 335
               + V     ASR     ++     + DW    +L+L+ + LG+E VNP Y  T++  
Sbjct: 467 TLYESDVYA---ASRSVMYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTL 523

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           +TFPQS+GI GG+P +S Y VG Q ++  YLDPH  +P + +
Sbjct: 524 YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPL 565



 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
           + +  T+H D +R + L  +DPS+ +GF C+D+ ++ D   R ++L    N   +F++  
Sbjct: 696 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKDRIAELFR--NNKSIFSLAN 753

Query: 443 THKKPVNHSDVLG 455
              +  + SD +G
Sbjct: 754 EPPQYPSDSDDMG 766


>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
 gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
          Length = 425

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 99/307 (32%), Positives = 142/307 (46%), Gaps = 73/307 (23%)

Query: 140 FNQDFSSRILISYRKGFDPI----------------------GD-SKITSDVGWGCMLRS 176
           F  DF SRI ++YR GF+PI                      GD +  +SD GWGCM+RS
Sbjct: 113 FLDDFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRS 172

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK-AYGL 235
            Q L+A ALL  +LGR WR+      +R    I+ LF D   +P+S+ N ++ G  A G 
Sbjct: 173 GQSLLANALLISQLGRDWRRTTDPGAER---NIVALFADDARAPYSLQNFVKHGAIACGK 229

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
             G W GP A  R  +ALA    +          + IY          G  P V  D   
Sbjct: 230 HPGEWFGPSATARCIQALADQHESS---------LRIYST--------GDLPDVYED--- 269

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
              S  +  + D                    + PTL L     QS+GI GG+P +S Y 
Sbjct: 270 ---SFLATARPD-----------------GETFHPTLIL---MEQSIGIAGGRPSSSHYF 306

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           VGVQ +   YLDPH  +P +   ++ L     +  + H+  +R++H++ +DPS+ IGF  
Sbjct: 307 VGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSCHTRRLRYLHVEDMDPSMLIGFLI 366

Query: 413 RDKDDFD 419
           +D+DD+D
Sbjct: 367 QDEDDWD 373


>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
          Length = 988

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 94/272 (34%), Positives = 126/272 (46%), Gaps = 57/272 (20%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 167
           F  DF+SRI ++YR  F PI D+ +                                TSD
Sbjct: 305 FYSDFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSD 364

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD------REYVEILHLFGDSETS-- 219
            GWGCMLR+ Q L+A  LL   LGR WR+P   P+         YV+IL  F D+ +   
Sbjct: 365 AGWGCMLRTGQSLLANTLLHLHLGRDWRRP---PYPICTADYATYVQILTWFFDNPSPLC 421

Query: 220 PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
           PFS+H +   GK  G   G W GP     + + L      E GLG  S+     +   D 
Sbjct: 422 PFSVHRMALVGKELGKEVGQWFGPSTAAGAIKTLVHA-FPEAGLGV-SVATDSVIYQSD- 478

Query: 280 DGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
                    V     S   S    G++ W    +L+LV + LGL+ VNP Y  T++  +T
Sbjct: 479 ---------VYTASRSNLGSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIYYDTIKALYT 529

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           FPQS+GI GG+P +S Y VG Q ++  YLDPH
Sbjct: 530 FPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561



 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 15/96 (15%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
           T+H + IR + L  +DPS+ IGF C+D++D+ D   R + L+         T+     +P
Sbjct: 693 TFHCERIRKMPLSGLDPSMLIGFLCKDEEDWLDLRKRITDLSRTHK-----TIFSIQDEP 747

Query: 448 VN-HSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDD 482
            N  SD          DD++G+ S+++   +  ED+
Sbjct: 748 PNWPSD---------SDDNMGLESISEPDIDMPEDE 774


>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
          Length = 336

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 94/334 (28%), Positives = 149/334 (44%), Gaps = 71/334 (21%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG-GKPGA 351
           D  + C V                                      P S   VG   PG 
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTVGESTPG- 202

Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGF 410
            T     Q +  I+LDPH  Q  +N  +++   D  T+H     + +++ ++DPS+A+GF
Sbjct: 203 -TLNASNQSDELIFLDPHTTQTFVNT-EENGTVDDQTFHCLQSPQRMNILNLDPSVALGF 260

Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 261 FCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
          Length = 393

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 158/347 (45%), Gaps = 68/347 (19%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PI          W  
Sbjct: 57  VWILGKQHLLKTEKS-----------KLLADISARLWFTYRRKFSPID---------WN- 95

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
                                W K  ++P  +EY  IL  F D +   +SIH + Q G  
Sbjct: 96  ---------------------WEKQKEQP--KEYQRILQCFLDRKDCCYSIHQMAQMGVG 132

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD----EDGERGGAPV 288
            G + G W GP  + +  + LA      +        +A+YV   +    ED ++     
Sbjct: 133 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDNTVVIEDIKKMCCAS 184

Query: 289 VCIDDA-------SRHCSVFSKG----QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
               DA       S + S  SKG    +  W P+LL+VPL LG+ ++NP Y+   +  F 
Sbjct: 185 ALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIVPLRLGINQINPVYVDAFKECFK 244

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
            PQSLG +GGKP  + Y +G   +  I+LDPH  Q  ++  ++ +  D + +     + +
Sbjct: 245 MPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQPPQRM 304

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++ ++DPS+A+GF+C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 305 NILNLDPSVALGFFCQEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 350


>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
 gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
           commune H4-8]
          Length = 602

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 102/311 (32%), Positives = 144/311 (46%), Gaps = 82/311 (26%)

Query: 112 DIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------- 164
           +IWL+GVCH               G  +F  DF++RI ++YR GF+ I D ++       
Sbjct: 114 EIWLMGVCHA-------------PGAPDFYADFATRIWLTYRSGFELIRDRQLIDLPPPV 160

Query: 165 ------------------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
                                   +SD GWGCMLR+ Q L+A ALL    GR WR+  + 
Sbjct: 161 ASLDGHLQGEWATDEAEPPGAYGFSSDSGWGCMLRTGQSLLANALLTAWFGRDWRRISEV 220

Query: 201 PFDRE--YVEILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
              +   YV +L LF D+   T+PFSIH +  AGK  G   G W GP     + + L   
Sbjct: 221 ETHQHSLYVHLLSLFLDTPHPTAPFSIHRMALAGKQLGKDIGQWFGPSTAAGAIKNL--- 277

Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT------- 309
                     + P+A            G   VV +D A     VF+   ++W+       
Sbjct: 278 --------VSAYPLA------------GIGVVVGMDGALSKSEVFTASHSEWSDEEAALD 317

Query: 310 ----PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
               P+L+L+ L LGL++VNP Y  T++  FTFPQS+GI GG+P +S + VG Q    IY
Sbjct: 318 WGDRPVLILLNLRLGLDRVNPIYHDTIKALFTFPQSVGIAGGRPCSSYHFVGAQGSDLIY 377

Query: 366 LDPHDVQPVIN 376
           LDPH  +  + 
Sbjct: 378 LDPHHTRNTVR 388



 Score = 41.6 bits (96), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
            AD +T+H    + + + + DPS+  GF C+D  D+DD+ AR S+L  +
Sbjct: 524 HADLATFHCTNPKMMPISAQDPSMLAGFLCKDIADWDDWRARMSRLPNQ 572


>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
          Length = 342

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 93/333 (27%), Positives = 150/333 (45%), Gaps = 69/333 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFS-SRILISYRKGFDPIGDSKITSDVGWG 171
           +W+LG  H +  D           L E    F+ +  L ++  G  P      +SD GWG
Sbjct: 35  VWILGKQHLLKTD----------SLPEIISHFTETSELTAHDGGTGP------SSDAGWG 78

Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGK 231
           CMLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + +   
Sbjct: 79  CMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKM-- 136

Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
                                            C  LP++  + + +  G          
Sbjct: 137 ---------------------------------CCILPLSADIATENPSGS--------- 154

Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
            +AS H    S     W P+LL+VPL LG+ ++NP Y+   +       SLG +GGKP  
Sbjct: 155 PNASNHSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK-------SLGALGGKPNN 207

Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
           + Y +G   +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+
Sbjct: 208 AYYFIGFLGDELIFLDPHTTQTFVDTEENGTVDDQTFHCLQPPQRMNILNLDPSVALGFF 267

Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           C+++ DFD +C+   K   + N   +F + Q H
Sbjct: 268 CKEEKDFDSWCSLVQKEILKEN-LRMFELVQKH 299


>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
 gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
          Length = 492

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 156/343 (45%), Gaps = 79/343 (23%)

Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPIG----------------------------- 160
           D + ++G+ E  QD  S+I ++YR GF+PI                              
Sbjct: 77  DISVDDGVIE--QDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNF 134

Query: 161 -----DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW-----RKPLQKPFDREYVEIL 210
                +   T+DVGWGCM+R+SQ L+A       LGR +     R P        + EI+
Sbjct: 135 HGLLDNDNFTTDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRDRSP-------RHDEII 187

Query: 211 HLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
            +F D   +PFS+HN ++      L    G W GP A   S + L           C + 
Sbjct: 188 DMFMDEPRAPFSLHNFIKVASESPLKVKPGQWFGPNAASLSIKRL-----------CDN- 235

Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP----ILLLVPLVLGLEKV 324
              +Y  +G      G   VV  + ++ +  + ++      P    IL+L+P+ LG++KV
Sbjct: 236 ---VYESNG-----TGRVKVVISESSNLYDDIITQMFTTLNPVPDAILVLLPVRLGIDKV 287

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
           NP Y  ++       QS+GI GGKP +S Y  G +    +YLDPH  Q V N       +
Sbjct: 288 NPLYHASVLELLALRQSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRN-----KTS 342

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
              TYH++  + + +D +DPS+ IG   +D +D++DF +  +K
Sbjct: 343 VYDTYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKSSCTK 385


>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
          Length = 324

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 87/300 (29%), Positives = 130/300 (43%), Gaps = 56/300 (18%)

Query: 109 STSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDV 168
           ++  +W+LG  + +  ++            E   D +SR+  +YRK F  IG +  TSD 
Sbjct: 26  TSEPVWILGRKYSVLTEKE-----------EILSDVASRLWFTYRKNFPAIGGTGPTSDT 74

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           GWGCMLR  QM+ AQAL+   LGR WR          Y  +L+ F D + S +SIH + Q
Sbjct: 75  GWGCMLRCGQMIFAQALVCRHLGRDWRWAQWTQQPDSYFNVLNAFIDRKDSYYSIHQIAQ 134

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
            G   G + G W GP  + +  + LA                                  
Sbjct: 135 MGVGEGKSIGQWYGPNTVAQVLKKLA---------------------------------- 160

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
                      VF    +    I +   +V G   +N  Y+ TL+  F  PQSLG++GGK
Sbjct: 161 -----------VFDTWSSLAVHIAMDNTVVTGEININEAYVETLKHCFMMPQSLGVIGGK 209

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P ++ Y +G   +  IYLDPH  QP + +    L  D S +       + +  +DPS+A+
Sbjct: 210 PNSAHYFIGYVGDELIYLDPHTTQPAVELTDSCLVPDESFHCQHPPSRMSIRELDPSIAV 269


>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
           972h-]
 gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
          Length = 320

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 144/341 (42%), Gaps = 53/341 (15%)

Query: 91  MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 150
           M R  ER L  + T      + IW LG  +KI   +            +F  D  S I I
Sbjct: 4   MARFLERYLHFAPTNTEPPGTLIWFLGHSYKIEDSQ---------WPEKFLYDSFSLITI 54

Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
           +YR G +  G   +TSD GWGCM+RS+Q L+A  L   R+  P         +++  EIL
Sbjct: 55  TYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL---RICYP---------EKQLKEIL 100

Query: 211 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
            LF D  ++PFSIH  +  GK    +  G W GP   C     +AR            +P
Sbjct: 101 ALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSC---SCVARLSDQNP-----DVP 152

Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
           + +YV        R     V                    P+LLL+P  LG++ +N  Y 
Sbjct: 153 LHVYVARNGNAIYRDQLSKVSF------------------PVLLLIPTRLGIDSINESYY 194

Query: 330 PTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY 389
             L   F     +GI GG+P ++ Y    Q +   YLDPH         +    A   T+
Sbjct: 195 DQLLQVFEIRSFVGITGGRPRSAHYFYARQNQYFFYLDPHCTHFAHTTTQ---PASEETF 251

Query: 390 HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
           HS  +R + +  +DP +  GF  RD++++  F A     A+
Sbjct: 252 HSATLRRVAIQDLDPCMIFGFLIRDEEEWHSFEANQKYFAD 292


>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
          Length = 336

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P   
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRP-LD 202

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                 Q +  I+LDPH  Q  ++  ++ +  D + +     + +++ ++DPS+A+GF+C
Sbjct: 203 YLTASNQSDELIFLDPHTTQTFVDTEENGMVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
          Length = 336

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 91/333 (27%), Positives = 147/333 (44%), Gaps = 69/333 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G  P  S
Sbjct: 181 DIKKMCCV-------------------------------------LPSSADTAGESPPGS 203

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS-DVIRHIHLDSIDPSLAIGFY 411
              +  Q    I+LDPH  Q  ++  +++   D  T+H     + +++ ++DPS+A+GF+
Sbjct: 204 LTALN-QSNELIFLDPHTTQTFVDT-EENGTVDDQTFHCLQSPQRMNILNLDPSVALGFF 261

Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 262 CKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
           [Homo sapiens]
          Length = 340

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 33  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 82  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 141

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 142 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 184

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P  S
Sbjct: 185 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 207

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 208 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 266

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 267 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 297


>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
          Length = 336

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P  S
Sbjct: 181 DIKKMCCV-------------------------------------LPLSADTAGDRPPDS 203

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
 gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
           gorilla]
 gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
 gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 336

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P  S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 203

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
 gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
          Length = 480

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 96/330 (29%), Positives = 150/330 (45%), Gaps = 60/330 (18%)

Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------------- 159
           LG   G++   E  +D  SRI  +YR GF+PI                            
Sbjct: 69  LGRRYGSSSKEEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALD 128

Query: 160 ------GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
                  +   T+DVGWGCM+R+SQML+A A     LGR +        ++++ +I+ +F
Sbjct: 129 NIHGLFNNQNFTTDVGWGCMIRTSQMLLANAFQLLLLGRDF--AYVDGSEKKHSDIIDMF 186

Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
            D   +PFS+HN ++A     L    G W GP A   S + L  C+    G    S    
Sbjct: 187 TDEPKTPFSLHNFIKAASDSPLKVKPGEWFGPNAASISIKRL--CKSQFDGSVSPSF-RV 243

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
           I   S D   ++ G  +  I+++                IL+L+P+ LGL KV+P Y  +
Sbjct: 244 IISESCDIYDDKIGKLLQEIENSE-------------DAILILLPVRLGLNKVSPYYHDS 290

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           L   F   Q +GI GGKP +S Y  G      +YLDPH  Q +         +   T+H+
Sbjct: 291 LSSLFCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPHYPQSM------KASSIYDTFHT 344

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           + ++ + ++ +DPS+ IG   + K+D++ F
Sbjct: 345 NKVQSLKIEDMDPSMLIGILIKSKEDYESF 374


>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 992

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 98/284 (34%), Positives = 134/284 (47%), Gaps = 49/284 (17%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
           G+N    F  DF+SRI ++YR  F PI DS +                            
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350

Query: 165 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGD 215
                 TSD GWGCMLR+ Q L+A ALL   LGR WR+P       +Y   V+I+  F D
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFD 410

Query: 216 --SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273
             S  SPFS+H +  AGK  G   G W GP     + + L      E GLG       + 
Sbjct: 411 TPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGVI 469

Query: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333
             S     +   A    I    RH  V   G+A    +++L+ + LGL+ VNP Y  T++
Sbjct: 470 FQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTIK 520

Query: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
             +TFPQS+GI GG+P +S Y +G Q ++  YLDPH  +P + +
Sbjct: 521 ALYTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPHHARPAVPL 564



 Score = 45.1 bits (105), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 6/84 (7%)

Query: 357 GVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
           G  E +   LDP     V     D L     T+H D +R + +  +DPS+ +GF C+D++
Sbjct: 681 GDSEGAGEALDPMAEHYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDEN 736

Query: 417 DFDDFCARASKLAEESNGAPLFTV 440
           D+ DF  R + L        +FTV
Sbjct: 737 DWFDFRRRVNDLMHRHKT--IFTV 758


>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 1009

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 132/290 (45%), Gaps = 61/290 (21%)

Query: 140 FNQDFSSRILISYRKGFDPI-------------------------------GDSKITSDV 168
           F  DF+SRI ++YR  F PI                               GD   +SD 
Sbjct: 308 FYADFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDA 367

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFSI 223
           GWGCMLR+ Q L+A AL+   LGR WRKP       +Y   ++I+  F D  +   PFS+
Sbjct: 368 GWGCMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSV 427

Query: 224 HNLLQAGKAYGLAAGSWVGP--------YAMCRSWEALARCQRAETGLGCQSLPMA---I 272
           H +   GK  G+  G W GP        Y    S  ++   Q A   L   + P A   I
Sbjct: 428 HRMALVGKQLGVKVGQWFGPSTAAGAIKYVSAHS--SMVPNQPARRTL-VHAFPEAGLGI 484

Query: 273 YVVSGD---EDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPR 327
           YV +      D E   A    I    RH          W   P+L+L+   LG++ VNP 
Sbjct: 485 YVAADGGTIYDSEVFAASHSGIGSPRRHTRRV------WGDRPVLILIGHRLGIDGVNPI 538

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           Y  TL+  +T+PQS+GI GG+P +S Y VG Q ++  YLDPH  +P I +
Sbjct: 539 YYDTLKTLYTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPTIPL 588



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 1/51 (1%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLF 438
           T+H D +R + L S+DPS+ IGF C+D+ ++ D  +R ++L+ +S  +P+F
Sbjct: 728 TFHCDRVRKMPLSSLDPSMLIGFLCKDESEWQDLKSRINELSRKSK-SPVF 777


>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
          Length = 336

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P  S
Sbjct: 181 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 203

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 204 -LTASNQSDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 262

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 263 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 293


>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
          Length = 500

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 119/239 (49%), Gaps = 13/239 (5%)

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
           ++  FGD   +PF +H L+  GK  G  AG W GP         +A   R          
Sbjct: 235 LVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGP-------SVVAHILRKAVDKTSVVT 287

Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
            +A+YV    +D       VV + D S + +       DW  +++LVP+ LG E +NP Y
Sbjct: 288 NLAVYVA---QDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGGEALNPSY 344

Query: 329 IPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTST 388
           I  ++        +GI+GGKP  S Y +G Q+E  +YLDPH  QPV+++ + +   +  +
Sbjct: 345 IDCVKNFLKLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQINFSLE--S 402

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFTVTQTHKK 446
           +H    + +  + +DPS  IGFY ++K DF+  C+  S+ L+      P+FT  + H +
Sbjct: 403 FHCSSPKKMPFNRMDPSCTIGFYAKNKKDFESLCSAVSEALSSSKEKYPVFTFVEGHSQ 461



 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 30/61 (49%), Positives = 38/61 (62%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
           +  F   F SRI ++YR+ F  +  S  T+D GWGCMLRS QML+AQ LL H + R W  
Sbjct: 104 VERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPRDWVW 163

Query: 197 P 197
           P
Sbjct: 164 P 164


>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
 gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
          Length = 340

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/318 (30%), Positives = 147/318 (46%), Gaps = 63/318 (19%)

Query: 137 LAEFNQDFSSRILISYRKGFDPI------------------------------GDSKITS 166
           L E     +SR+  +YR GF+PI                               +   ++
Sbjct: 52  LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-ILHLFGDSETSPFSIHN 225
           DVGWGCM+R+SQ L+A AL    LGR  + P       E VE I+ LFGD  T PFS+HN
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171

Query: 226 LLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
            ++   A  L    G W GP A   S + L  C + E+     ++ ++I       D E 
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRL--CAKFESN-EIPNINVSICESCNLYDEEI 228

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
            G              +F + +   +P+L+L PL LG++K+N  Y P+L       QS+G
Sbjct: 229 RG--------------IFEESE---SPLLILFPLRLGIDKINSIYYPSLLQLLALKQSVG 271

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           I GGKP +S Y  G Q  + +YLDPH++Q           +D  TYH+   + + + ++D
Sbjct: 272 IAGGKPSSSYYFFGFQGSNLLYLDPHNLQAA--------SSDPGTYHTSKFQTLSISNLD 323

Query: 404 PSLAIGFYCRDKDDFDDF 421
           P  A   +  ++  +DD+
Sbjct: 324 PLNAC--WSVNQMTYDDY 339


>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 360

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 88/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P  S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                 + +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 228 -LTASNESDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317


>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
 gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
          Length = 1039

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/278 (34%), Positives = 138/278 (49%), Gaps = 51/278 (18%)

Query: 140 FNQDFSSRILISYRKGFD-PIGDSKI-------------------------------TSD 167
           F  DF+SRI ++YR  F  PI D+++                               +SD
Sbjct: 336 FYIDFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSD 395

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDREYVEILHLFGDSET--SPFS 222
            GWGCMLR+ Q L+A AL+   LGR WR+P   +Q      YV+I+  F D+    +PFS
Sbjct: 396 TGWGCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFS 455

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
           +H +  AGK +G   G W GP     + + L      E+GLG          VS   DG 
Sbjct: 456 VHRMALAGKEFGTDVGQWFGPSVAAGAIKTLVNS-FPESGLG----------VSVATDGT 504

Query: 283 RGGAPVVCI---DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
              + V  +   + +SR             P+LLL+ + LG+E VNP Y  T++L +TFP
Sbjct: 505 LFQSDVFAVSHGEMSSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIYYETIKLLYTFP 564

Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           QS+GI GG+P +S Y VG Q ++  YLDPH+ +P I +
Sbjct: 565 QSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTRPAIPL 602



 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 29/43 (67%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430
           T+H + +R + L  +DPS+ IGF CRD+ ++ DF  R ++L +
Sbjct: 739 TFHCERVRKMPLSGLDPSMLIGFLCRDEAEWWDFKKRVAELPK 781


>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
          Length = 280

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 84/254 (33%), Positives = 123/254 (48%), Gaps = 25/254 (9%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH + Q G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----VSGDEDGERGGAP 287
            G + G W GP  + +  + LA      +        +A+++     V  +E        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSS--------LAVHIAMDNTVVMEEIRRLCRTS 112

Query: 288 VVCID------DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
           V C        D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  
Sbjct: 113 VPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC 172

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIR 395
           F  PQSLG++GGKP ++ Y +G   E  IYLDPH  QP +         D S +      
Sbjct: 173 FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPC 232

Query: 396 HIHLDSIDPSLAIG 409
            + +  +DPS+A+G
Sbjct: 233 RMSIAELDPSIAVG 246


>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
           jacchus]
          Length = 360

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 146/332 (43%), Gaps = 67/332 (20%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 53  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 101

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 102 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 161

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 162 EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 204

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P  S
Sbjct: 205 DIKKMCRV-------------------------------------LPLSADTPGDRPPDS 227

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                  +E  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 228 LTASNRSDE-LIFLDPHTTQTFVDAEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 286

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 287 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 317


>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
           benhamiae CBS 112371]
 gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
           benhamiae CBS 112371]
          Length = 437

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 140/316 (44%), Gaps = 85/316 (26%)

Query: 139 EFNQDFSSRILISYRKGFDPI--------GDSK-----------------ITSDVGWGCM 173
           +F  DF S++ I+YR  F PI        GDS                   TSD GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 232
           +RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G  A
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGATA 261

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS-GDEDGERGGAPVVCI 291
            G   G W GP A  +  +AL +    + GL        +Y+ S G +  E+    V C 
Sbjct: 262 CGKCPGEWFGPSAASQCIQALVKSN-PQVGL-------RVYITSDGSDIYEKQFKEVACD 313

Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
           +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI G +   
Sbjct: 314 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAGPE--- 359

Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
                                            + STYH+  +R +H+  +DPS+ IGF 
Sbjct: 360 ---------------------------------ELSTYHTRRLRRLHVREMDPSMLIGFL 386

Query: 412 CRDKDDFDDFCARASK 427
            RD+DD++D   R  +
Sbjct: 387 VRDEDDWEDLKQRVRE 402


>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
          Length = 497

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 120/233 (51%), Gaps = 13/233 (5%)

Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
           +++ LFGD   +PF +H L+  GK  G  AG W GP  +      + R   A+T +G QS
Sbjct: 231 KLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAH----ILRKAVAKTSVG-QS 285

Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPR 327
           L  A+YV    +D       V+ + D S    V       W  +++LVP+ LG E +NP 
Sbjct: 286 L--AVYVA---QDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGGEALNPS 340

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           YI  ++   +    +GI+GGKP  S Y +G Q+E  +YLDPH  QPV++  + +   +  
Sbjct: 341 YIECVKNILSLDCCIGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDFTQANFSLE-- 398

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEESNGAPLFT 439
           ++H    + +    +DPS  IGFY R K+DF+  C+     L+      P+FT
Sbjct: 399 SFHCSSPKKMPFSRMDPSCTIGFYARTKEDFESMCSVVGMVLSSSKEKYPIFT 451



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 11/73 (15%)

Query: 108 SSTSDIWLLGVCHKI-AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITS 166
           + TS I++LG  + + ++DE          +  F  DF SRI ++YR+ F  +  S +T+
Sbjct: 87  NKTSPIFVLGHAYLLNSEDE----------VERFRLDFVSRIWLTYRREFPQLEGSTLTT 136

Query: 167 DVGWGCMLRSSQM 179
           D GWGCMLRS QM
Sbjct: 137 DCGWGCMLRSGQM 149


>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
 gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
          Length = 433

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 120/414 (28%), Positives = 171/414 (41%), Gaps = 102/414 (24%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIGDSKITSDVG 169
           IWLLGV +     +  G +A  +  A F+   +DFSSR+  +YR+ F  I  + I +D G
Sbjct: 36  IWLLGVIYHRKMTQFYGASAVVDDGASFDAFLEDFSSRLWFTYRREFPAIPGTDIRTDCG 95

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWR------------------KPLQKPF-----DREY 206
           WGCMLRSSQM++AQA + H LGR WR                   PL++ F     D   
Sbjct: 96  WGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTEAGEVRLPRHALWPLREGFRCTGGDGTA 155

Query: 207 VEIL----------HLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGP----YAMCRSW 250
           V +             FGD    ++PFS+HNL+Q G+  G  AG W GP    Y +  + 
Sbjct: 156 VLVRCSPKPVNDPPRWFGDKADASTPFSLHNLVQRGRESGKKAGDWYGPSSVAYILKDAL 215

Query: 251 E-ALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
           E A  R QR           + IYV              + +DD +  CS  S       
Sbjct: 216 EDAAHRDQRLAQ--------LCIYVAQD---------CTIYMDDVTALCSAGSTEGVT-- 256

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS--------------LGIVGGKPGASTYI 355
                        +  PR +   R  F+  Q+                +   K G S  +
Sbjct: 257 ------------HRRLPRTVFARREMFSGGQTQRMCIHSSWLHLFVFFVCFLKYGISFLL 304

Query: 356 -VGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
            +   EE  IYLDPH  Q ++++   D   D  ++H    R +    IDPS  IGFYC+ 
Sbjct: 305 QLSAAEEKVIYLDPHYCQEMVDVNSQDFPLD--SFHCSWPRKMSFSRIDPSCTIGFYCKT 362

Query: 415 KDDFDDFCARASKLA---EESNGAPLFTV--------TQTHKKPVNHSDVLGET 457
           K D +DF     +L    +  +  P+F +        T T K+P     VL + 
Sbjct: 363 KHDLEDFTKNIRELTVPKQMRHEYPVFLISEGSCSDHTDTEKRPEEIVHVLQDV 416


>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
          Length = 603

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 96/310 (30%), Positives = 137/310 (44%), Gaps = 63/310 (20%)

Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGF------DPIGDS------------------- 162
           LG+   NN  ++   DF SRI  +YR  F      DP+ D                    
Sbjct: 55  LGNLYDNN--SDLLDDFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLR 112

Query: 163 --KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF--------DREYV---EI 209
                +D GWGCMLR+SQ L+A  L    LGR WR+    PF         +EYV   ++
Sbjct: 113 ERTFNTDQGWGCMLRTSQSLLANTLQIMLLGRQWRR---NPFVDLTDYAKRKEYVNLIKL 169

Query: 210 LHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
           L+LF D  S  SPFS+H +   GK+ G   G W GP     + + L   Q  +  L   S
Sbjct: 170 LNLFMDNPSTLSPFSVHRMAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQ-TDINLSV-S 227

Query: 268 LPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVN 325
           +     +   D     GG                    ++W   P+L+LV + LGL+ ++
Sbjct: 228 VASDSVIYKSDVYQASGGTSTT--------------ADSEWGNKPVLILVGVRLGLDGIH 273

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
           PRY  TL+        +GI GG+P +S Y  G Q +S  Y+DPH ++P INI     E +
Sbjct: 274 PRYYETLKAFLRMQSCVGIAGGRPSSSYYFFGYQSDSLFYVDPHIMKPTINIKTPPTEGE 333

Query: 386 TSTYHSDVIR 395
             T   +++R
Sbjct: 334 LKTEIENLLR 343



 Score = 42.0 bits (97), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 37/62 (59%), Gaps = 5/62 (8%)

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
           A  STY  D  R +++  +DPS+ IGF  +D+++F +F  +  +L ++     +F+V  +
Sbjct: 470 ASISTYFCDKPRKMNISQMDPSMLIGFLVKDENEFFEFVNQIKELPQQ-----VFSVADS 524

Query: 444 HK 445
           H+
Sbjct: 525 HR 526


>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
          Length = 431

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 126/260 (48%), Gaps = 15/260 (5%)

Query: 194 WRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP---------Y 244
           W K  ++P   EY  IL  F D +   +SIH + Q G   G + G W GP          
Sbjct: 137 WEKHQEQP--EEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 194

Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG 304
           A+   W +LA     +  +  + +    ++   D   +   +    +D  +  C   + G
Sbjct: 195 ALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLD-WNTDCPGQTSG 253

Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
              W P+LL+VPL LG+ ++NP Y    +  F  PQSLG +GGKP ++ Y +G   +  I
Sbjct: 254 ---WKPLLLIVPLRLGINQINPIYADAFKECFKMPQSLGALGGKPNSAYYFIGFLGDELI 310

Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           YLDPH  Q  ++  ++    D S +       + + ++DPS+A+GF+C++++DFD++C  
Sbjct: 311 YLDPHTTQTFVDTEENGTVNDQSFHCQQSPPRMKILNLDPSVALGFFCKEEEDFDNWCGL 370

Query: 425 ASKLAEESNGAPLFTVTQTH 444
             K   +     +F + + H
Sbjct: 371 VQKEILKPQSLQMFELVEKH 390


>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
 gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
          Length = 266

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 60/169 (35%), Positives = 95/169 (56%), Gaps = 6/169 (3%)

Query: 293 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
           D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++G
Sbjct: 71  DSERHCNGFPAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 130

Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
           GKP ++ Y +G   E  IYLDPH  QP + +       D S +       + +  +DPS+
Sbjct: 131 GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPSRMGIGELDPSI 190

Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           A+GF+C+ ++DF+D+C +  KL++     P+F + +     +   DVL 
Sbjct: 191 AVGFFCKKEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 239


>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
          Length = 430

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 153/355 (43%), Gaps = 73/355 (20%)

Query: 138 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 174
           A F  DF+SR  ++YR  F       DP                +  S  TSD GWGCM+
Sbjct: 121 AAFLDDFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMI 180

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
           RS Q L+A A+    LGR WR+ +    DRE   +L LF D   +P+SIHN ++ G+ Y 
Sbjct: 181 RSGQSLLANAMAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSIHNFVRHGEKYC 237

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
               G W GP A  R  + L   ++ E         + IY          G  P +  D+
Sbjct: 238 SKYPGEWFGPSATARCIQDLVNSRKQE---------LRIYST--------GDGPDIYEDN 280

Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
             +   +       + P L+LV   LG++K+ P Y   L  +    QS+GI GG+P +S 
Sbjct: 281 FMK---IAKPDGEVFHPTLVLVGTRLGIDKITPVYWEALIASVQMSQSVGIAGGRPSSSH 337

Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEA---DTSTYHSDVIRHIHLDSIDPSLAIGF 410
           Y VG Q     YLDPH  +  +    D       D  + H+  +R IH+  +DP+     
Sbjct: 338 YFVGSQGHFLFYLDPHHTRKALPYYSDVARYTIDDMDSCHTSRLRRIHVREMDPN----- 392

Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 465
                      C  A+++ + +  + +  V          SD  GE GG+P D S
Sbjct: 393 -----------CHPANEIRDATGRSVIDEVELL-------SDEDGEDGGIPHDKS 429


>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
 gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 414

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 93/289 (32%), Positives = 134/289 (46%), Gaps = 34/289 (11%)

Query: 140 FNQDFSSRILISYRKGFDPIG---DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
           F  DF ++I ++YR  F  I    D K  S +     LRS   LV Q       G  W  
Sbjct: 103 FLDDFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRS--QLVDQGGFTSDTG--WGC 158

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
                   E  +IL LF D   +P+SIH  ++ G  A G   G W GP        A AR
Sbjct: 159 SSSN----EERKILSLFADDPRAPYSIHKFVEHGASACGKHPGEWFGP-------SAAAR 207

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
           C +A T    +S  + +Y+     D           +D     S+       +TP L+LV
Sbjct: 208 CIQALTNSQVES-ELRVYITGDGSD---------VYEDT--FMSIAKPNSTKFTPTLILV 255

Query: 316 PLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
              LGL+K+ P Y   L+ +   PQS+GI GG+P +S Y +GVQE    YLDPH  +P +
Sbjct: 256 GTRLGLDKITPVYWEALKSSLQMPQSVGIAGGRPSSSHYFIGVQESDFFYLDPHQTRPAL 315

Query: 376 NIG---KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                 +D    D  + H+  +R +H+  +DPS+ I F  RD++D+ D+
Sbjct: 316 PFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDPSMLIAFLIRDENDWKDW 364


>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
          Length = 485

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 79/255 (30%), Positives = 123/255 (48%), Gaps = 27/255 (10%)

Query: 193 PWRKPLQKP---FDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
           P R P   P    D  + +++  FGD  ++PF +H L++ GK  G  AG W GP  +   
Sbjct: 210 PARCPSASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHM 269

Query: 250 W-EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
             +A+AR    E         +A+YV              V  +D    C     G   W
Sbjct: 270 LRKAVARAAEFED--------LAVYVAQDC---------TVYKEDVMSLCESSGVG---W 309

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
             +++LVP+ LG E +NP YI  ++        +GI+GGKP  S + VG Q+E  +YLDP
Sbjct: 310 KSVVILVPVRLGGESLNPSYIECVKNILKLKCCIGIIGGKPKHSLFFVGFQDEQLLYLDP 369

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK- 427
           H  QPV+++ + +   +  ++H +  R ++   +DPS  IG Y R K DF+  C   S+ 
Sbjct: 370 HYCQPVVDVTQANFSLE--SFHCNSPRKMNFSRMDPSCTIGLYARSKTDFESLCTAVSEA 427

Query: 428 LAEESNGAPLFTVTQ 442
           L+      P+FT  +
Sbjct: 428 LSSSKEKYPIFTFVE 442



 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 30/63 (47%), Positives = 40/63 (63%), Gaps = 1/63 (1%)

Query: 134 NNGLAE-FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
           N G  E F Q F S + ++YR+ F  +  S +T+D GWGCMLRS QM++AQ LL H +  
Sbjct: 92  NEGEVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPT 151

Query: 193 PWR 195
            WR
Sbjct: 152 DWR 154


>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
           leucogenys]
          Length = 441

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/328 (26%), Positives = 150/328 (45%), Gaps = 36/328 (10%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW--GCMLRSSQML-VAQALLFHR 189
           G      F  DF SR+ ++YR     +    I  D  W  G  L   ++   A    +H 
Sbjct: 98  GEGEHTAFPADFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHG 157

Query: 190 LGRPWRKP--------LQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
             R W  P        L++  +R + +I+  F D   +PF +H L++ G++ G  AG W 
Sbjct: 158 PAR-WMPPRWAQGAPELEQ--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWY 214

Query: 242 GPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF 301
           GP         +A   R       +   + +YV       +   A +V   D +      
Sbjct: 215 GP-------SLVAHILRKAVESCSEVTRLVVYVSQTCSMYKADVARLVARPDPT------ 261

Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
               A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++
Sbjct: 262 ----AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCQLCLGIMGGKPRHSLYFIGYQDD 317

Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             +YLDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETL 375

Query: 422 CARASKLAEESNGA---PLFTVTQTHKK 446
           C+  +++   S+     P+FT+ + H +
Sbjct: 376 CSELTRVLSSSSAMERYPMFTLAEGHAQ 403


>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
          Length = 252

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/233 (30%), Positives = 118/233 (50%), Gaps = 32/233 (13%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I  +   +W+LG  +    D           L +   D  SR+  +YRKGF  IG++  T
Sbjct: 40  IPQTVDPVWILGKKYSTIID-----------LQQIRNDIQSRLWFTYRKGFVQIGNTNFT 88

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           SD GWGCMLR  QM++ QAL+F  LGR WR    K  D +Y++IL +F D  ++P+SIH 
Sbjct: 89  SDRGWGCMLRCGQMVIGQALIFLHLGRDWRWDPDKR-DIDYLKILRMFEDKRSAPYSIHQ 147

Query: 226 LLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGG 285
           +   G ++G   G W GP  + +  + LA             L   ++ V+ D       
Sbjct: 148 IALMGVSHGKQVGEWFGPNTIAQVLKKLA---------TMDELSSLVFHVALDN------ 192

Query: 286 APVVCIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLT 335
              + I++  + C+V  +  +    W P++L++PL LG+  +NP Y+  ++++
Sbjct: 193 --TLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQGVKVS 243


>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
 gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
          Length = 302

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 97/309 (31%), Positives = 148/309 (47%), Gaps = 42/309 (13%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET--SPFSIHNLLQAG 230
           M R  QML+AQAL+ H LGR WR    +      ++I+  F DS +  SP S+H L+Q  
Sbjct: 1   MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHRLVQMS 60

Query: 231 KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGDE--DGER 283
                  G W GP ++C    A+ R     + L  +   + +Y     V+  +E  D  R
Sbjct: 61  DR---KPGEWCGPSSIC---SAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLAR 114

Query: 284 G------GAPVVCIDDASRHCSVFSKGQADW---------TPILLLVPLVLGL-EKVNPR 327
           G        P +   D   H +++ + Q+D          T ILLL+PL+ G   ++NPR
Sbjct: 115 GLHTSYQYQPKIYFTD---HTALY-RSQSDQTNDSHSFKPTAILLLIPLMFGKGNRINPR 170

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           YI  +   F+ P  +G++GG+   S+Y VG Q  S IYLDPH  QP  N+       D  
Sbjct: 171 YIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPTQNLNSPKFSVD-- 228

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
           ++H  + + +   +++PS A+GFYCR + +  D   R   L   S+         T  +P
Sbjct: 229 SWHCPIPKTMSAANLNPSCAVGFYCRTRGELSDLIDRLPILMSVSDNLQ----ASTRSRP 284

Query: 448 VNHS-DVLG 455
           V  + +VLG
Sbjct: 285 VAFTVEVLG 293


>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1038

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 139/318 (43%), Gaps = 74/318 (23%)

Query: 123 AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI------------------ 164
           +Q  A     G +   EF  DF+SRI ++YR  F PI DS +                  
Sbjct: 271 SQSPASEKHPGQDWAPEFYADFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSP 330

Query: 165 --------------TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YV 207
                         T+D GWGCMLR+ Q L+A ALL   LGR WR+P    +  +   YV
Sbjct: 331 SPKSRRWFGGEKGWTTDTGWGCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYV 390

Query: 208 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
           +I+  F DS    +PFS+H +  AGK  G   G W GP     + + L +    + GLG 
Sbjct: 391 QIITWFLDSPLPQAPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKRLVQA-FPDAGLGV 449

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------W--TPIL 312
                                  V  D A     V+S    D           W    +L
Sbjct: 450 ----------------------AVASDGALYQTDVYSASYVDVGSPRNVRKLRWGGRAVL 487

Query: 313 LLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQ 372
           +L  + LG+  VNP Y  T++  F  PQS+GI GG+P +S Y +GVQ ++ IYLDPH  +
Sbjct: 488 VLFGIRLGINGVNPIYYDTIKGLFEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPHHAR 547

Query: 373 PVINIGKDDLEADTSTYH 390
           P I + +   EAD    H
Sbjct: 548 PAIPL-RPLPEADEGNQH 564



 Score = 45.8 bits (107), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 5/69 (7%)

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
           A+  T+H D +R + L  +DPS+ +GF C+D++D+ DF  R + L   +      T+   
Sbjct: 716 AELKTFHCDRVRKMPLSGLDPSMLLGFLCQDEEDWIDFRHRITDLMHRNK-----TIFAI 770

Query: 444 HKKPVNHSD 452
             +P N S+
Sbjct: 771 QDEPPNWSE 779


>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
          Length = 263

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 93/169 (55%), Gaps = 6/169 (3%)

Query: 293 DASRHCSVFSKGQ------ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
           D+ RHC+ F  G       + W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++G
Sbjct: 68  DSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 127

Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
           GKP ++ Y VG   E  IYLDPH  QP +         D S +       + +  +DPS+
Sbjct: 128 GKPNSAHYFVGYVGEELIYLDPHTTQPAVEPTGSCFIPDESFHCQHPPCRMSIAELDPSI 187

Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 188 AVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 236


>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
          Length = 330

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 141/318 (44%), Gaps = 62/318 (19%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWR------------------------------------K 196
           MLRS QM++AQ LL H L R W                                      
Sbjct: 1   MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
            L++  +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +A  
Sbjct: 61  ELER--ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHI 111

Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
            R           + +YV       +   A +V   D +          A+W  +++LVP
Sbjct: 112 LRKAVESCSDVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVP 161

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
           + LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP ++
Sbjct: 162 VRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVD 221

Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA- 435
           + + D   +  ++H    R +     DPS  +GFY  D+ +F   C+  +++   S+   
Sbjct: 222 VSQADFPLE--SFHCTSPRKMAFAKTDPSCTVGFYAGDRKEFGTLCSELTRVLSSSSATE 279

Query: 436 --PLFTVTQTHKKPVNHS 451
             P+FT+ + H +  +HS
Sbjct: 280 RYPMFTLAEGHAQ--DHS 295


>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
          Length = 423

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 90/274 (32%), Positives = 136/274 (49%), Gaps = 44/274 (16%)

Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSP 220
           +   TSD GWGCM+R+SQ L+A ALL  +L     +  Q       ++IL LF D  TSP
Sbjct: 138 NDNFTSDAGWGCMIRTSQNLLAIALL--KLSEEHNESAQ-------LDILKLFQDDPTSP 188

Query: 221 FSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALA-RCQRAETGLGCQSLPMAIYVVSG 277
           FS+HN ++   +  L    G W GP A   S + L    ++ ET       P  I  V  
Sbjct: 189 FSLHNFIRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLET-------PGEIPYVYI 241

Query: 278 DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
            E+ +         DD      +F++ Q    P+LLL P+ LG+++VN  Y  ++    +
Sbjct: 242 SENAD-------LFDDEIE--DLFNEEQK---PLLLLFPVRLGIDQVNKYYYKSILQLLS 289

Query: 338 FPQSLGIVGGKPGASTYIVGVQEES-AIYLDPHDVQPV---INIGKDDLEADTSTYHSDV 393
            P S+GI GGKP +S Y +G + E+  +Y DPH  Q V   INI         +TYH+  
Sbjct: 290 LPYSVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI---------TTYHTAN 340

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
              + ++ +DPS+ IG   +  D++ +F    S+
Sbjct: 341 YNKLDIEMVDPSMMIGVLLKSMDEYKEFKQDCSE 374


>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
          Length = 592

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 107/347 (30%), Positives = 155/347 (44%), Gaps = 62/347 (17%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG------- 160
           S   DIW     H  A+D    D   N    EF  D  +RI ++YR  F PI        
Sbjct: 75  SGLKDIWQTLRFH-TAEDNEKDDL--NKWPQEFIDDVYTRIWLTYRTKFSPIDRDPEGPS 131

Query: 161 ----------------DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
                           +   T+D GWGCM+R+SQ L+A ALL   +GR WR       + 
Sbjct: 132 PLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQSLLANALLNLHIGRDWR--YTGELNE 189

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGL 263
            + EI+  F D  + PFSIH ++  GK       G W GP A  RS ++L          
Sbjct: 190 MHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGEWFGPSAAARSIQSL---------- 239

Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
            C      + V  G + G+     V  +  A     VF        PIL+L+ L LG++ 
Sbjct: 240 -CNEFDSGVKVYIGSDSGDIYENDVFKV--AKDENGVFK-------PILILLGLRLGIDN 289

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           +NP Y  +L+      +S+GI GG+P  S Y  G Q +   YLDPH  QP + +  D L+
Sbjct: 290 INPVYWDSLKAILNSKESIGIAGGRPSTSHYFFGFQGDHLFYLDPHLPQPAL-LHDDQLD 348

Query: 384 A------------DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
                        D ++ H+  +R IHL  +DPS+ +GF  +D++++
Sbjct: 349 TSVSESTEIVSSLDVNSVHTKKLRKIHLSEVDPSMLLGFLIKDENEW 395


>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
          Length = 330

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 139/311 (44%), Gaps = 56/311 (18%)

Query: 173 MLRSSQMLVAQALLFHRLGRPW----------------------------------RKPL 198
           MLRS QM++AQ LL H L R W                                  +   
Sbjct: 1   MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60

Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 258
           +   +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +A   R
Sbjct: 61  ELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILR 113

Query: 259 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
                  +   + +YV       +   A +V   D +          A+W  +++LVP+ 
Sbjct: 114 KAVESCSEVTRLVVYVSQDCTVYKADVARLVARPDPT----------AEWKSVVILVPVR 163

Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
           LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ 
Sbjct: 164 LGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVS 223

Query: 379 KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--- 435
           + D   +  ++H    R +    +DPS  +G Y  D+ +F+  C+  +++   S+     
Sbjct: 224 QADFPLE--SFHCTSPRKMAFAKMDPSCTVGSYAGDRKEFETLCSELTRVLGSSSATERY 281

Query: 436 PLFTVTQTHKK 446
           P+FT+ + H +
Sbjct: 282 PMFTLAEGHAQ 292


>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 999

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 91/278 (32%), Positives = 133/278 (47%), Gaps = 50/278 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKI---------------------------------TS 166
           F  DF+SRI ++YR  F PI D+ +                                 TS
Sbjct: 303 FYADFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTS 362

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSETS--PF 221
           D GWGCMLR+ Q L+A ALL   LGR WR+P    +  +Y   V+I+  F D+ +   PF
Sbjct: 363 DAGWGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPF 422

Query: 222 SIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
           S+H +   GK  G   G W GP     + + L      + GLG     +A+   S   + 
Sbjct: 423 SVHRMALVGKDLGKEVGQWFGPSTAAGAIKTLVHS-FPDAGLG-----VAVASDSTLYES 476

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
           +   A    +    RH       + +W    +L+L+ + LG+E VNP Y  T++  +TFP
Sbjct: 477 DVYAASRSSVYSTRRH----GHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLYTFP 532

Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           Q++GI GG+P +S Y VG Q ++  YLDPH  +P I +
Sbjct: 533 QTVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAIPL 570



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 15/46 (32%), Positives = 29/46 (63%)

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           + +  T+H D +R + L  +DPS+ +GF C+D+ ++ D   R ++L
Sbjct: 699 QTELKTFHCDRVRKMPLSGLDPSMLLGFLCKDEAEWLDLKERITEL 744


>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
 gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
          Length = 577

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 145/320 (45%), Gaps = 64/320 (20%)

Query: 138 AEFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDV 168
            EF +D  SR++ +YR  F PI     G S I                        T+D+
Sbjct: 127 VEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNSFTTDI 186

Query: 169 GWGCMLRSSQMLVAQALLFHRLGRPWR------KPLQKPFDREYVEILHLFGDSETSPFS 222
           GWGCM+R+ Q L+  AL    LGR +R       P  K    E  +I+  F D+   PFS
Sbjct: 187 GWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEE--DIIEWFYDNPNKPFS 244

Query: 223 IHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDG 281
           IH  +  G +      G W GP   C + ++L   +  E G+        + V SGD   
Sbjct: 245 IHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLIY-EFPECGID----ECILSVSSGD--- 296

Query: 282 ERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
                  +  D+ + H   F K +   T IL+L+ + LG++K+N  Y   ++       S
Sbjct: 297 -------IYEDEINEH---FQKNEN--TIILILLGVKLGIDKINQCYFNDIKDILNSRYS 344

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
            GI GG+P +S Y  G   E   Y DPH  +P + + +D   +  ST +S ++    +  
Sbjct: 345 CGISGGRPSSSLYFFGHMNEYLYYFDPH--KPQLQLNEDFKNSCHSTDYSKIL----ISE 398

Query: 402 IDPSLAIGFYCRDKDDFDDF 421
           IDPS+ IGFY + K D+D+F
Sbjct: 399 IDPSMLIGFYLKGKKDWDNF 418


>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
 gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
           norvegicus]
          Length = 224

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 58/169 (34%), Positives = 93/169 (55%), Gaps = 6/169 (3%)

Query: 293 DASRHCSVFSKGQA------DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
           ++ RHC+    G         W P++LL+PL LGL  +N  Y+ TL+  F  PQSLG++G
Sbjct: 29  ESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIG 88

Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
           GKP ++ Y +G   E  IYLDPH  QP + +       D S +       + +  +DPS+
Sbjct: 89  GKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDSCFIPDESFHCQHPPCRMGIGELDPSI 148

Query: 407 AIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           A+GF+C+ ++DF+D+C +  KL++     P+F + +     +   DVL 
Sbjct: 149 AVGFFCKTEEDFNDWCQQVKKLSQLGGALPMFELVEQQPSHLACQDVLN 197


>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 180

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 55/118 (46%), Positives = 80/118 (67%), Gaps = 1/118 (0%)

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W P+++LVP+ LG++ +NP YIPTL+  F+FPQ LG++GGKP +S Y VG Q+   +Y+D
Sbjct: 11  WHPVIILVPVRLGIQCLNPIYIPTLKAFFSFPQCLGVIGGKPHSSFYFVGYQDNKVLYMD 70

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           PH VQP + +  D L     +Y  ++ + +  D IDPSLA+GF C  + +FDDFC  A
Sbjct: 71  PHFVQPTVKMDDDPLFP-IESYRMEIPQAMSFDDIDPSLALGFLCSSQAEFDDFCLNA 127


>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
 gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
          Length = 1034

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/269 (34%), Positives = 128/269 (47%), Gaps = 50/269 (18%)

Query: 140 FNQDFSSRILISYRKGF-DPIGDSKI-------------------------------TSD 167
           F  DF+SRI ++YR  F  PI D ++                               +SD
Sbjct: 302 FYIDFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSD 361

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY---VEILHLFGDSET--SPFS 222
            GWGCMLR+ Q L+A AL+   LGR WRKP       +Y   V IL  F D+    +PFS
Sbjct: 362 SGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFS 421

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
           +H +  AGK  G   G W GP     + +AL      E G+G     +A+ V     DG 
Sbjct: 422 VHRMALAGKELGTDVGQWFGPSVAAGAIKALVNS-FPEAGIG-----VAVAV-----DGV 470

Query: 283 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                V              + +  W   P+LLL+ + LG+E VNP Y  T+++ +TFPQ
Sbjct: 471 LYQTDVHAASHGDHFGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIYYDTIKMLYTFPQ 530

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           S+GI GG+P +S Y VG Q ++  YLDPH
Sbjct: 531 SVGIAGGRPSSSYYFVGSQADNLFYLDPH 559



 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 16/45 (35%), Positives = 28/45 (62%)

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           A+  T+H + +R + L  +DPS+ +GF CRD+ ++ D   R + L
Sbjct: 711 AELKTFHCERVRKMPLSGLDPSMLLGFLCRDEAEWVDLRKRVAGL 755


>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
          Length = 246

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/246 (30%), Positives = 112/246 (45%), Gaps = 53/246 (21%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKPHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWNWERQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 137

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 138 EGKSIGEWFGPNTVAQVIKKLALFDEWNS--------LAVYVSMDN---------TVVIE 180

Query: 293 DASRHCSVFSKGQAD-------------------------WTPILLLVPLVLGLEKVNPR 327
           D  + C V   G AD                         W P+LL+VPL LG+ ++NP 
Sbjct: 181 DIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKPLLLIVPLRLGINQINPV 240

Query: 328 YIPTLR 333
           YI   +
Sbjct: 241 YIEAFK 246


>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
          Length = 337

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 121/247 (48%), Gaps = 22/247 (8%)

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
           DR +  I+  F D   +PF +H L++ G++ G  AG W GP         +A   R    
Sbjct: 72  DRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP-------SVVAHILRKAVE 124

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
             C  +   +  VS D    +         D +R  S +    A+W  +++LVP+ LG E
Sbjct: 125 -SCSEVTRLVVYVSQDCTVYKA--------DVARLVS-WPDPTAEWKSVVILVPVRLGGE 174

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
            +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ + + 
Sbjct: 175 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVNQANF 234

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 439
             +  ++H    R +    +DPS  +GFY  ++ +F+  C+   ++   S+     P+FT
Sbjct: 235 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 292

Query: 440 VTQTHKK 446
           V + H +
Sbjct: 293 VAEGHAQ 299


>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
           passalidarum NRRL Y-27907]
          Length = 363

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 84/280 (30%), Positives = 133/280 (47%), Gaps = 43/280 (15%)

Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
           F+ R+  + R  FD        SDVGWGCM+R+SQ L+A AL+           LQ   +
Sbjct: 104 FNKRLFTTVRSLFD---SENFNSDVGWGCMIRTSQSLLANALM----------KLQPSAE 150

Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 261
            E   +++LF D+  S FS+HN ++      L    G W GP A   S + L    + +T
Sbjct: 151 HE---VINLFQDNIASAFSLHNFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKT 207

Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
             G +   + I   S   D E        I++     SV           L+L P+ LG+
Sbjct: 208 IQGVKYPHVFISENSDLYDEE--------IEELLVESSV-----------LILFPVRLGI 248

Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
           + VN  Y  ++      P ++GI GGKP +S Y +G Q++  +Y DPH  Q   N     
Sbjct: 249 DNVNSYYYDSIFQLLACPFTVGISGGKPSSSFYFLGYQDQDLLYFDPHSPQLYEN----- 303

Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
              + +TYH++  + +H+  +DPS+ +G   +DK ++ +F
Sbjct: 304 -PINYTTYHTNNYQRLHIHMLDPSMMVGILVKDKSEYKEF 342


>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
          Length = 450

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 95/309 (30%), Positives = 138/309 (44%), Gaps = 53/309 (17%)

Query: 143 DFSSRILISYRKGFDPI-----GDSKIT------------------------SDVGWGCM 173
           D  SR+  +YR  F PI     G S I                         SD+GWGCM
Sbjct: 64  DVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALTDPDSFYSDIGWGCM 123

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KA 232
           +R+ Q L+A A+   +L R +R    +  D E + ++  F D    P S+HN ++A  K 
Sbjct: 124 IRTGQALLANAIQRVKLAREFRINASRIDDNE-LNLIRWFQDDVKYPLSLHNFVKAEEKI 182

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G+  G W GP A  RS + L      E    C      I   S D          +  D
Sbjct: 183 SGMKPGQWFGPSATARSIKTLI-----EGFPLCGIKNCIISTQSAD----------IYED 227

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           + +R   +F K +     +LLL  + LG++K+N  Y   +    + P S+GI GGKP +S
Sbjct: 228 EVTR---IFHKDRD--ANLLLLFAVRLGVDKINSLYWKDIFKILSSPYSVGIAGGKPSSS 282

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            Y  G Q E+  YLDPH+ Q   ++  DDLE   S  H      +H+   DPS+ +G   
Sbjct: 283 LYFFGYQNENLFYLDPHNTQQS-SLMMDDLEFYRSC-HGHKFNKLHISETDPSMLLGMLI 340

Query: 413 RDKDDFDDF 421
             K+++D F
Sbjct: 341 SGKNEWDQF 349


>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
          Length = 994

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/277 (32%), Positives = 129/277 (46%), Gaps = 49/277 (17%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKI--------------------------------TSD 167
           F  DF+SRI ++YR  F+PI D+ +                                TSD
Sbjct: 309 FYSDFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSD 368

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGD--SETSPFS 222
            GWGCMLR+ Q L+A ALL   LGR WR+P    +  +   YV+I+  F D  S   PFS
Sbjct: 369 SGWGCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFS 428

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
           +H +   GK  G   G W GP     + + L      E GLG     +A+  V    D  
Sbjct: 429 VHRMALVGKELGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVS---VAVDGVIYQSDVY 484

Query: 283 RGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                 + +    +H      G+  W    +L+L+ + LG++ VNP Y   ++  +T PQ
Sbjct: 485 AVSRSTMGLGSPRKH------GRPSWGDRAVLVLIGIRLGIDGVNPIYYDLIKALYTLPQ 538

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           +LGI GG+P +S Y VG Q  +  YLDPH  +P I +
Sbjct: 539 TLGIAGGRPSSSYYFVGSQANNLFYLDPHHARPTIPL 575



 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 52/100 (52%), Gaps = 16/100 (16%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
           T+H D +R + L  +DPS+ IGF C+D++D+ D   R ++L              THK+ 
Sbjct: 711 TFHCDRVRKMPLSGLDPSMLIGFLCKDENDWIDLRRRLTELF------------NTHKRH 758

Query: 448 VNHSDVLGETGGVPED--DSLGVMSMNDAVGNAHEDDWQL 485
           +    +  E    P D  D++G+ S+++   +  E+D +L
Sbjct: 759 IFS--IQDEPPNWPSDSEDNIGLESISEPDIDLPEEDDEL 796


>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
           6054]
 gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 514

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 95/323 (29%), Positives = 153/323 (47%), Gaps = 43/323 (13%)

Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
           FS  +L + +   + I     T+DVGWGCM+R+SQ L+A    F RL       L K  D
Sbjct: 138 FSKSLLYNLQNFNNFIEKENFTTDVGWGCMIRTSQSLLANT--FVRL-------LDKQSD 188

Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAET 261
                I+ LF D+  +PFS+HN ++   +  L    G W GP A   S + L  C     
Sbjct: 189 -----IIALFNDTYLAPFSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRL--CDGYYD 241

Query: 262 GLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
               +++   I V+  +            ++ ++      +KG      +L+L+P+ LG+
Sbjct: 242 NSTSETILPRINVLISESTDLYDSQIAQLLEPSTE-----TKG------LLVLLPVRLGI 290

Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
           + +N  Y  +L    +  QS+GI GGKP +S Y  G Q+ S IY+DPH  Q    I   D
Sbjct: 291 DSINSYYFSSLLHLLSLEQSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQ----IFSSD 346

Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF---CARASKLAEESNGAPLF 438
           +  D STY++   + + +  +DPS+ IG + RD   +++F   C  A+      +     
Sbjct: 347 I--DMSTYYATRYQRVDIGKLDPSMLIGVFIRDLTSYENFKKSCLDAANKIVHFHATERS 404

Query: 439 TVTQTHKK-----PVNHSDVLGE 456
           TV ++ +K      +N SD+  E
Sbjct: 405 TVPESRRKNSEFVNINRSDLKDE 427


>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 557

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 141/346 (40%), Gaps = 48/346 (13%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL---- 198
           D  S    +YR  F  I    ITSD GWGCMLRS+QM++ QAL  H   R WR P     
Sbjct: 171 DERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQLLAR 230

Query: 199 --QKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALAR 255
             Q  F R  +     +  S  S +S+HN++ AG   Y    G W GP   C     L  
Sbjct: 231 RRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMRDLVH 290

Query: 256 CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLV 315
               +  LG   L   I+ V     G      +           +  K +          
Sbjct: 291 IHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQPQAH 350

Query: 316 PLVLGLEK---------------------------VNPRYIPTLRLTFTFPQSLGIVGGK 348
           PL L  E+                           +N  Y+ +L  TF+ PQS+G++GG+
Sbjct: 351 PLDLEWEEELMESANTVEWDTALLLLVPLRLGLTSLNEEYVQSLAHTFSLPQSVGVLGGR 410

Query: 349 PGASTYIVGVQEE-SAIY-LDPHDVQ--PVINIGKDDLEADTSTYHS-DVIRHIHLD--- 400
           P  + +  G Q++ S I+ LDPH VQ  P     + + +A +    S D +R  H     
Sbjct: 411 PRGARWFYGAQKDGSKIFGLDPHTVQTAPGRQTARVNGQASSVVELSDDYLRSCHTTCPE 470

Query: 401 -----SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAP-LFTV 440
                 +DPS+A+GFYCR + D +          +E +  P LF+V
Sbjct: 471 MFPFCKMDPSIALGFYCRTRADLNHVLNSMGAWQKEHSSIPELFSV 516


>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
          Length = 296

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 120/247 (48%), Gaps = 22/247 (8%)

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
           DR +  I+  F D   +PF +H L++ G++ G  AG W GP         +A   R    
Sbjct: 31  DRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP-------SVVAHILRKAVE 83

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
             C  +   +  VS D    +         D +R  S +    A+W  +++LVP+ LG E
Sbjct: 84  -SCSEVSRLVVYVSQDCTVYKA--------DVARLLS-WPDPTAEWKSVVILVPVRLGGE 133

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
            +NP Y+P ++        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ +   
Sbjct: 134 TLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQPSF 193

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFT 439
             +  ++H    R +    +DPS  +GFY  ++ +F+  C+   ++   S+     P+FT
Sbjct: 194 PLE--SFHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCSELMRILSSSSVTERYPMFT 251

Query: 440 VTQTHKK 446
           V + H +
Sbjct: 252 VAEGHAQ 258


>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
 gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
          Length = 443

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 94/330 (28%), Positives = 143/330 (43%), Gaps = 73/330 (22%)

Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------- 162
           LG    N+  A  N    S++ +SYR GF+PI  S                         
Sbjct: 69  LGQIFDNSNAA--NNYIESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFA 126

Query: 163 ---------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213
                      TSD GWGCM+R+SQ L+A  LL             K + +   EI+ LF
Sbjct: 127 NLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEQEIVKLF 173

Query: 214 GDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
            D   SPFSIHN ++   +  L    G W GP A   S + L    + +   G    P  
Sbjct: 174 QDDTKSPFSIHNFIRVASSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGIN--PPR 231

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
           +++    +            DD  R   VF+K +++   +++L P+ LG++KVN  Y  +
Sbjct: 232 VFISENSD----------LFDDEIR--DVFAKEKSN--SVIILFPIRLGIDKVNSYYYNS 277

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           +    +   S GI GGKP +S Y +G ++   IY DPH  Q V      +   +  +YHS
Sbjct: 278 IFHLLSSKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQIV------ETPFNMDSYHS 331

Query: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
                +++  +DPS+ IG    + D++ DF
Sbjct: 332 TNYNTLNISLLDPSMMIGILVTNIDEYIDF 361


>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
           1558]
          Length = 1159

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/248 (33%), Positives = 112/248 (45%), Gaps = 51/248 (20%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------------REYVEIL 210
           +T+D GWGCMLR+ Q L+A AL+   LGR WR P Q                   YV IL
Sbjct: 580 LTTDAGWGCMLRTGQSLLANALIHLHLGRDWRVPSQPQVPPTSAAHLAELEAYSSYVRIL 639

Query: 211 HLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
             F D  S   PFS+H +   GK  G   G W GP     + + L             S 
Sbjct: 640 SWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTL-----------VNSF 688

Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQAD-----------------W--T 309
           P +   V+   D       +V   D     ++ S G +D                 W   
Sbjct: 689 PPSGMAVATAVDS------IVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNR 742

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
            +L+L+ + LGL+ VNP Y  +++  FTFPQS+GI GG+P +S Y VG Q  S +YLDPH
Sbjct: 743 AVLVLIGIRLGLDGVNPLYYESIKALFTFPQSVGIAGGRPSSSYYFVGTQANSLVYLDPH 802

Query: 370 DVQPVINI 377
             +P + +
Sbjct: 803 FTRPAVPL 810



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 5/60 (8%)

Query: 383  EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
            +A   T+H D +R I L  +DPS+ +GF C+D+ DF+DFC+R ++L ++     +FT+ +
Sbjct: 962  KAQLGTFHCDKVRKIPLSGLDPSMLLGFVCKDEADFEDFCSRVAQLPQK-----IFTIQE 1016


>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
 gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
          Length = 446

 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 98/353 (27%), Positives = 148/353 (41%), Gaps = 76/353 (21%)

Query: 141 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 166
           N    S++ +SYR GF+PI  S                                    TS
Sbjct: 80  NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCM+R+SQ L+A  LL             K + +   EI+ LF D  +SPFSIHN 
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDGTSSPFSIHNF 186

Query: 227 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
           ++      L    G W GP A   S + L      +  L    +P     +S + D    
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVF--ISENSD---- 240

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
                  DD  R   VF+K ++    +L+L P+ LG++KVN  Y  ++        S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKS--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
            GGKP +S Y +G ++   IY DPH  Q V      +   +  +YH+     +++  +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 457
           S+ IG    + D++ DF    S   + +N    F     H  PV    ++ ++
Sbjct: 346 SMMIGILVTNIDEYIDF---KSSCIDNNNKIVHF---HPHTLPVQQDSIINQS 392


>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
           70294]
 gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 411

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 142/309 (45%), Gaps = 57/309 (18%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 170
           F  D  SRI  +YR  F PI  S                                +D+GW
Sbjct: 74  FLSDVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGW 133

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
           GCM+R+ Q L+A A+    LGR +R       + +  +I+  F D+   PFS+HN ++ G
Sbjct: 134 GCMIRTGQSLLANAIQIAILGREFRVN-DGDVNEQERKIISWFMDTPDEPFSLHNFVKKG 192

Query: 231 -KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
            +      G W GP A  RS ++L   Q  + G+    + ++   +  DE          
Sbjct: 193 CELSSKKPGEWFGPAATSRSIQSLVE-QFPDCGIDRCIVSVSSADIFKDE---------- 241

Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
            I+D      +F   +  ++ ILLL+ + LG++KVN  Y+  +R       S+GI GG+P
Sbjct: 242 -IND------IFKNKR--YSNILLLMGVKLGVDKVNEYYLKDIRKILESRYSVGISGGRP 292

Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
            +S Y  G Q+++ +Y DPH  QP        +E+   T H+D    I++  +DPS+ IG
Sbjct: 293 SSSLYFFGYQDDTLLYFDPHKPQPST------IESLLETCHTDNFDKINISDMDPSMLIG 346

Query: 410 FYCRDKDDF 418
              + +DD+
Sbjct: 347 VLLQGEDDW 355


>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
          Length = 391

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/326 (27%), Positives = 148/326 (45%), Gaps = 42/326 (12%)

Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
           Q +S  I  +YRK F  I +S+ TSD GWGCMLRS QM+ AQ L  H      R+  Q  
Sbjct: 51  QIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQILRVH-----IRQKKQHS 105

Query: 202 FDREYVEILHLFGDSE---------------TSPFSIHNLLQAGK-AYGLAAGSWVGPYA 245
            D +Y ++L  F D +                SP+SI  +    +  + +    W  P  
Sbjct: 106 KDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWYRPDQ 164

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIY--VVSGDEDGERGGAPVVC-----------ID 292
           +  +   L + ++ E   G + L + I   ++      E  G  + C             
Sbjct: 165 ILNALSLLHQQKQLE---GSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKNK 221

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           + S+ C++  K       I  +  +  GL+++N  Y+P L      PQ  GI+GG+   +
Sbjct: 222 EISKICNICQKKDPKALAIFFITRI--GLDEINKEYLPFLNDLIDLPQFQGIIGGRDDKA 279

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
            YI+G   +  IYLDPH +Q  IN G   +  D  T+    +++I+ + + PS+A+GFYC
Sbjct: 280 YYILGRVNKRLIYLDPHYIQEHINRGNVVMLKD--TFFCKDVKYINEEQMSPSIALGFYC 337

Query: 413 RDKDDFDDFCARASKLAEESNGAPLF 438
           +++ + D F     ++ +  +    F
Sbjct: 338 QNQSELDKFFNSIEQIKKNYDNEKTF 363


>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
 gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
          Length = 446

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 136/317 (42%), Gaps = 70/317 (22%)

Query: 141 NQDFSSRILISYRKGFDPIGDS----------------------------------KITS 166
           N    S++ +SYR GF+PI  S                                    TS
Sbjct: 80  NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTS 139

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGCM+R+SQ L+A  LL             K + +   EI+ LF D  +SPFSIHN 
Sbjct: 140 DAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKLFQDDTSSPFSIHNF 186

Query: 227 LQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
           ++      L    G W GP A   S + LA     +  +    +P     +S + D    
Sbjct: 187 IRVASLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVF--ISENSD---- 240

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
                  DD  R   VF+K +     +L+L P+ LG++KVN  Y  ++        S GI
Sbjct: 241 -----LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYNSIFHLLASKYSCGI 291

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
            GGKP +S Y +G ++   IY DPH  Q V      +   +  +YH+     +++  +DP
Sbjct: 292 AGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYHTTNYNRLNISLLDP 345

Query: 405 SLAIGFYCRDKDDFDDF 421
           S+ IG    + D++ DF
Sbjct: 346 SMMIGILVTNIDEYIDF 362


>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
          Length = 411

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 88/285 (30%), Positives = 135/285 (47%), Gaps = 54/285 (18%)

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP-----------FDREYVE---IL 210
           T+D GWGCM+R++QM+VAQA++ +R GR WR   +K            FD E ++   IL
Sbjct: 88  TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147

Query: 211 HLFGDSETSPFSIHNLLQ-AGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
            LF D  ++P  IH +++ A +  G  A G W  P       EA+   ++A T       
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPS------EAVFIMKKAITESASPLT 201

Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV-LGLEKVNPR 327
              +  +S D     G   +  ++  ++H          WT  L+LV +V LG  ++N  
Sbjct: 202 GDTVMYLSID-----GRVHIRDLEVETKH----------WTKTLMLVIVVRLGAAELNRI 246

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           Y+P L   F+    LGI GG+P  S + VG   +  IYLDPH     I I   D++ +TS
Sbjct: 247 YVPHLMRLFSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAHEYIPI---DMDFNTS 303

Query: 388 -------------TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
                        +YH  ++  +H   +DPS A+ F    ++ FD
Sbjct: 304 QEDPKKPKKCPERSYHCRLLSKMHFLDMDPSCALCFRFESREQFD 348


>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
          Length = 408

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/268 (29%), Positives = 128/268 (47%), Gaps = 37/268 (13%)

Query: 159 IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSET 218
           I +   T+DVGWGCM+R+SQ L+A           +++ + +   +E +++L  F DSE 
Sbjct: 123 IDNENFTTDVGWGCMIRTSQSLLANT---------YKRMISEDAQQE-IQLLDQFKDSEA 172

Query: 219 SPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVS 276
           +PFS+HN ++      L    G W GP A   S + L     ++   G   LP    ++S
Sbjct: 173 APFSLHNFIRVANESPLQVKPGQWFGPNAASLSIQRLCNLVNSKENFG---LPGLSVLIS 229

Query: 277 GDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
            + D           DD  +   +  K Q+    +L+L+P+ LG++K N  Y  ++    
Sbjct: 230 ENSD---------LYDDKVQEF-LDKKKQS----LLILLPIRLGIDKTNEFYYSSILQLL 275

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
              QS+GI GGKP +S Y  G   +  +YLDPH  Q           A  ++YH+   + 
Sbjct: 276 NCKQSVGIAGGKPSSSFYFFGYDNDELLYLDPHYPQ--------GTNAGYNSYHTPRYQR 327

Query: 397 IHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           + +  +DPS+ IG    D  D++ F A 
Sbjct: 328 LTISQLDPSMMIGILVDDLQDYNTFKAE 355


>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 446

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 140/331 (42%), Gaps = 72/331 (21%)

Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS------------------------ 162
            LG    N   A  N    S++ +SYR GF+PI  S                        
Sbjct: 68  VLGQTFDNFDTA--NDYIESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNF 125

Query: 163 ----------KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
                       TSD GWGCM+R+SQ L+A  LL             K + +   EI+ L
Sbjct: 126 ANLKSLFDKENFTSDAGWGCMIRTSQNLLANTLL-------------KLYPKNEPEIVKL 172

Query: 213 FGDSETSPFSIHNLLQAGKAYGL--AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
           F D  +SPFSIHN ++      L   +G W GP A   S + L      +  +    +P 
Sbjct: 173 FQDGTSSPFSIHNFIRVASLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPR 232

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
               +S + D           DD  R   VF+K +     +L+L P+ LG++KVN  Y  
Sbjct: 233 VF--ISENSD---------LFDDEIR--DVFAKEKN--ASVLILFPIRLGIDKVNSYYYN 277

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYH 390
           ++        S GI GGKP +S Y +G ++   IY DPH  Q V      +   +  +YH
Sbjct: 278 SIFHLLASKYSCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV------ETPINMDSYH 331

Query: 391 SDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           +     +++  +DPS+ IG    + D++ DF
Sbjct: 332 TTNYNRLNISLLDPSMMIGILVTNIDEYIDF 362


>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
          Length = 411

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 124/310 (40%), Gaps = 83/310 (26%)

Query: 138 AEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVGWGCML 174
           A F  DF S+  ++YR  F       DP                +  S  +SD GWGCM+
Sbjct: 109 AAFLDDFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMI 168

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYG 234
           RS QML+A A+    LGR                                       A G
Sbjct: 169 RSGQMLLANAMAITNLGR--------------------------------------VACG 190

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDA 294
              G W GP A  R  ++L   Q   +        + +Y          G  P V  D  
Sbjct: 191 KYPGEWFGPSATARCIQSLTNAQEQPS--------LRVYST--------GDGPDVYED-- 232

Query: 295 SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
            +   +       + P L+LV   LG++K+ P Y   L      PQS+GI GG+P AS Y
Sbjct: 233 -KFMKIAKPDGTRFHPTLILVGTRLGIDKITPVYWDALIAALQMPQSVGIAGGRPSASHY 291

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
            +G Q     YLDPH  +P +    D     +AD  T H+  +R +H+  +DPS+ IGF 
Sbjct: 292 FIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADIDTAHTRRLRRLHVREMDPSMLIGFL 351

Query: 412 CRDKDDFDDF 421
            +D DD+ ++
Sbjct: 352 IKDDDDWSEW 361


>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
 gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
          Length = 551

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 59/161 (36%), Positives = 93/161 (57%), Gaps = 6/161 (3%)

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W P+L+L+P+ LGL+ +N  Y  +L   F FPQ+LG+VGGKP AS Y +  Q+++  YLD
Sbjct: 383 WEPLLILIPMRLGLDGLNSIYHSSLLEIFKFPQNLGVVGGKPRASLYFIAAQDDNLFYLD 442

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH VQ  I + ++  +   +T+     +  H+  +DPSL + F+C+ KDDF+DF  R+ K
Sbjct: 443 PHTVQNHIEV-ENGSKFPLNTFFCSTTKRTHVSEVDPSLVVAFFCKTKDDFNDFVERSKK 501

Query: 428 LAEESNGAPLFTVTQTHKKPVNHSDV----LGETGGVPEDD 464
           +  +    P+F++        +  D     + ETGG   DD
Sbjct: 502 MTSQMEN-PIFSIFDNEPDYDSSRDYEYEEIDETGGETSDD 541



 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 5/119 (4%)

Query: 137 LAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
           + EF  DF++R+L  +YR+GF  I D+   +D GWGCMLRS QML++  LL + LG  W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199

Query: 196 KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA 254
           +         + +I+ +F D  ++PFSIHN+   G+  G   G W  P  + ++ + L 
Sbjct: 200 RSSSAT----HPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILV 254


>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
 gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
          Length = 332

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 91/308 (29%), Positives = 139/308 (45%), Gaps = 38/308 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
           I I+YRK    I +   T+D GWGCM+RS QM++AQ  L   LG  W+     +    + 
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96

Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
            +++ I++LFGDS  S FSIH L+      G+  G W GP        + A    AE   
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP--------SFASDIAAEHIN 148

Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
             +      YV    + G   G  +             SK +  + P ++ VPL LG E 
Sbjct: 149 EMRVFRTRGYVA---KLGSIVGPKI----------EELSKDEVGFNPCIIFVPLRLGPES 195

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
               + P L+  F  PQ +G++GGKPG + Y       +  +LDPH  Q  I     D++
Sbjct: 196 PENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DMK 250

Query: 384 ADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVT 441
            D S  +Y     + ++   IDPS+++ F  +  +D++ F     K  E    + LF   
Sbjct: 251 GDWSYQSYFCKDNKSMNYSKIDPSISLVFLVKHVNDYEHF----KKSFENKTFSKLFIFK 306

Query: 442 QTHKKPVN 449
              +K +N
Sbjct: 307 NEIEKKLN 314


>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1093

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 55/274 (20%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
           F  DF+SR+ ++YR  F PI D+ +                                   
Sbjct: 369 FYADFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGG 428

Query: 165 ----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-LQKPFDR--EYVEILHLFGDSE 217
               TSD GWGCMLR+ Q L+A ALL   LGR WR+P   +P      YV++L  F DS 
Sbjct: 429 EKGWTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSP 488

Query: 218 TS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV 275
           +   PFS+H +  AGK  G   G W GP     + + L     A  G G         VV
Sbjct: 489 SPLCPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVH---AFPGGGLGVAVAVDGVV 545

Query: 276 SGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
              +      +P     D+ RH    + G      +L+L+ + LGL+ VNP Y  T++  
Sbjct: 546 YETDVFSASHSP-----DSRRHHRTSTWGDRG---VLILIGIRLGLDGVNPIYYDTIKEL 597

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           +T+PQS+GI GG+P +S Y VG Q +S  YLDPH
Sbjct: 598 YTWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631



 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 37/57 (64%), Gaps = 2/57 (3%)

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           A+  T+H + +R + L  +DPS+ IGF CRD++++ D  AR + +A++    P+F V
Sbjct: 779 AELRTFHCERVRKMPLSGLDPSMLIGFLCRDEEEWRDLRARIANMAKKFK--PIFAV 833


>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
          Length = 734

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 62/163 (38%), Positives = 91/163 (55%), Gaps = 13/163 (7%)

Query: 281 GERGGA---PVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFT 337
           GE  G+   P+ C D  S  C         W  I++LVP+ LGL+K+N  Y   ++    
Sbjct: 515 GENSGSFKDPLTCSDFFSSSCI-----PQRWKSIIILVPIKLGLDKLNEVYFREIKSMLE 569

Query: 338 FPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHI 397
            PQS+G++GGKP  S Y VG Q+E  IYLDPH V   ++    +    + +YH  V + +
Sbjct: 570 LPQSIGLIGGKPKQSFYFVGYQDEHIIYLDPHFVHDTVSPNDINF---SDSYHHCVPQKM 626

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
            +  +DPS+AIGFYC  + DF+DFC R  ++  E  G P+ +V
Sbjct: 627 LISQLDPSMAIGFYCHTQSDFEDFCVRIKEI--EKRGFPVVSV 667



 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 31/47 (65%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQM 179
            N  +  F  DF + +  SYRK F PI ++ IT+D+GWGCM+R+ QM
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQM 315


>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 1193

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 82/240 (34%), Positives = 110/240 (45%), Gaps = 28/240 (11%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 208
           +TSD GWGCMLR+ Q L+  AL+   LGR WR P       E               Y +
Sbjct: 562 LTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAKYAQ 621

Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
           +L  F D  S   PFS+H +   GK  G   G W GP     + + LA    A  G+   
Sbjct: 622 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 680

Query: 267 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSKGQADW--TPILLLVPL 317
           +   +I      Y  S    D     +P        R     +K +  W    +L+LV +
Sbjct: 681 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRRGDNEAK-EEKWGKRAVLILVGV 739

Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
            LGL+ VNP Y  +++  FTFPQS+GI GG+P +S Y VG Q     YLDPH  +P I +
Sbjct: 740 RLGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 799



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           TYH + I+ + L  +DPS+ +GF C+D+DDF+DF  R ++L ++     +FTV
Sbjct: 952 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 999


>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
 gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
          Length = 495

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/321 (28%), Positives = 145/321 (45%), Gaps = 74/321 (23%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVGW 170
           F +D  +R+  +YR  F PI  S                                +D+GW
Sbjct: 75  FLKDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGW 134

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--ILHLFGDSETSPFSIHNLLQ 228
           GCM+R+ Q L+   L   RLGR +R     P +++  E  I+  F D+   PFS+H  + 
Sbjct: 135 GCMIRTGQSLLGNTLQIVRLGRDFR---YDPENKDISENRIIEWFIDAPEKPFSLHQFIT 191

Query: 229 AG-KAYGLAAGSWVGPYAMCRSWEALAR----CQRAETGLGCQSLPMAIYVVSGDEDGER 283
            G +  G   G W GP A  RS ++L R    C  AE           + V SGD     
Sbjct: 192 EGMELSGKNPGEWFGPAATARSIQSLIRKFPDCGIAEC---------LVSVSSGD----- 237

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
                +  D+  +   VF+  + +   +L+L+ + LGL  VN  Y  ++R   +   S+G
Sbjct: 238 -----IYSDEVKQ---VFADNKKN---LLILLGVKLGLNAVNECYWDSIRHILSSKYSVG 286

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHIHLD 400
           I GG+P +S Y  G + +  +Y DPH  QP        LE +  +Y   H++    + ++
Sbjct: 287 ISGGRPSSSLYFFGYEGDELLYFDPHSPQP-------SLEENNVSYKSCHTNKYGKLLMN 339

Query: 401 SIDPSLAIGFYCRDKDDFDDF 421
            +DPS+ +GF  R ++D+++F
Sbjct: 340 DMDPSMLLGFLIRGQEDWENF 360


>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
 gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
 gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
          Length = 521

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 138/311 (44%), Gaps = 54/311 (17%)

Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
           EF  D  +R+  +YR  F PI     G S ++                        +D+G
Sbjct: 115 EFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDIG 174

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+A AL    LGR +R       + E + I+  F D    PFS+H  +Q 
Sbjct: 175 WGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHE-LRIIKWFEDDPKYPFSLHKFVQE 233

Query: 230 GKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G +  G   G W GP A  RS +AL     A     C      I   SGD          
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPA-----CGIAHCVISTDSGD---------- 278

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
           V +D+      +F    +    +LLL+ + LG++ VN  Y   +R   +   S+GI GG+
Sbjct: 279 VYMDEVE---PLFRADPS--AAVLLLLCVRLGVDVVNEVYWEHIRHILSSEHSVGIAGGR 333

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT-STYHSDVIRHIHLDSIDPSLA 407
           P +S Y  G Q+E   YLDPH  +P +N+     + D   + H+     +H+  IDPS+ 
Sbjct: 334 PSSSLYFFGYQDEHLFYLDPH--KPQLNLASYQQDLDLFRSVHTQRFNKVHMSDIDPSML 391

Query: 408 IGFYCRDKDDF 418
           IG     KDD+
Sbjct: 392 IGILLNGKDDW 402


>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
          Length = 616

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 53/140 (37%), Positives = 84/140 (60%), Gaps = 5/140 (3%)

Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
            Q++W  +++LVP+ LGL+K+N  Y   ++     P S+G++GGKP  S Y VG Q+E  
Sbjct: 426 NQSNWKSLIILVPVKLGLDKLNEIYFSGIKAMLQMPSSIGLIGGKPKQSFYFVGFQDEHI 485

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           IYLDPH V   I+    +     ++YH  + + +H   IDPS+A GFYC    DF+ FC 
Sbjct: 486 IYLDPHFVHDTIHPFDSNF---LNSYHDCIPQKMHFSQIDPSMAFGFYCHTYKDFEQFCI 542

Query: 424 RASKLAEESNGAPLFTVTQT 443
           R  ++  E++G P+ ++ +T
Sbjct: 543 RIKEI--EASGFPILSIGET 560



 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 61/100 (61%), Gaps = 7/100 (7%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR---P 193
           +  F +DF S +  SYRK F  I ++ IT+D+GWGCMLR+ QM++A+ALL H       P
Sbjct: 194 VERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENIP 253

Query: 194 WRKPLQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGK 231
           + + ++   + +Y +I+  F D  S+ + +SIH ++   K
Sbjct: 254 YGEKIKT--NSKYKKIMSWFCDYPSKENFYSIHQIVHKNK 291


>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
 gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
          Length = 314

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 140/341 (41%), Gaps = 57/341 (16%)

Query: 91  MRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILI 150
           M  I ER L    T    S + IW LG  H  A +      A       F QD    + +
Sbjct: 4   MSHILERYLRMFPTNHEPSGTFIWSLG--HSYATETGKWPEA-------FVQDTYDLLSL 54

Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEIL 210
           +YRK     G    +SD GWGCM+RS Q ++A  L   R  +P   P+ K        IL
Sbjct: 55  TYRKCI--AGMECFSSDAGWGCMIRSMQTMLANCL---RRVQP-SLPVHK--------IL 100

Query: 211 HLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
           H F D   +  S+H  + AG     +  G+W GP  +      L           C + P
Sbjct: 101 HYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHL-----------CSTHP 149

Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPI--LLLVPLVLGLEKVNPR 327
                V    DG                 ++  + Q   TP   LLL  L LG++ ++  
Sbjct: 150 QVGLNVCVSHDG-----------------AIMYRDQLRNTPYPRLLLFTLRLGIDTIHTS 192

Query: 328 YIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           Y   L    T PQ++GIVGG+P A+ Y    Q +   YLDPH  Q        D  A  S
Sbjct: 193 YYEQLCHVLTIPQAIGIVGGRPRAAHYFYACQSQWFFYLDPHTTQTAHTF---DNPAPNS 249

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           ++H   +R + ++ +DP + +GF    ++   DF  R  KL
Sbjct: 250 SFHVTTLRRLRINELDPCMVLGFAITSEECQTDFEQRIVKL 290


>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
 gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
          Length = 465

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 86/134 (64%), Gaps = 5/134 (3%)

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++++PL LG++++N  YI  L+   + PQSLG +GGKP  S Y +G Q++  IYLD
Sbjct: 217 WKSLIIMIPLKLGVDRINTSYIRKLKSILSIPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 276

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH VQ  ++   ++    + T+   + + +   +IDPSL++GFYC+DK  FDD C R SK
Sbjct: 277 PHFVQDTVDPSSNNY---SETFCGCIPQKMSFSNIDPSLSVGFYCKDKSSFDDLCDRLSK 333

Query: 428 LAEESNGAPLFTVT 441
           L  E++  P+ +++
Sbjct: 334 L--ENDEFPIISIS 345


>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
           H99]
          Length = 1185

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 81/239 (33%), Positives = 113/239 (47%), Gaps = 26/239 (10%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL------QKPFDRE---------YVE 208
           +TSD GWGCMLR+ Q L+  AL+   LGR WR P       +   ++E         Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYAQ 619

Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
           +L  F D  S   PFS+H +   GK  G   G W GP     + + LA    A  G+   
Sbjct: 620 MLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANS-FAPCGVAVA 678

Query: 267 SLPMAI------YVVSG-DEDGERGGAPVVCIDDASRHCSVFSK-GQADWTPILLLVPLV 318
           +   +I      Y  S    D     +P        R     +K G+     +L+LV + 
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSISPTFNSSKKKRGGDNKAKEGKWGKRAVLILVGIR 738

Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           LGL+ VNP Y  +++  FTFPQS+GI GG+P +S Y +G Q     YLDPH  +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQSVGIAGGRPSSSYYFIGSQANHLFYLDPHLTRPAIPL 797



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 5/53 (9%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           TYH + I+ + L  +DPS+ +GF C+D+DDF+DF  R ++L ++     +FTV
Sbjct: 946 TYHCEKIKKMPLSGLDPSMLLGFVCKDEDDFEDFVERVAQLPKK-----IFTV 993


>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
 gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1188

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/239 (33%), Positives = 109/239 (45%), Gaps = 26/239 (10%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---------------YVE 208
           +TSD GWGCMLR+ Q L+  AL+   LGR WR P       E               Y +
Sbjct: 560 LTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAKYAQ 619

Query: 209 ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
           ++  F D  S   PFS+H +   GK  G   G W GP     + + LA    A  G+   
Sbjct: 620 MVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANS-FAPCGIAVA 678

Query: 267 SLPMAI------YVVSG--DEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
           +   +I      Y  S    +D  R            RH +   +G+     +L+LV + 
Sbjct: 679 TATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEGKWGERAVLILVGIR 738

Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
           LGL+ VNP Y  +++  FTFPQ+ G  GG+P +S Y VG Q     YLDPH  +P I +
Sbjct: 739 LGLDGVNPIYYDSIKALFTFPQAGGSAGGRPSSSYYFVGSQANHLFYLDPHLTRPAIPL 797



 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 34/53 (64%), Gaps = 5/53 (9%)

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           TYH + I+ + L  +DPS+ +GF C+ +DDF++F  R + L ++     +FTV
Sbjct: 947 TYHCEKIKKMPLSGLDPSMLLGFVCKSEDDFENFVERVALLPKK-----IFTV 994


>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
 gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
          Length = 484

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 62/167 (37%), Positives = 86/167 (51%), Gaps = 21/167 (12%)

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G++K+NP YIP L+   ++PQS+GIVGG+P AS Y+ GVQ+ S IYLDPH+ Q  +    
Sbjct: 339 GMDKINPVYIPQLQQVLSWPQSVGIVGGRPSASLYVCGVQDASFIYLDPHEAQLALG--- 395

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 439
                   TY  DV+R +    +DPSLAIGF C    + +D  AR   LA + + APL T
Sbjct: 396 --------TYFCDVVRVLPSAQLDPSLAIGFVCTSSAELEDLFARLQALATQHSSAPLMT 447

Query: 440 VTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486
           +T      V          G   D       +    G    D+W+L+
Sbjct: 448 LTTGSGAAV----------GCGSDADFTDDVLEGGTGQQQLDEWELV 484



 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 48/117 (41%), Positives = 65/117 (55%), Gaps = 6/117 (5%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           DF SR+  +YRK F  +G S +TSDVGWGC LRS QML+A+     R G   R  L + +
Sbjct: 49  DFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVRHGWRAGAMMRVALGRDW 108

Query: 203 DR-----EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEAL 253
            R     E V  ++    D   +P SIH +  AG   G+  G W+GP+ +C+  EAL
Sbjct: 109 QRCSDNLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPGRWLGPWMLCKGLEAL 165


>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
 gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
          Length = 489

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 145/321 (45%), Gaps = 53/321 (16%)

Query: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT-------------------- 165
           +  +N   +F  D  SR+  +YR  F PI     G S ++                    
Sbjct: 69  SKNSNENPDFLSDVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNP 128

Query: 166 ----SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
               +DVGWGCM+R+ Q L+  AL   RLGR +R  +      E + I+  F D   +PF
Sbjct: 129 ACFNTDVGWGCMIRTGQSLLGNALQIARLGRGYR--IGSELKPEEISIIDWFVDIPDAPF 186

Query: 222 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           SIHN +  G +      G W GP A  RS ++L R  +      CQ     I V SGD  
Sbjct: 187 SIHNFVSKGMELSSKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQ-----ISVSSGD-- 239

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                   V  +D  +   VF++ +   + ILLL+ + LG+  VN  Y   ++       
Sbjct: 240 --------VYEEDVMK---VFNESKD--SRILLLLGVKLGINAVNEFYWNDIKRLLGSKF 286

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
           S+GI GG+P +S Y +G Q    +YLDPH  QP ++    +  +   + HS     + + 
Sbjct: 287 SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQPFLSPSHQE-RSFYDSCHSSNYGKLAIQ 345

Query: 401 SIDPSLAIGFYCRDKDDFDDF 421
            +DPS+ IG     +++F ++
Sbjct: 346 DLDPSMLIGILISGEEEFKEW 366


>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
 gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
          Length = 427

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 84/253 (33%), Positives = 119/253 (47%), Gaps = 30/253 (11%)

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDSETSPFSI 223
           +D+GWGCM+R+ Q L+  AL    LGR WR           +  EI   F D+   PFS+
Sbjct: 55  TDIGWGCMIRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSL 114

Query: 224 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGD---- 278
           H  +  G +  G   G W GP A  RS ++L   +  E G+        I V SGD    
Sbjct: 115 HRFISKGMQLSGKKPGEWFGPAATARSIQSLVH-EFPECGID----KCLISVSSGDIYKT 169

Query: 279 --EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTF 336
             ED    G           H      GQ D T IL+L+ + LG+E +N  Y  ++R   
Sbjct: 170 EVEDVFNEG-----------HTGEARNGQKDKT-ILILLGVKLGIETINRCYWDSIRRIL 217

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRH 396
           +   S+GI GG+P +S Y  G Q +  +Y DPH  QP  +  K+DL  +T   H+     
Sbjct: 218 SSEYSIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQPSYD--KNDLFYETC--HTTNFGK 273

Query: 397 IHLDSIDPSLAIG 409
           + L  +DPS+ +G
Sbjct: 274 LSLADMDPSMLLG 286


>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4; AltName:
           Full=Pexophagy zeocin-resistant mutant protein 8
 gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
          Length = 533

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 87/267 (32%), Positives = 117/267 (43%), Gaps = 50/267 (18%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCMLRSSQ 178
           F  D  S+I ++YR GF PI   K                      TSD GWGCM+R+SQ
Sbjct: 65  FIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGFTSDAGWGCMIRTSQ 124

Query: 179 MLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAA 237
            L+A ALLF  LGR W    + P + E+  I+  F D    PFSIHN +Q G K      
Sbjct: 125 SLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIHNFVQQGIKCCDKKP 184

Query: 238 GSWVGPYAMCRSWEALARCQRAETGLGCQSLP---MAIYVVSGDEDGERGGAPVVCIDDA 294
           G W GP A  R+ + L           C+  P   + +Y  S             C D  
Sbjct: 185 GEWFGPSAASRAIKNL-----------CKEYPPCGLRVYFSSD------------CGDVY 221

Query: 295 SRHCSVFSKGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
                  + G +D +TPIL+L+ + LG+EKVN      LR   +  QS+GI G K     
Sbjct: 222 DTEVRELAYGDSDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSLKQSVGISGRKTSFLA 281

Query: 354 YI-VGVQEESAIYLDPHDVQPVINIGK 379
            + +G Q +   YL P   +  +  GK
Sbjct: 282 LLSIGFQGDYLFYLIPTFPKKALTFGK 308


>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 330

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 138/309 (44%), Gaps = 40/309 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
           I I+YRK    I +   T+D GWGCM+RS QM +AQ  L   LG  W+     +    + 
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96

Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARCQRAETG 262
            +++ I++LFGDS  S FSIH L+      G+  G W GP +A   + E +   +   T 
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRT- 155

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
                               RG    +     S+   +   G   + P ++ VPL LG E
Sbjct: 156 --------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVPLRLGPE 194

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
                + P L+  F  PQ +G++GGKPG + Y       +  +LDPH  Q  I     D+
Sbjct: 195 SPENEFKPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTTQNAI-----DM 249

Query: 383 EADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
           + D S  +Y     + +    +DPS+++ F  +  +D++ F     K  E    + LFT 
Sbjct: 250 KGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTFSKLFTF 305

Query: 441 TQTHKKPVN 449
               +K +N
Sbjct: 306 KDETEKELN 314


>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
          Length = 476

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 150/323 (46%), Gaps = 56/323 (17%)

Query: 138 AEFNQDFSSRILISYRKGF-----DPIGDSKI-------------------TSDVGWGCM 173
           ++F  D ++R+  +YR GF     DP G S +                   T+D GWGCM
Sbjct: 91  SDFISDVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCM 150

Query: 174 LRSSQMLVAQALLFHRLGRPWRK-PLQKP---------FDREYVEILHLFGDSETSPFSI 223
           +R+SQ L+A ALL   +GR WR  P + P         +++++ +I+  F D   +PFSI
Sbjct: 151 IRTSQSLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQW-QIITWFADFPWAPFSI 209

Query: 224 HNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
             +++ G  +     G W GP A  RS   L +    ++   C+   +  Y+  G+ D  
Sbjct: 210 QQIVRYGSEHCNKKPGEWFGPSAASRSIVYLCK----QSYKACK---LNTYLTEGNGD-- 260

Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
                    +D     S     +  + P L+L  + LG+  VNP Y   L+   +  QS+
Sbjct: 261 -------IYEDELLXVSCPEGTENGFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIHQSV 313

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI---NIGKDDLEAD-TSTYHSDVIRHIH 398
           GI GG+P +S Y  G Q ++  Y+DPH  Q  +   ++   D   +  ++ H+  IR + 
Sbjct: 314 GIAGGRPSSSHYFFGYQGDNLFYMDPHTPQTALLADHVDDADYRXEYVASVHTKRIRKLG 373

Query: 399 LDSIDPSLAIGFYCRDKDDFDDF 421
           L  +DPS+ IG      +D+ + 
Sbjct: 374 LCEMDPSMLIGLLVTSLEDYKEL 396


>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 330

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 141/315 (44%), Gaps = 41/315 (13%)

Query: 143 DFSSR-ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---L 198
           DF+   I I+YRK    I +   T+D GWGCM+RS QM +AQ  L   LG  W+     +
Sbjct: 33  DFARHTIWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCI 90

Query: 199 QKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP-YAMCRSWEALARC 256
               +  +++ I++LFGDS  S FSIH L+      G+  G W GP +A   + E +   
Sbjct: 91  NTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEM 150

Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVP 316
           +   T                     RG    +     S+   +   G   + P ++ VP
Sbjct: 151 RVFRT---------------------RGYVAKLGSIIGSKIEELIKDG-GGFNPCIIFVP 188

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVIN 376
           L LG E     + P L+  F  PQ +G++GGKPG + Y       +  +LDPH  Q  I 
Sbjct: 189 LRLGPESPENEFRPLLKTIFDIPQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTTQNAI- 247

Query: 377 IGKDDLEADTS--TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG 434
               D++ D S  +Y     + +    +DPS+++ F  +  +D++ F     K  E    
Sbjct: 248 ----DMKGDWSYQSYFCKDNKSMLYSKMDPSISLVFLVKHANDYEHF----KKSFENKTF 299

Query: 435 APLFTVTQTHKKPVN 449
           + LFT     +K +N
Sbjct: 300 SKLFTFKDETEKELN 314


>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
          Length = 285

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/291 (29%), Positives = 127/291 (43%), Gaps = 44/291 (15%)

Query: 144 FSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
           F S I I+YR+ F P+     +  SD GWGCM+R  QM +A+ L              K 
Sbjct: 2   FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGL--------------KR 47

Query: 202 FDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEALARCQRAE 260
           F  +  EI+ LF D + S FSI N+ +AGK  + L AG W  P  +C   + L   +   
Sbjct: 48  FQIKEDEIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKK--- 104

Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
              G + L   I  +S D         ++  +D     S    G      ++L +   LG
Sbjct: 105 ---GFKDL--KIRTISSDR--------ILIFEDLEMEFSSEKNG------LILFLVCKLG 145

Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
           LEK    Y+      F +  S+G++GGKP  + + VG  E+  IYLDPH VQ       +
Sbjct: 146 LEKTEENYLKIALKIFDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDPHYVQDF-----N 200

Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
               D ++Y       +    ID S+    +  +K++   F     +L EE
Sbjct: 201 QNNVDQNSYFCKNYAVLDQKKIDSSIGNVLFFENKEELKMFFQFLDQLKEE 251


>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 523

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/289 (29%), Positives = 135/289 (46%), Gaps = 43/289 (14%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALL--FHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
            TSD GWGCM+R+SQ L+A ALL  FH  G    +P      +   +++ LF D+ ++PF
Sbjct: 179 FTSDAGWGCMIRTSQNLLANALLRLFHTTGG---QPQNFAVTKTEADVIELFQDTLSAPF 235

Query: 222 SIHNLLQAGKAYGL--AAGSWVGPYA-------MCRSWEALARCQRAETGLGCQS---LP 269
           S+HN ++A  +  L    G W GP A       +   +  + + +R+E   G  S   +P
Sbjct: 236 SLHNFIKAANSLSLNIKPGQWFGPSAASLSIKKLVNDYNLIQQERRSERDSGRDSGHKVP 295

Query: 270 M-----------AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG-----QADWTPILL 313
                       +      D   +R   P V +   S +C ++        + +  PIL 
Sbjct: 296 TPNLKLHSKSADSDSDSDSDAISKRNSIPYVYV---SENCDLYDDEINAIFELEQRPILF 352

Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQ 372
           L P+ LG+E+VN  Y  ++        S+GI GGKP +S Y +G + E+  IY DPH  Q
Sbjct: 353 LFPIRLGIEQVNKYYYSSILQILASKFSVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQ 412

Query: 373 PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
            V          +  +YH+     + +D +DPS+ IG      D++ +F
Sbjct: 413 IV------QTPVNLESYHTSEYSKLKIDQLDPSMMIGILIETIDEYQEF 455


>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
          Length = 402

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/327 (26%), Positives = 139/327 (42%), Gaps = 45/327 (13%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPWRKP--LQKP 201
           SS I  SYRK       S +TSD GWGCM+R +QM +AQ +  +H   +P +    ++  
Sbjct: 71  SSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVIRHYHSFTQPEQLIVLIRHF 130

Query: 202 FDREYVEILHLFGDSETS-------PFSIHNLLQAGKA-YGLAAGSWVGPYAMCRSWEAL 253
            D +  E+++     + +       PFSI  ++   K  +    G W  P  +  +   L
Sbjct: 131 LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKVEFKKEPGDWYKPNEILETLNYL 190

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP--- 310
            +  +        SL M IY+                + DA +    + KG  +W     
Sbjct: 191 FKYSQY-------SLNMQIYI---------NYQCAFILQDAIKQMFNYDKGNQEWLKECI 234

Query: 311 -------------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVG 357
                        I + +P  +GL++VN  Y+  L +  T P   GI+GG    + YIVG
Sbjct: 235 KNNNQFISQHDKGIAIFLPARIGLQRVNQDYLEVLNILMTLPYFQGIIGGVTNRAFYIVG 294

Query: 358 VQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
             ++  IYLDPH VQ   N   +DL    ++Y    I+ IH  SIDPS+ +    R+  +
Sbjct: 295 RIQDYLIYLDPHFVQNAQNF--EDLSKTQASYTCQNIQLIHNKSIDPSIVVCLCVRNGLE 352

Query: 418 FDDFCARASKLAEESNGAPLFTVTQTH 444
             D     + + +E       ++  T+
Sbjct: 353 LLDLWHSLNHMKQEFQEFFFISILDTN 379


>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
          Length = 1055

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 142/308 (46%), Gaps = 46/308 (14%)

Query: 148 ILISYRKGFDPI-GDSKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWRKPLQKPFDR 204
           + ++YRKG+DPI GD+++TSD GWGC  RS QML+AQAL+ +     R  R    +P   
Sbjct: 603 VWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSARMQRLEGVRPSTW 662

Query: 205 EYVE----ILHLFGDSE--TSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR 258
           ++ E    +L +F DS    + FSI ++ +         G W+ P               
Sbjct: 663 QHEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLSP--------------- 707

Query: 259 AETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLV 318
                    + + I  ++  E G R    V  ++D          G+  W P LL++PL 
Sbjct: 708 -------SEVALIIRRLNPPETGMR----VRIVNDTLLSTRRILAGEP-WMPTLLMIPLR 755

Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE--SAIYLDPHDVQPVIN 376
            GL+ + P  +P     F +P  +G +GGKPG++ Y VG+  +    +YLDPH  +  ++
Sbjct: 756 AGLDTLQPESVPAFVAFFDWPWCVGAIGGKPGSAYYYVGIDHDRRRVLYLDPHTTRSRLD 815

Query: 377 IGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNG-- 434
           +     +A   T   D ++ + +     S+ +G +  +  D  +   R  +  E+ +G  
Sbjct: 816 LSN---QAAEKTCVPDKLKSMDMSKSCSSICVGLFLPELRDLTELVQRYKR--EQLSGMW 870

Query: 435 -APLFTVT 441
             PLF V 
Sbjct: 871 STPLFHVV 878


>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
 gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
          Length = 391

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 85/155 (54%), Gaps = 18/155 (11%)

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G++K+NP Y+P L+   T+PQS+GIVGG+P AS Y+ GVQ+ S ++LDPH+ QP +  G 
Sbjct: 216 GMDKINPVYLPQLQRILTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWGI 275

Query: 380 DDLEADT-----------------STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
                 T                 +TY  D +R +   ++DPS+AIGF C    D +D  
Sbjct: 276 AGDAGHTKEAGNGGSAVVLPASSLATYFCDTVRLMPATALDPSMAIGFLCMGAADLEDLF 335

Query: 423 ARASKLAEESNGAPLFTVTQ-THKKPVNHSDVLGE 456
            R   LA+E + APL T+T  T +  V   D  GE
Sbjct: 336 TRLDALAKEHSLAPLMTLTSGTAQAGVGLEDDFGE 370


>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
 gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
          Length = 196

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 75/122 (61%), Gaps = 11/122 (9%)

Query: 308 WTPILLLVPLVLGLEK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
           W P+++LVPLVLGL++ VNPRY+P +      PQS+GI+GGKP AS Y VG Q+E   YL
Sbjct: 75  WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLPQSVGILGGKPCASLYFVGAQDEELFYL 134

Query: 367 DPHDVQPVINIGK----------DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKD 416
           DPH VQ  + + +          +     T TYH   + H++   +DPS+ +GFYCR + 
Sbjct: 135 DPHTVQLAVPLEQIWGCAQTGSPESGPFPTETYHCRSVLHMNARELDPSMVLGFYCRTRA 194

Query: 417 DF 418
           DF
Sbjct: 195 DF 196



 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 26/45 (57%), Positives = 31/45 (68%)

Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
           F SR+ I+YR+GF  IG    T+D GWGC LRS QML+A AL  H
Sbjct: 1   FHSRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANALQSH 45


>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
           AG-1 IA]
          Length = 808

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 92/287 (32%), Positives = 126/287 (43%), Gaps = 71/287 (24%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKI----------------------------------- 164
           F +DF+S I ++YR  + PI D+ +                                   
Sbjct: 142 FYEDFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKS 201

Query: 165 -TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE---YVEILHLFGDSET-- 218
            TSD GWGCMLR+ Q L+A AL+   LGR WR+P    F  E   YV+IL  F D+ +  
Sbjct: 202 WTSDAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPL 261

Query: 219 SPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALA----RCQRAETGLGCQSLPMAIYV 274
           +PF +H +  AGKA G   G+W GP     S + LA     CQ +   L       A  V
Sbjct: 262 APFGVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPECQLS-VSLAVDGTVFASDV 320

Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGLEKVNPRYIPTL 332
            +    G      V     +       SK  G+A    +L+LV + LGL+ VNP Y   L
Sbjct: 321 YAASHMGM-----VTTSGRSISSRRSASKWGGRA----VLILVNIRLGLDNVNPIYYDAL 371

Query: 333 RLTFTFPQSLGIVGGKP--GASTYIVGVQEESAIYLDPHDVQPVINI 377
           ++            G+P  G+S Y VG Q +S  YLDPH  +P I +
Sbjct: 372 KV------------GRPRQGSSYYFVGSQADSLFYLDPHHTRPYIPL 406



 Score = 45.4 bits (106), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 30/46 (65%)

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
           A+  T+H D +R + + ++DPS+ +GF CRD  D+ DF  R + ++
Sbjct: 519 AELRTFHCDRVRKMPMSALDPSMLLGFLCRDDADWKDFRTRVADVS 564


>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
          Length = 355

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 142/323 (43%), Gaps = 32/323 (9%)

Query: 145 SSRILISYRKGFDPIGDS-KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
           S+ +  +YR     IGDS +  +D GWGC LR  QM+V +AL      R + K L  P +
Sbjct: 52  SAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK-LSYPSE 110

Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
              + IL  F D      S+H +    K  G  AG W  P  +          Q A   +
Sbjct: 111 AARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQEA---M 167

Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
           G Q     ++V             +V +DD  +   +F   +A     LL VPL LG++ 
Sbjct: 168 GLQ-----VHVAMD---------SMVVLDDLRK---LFRADRA----TLLFVPLRLGIDI 206

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
           V    IP ++  F  P +LGI+GG+PGA+ Y +G  + + + LDPH  Q  +  G  D  
Sbjct: 207 VQAEMIPAVKRFFHSPSALGIMGGRPGAAHYFIGYMDHNLLLLDPHTTQDPLRAGSQDAL 266

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV--T 441
                    +   + LD +DP++ + F   D++    F    +   EE+ G  LF++  T
Sbjct: 267 VSCRCSRPML---LDLDKVDPTMCLAFLLTDEESLQRFADDYNASVEET-GVRLFSMLDT 322

Query: 442 QTHKKPVNHSDVLGETGGVPEDD 464
           ++    V  +  L E     +DD
Sbjct: 323 KSFASSVAVASSLAEEEEFSDDD 345


>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
          Length = 330

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 127/278 (45%), Gaps = 29/278 (10%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---RKPLQKPFDR 204
           I ++YRK    +   + TSD GWGCM+RS QM +AQ+ +   +G  W   +   Q   ++
Sbjct: 38  IWVTYRKNMKELPGGR-TSDSGWGCMIRSMQMALAQSFVSLVMGNSWKFTKTGFQVERNK 96

Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGL 263
            ++  I++LFGD   S FSIHNL+      G+  G W GP     S+ +        T  
Sbjct: 97  FHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGP-----SFASEIAADHLNT-- 149

Query: 264 GCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEK 323
                   I+V        R G  V             S+   +  P ++ VPL LG   
Sbjct: 150 --------IHVFRTRGYVARLGRIV------KPDILDISEDNGNILPTIIFVPLRLGPVN 195

Query: 324 VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE 383
               + P L+  F  PQ +G+VGGKP  + +          YLDPH  Q  +++   D  
Sbjct: 196 AEEDFRPILKKVFDIPQCVGMVGGKPNLAFFFHTFDGNLLYYLDPHTTQNAVSM---DGG 252

Query: 384 ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
               +Y  + ++ +   ++DPS+++ F  ++KDDF+ F
Sbjct: 253 WSAESYFCNDVKSMKYKNLDPSVSLLFLIKNKDDFNKF 290


>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 402

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)

Query: 136 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 162
           G +E  +    R  +SYR GF+PI                                  + 
Sbjct: 75  GDSEVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDND 134

Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 221
             T+DVGWGCM+R+SQ ++A A+                 DR   E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177

Query: 222 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 278
           S+HN ++      L    G W GP A   S + L   + + T     ++P+++ V  SGD
Sbjct: 178 SLHNFVKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232

Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
                        DD           Q    P+LLL+PL LG++ VN  Y  +L      
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
           PQS GI GGKP +S Y  G Q  S +YLDPH  Q V         A   +YHS   + + 
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSSYQKLD 322

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           +  +DPS+  G   ++ +D+ D   R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350


>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
           kowalevskii]
          Length = 338

 Score =  111 bits (278), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 83/143 (58%), Gaps = 5/143 (3%)

Query: 302 SKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
           S+    W  +++L+P+ LG E++NP YI  ++  FT    +GI+GGKP  S Y +G QE+
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSLFTLKHCIGIIGGKPKHSLYFIGFQED 215

Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             I+LDPH  Q V+++   D      ++H    R + L  +DPS  IGFYC+ +DDF +F
Sbjct: 216 KLIHLDPHLCQDVVDMRSRDFPL--QSFHCMSPRKMSLMKMDPSCTIGFYCKTQDDFKEF 273

Query: 422 CARASKLAEESNGA---PLFTVT 441
           C+ A ++ + +      P+F  +
Sbjct: 274 CSYAQEVLDSTKHVGDYPMFIFS 296



 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 62/100 (62%), Gaps = 6/100 (6%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDE--ALGDAAGNNGLA---EFNQDFSSRILISYRKGF 156
           S+T  S  T  IWLLG C+    D+      +A ++ L     F +DF+SR+ ++YR+ F
Sbjct: 42  SQTNFSYHTP-IWLLGECYHHRPDDPNETEQSAEDDCLTPMERFKRDFTSRLWLTYRREF 100

Query: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
             +  + +T+D GWGCMLRS QM++AQ+ L H LGR +++
Sbjct: 101 QQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140


>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 388

 Score =  111 bits (278), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 127/279 (45%), Gaps = 39/279 (13%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
           +  SYR+ F+P+ +   TSDVGWGC +R+ QM++A A + +R G           D   V
Sbjct: 94  LYFSYRRQFEPLRNGA-TSDVGWGCTIRACQMMLAWAFMRYRNGG------SVTMDDNVV 146

Query: 208 EIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
           + L      LF D  T+PF IH +   G  +G+  G W GP  M +   AL    R+  G
Sbjct: 147 DSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSSGG 206

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
            G + L  +        D + G   VV     S+H             ++LL+P+ LG +
Sbjct: 207 EGPEVLVAS--------DRQIGVQDVVVRLQRSQH-------------VVLLIPVKLGPQ 245

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
            V+  Y   L+  F    S+G VGG+  ++ +  G Q +  I+LDPH VQ  +       
Sbjct: 246 TVSVTYANALKRFFEMGSSIGAVGGEKNSAYFFFGYQGDKIIHLDPHYVQCALT------ 299

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             +++   +   R + +   + S  +GFY    D+ D F
Sbjct: 300 SPNSNGTLAGTWRSLPVMQCNTSALLGFYVSSCDELDQF 338


>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 402

 Score =  111 bits (278), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 139/328 (42%), Gaps = 89/328 (27%)

Query: 136 GLAEFNQDFSSRILISYRKGFDPI---------------------------------GDS 162
           G  E  +    R  +SYR GF+PI                                  + 
Sbjct: 75  GDLEVQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFANIHSLVDND 134

Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI-LHLFGDSETSPF 221
             T+DVGWGCM+R+SQ ++A A+                 DR   E+ + LF D+ ++ F
Sbjct: 135 NFTTDVGWGCMIRTSQSVLANAI-----------------DRAGYEVDVELFADTSSAAF 177

Query: 222 SIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVV-SGD 278
           S+HN ++      L    G W GP A   S + L   + + T     ++P+++ V  SGD
Sbjct: 178 SLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSST-----NVPLSVLVCESGD 232

Query: 279 EDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTF 338
                        DD           Q    P+LLL+PL LG++ VN  Y  +L      
Sbjct: 233 -----------IYDD-----------QIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEV 270

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
           PQS GI GGKP +S Y  G Q  S +YLDPH  Q V         A   +YHS + + + 
Sbjct: 271 PQSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNV--------SAGVGSYHSSLYQKLD 322

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           +  +DPS+  G   ++ +D+ D   R +
Sbjct: 323 ISDMDPSMMAGIVLKNNEDYTDLKRRTT 350


>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
 gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
          Length = 357

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 110/230 (47%), Gaps = 42/230 (18%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSK---------------------ITSDVGWGCM 173
           N  + F  DF SR+ ++YR GF PI  S+                      TSD G+GCM
Sbjct: 91  NWPSAFLDDFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCM 150

Query: 174 LRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY 233
           +RS Q ++A AL   RLGR WR   +   D+++ EIL LF D   +PFSIH  ++ G A 
Sbjct: 151 IRSGQCILANALQILRLGRDWRW-QENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAV 209

Query: 234 -GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G   G W GP A  R  + LA   R E GL        +Y VSGD      GA V   +
Sbjct: 210 CGKYPGEWFGPSAAARCIQDLANKHR-EAGL-------KVY-VSGD------GADVY--E 252

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
           D  +  +V   G   W P L+LV   LG++K+ P Y   L++    P  L
Sbjct: 253 DKLKQVAVDEDGL--WQPTLILVGTRLGIDKITPVYWEALKIREMDPSML 300


>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 1295

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 117/281 (41%), Gaps = 49/281 (17%)

Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
           D   G  A N GL+       SR       G+   G+  +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559

Query: 185 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 228
           L+   LGR WR P QKP                  YV +L  F D  S   PFS+H    
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
            GK  G   G W GP     + + LA            S P     V    DG    + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668

Query: 289 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 336
                AS   + ++ G     P            +L+++P  LGL+ VNP Y   ++   
Sbjct: 669 Y---QASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
               S+GI GG+P +S Y V  Q  S  YLDPH  +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 31/49 (63%)

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           E    T+H D ++ + L  +DPS+ +GF C ++ +F+DFC R S+L  +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979


>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
          Length = 178

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/163 (39%), Positives = 88/163 (53%), Gaps = 32/163 (19%)

Query: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHER 97
           S+  K S+LS +F    ++FE   + S++   A   K    + A  R+     +RR+   
Sbjct: 45  SRQPKASVLSGVFAPPLAIFEGQQQVSSTPCDASSTKPPSGSYAWSRI-----LRRVS-- 97

Query: 98  VLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFD 157
                                     ++E  G +  ++G A F +DFSSRI I+YRKGFD
Sbjct: 98  -------------------------PEEEESGGSDSDSGHAAFLEDFSSRIWITYRKGFD 132

Query: 158 PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
            I  SK+TSDV WGCM+RSSQMLVAQAL+FH LGR WRKP +K
Sbjct: 133 AIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEK 175


>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 1295

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 117/281 (41%), Gaps = 49/281 (17%)

Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
           D   G  A N GL+       SR       G+   G+  +TSD GWGCMLR+ Q ++A A
Sbjct: 500 DAYFGAFAQNEGLSHSQTMMPSRQSGGGAWGWVKGGERGLTSDAGWGCMLRTGQSMLANA 559

Query: 185 LLFHRLGRPWRKPLQKPFDRE--------------YVEILHLFGD--SETSPFSIHNLLQ 228
           L+   LGR WR P QKP                  YV +L  F D  S   PFS+H    
Sbjct: 560 LIHLHLGRGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFAL 619

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
            GK  G   G W GP     + + LA            S P     V    DG    + V
Sbjct: 620 IGKELGKEVGEWFGPSTAAGALKTLA-----------NSFPPCGLSVVSAADGSVFRSEV 668

Query: 289 VCIDDASRHCSVFSKGQADWTP------------ILLLVPLVLGLEKVNPRYIPTLRLTF 336
                AS   + ++ G     P            +L+++P  LGL+ VNP Y   ++   
Sbjct: 669 Y---QASNLPTDWTTGAKPSRPNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK--- 722

Query: 337 TFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
               S+GI GG+P +S Y V  Q  S  YLDPH  +P + +
Sbjct: 723 ----SVGIAGGRPSSSYYFVASQANSLFYLDPHFTRPAVPL 759



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 31/49 (63%)

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           E    T+H D ++ + L  +DPS+ +GF C ++ +F+DFC R S+L  +
Sbjct: 931 ETALKTFHCDRVKKLPLSGLDPSMLLGFLCTNEAEFEDFCERVSRLPHK 979


>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 377

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/327 (27%), Positives = 128/327 (39%), Gaps = 105/327 (32%)

Query: 138 AEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGWGCML 174
           A F  DF SRI I+YR  F  I  SK                        T+D GWGCM+
Sbjct: 90  AAFLDDFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMI 149

Query: 175 RSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY- 233
           RS Q L+A ALL  +LGR WR+  +     + + +L LF D   +PFSIH  ++ G A  
Sbjct: 150 RSGQSLLANALLIQKLGRDWRRGSET---GKEIALLSLFADRPQAPFSIHRFVEHGAAAC 206

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
           G   G W GP        A ARC        C+   + +YV S   D           +D
Sbjct: 207 GKHPGEWFGP-------SATARCIDE-----CEHAGLNVYVTSDGSD---------VHED 245

Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAST 353
             R  +    G  D  P L+L+ + LG++ + P Y   L+    +PQS+GI G       
Sbjct: 246 KFRQIA----GLDDIKPTLILLGVRLGIDSITPVYWDALKAIIQYPQSVGIAG------- 294

Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
                                                      +H+  +DPS+ IGF  +
Sbjct: 295 ------------------------------------------RLHIKEMDPSMLIGFLIK 312

Query: 414 DKDDFDDFCARASKLAEESNGAPLFTV 440
           + DD+ D+  R       + G P+  V
Sbjct: 313 NNDDWHDWKHR----VRSAPGKPIIHV 335


>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 873

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 149/353 (42%), Gaps = 76/353 (21%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGFDPIGDSKI---------------------------- 164
           G+N    F  DF+SRI ++YR  F PI DS +                            
Sbjct: 291 GSNWPPVFYADFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPV 350

Query: 165 ------TSDVGWGCMLRSSQMLVAQALLFHRLGRP-WRKPLQKPFDRE---YVEILHLFG 214
                 TSD GWGCMLR+ Q L+A ALL   LGR  WR+P       +   YV+I+  F 
Sbjct: 351 GGEKGWTSDAGWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFF 410

Query: 215 D--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAI 272
           D  S  SPFS+H +  AGK  G   G W GP     + + L      E GLG       +
Sbjct: 411 DTPSPQSPFSVHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHA-FPEAGLGVSVASDGV 469

Query: 273 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
              S     +   A    I    RH  V   G+A    +++L+ + LGL+ VNP Y  T+
Sbjct: 470 IFQS-----DVYAASNAYIGSPRRHAKVSWGGRA----VIVLIGIRLGLDGVNPIYYDTI 520

Query: 333 RLT-----------FTFPQSLGIVGGKPGASTYIV----------GVQEESAIYLDPHDV 371
           +++            T P + G     P AS  I           G  E +   LDP   
Sbjct: 521 KVSIRTLRPYRWILMTVPYTSGFNASLP-ASPEISSDMDVRELGWGDSEGAGEALDPMAE 579

Query: 372 QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
             V     D L     T+H D +R + +  +DPS+ +GF C+D++D+ DF  R
Sbjct: 580 HYVNAYSPDQLR----TFHCDRVRKMPMSGLDPSMLLGFLCKDENDWFDFRRR 628


>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
 gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
 gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
 gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
 gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
 gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
 gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 494

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
          Length = 392

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 142/310 (45%), Gaps = 49/310 (15%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPL 198
           +   D ++RI  +YRK F P+  S+ T+DVGWGCMLR  QM++A  L+           +
Sbjct: 119 QLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLM----------AV 168

Query: 199 QKPFDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGPYAMCRSWEALARCQ 257
            +P       + HL        +++ N  L+AG+  G ++   VG   + +   ALA+  
Sbjct: 169 LQP------RVHHLLK------YTMENHHLKAGRFQGPSS---VGSALLHQVPSALAQLN 213

Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
           +       + + +  Y  S            + I D  R      +GQA++ PI+L++PL
Sbjct: 214 QFRD----EEVKLRTYFASD----------TLVILDQLRP----EEGQAEFEPIMLVLPL 255

Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
            LG+EK+ P+Y   L+L    P  +G +GG    + YI G Q      LDPH     +  
Sbjct: 256 RLGIEKIGPQYHARLQLLLRQPWCMGFIGGHDKRAMYIFGYQGHQYFGLDPHRCSAAVAQ 315

Query: 378 GKDDLEAD----TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK-LAEES 432
              +L         ++H+  +  I  D +DPSLA+    R  ++ DD  +   +  +E+ 
Sbjct: 316 STAELRDRWVEVRDSFHTSKLSGIERDDLDPSLAVFLLARTAEELDDMLSVIGQPTSEDR 375

Query: 433 NGAPLFTVTQ 442
            G  L +V Q
Sbjct: 376 PGPALVSVVQ 385


>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
          Length = 494

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 494

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
          Length = 494

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
 gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
          Length = 384

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 80/139 (57%), Gaps = 6/139 (4%)

Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
           +W  +++L+P+ LG E +NP Y P ++  FT    LG++GG+P  S Y VG QE+  I+L
Sbjct: 203 NWCSVIILIPVRLGGESLNPIYEPCIKGLFTMDHCLGVIGGRPKHSLYFVGFQEDKLIHL 262

Query: 367 DPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           DPH  Q V+++   D   +  ++H    R + +  +DPS  IGFYCR +DDF+ FC   +
Sbjct: 263 DPHFCQEVVDMTPRDFPLE--SFHCMNPRKMSIARMDPSCTIGFYCRTRDDFNKFCTTVT 320

Query: 427 KLAEESNGA----PLFTVT 441
           +      G     P+F V+
Sbjct: 321 EEMLRQPGPKADYPMFIVS 339



 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 7/91 (7%)

Query: 113 IWLLGVCHKIAQDE------ALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKIT 165
           IWL GVC+    +E       L D+       E F +DF+S++ ++YR+ F  +  S  T
Sbjct: 88  IWLQGVCYHRRNEELTKELEPLTDSDRRLYTMELFKRDFASKVWLTYRREFPQLAGSMFT 147

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
           +D GWGCMLRS QML+A  L+ H LGR +++
Sbjct: 148 TDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178


>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
          Length = 494

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 85  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGBIYENEVEKV 257

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 258 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 358 GILIKGEKDW 367


>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
 gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 506

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 97  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 370 GILIKGEKDW 379


>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
          Length = 506

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 129/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 97  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKV 269

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ I
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLI 369

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 370 GILIKGEKDW 379


>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 159/373 (42%), Gaps = 60/373 (16%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           N + +  QD    I I+YR+ F P+  S   SD GWGCMLR  QM +AQ L  H      
Sbjct: 57  NKIKQLVQD---TIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQMLKKHLKNHGD 113

Query: 195 RKPLQKPFDREYVEILHLFGDSETS----------------------PFSIHNL-LQAGK 231
           ++      D +Y  IL  F D+++                       PFSI  +   A K
Sbjct: 114 KR------DEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKK 167

Query: 232 AYGLAAGSWVGP-YAM-----------CRSWEALARCQRAETGLGCQSLPMAIYVVSGDE 279
            + L  G W  P Y +            R+ E L      ++ L    L   ++ +  + 
Sbjct: 168 EFNLDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFDIKFET 227

Query: 280 DGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFP 339
           D +        +++      + SK       + + V   +GL++ N +Y+  L      P
Sbjct: 228 DKD--------LEEQLEKTQLKSKN-----SLAIFVLTRIGLDEPNQKYLKVLDELMELP 274

Query: 340 QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK--DDLEADTSTYHSDVIRHI 397
              GIVGG P  + YI+G   +  IYLDPH VQ   N G+  ++   + ++Y    I  +
Sbjct: 275 YFQGIVGGTPKRAFYILGRINDHYIYLDPHYVQEAENKGQIIENKMFNRTSYSCKYIHLL 334

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGET 457
           +   +D S+ + +Y R+K +   F     K+ ++S+   +F ++ T  + V++S+ L E+
Sbjct: 335 NQKHVDTSMGLSYYIRNKSELLQFWRDMKKIKQKSDDFFIF-LSDTTPEYVDYSNQLEES 393

Query: 458 GGVPEDDSLGVMS 470
                DD +  + 
Sbjct: 394 SNKLNDDDVVFLQ 406


>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
          Length = 494

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 132/310 (42%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
           EF  D  SR+  +YR  F PI     G S ++                        +D+G
Sbjct: 85  EFLLDVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 144

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R   +K   RE  +I+  F D+  +PFSIHN +  
Sbjct: 145 WGCMIRTGQSLLGNALQILHLGRDFRVDNEKSLKRES-KIVTWFNDTPEAPFSIHNFVST 203

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L           C      + V SGD    +     
Sbjct: 204 GTELSDKRPGEWFGPAATARSIQSLIYGFPE-----CGITDCVVSVSSGDI--YQNEVEK 256

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
           + +++               + IL L+ + LG+  VN  Y  ++       +S+GI GG+
Sbjct: 257 IYVENPD-------------SIILFLLGVKLGINAVNESYRESICGILNSARSVGIAGGR 303

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    +Y DPH  QP +       E+   + H+     + L  +DPS+ I
Sbjct: 304 PSSSLYFFGYQGNQFLYFDPHIPQPAVE------ESFVESCHTSKFGKLQLSEMDPSMLI 357

Query: 409 GFYCRDKDDF 418
           G   + ++D+
Sbjct: 358 GVLIKGEEDW 367


>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
          Length = 700

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 59/149 (39%), Positives = 84/149 (56%), Gaps = 10/149 (6%)

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAI 364
           A W P+LL +PL LGL + NP Y   ++     P S+GI+GG+P  + +IVG   +E  +
Sbjct: 259 ATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQIPHSIGIMGGRPSHAVWIVGTAGDEDLL 318

Query: 365 YLDPHDVQPVINIGKDDLEA-DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
            LDPH  QP     +DDL A D  T+H D    + L+ +DPS+ IGF C  +D+FD  CA
Sbjct: 319 CLDPHTTQPA---SQDDLTAEDDVTHHCDCPVRLPLERLDPSMVIGFVCTTEDEFDQLCA 375

Query: 424 RASK---LAEESNGAPLFTVTQTHKKPVN 449
              +     E + G PLF V ++  +P N
Sbjct: 376 HLERDVLSVETTCGHPLFEVHKS--RPSN 402



 Score = 41.6 bits (96), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 35/66 (53%), Gaps = 3/66 (4%)

Query: 179 MLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
           M++A+A+    LG+ WR  P  +  D  Y  +  +F D ++S +SI N+   G A     
Sbjct: 1   MMLAEAITRIHLGKDWRWTPGCQ--DEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPI 58

Query: 238 GSWVGP 243
           GSW GP
Sbjct: 59  GSWFGP 64


>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
          Length = 216

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 52/154 (33%), Positives = 84/154 (54%), Gaps = 14/154 (9%)

Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
           +W P+L+++PL LGL  +N  Y P ++  F  PQ +GI+GG+P  + Y  G+ + + +YL
Sbjct: 28  EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 87

Query: 367 DPHDVQPVINIG--------KDDL------EADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
           DPH  Q  +++         +DD       E   STYH   I    +D +DPSLA+GF+C
Sbjct: 88  DPHFCQNFVDLDETTTTRDERDDYVEIKNDEFKDSTYHCPFILSTKIDKVDPSLALGFFC 147

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
             +DD+++   R       ++  PLF + +T  K
Sbjct: 148 HTEDDYNELAKRLRTHLLPASTPPLFEMLETRPK 181


>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
           purpuratus]
          Length = 1018

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 81/144 (56%), Gaps = 10/144 (6%)

Query: 113 IWLLGVC-HKIAQDEALGDAAGNNGLAE-----FNQDFSSRILISYRKGFDPIGDSKITS 166
           IW LG C H+  +D       G + +       F QDFSSR+ ++YR+ F  +  S  TS
Sbjct: 346 IWFLGKCYHQRPEDPDPERPPGMDSVRSMVIEMFKQDFSSRLWMTYRREFPTLAGSNFTS 405

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWR--KPLQKPFDREYVEILHLFGDS--ETSPFS 222
           D GWGCMLRS QM++A +L+ H LGR W   KP  +   + + +I+  FGD   + SPFS
Sbjct: 406 DCGWGCMLRSGQMMLAHSLILHFLGREWNIYKPQTQEMLQFHRQIVRWFGDQPLDMSPFS 465

Query: 223 IHNLLQAGKAYGLAAGSWVGPYAM 246
           +H L+  G+  G   G W GP ++
Sbjct: 466 VHRLVGIGQNNGKKVGDWYGPSSV 489



 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 6/154 (3%)

Query: 291 IDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPG 350
           ID +    S  ++G   W  +++++P+ LG ++VNP YI  ++  FT    LGI+GGKP 
Sbjct: 819 IDPSRSRTSTSTEGGKPWCAVVIMIPVRLGGDEVNPVYIRPIQSLFTLESCLGIIGGKPK 878

Query: 351 ASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
            S + VG QEE  I+LDPH  Q V+++   D      ++H    R + +  +DPS  IGF
Sbjct: 879 HSLFFVGFQEEKLIHLDPHYCQQVVDMKTRDFPL--WSFHCMSPRKMSISKMDPSCTIGF 936

Query: 411 YCRDKDDFDDFCAR----ASKLAEESNGAPLFTV 440
           Y R ++ F+  C       S L   S+  P+F V
Sbjct: 937 YIRTEEQFEQLCKELPTVVSPLGSHSSDYPMFIV 970


>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
          Length = 286

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/141 (35%), Positives = 79/141 (56%), Gaps = 3/141 (2%)

Query: 305 QADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAI 364
           +A+W  I++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +
Sbjct: 109 EAEWKSIIILVPVRLGGETLNPAYMPCIKELLRMEPCLGIIGGKPKHSLYFIGYQDDFLL 168

Query: 365 YLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           YLDPH  QP ++  KD    +  ++H    R +    +DPS  +GFY   + DF+  C++
Sbjct: 169 YLDPHYCQPCVDTMKDSFPLE--SFHCTAPRKLPFAKMDPSCTVGFYAGTRKDFEALCSQ 226

Query: 425 -ASKLAEESNGAPLFTVTQTH 444
               L   +   P+FTV + H
Sbjct: 227 LLQALNSTATRYPMFTVAEGH 247


>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
 gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
          Length = 440

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/155 (33%), Positives = 83/155 (53%), Gaps = 16/155 (10%)

Query: 307 DWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYL 366
           +W P+L+++PL LGL  +N  Y P ++  F  PQ +GI+GG+P  + Y  G+ + + +YL
Sbjct: 252 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELPQCVGIIGGRPNHALYFCGIVDNNLLYL 311

Query: 367 DPHDVQPVINIG---------------KDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
           DPH  Q  +++                K+D E   STYH   I    +D +DPSLA+GF+
Sbjct: 312 DPHFCQNFVDLDEATTTKDERGDYVEIKND-EFRDSTYHCPFILSTKIDKVDPSLALGFF 370

Query: 412 CRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
           C  +DD+ +   R       ++  PLF + +T  K
Sbjct: 371 CHTEDDYSELANRLRTHLLPASTPPLFEMLETRPK 405



 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/122 (36%), Positives = 57/122 (46%), Gaps = 28/122 (22%)

Query: 128 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
           LG+   + G +A   +  +S +  +YRK F PIG +  T+D GWGCMLR  QML+A+ L+
Sbjct: 59  LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 118

Query: 187 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
              LGR W       +DR     EY  IL   G SE                G   G W 
Sbjct: 119 VRHLGRNWL------WDRDVMLTEYKRILPNMGVSE----------------GKEIGEWF 156

Query: 242 GP 243
           GP
Sbjct: 157 GP 158


>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
          Length = 389

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 137/340 (40%), Gaps = 45/340 (13%)

Query: 130 DAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-LFH 188
           D A +  + +    F   I  SYR     +  S +TSD GWGCMLR  QM + Q +  F+
Sbjct: 47  DLAVDQKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFY 106

Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSE-------------------TSPFSIHNLL-Q 228
            L             +E  E++  F D++                    SPFSI  ++ Q
Sbjct: 107 NLSSS----------QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQ 156

Query: 229 AGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED---GERGG 285
                  + G W  P  +    + L R  + +  L         +++S        + GG
Sbjct: 157 TKLELQKSPGEWYKPNDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFNKNGG 216

Query: 286 APVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIV 345
                  D         KGQ D   + + +   +GL+  N  Y+  L    T+PQ  GI+
Sbjct: 217 -------DEEWLKEQIEKGQNDEFGVSIFILTRIGLDTCNQEYLKVLNDIMTYPQFQGIL 269

Query: 346 GGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS 405
           GG P  + YI+G      IYLDPH VQ   N    ++E D S+Y    I+ I  + +DPS
Sbjct: 270 GGFPNKALYILGRVGNYYIYLDPHYVQNAQNY--QEMENDRSSYTCQSIQLIDSNQLDPS 327

Query: 406 LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT-VTQTH 444
           +AI F C           R  K  +  NG   F  +T+TH
Sbjct: 328 MAISF-CVKNALDLLDLWRRLKQTKSENGESFFMALTETH 366


>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
          Length = 357

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/277 (26%), Positives = 125/277 (45%), Gaps = 35/277 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +   G P    + LQ+   R
Sbjct: 74  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 133 TQT----LFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNAL----------- 177

Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
                MA Y+ +G E     G  V+   +         +       ++LL+P++LG+  +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEQVKELLRQSMHVVLLIPVMLGIRVI 227

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP      +  E 
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQPAFTSSGNSGEL 287

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             +       R +   S D S+ +GFY    D F  F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSFAVF 318


>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
           8797]
          Length = 448

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 157/363 (43%), Gaps = 68/363 (18%)

Query: 135 NGLAEFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------ 165
           N   +F +D  +R+  +YR  F PI     G S I+                        
Sbjct: 38  NEKMQFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFN 97

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHN 225
           +D+GWGCM+R+ Q L+  AL   R GR +R       D    +I+  F D+  +PFS+HN
Sbjct: 98  TDIGWGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD----DIIQWFKDTPDAPFSLHN 153

Query: 226 LLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERG 284
            ++ G +   +  G W GP A  RS ++L  C   + G+        I  VS  +  ++ 
Sbjct: 154 FVKKGVELADMKPGQWFGPAATSRSIQSLI-CNFPQCGID-----HCIVSVSSADIYKQD 207

Query: 285 GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGI 344
              +   D  S               +L+L  + LG+  VN  Y   +R       S+GI
Sbjct: 208 VEDMFDADPDSN--------------LLILFGVKLGVSAVNASYWEDIRRLLNSKFSVGI 253

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
            GG+P +S Y  G Q +  +Y DPH  QP +    DD  A  +T HS     + L  +DP
Sbjct: 254 AGGRPSSSLYFFGYQNQELLYFDPHTPQPSL---IDD--AAFNTCHSIEFGKLELRDMDP 308

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDD 464
           S+ IG     + D++++    ++  E S    +F + +   +   + DV  +  G   D+
Sbjct: 309 SMLIGIMIEGERDWENW----ARFTETSK---IFNILEERSEDCINVDV--DIDGDENDE 359

Query: 465 SLG 467
           ++G
Sbjct: 360 NIG 362


>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
          Length = 354

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 32/299 (10%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA-LLFHRLGRPWRKPLQKPFDREY 206
           +  SYR GF P+ +   T+DV WGC++R++QML+AQA + F   G  +         RE 
Sbjct: 69  LYFSYRCGFTPLSNGS-TTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFVDGSALQILREK 127

Query: 207 VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
           V+   LF D  ++PF IH +    + YG+A G W G     ++  +L +      G G  
Sbjct: 128 VQ--PLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSLRGGNG-- 183

Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNP 326
             P  +  V    D E     V  +   SR              ++LL+P VLGL++++ 
Sbjct: 184 --PAVLVFV----DREVSALKVRDLLSHSRQ-------------VVLLIPAVLGLDRISV 224

Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT 386
           +Y   L         +G++GG+  ++ Y VG Q  + IYLDPH  Q          E  T
Sbjct: 225 KYSKMLIRCLEMESCIGVIGGRKSSALYFVGHQSNNIIYLDPHRAQRAFTEVASPGEL-T 283

Query: 387 STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK 445
             +H      + + +   S+  GFY    + F  F A   + A  +   PL +V  + +
Sbjct: 284 GAWHL-----LPVTACSTSILFGFYIDSLESFKQFEADMLE-ANSALAFPLISVATSER 336


>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
          Length = 362

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 183 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 242

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 243 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 300

Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
           +++   S+     P+FT+ + H +  +HS
Sbjct: 301 TRVLSSSSATERYPMFTLAEGHAQ--DHS 327



 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 88  TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 141

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
            GWGCMLRS QM++AQ LL H L R ++
Sbjct: 142 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 169


>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
          Length = 360

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 181 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLY 240

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 241 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 298

Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
           +++   S+     P+FT+ + H +  +HS
Sbjct: 299 TRVLSSSSATERYPMFTLAEGHAQ--DHS 325



 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 86  TSFSKISSIHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPGGCLTSD 139

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
            GWGCMLRS QM++AQ LL H L R ++
Sbjct: 140 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 167


>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 298

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 81/304 (26%), Positives = 135/304 (44%), Gaps = 46/304 (15%)

Query: 151 SYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVE 208
           +Y K F P+     T+D  WGC +RS+Q L+ Q +  L+  LG   R     P + +Y  
Sbjct: 28  TYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDDIRNIF--PTNSKY-- 82

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSL 268
              LF D   SPF + ++    ++YG+  G WV P  +    + +    R          
Sbjct: 83  --ELFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKEIMNFFRI--------- 131

Query: 269 PMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRY 328
                             PVV  +       V ++  +   P+LLL  L+LG E    +Y
Sbjct: 132 ------------------PVVIAEHGCLSREVLNEALSHNIPVLLLFTLMLGYENFELKY 173

Query: 329 IPTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTS 387
           +P L+LT +   QS+G+VGG+ G + +IVG Q+E  +Y DPHDV    +I K D     +
Sbjct: 174 LPFLKLTLSLIYQSVGVVGGQQGKAYFIVGHQKEKLLYFDPHDVNE--SITKID---QIN 228

Query: 388 TYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
                 ++ +  D++  S+ +GF+  +  D ++       L  +S   P+  V +  +  
Sbjct: 229 QLFKPPLKVMPADTLSSSMLVGFFITNLQDAEEL----PMLLNQSGECPIHIVDKIEEAK 284

Query: 448 VNHS 451
             H+
Sbjct: 285 ETHT 288


>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
          Length = 359

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 86/149 (57%), Gaps = 7/149 (4%)

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           A+W  +++LVP+ LG E +NP Y+P ++        LGI+GGKP  S Y +G Q++  +Y
Sbjct: 180 AEWKSVVILVPVRLGGETLNPVYVPCVKELLRSELCLGIMGGKPRHSLYFIGYQDDFLLY 239

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARA 425
           LDPH  QP +++ + D   +  ++H    R +    +DPS  +GFY  D+ +F+  C+  
Sbjct: 240 LDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSEL 297

Query: 426 SKLAEESNGA---PLFTVTQTHKKPVNHS 451
           +++   S+     P+FT+ + H +  +HS
Sbjct: 298 TRVLSSSSATERYPMFTLVEGHAQ--DHS 324



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSD 167
           +S S I  + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+    +TSD
Sbjct: 85  TSFSKISSVHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLAGGSLTSD 138

Query: 168 VGWGCMLRSSQMLVAQALLFHRLGRPWR 195
            GWGCMLRS QM++AQ LL H L R ++
Sbjct: 139 CGWGCMLRSGQMMLAQGLLLHFLPRVYK 166


>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
          Length = 745

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++++PL LG +K+N  YI  L+L    PQSLG +GGKP  S Y +G Q++  IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH VQ  +N    D    ++TY   + + +    +DPSL+IGFYCRD+  F+D C R S 
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619

Query: 428 LAEESNGAPLFTVTQ 442
           +   +   P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
           F  D +S I  SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H        P  
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289

Query: 198 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 232
             +KP    Y ++L  F D  S+   + IH ++   +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326


>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
          Length = 745

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 81/135 (60%), Gaps = 5/135 (3%)

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           W  +++++PL LG +K+N  YI  L+L    PQSLG +GGKP  S Y +G Q++  IYLD
Sbjct: 503 WKSLIIMIPLKLGADKLNSTYIEKLKLLLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLD 562

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASK 427
           PH VQ  +N    D    ++TY   + + +    +DPSL+IGFYCRD+  F+D C R S 
Sbjct: 563 PHFVQESVNPNSFDY---SNTYSGCIPQKMPFTQLDPSLSIGFYCRDQASFEDLCDRLSV 619

Query: 428 LAEESNGAPLFTVTQ 442
           +   +   P+ +V Q
Sbjct: 620 I--NNCEFPIISVCQ 632



 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 6/98 (6%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP-- 197
           F  D +S I  SYRK F PI ++ IT+D+GWGCMLR+ QM++A+AL+ H        P  
Sbjct: 230 FLSDVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEI 289

Query: 198 -LQKPFDREYVEILHLFGD--SETSPFSIHNLLQAGKA 232
             +KP    Y ++L  F D  S+   + IH ++   +A
Sbjct: 290 ERKKP-HSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQA 326


>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 469

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 92/311 (29%), Positives = 137/311 (44%), Gaps = 52/311 (16%)

Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKI------------------------TSDVG 169
           EF +D +SR+  +YR  F PI     G S +                         +D+G
Sbjct: 62  EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ 228
           WGCM+R+ Q L+A AL    LGR +R        +   ++I+  F D+   PFS+H  +Q
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181

Query: 229 AG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAP 287
            G K  G   G W GP A+ RS  +L           C        ++S D       + 
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHC--------IISTD-------SA 226

Query: 288 VVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGG 347
            V +D+         K        LLL+ + LG++  N  Y   ++   +  QS+GI GG
Sbjct: 227 DVYLDEIDPLFRANPKANV-----LLLLGVRLGVDFTNEYYWDDIKNILSSSQSVGISGG 281

Query: 348 KPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLA 407
           +P +S Y  G Q +   YLDPH VQ  + + + D E    + H      IHL +IDPS+ 
Sbjct: 282 RPSSSLYFFGYQGDYLFYLDPHKVQLNLALYESD-EERFHSVHPQTFNKIHLSAIDPSML 340

Query: 408 IGFYCRDKDDF 418
           +GF    +DD+
Sbjct: 341 LGFLLTGEDDW 351


>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 444

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 91/312 (29%), Positives = 138/312 (44%), Gaps = 71/312 (22%)

Query: 146 SRILISYRKGFDPIGDSK----------------------------------ITSDVGWG 171
           SR+ +SYR GFDPI  ++                                   TSD GWG
Sbjct: 84  SRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAGWG 143

Query: 172 CMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AG 230
           CM+R+SQ L+A  LL              P D +  +++ LF D+++SPFSIHN ++ AG
Sbjct: 144 CMIRTSQNLLANTLL-----------QLLPPDSKQ-DVIGLFQDNQSSPFSIHNFIKVAG 191

Query: 231 KA-YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
           ++   +  G W GP A   S + L    + +   G +   + I   S   DGE       
Sbjct: 192 ESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVFISENSDLYDGEINEI--- 248

Query: 290 CIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP 349
                     +  +G++    +L+L P+ LG++KVN  Y  ++        S GI GGKP
Sbjct: 249 ----------LSEEGRS----VLVLFPIRLGIDKVNSYYYDSIFQVLKSKFSCGISGGKP 294

Query: 350 GASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIG 409
            +S Y +G      IY DPH  Q V N        +  +YH+     +++  +DPS+ IG
Sbjct: 295 SSSFYFLGYDNSDLIYFDPHLPQLVEN------PINIESYHTRNYNRLNISLLDPSMMIG 348

Query: 410 FYCRDKDDFDDF 421
              R  DD+ +F
Sbjct: 349 ILLRSMDDYLEF 360


>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 485

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 128/310 (41%), Gaps = 57/310 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 76  EFLLDVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIG 135

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R      F RE   I++ F D+  +PFS+HN +  
Sbjct: 136 WGCMIRTGQSLLGNALQILHLGRDFRVDEDDDFRRE-SRIVNWFNDTPEAPFSLHNFVST 194

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS + L      E G+        + V SG  D        
Sbjct: 195 GTELSDKRPGEWFGPAATARSIQYLIY-GFPECGINA----CIVSVSSG--DIYENEVEE 247

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
           V +D+ +             + IL L+ + LG+  VN  Y  ++        S+GI GG+
Sbjct: 248 VFVDNPN-------------SSILFLLGVKLGINAVNESYRESICGILNSAWSVGIAGGR 294

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++  ++ H+     + L  +DPS+ I
Sbjct: 295 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVNSCHTSKFGRLQLSEMDPSMLI 348

Query: 409 GFYCRDKDDF 418
           G   + + D+
Sbjct: 349 GVLIKGEKDW 358


>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
 gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
          Length = 463

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 129/312 (41%), Gaps = 67/312 (21%)

Query: 135 NGLAEFNQDF----SSRILISYRKGFDPIGDSK--------------------------- 163
           N  +  NQDF    +SR+  +YR  F PI  S                            
Sbjct: 52  NRNSNLNQDFLSDVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNP 111

Query: 164 --ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
               +D+GWGCM+R+ Q L+  AL   +LGR +R  L      +  EI+  F D+   PF
Sbjct: 112 DCFNTDIGWGCMIRTGQSLLGNALQLAKLGRHFR--LDNKMGIKDDEIISWFRDTTQEPF 169

Query: 222 SIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           SIH  ++ G K      G W GP A   S ++L   +  E G+        + V SGD  
Sbjct: 170 SIHKFVEKGNKLANKKPGEWFGPAATSISIQSLIE-EFPECGID----KCLVSVSSGD-- 222

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                      +D  R   +F +     + IL L+ + LGL+ VN  Y   +        
Sbjct: 223 ---------IFEDDVRE--IFEENMD--SKILFLMGVKLGLDAVNSFYWEDILNILDSKF 269

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTY---HSDVIRHI 397
           S+GI GG+P +S Y  G Q    +Y DPH  QP +         D S Y   H+     +
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQPSL--------VDPSVYETCHTTNFGKL 321

Query: 398 HLDSIDPSLAIG 409
            +  +DPS+ IG
Sbjct: 322 DIKDMDPSMLIG 333


>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 1216

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 87/362 (24%), Positives = 154/362 (42%), Gaps = 79/362 (21%)

Query: 142 QDFSSRILISYRKGFDPIGDSKI-------TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           Q + + IL +YRK F P+   KI       TSD GWGCM+R+ QM+ AQ +  H     +
Sbjct: 257 QIYQNTILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRHLKKTDY 316

Query: 195 RKP----------LQKPFDRE----YVEILHLFGDSETSPFSIHNLL-QAGKAYGLAAGS 239
            +           L++   +E    Y+     +      P+SIH +  +A   Y +  G 
Sbjct: 317 IEQHQLINIIIGFLEEEEVQEGGKGYIFNQQSYIQDRIRPYSIHQITNRAFCKYKIQPGQ 376

Query: 240 WVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED-----------GERGGAPV 288
           W  P  +    + L +  + +   G ++L + ++  S D+            G +G   +
Sbjct: 377 WYTPNQIAIILKELHKKNKIK---GTENLKIDVH--SSDKPIIFEKILQTLLGRQGKINL 431

Query: 289 VC--------------IDDA------------SRHCSVFSKGQADWT------------- 309
            C               DD+                S + + + D T             
Sbjct: 432 NCNHENQQSRNSINQDQDDSFEKIMPPNQQEIEEFSSQYEESKEDQTDNLCCKDCFKTDN 491

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
            + LL+P  LGL++++P +I  L+   +  QS+G++GGKP  + Y +G   +  +YLDPH
Sbjct: 492 KLFLLLPCRLGLDEISPIHIEILKKLLSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDPH 551

Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
            ++  +   K+DL  + S+Y  + +  + ++ I  SL  GFY    D+ + F     +L 
Sbjct: 552 YIKECVR--KEDLMENISSYFEEDVFKMPINKISTSLVFGFYFSGVDELNKFYKFLRQLE 609

Query: 430 EE 431
           +E
Sbjct: 610 KE 611


>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
          Length = 392

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 143/340 (42%), Gaps = 41/340 (12%)

Query: 129 GDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
            DA     + +  Q  S  I  SYRK       S +TSD GWGCM+R +QM +AQ +   
Sbjct: 46  NDADIEQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQII--- 102

Query: 189 RLGRPWRKPLQ-----KPF----DREYVEILHLFGDSET----SPFSIHNLLQAGKA-YG 234
           R    ++KP Q     + F    D E  + +  F  ++     +PFSI  ++   K    
Sbjct: 103 RYYNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162

Query: 235 LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYV-----------VSGDEDGER 283
              G W     + ++ + L +  +        SL M IY+           +    + + 
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQY-------SLNMEIYINYDCAFILQDAIQQMFNQQE 215

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
           G    + + + +++ + F     D   I + +P  +GL+ +N  Y+  L      P   G
Sbjct: 216 GNE--IWLKERAKNNNQFDL--QDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQG 271

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           ++GG    + Y VG  ++  IYLDPH VQ   N   DDL  + ++Y    I+ IH   ID
Sbjct: 272 MIGGVSKRALYFVGRIQDYLIYLDPHFVQNAQNF--DDLSKNQASYTCQNIQLIHNSLID 329

Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQT 443
           PS+ +    R+  +  D         +E      F++ +T
Sbjct: 330 PSIVVCLCIRNALELLDLWQIFQHFKQEYQDLFFFSLLET 369


>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
           Clan CA, family C54, putative [Trypanosoma cruzi]
          Length = 357

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 73/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG---RPWRKPLQKPFDR 204
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +  G   R   + LQ+   R
Sbjct: 74  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177

Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
                MA Y+ +G E     G  ++   +         +     T ++LL+P++LG+  +
Sbjct: 178 -----MASYLATGGE-----GPVILAFPERQIFLEEVKELLRQSTHVVLLIPVMLGICVI 227

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP         E 
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             +       R +   S D S+ +GFY    D    F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLSVF 318


>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
           verrucosum HKI 0517]
 gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
           verrucosum HKI 0517]
          Length = 398

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 104/235 (44%), Gaps = 48/235 (20%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK--------------------------ITSDVGWGC 172
           +F  DF S++ I+YR  F PI  +                            TSD GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-K 231
           M+RS Q L+A  LLF RLGR WR+  +    +E  E++ LF D   +PFSIH  +  G  
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKV---QEESELVSLFADHPRAPFSIHRFVHHGAT 301

Query: 232 AYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
           A G   G W GP A  +  +AL +    + GL             G +  E+    V C 
Sbjct: 302 ACGKCPGEWFGPSAASQCIQALVK-SNPQVGL------RVCITSDGSDIYEKQFKEVACD 354

Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
           +                 P L+L+ + LG+++V P Y  +L+    FPQS+GI G
Sbjct: 355 ESG-----------GGIQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQSVGIAG 398


>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
          Length = 351

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +   G P    + LQ+   R
Sbjct: 68  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 127 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 171

Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
                MA Y+ +G E     G  V+   +         +     T ++LL+P++LG+  +
Sbjct: 172 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 221

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP         E 
Sbjct: 222 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 281

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             +       R +   S D S+ +GFY    D    F
Sbjct: 282 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 312


>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 357

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/277 (26%), Positives = 124/277 (44%), Gaps = 35/277 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL-FHRLGRPW--RKPLQKPFDR 204
           +  SYR    P+ +   T+D+ WGCM+R+ QM++A A + +   G P    + LQ+   R
Sbjct: 74  LYFSYRNRIVPLMNGA-TTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
                  LF D  ++PF IH +   G  +G+  G W GP  + ++  AL           
Sbjct: 133 TQT----LFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSAL----------- 177

Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
                MA Y+ +G E     G  V+   +         +     T ++LL+P++LG+  +
Sbjct: 178 -----MASYLAAGGE-----GPVVLAFPERQIFLEEVKELLRQSTHVVLLIPVMLGIRVI 227

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
           + +Y   ++       S+GI+GGK  ++ ++ G Q++   +LDPH VQP         E 
Sbjct: 228 SEKYSQLMKRCLEMESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQPAFTSSGSPGEL 287

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
             +       R +   S D S+ +GFY    D    F
Sbjct: 288 TCAR------RVLPTTSYDTSMTLGFYISSLDSLALF 318


>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
 gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
          Length = 603

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 83/168 (49%), Gaps = 38/168 (22%)

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
           P+L+L+P+ LGL+ +N  Y  +L   F FPQ+LG+VGGKP AS Y + VQ+++  YLDPH
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEIFKFPQNLGVVGGKPRASLYFIAVQDDNLFYLDPH 430

Query: 370 DVQPVINIGKDDLEAD-------------------------------------TSTYHSD 392
            VQ  I+I   + E                                        +T+   
Sbjct: 431 TVQNHIDINNSNGEPSNFSFSSSPSSSNINIINTNNNNNNNNNNDKNNNNSFPVNTFFCS 490

Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTV 440
             +  H+  +DPSL + F+C+ + DFDDF  R+  +A +    P+F++
Sbjct: 491 QTKRTHVSEVDPSLVVAFFCKSRSDFDDFVDRSKAMASQMEN-PIFSI 537



 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 74/126 (58%), Gaps = 1/126 (0%)

Query: 130 DAAGNNGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
           D  G + + EF +DF++R+L  +YR+GF  I +++  +D GWGCMLRS QML++  LL H
Sbjct: 129 DIPGQSFIKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHH 188

Query: 189 RLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
            LG  W+K         Y  I+ +F D  ++PFSIHN+   G+  G   G W  P  + +
Sbjct: 189 ALGDDWKKSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQ 248

Query: 249 SWEALA 254
           + ++L 
Sbjct: 249 AIKSLV 254


>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 371

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 124/302 (41%), Gaps = 57/302 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-----------------------------ITSDVG 169
           EF  D  SR+  +YR  F PI  +                                +D+G
Sbjct: 97  EFLLDVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIG 156

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + A
Sbjct: 157 WGCMIRTGQSLLGNALQILHLGRDFRVNGNESLERE-SKFVNWFNDTPEAPFSLHNFVSA 215

Query: 230 GKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G        G W GP A  RS ++L        G     +   I  VS  +  E     V
Sbjct: 216 GTELSDKRPGEWFGPAATARSIQSLI------YGFPECGIDDCIVSVSSGDIYENEVEKV 269

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+
Sbjct: 270 FAENPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGR 315

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DP  ++
Sbjct: 316 PSSSLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPRCSL 369

Query: 409 GF 410
            F
Sbjct: 370 VF 371


>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 284

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 73/261 (27%), Positives = 113/261 (43%), Gaps = 39/261 (14%)

Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211
           YR     + +S +T+D GWGC  RS+Q L+ Q +L  +L R +R    + F +  V  L 
Sbjct: 25  YRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYIL--KLHRKFRSLYDQVFGQN-VNPLD 81

Query: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271
           LF D  ++PF I NL +   A GL  G W  P  M     A  +       L C      
Sbjct: 82  LFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIM----AATIKLIFDTLNLSC------ 131

Query: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331
             ++S D   +             +H            P L+L+P + GL K++  Y+  
Sbjct: 132 --IISQDLTLDSNDI---------KHTKY---------PALILIPSLFGLSKMDDSYLSF 171

Query: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHS 391
           L L      SLG V G+  ++ Y VG   E   Y DPH  +  +      +     ++  
Sbjct: 172 LLLCLCIESSLGFVSGQNASAYYFVGFDLEDFYYFDPHVTKEAV------VSPPYDSFFD 225

Query: 392 DVIRHIHLDSIDPSLAIGFYC 412
             ++ +  +SI+PS+ +GFYC
Sbjct: 226 LELKSMKKESINPSVLLGFYC 246


>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
           [Homo sapiens]
          Length = 231

 Score =  101 bits (252), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 66/226 (29%), Positives = 102/226 (45%), Gaps = 61/226 (26%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 29  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + +    
Sbjct: 78  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEK---- 133

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
                        MCR                       +  +S D  G+R    +   +
Sbjct: 134 -------------MCR-----------------------VLPLSADTAGDRPPDSLTASN 157

Query: 293 DA---SRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLT 335
            +   S +CS        W P+LL+VPL LG+ ++NP Y+   ++T
Sbjct: 158 QSKGTSAYCSA-------WKPLLLIVPLRLGINQINPVYVDAFKVT 196


>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
           fumigatus A1163]
          Length = 226

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)

Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
           G+  + P L+L+   LG++++ P Y   ++ T   PQS+GI GG+P AS Y VGVQ    
Sbjct: 20  GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79

Query: 364 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            YLDPH  +P +   NI     + +  TYH+  +R IH+  +DPS+ IGF  +D++D+
Sbjct: 80  FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137


>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
 gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
           fumigatus Af293]
          Length = 226

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 3/118 (2%)

Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
           G+  + P L+L+   LG++++ P Y   ++ T   PQS+GI GG+P AS Y VGVQ    
Sbjct: 20  GRGSFRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHL 79

Query: 364 IYLDPHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            YLDPH  +P +   NI     + +  TYH+  +R IH+  +DPS+ IGF  +D++D+
Sbjct: 80  FYLDPHQTRPALPQRNIDDPYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137


>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
          Length = 632

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 4/118 (3%)

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           ++W P+LL VPL LGL   NP Y   ++  F  P  +GI+GG P  + +IVGV  +  I 
Sbjct: 385 SNWRPLLLFVPLRLGLHNPNPCYFNAIKAVFRLPNCIGILGGSPCHAVWIVGVTGDDVIC 444

Query: 366 LDPHDVQPVINIGKDDLEAD-TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFC 422
           LDPH  QP    G+ +L+ D   TYH +    + L  +DPS+ +GF C  + +FDD C
Sbjct: 445 LDPHTTQPA---GRGNLKPDYDQTYHCENPIRMPLKRLDPSMVLGFLCSTEKEFDDLC 499



 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
           E      SR+ ++YRKGF PIG      SD GWGCM R  QM++A+A+L   LGR WR  
Sbjct: 43  EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
           P Q+    EY  +L +F D  +  +SI  +   G + G + GSW GP  + +  + L+  
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160

Query: 257 QR 258
            R
Sbjct: 161 DR 162


>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 284

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/274 (25%), Positives = 120/274 (43%), Gaps = 39/274 (14%)

Query: 141 NQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQK 200
           NQ  +      YR  F  I +S ++ D GWGC  RSSQ LV Q +L  RL + +      
Sbjct: 14  NQILAEIPRFCYRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYIL--RLHKNFPDLYNS 71

Query: 201 PFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAE 260
            F  +    L LF D   +PF I N++    + GL  G+W  P  +  +++++ +     
Sbjct: 72  TFGID-KNPLDLFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSIFQ----S 126

Query: 261 TGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG 320
             L C        +V  D                     ++ + ++   P+L+L+P + G
Sbjct: 127 LHLNC--------IVPQDSTF------------------IYEELESTNYPVLILIPGLFG 160

Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKD 380
           LEK+   YI  + L+     SLG V G   ++ Y +G   +   Y DPH  +  +     
Sbjct: 161 LEKIEKPYISFIFLSLCMNSSLGFVSGHNDSAFYFIGFDSDYFYYFDPHVTKQALTGPPY 220

Query: 381 DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
           D   +        ++ + +++I+PS+ +GFYC D
Sbjct: 221 DSLFELK------LKSMKIENINPSVLLGFYCDD 248


>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
          Length = 348

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 49/344 (14%)

Query: 136 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           G AE  +  + ++L  SYR  F+P+ +   T+D+GWGC +R+ QM++A AL+ ++ G   
Sbjct: 37  GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93

Query: 195 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
                  F+   V  L     HLF D  ++PF IH +   G  +G   GSW GP  +   
Sbjct: 94  ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149

Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
             AL                M  Y+ SG +     G  V+ + D         K      
Sbjct: 150 MGAL----------------MEDYLSSGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
            +LLL+P++LG   ++  Y   L+       ++G VGGK G++ + +G Q  + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248

Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
             Q           +DT    S     + L S   S+ +GFY    D F  F        
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298

Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 473
           +++N + +F + +     V  SD +G      + D   ++S  D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFSEDDPDVCSLVSFGD 337


>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
          Length = 364

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 85/352 (24%), Positives = 143/352 (40%), Gaps = 98/352 (27%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
           L EF  D    + I  ++     G +  +SD GWGCMLR  QM++AQAL+   LGR    
Sbjct: 24  LEEF-PDTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRA--- 79

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
                                          Q G   G + G W GP  + +  + LA  
Sbjct: 80  -------------------------------QMGVGEGKSIGEWFGPNTVAQVLKKLALF 108

Query: 257 QRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVF--------------- 301
               +        +A+YV   +          V I+D  + C V                
Sbjct: 109 DEWNS--------LAVYVSMDN---------TVVIEDIKKMCCVLPLSADTDTESPPDSP 151

Query: 302 -----SKGQAD----WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
                SKG +     W P+LL+VPL LG+ ++NP Y+   +L  +    + IV  +    
Sbjct: 152 TASNQSKGPSACGSAWKPLLLIVPLRLGINQINPVYVDAFKLQASC-HPILIVTKEGVRR 210

Query: 353 TYIVGVQEESA--------------------IYLDPHDVQPVINIGKDDLEADTSTYHSD 392
           T I+  ++ S                     I+LDPH  Q  ++  ++ +  D + +   
Sbjct: 211 TRILPPKDSSGARASESLKVKHVSFKTGDELIFLDPHTTQTFVDTEENGMVDDQTFHCLQ 270

Query: 393 VIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
             + +++ ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 271 SPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 321


>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
 gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
          Length = 460

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/301 (30%), Positives = 134/301 (44%), Gaps = 56/301 (18%)

Query: 139 EFNQDFSSRILISYRKGFDPI-----GDSKIT------------------------SDVG 169
           +F  D  SR+  +YR  F PI     G S ++                        +D+G
Sbjct: 60  QFLSDVHSRLHFTYRTKFVPIPRVSDGPSPLSFHFLIRENPLTTIENAIYNPDCFNTDIG 119

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+R+ Q L+  AL    LGR +R  + +  D+E  +I+  F D+  + FSIHN +  
Sbjct: 120 WGCMIRTGQSLLGNALQIANLGRDFR--VNQGKDQEEYKIIDWFADTPQAHFSIHNFVSQ 177

Query: 230 G-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPV 288
           G K      G W GP A  RS + L   Q  + G+        I V SGD          
Sbjct: 178 GLKLSNKKPGEWFGPAATSRSIQCLVE-QFPDCGID----KCLISVSSGD---------- 222

Query: 289 VCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGK 348
              +D  R   +F+  Q   + ILLL+ + LG+  VN  Y   ++ T     S+GI GG+
Sbjct: 223 -VFEDEVRE--IFA--QKPQSRILLLLGVKLGVNAVNEYYWDDVKKTLGSKFSVGIAGGR 277

Query: 349 PGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAI 408
           P +S Y +G Q    IY DPH  QP +    +  +    T H+     + L  +DPS+ I
Sbjct: 278 PSSSLYFMGFQGNELIYFDPHTPQPSLQTSANFYD----TCHALNFGKLLLSDLDPSMLI 333

Query: 409 G 409
           G
Sbjct: 334 G 334


>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
          Length = 378

 Score = 99.0 bits (245), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 98/386 (25%), Positives = 137/386 (35%), Gaps = 130/386 (33%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
           AGN  + EF +DF SRI ++YR+ F PI  S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45  AGN--VEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102

Query: 192 RPWRKP----------------------------------LQKPFD--REYVE------- 208
           R W  P                                  L+ P    +E +E       
Sbjct: 103 RAWTWPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHE 162

Query: 209 ---------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
                    I+  FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 163 IRNEIYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
               G     + IYV             V   D   +  +  +   AD   +++LVP+ L
Sbjct: 223 PDLQG-----ITIYVAQ--------DCTVYSSDVIDKQRTAMTADNADDKAVIILVPVRL 269

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G E+ N  Y+  ++ TF  P    +   K                 +DP           
Sbjct: 270 GGERTNTDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA--PL 437
                                    S  IGFYCR+  DF       +K+   S+    PL
Sbjct: 301 -------------------------SCTIGFYCRNIQDFKRASEEITKMLTISSKEKYPL 335

Query: 438 FTVTQTHKK-------PVNHSDVLGE 456
           FT    H +         N  D+  E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361


>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
 gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
          Length = 179

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 45/114 (39%), Positives = 70/114 (61%), Gaps = 3/114 (2%)

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           + P L+L+   LG++++ P Y   ++ T   PQS+GI GG+P AS Y VGVQ     YLD
Sbjct: 24  FRPTLILIGTRLGIDRITPVYWDAVKTTLQLPQSVGIAGGRPSASHYFVGVQGSHLFYLD 83

Query: 368 PHDVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
           PH  +P +   NI +   + +  TYH+  +R IH+  +DPS+ IGF  +D++D+
Sbjct: 84  PHQTRPALPQRNIDERYTDEEIETYHTRRLRRIHIRDMDPSMLIGFIIKDREDW 137


>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 348

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 92/344 (26%), Positives = 148/344 (43%), Gaps = 49/344 (14%)

Query: 136 GLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           G AE  +  + ++L  SYR  F+P+ +   T+D+GWGC +R+ QM++A AL+ ++ G   
Sbjct: 37  GTAEMVKLAACKLLYFSYRCQFEPLRNGS-TTDIGWGCTIRAGQMMLAHALMRYKNGG-- 93

Query: 195 RKPLQKPFDREYVEIL-----HLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
                  F+   V  L     HLF D  ++PF IH +   G  +G   GSW GP  +   
Sbjct: 94  ----GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVV 149

Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT 309
             AL                M  Y+ +G +     G  V+ + D         K      
Sbjct: 150 MGAL----------------MEDYLRNGGQ-----GPDVLVLRDRQVMEDEVRKILLLSK 188

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
            +LLL+P++LG   ++  Y   L+       ++G VGGK G++ + +G Q  + I LDPH
Sbjct: 189 HVLLLIPVMLGPHHISEGYAKLLKRCLRMESTVGAVGGKEGSAFFFMGYQGGNLIVLDPH 248

Query: 370 DVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
             Q           +DT    S     + L S   S+ +GFY    D F  F        
Sbjct: 249 YAQSAFTC------SDTQGKISGEWYTLPLTSCSTSVLLGFYIHSPDSFSQFTGD----I 298

Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 473
           +++N + +F + +     V  SD +G      + D   ++S  D
Sbjct: 299 KDANSSLIFPLIE-----VTTSDCVGHIFNEDDPDVCSLVSFGD 337


>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
          Length = 454

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
            D  P L+L+ + LG+++V P Y   L+    +PQS+GI GG+P +S Y +G Q     Y
Sbjct: 245 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQGSHFFY 304

Query: 366 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           LDPH  +P +       +     + +TYH+  +R +H+  +DPS+ IGF  RD+DD++ +
Sbjct: 305 LDPHHTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 364

Query: 422 CARASKLAEESNGAPLFTVTQTHKKP 447
                  A    G  +  V    K P
Sbjct: 365 KRSVHNRAMIGTGKAIIHVFDKEKSP 390



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 24/119 (20%)

Query: 101 PSRTGISSSTSDIWLLGVC-HKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF--- 156
           P+R+  S++     LL    H+ +    LG     +    F  DF S+I ++YR  F   
Sbjct: 85  PTRSSDSATKPQRHLLPFAIHRGSTTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLI 144

Query: 157 ----DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR 195
               DP                +     T+D GWGCM+RS Q L+A AL    LGR  R
Sbjct: 145 PKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRACR 203


>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/397 (23%), Positives = 163/397 (41%), Gaps = 67/397 (16%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           I++LG  H+I  D+        + + +  Q     I I+YR+ + P+  S   SD GWGC
Sbjct: 38  IYILG--HRIDIDQF----EIEDRINKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGC 91

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS------------- 219
           MLR  QM +AQ L  H      ++      D +Y  I+  F D+++              
Sbjct: 92  MLRVGQMAMAQMLKKHLKNHGDKR------DEDYDNIILAFADNDSQENKEFIEFQNSKD 145

Query: 220 ---------PFSIHNL-LQAGKAYGLAAGSWVGP-YAM-----------CRSWEALARCQ 257
                    PFSI  +   A K + L  G W  P Y +            R+ E L    
Sbjct: 146 KQKAHNFICPFSIQKIAYLAKKEFNLDPGEWYRPNYILFLLELLHNTIPIRASENLKLSV 205

Query: 258 RAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPL 317
             ++ L    L   ++    + D +        +++      +  K       + + V  
Sbjct: 206 FNDSCLFLDQLMNRMFEAKFETDKD--------LEEQLEKTQLIGKN-----SLAIFVLT 252

Query: 318 VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
            +GL++ N +Y+  L      P   GIVGG P  + YI+G   +  +YLDPH VQ   N 
Sbjct: 253 RIGLDEPNQKYLKILDEIMELPYFQGIVGGTPKRAFYILGKINDHYLYLDPHYVQEAEN- 311

Query: 378 GKDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESN 433
            KD +  +     ++Y    I  ++   +D S+ + FY R++ +   F     ++ + S+
Sbjct: 312 -KDQINENKMFNRTSYSCKNIHLLNQKHVDTSMGLSFYIRNQSELLQFWRNMKQIKQSSD 370

Query: 434 GAPLFTVTQTHKKPVNHSDVLGETGGVPEDDSLGVMS 470
              +F ++ +  + V++S  L E+     DD +  + 
Sbjct: 371 DFFIF-LSDSAPEYVDYSGQLEESSNKLNDDDVVFLQ 406


>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
          Length = 259

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/272 (25%), Positives = 114/272 (41%), Gaps = 56/272 (20%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + Q G  
Sbjct: 1   MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 61  EGKSIGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 103

Query: 293 DASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
           D  + C V                                      P S    G +P  S
Sbjct: 104 DIKKMCRV-------------------------------------LPLSADTAGDRPPDS 126

Query: 353 TYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
                 Q +  I+LDPH  Q  ++  ++    D + +     + +++ ++DPS+A+GF+C
Sbjct: 127 -LTASNQGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFC 185

Query: 413 RDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +++ DFD++C+   K   + N   +F + Q H
Sbjct: 186 KEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 216


>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
 gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
          Length = 394

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)

Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102

Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
                     +ET+PFSIHN++++          +  P   C   EA+ R  +    +  
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGC---EAIKRTMQG--AVKT 148

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLNEGPSAA 255

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 297

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 109/223 (48%), Gaps = 33/223 (14%)

Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEI 209
            +Y KGF P+     T+D  WGC +RS Q L+ Q +   +L + +   ++  F       
Sbjct: 27  FTYHKGFSPLAGG-YTTDKNWGCCIRSGQGLLMQFV--SKLYQLYGDKIKNIFPNG--SK 81

Query: 210 LHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLP 269
             LF D   +PF IH + +  + +G+ AG WV P  +   ++ L                
Sbjct: 82  FELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDLLSF------------- 128

Query: 270 MAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYI 329
             I+VV   E+G        C+   S      S G     P+LLL  L+LG +  + +Y+
Sbjct: 129 FGIHVVIA-ENG--------CLSRESLR-EALSYGH----PVLLLFTLMLGYKDFDLKYL 174

Query: 330 PTLRLTFTFP-QSLGIVGGKPGASTYIVGVQEESAIYLDPHDV 371
           P LRLT +   QS+G+VGG+ G + Y+VG Q+E+ +Y DPH+V
Sbjct: 175 PFLRLTLSLIYQSVGVVGGQQGKAYYLVGHQKENLLYFDPHEV 217


>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 463

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 4/146 (2%)

Query: 306 ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
            D  P L+L+ + LG+++V P Y   L+    +PQS+GI GG+P +S Y +G Q     Y
Sbjct: 253 TDVHPTLILLGIRLGIDRVTPVYWEALKAVLKYPQSVGIAGGRPSSSHYFIGAQASHFFY 312

Query: 366 LDPHDVQPVINI----GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           LDPH  +P +       +     + +TYH+  +R +H+  +DPS+ IGF  RD+DD++ +
Sbjct: 313 LDPHHTRPALAYHDAGDRPYTTEELNTYHTRRLRRLHIKDMDPSMLIGFLIRDEDDWNSW 372

Query: 422 CARASKLAEESNGAPLFTVTQTHKKP 447
                  A    G  +  V    K P
Sbjct: 373 KRSVHNGAMIGTGKAIIHVFDKEKSP 398


>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
 gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
          Length = 356

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 117/262 (44%), Gaps = 37/262 (14%)

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
           TSD+GWGCM+R+ Q L+A AL     G P              EI+ LF D   +PFSIH
Sbjct: 85  TSDIGWGCMIRTGQTLLANALQRTNKGTPCS------------EIIELFVDETKNPFSIH 132

Query: 225 NLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           N +  GK   L   G W  P    +  E L           C      + + SGD   + 
Sbjct: 133 NFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKC-----IVSISSGDIYEQ- 186

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTFTFPQSL 342
               +  +DD+    +  +K Q     ILLL  + LG+  +N  +Y   ++       + 
Sbjct: 187 --DVLDELDDSEPPAN--TKQQH----ILLLFGIKLGINTINIEKYGQDIKDITNNKYTC 238

Query: 343 GIVGGKPGASTYIVGVQE--ESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
           GI GG+P +S +  G     +  +Y DPH      N   D+   D STYHS     + + 
Sbjct: 239 GISGGQPKSSLFFFGYNNTHDRILYFDPHKPN---NFTTDN---DYSTYHSTEFNELEMF 292

Query: 401 SIDPSLAIGFYCR-DKDDFDDF 421
           ++DPS+ IGF  + +K D++ F
Sbjct: 293 NLDPSMIIGFLVKNNKADWNKF 314


>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
 gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
          Length = 394

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 121/274 (44%), Gaps = 33/274 (12%)

Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRKLSLFFDH- 102

Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
                     +ET+PFSIHN++++     +    +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGC---EAIKRT--VQGAVKT 148

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLCEGLSAA 255

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 ASVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
 gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
          Length = 394

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)

Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102

Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
                     +ET+PFSIHN++++          +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSTNG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
 gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
          Length = 394

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 33/274 (12%)

Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  H  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWVH--GRPADRRLSLFFDH- 102

Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
                     +ET+PFSIHN++++          +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGC---EAIKRT--VQGAVKT 148

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCI-DDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
           + L   + VV+             CI  D  +H   F +G AD   +L  V +    +  
Sbjct: 149 EQLQTRVMVVTSANG---------CIYADEVQH--TFKQG-ADVVLVLASVRVSAAAQLT 196

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
              Y+   +L    PQ LG+VGG PG S Y     +    YLDPH       + +    A
Sbjct: 197 QESYLQIEKL-MEQPQCLGVVGGVPGRSYYFFAHNQTQLFYLDPHQRTAAALLSEGPSAA 255

Query: 385 DTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            + T     +R +H   +D SL + F    +D++
Sbjct: 256 VSVTPSVADVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
          Length = 257

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 80/176 (45%), Gaps = 44/176 (25%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S + L G C+     E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISTVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
             +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 GSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRCRGPGR 193

Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP
Sbjct: 194 RGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGP 249


>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
          Length = 378

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 115/315 (36%), Gaps = 118/315 (37%)

Query: 134 NNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------ITSDVGW 170
           N    +F  DF SR  ++YR  F PI  SK                        +SD GW
Sbjct: 109 NGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGFSSDSGW 168

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
           GCM+RS Q L+A A    RLGR WR+  QK    E ++I+ +F D   +P+SIHN +  G
Sbjct: 169 GCMIRSGQSLLANATGIVRLGRDWRRGQQK---AEEIKIMRMFADDPAAPYSIHNFVDYG 225

Query: 231 KAY-GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV 289
            +  G   G W GP A  +                                         
Sbjct: 226 SSKCGKYPGEWFGPSATSQ----------------------------------------- 244

Query: 290 CIDDASRHCSVFSKGQAD---WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVG 346
           CI+      S  +  ++D   + P L+L+   LG++K+   Y   L      PQS+GI G
Sbjct: 245 CINPDVYEDSFMATAKSDHGFFKPTLILISTRLGIDKITQVYWEALISALQMPQSVGIAG 304

Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSL 406
                                                          +R +H+  +DPS+
Sbjct: 305 -----------------------------------------------LRRLHVQQMDPSM 317

Query: 407 AIGFYCRDKDDFDDF 421
            IGF  R ++++ ++
Sbjct: 318 LIGFIIRSEEEWKEW 332


>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 265

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 67/116 (57%), Gaps = 2/116 (1%)

Query: 304 GQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESA 363
               W  +++LVP+ LG E +NP YI  ++        +GI+GGKP  S Y +G Q+E  
Sbjct: 151 AHQSWQSVIILVPVRLGGESLNPSYIECVKNILKLDCCIGIIGGKPKHSLYFIGFQDEQL 210

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFD 419
           +YLDPH  QPV+++ + +   +  ++H +  + +    +DPS  IGFY + K DF+
Sbjct: 211 LYLDPHYCQPVVDVSQVNFSLE--SFHCNSPKKMPFSRMDPSCTIGFYAKSKKDFE 264



 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 12/132 (9%)

Query: 65  ASEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQ 124
           A    A +N   GWT  VK   T   +  +   +LG S    S  T    L  +C  ++ 
Sbjct: 14  AKLMSAWNNVKYGWT--VKSKTTFNKLSPV--TILGHSYLLNSEGT----LFFICLILSS 65

Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
              L      + +  F   F SRI ++YRK F P+  S +T+D GWGCMLRS QML+AQ 
Sbjct: 66  FCCLN----LDEVERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQG 121

Query: 185 LLFHRLGRPWRK 196
           LL H + R +++
Sbjct: 122 LLVHLMHRVYKE 133


>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
 gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
          Length = 483

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 85/257 (33%), Positives = 118/257 (45%), Gaps = 34/257 (13%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
             SD+GWGCM+R+ Q L+  AL   RL  P   P +K       +++  F D  ++PFS+
Sbjct: 144 FCSDIGWGCMIRTGQALLGNALA--RLRSP---PEEK-------QLIGWFEDRSSAPFSL 191

Query: 224 HNLLQAGKAYGLA-AGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
           HN ++ G A      G W GP A  RS ++L      + GL        I   SGD   E
Sbjct: 192 HNFVREGNALSRKPPGEWFGPSATSRSIQSLVHA-FPQCGLNH----CIISTDSGDVYEE 246

Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSL 342
             G P++               +     ILLL+ + LGL  VN RY P ++       S+
Sbjct: 247 DVG-PIL--------------EREPQATILLLLGVKLGLNNVNSRYWPDVKHILGSSFSV 291

Query: 343 GIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSI 402
           GI GG+P +S Y  G Q +   YLDPH  Q  +     D E   S  HS     +H   +
Sbjct: 292 GIAGGRPSSSLYFFGYQGDYLFYLDPHTSQLDLASCATDNEKYESV-HSARFNKVHFSEL 350

Query: 403 DPSLAIGFYCRDKDDFD 419
           DPS+ IG   +  DD+D
Sbjct: 351 DPSMLIGVLIQGLDDWD 367


>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
          Length = 378

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/386 (25%), Positives = 134/386 (34%), Gaps = 130/386 (33%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
           AGN  + EF +DF SRI ++YR+ F  I  S +T+D GWGC LR+ QML+AQ L+ H LG
Sbjct: 45  AGN--VEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLG 102

Query: 192 RPWRKP----------------------------------LQKPF------------DRE 205
           R W  P                                  L+ P             D E
Sbjct: 103 RAWTWPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHE 162

Query: 206 YV-EILH-----LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
              EI H      FGDS  + F +H L++ GK  G  AG W GP  +           R 
Sbjct: 163 MQNEIYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARH 222

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL 319
               G     + IYV             V   D   + C+  +    D   +++LVP+ L
Sbjct: 223 PELQG-----ITIYVAQ--------DCTVYSSDVIDKQCASMAPDITDDKAVIILVPVRL 269

Query: 320 GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGK 379
           G E+ N  Y+  ++ TF  P    +   K                 +DP           
Sbjct: 270 GGERTNIDYLEFVK-TFHCPSPKKMSFRK-----------------MDP----------- 300

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE--ESNGAPL 437
                                    S  IGFYCR+  DF       +K+ +       PL
Sbjct: 301 -------------------------SCTIGFYCRNVQDFKRASEEITKMLKVFSKEKYPL 335

Query: 438 FTVTQTHKK-------PVNHSDVLGE 456
           FT    H +         N  D+  E
Sbjct: 336 FTFVNGHSRDYDFTSTTTNEEDLFSE 361


>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
          Length = 194

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 78/158 (49%), Gaps = 27/158 (17%)

Query: 113 IWLLGVCHKI--------AQDEALGDAAGNNGLA----------------EFNQDFSSRI 148
           IWLLG  + I        A  EA  D   N G +                +F  DF+SR+
Sbjct: 29  IWLLGCSYIIKPTDHIQQALLEAQRDLMFNKGSSENEEENNQNMHMLWPPDFYDDFTSRL 88

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ-KPFDREYV 207
            ++YR  + PI  S   +D+GWGC LRS Q L+A  L+ H LGR WR+  Q +   ++Y 
Sbjct: 89  WMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQAAWKQYS 148

Query: 208 EILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGP 243
            I+H F D  S  +PFSIH +   GK  G   G W GP
Sbjct: 149 RIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGP 186


>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
          Length = 350

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 70/130 (53%), Gaps = 3/130 (2%)

Query: 321 LEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG-- 378
           L +VNP YI  L+  F  P S G++GG+P  + Y +G   E A+YLDPH VQ V  IG  
Sbjct: 180 LNEVNPIYIEGLKKCFQLPGSCGMIGGRPNQALYFIGYVGEEALYLDPHTVQRVGCIGEK 239

Query: 379 KDDLEADT-STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPL 437
           ++ +E +  +T+H      I   S+DPSLA+ F C  +  FD   A   +        PL
Sbjct: 240 QESVEQEQDATFHQRHASRIAFASMDPSLAVCFLCCSRAQFDQLVAHFKERLNGGGSQPL 299

Query: 438 FTVTQTHKKP 447
           F VT+T + P
Sbjct: 300 FEVTKTRQAP 309



 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 2/103 (1%)

Query: 142 QDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKP 201
           QD  SR+  +YR+GF PIG++++T+D GWGCMLR  QM++A+AL    LGR W+   ++ 
Sbjct: 72  QDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLGRDWQWS-EET 130

Query: 202 FDREYVEILHLFGDSETSPFSIHNL-LQAGKAYGLAAGSWVGP 243
            D  Y++I++ F D++ +PFS+H + L    +     G W GP
Sbjct: 131 RDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGP 173


>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 296

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 70/271 (25%), Positives = 117/271 (43%), Gaps = 54/271 (19%)

Query: 150 ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREY--- 206
            +YR  F  I    ITSD GWGC  RS+Q L+A   L +            P D EY   
Sbjct: 30  FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNY-----------APVDAEYFFT 78

Query: 207 ----VEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
               + +  LF D    PFSI NL+   + +G+  G+W  P  +  + E++ +       
Sbjct: 79  VFNEIPMFSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFK------- 131

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
                L +++ ++S D +       ++  D  +                      +LG++
Sbjct: 132 ----DLKLSV-LISKDSN-------IIPEDVKTMRAPFLLLIPI-----------LLGMK 168

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQE-ESAIYLDPHDVQPVINIGKDD 381
            V  ++IP ++ TF  P+ LG V G    S ++VG+ E ++ +Y DPH  +  +      
Sbjct: 169 DVEQKFIPFIKYTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPHVTKQAVASS--- 225

Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYC 412
              D S +     R I + S++PS  +GF+C
Sbjct: 226 --FDHSEFFEVPPRGIKMKSLNPSFLLGFFC 254


>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 343

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 72/293 (24%), Positives = 125/293 (42%), Gaps = 37/293 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
           I  SYR GF     + I SD GWGCMLRS QM+ A  LL H    P    +Q     + +
Sbjct: 27  IYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLLRHLKENP---QIQNQLKIQNI 83

Query: 208 E-----ILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSWEALARCQRAET 261
                 I+  F +++  PFSI  +   A + + L  G W  P  +  S + L    +  +
Sbjct: 84  NDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKKLLNNFQTFS 143

Query: 262 GLGCQS------LPMAIYVVSGDEDGERGGAPV------VCIDDASRHCSVFSKGQADWT 309
            +   S       P+          G++  + +      + I++  +   +  +    + 
Sbjct: 144 EMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKIMKQNSNKYQ 203

Query: 310 PILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
                  +++GL+    +Y+  L   FT   S+G           ++G+  +   YLDPH
Sbjct: 204 INKQNYKILIGLDYPEEKYLDILIKLFTHRLSIG-----------MIGLNNDKLTYLDPH 252

Query: 370 DVQPV-INIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
            VQ   IN      E +  TY  + ++ I+  ++ PS+ +GFY +D +D ++F
Sbjct: 253 IVQHADINTN----EINLKTYFQEEVKQINKHALGPSVGLGFYLKDLNDLNEF 301


>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
          Length = 373

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 4/117 (3%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
           F  DF SR+ ++YR  F  IG++ + +D+GWGCMLR+ QML+AQAL+ H LGR WR   +
Sbjct: 147 FLTDFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAE 206

Query: 200 KPFDREYVEILHLFGD--SETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCRSWEAL 253
           +     Y E+L  F D  S  SP+SIH + + G + +    G W  P  +  +   L
Sbjct: 207 ENM-MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLL 262


>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
          Length = 296

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 110/248 (44%), Gaps = 44/248 (17%)

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
           +R + +I+  F D   +PF +H L++ G++ G  AG W GP         +A   R    
Sbjct: 51  ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP-------SLVAHILRKAVE 103

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLE 322
              +   + +YV                    S+ C+V           L +  L +   
Sbjct: 104 SCSEVTRLVVYV--------------------SQDCTV-----------LHMRSLAIDPS 132

Query: 323 KVNPRYIPT-LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
           K     +P+ L+        LGI+GGKP  S Y +G Q++  +YLDPH  QP +++ + +
Sbjct: 133 KDRSTCLPSSLQELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQAN 192

Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLF 438
              +  ++H    R +    +DPS  +GFY  D+ +F+  C+  +++   S+     P+F
Sbjct: 193 FPLE--SFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMF 250

Query: 439 TVTQTHKK 446
           T+ + H +
Sbjct: 251 TLAEGHAQ 258


>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
 gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
          Length = 419

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 117/260 (45%), Gaps = 39/260 (15%)

Query: 153 RKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHL 212
           R  FD   +   TSD GWGCM+R+SQ L+A AL          K   +      +EIL L
Sbjct: 130 RSLFD---NENFTSDAGWGCMIRTSQNLLANAL---------LKLAGEANGNVQLEILKL 177

Query: 213 FGDSETSPFSIHNLLQAGKAYGLAA--GSWVGPYAMCRSWEALARCQRAETGLGCQSLPM 270
           F D   + FSIHN ++   A  L+   G W GP A   S   L         +  Q  P 
Sbjct: 178 FQDDPNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLT------IEMTDQESPT 231

Query: 271 AIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIP 330
            +  V   E+ +         DD      +  K      P+LLL P+ LG++ VN  Y  
Sbjct: 232 VVPFVYISENAD-------LYDDEIEETFLKEK-----RPLLLLFPVRLGIDHVNKYYYK 279

Query: 331 TLRLTFTFPQSLGIVGGKPGASTYIVGVQ-EESAIYLDPHDVQPVINIGKDDLEADTSTY 389
           ++        S+GI GGKP +S Y +G + +E+ IY DPH  Q        +   + ++Y
Sbjct: 280 SILQLLASRFSVGIAGGKPSSSFYFIGYENDENLIYFDPHLPQVF------ESPINLASY 333

Query: 390 HSDVIRHIHLDSIDPSLAIG 409
           H+     + ++ +DPS+ IG
Sbjct: 334 HTLNYNKLSIEMLDPSMMIG 353


>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
 gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
          Length = 269

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/227 (27%), Positives = 104/227 (45%), Gaps = 24/227 (10%)

Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           +SIH + Q G++   A G W+GP  + +  + L R     +        +AI+V      
Sbjct: 4   YSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSS--------LAIHVAMD--- 52

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                   V +DD    C    +    W P+LL++PL LG+  +NP Y+P L+       
Sbjct: 53  ------STVVLDDVYASC----REGGSWKPLLLIIPLRLGITDINPLYVPALKRCLELDS 102

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADT---STYHSDVIRHI 397
           S G++GG+P  + Y +G  ++  +YLDPH  Q    + +    A+     TYH      +
Sbjct: 103 SCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARL 162

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           +  ++DPSLA+ F C+  D F+    +  +         LF ++QT 
Sbjct: 163 NFSAMDPSLAVCFLCKTSDSFESLLTQFKEEVLSLCSPALFEISQTR 209


>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
          Length = 256

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 60/176 (34%), Positives = 81/176 (46%), Gaps = 45/176 (25%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S   S + L G C+     E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSK-ISSVHLCGRCYHF---EGEGD------IQRFQRDFVSRLWLTYRRDFPPLAG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-------------------------- 195
           S +TSD GWGCMLRS QM++AQ LL H L R WR                          
Sbjct: 134 S-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWRWVEGTGLASSEMPGPASPSRYRGPGR 192

Query: 196 --------KPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
                     L+   DR +  I+  F D   +PF +H L++ G++ G  AG W GP
Sbjct: 193 RGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGP 248


>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 516

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 98/400 (24%), Positives = 160/400 (40%), Gaps = 78/400 (19%)

Query: 142 QDFSSRILISYRKGFDPIGD------------SKITSDVGWGCMLRSSQMLVAQALLFHR 189
           ++F + I I+YRK F  + +            S+  SD GWGCM+R  QM  A+ L  H 
Sbjct: 71  ENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCMVRVGQMAFAEGLRRHL 130

Query: 190 LGRPWRKPLQKPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSWVGPY 244
           +    +K + K  +   V I     D +     +P+SI  + + A   + L  G W  P 
Sbjct: 131 VEN--KKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIALSDFNLLPGEWYTPI 188

Query: 245 AMCRSWEALARCQRAETGLGCQSLPMAIY-----VVSGD-------EDGERGGAPVVCID 292
            +C     L   ++A  G   + L +A++     +V  D        D +RG    +C +
Sbjct: 189 RICYILGLLHNERKAIKG--TEDLKVAVFSSSRPIVFQDFLERMCKVDPQRGKHAQICPN 246

Query: 293 -------------DASRHCSVFSKGQ---------ADWTPILLLV-PL------------ 317
                        D   H  +  + Q         ++ TP L LV P+            
Sbjct: 247 QCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCPIHHELQYSMIVYI 306

Query: 318 --VLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI 375
             ++GL+   P Y+   +    F  SLG++GGKP  + Y VG  E+  IYLDPH VQ   
Sbjct: 307 VCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDEFIYLDPHYVQEFS 366

Query: 376 NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGA 435
           N       +   TY     +     +ID S ++ +Y +D +  ++F      L  + N  
Sbjct: 367 NEKNFQSSSQLETYFCKKFQTYPSKNIDSSFSLMYYLKDLEQLEEFYQFMMGLKRDYNEH 426

Query: 436 PLFTVTQTHKKPVNHSDVLG---ETGGVPEDDSLGVMSMN 472
               +  T       S  LG   E+  +  D +L +++ N
Sbjct: 427 FFMMMEDTEP-----SFCLGDGKESSNLISDKNLNILADN 461


>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
          Length = 347

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 109/247 (44%), Gaps = 28/247 (11%)

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           M+R+ Q L+  AL    LGR +R    +  +RE  + ++ F D+  +PFS+HN + AG  
Sbjct: 1   MIRTGQSLLGNALQILHLGRDFRVNGNESLERES-KFVNWFNDTPEAPFSLHNFVSAGTE 59

Query: 233 YG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCI 291
                 G W GP A  RS ++L        G     +   I  VS  +  E     V   
Sbjct: 60  LSDKRPGEWFGPAATARSIQSLIY------GFPECGIDDCIVSVSSGDIYENEVEKVFAE 113

Query: 292 DDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGA 351
           +  SR              IL L+ + LG+  VN  Y  ++    +  QS+GI GG+P +
Sbjct: 114 NPNSR--------------ILFLLGVKLGINAVNESYRESICGILSSTQSVGIAGGRPSS 159

Query: 352 STYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFY 411
           S Y  G Q    ++ DPH  QP +       ++   + H+     + L  +DPS+ IG  
Sbjct: 160 SLYFFGYQGNEFLHFDPHIPQPAVE------DSFVESCHTSKFGKLQLSEMDPSMLIGIL 213

Query: 412 CRDKDDF 418
            + + D+
Sbjct: 214 IKGEKDW 220


>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
          Length = 546

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 55/149 (36%), Positives = 79/149 (53%), Gaps = 12/149 (8%)

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
           ++LLVPL LGL++++  YIP+L  T   PQSLG +GG+P  + + +G Q  +   LDPH 
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSLLETLRVPQSLGFLGGRPNHAIFFIGAQGNTLTGLDPHT 439

Query: 371 VQPVINIGKD-DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLA 429
            QP  ++G+    E    + H      + +  IDPSLA+ FY  D+  F+D   R     
Sbjct: 440 TQPAADMGEGFPSERYVHSLHCQSAVSMDVHRIDPSLALAFYLPDRATFEDLIKRIG--- 496

Query: 430 EESNGAPLFTVTQTHKKPVNHSDVLGETG 458
            E+N  P F+V QT        D  GE G
Sbjct: 497 -ETN-PPPFSVEQTRP------DYEGEMG 517



 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/138 (39%), Positives = 70/138 (50%), Gaps = 23/138 (16%)

Query: 114 WLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCM 173
           W++G+ +   ++E            E   D  S + I+YR GF  +     T D GWGCM
Sbjct: 38  WIMGIPYTELREE------------ERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCM 85

Query: 174 LRSSQMLVAQALLFHRLGRPWRKP------LQKPFDREYVEILHLFGD--SETSPFSIHN 225
           LRS+QML+ QAL  H LGR WR P      L+ P   EY  ++ LF D   E + FSIHN
Sbjct: 86  LRSAQMLMTQALQRHTLGRSWRVPRTLEERLRVP---EYRTLVRLFADHPGEANLFSIHN 142

Query: 226 LLQAGKAYGLAAGSWVGP 243
           + Q G  Y    G W GP
Sbjct: 143 MCQVGIRYDKLPGEWYGP 160


>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 398

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 31/273 (11%)

Query: 148 ILISYRKGFD--PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDRE 205
           ++ +YR GF+  P     I +D GWGC+LR+SQML+A  L  +  GRP  + L   FD  
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFLWAY--GRPADRRLALFFDH- 102

Query: 206 YVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGC 265
                     +ET+PFSIHNL+++          +  P   C   EA+ R    +  +  
Sbjct: 103 ---------SAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGC---EAIKRTM--QDAIKT 148

Query: 266 QSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN 325
           + L   + VV+             C+     H   F +G A+   +L  V +    +   
Sbjct: 149 EQLQTRVTVVTSTNG---------CVYADEVH-HTFKQG-AEVVLVLASVRVSAAAQLTQ 197

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD 385
             Y+   +L    PQ LGIVGG PG S Y     +    YLDPH       +        
Sbjct: 198 ESYLQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPHQRTTAALLSDGPSATV 256

Query: 386 TSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
           + T     +R +H   +D SL + F    +D++
Sbjct: 257 SVTPSVSDVRCVHWSRVDTSLFLAFAVTTRDEW 289


>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
 gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
           brucei strain 927/4 GUTat10.1]
          Length = 327

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
           +S  L +YR+ FDP+  S +TSD GWGC+ R++QML+A +L         R+   +    
Sbjct: 41  NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 262
           +Y   L    D + +PFS+H +++    + L  G  + P  +A  +  EA++ C +  T 
Sbjct: 92  QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144

Query: 263 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
            G  S P+++ + V+G    E     V C    SR+             +L+L PL  G 
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187

Query: 322 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 379
            + ++ +   +L      P+S+G+VGG P    YI+G   +E  +YLDPH       +  
Sbjct: 188 SRYMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSS 247

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           +  E       S  +R +    +D S  +GF+   +  ++    R   L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299


>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
           gambiense DAL972]
          Length = 327

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 136/292 (46%), Gaps = 38/292 (13%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
           +S  L +YR+ FDP+  S +TSD GWGC+ R++QML+A +L         R+   +    
Sbjct: 41  NSFYLFTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL---------RRHSAQDCKL 91

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP--YAMCRSWEALARCQRAETG 262
           +Y   L    D + +PFS+H +++    + L  G  + P  +A  +  EA++ C +  T 
Sbjct: 92  QYFADL---DDEQVAPFSLHCMVR----HILKQGESLRPVYWAPSQGCEAISGCVKRATE 144

Query: 263 LGCQSLPMAIYV-VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL 321
            G  S P+++ + V+G    E     V C    SR+             +L+L PL  G 
Sbjct: 145 RGILSSPLSVVITVAGAVPAEE----VSCHLKESRN-------------VLILAPLRCGA 187

Query: 322 EK-VNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGK 379
            + ++ +   +L      P+S+G+VGG P    YI+G   +E  +YLDPH       +  
Sbjct: 188 SRCMSQKMFLSLEHLLLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCKTQDALLSG 247

Query: 380 DDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           +  E       S  +R +    +D S  +GF+   +  ++    R   L+++
Sbjct: 248 EPGETGVVKPTSSNLRSVPYGQVDTSFFLGFFVDSQSRWESLQKRIEGLSKQ 299


>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
          Length = 567

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 10/146 (6%)

Query: 297 HCSVFSKGQ--ADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTY 354
           +CS  ++ +    W P++++VP+ LG    +      L       QSLG +GG+P  S Y
Sbjct: 406 NCSRMAQAREPCSWRPLIVVVPVRLGARSEDQH----LSRIDKHLQSLGFIGGRPRHSYY 461

Query: 355 IVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRD 414
            VGV+  +A YLDPH  QP  +I K+    + +++H      + L  IDPSLA+GFYC D
Sbjct: 462 FVGVRGYNAYYLDPHITQPYQSIRKN---INVASFHCAHPGKMSLAHIDPSLALGFYCDD 518

Query: 415 KDDFDDFCARASKLAEESNGAPLFTV 440
           K DF+D   R  +LA   +  P+ +V
Sbjct: 519 KSDFEDLIRRVEELA-AGDSHPILSV 543


>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
          Length = 326

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 118/280 (42%), Gaps = 35/280 (12%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
           +S  L++YR  F+P+  S +TSD GWGC+ R+SQML+A  L  H                
Sbjct: 41  TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVLRRHAASEC----------- 89

Query: 205 EYVEILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETG 262
            +++      D   +PFS+H + +A   +G    A  W  P   C   EA+  C  +   
Sbjct: 90  -HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYW-APSQGC---EAIRSCVESAVR 144

Query: 263 LGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGL- 321
            G  +  +++ V S     ER                +    + D + +L+LVP+  G  
Sbjct: 145 QGLLTQKLSVVVSSSGTIPER---------------EIHEHLRGDGS-VLVLVPVRCGTS 188

Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
            ++       L      P  +G+VGG P    YIVG      +YLDPH +     +  + 
Sbjct: 189 RRMTQTMFFALEHLLHIPSCMGVVGGVPNRGYYIVGTSGHRLLYLDPHCMTQNAMVSCEL 248

Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
            +    T  ++++R +  D +D S   GF     D+++  
Sbjct: 249 GKVGIVTPTTNLLRSVRWDHVDTSFFFGFLLDSLDEYEKL 288


>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
           anatinus]
          Length = 147

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 32/136 (23%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW----R 195
           F +DF SR+ ++YR+ F P+  S  TSD GWGCMLRS QML+AQ L+ H L R W     
Sbjct: 5   FQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWIWAEA 64

Query: 196 KPLQKP----------------------------FDREYVEILHLFGDSETSPFSIHNLL 227
            P  KP                             +R++  I+  F D   +PFS+H L+
Sbjct: 65  GPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSLHRLV 124

Query: 228 QAGKAYGLAAGSWVGP 243
           + G+  G  AG W GP
Sbjct: 125 RLGQGSGKRAGDWYGP 140


>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
          Length = 172

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 45/127 (35%), Positives = 69/127 (54%), Gaps = 11/127 (8%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 33  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 81

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH + +  + 
Sbjct: 82  MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMEKMCRV 141

Query: 233 YGLAAGS 239
             L+A +
Sbjct: 142 LPLSADT 148


>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 359

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 92/389 (23%), Positives = 156/389 (40%), Gaps = 72/389 (18%)

Query: 80  AAVKRLVTAGSMRRI--------HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGD 130
           A  ++LV  GS   +        HE +  P   G  S     ++LGV  K  Q D+ L +
Sbjct: 2   AYFQKLVQHGSYNILSKFYNQIGHEDIQKPIFIGGCS----FYILGVEFKTKQMDKQLAE 57

Query: 131 AAGNNGL----AEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL- 185
                 L    A      S+   ++YR G++ + +S +T+DVGWGC +R+ QM++A A+ 
Sbjct: 58  QPPEVYLQYSSAPAFFRISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAME 117

Query: 186 ------LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFSIHNLLQAGKAY--GL 235
                   +    P+      P   E + +L  F DS   T+P SIH++ ++        
Sbjct: 118 TIVYSGALNNTQTPYI-----PTKEEIMNVLVPFIDSPNSTTPLSIHHVYESRFVVEKNK 172

Query: 236 AAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS 295
           +  +++ P  + +++  L    +                            P+ C+  ++
Sbjct: 173 SGVNYLAPSVVAKAYSGLVNSWKL--------------------------CPIRCVMCSN 206

Query: 296 RHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
                    +  + P L+ +P+VL     N      L+  +      GIVGG    + ++
Sbjct: 207 VSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFV 261

Query: 356 VGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS-----LAIGF 410
            G      +YLDPH VQP     K   E DT +Y         + +IDP+        GF
Sbjct: 262 FGFHALQFLYLDPHIVQPSF---KSFTEIDTKSYSPISTNRFSVHTIDPTKLDDFCTFGF 318

Query: 411 YCRDKDDFDDFCARASKLAEESNGAPLFT 439
             ++  + DDF   A ++ E SN   L T
Sbjct: 319 LIKNFHEIDDFMKFAKEVFEISNDKELRT 347


>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
 gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
          Length = 556

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 46/122 (37%), Positives = 69/122 (56%), Gaps = 4/122 (3%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
           E  +  +SR+ ++YRKGF PIG      SD GWGCM R  QM++A+A+L   LGR W+  
Sbjct: 37  EIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLGRSWKWS 96

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
           P Q+    EY  +L +F D  ++ +SI  +   G + G + GSW GP  + +  + L+  
Sbjct: 97  PEQE--SPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKKLSVY 154

Query: 257 QR 258
            R
Sbjct: 155 DR 156



 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/122 (37%), Positives = 63/122 (51%), Gaps = 15/122 (12%)

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEAD-TSTYHSDV 393
            F  P  +GI+GG P  + +IVGV ++  I LDPH  QP    G+ +L+ D   TYH D 
Sbjct: 350 VFRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPA---GRGNLKPDYDQTYHCDN 406

Query: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE------SNGAPLFTVTQTHKKP 447
              I L  +DPS+ +GF C  + +FDD C     L EE      +N  PL  +  T  +P
Sbjct: 407 PIRIPLKRLDPSMVLGFLCSTEKEFDDLC---HNLKEEVLHPSVANSWPLVEIHTT--RP 461

Query: 448 VN 449
            N
Sbjct: 462 SN 463


>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
          Length = 823

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 67/113 (59%), Gaps = 3/113 (2%)

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
           IL+++P  LGL KVN  Y  +++  F    ++GI+GG+P  + Y VG Q+   I LDPH 
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIKYVFQCRLNVGIMGGRPNQALYFVGTQKTDLICLDPHL 670

Query: 371 VQPVINIGKDDLE--ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           VQ  + + +++L       TYH D  + + +  +D SLA GFY +D +DF+ F
Sbjct: 671 VQDTV-LNQEELSNVELNQTYHCDQAKKLSMTKLDTSLAFGFYLKDYNDFEVF 722



 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 10/99 (10%)

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLG-----RPWRKPLQKPFDREYVEILHLFGD---S 216
           T+DVGWGC +R  QM++ QAL+ H +G     +      QK  +  Y +I+ L  D   S
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLN--YAKIIQLIHDNDCS 451

Query: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALAR 255
           +T  FSI N+ + G  +    G W GP+A+      L R
Sbjct: 452 QTGAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNR 490


>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 327

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 131/290 (45%), Gaps = 42/290 (14%)

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
           L +YRK F+P+  S IT+D GWGC+ R+SQML+A AL         R+ +   F  +Y  
Sbjct: 45  LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMTLDFSFQYFC 95

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
            +    D   +PFS+H ++++    G  L    W  P   C   EA++ C R+    G  
Sbjct: 96  DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRSAIHRGAL 148

Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 325
              + + V         G A  +   + +RH      G A     L+LVP+  G   ++ 
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 384
            +   +L      P  +G+VGG PG   YIVG   +E  +YLDPH +     +     E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIVGTGGQELLLYLDPHCMTQEALVS---CES 249

Query: 385 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           DT+       RH   +  D +D S  IGF+    + ++D   +   L+ +
Sbjct: 250 DTAGVVRPTPRHLLCVPYDRVDTSFFIGFFVDSFELWEDLQKKIEGLSRQ 299


>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
          Length = 280

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 60/199 (30%), Positives = 89/199 (44%), Gaps = 44/199 (22%)

Query: 79  TAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
            A ++ L  AG    +    +  SRT  S  +S    + +C +  + E  GD      + 
Sbjct: 84  VAVMQVLHLAGRCPYVSPGWVVKSRTSFSKISS----IHLCGRRYRFEGEGD------IQ 133

Query: 139 EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---- 194
            F +DF SR+ ++YR+ F P+    +TSD GWGCMLRS QM++AQ LL H L R W    
Sbjct: 134 RFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAE 193

Query: 195 ---------------------------RKPLQKP---FDREYVEILHLFGDSETSPFSIH 224
                                      R     P    +R + +I+  F D   +PF +H
Sbjct: 194 GMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLH 253

Query: 225 NLLQAGKAYGLAAGSWVGP 243
            L++ G++ G  AG W GP
Sbjct: 254 RLVELGQSSGKKAGDWYGP 272


>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 359

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 144/344 (41%), Gaps = 52/344 (15%)

Query: 113 IWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRILISYRKGFDPIGDSKITS 166
            ++LGV  K  Q D+ L +      L     A F +  S+   ++YR G++ + +S +T+
Sbjct: 39  FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAATFFR-ISNLFWMTYRSGYEKLPNSSLTT 97

Query: 167 DVGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGDS--ETSPFS 222
           DVGWGC +R+ QM++A A+  + +       +    P  +E + +L  F DS   T+P S
Sbjct: 98  DVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIPTKQEVMNVLIPFIDSPNSTTPLS 157

Query: 223 IHNLLQAGKAY--GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           IH++ ++        +  +++ P  + +++  L    +                      
Sbjct: 158 IHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLVNSWKL--------------------- 196

Query: 281 GERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                 P+ C+  ++         +  + P L+ +P+VL     N      L+  +    
Sbjct: 197 -----CPIRCVMCSNVSIPTHELSKLPFKPTLVFLPIVL-----NHLIHSKLQQIYKSKL 246

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
             GIVGG    + ++ G      +YLDPH VQP     K   E DT +Y         + 
Sbjct: 247 FAGIVGGMGDRAIFVFGFHALQFLYLDPHIVQPSF---KSFTEIDTKSYSPIGTNRFSVH 303

Query: 401 SIDPS-----LAIGFYCRDKDDFDDFCARASKLAEESNGAPLFT 439
           +IDP+        GF  ++  + DDF   A  + E SN   L T
Sbjct: 304 TIDPTKLDDFCTFGFLIKNLHEVDDFMKLAKDVFEISNDKELRT 347


>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
          Length = 414

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 4/122 (3%)

Query: 139 EFNQDFSSRILISYRKGFDPIGDSK-ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWR-K 196
           E      SR+ ++YRKGF PIG      SD GWGCM R  QM++A+A+L   LGR WR  
Sbjct: 43  EIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSWRWS 102

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARC 256
           P Q+    EY  +L +F D  +  +SI  +   G + G + GSW GP  + +  + L+  
Sbjct: 103 PEQE--SPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSVY 160

Query: 257 QR 258
            R
Sbjct: 161 DR 162


>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
          Length = 483

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 77/265 (29%), Positives = 122/265 (46%), Gaps = 39/265 (14%)

Query: 166 SDVGWGCMLRSSQMLVAQALLFHRLGRPWR-KPLQKPFDREYVEILHLFGDSETSPFSIH 224
           +DVGWGCM+R+ Q L+  AL   R+    + +P     D +  EI  LF D+  S FS+ 
Sbjct: 135 TDVGWGCMIRTGQSLLGNAL--QRVKSTVKDQPYIYEMD-DTKEITDLFKDNTKSAFSLQ 191

Query: 225 NLLQAGKAYG-LAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           N ++ G+ Y  +A G W GP         L +         C      I V SGD   E 
Sbjct: 192 NFVKCGRIYNKIAPGEWFGPATTATCIRYLIQENPCYGIEAC-----YISVSSGDIFKEN 246

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTP---ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQ 340
                              +G  D  P   IL+L+ + LGL+ V+ RY   ++     P 
Sbjct: 247 ------------------IQGMIDRYPNGNILILLGIKLGLDSVHERYWGEIKTMLESPF 288

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
           S+GI GG+P +S Y  G  +++ ++ DPH+ Q  +    DD +    + H++    ++  
Sbjct: 289 SVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQTAL---IDDFD---ESCHTENFGKLNFS 342

Query: 401 SIDPSLAIGFY--CRDKDDFDDFCA 423
            +DPS+ +GF   C   D+F +F +
Sbjct: 343 DLDPSMLLGFLLPCSKWDEFQEFTS 367


>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
          Length = 356

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 67/257 (26%), Positives = 102/257 (39%), Gaps = 87/257 (33%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 79  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 127

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA 232
           MLR  QM++AQAL+   LGR                                   Q G  
Sbjct: 128 MLRCGQMMLAQALICRHLGRA----------------------------------QMGVG 153

Query: 233 YGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCID 292
            G + G W GP  + +  + LA      +        +A+YV   +          V I+
Sbjct: 154 EGKSVGEWFGPNTVAQVLKKLALFDEWNS--------LAVYVSMDN---------TVVIE 196

Query: 293 DASRHCSV--FSKGQAD----------------------WTPILLLVPLVLGLEKVNPRY 328
           D  + C +  FS   AD                      W P+LL+VPL LG+ ++NP Y
Sbjct: 197 DIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLLLIVPLRLGINQINPVY 256

Query: 329 IPTLRLTFTFPQSLGIV 345
           +   + TF   +  G V
Sbjct: 257 VDAFK-TFVDTEENGTV 272


>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
 gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
          Length = 388

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 73/295 (24%), Positives = 123/295 (41%), Gaps = 44/295 (14%)

Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
           +G  EF +  + ++L  SYR  F P+ + + T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKAAAKKLLYFSYRNCFPPLPN-RSTTDTRWGCLVRTTQMLVGSCLLRYHCKGA 112

Query: 194 WRKPLQKPFDREYVE----ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRS 249
           +  P     +R+  E    I  LF D  ++P  IH +        +   S + P      
Sbjct: 113 YVLP-----ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSP------ 161

Query: 250 WEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHCSVFSKGQADW 308
                     E G+   +  +A +   GD       AP   C ++ +   S      ++ 
Sbjct: 162 ---------TEAGMAIAAALIAFHAQGGD-------APFTFCCENRNIDESAVMAKLSEG 205

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
             ++L++P+VLG+  ++ +Y   L          GI GG   AS Y+ G Q  +  ++DP
Sbjct: 206 QHVILIIPVVLGIAPMSGQYERMLLKILDMKACCGIAGGFKQASLYMFGHQGRNVFFMDP 265

Query: 369 HDVQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           H VQ     G+    LE             +     DP + +GFY     D+ +F
Sbjct: 266 HYVQRAYTSGRTVGTLEGARG--------DLAARRFDPCMVLGFYLHTPADYCEF 312


>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
          Length = 256

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 12/122 (9%)

Query: 128 LGDAAGNNG-LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALL 186
           LG+   + G +A   +  +S +  +YRK F PIG +  T+D GWGCMLR  QML+A+ L+
Sbjct: 30  LGEKFTSRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLI 89

Query: 187 FHRLGRPWRKPLQKPFDR-----EYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWV 241
              LG  W       +DR     EY  IL +F D +   FSIH +   G + G   G W 
Sbjct: 90  VRHLGHNWL------WDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWF 143

Query: 242 GP 243
           GP
Sbjct: 144 GP 145


>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
          Length = 256

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 11/114 (9%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG  H +  +++           +   D S+R+  +YR+ F PIG +  +SD GWGC
Sbjct: 27  VWILGKQHLLKTEKS-----------KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGC 75

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           MLR  QM++AQAL+   LGR W    QK   +EY  IL  F D +   +SIH +
Sbjct: 76  MLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDCCYSIHQM 129



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 1/91 (1%)

Query: 354 YIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
           Y +    +  I+LDPH  Q  ++  +D    D + +     + +++ ++DPS+A+GF+C+
Sbjct: 124 YSIHQMGDELIFLDPHTTQTFVDTEEDGTVDDQTFHCLQSPQRMNILNLDPSVALGFFCK 183

Query: 414 DKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++ DFD++C+   K   + N   +F + Q H
Sbjct: 184 EEKDFDNWCSLVQKEILKEN-LRMFELVQKH 213


>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 649

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 85/358 (23%), Positives = 145/358 (40%), Gaps = 36/358 (10%)

Query: 148 ILISYRKGFDPIGD-----SKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPF 202
           I  SYR  F  I D       +++D GWGCM+R SQML+A+AL  H L     +  Q   
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204

Query: 203 DRE---YVEILHLFGD--SETSPFS------------IHNLLQAGKAYGLAAGSWVGPYA 245
           D E   Y  I+ LF D  SE+   +            + N       Y L     +   A
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264

Query: 246 MCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---RHCSVFS 302
           + R ++     +   T +    +   I   S  +   + G  ++   D     +     S
Sbjct: 265 ILRQYQQ--NVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEAS 322

Query: 303 KGQAD-WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEE 361
           + Q D    IL++V L  G+ K   ++             +G + G      YI+G QE+
Sbjct: 323 RKQNDRLNNILVMVHLKFGINKFEMQHKDYFIELLKIKNFVGALSGTETKGMYIIGFQED 382

Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR--DKDDFD 419
             I LDPH +Q     G+  L+ D  TY +   R I L+ +   +++G++ +  ++   +
Sbjct: 383 RLIVLDPHFIQKSTE-GEQGLDKDYCTYFNKTPRSISLECLSSDISLGYFIQVNEEQSIN 441

Query: 420 DFCARASKLAEESNGAPLFTV----TQTHKKPVNHSDVLGETGGVPEDDSLGVMSMND 473
            F  +   L E+ +  PL ++     +T +  +    +  E       DS+  +S N+
Sbjct: 442 QFIDQILTLNEK-HKEPLLSILNDRIETDEMEIEEHQINKEVKDQENQDSVNNISQNE 498


>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
           IL3000]
          Length = 327

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 81/290 (27%), Positives = 129/290 (44%), Gaps = 42/290 (14%)

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE 208
           L +YRK F+P+  S IT+D GWGC+ R+SQML+A AL         R+ +   F  +Y  
Sbjct: 45  LFTYRKDFEPLPRSVITTDKGWGCLARASQMLLACAL---------RRHMALDFSFQYFC 95

Query: 209 ILHLFGDSETSPFSIHNLLQAGKAYG--LAAGSWVGPYAMCRSWEALARCQRAETGLGCQ 266
            +    D   +PFS+H ++++    G  L    W  P   C   EA++ C R     G  
Sbjct: 96  DI---DDERIAPFSLHCMVRSVLRPGEDLRPVYWT-PSQGC---EAISGCVRRAIHRGAL 148

Query: 267 SLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLG-LEKVN 325
              + + V         G A  +   + +RH      G A     L+LVP+  G   ++ 
Sbjct: 149 HSQLRVVV---------GAAGAIPKHEVNRHLE--DSGNA-----LILVPVRCGTTRRMT 192

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-QEESAIYLDPHDVQPVINIGKDDLEA 384
            +   +L      P  +G+VGG PG   YI+G   +E  +YLDPH +     +     E+
Sbjct: 193 QKMFLSLEHLLLTPMCVGMVGGVPGRCYYIIGTGGQELLLYLDPHCMTQEALVS---CES 249

Query: 385 DTSTYHSDVIRH---IHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEE 431
           DT        RH   +  D +D S  +GF+    + ++D   +   L+ +
Sbjct: 250 DTVGVVRPTPRHLLCVPYDRVDTSFFLGFFVDSFELWEDLQKKIEGLSRQ 299


>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
 gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
          Length = 348

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/347 (27%), Positives = 146/347 (42%), Gaps = 68/347 (19%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSK------------ITSDVGWGCMLRSSQMLVAQALLF 187
           F ++F   IL +YR  F  I  ++            I SDVGWGCM R +QM +A  +  
Sbjct: 44  FLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVTQMSIAHGIC- 102

Query: 188 HRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKA-YGLAAGSWVGPYAM 246
                 + K      + E  +IL+ F D+E++ FSIHN++  G + +G+   SW+GP   
Sbjct: 103 -----QFMKRFLGNLNIE--KILNNFQDNESAKFSIHNMVNIGLSEFGIDPTSWIGPTTS 155

Query: 247 CRSWEALARCQRAETGLGCQSLPMA-IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQ 305
                 L    R+       ++ +A I  V G           +  D A +H   FS+  
Sbjct: 156 SMIANKLINDNRSIIS----NIQIASITYVEG----------TIYRDQAVKH---FSEVG 198

Query: 306 ADWTPILLLVPLVLGLEKVNPR-YIPTLRLTFTFPQSLGIVGGKPGAS--TYIVGVQEES 362
           +D    + L  + LG  K N   Y  T+       Q + I+GG   +S    IV      
Sbjct: 199 SDSCTFVWLC-MKLGTSKFNINSYKKTVISMSNVSQFICIMGGNNYSSGALLIVAFSNSF 257

Query: 363 AIYLDPH-DVQPVI---NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
              LDPH  V P     N  +DD      T        I+   ++ SL++ + CR+ +DF
Sbjct: 258 LYCLDPHIKVLPSFSDKNFIRDDFIQKVPT-------RIYWGELNSSLSMVYICRNLEDF 310

Query: 419 DDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLGETGGVPEDDS 465
           DD C+  +++      + LF V       +N+ D   E   + E DS
Sbjct: 311 DDLCSNLTRI-----NSDLFEV-------INNCDF--EVKSINELDS 343


>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
 gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
          Length = 388

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 88/360 (24%), Positives = 145/360 (40%), Gaps = 52/360 (14%)

Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
           +G  EF +  + ++L  SYR  F P+ +   T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGA 112

Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
           +  P     + E  E I  LF D  ++P  IH          +   S + P         
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161

Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 307
                  E G+   +  +A +   GD        P   C +    D     +  S+GQ  
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
              ++L++P+VLG+  ++ +Y   +          GI GG   AS Y+ G Q  S  ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           PH +Q       +   +D +    +  R  +     DP + +GFY    +D+  F A   
Sbjct: 265 PHYIQ-------NAYTSDRTVGTLEGARGELSARRFDPCMVLGFYLHTLEDYRVF-AEEL 316

Query: 427 KLAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 479
            +A      PL +  Q  ++    SD         E G +P ++    +S N  A G  H
Sbjct: 317 AVANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376


>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
          Length = 362

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 82/346 (23%), Positives = 146/346 (42%), Gaps = 67/346 (19%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQ-----DFSSRILISYRKGFDPIGDSKITSD 167
           ++LLG+ +K    +        + L +++        S+ + ++YR G++ + +S + +D
Sbjct: 39  LFLLGIEYKTTPLKKQAQELPQSSLLQYSSMAAYVRMSNLLWMTYRSGYEKLPNSSLNTD 98

Query: 168 VGWGCMLRSSQMLVAQAL--LFHRLGRPWRKPLQKPFDREYVEILHLFGD--SETSPFSI 223
           VGWGC +R+ QM+++ A+  L ++           P   E + ++  F D   +T+P SI
Sbjct: 99  VGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIPKQNEILNVVIPFVDFFEQTTPLSI 158

Query: 224 HNLLQ---------AGKAYGLAAGSWVGPYA-MCRSWEALA-RCQRAETGLGCQSLPMAI 272
           H++ +         +G  Y LA       Y+ +  SW+  A RC  A       S+P+  
Sbjct: 159 HHVYESRFVVEQNKSGVNY-LAPTIVAKAYSDLVNSWKMCALRCVMASNT----SIPL-- 211

Query: 273 YVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTL 332
                                    C +    +  + P L+ +P+++  + V  R    L
Sbjct: 212 -------------------------CDI---KKEPFKPTLVFLPIIMD-QLVKSR----L 238

Query: 333 RLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVI-NIGKDDLEAD---TST 388
           +  + F    GIV G    + YI G      ++LDPH VQP   +  K DL++      T
Sbjct: 239 QQIYKFNMFAGIVSGIGDRAVYIFGFHVMRCLFLDPHTVQPAAESFTKIDLKSYAPINPT 298

Query: 389 YHSDVIRHIHLDSIDPSLAIGFYCR---DKDDFDDFCARASKLAEE 431
            +   I  I LD ID     GF  +   + D F+ FC     ++ E
Sbjct: 299 LNRFAIHSIELDKIDQFCTFGFLIKSLEEVDAFEKFCTETFDISHE 344


>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
           Clan CA, family C54, putative [Trypanosoma cruzi]
          Length = 328

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
             NN     N   +   L++YR  F P+  S +TSD GWGC++RSSQML+A AL      
Sbjct: 28  VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81

Query: 192 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 248
             WR          + ++     D+E ++PFS+H +++A   KA       W        
Sbjct: 82  --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127

Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 306
                          GC+++   +     +   +R   P + +   S+ C +  +     
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170

Query: 307 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           ++  +L+L P+  G   ++      +L         +G+VGG P  S YI+G   +  +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMKFFSLEHLLHSSTCIGVVGGVPQRSYYILGTSGQRLLY 230

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
           LDPH +     +     +A   T  + +++ +  D +D S  +GF
Sbjct: 231 LDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
 gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
          Length = 328

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 74/291 (25%), Positives = 124/291 (42%), Gaps = 48/291 (16%)

Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLF 187
           LG  A NN     N   +   L++YR  F P+  S +TSD GWGC++RSSQML+A AL  
Sbjct: 25  LGRVA-NNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL-- 81

Query: 188 HRLGRPWRKPLQKPFDREYVEILHLFGDSET---SPFSIHNLLQA--GKAYGLAAGSWVG 242
                 WR      +      + H F D +T   +PFS+H +++A   KA       W  
Sbjct: 82  ------WR------YSANDCRLDH-FRDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWT- 127

Query: 243 PYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFS 302
                                GC+++   +     +   +R   P + +   S+ C +  
Sbjct: 128 ------------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAR 164

Query: 303 K--GQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
           +     ++  +L+L P+  G   ++      +L         +G+VGG P  S YI+G  
Sbjct: 165 EICSNLEFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTS 224

Query: 360 EESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
            +  +YLDPH +     +     +A   T  + +++ +  D +D S  +GF
Sbjct: 225 GQRLLYLDPHCMTQEALVSSHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 328

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 121/285 (42%), Gaps = 43/285 (15%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
             NN     N   +   L++YR  F P+  S +TSD GWGC++RSSQML+A AL      
Sbjct: 28  VANNDKELVNILRNGFFLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL------ 81

Query: 192 RPWRKPLQKPFDREYVEILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCR 248
             WR          + ++     D+E ++PFS+H +++A   KA       W        
Sbjct: 82  --WRYSANDCRLDHFCDM-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT------- 127

Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQA 306
                          GC+++   +     +   +R   P + +   S+ C +  +     
Sbjct: 128 ------------PSQGCEAIRCCV-----NNAVDRKLIPPIRVVVCSQGCLLAREICSNL 170

Query: 307 DWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365
           ++  +L+L P+  G   ++      +L         +G+VGG P  S YI+G   +  +Y
Sbjct: 171 EFGTVLILAPMRCGASRRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLY 230

Query: 366 LDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
           LDPH +     +     +A   T  + +++ +  D +D S  +GF
Sbjct: 231 LDPHCMTQEALVSGHAEKAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
 gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
          Length = 388

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 88/360 (24%), Positives = 145/360 (40%), Gaps = 52/360 (14%)

Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
           +G  EF +  + ++L  SYR  F P+ +   T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKAATKKLLYFSYRNCFPPLPNGS-TTDTRWGCLVRTTQMLVGTCLLRYHCQGT 112

Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
           +  P     + E  E I  LF D  ++P  IH          +   S + P         
Sbjct: 113 YVLPEAD--NAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161

Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CID----DASRHCSVFSKGQAD 307
                  E G+   +  +A +   GD        P   C +    D     +  S+GQ  
Sbjct: 162 ------TEAGMAIAAALIAFHAQGGD-------VPFTFCCESRNIDEPAVMAKLSEGQH- 207

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
              ++L++P+VLG+  ++ +Y   +          GI GG   AS Y+ G Q  S  ++D
Sbjct: 208 ---VILIIPVVLGIAPMSDQYERMMLKILDMKACCGIAGGLKRASLYMFGHQGRSVFFMD 264

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIR-HIHLDSIDPSLAIGFYCRDKDDFDDFCARAS 426
           PH +Q       +   +D +    +  R  +     DP + +GFY    +D+  F A   
Sbjct: 265 PHYIQ-------NAYTSDKTVGTLEGARGELSARRFDPCMVLGFYIHTLEDYRVF-AEEL 316

Query: 427 KLAEESNGAPLFTVTQTHKKPVNHSD------VLGETGGVPEDDSLGVMSMND-AVGNAH 479
            +A      PL +  Q  ++    SD         E G +P ++    +S N  A G  H
Sbjct: 317 VVANSLVAFPLISFGQRPREGTTPSDNGVVSVAESEEGIMPHENEKSQLSPNPLAAGGGH 376


>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
          Length = 128

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 46/116 (39%), Positives = 62/116 (53%), Gaps = 15/116 (12%)

Query: 113 IWLLGVCHKI--AQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGW 170
           +W+LG  + I   +DE L D A             SR+  +YR+ F  IG +  TSD GW
Sbjct: 23  VWILGRKYSIFTEKDEILSDVA-------------SRLWFTYRRNFPAIGGTGPTSDTGW 69

Query: 171 GCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           GCMLR  QM+ AQAL+   LGR WR   +K     Y  +L+ F D + S +SIH +
Sbjct: 70  GCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFNVLNAFLDRKDSYYSIHQI 125


>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 371

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 77/306 (25%), Positives = 131/306 (42%), Gaps = 44/306 (14%)

Query: 142 QDFSSRILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
           ++ SS + +SY+K         + IT+D GWGC LR+SQM++AQ L  H     + K +Q
Sbjct: 52  EELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRH----LYEKRVQ 107

Query: 200 KPF--DREYVEILHL---FGDSET------SPFSIHNLLQAGKAYGLAAGSWVGPYAMCR 248
                D+  ++  HL   F +S +      SPF  H+LL   +A  L        Y   +
Sbjct: 108 SFIYNDKTKLDFQHLIMMFAESNSLENMDQSPFGFHSLL--TQAINLFQVPLKQQYTPVQ 165

Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
             +AL +          Q L  ++ +V+           V+  +D  +    + K     
Sbjct: 166 GIKALKQ------QFKQQKLVKSLKIVT-------SSTGVIFQEDIRQKMKNWEKS---- 208

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
             +LL++   LG  K+N  Y+  ++        +G +GG    S ++VG   +  + LDP
Sbjct: 209 --LLLILHFKLGTGKLNQIYVEQIKSLMDLEYFVGAIGGIKNKSLFMVGYMNDQFLSLDP 266

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS---IDPSLAIGFYCRDKDDFDDFCARA 425
           H  Q   N  KD L  +     S   + +  DS    +   +I FY R +  ++ F  + 
Sbjct: 267 HVQQ---NACKDPLNLNDEEMSSFFPKKVRADSCVKYEGDFSISFYIRSEKQYNIFLQKI 323

Query: 426 SKLAEE 431
           S L ++
Sbjct: 324 SNLNKQ 329


>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
          Length = 328

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 67/269 (24%), Positives = 116/269 (43%), Gaps = 43/269 (15%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
            L++YR  F P+  S +TSD GWGC++RSSQML+A AL        WR          + 
Sbjct: 44  FLLTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSANDCRLDHFR 95

Query: 208 EILHLFGDSE-TSPFSIHNLLQA--GKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
           +I     D+E ++PFS+H +++A   KA       W                       G
Sbjct: 96  DI-----DTEDSTPFSLHKMVRAVMKKADVFRPEYWT-------------------PSQG 131

Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSK--GQADWTPILLLVPLVLGL- 321
           C+++   +     +   +R   P + +   S+ C +  +     ++  +L+L P+  G  
Sbjct: 132 CEAIRCCV-----NNAVDRRLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS 186

Query: 322 EKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD 381
            ++      +L         +G+VGG P  S YI+G   +  +YLDPH +     +    
Sbjct: 187 RRMTQMMFFSLEHLLHSSACIGVVGGVPQRSYYILGTSGQRLLYLDPHCMTQEALVSSHA 246

Query: 382 LEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
             A   T  + +++ +  D +D S  +GF
Sbjct: 247 ERAGVVTVTASLVKSVRWDCVDTSCFLGF 275


>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
 gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
          Length = 81

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 35/48 (72%), Positives = 41/48 (85%)

Query: 327 RYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
           RYIP L+ T TFPQSLGI+GGKPG STYI GVQ++ A+YLDPH+VQ V
Sbjct: 10  RYIPLLKETLTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLV 57


>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
          Length = 255

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/140 (37%), Positives = 67/140 (47%), Gaps = 27/140 (19%)

Query: 133 GNNGLAEFNQDFSSRILISYRKGF-------DP----------------IGDSKITSDVG 169
           G    A F  DF+SR  ++YR  F       DP                +  S  TSD G
Sbjct: 116 GTGWPAAFLDDFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSG 175

Query: 170 WGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQA 229
           WGCM+RS Q L+A AL    LGR WR+ +    DRE   +L LF D   +P+S+HN ++ 
Sbjct: 176 WGCMIRSGQSLLANALAVLDLGRDWRRGMLP--DRER-RLLALFADDPRAPYSVHNFVRH 232

Query: 230 GKAY-GLAAGSWVGPYAMCR 248
           G+ Y     G W GP A  R
Sbjct: 233 GEKYCSKYPGEWFGPSATAR 252


>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
          Length = 321

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 120/323 (37%), Gaps = 91/323 (28%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           I+ L   H    + +  DAA               I I+YR+ +  +G + +TSD GWGC
Sbjct: 38  IFGLSYTHDTPSELSFADAA---------HRIHDLITITYRQKYATLGHTYLTSDAGWGC 88

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH---------LFGDSETSPFSI 223
            +RS QML+  +++ +         L K F  EY    H         L  D E+S  SI
Sbjct: 89  AIRSVQMLLVNSIVVY---------LDKSFHPEYTSHDHIAIKNNAKQLVFDKESSVLSI 139

Query: 224 HNL-LQAGKAYGLAAGSWVGPYAMCRS--------WEALARCQRAETGLGCQSLPMAIYV 274
           HN+ +Q         G+   P + C +        WE     +R    L C         
Sbjct: 140 HNIYIQDAIIKHNPTGTNFLPPSTCATAVADLYNFWE-----KRTFDVLMCTEY------ 188

Query: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334
                           I + ++             P LL +P ++   + N      ++ 
Sbjct: 189 ----------------IPEVTQ-------------PTLLFIPRIVTKSERN-----FIQT 214

Query: 335 TFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVI 394
           T   PQS G V G   A+ Y  GVQE+   +LDPH VQ    +G          Y +  I
Sbjct: 215 TSFLPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQDASEVG----------YFNRPI 264

Query: 395 RHIHLDSIDPSLAIGFYCRDKDD 417
              + D +D S   G  C +K D
Sbjct: 265 FEANFDELDNSFVFGMMCENKSD 287


>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
           digitatum Pd1]
          Length = 208

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 67/145 (46%), Gaps = 30/145 (20%)

Query: 128 LGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSK-----------------------I 164
           L D A  N    F  DF SRI I+YR  F PI  +K                        
Sbjct: 59  LNDTAWPNA---FVSDFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGF 115

Query: 165 TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIH 224
           TSD GWGCM+RS Q L+A A     LGR WR+  +   + E  +++ +F D   +PFSIH
Sbjct: 116 TSDTGWGCMIRSGQSLLANAFSVLLLGRDWRRGEK---EEEESKLISMFADHPEAPFSIH 172

Query: 225 NLLQAG-KAYGLAAGSWVGPYAMCR 248
             +  G ++ G   G W GP A  +
Sbjct: 173 KFVNRGAESCGKYPGEWFGPSATAK 197


>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 425

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 55/172 (31%), Positives = 75/172 (43%), Gaps = 27/172 (15%)

Query: 101 PSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF---- 156
           P+R+  S++     LL           LG     +    F  DF S+I ++YR  F    
Sbjct: 85  PTRSSDSATKPQRHLLPFAIHRGSTSPLGQQGQQHWPDAFLDDFESKIWLTYRSNFPLIP 144

Query: 157 ---DP----------------IGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP 197
              DP                +     T+D GWGCM+RS Q L+A AL    LGR WR+ 
Sbjct: 145 KSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQSLLANALAILSLGRDWRRG 204

Query: 198 LQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAYGLAAGSWVGPYAMCR 248
            +    +E  ++L LF D   +PFSIH  ++ G  A G   G W GP A  R
Sbjct: 205 TKI---KEESKLLSLFADDPKAPFSIHRFVEHGASACGKYPGEWFGPSATAR 253



 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 54/108 (50%), Gaps = 6/108 (5%)

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD-----LEADTSTYHSDVIRHIHL 399
           + G+P +S Y +G Q     YLDPH  +P + + +D         + +TYH+  +R +H+
Sbjct: 255 IHGRPSSSHYFIGAQGSHFFYLDPHHTRPAL-VYRDAGDRPYTTEELNTYHTRRLRRLHI 313

Query: 400 DSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP 447
             +DPS+ IGF  RD+DD++ +       A    G  +  V    K P
Sbjct: 314 KDMDPSMLIGFLIRDEDDWNSWKRSVHNGAMIGTGKAIIHVFDKEKSP 361


>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
 gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
          Length = 142

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 65/144 (45%), Gaps = 40/144 (27%)

Query: 126 EALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKI--------------------- 164
           + +G  +G N   EF  DF+S++ ++YR  F PI D+ +                     
Sbjct: 3   DMVGTTSGANWPPEFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLG 62

Query: 165 -----TSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETS 219
                TSD GWGCMLR+ Q L+A AL+F  LGR WR+P   P   E             S
Sbjct: 63  GERGWTSDSGWGCMLRTGQSLLANALVFMWLGREWRRP-PAPMPTE-------------S 108

Query: 220 PFSIHNLLQAGKAYGLAAGSWVGP 243
             S+H +  AGK  G   G W GP
Sbjct: 109 YASVHRMALAGKELGKDVGQWFGP 132


>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
 gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
          Length = 388

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 135/323 (41%), Gaps = 52/323 (16%)

Query: 135 NGLAEFNQDFSSRIL-ISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRP 193
           +G  EF +  + ++L  SYR  F P+  S  T+D  WGC++R++QMLV   LL +     
Sbjct: 54  DGTTEFVKVATKKLLYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGA 112

Query: 194 WRKPLQKPFDREYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEA 252
           +  P  +  + E  E I  LF D  ++P  IH          +   S + P         
Sbjct: 113 YVLP--EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSP--------- 161

Query: 253 LARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVV-CIDDASRHC---SVFSKGQADW 308
                  E G+   +  +A     GD        P   C +  SRH    +V +K   + 
Sbjct: 162 ------TEAGMAIAAALIAFRAQGGD-------VPFTFCCE--SRHIDEPAVMAK-LLEG 205

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDP 368
             ++L++P+VLG+  ++ +Y   +          GI GG   AS Y+ G Q  S  ++DP
Sbjct: 206 QHVVLIIPVVLGIAPMSDQYELVMLKILDVKACCGIAGGFKQASLYMFGHQGRSVFFMDP 265

Query: 369 HDVQPVINIGKDDLEADTSTYHSDVIR----HIHLDSIDPSLAIGFYCRDKDDFDDFCAR 424
           H VQ           A TS+     +      +     DP + +GFY    +D+  F   
Sbjct: 266 HYVQ----------NAYTSSRTVGTLEGSRGELRARRFDPCMVLGFYLHTPEDYRVF--- 312

Query: 425 ASKLAEESNGAPLFTVTQTHKKP 447
           A +LA  +N   +F +    ++P
Sbjct: 313 AEELA-VANSLVVFPLISFGRRP 334


>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
          Length = 224

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 5/101 (4%)

Query: 126 EALGDAAGNNGLA-----EFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQML 180
           E + +   NN +      +F  DF+SR+ ++YR  + PI  S   +D+GWGCMLRS Q L
Sbjct: 120 EEISEEEDNNNMYLRWPLDFYDDFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSL 179

Query: 181 VAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPF 221
           +A  L+ H LGR WR+  Q    R+ + I  L    +  PF
Sbjct: 180 LANTLIIHFLGRDWRRQTQNQTTRKELCIGFLMSYHQEHPF 220


>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 325

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/314 (21%), Positives = 129/314 (41%), Gaps = 67/314 (21%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +++LG C+    +E L     N+     N      I+ +YR+ +  +G++ ++SD GWGC
Sbjct: 36  VYILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSCLGNTYLSSDAGWGC 91

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFD-------REYVEILHLFGDSETSPFSIHN 225
            +R++QM++   L+       ++  +Q+  D       +  ++   L  D  +S  SIHN
Sbjct: 92  AIRATQMMIVNTLVI------FKDQMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHN 145

Query: 226 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           +   +  K +     +++ P   C +  +L +                       E  ++
Sbjct: 146 IYIQEIIKVHNPTGTNFLPPSICCIAISSLLQ-----------------------EWDKK 182

Query: 284 GGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLG 343
               + C+D    +CS          P L L+P ++   +        +  + T  QS G
Sbjct: 183 LFNCITCLDHIP-NCSY---------PTLYLIPQIITFTEHQ-----LILDSLTLSQSRG 227

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
            VGG   ++ ++ G Q  +  +LDPH VQ   + G          Y +     I L  I 
Sbjct: 228 FVGGIGESAIFVFGYQGTTLFFLDPHYVQNAGDFG----------YFNPPTYQIDLSLIS 277

Query: 404 PSLAIGFYCRDKDD 417
           PS+   F C +++D
Sbjct: 278 PSIVFAFMCYNEND 291


>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 228

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 12/96 (12%)

Query: 102 SRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGD 161
           SRT  S  +S    + +C +  + E  GD      +  F +DF SR+ ++YR+ F P+  
Sbjct: 84  SRTSFSKISS----IHLCGRRYRFEGEGD------IQRFQRDFVSRLWLTYRRDFPPLPG 133

Query: 162 SKITSDVGWGCMLRSSQMLVAQALLFHRL--GRPWR 195
             +TSD GWGCMLRS QM++AQ LL H L  G+PWR
Sbjct: 134 GCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKPWR 169


>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
          Length = 312

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 70/298 (23%), Positives = 125/298 (41%), Gaps = 59/298 (19%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
           FNQ   + I   YR      G  K  SD GWGC++R  QM++A AL+        R+   
Sbjct: 49  FNQKKDTLIWFCYRANIQFEG--KAISDQGWGCLVRVGQMMLANALM--------RECKI 98

Query: 200 KPFDREYVEILHLFGDSE----TSPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 253
              ++    I+HLF D++     +PFSI  +++ A     +  G W  GP  M       
Sbjct: 99  LAINKTKAMIIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIM------- 151

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWT---P 310
                                 S  ED  +    +  I+  +       + Q D +   P
Sbjct: 152 ----------------------SVIEDLNKNNMNIKQINLVNFLEQCVLESQIDLSFKKP 189

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
            LL++  ++G + +    I  L+      Q  G + GK   + +++G Q+ +AI++DPH 
Sbjct: 190 HLLIIHAIIGDKSLGQLEIQNLQSHMQISQFAGAIIGKNNKAFFLIGFQKNNAIFMDPHY 249

Query: 371 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
           VQ      K ++E +        ++   L  ++ ++A+ FY  +  ++ +F  + +KL
Sbjct: 250 VQES---NKIEMECN--------LKCQPLKQLNGTIALAFYISNYMEYLEFKKQVNKL 296


>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
 gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
          Length = 102

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 49/83 (59%), Gaps = 11/83 (13%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           +W+LG C+ +  ++            E   D  SR+  +YRK F PIG +  +SD GWGC
Sbjct: 29  VWVLGECYNVKTEKT-----------ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGC 77

Query: 173 MLRSSQMLVAQALLFHRLGRPWR 195
           MLR  QM++AQAL+  +LGR WR
Sbjct: 78  MLRCGQMILAQALVCSQLGRAWR 100


>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
          Length = 564

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 83/352 (23%), Positives = 139/352 (39%), Gaps = 73/352 (20%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 219
           +T+D  WGC +RS+QM++A AL             Q  F      IL LF D+      S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261

Query: 220 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 254
            FSI N+    LQ G+     YG+++ + +             + +C        +E + 
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321

Query: 255 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---------RHCSV--F 301
           +  CQ        Q L     V++  +  E         DD +         R  +    
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKL 380

Query: 302 SKGQADWTP---------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
                D  P         +L++V + LGL+K++P Y   +      PQ +G+VGGKP  +
Sbjct: 381 PNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGGKPNKA 440

Query: 353 TYIVG------VQEESAIYLDPHDVQP-VINIGKD-DLEA-DTSTYHSDVIRHIHLDSID 403
            Y  G        +   ++LDPH VQ    N+    DL+  + + +H+   R + +  +D
Sbjct: 441 FYFFGHIIDQDTNKVKLMFLDPHKVQDYTYNVETSYDLDVKEQAKFHTTEARLLKIKELD 500

Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
             L  GF  +   DF+ F        +E     +F++ Q   +  N+S +  
Sbjct: 501 TCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552


>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
          Length = 564

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 81/352 (23%), Positives = 136/352 (38%), Gaps = 73/352 (20%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS----ETS 219
           +T+D  WGC +RS+QM++A AL             Q  F      IL LF D+      S
Sbjct: 214 LTTDCNWGCTIRSAQMMIANALQ------------QSTFMYPVNSILKLFDDNIRECTES 261

Query: 220 PFSIHNL----LQAGKA----YGLAAGSWV-----------GPYAMCR------SWEALA 254
            FSI N+    LQ G+     YG+++ + +             + +C        +E + 
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321

Query: 255 R--CQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDAS---------RHCSV--F 301
           +  CQ        Q L     V++  +  E         DD +         R  +    
Sbjct: 322 KKGCQLVNEKQD-QQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKL 380

Query: 302 SKGQADWTP---------ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
                D  P         +L++V + LGL+K++P Y   +      PQ +G+VGGKP  +
Sbjct: 381 PNMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKYMQMPQFVGLVGGKPNKA 440

Query: 353 TYIVG------VQEESAIYLDPHDVQPV---INIGKDDLEADTSTYHSDVIRHIHLDSID 403
            Y  G        +   ++LDPH VQ     +    D    + + +H+   R + +  +D
Sbjct: 441 FYFFGHIIDLDTNKVKLMFLDPHKVQDYTYDVETSYDLDVKEQAKFHTTEARLLKIKELD 500

Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
             L  GF  +   DF+ F        +E     +F++ Q   +  N+S +  
Sbjct: 501 TCLGFGFLIKSLQDFNQFKTLLESNIQEDLDHSIFSLYQHESELDNNSQMFS 552


>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 398

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 121/303 (39%), Gaps = 42/303 (13%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
           +  SYR  F P+ +   T+D  WGC+LR++QML+   LL +     +  P     + +  
Sbjct: 74  LYFSYRSCFPPLPNGS-TTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELK-A 131

Query: 208 EILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQS 267
            I  LF D  ++P  IH          +   S + P                E G+    
Sbjct: 132 NISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSP---------------TEAGMA--- 173

Query: 268 LPMAIYVVSGDEDGERGGAPVV--CIDDASRHCSVFSK---GQADWTPILLLVPLVLGLE 322
             MA  +++   +G  G  P    C +      +V +K   GQ     ++L++P+VLGL 
Sbjct: 174 --MAAALIACHAEG--GDVPFTFSCENRNIDEPAVVAKLLEGQH----VILIIPVVLGLA 225

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDL 382
            ++ +Y   +          GI GG   AS Y+ G Q     ++DPH +Q       D  
Sbjct: 226 PLSDKYESMMLKILDMKACCGIAGGFKQASFYMFGHQGRKVFFMDPHYIQKAYT--SDKT 283

Query: 383 EADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
                    D+         DP + +GFY    +D+  F   A +LA  ++      ++ 
Sbjct: 284 AGTLYGARGDLTAR----KFDPCMVLGFYLHTLEDYRVF---AEELAVVNSLVTFPLISW 336

Query: 443 THK 445
           +HK
Sbjct: 337 SHK 339


>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 355

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 36/228 (15%)

Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 177
           +A DE   D   N    +F  DF SRI ++YR  F+ I  S   + TS +     L+S  
Sbjct: 99  LAYDEPTKD---NGWPPQFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQL 155

Query: 178 ---QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
                  + +++  RLGR WR+  Q P   E  EI+ LF D   +P+S+H+ ++ G  A 
Sbjct: 156 GDQSPFSSDSMI--RLGRDWRR-GQSP--HEEREIIKLFADHPNAPYSLHSFVRHGASAC 210

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
           G   G W GP A  R  +ALA    +          + +Y          G  P V  D+
Sbjct: 211 GKYPGEWFGPSATARCIQALANSHESS---------LRVYST--------GDGPDVYEDE 253

Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
             +      +G+A + P L+LV   LG++K+ P Y   L  +   PQS
Sbjct: 254 FMKIAK--PEGEA-FHPTLILVGTRLGIDKITPVYWEALIASLQMPQS 298


>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
 gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
 gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
           construct]
          Length = 141

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 62/110 (56%), Gaps = 7/110 (6%)

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y +G Q++  +YLDPH  QP +++ + D   +  ++H    R +    +DP
Sbjct: 1   MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--SFHCTSPRKMAFAKMDP 58

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKKPVNHS 451
           S  +GFY  D+ +F+  C+  +++   S+     P+FT+ + H +  +HS
Sbjct: 59  SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ--DHS 106


>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
          Length = 282

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 114/250 (45%), Gaps = 33/250 (13%)

Query: 20  DTPNRSLASVG-SELGSSESKSSKGSLLSSLFNSAFSVFETYS---ESSASEKKAVHNKS 75
           D  +R   ++G  E+  + SK S G+LLSS  N+  S     S    S  S         
Sbjct: 12  DGSDREQLTIGDCEVCDTTSKYSVGALLSSAANATSSKISRASINLRSLLSGSATKKTND 71

Query: 76  NGWTAAVKRLVTAGSMRRIHERV---LGPSRTGISSST----SDIWLLGVCHKIAQ---- 124
           +  + +   +  + S+R+  + V       R  IS S     + +WLLG  +  ++    
Sbjct: 72  DDVSTSESDIAISSSVRQKFDNVWFSFVYGRWRISRSKYKKKAPLWLLGEFYFTSRPDED 131

Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQA 184
           DE +  A        F  D+ SRI ++YR    P+  S  T+D GWGC LR+ QM++AQA
Sbjct: 132 DEVVFRA--------FAIDYYSRIWLTYRTELSPLPGSSKTTDCGWGCTLRTCQMMLAQA 183

Query: 185 LLFHRLGRPWRKPLQKPFDRE-----YVEILHLFGDSETSPFSIHNLLQAGKAYGL--AA 237
           L+   LGR WR    +  +R      + +I+ LFGD   +   ++ L++  K      A 
Sbjct: 184 LVVLHLGREWRFWGDEEANRYRCGFGHYDIVSLFGDHLDADLGLYRLMKIAKERNEHDAV 243

Query: 238 GSWVGPYAMC 247
           G+W   Y+ C
Sbjct: 244 GNW---YSAC 250


>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
          Length = 141

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 59/105 (56%), Gaps = 5/105 (4%)

Query: 345 VGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDP 404
           +GGKP  S Y +G Q++  +YLDPH  QP +++ + +   +  ++H    R +    +DP
Sbjct: 1   MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--SFHCTSPRKMAFAKMDP 58

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGA---PLFTVTQTHKK 446
           S  +GFY  D+ +F+  C+  +++   S+     P+FT+ + H +
Sbjct: 59  SCTVGFYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQ 103


>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
          Length = 429

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 26/114 (22%)

Query: 140 FNQDFSSRILISYRKGF-------DPIGD----------------SKITSDVGWGCMLRS 176
           F  DF SRI ++YR  F       DP                   +  +SD GWGCM+RS
Sbjct: 180 FLDDFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRS 239

Query: 177 SQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG 230
            Q L+A A+L  RLGR WR+  +   D E  +I+ LF D   +PFS+HN ++ G
Sbjct: 240 GQSLLANAILVARLGREWRR--ETDLDAEK-DIIALFADDPRAPFSLHNFVKYG 290



 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 3/76 (3%)

Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDD---LEADTSTYHSDVIRHIHLDSID 403
           G+P +S Y +GVQ +   YLDPH  +P +   +D       +  T H+  +R +H+D +D
Sbjct: 302 GRPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELDTCHTRRLRQLHIDDMD 361

Query: 404 PSLAIGFYCRDKDDFD 419
           PS+ IGF  +D+DD+D
Sbjct: 362 PSMLIGFLIKDEDDWD 377


>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
 gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
          Length = 158

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 46/68 (67%)

Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
           LGL+ VNP Y  T+++ +TFPQS+GI GG+P +S Y VG Q ++  YLDPH  +P + + 
Sbjct: 1   LGLDGVNPIYYDTIKILYTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHHARPAVPLR 60

Query: 379 KDDLEADT 386
              LE ++
Sbjct: 61  PPTLEPES 68


>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
          Length = 806

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 2/110 (1%)

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
           +++++ + LGLE +   Y   L+  F+  Q +GI+GGKP  + Y VG Q++  I+LDPH 
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLRQCVGILGGKPNFALYFVGYQQDHMIFLDPHY 700

Query: 371 VQPVINIGKD--DLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
           VQ  +   +   D E   +       + I ++S+DP + +GF  ++  D 
Sbjct: 701 VQQALTSDEQLKDQELKDTYQSQRSAKKIKMESLDPCIGVGFLIQNSKDL 750



 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 13/72 (18%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV----EILHLFGDSETS 219
           I SD GWGCM+R  QM++A + L         K LQ+  +   +     IL +  D   +
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFL---------KLLQQNHNFHDILTHDSILSMILDQLDA 443

Query: 220 PFSIHNLLQAGK 231
           PF IH + + G+
Sbjct: 444 PFGIHQITEEGR 455


>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
          Length = 352

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 67/278 (24%), Positives = 117/278 (42%), Gaps = 57/278 (20%)

Query: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE--- 208
           YR  F P+ ++ +TSD GWGC +RS+QMLVA A+          K     FD   V    
Sbjct: 92  YRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANAI---------GKLFTNDFDTGEVTDKM 142

Query: 209 ILHLFGD--SETSPFSIHNLL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLG 264
           ++  F D  S   PFSIHNL   +A     +   S++ P A+  ++  + + + A    G
Sbjct: 143 VIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK-KLANPKFG 201

Query: 265 CQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKV 324
            + L                          +    V+++      P ++L+P+ +  +  
Sbjct: 202 MEIL------------------------TTTFTFRVYTQ------PTIVLIPISIP-DSF 230

Query: 325 NPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEA 384
           N +    + + F+F    G+VGG    + Y  G+  +  ++LDPH V+   N   +    
Sbjct: 231 NDK----IAVIFSFYLFSGMVGGSGRKAFYFFGIHHDQLLFLDPHTVR---NTVINSCSF 283

Query: 385 DTSTYHSDV--IRHIHLDSIDPSLAIGFYCRDKDDFDD 420
           D   YH  +  ++ +    +D S  + F    + + DD
Sbjct: 284 DPQEYHPIIGDVKALSYSLLDRSAVLAFVVTSQRELDD 321


>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
          Length = 426

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/169 (30%), Positives = 69/169 (40%), Gaps = 50/169 (29%)

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGA------------------------------- 351
           ++ PRY   LR     PQS G++GG+P A                               
Sbjct: 234 RLEPRYAEPLRAALRLPQSAGMLGGRPRANRIFNTTSMCASSDQNLQLCFENSTRAIDPS 293

Query: 352 -------STYIVGVQEESA---IY-LDPHDVQPVINIGKDDL---EADTSTYHSDVIRHI 397
                  + +  G+        +Y LDPH VQP + +G D      A  S    D  + +
Sbjct: 294 KSGRPRAALFFPGLAARDGGADVYGLDPHTVQPALAVGDDGALGPGAAASVAPRDA-KKL 352

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKK 446
             D++DPSLA+ FYC D+DDF DF  RA  L     GAPLF V     +
Sbjct: 353 AADALDPSLALAFYCADRDDFLDFVGRARALP----GAPLFEVVDAAPR 397



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 54/117 (46%), Gaps = 15/117 (12%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYV 207
           +  +YR GF+ +     T D GWGCMLRS+QML+  AL   R G   R           +
Sbjct: 28  LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL--TRNGAAPR-----------L 74

Query: 208 EILHLFGDS--ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETG 262
               LF D+  +++PF +HN  + G  Y +  G W GP   C     L   +R   G
Sbjct: 75  ATAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLVDWRRNAPG 131


>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 200

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 6/100 (6%)

Query: 148 ILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKP---LQKPFDR 204
           I I+YRK    I +   T+D GWGCM+RS QM++AQ  L   LG  W+     +    + 
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNWKYENNCMNTERNI 96

Query: 205 EYVE-ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
            +++ I++LFGDS  S FSIH L+      G+  G W GP
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGP 136


>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
          Length = 348

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 28/124 (22%)

Query: 140 FNQDFSSRILISYRKGFDPI---------------------GD-SKITSDVGWGCMLRSS 177
           F  DF SR  ++YR GF+PI                     GD S  +SD GWGCM+RS 
Sbjct: 120 FLDDFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSG 179

Query: 178 QMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAA 237
           Q L+A A+  + LGR WR       ++   EI+ LF D   +P+SIH  +  G    +A 
Sbjct: 180 QSLLANAMAMYELGRGWRLSDGGIAEK---EIISLFADDPRAPYSIHRFVGHG---AVAC 233

Query: 238 GSWV 241
           GS++
Sbjct: 234 GSFL 237



 Score = 46.6 bits (109), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 3/61 (4%)

Query: 364 IYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDD 420
            YLDPH  +P +   +   E    +  + H+  +R +H+  +DPS+ IGF  RD+DD+D+
Sbjct: 238 FYLDPHHTRPGLPFHEHPSEYTQEEVGSCHTRRLRRLHIREMDPSMLIGFLIRDEDDWDN 297

Query: 421 F 421
           +
Sbjct: 298 W 298


>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
          Length = 325

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/316 (22%), Positives = 127/316 (40%), Gaps = 71/316 (22%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           + +LG C+    +E L     N+     N      I+ +YR+ +  +G++ ++SD GWGC
Sbjct: 36  VHILGNCYYPETNENLNHLTFNDA----NIKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 225
            +R++QM++  AL+       ++  +Q+  D    E          L  D  +S  SIHN
Sbjct: 92  AIRATQMMIVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145

Query: 226 LL--QAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGER 283
           +   Q  K +     +++ P   C +  +L +                          E 
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSVCCIAISSLLQ--------------------------EW 179

Query: 284 GGAPVVCIDDASR--HCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
              P  CI   +    CS          P L L+P ++   + +   + +L L+    QS
Sbjct: 180 DKKPFNCITCLNHIPSCS---------CPTLYLIPRIITFTE-HQLILDSLALS----QS 225

Query: 342 LGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDS 401
            G VGG   ++ ++ G Q  +  +LDPH VQ   + G      +  TY  D+        
Sbjct: 226 RGFVGGIGESAIFVFGCQGTTLFFLDPHYVQNAGDFGY----FNPPTYQIDI------SL 275

Query: 402 IDPSLAIGFYCRDKDD 417
           I  S+   F C ++++
Sbjct: 276 ISSSVVFAFMCYEENE 291


>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
 gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
          Length = 353

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 87/325 (26%), Positives = 130/325 (40%), Gaps = 47/325 (14%)

Query: 118 VCHKIAQ-DEALGDAAGNNGLAE----FNQDFSSRILISYRKGFDPIGD---------SK 163
           + + I Q D++L    GN   A+    F + F   IL SYR  F  I           S 
Sbjct: 20  IIYNIDQHDDSLIFLFGNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSS 79

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSI 223
           +T+D+GWGCMLR  QM +A  LL        R    K +      IL  F D E S FSI
Sbjct: 80  VTTDLGWGCMLRVIQMSLALGLL--------RYCKMKKYTYSLDYILQNFQDLEESLFSI 131

Query: 224 HNLLQAG-KAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGE 282
           H  ++ G   +      W GP +     + L +             P             
Sbjct: 132 HQFVKVGCSIFNKKPKDWFGPTSASTIADYLVKNN-----------PFLFNNFRISSILF 180

Query: 283 RGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVN-PRYIPTLRLTF-TFPQ 340
           + G     I  ++   S  ++  ++ T   + +   LG   +N  +Y  ++   F   PQ
Sbjct: 181 KDGT----IYKSNLFQSFKNEEYSENTLTFVWLCTRLGSSALNIQKYKDSIFSIFKNVPQ 236

Query: 341 SLGIVGGKPGAST--YIVGVQEESAIYLDPH-DVQPVINIGKDDLEADTSTYHSDVIRHI 397
            + I GG   +S+   IVG  E+    LDPH  +Q    I   + E     +   V   I
Sbjct: 237 LICIAGGHNCSSSALLIVGASEKFLYCLDPHIKLQEAFVIKNFNREE----FIQQVPMRI 292

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFC 422
             ++++PSL+  F C D DDF+  C
Sbjct: 293 SWENLNPSLSFVFCCTDIDDFNHLC 317


>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
           tetraurelia strain d4-2]
 gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
           tetraurelia]
 gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
          Length = 277

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/291 (24%), Positives = 122/291 (41%), Gaps = 59/291 (20%)

Query: 140 FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQ 199
           F Q   + I  SYR      G +   SD GWGC++R  QM+VA +L+             
Sbjct: 14  FLQLKETFIWFSYRANIQYEGRA--ISDQGWGCLIRVGQMIVANSLIRESTNS------- 64

Query: 200 KPFDREYVEILHLFGDSET----SPFSIHNLLQ-AGKAYGLAAGSW-VGPYAMCRSWEAL 253
           KP D +  +I+ LF D++     +PFSI  +++ A   Y +  G W  GP  MC   + L
Sbjct: 65  KPNDLK-TKIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLL 123

Query: 254 ARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW---TP 310
              Q A+T                           + I +    C +  + Q D     P
Sbjct: 124 ---QSAKT------------------------IKQLKIINFLEQCVI--EKQIDLQFKQP 154

Query: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370
            LL++  ++G ++++  ++  L+     PQ  G + GK   + +++G Q    I +DPH 
Sbjct: 155 QLLIIHAIIGNKELDQYFVAELQKHMQIPQFAGAIVGKSKKAYFLIGYQNNQGIVMDPHY 214

Query: 371 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           VQ          E++    +S  ++ I L     ++A+ +Y  +  D+   
Sbjct: 215 VQ----------ESNLLQLNSQ-LKCIPLKEFSGTIALCYYISNSYDYQQL 254


>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
          Length = 149

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 11/82 (13%)

Query: 108 SSTSDIWLLGVCHKIAQDEALGDAAGNNGLAE-FNQDFSSRILISYRKGFDPIGDSKITS 166
           S +S + LLG  ++++          + G+ E F + FSS + +SYR+GF P+  S ++S
Sbjct: 74  SKSSPVCLLGQSYQLS----------STGVRESFRRVFSSLLWMSYRRGFRPLDGSTLSS 123

Query: 167 DVGWGCMLRSSQMLVAQALLFH 188
           D GWGCMLRS+QML+AQ LL H
Sbjct: 124 DAGWGCMLRSAQMLLAQGLLLH 145


>gi|237837057|ref|XP_002367826.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
 gi|211965490|gb|EEB00686.1| hypothetical protein TGME49_006450 [Toxoplasma gondii ME49]
          Length = 3559

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 87/175 (49%), Gaps = 16/175 (9%)

Query: 285  GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 343
            GA V C+ D S     + +G       LLL PL L   EK+NP Y+ +L      P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023

Query: 344  IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 401
            +V G+   + Y +G Q+++ +YLDPH  +QP        L A T ++ +     +  + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079

Query: 402  IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHK--KPVNHSDVL 454
            ++PSLA+ F+ R++       A   KL EE +   +  V +  +   P++  DVL
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKL-EEVDSFSMLQVVERRRPFSPLDLDDVL 3133



 Score = 45.1 bits (105), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)

Query: 86   VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
            +TA SM R+   V G S       R  IS    D W  G    ++ D A       + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139

Query: 139  EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
            E  +  +     +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIAR---FTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196

Query: 182  AQALLFHRL 190
             QAL  H L
Sbjct: 1197 MQALRRHFL 1205


>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
          Length = 646

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 22/120 (18%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRK 196
           + EF +DFS++I +SYR+GF  IGD+   +D GWG                      W+K
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGY---------------------WKK 447

Query: 197 PLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALAR 255
             Q  +      I+ +F D  T+PFSIHN+   G+ + G   G W  P  +  + ++L  
Sbjct: 448 SGQNEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVN 507



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 26/99 (26%)

Query: 344 IVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSID 403
           IVGGKP AS Y +  Q+++  YLDPH VQ  I+                       + ++
Sbjct: 541 IVGGKPRASLYFIAAQDDNLFYLDPHTVQQAID-----------------------NEVE 577

Query: 404 PSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQ 442
            SL++      K+DF DF  R+ KL  +S   PL+ + +
Sbjct: 578 FSLSVS--VETKEDFLDFLERSKKLVSKSE-FPLYNIAE 613


>gi|221481944|gb|EEE20310.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 3562

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 87/174 (50%), Gaps = 15/174 (8%)

Query: 285  GAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLG 343
            GA V C+ D S     + +G       LLL PL L   EK+NP Y+ +L      P SLG
Sbjct: 2970 GAAVDCLRDDSCADVPWRRG------CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLG 3023

Query: 344  IVGGKPGASTYIVGVQEESAIYLDPHD-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDS 401
            +V G+   + Y +G Q+++ +YLDPH  +QP        L A T ++ +     +  + +
Sbjct: 3024 MVAGRGQQAFYCIGTQQKALLYLDPHSGIQPPAL----QLPAATPSFFAGSCWKVSDVAA 3079

Query: 402  IDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            ++PSLA+ F+ R++       A   KL EE +   +  V +  ++P +  D+ G
Sbjct: 3080 LNPSLAVAFFVRNERQLLGLAAALKKL-EEVDSFSMLQVVE-RRRPFSPLDLDG 3131



 Score = 45.1 bits (105), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)

Query: 86   VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
            +TA SM R+   V G S       R  IS    D W  G    ++ D A       + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139

Query: 139  EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
            E  +  +     +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196

Query: 182  AQALLFHRL 190
             QAL  H L
Sbjct: 1197 MQALRRHFL 1205


>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
          Length = 538

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 41/79 (51%), Gaps = 5/79 (6%)

Query: 362 SAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           S IYLDPH VQ       D       T+  +  R + L SIDPSLA+GFYC    ++ D 
Sbjct: 331 SVIYLDPHQVQEAAACPDD-----WRTFWCETPRSMPLPSIDPSLALGFYCSSLGEYRDL 385

Query: 422 CARASKLAEESNGAPLFTV 440
           C+R   L   S GAPL  V
Sbjct: 386 CSRLEALERRSGGAPLVCV 404



 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 26/129 (20%)

Query: 179 MLVAQALLFHRLGRPWR----------------KPLQKPFDREYVEILHLFGDS--ETSP 220
           M++AQ L+ H LGR WR                             +L LF D+  E +P
Sbjct: 1   MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60

Query: 221 FSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDED 280
           FS+H+L +AG+A G+ AG W+GP+ MC++  A A   R       Q + + + V    E 
Sbjct: 61  FSLHSLCRAGQACGVVAGRWLGPWVMCKTLAAAAGAARR------QGVDLGLTVAVLAES 114

Query: 281 GERGGAPVV 289
           G  GGAP++
Sbjct: 115 G--GGAPLL 121



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/37 (59%), Positives = 26/37 (70%)

Query: 323 KVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQ 359
           K+NPRYIP L      PQS+GIVGG+P +S Y VG Q
Sbjct: 215 KLNPRYIPQLEAVLAMPQSIGIVGGRPSSSLYFVGFQ 251


>gi|221505025|gb|EEE30679.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 3554

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 77/148 (52%), Gaps = 9/148 (6%)

Query: 311  ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
             LLL PL L   EK+NP Y+ +L      P SLG+V G+   + Y +G Q+++ +YLDPH
Sbjct: 2988 CLLLFPLTLCSGEKINPVYVHSLLAYLELPWSLGMVAGRGQQAFYCIGTQQKALLYLDPH 3047

Query: 370  D-VQPVINIGKDDLEADTSTYHSDVIRHIH-LDSIDPSLAIGFYCRDKDDFDDFCARASK 427
              +QP        L A T ++ +     +  + +++PSLA+ F+ R++       A   K
Sbjct: 3048 SGIQPPAL----QLPAATPSFFAGSCWKVSDVAALNPSLAVAFFVRNERQLLGLAAALKK 3103

Query: 428  LAEESNGAPLFTVTQTHKKPVNHSDVLG 455
            L EE +   +  V +  ++P +  D+ G
Sbjct: 3104 L-EEVDSFSMLQVVE-RRRPFSPLDLDG 3129



 Score = 45.4 bits (106), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 31/129 (24%)

Query: 86   VTAGSMRRIHERVLGPS-------RTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLA 138
            +TA SM R+   V G S       R  IS    D W  G    ++ D A       + LA
Sbjct: 1084 LTALSMDRLGVAVAGRSNKRRRLFRLPISLPGGDPWPAGRVGCVSSDAA----EVQHKLA 1139

Query: 139  EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
            E  +  +     +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 1140 ETVRAIA---RFTYRSGFAPMYKCCGEKKRRVGPGFEREWIAINSDVGWGCTVRAAQMLL 1196

Query: 182  AQALLFHRL 190
             QAL  H L
Sbjct: 1197 MQALRRHFL 1205


>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
 gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
          Length = 266

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 2/57 (3%)

Query: 134 NNGLAE--FNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFH 188
           NN + +  F  D  S I  SYRK F PI ++ IT+D+GWGCMLR+ QM++A+ALL H
Sbjct: 205 NNNIIQSNFLDDVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALLKH 261


>gi|193784751|dbj|BAG53904.1| unnamed protein product [Homo sapiens]
          Length = 146

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 56/117 (47%), Gaps = 1/117 (0%)

Query: 339 PQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIH 398
           P SL   G      T ++   EE  IYLDPH  QP +         D S +       + 
Sbjct: 4   PLSLSSAGSATHLPTCLILPGEE-LIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMS 62

Query: 399 LDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           +  +DPS+A+GF+C+ +DDF+D+C +  KL+      P+F + +     +   DVL 
Sbjct: 63  IAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLN 119


>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 3465

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 59/106 (55%), Gaps = 7/106 (6%)

Query: 311  ILLLVPLVL-GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPH 369
             LLL PL L   EK+NP Y+P+L      P S+G+V G+   + Y +G Q+++ +YLDPH
Sbjct: 2955 CLLLFPLTLCSGEKINPVYVPSLLAYLELPWSVGMVAGRGQQAFYCIGTQQKALLYLDPH 3014

Query: 370  D-VQ-PVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCR 413
              +Q P + +      A  S +     +   + +++PSL++ F+ R
Sbjct: 3015 SGIQPPALQL----PSATPSFFAGSCWKIADVAALNPSLSVAFFVR 3056



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 17/70 (24%)

Query: 139  EFNQDFSSRILISYRKGFDPI----GDSK-------------ITSDVGWGCMLRSSQMLV 181
            + +Q   S    +YR GF P+    G+ K             I SDVGWGC +R++QML+
Sbjct: 942  QLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQMLL 1001

Query: 182  AQALLFHRLG 191
             QAL  H LG
Sbjct: 1002 MQALRRHFLG 1011


>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
           multifiliis]
          Length = 209

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 20/143 (13%)

Query: 142 QDFSSRILISYRKGFDPI----GDSKI---TSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           ++F + I ++YR+ F P+     D KI    SD GWGCM+R  QM +A+ L  H   +  
Sbjct: 24  ENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRHHLQQKGI 83

Query: 195 ---RKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQ-AGKAYGLAAGSWVGPYAMCRSW 250
              ++ +Q   D +       FGD   +P+SI  + + A K + L  G W  P  +C   
Sbjct: 84  YDNKRIIQAFLDND-------FGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRICHVL 136

Query: 251 EALARCQRAETGLGCQSLPMAIY 273
             L      +  L C+ L + ++
Sbjct: 137 SLLHN--DKKQILDCEDLKVGVF 157


>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
          Length = 360

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 96/228 (42%), Gaps = 35/228 (15%)

Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDS---KITSDVGWGCMLRSS- 177
           +A D+ + D    +G   F  DF S+I ++YR  F+PI  S   + TS +     L+S  
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158

Query: 178 --QMLVAQALLFHRLGR-PWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAG-KAY 233
             Q   +   +  RLGR  WR+        E   +L  F D   +P+SIH+ ++ G  A 
Sbjct: 159 GDQSPFSSDTMV-RLGRGDWRRGESV---EEECRLLKDFADDPRAPYSIHSFVRHGASAC 214

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
           G   G W GP A  R  +AL     +           +I V S       G  P V  D+
Sbjct: 215 GKYPGEWFGPSATARCIQALTNSHES-----------SIRVYST------GDGPDVYEDE 257

Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQS 341
                 +      D+ P L+LV   LG++K+ P Y   L      PQS
Sbjct: 258 ---FMQIAKPPGEDFHPTLVLVGTRLGIDKITPVYWEALIAALQMPQS 302


>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
          Length = 93

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 19/91 (20%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFN---QDFSSRILISYRKGFDPIG-- 160
           I  +   IW+LG  +              N L E +   +D  S +  +YRKGF PIG  
Sbjct: 16  IPQTDEPIWILGKKY--------------NALKELDMIRRDIRSMLWFTYRKGFIPIGGC 61

Query: 161 DSKITSDVGWGCMLRSSQMLVAQALLFHRLG 191
           +S  TSD GWGCMLR  QM++AQAL+   LG
Sbjct: 62  NSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92


>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 348

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 114/278 (41%), Gaps = 67/278 (24%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
           +S I   YR  F  + ++ +TSD GWGC +R+ QML+A A++             K F  
Sbjct: 85  TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAII-------------KLFGS 131

Query: 205 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAY-GLAAGSWVGPYAMCRSWEALARCQR 258
           + +    ++H F D   S  P+SIH+L        G   GS   P++             
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFS------------- 178

Query: 259 AETGLGCQSLPMAIYVVSG--DEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILL 313
                        IY ++   ++D  R              C V +     ++   P ++
Sbjct: 179 -----------SVIYALTELVNKDFNRAF-----------ECHVITNKFLLKSINKPTIV 216

Query: 314 LVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP 373
            +P  +  +K + R I      F+F    G+VGG    + Y  G+     ++LDPH V+P
Sbjct: 217 FIPFTIP-DKFDQRLIT----IFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRP 271

Query: 374 VI-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
              +I K D E D     SD I+ + ++ ++ S+   F
Sbjct: 272 CASSIMKFD-EKDYIAKLSD-IKSLRINELERSVVFSF 307


>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 348

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 66/277 (23%), Positives = 116/277 (41%), Gaps = 65/277 (23%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
           +S I   YR  F  + ++ + SD GWGC +R+ QML+A A++             K F  
Sbjct: 85  TSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAII-------------KLFGS 131

Query: 205 EYVE---ILHLFGD--SETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
           + +    ++H F D  +   P+SIH+L        + +G+                    
Sbjct: 132 DNINRKTVIHWFLDFYNVECPYSIHSLFTTQI---IVSGN-------------------- 168

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASR--HCSVFSKG---QADWTPILLL 314
               G   LP+++   +  E   +         D +R   C V +      +   P ++ 
Sbjct: 169 --PNGSSFLPLSVVTYALTELVNK---------DLNRIFECHVITNKFLLNSINKPTIIF 217

Query: 315 VPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPV 374
           +P  +  ++ N R I      F+F    G+VGG    + Y  G+  +  ++LDPH V+P 
Sbjct: 218 IPFTIP-DEFNQRLIS----IFSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDPHFVRPC 272

Query: 375 I-NIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
             +I K D E D     SD I+ +H++ ++ S+   F
Sbjct: 273 ASSIMKFD-EKDYIAKLSD-IKSLHINELERSVVFSF 307


>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
          Length = 473

 Score = 61.6 bits (148), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 81/340 (23%)

Query: 148 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 200
           I  +YR+GF      DS +T+D GWGC++R  QM++A+ L      F+++      PL +
Sbjct: 52  IRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAELLKRHLKCFYKVDLFSFPPLLQ 111

Query: 201 PFDREYVEILHLFGDSE--------TSP----FSIHNLLQ-AGKAYGLAAGSWVGPYAMC 247
                  ++L +F D +        + P    FSI  +++ A K +G   G W  P  + 
Sbjct: 112 -------DVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKEWGKKPGEWYSPNQIV 164

Query: 248 RS-WEALARCQRAET-GLG-------------------------CQ----SLPMAIYVVS 276
           ++ ++ L         GLG                         CQ    S+   +  + 
Sbjct: 165 QAIYKILQEINIPYCYGLGFVPFYESQIDLRAIFQEMCMMEDCVCQKKVFSIEQFLKSLE 224

Query: 277 GDEDGERGGAPV---------VCIDDASRHC-----SVFSK--GQADWTPILLLVPLVL- 319
             E G+     V         VC +D S        ++  K   Q  + P+  +   +L 
Sbjct: 225 KLEIGKEEMVQVMHGNDSISDVCCEDQSEQNKKEIGNLLKKYICQKCFVPVRAVAVCLLS 284

Query: 320 --GLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINI 377
             G ++ NP Y+  +R         G++GG+P  + +IVG  +   + LDPH VQ     
Sbjct: 285 RIGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHLVQE---- 340

Query: 378 GKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDD 417
            K + E    +        +    ID SL + FY ++ DD
Sbjct: 341 AKMNPEEYIKSCFPGEALFMSDKEIDCSLGLVFYLKNLDD 380


>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 348

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/278 (20%), Positives = 109/278 (39%), Gaps = 67/278 (24%)

Query: 145 SSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDR 204
           +S I   YR  F  + ++ +TSD GWGC +R+ QML+A +++             K F  
Sbjct: 85  TSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSII-------------KLFGS 131

Query: 205 EYVE---ILHLFGDSETS--PFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRA 259
           + +    ++H F D   S  P+SIH+L                            +   +
Sbjct: 132 DNINRKTVIHWFLDFYNSECPYSIHSLFT-------------------------TQIIVS 166

Query: 260 ETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKG---QADWTPILLLVP 316
           +   G   LP ++ + +  E   +         + +  C + +      +   P ++ +P
Sbjct: 167 KNPNGSSFLPFSVVIYALTELVNKDF-------NRAFECHIITNKFLLNSINKPTIVFIP 219

Query: 317 LVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQP--- 373
             +  E     +   L   F+F    G+VGG    + Y  G+     ++LDPH V+P   
Sbjct: 220 FTIPDE-----FEQRLITIFSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDPHFVRPCAS 274

Query: 374 -VINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGF 410
            +I   + D  A  S      I+ + ++ ++ S+   F
Sbjct: 275 SIIKFDEKDYIAKLSD-----IKSLRINELERSVVFSF 307


>gi|412989956|emb|CCO20598.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Bathycoccus prasinos]
          Length = 532

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 98/267 (36%), Gaps = 74/267 (27%)

Query: 234 GLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDD 293
            L  G W+ P  +C+ +  +    R ++    + L +         DG  GG P    + 
Sbjct: 234 ALCPGQWMAPSEICKRYGKMM--NRLDSFQNVRCLILG--------DGCGGGVPEFYPER 283

Query: 294 ASRHCSVFSKGQADWTPILLLVPLVLGL-EKVNPRYIPTLRLTFTFPQSLGIVGGKPGAS 352
                    K  AD   +L+LVPL  G  + +NP Y+ +L+   +  + +GIVGGK  AS
Sbjct: 284 VREEM----KTHAD-KDVLILVPLRCGASDAINPEYVKSLQKFLSVRECVGIVGGKKTAS 338

Query: 353 TYIVGVQE--------------------------------------ESAIYLDPHDVQPV 374
            YIVG                                           AIYLDPH  +  
Sbjct: 339 YYIVGFTSGKKSSDSYSGGEKEEEEEEKEEEENEEDEEEEEEEEEETRAIYLDPHVAKAY 398

Query: 375 INIGKDDLEADT-STYHSDV--------IRHIHLDSIDPSLAIGFYCRDKDDFDD----- 420
           ++  +   +  T S Y+           I +    ++DPSL +GF   +  ++D+     
Sbjct: 399 VSPRERSRDESTESAYYRSFFGSASEHGILYTPFHALDPSLVVGFLVGNDTNYDEMNNAS 458

Query: 421 ------FCARASKLAEESNGAPLFTVT 441
                 F    + +  ES   PL TV 
Sbjct: 459 SSSLDAFVDVLTNIERESGSTPLITVV 485


>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
          Length = 98

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 13/81 (16%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIG--DSKITSDVGW 170
           +W+LG  +   ++           L    +D  S +  +YRKGF PIG  +S  TSD GW
Sbjct: 23  VWILGRVYNAIKE-----------LDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGW 71

Query: 171 GCMLRSSQMLVAQALLFHRLG 191
           GCMLR  QM++A+AL+   LG
Sbjct: 72  GCMLRCGQMVLARALITLHLG 92


>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 341

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 18/101 (17%)

Query: 148 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFD 203
           IL +YR  F+PI    G + + SD GWGC +R++QML+AQA+     G+          D
Sbjct: 67  ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV--KMAGK----------D 113

Query: 204 REYVEILHLFGDSETSPFSIHNLLQAGK-AYGLAAGSWVGP 243
            +   +L LF DS  +P S+H +++ G+       G+W GP
Sbjct: 114 ADDSVVLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGP 154


>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 193

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 25/150 (16%)

Query: 95  HERVLGPSRTGISSSTSDIWLLGVCHKIAQ-DEALGDAAGNNGL-----AEFNQDFSSRI 148
           HE V  P   G  S     ++LGV  K  Q D+ L +      L     A F +  S+  
Sbjct: 25  HEDVQKPIFVGGCS----FYILGVEFKTKQMDKQLAEQPPEVYLQYSSAAAFFR-ISNLF 79

Query: 149 LISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQAL-------LFHRLGRPWRKPLQKP 201
            ++YR G++ + +S +T+DVGWGC +R+ QM++A A+         +    P+      P
Sbjct: 80  WMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYI-----P 134

Query: 202 FDREYVEILHLFGDS--ETSPFSIHNLLQA 229
             +E + +L  F DS   T+P SIH++ ++
Sbjct: 135 TKQEVMNVLIPFIDSPNSTTPLSIHHVYES 164


>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 658

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 69/179 (38%), Gaps = 52/179 (29%)

Query: 326 PRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGV-----------------QEESAIY-LD 367
           P Y  TL    +FPQS+G++GG P  + +  G                  QE    Y LD
Sbjct: 418 PTYGSTLAKLLSFPQSVGMLGGTPRHALWFYGADEVDPPTFGDDGKALNGQECGGWYGLD 477

Query: 368 PHDVQ------PVINIGKDDLEADT------------------------STYHSDVIRHI 397
           PH  Q           GKD++ +D                         +T H++  R I
Sbjct: 478 PHTTQVAPRGTRTTKYGKDEVSSDDIELNNCQWQVQLNDAYLRSLHFTPTTTHANHQRSI 537

Query: 398 HLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEES---NGAPLFTVTQTHKKPVNHSDV 453
            L  +DPS A+GFY RD  DF  F      L++E    N  P   VT T K P    DV
Sbjct: 538 PLSKLDPSCALGFYIRDHSDFVQFTNAIDALSKEHCRPNKLPDI-VTVTEKTPNYEVDV 595



 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 164 ITSDVGWGCMLRSSQMLVAQALLFH 188
           + SD GWGCMLRS+QM++AQ +  H
Sbjct: 133 LKSDAGWGCMLRSAQMMMAQTVRMH 157


>gi|148682816|gb|EDL14763.1| mCG116861, isoform CRA_a [Mus musculus]
          Length = 127

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 25/81 (30%), Positives = 48/81 (59%), Gaps = 1/81 (1%)

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           I+LDPH  Q  ++I +  L  D + +     + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4   IFLDPHTTQTFVDIEESGLVDDQTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63

Query: 424 RASKLAEESNGAPLFTVTQTH 444
              K   + N   +F + Q H
Sbjct: 64  LVQKEILKEN-LRMFELVQKH 83


>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 183

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/144 (25%), Positives = 66/144 (45%), Gaps = 19/144 (13%)

Query: 113 IWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGC 172
           + +LG C+    +E L     N+     N      I+ +YR+ +  +G++ ++SD GWGC
Sbjct: 36  VHILGNCYYPETNENLNHLTFNDA----NLKIHDLIVATYRQKYSYLGNTYLSSDAGWGC 91

Query: 173 MLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVE-------ILHLFGDSETSPFSIHN 225
            +R++QM+V  AL+       ++  +Q+  D    E          L  D  +S  SIHN
Sbjct: 92  AIRATQMMVVNALVI------FKDQMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHN 145

Query: 226 LL--QAGKAYGLAAGSWVGPYAMC 247
           +   Q  K +     +++ P   C
Sbjct: 146 IYIQQVIKTHNPKGTNFLPPSICC 169


>gi|78070455|gb|AAI07651.1| Atg4d protein [Rattus norvegicus]
          Length = 168

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           +YLDPH  QP +++ + +   ++  +H    R +    +DPS  +GFY  ++ +F+  C+
Sbjct: 47  LYLDPHYCQPTVDVNQANFPLES--FHCTSPRKMAFAKMDPSCTVGFYAGNRKEFETLCS 104

Query: 424 RASKLAEESNGA---PLFTVTQTHKK 446
              ++   S+     P+FTV + H +
Sbjct: 105 ELMRILSSSSVTERYPMFTVAEGHAQ 130


>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
           multifiliis]
          Length = 384

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 2/81 (2%)

Query: 341 SLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLD 400
           S+G++GG PG + Y +G+ +   IYLDPH +Q      K     D  TY    I  +   
Sbjct: 223 SIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQEAHQNEKTVQNID--TYFCKFINRVSQK 280

Query: 401 SIDPSLAIGFYCRDKDDFDDF 421
            ++ SLA GFY ++  + + F
Sbjct: 281 KLESSLAFGFYIKNLQELEQF 301


>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
          Length = 206

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 30/135 (22%)

Query: 127 ALGDAAGNNGLAEFNQDFSSRILISYRKGFD-------------------PI-GDSKITS 166
           A+ D      L E  +DF   IL++YR+G                     P+   + I +
Sbjct: 17  AMCDQNPGPKLRERLKDF---ILLTYRRGLSIHLPRFYAGNIPKRFYGIWPLWQQTDIKT 73

Query: 167 DVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNL 226
           D GWGC LR++QM +A+AL      R    PL      +   IL LF D+  +PFS+ NL
Sbjct: 74  DRGWGCALRATQMALAEAL------RDVLSPLDN-VQEQRSRILQLFYDTTEAPFSLENL 126

Query: 227 LQAGKAYGLAAGSWV 241
           + A   +G    +W+
Sbjct: 127 VMADVEHGANVVAWI 141


>gi|392343434|ref|XP_003754884.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
           norvegicus]
 gi|392355909|ref|XP_003752169.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
           norvegicus]
          Length = 126

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 1/81 (1%)

Query: 364 IYLDPHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCA 423
           I+LDPH  Q  ++  +  L  D + +     + + + ++DPS+A+GF+C+++ DFD++C+
Sbjct: 4   IFLDPHTTQTFVDTEESGLVDDHTFHCLQSPQRMSILNLDPSVALGFFCKEEKDFDNWCS 63

Query: 424 RASKLAEESNGAPLFTVTQTH 444
              K   + N   +F + Q H
Sbjct: 64  LVQKEILKEN-LRMFELVQKH 83


>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
          Length = 389

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 347 GKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLE---ADTSTYHSDVIRHIHLDSID 403
           G+P +S Y +G Q     YLDPH  +  +   +D +E    + ++ H+  +R IH+  +D
Sbjct: 262 GRPSSSHYFIGAQGSFLFYLDPHHTRVALPYREDPIEYTSEEIASCHTPRLRRIHVREMD 321

Query: 404 PSLAIGFYCRDKDDFDDF 421
           PS+ IGF  +++ D+ + 
Sbjct: 322 PSMLIGFLIQNEVDWQEL 339



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 26/94 (27%)

Query: 122 IAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI---------------------- 159
           +A D+ + D    +G   F  DF S+I ++YR  F+PI                      
Sbjct: 102 LAYDDPVVDGGWPSG---FISDFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQL 158

Query: 160 GD-SKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
           GD S  +SD GWGCM+RS Q ++A  +   RLGR
Sbjct: 159 GDQSPFSSDSGWGCMIRSGQSMLANTIAMVRLGR 192


>gi|328852767|gb|EGG01910.1| Hypothetical protein MELLADRAFT_123246 [Melampsora larici-populina
           98AG31]
          Length = 134

 Score = 51.2 bits (121), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)

Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
           P+L+L+ +  GL++VN  P Y  T+  TFTFPQS+GI GG+P  S ++
Sbjct: 83  PVLVLMNVQSGLDRVNINPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130


>gi|328852471|gb|EGG01617.1| Hypothetical protein MELLADRAFT_92005 [Melampsora larici-populina
           98AG31]
          Length = 134

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)

Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
           P+L+L+ +  GL++VN  P Y  T+  TFTFPQS+GI GG+P  S ++
Sbjct: 83  PVLVLMNVQSGLDRVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130


>gi|407037202|gb|EKE38551.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 157

 Score = 50.4 bits (119), Expect = 0.002,   Method: Composition-based stats.
 Identities = 40/137 (29%), Positives = 58/137 (42%), Gaps = 13/137 (9%)

Query: 308 WTPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLD 367
           + P L+ +P+VL     N      L+  +      GIVGG    + ++ G      +YLD
Sbjct: 17  FKPTLVFLPIVL-----NHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFLYLD 71

Query: 368 PHDVQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPS-----LAIGFYCRDKDDFDDFC 422
           PH VQP     K   E DT +Y         + +IDP+        GF  ++  + DDF 
Sbjct: 72  PHIVQPSF---KSFTEIDTKSYSPIGSNRFSVHTIDPTKLDDFCTFGFLIKNLHEVDDFM 128

Query: 423 ARASKLAEESNGAPLFT 439
             A  + E SN   L T
Sbjct: 129 KLAKDVFEISNDKELRT 145


>gi|328859149|gb|EGG08259.1| Hypothetical protein MELLADRAFT_123247 [Melampsora larici-populina
           98AG31]
          Length = 134

 Score = 50.4 bits (119), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 34/48 (70%), Gaps = 2/48 (4%)

Query: 310 PILLLVPLVLGLEKVN--PRYIPTLRLTFTFPQSLGIVGGKPGASTYI 355
           P+L+L+ +  GL++VN  P Y  T+  TFTFPQS+GI GG+P  S ++
Sbjct: 83  PVLVLMNVQSGLDQVNISPSYCKTIEATFTFPQSVGIAGGRPSQSLFL 130


>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
 gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
          Length = 350

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 81/356 (22%), Positives = 126/356 (35%), Gaps = 95/356 (26%)

Query: 128 LGDAAGNNGLAEFNQDFSSR--ILISYRKG-------------------FDPI----GDS 162
           + +    N    +N+   SR  IL +YR G                   F P+    G  
Sbjct: 1   MSNVVRENVNVLYNKRLESRFGILFTYRYGLEYKFPRPINFKRRRLFNIFSPLNLSNGIV 60

Query: 163 KITSDVGWGCMLRSSQMLVAQALLFHRLGRPW---------RKPLQKPFDREYVE----- 208
            I SD GWGC+LRS+QM ++QALL   LG  +         R P  +  D+  +      
Sbjct: 61  TIDSDKGWGCVLRSTQMAISQALLNLVLGPEFSVEQLEIRNRTPRNRKIDQSLLNIDTFE 120

Query: 209 -----------------ILHLFGDSETSPFSIHNLLQAGKAYGLAAGSW-VGPY--AMCR 248
                            IL  F D   + FSI+N + A             GP   A+C 
Sbjct: 121 KLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIYNFVIADYVLKTCTKFLHFGPTSAALC- 179

Query: 249 SWEALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADW 308
                     A   +   +LP+                  +   D   H S   +   + 
Sbjct: 180 ----------ASKIINDLNLPIN----------------SIAFPDGVFHISDVREILEEK 213

Query: 309 TPILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKP-GASTYIVGVQEESAIYLD 367
             +L+ V     L+++       +R  F   Q  GI+GG     S YI G   +   Y D
Sbjct: 214 RNLLVWVSNKKKLDRIER---ECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYND 270

Query: 368 PHDV--QPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDF 421
           PH    +   ++   D+  D   + S  ++ ++    + S  + F  +D+DDF DF
Sbjct: 271 PHLYCKKAFRSLEYVDIFRD---FTSRRVKSMNWRYFNASFTLLFLFKDRDDFQDF 323


>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
          Length = 307

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 26/38 (68%)

Query: 137 LAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCML 174
           +  F +DF SRI ++YR+ F  + DS  TSD GWGCM+
Sbjct: 195 IEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMI 232


>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
          Length = 469

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 27/116 (23%)

Query: 148 ILISYRKGFDPIG--DSKITSDVGWGCMLRSSQMLVAQAL-----LFHRLGRPWRKPLQK 200
           I  +YR+GF      +S +T+D GWGC++R  QM++A+ L      F+ +      PL +
Sbjct: 52  IRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAELLKRHLKCFYNVNLFQFPPLMQ 111

Query: 201 PFDREYVEILHLFGDSETSP------------FSIHNLLQ-AGKAYGLAAGSWVGP 243
                  E+L LF D +               FSI  +++ A + +G   G W  P
Sbjct: 112 -------EVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160



 Score = 45.8 bits (107), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 12/104 (11%)

Query: 319 LGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIG 378
           +G ++ NP YI  +R         G++GG+P  + +IVG  ++  + LDPH VQ   N+ 
Sbjct: 286 IGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHLVQQA-NMN 344

Query: 379 KDDLEADT----STYHSDVIRHIHLDSIDPSLAIGFYCRDKDDF 418
            ++         + + SD         ID SL + FY ++++D 
Sbjct: 345 PEEYVKSCFPGEALFMSD-------KEIDCSLGLVFYLKNEEDL 381


>gi|307108756|gb|EFN56995.1| hypothetical protein CHLNCDRAFT_143631 [Chlorella variabilis]
          Length = 137

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 9/113 (7%)

Query: 66  SEKKAVHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGIS-SSTSDIWLLGVCHKIAQ 124
           +E  AV   S G + +   L  A  + ++H+ +     +G S +  + +WLLG C+    
Sbjct: 5   AELSAVDKLSLGLSRSYYALARALRLNKLHDLLA----SGASITPDAPVWLLGQCYSCPP 60

Query: 125 DEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPI--GDSKITSDVGWGCMLR 175
             +  +A     LA     + S   +SYR GF  I  G + + SD GWGC LR
Sbjct: 61  GAS--EAQQEEALARMLHHYQSIPWMSYRTGFTSIAAGSAHLQSDAGWGCTLR 111


>gi|294954843|ref|XP_002788322.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
 gi|239903634|gb|EER20118.1| hypothetical protein Pmar_PMAR026708 [Perkinsus marinus ATCC 50983]
          Length = 345

 Score = 45.1 bits (105), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 27/113 (23%), Positives = 52/113 (46%), Gaps = 26/113 (23%)

Query: 336 FTFPQSLGIVGGKPGASTYIVGVQEESA-------------------IYLDPHDVQPVIN 376
              P  +G++GG+   + Y+VGV E+                     + +DPH VQ  + 
Sbjct: 207 LKLPWCVGVIGGQSTRAHYVVGVAEKDTYLQSSTWGRSGYRQTRTDLLSIDPHFVQSAV- 265

Query: 377 IGKDDLEADTSTY-HSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKL 428
                +EA + ++ +SD    +    ++PSL +GFY +D+ D ++  A   ++
Sbjct: 266 -----VEAQSISFKNSDEPSRLQPTKLNPSLGVGFYVKDETDLEELSAELDRV 313


>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
          Length = 346

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 20/94 (21%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
             NN +A   +  S+   ++YR GF   +    +T+D GWGC LRS QML   +L+  RL
Sbjct: 57  TSNNNIA---KHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111

Query: 191 GRP-------WRKPLQKPF-------DREYVEIL 210
             P         + +QK F        REYV+++
Sbjct: 112 QEPNPGFGEDAAEKVQKNFIIHSMEERREYVQLI 145


>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
 gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
          Length = 135

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 11/67 (16%)

Query: 106 ISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKIT 165
           I    +++W+LG  +   Q+  L             +D  SR+  +YR GF P+G+ ++T
Sbjct: 43  IPRRNTNVWVLGKKYNAIQELEL-----------IRRDIQSRLWCTYRHGFSPLGEVQLT 91

Query: 166 SDVGWGC 172
           +D GWG 
Sbjct: 92  TDKGWGL 98


>gi|389585790|dbj|GAB68520.1| peptidase, partial [Plasmodium cynomolgi strain B]
          Length = 894

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           F+ R    Y KG D I  S   SD GWGCM+R  QM++A  L+ +++ + +
Sbjct: 418 FTKRKRTKYTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 466


>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
 gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
          Length = 133

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 30/42 (71%), Gaps = 5/42 (11%)

Query: 148 ILISYRKGFDPI----GDSKITSDVGWGCMLRSSQMLVAQAL 185
           IL +YR  F+PI    G + + SD GWGC +R++QML+AQA+
Sbjct: 67  ILFTYRCAFEPIEGCVGPTSV-SDKGWGCAIRATQMLLAQAV 107


>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 346

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 6/63 (9%)

Query: 132 AGNNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190
             NN +A   +  S+   I+YR GF   +    +T+D GWGC LRS QML   +L+  RL
Sbjct: 57  TSNNNIA---KHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RL 111

Query: 191 GRP 193
             P
Sbjct: 112 QEP 114


>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 346

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 6/61 (9%)

Query: 134 NNGLAEFNQDFSSRILISYRKGFD-PIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGR 192
           NN +A   +  S+   I+YR GF   +    +T+D GWGC LRS QML   +L+  RL  
Sbjct: 59  NNNVA---KHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLI--RLQE 113

Query: 193 P 193
           P
Sbjct: 114 P 114


>gi|221060360|ref|XP_002260825.1| peptidase [Plasmodium knowlesi strain H]
 gi|193810899|emb|CAQ42797.1| peptidase, putative [Plasmodium knowlesi strain H]
          Length = 1001

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 143 DFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           +F++R    + KG D I  S   SD GWGCM+R  QM++A  L+ +++ + +
Sbjct: 464 NFTNRRRTKHTKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 513


>gi|156102174|ref|XP_001616780.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805654|gb|EDL47053.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1007

 Score = 42.4 bits (98), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 144 FSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPW 194
           F+ R    Y KG D I  S   SD GWGCM+R  QM++A  L+ +++ + +
Sbjct: 468 FAKRKRDRYSKGDDTI--SIYMSDTGWGCMIRVVQMVLANILIKYKVSKKY 516


>gi|149030140|gb|EDL85217.1| rCG23129 [Rattus norvegicus]
          Length = 90

 Score = 41.6 bits (96), Expect = 1.0,   Method: Composition-based stats.
 Identities = 16/44 (36%), Positives = 30/44 (68%), Gaps = 1/44 (2%)

Query: 401 SIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTH 444
           ++DPS+A+GF+C+++ DFD++C+   K   + N   +F + Q H
Sbjct: 5   NLDPSVALGFFCKEEKDFDNWCSLVQKEILKEN-LRMFELVQKH 47


>gi|390457789|ref|XP_003732004.1| PREDICTED: cysteine protease ATG4B-like [Callithrix jacchus]
          Length = 102

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/51 (31%), Positives = 29/51 (56%)

Query: 405 SLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKPVNHSDVLG 455
           S+++GF+C+ +DDF+D C +  KL+      P+F + +     +   DVL 
Sbjct: 25  SISVGFFCKTEDDFNDRCQQVKKLSLLGGALPMFELVEQQPSHLACPDVLN 75


>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
           gorilla]
          Length = 351

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 203 DREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGP 243
           +R + +I+  F D   +PF +H L++ G++ G  AG W GP
Sbjct: 51  ERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGP 91


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,942,788,756
Number of Sequences: 23463169
Number of extensions: 340944330
Number of successful extensions: 731588
Number of sequences better than 100.0: 787
Number of HSP's better than 100.0 without gapping: 759
Number of HSP's successfully gapped in prelim test: 28
Number of HSP's that attempted gapping in prelim test: 728680
Number of HSP's gapped (non-prelim): 1371
length of query: 486
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 339
effective length of database: 8,910,109,524
effective search space: 3020527128636
effective search space used: 3020527128636
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 79 (35.0 bits)