BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10465
         (309 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
           pisum]
 gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
           pisum]
 gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
           pisum]
 gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
           pisum]
          Length = 402

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 184/287 (64%), Positives = 223/287 (77%), Gaps = 28/287 (9%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           DL+QIR DI SRLWFTYRKGFV IG++  T+D+GWGCMLRCGQMVI QAL+FLHLGRDW+
Sbjct: 59  DLQQIRNDIQSRLWFTYRKGFVQIGNTNFTSDRGWGCMLRCGQMVIGQALIFLHLGRDWR 118

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
           W+ + ++  YLKIL+MFED+R+APYSIHQIAL G S GK VGEWFGPNT+AQVL+KLA  
Sbjct: 119 WDPDKRDIDYLKILRMFEDKRSAPYSIHQIALMGVSHGKQVGEWFGPNTIAQVLKKLATM 178

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYIN 189
           D+ SS+VFHVALDNTLV+N+VKKLCT  ++ +S+ Q W+PLVLVIPLRLGI  INP Y+ 
Sbjct: 179 DELSSLVFHVALDNTLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQ 238

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
           G+K C                           FTFPQSLGVIGG+PNHALYFIG+VGNDV
Sbjct: 239 GVKMC---------------------------FTFPQSLGVIGGRPNHALYFIGFVGNDV 271

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
           IFLDPHT Q IG + +K+ ++E K+D +YHC Q +RL IL+MDPS+A
Sbjct: 272 IFLDPHTTQQIGMLPNKDIETEHKIDHSYHCQQINRLPILNMDPSLA 318


>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
           [Tribolium castaneum]
 gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
          Length = 366

 Score =  352 bits (903), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 171/290 (58%), Positives = 214/290 (73%), Gaps = 29/290 (10%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG-DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
           Q+L+ IR+DI S++WFTYRK FVPIG D GLTTDKGWGCMLRCGQMV+AQAL+ LHLGRD
Sbjct: 36  QELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHLGRD 95

Query: 69  WQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
           W W   +K+  YLKIL  F D+R AP+SIHQIA+ G SE K VG+WFGPNTVAQVL+KL 
Sbjct: 96  WVWEPETKDSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQVLKKLV 155

Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLC-TTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
           KYD+WS+I  H+ALDNTL+++ +++LC +      S+  W+PL+L++PLRLG+Q+INP+Y
Sbjct: 156 KYDEWSAIEMHIALDNTLIISDIRELCLSQGSDGCSSGDWKPLLLIVPLRLGLQEINPIY 215

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
            +G+KKC                           F F QSLGVIGGKPN ALYFIG+VG+
Sbjct: 216 ASGLKKC---------------------------FQFKQSLGVIGGKPNLALYFIGHVGD 248

Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +VI+LDPHT Q  G V  KE + E +LDSTYHC  ASR++IL MDPS+AV
Sbjct: 249 EVIYLDPHTTQKSGSVESKETEEEIELDSTYHCKYASRINILSMDPSVAV 298


>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
 gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
          Length = 388

 Score =  346 bits (888), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 183/306 (59%), Positives = 216/306 (70%), Gaps = 45/306 (14%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D+  IR DI S+LWFTYRKGFVPIGDSGLT+DKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 39  RDVTAIRSDIKSKLWFTYRKGFVPIGDSGLTSDKGWGCMLRCGQMVLAQALVCLHLGRDW 98

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           +W  +SKE  YL+ILKMFED +TA YSIHQIAL G SEGK VG+WFGPNTV QVL+KL+ 
Sbjct: 99  RWKKDSKEPEYLRILKMFEDTKTATYSIHQIALMGVSEGKDVGQWFGPNTVTQVLKKLSV 158

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA------------------SSNPQWQPLV 171
           YD WSSIV HVALDNT++VN +K LC  N+++                  +S  +W+PL+
Sbjct: 159 YDKWSSIVIHVALDNTIIVNDIKSLCQRNEQSVIDSSAQKHSPLNEPVYFNSARKWKPLL 218

Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
           LV+PLRLG+ +INPVY+NG+K C                           FTF QSLGVI
Sbjct: 219 LVVPLRLGLSEINPVYLNGLKTC---------------------------FTFRQSLGVI 251

Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           GGKPNHALYFIG VG  VI+LDPHT Q +  V  KE   EK  D +YHCP+ASR  IL M
Sbjct: 252 GGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYHCPRASRSRILDM 311

Query: 292 DPSIAV 297
           DPS+AV
Sbjct: 312 DPSVAV 317


>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
          Length = 370

 Score =  346 bits (887), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 162/302 (53%), Positives = 212/302 (70%), Gaps = 30/302 (9%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDS-GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +L  IR+DI S+LWFTYRK FVPIG S G T+DKGWGCMLRCGQMV+ QAL+ +HLGRDW
Sbjct: 45  ELNTIRQDIVSKLWFTYRKDFVPIGGSDGKTSDKGWGCMLRCGQMVLGQALMSIHLGRDW 104

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           QWN  +++  YL ILK FED R AP+SIHQIA  G SEGK VG+WFGPNTVAQVL+KL K
Sbjct: 105 QWNPTTRDATYLSILKKFEDSRKAPFSIHQIASMGISEGKEVGQWFGPNTVAQVLKKLVK 164

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRAS-SNPQWQPLVLVIPLRLGIQDINPVYI 188
           +D+ + +  HVALDN +++++++ LC + + A  S P W+PL+L++PLRLG+  +N +Y+
Sbjct: 165 FDEGNDVAIHVALDNVVIISEIRDLCLSKETADVSTPHWKPLLLIVPLRLGLTQMNSIYL 224

Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
            G+K+C                           F F QSLG+IGGKPN ALYFIGYVGN+
Sbjct: 225 GGLKQC---------------------------FQFKQSLGIIGGKPNSALYFIGYVGNE 257

Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQ-RSYSDYK 307
           VI+ DPHT Q  G V +K+   EK +D +YHC  ASR+ +L MDPS+AV    RS +D+ 
Sbjct: 258 VIYFDPHTTQKAGSVGNKDTSEEKDVDLSYHCKHASRMSMLGMDPSVAVCFLCRSEADFN 317

Query: 308 NV 309
           ++
Sbjct: 318 DL 319


>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
          Length = 405

 Score =  343 bits (879), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 179/302 (59%), Positives = 213/302 (70%), Gaps = 39/302 (12%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSG--LTTDKGWGCMLRCGQMVIAQALLFLHL 65
           + +D++ IRRDI SRLWFTYRKGFVPIG  G   T+DKGWGCMLRCGQMV+ QAL+ LHL
Sbjct: 62  AKKDIDAIRRDIRSRLWFTYRKGFVPIGGFGSTFTSDKGWGCMLRCGQMVLGQALISLHL 121

Query: 66  GRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
           GRDW+W   ++   YL IL+ FEDRR APYSIHQIAL GASEGK VG+WFGPNT+AQVL+
Sbjct: 122 GRDWRWTPETRSSTYLNILRRFEDRRAAPYSIHQIALMGASEGKDVGQWFGPNTIAQVLK 181

Query: 126 KLAKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIP 175
           KL  YDDWSSI  HVALDNTLVVN V + C     TT +     P     QW+PL+L+IP
Sbjct: 182 KLVVYDDWSSITIHVALDNTLVVNDVVQQCRVEGATTAEVDGEKPLKAPSQWKPLLLLIP 241

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           LRLG+ +INP+YING+K                             F FPQSLG+IGGKP
Sbjct: 242 LRLGLNEINPIYINGLKT---------------------------SFQFPQSLGLIGGKP 274

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
           +HALYFIGYVG++VIFLDPHT Q  G V  K  D+E ++D+TYHC  ASR+ I  MDPS+
Sbjct: 275 SHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCKIASRIPITGMDPSV 334

Query: 296 AV 297
           A+
Sbjct: 335 AL 336


>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
          Length = 383

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 167/310 (53%), Positives = 208/310 (67%), Gaps = 37/310 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           QDLE+IRRDITS +W TYRKGFVPIGD GLT+DKGWGCMLRCGQMV+  AL+ +HL  DW
Sbjct: 40  QDLERIRRDITSVIWCTYRKGFVPIGDEGLTSDKGWGCMLRCGQMVLGVALIKVHLSADW 99

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W   +++  YLKI++  E+R+ APYSIHQ+AL GA EGK VG+WFGPNTVAQVL+KL  
Sbjct: 100 VWTPETRDPTYLKIVQRLEERKQAPYSIHQVALMGACEGKEVGQWFGPNTVAQVLKKLVV 159

Query: 130 YDDWSSIVFHVALDNTLV---------VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
           YD WSS+V HVALDNT+V         VN  +  C+ N        W PL+L++PLRLG+
Sbjct: 160 YDKWSSLVIHVALDNTVVKEDILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGL 219

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            +INP+Y+ G+K C                           F  PQS+GVIGGKPN ALY
Sbjct: 220 SEINPIYMEGLKIC---------------------------FQSPQSIGVIGGKPNQALY 252

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQ 300
            IG VG++VI+LDPHT Q  G V +K  D +K++D TYHC  ASR+ IL MDPS+AV   
Sbjct: 253 LIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIPILSMDPSVAVCFL 312

Query: 301 -RSYSDYKNV 309
            R+ SD+  +
Sbjct: 313 CRTRSDFDEL 322


>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
           litura]
          Length = 365

 Score =  338 bits (866), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 162/296 (54%), Positives = 205/296 (69%), Gaps = 35/296 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           QDL++IRRDITS +W TYRKGF+PIGD GLT+DKGWGCMLRCGQMV+  AL+ +HL  DW
Sbjct: 23  QDLDRIRRDITSIIWCTYRKGFIPIGDEGLTSDKGWGCMLRCGQMVLGVALVRVHLSADW 82

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W   +++  YLKI++ FE+R+ APYSIHQ+AL GASEGK VG+WFGPNTVAQVL+KL  
Sbjct: 83  VWTPETRDPTYLKIIQRFEERKQAPYSIHQVALMGASEGKQVGQWFGPNTVAQVLKKLTV 142

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNK---RASSNP-----QWQPLVLVIPLRLGIQ 181
           YD WSS+V HVALDNT+V   + + C  N      S+ P      W PL+L++PLRLG+ 
Sbjct: 143 YDKWSSLVIHVALDNTVVKEDILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLS 202

Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
           +INP+YI+G+K C                           F  PQS+GVIGGKPN ALY 
Sbjct: 203 EINPIYIDGLKIC---------------------------FQCPQSIGVIGGKPNQALYL 235

Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +G VG++VI+LDPHT Q  G V  K  D +K++D +YHC  ASR+ +L MDPS+AV
Sbjct: 236 VGCVGDEVIYLDPHTTQRSGLVETKTTDEQKEMDWSYHCKYASRIPMLAMDPSVAV 291


>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
          Length = 382

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 171/300 (57%), Positives = 212/300 (70%), Gaps = 39/300 (13%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRK FVPIG  +S  T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34  RELDAIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQWN+ ++   YLKIL+ FED+R AP+SIHQIAL GASEGK VG+WFGPNTVAQVL+KL
Sbjct: 94  DWQWNLETRNSTYLKILERFEDKRNAPFSIHQIALMGASEGKEVGQWFGPNTVAQVLKKL 153

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
             +D+WSSI  HVALDNTL+VN + K C     TT +     P     QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGDAPLKAPSQWKPLLLLIPLR 213

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ +INP+YING+K                             F  PQSLGVIGGKP H
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPTH 246

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ALYFIG VGN+VI+LDPHT Q  G V  K ++ E ++D+TYHC  + R+ I+ +DPS+A+
Sbjct: 247 ALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMDATYHCKFSGRIPIIEIDPSVAL 306


>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
          Length = 384

 Score =  330 bits (845), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 174/300 (58%), Positives = 209/300 (69%), Gaps = 39/300 (13%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGD--SGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRKGFVPIG   S  T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34  KELDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW   ++   YLKIL+ FEDRRTAP+SIHQIA  GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94  DWQWTPETRNSTYLKILERFEDRRTAPFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 153

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCT----TNKRASSN------PQWQPLVLVIPLR 177
             YDDWSSI  HVALDNTL+VN + + C     T   A  N       QW+PL+L+IPLR
Sbjct: 154 VVYDDWSSITIHVALDNTLIVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLR 213

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ +INP+YING+K                             F  PQSLGVIGGKPN 
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 246

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ALYFIG VGN+VI+LDPHT Q  G V  K ++ E ++D+TYHC  ASR+ I  +DPS+A+
Sbjct: 247 ALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMDATYHCKFASRIPITGIDPSVAL 306


>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
          Length = 382

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 170/300 (56%), Positives = 212/300 (70%), Gaps = 39/300 (13%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRK FVPIG  +S  T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34  RELDAIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW++ ++   YLKIL+ FED+R AP+SIHQIAL GASEGK VG+WFGPNTVAQVL+KL
Sbjct: 94  DWQWSLETRNSTYLKILERFEDKRNAPFSIHQIALMGASEGKEVGQWFGPNTVAQVLKKL 153

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
             +D+WSSI  HVALDNTL+VN + K C     TT +     P     QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGDAPLKAPSQWKPLLLLIPLR 213

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ +INP+YING+K                             F  PQSLGVIGGKP H
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPTH 246

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ALYFIG VGN+VI+LDPHT Q  G V  K ++ E ++D+TYHC  + R+ I+ +DPS+A+
Sbjct: 247 ALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMDATYHCKFSGRIPIIEIDPSVAL 306


>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
          Length = 403

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 176/308 (57%), Positives = 215/308 (69%), Gaps = 35/308 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRKGF+PIG  +S  T+DKGWGCMLRCGQMV+AQAL+ LHLG+
Sbjct: 34  KELDAIRRDIRSKLWFTYRKGFIPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLHLGK 93

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW   +K   YLKIL  FED+R A +SIHQIALTGASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94  DWQWMPETKNNTYLKILSRFEDKRAAAFSIHQIALTGASEGKEVGQWFGPNTIAQVLKKL 153

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------QWQPLVLVIPLR 177
             YD+WSS+  HVALDNTL+VN + K C      ++            QW+PL+L+IPLR
Sbjct: 154 IVYDEWSSLTIHVALDNTLIVNDILKQCRIEGGETAEADGEVPLKAPSQWKPLLLLIPLR 213

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY--------EFTFPQSLG 229
           LG+ +INPVYING+K  +           KIL     MQ  +Y         F   QSLG
Sbjct: 214 LGLSEINPVYINGLKVKF-----------KILC----MQKKKYICIQFFQTSFKISQSLG 258

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
           VIGGKPN ALYFIG VG++VI+LDPHT Q  G V DK  + E ++D TYHC  ASR+ I 
Sbjct: 259 VIGGKPNLALYFIGCVGDEVIYLDPHTTQRSGSVEDKISEEEIEMDITYHCKSASRIPIT 318

Query: 290 HMDPSIAV 297
            MDPS+A+
Sbjct: 319 GMDPSVAL 326


>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
          Length = 383

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 167/300 (55%), Positives = 207/300 (69%), Gaps = 39/300 (13%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRKGFVPIG  +S  T+DKGWGCMLRCGQMV+AQAL+ LHLG+
Sbjct: 34  KELDAIRRDIRSKLWFTYRKGFVPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLHLGK 93

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW   +K   YLKIL+ FED+R A +SIHQIAL GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94  DWQWMPETKNNTYLKILRRFEDKRAAAFSIHQIALMGASEGKEVGQWFGPNTIAQVLKKL 153

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------QWQPLVLVIPLR 177
             YD+WSS+  HVALDNTL+VN + + C      ++            QW+PL+L+IPLR
Sbjct: 154 IVYDEWSSLTIHVALDNTLIVNDILRQCRVEGGVTAEADGEIPLRAPSQWKPLLLLIPLR 213

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ +INPVYING+K                             F   QSLGVIGGKPN 
Sbjct: 214 LGLSEINPVYINGLKT---------------------------SFKISQSLGVIGGKPNL 246

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ALYFIG VG++VI+LDPHT Q  G + DK  + E ++D +YHC  ASR+ I  MDPS+A+
Sbjct: 247 ALYFIGCVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDISYHCKSASRIPITGMDPSVAL 306


>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
 gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
          Length = 397

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 154/288 (53%), Positives = 198/288 (68%), Gaps = 31/288 (10%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GFVP+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 55  QELELIRRDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW 114

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIALTG S+ KAVGEW GPNTVAQ+L+ L +
Sbjct: 115 FWTPDCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVR 174

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+V HVA+D+T+V++++   C    +  S   W+PL+L++PLRLGI DINP+YI 
Sbjct: 175 FDDWSSLVVHVAMDSTVVLDEIYTRC----QEVSASTWKPLLLIVPLRLGISDINPMYIP 230

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 231 ALKRCLEL---------------------------SSSCGMIGGRPNQALYFLGYVDDEV 263

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E++LD +YH   A+RL    MDPS+AV
Sbjct: 264 LYLDPHTTQRAGSVAQKTTAAEQELDESYHQKYAARLSFGAMDPSLAV 311


>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
           terrestris]
          Length = 383

 Score =  320 bits (820), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 168/300 (56%), Positives = 209/300 (69%), Gaps = 39/300 (13%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRK FVPIG  +S  T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34  RELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW   ++   YLKIL+ FED+RTA +SIHQIA  GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94  DWQWTAETRNSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 153

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
             +D+WSSI  HVALDNTL+VN + K C     TT +   + P     QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGAVPLKAPSQWKPLLLLIPLR 213

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ +INP+YING+K                             F  PQSLGVIGGKPN 
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 246

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ALYFIG V N+VI+LDPHT Q  G V  K ++ E ++D+TYHC  +SR+ I  +DPS+A+
Sbjct: 247 ALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMDATYHCKSSSRIPITGIDPSVAL 306


>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
          Length = 383

 Score =  320 bits (820), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 168/300 (56%), Positives = 209/300 (69%), Gaps = 39/300 (13%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRK FVPIG  +S  T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34  RELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW   ++   YLKIL+ FED+RTA +SIHQIA  GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94  DWQWTAETRNSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 153

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
             +D+WSSI  HVALDNTL+VN + K C     TT +   + P     QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGAVPLKAPSQWKPLLLLIPLR 213

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ +INP+YING+K                             F  PQSLGVIGGKPN 
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 246

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ALYFIG V N+VI+LDPHT Q  G V  K ++ E ++D+TYHC  +SR+ I  +DPS+A+
Sbjct: 247 ALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMDATYHCKSSSRIPITGIDPSVAL 306


>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
 gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
          Length = 382

 Score =  320 bits (819), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 153/288 (53%), Positives = 196/288 (68%), Gaps = 31/288 (10%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GFVP+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 55  QELEPIRRDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 114

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+ L +
Sbjct: 115 FWTPDCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVR 174

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+D+T+V++ +   C    + SS   W+PL+L++PLRLGI DINP+YI 
Sbjct: 175 FDDWSSLAVHVAMDSTVVLDDIYTCC----QESSESSWKPLLLIVPLRLGITDINPIYIP 230

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 231 ALKRCLEL---------------------------SSSCGMIGGRPNQALYFLGYVDDEV 263

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E++LD +YH   A+RL    MDPS+AV
Sbjct: 264 LYLDPHTTQRAGAVAQKTTAAERELDESYHQKYAARLSFGAMDPSLAV 311


>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
           terrestris]
          Length = 386

 Score =  319 bits (818), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 168/300 (56%), Positives = 209/300 (69%), Gaps = 39/300 (13%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++L+ IRRDI S+LWFTYRK FVPIG  +S  T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 37  RELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 96

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW   ++   YLKIL+ FED+RTA +SIHQIA  GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 97  DWQWTAETRNSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 156

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
             +D+WSSI  HVALDNTL+VN + K C     TT +   + P     QW+PL+L+IPLR
Sbjct: 157 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGAVPLKAPSQWKPLLLLIPLR 216

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ +INP+YING+K                             F  PQSLGVIGGKPN 
Sbjct: 217 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 249

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ALYFIG V N+VI+LDPHT Q  G V  K ++ E ++D+TYHC  +SR+ I  +DPS+A+
Sbjct: 250 ALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMDATYHCKSSSRIPITGIDPSVAL 309


>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
 gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
          Length = 393

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 152/288 (52%), Positives = 196/288 (68%), Gaps = 31/288 (10%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GFVP+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELEVIRRDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+ L +
Sbjct: 121 FWTPDCRDTTYLKIVNRFEDTRKSFYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVR 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+D+T+V++ +  LC    +  S   W+PL+L++PLRLGI DINP+Y+ 
Sbjct: 181 FDDWSSLNVHVAMDSTVVLDDIFTLC----QEPSESAWKPLLLIVPLRLGISDINPIYVP 236

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 237 ALKRCLEL---------------------------NSSCGMIGGRPNQALYFLGYVDDEV 269

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E++LD +YH   A+RL    MDPS+AV
Sbjct: 270 LYLDPHTTQRAGAVAQKTTAAEQELDESYHQKYAARLSFAAMDPSLAV 317


>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
 gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
          Length = 402

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 153/296 (51%), Positives = 193/296 (65%), Gaps = 33/296 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF P+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELELIRRDIQSRLWCTYRCGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W    ++  YLKI+  FED + + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPECRDATYLKIVNRFEDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVR 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDW S+  HVA+D+T+V++ +  LC           W+PL+LVIPLRLGI DINP+Y+ 
Sbjct: 181 FDDWCSLAVHVAMDSTVVLDDIYSLCREGD------SWKPLLLVIPLRLGITDINPMYVP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQRSYSD 305
           ++LDPHT Q  G V  K    E++ D TYH   A+RL+   MDPS+AV      SD
Sbjct: 268 LYLDPHTTQRTGTVGQKTGVGEQEYDETYHQKHAARLNFSAMDPSLAVCFLCKTSD 323


>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
 gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
          Length = 389

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 157/292 (53%), Positives = 202/292 (69%), Gaps = 33/292 (11%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           + +DL+ IRRD+ +RLW TYR+GFVPIG S LTTDKGWGCMLRCGQMV+AQAL  LHLGR
Sbjct: 37  ATEDLDLIRRDVQTRLWCTYRRGFVPIGGSQLTTDKGWGCMLRCGQMVLAQALTQLHLGR 96

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQVLRK 126
           DW W   +  E YLKI+  FED + AP+S+HQIALTG +SE K VGEWFGPNTVAQVL+K
Sbjct: 97  DWSWTPETTNETYLKIVNRFEDSKAAPFSLHQIALTGESSEEKRVGEWFGPNTVAQVLKK 156

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINP 185
           L K+DDW S+V HVALDNTL  ++V +LC       SNP  W+PL+L+IPLRLG+ +INP
Sbjct: 157 LVKFDDWCSLVIHVALDNTLATDEVLELCVDR----SNPDSWKPLLLIIPLRLGLSEINP 212

Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
           +Y++G+KKC+ L                             + G++GG+PN ALYFIGYV
Sbjct: 213 IYVDGLKKCFEL---------------------------AGNCGMVGGRPNQALYFIGYV 245

Query: 246 GNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            ++ ++LDPHT Q  G +  K    E++LD T+H   A R++   MDPS+A+
Sbjct: 246 ADEALYLDPHTVQRSGTIGSKRDPDERELDETFHQKYARRINFKGMDPSLAL 297


>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
           pulchellus]
          Length = 390

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 144/308 (46%), Positives = 197/308 (63%), Gaps = 50/308 (16%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           +  +L+ +R +ITS++W TYRK F  I  +  T+D GWGCMLRCGQMV+A+A++  HLG+
Sbjct: 37  TFHELDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGK 96

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           DWQW+  +K+E YL++L+MF+D++   YSIHQIA  G SEGK VG+WFGPNT+A VLRKL
Sbjct: 97  DWQWSPGTKDEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKL 156

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC----TTNKR--------------ASSNPQWQP 169
           + +D WSS+  HVA+DN +V++ ++K+C    TT+                A+    W+P
Sbjct: 157 STFDKWSSLAMHVAMDNVVVMDDIRKICRVETTTDVEDGIRNRTQSHGGPAAAGARSWKP 216

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           LVL IPLRLG+ +INP+Y  G+K+ +AL                            QSLG
Sbjct: 217 LVLFIPLRLGLSEINPIYYCGLKRTFAL---------------------------KQSLG 249

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
           +IGGKPNHALY IG VG+D++FLDPHT Q        + D E   D +YHC  ASR+ I 
Sbjct: 250 IIGGKPNHALYIIGVVGDDLVFLDPHTTQ-----LAVDLDVECPEDESYHCAHASRMDIG 304

Query: 290 HMDPSIAV 297
            +DPSIA+
Sbjct: 305 QLDPSIAL 312


>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
 gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
          Length = 380

 Score =  297 bits (761), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 147/313 (46%), Positives = 199/313 (63%), Gaps = 57/313 (18%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D ++++ DI+SRLWFTYRK F PIG +G  +D+GWGCMLRCGQM++ QAL+  HLGRDW
Sbjct: 42  KDRQELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCMLRCGQMMLGQALICRHLGRDW 101

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           +W     +  Y KIL++F D++ + YSIHQIA  G SEGK+VG+WFGPNTVAQVL+KLA 
Sbjct: 102 RWKSAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSEGKSVGQWFGPNTVAQVLKKLAL 161

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC------------------------TTNKRASSNP 165
           ++DWSS+  HVA+DNT++++ +KKLC                        T+ +  S   
Sbjct: 162 FEDWSSLAIHVAMDNTVIIDDIKKLCRSARQPTPSQVTNSFLCNGVSAEQTSARSRSPAL 221

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
            WQPL+L+IPLRLG+ ++NPVY + +K C                           FT  
Sbjct: 222 PWQPLMLIIPLRLGLSELNPVYTDCLKAC---------------------------FTLR 254

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL-DSTYHCPQAS 284
           QSLG+IGGKPNHA YFIGYVGN +++LDPHT Q        E +    + DS++HC   S
Sbjct: 255 QSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPA-----VELEGNVPIPDSSFHCTHPS 309

Query: 285 RLHILHMDPSIAV 297
           R++I  +DPSIA+
Sbjct: 310 RMNIQDLDPSIAL 322


>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
           kowalevskii]
          Length = 356

 Score =  296 bits (759), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 151/303 (49%), Positives = 197/303 (65%), Gaps = 46/303 (15%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D  ++  DI SR+W TYRK F  IG +G T+D GWGCMLRCGQM++AQALL  HLGR+W
Sbjct: 37  RDRSELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGCMLRCGQMILAQALLCKHLGREW 96

Query: 70  QWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
           +W     + E Y KILK+F DR+ + YSIHQIA  G  EGK++G+WFGPNTVAQVLRKL 
Sbjct: 97  RWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVGEGKSIGQWFGPNTVAQVLRKLT 156

Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLCTT--------NKRASSNPQ------WQPLVLVI 174
            +DDWSSI  H+++DNT+VV  ++KLC T         K AS++ +      W+PLVL I
Sbjct: 157 LFDDWSSIAVHISMDNTIVVEDIRKLCRTPLFTECASPKAASASLENGGTTYWKPLVLFI 216

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PLRLG+ +INP+Y++ +KKC                           FT  QSLG+IGGK
Sbjct: 217 PLRLGLTEINPLYLDVLKKC---------------------------FTLKQSLGMIGGK 249

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           PNHA YFIG+ G  +++LDPHT Q    V D  + +    D TYHC   SR++I+H+DPS
Sbjct: 250 PNHAHYFIGFYGKTLVYLDPHTTQP---VVDINKWASIP-DDTYHCKHPSRMNIMHLDPS 305

Query: 295 IAV 297
           IA+
Sbjct: 306 IAL 308


>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
 gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
          Length = 382

 Score =  296 bits (758), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 145/305 (47%), Positives = 196/305 (64%), Gaps = 49/305 (16%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
            +L+ +R D+TS++W TYRK F  IG +G T+D GWGCMLRCGQMV+AQAL+  HLGR+W
Sbjct: 33  HELDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREW 92

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           +W   +K + YL IL+MF+D++   +SIHQIA  G SEGK VGEWFGPNTVA VLRKLA 
Sbjct: 93  RWEPGTKNKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKL-----------------CTTNKRASSNPQWQPLVL 172
           +D WSS+  HVA+DNT+++N++ K                     ++ A+S   W+PL+L
Sbjct: 153 FDKWSSLAIHVAMDNTVIINEISKFRCHIWAAADGLVRNRTNSEPSRPANSEGSWKPLLL 212

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
            IPLRLG+ +IN +Y  G+K+ +AL                            QSLG+IG
Sbjct: 213 FIPLRLGLSEINRIYAFGLKRTFAL---------------------------KQSLGMIG 245

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           GKPNHALYFIG V +++IFLDPHT Q + C    + D +   D +YHC  ASR++I  +D
Sbjct: 246 GKPNHALYFIGVVEDELIFLDPHTTQ-LAC----DLDVDSPDDQSYHCAHASRMNISELD 300

Query: 293 PSIAV 297
           PS+A+
Sbjct: 301 PSVAL 305


>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
          Length = 387

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 140/303 (46%), Positives = 191/303 (63%), Gaps = 47/303 (15%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
            +L+ +R D+TS++W TYR+ F  I  +  T+D GWGCMLRCGQM +A+AL+  HL R W
Sbjct: 39  HELDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGW 98

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           QW    ++E+YL++L+MF+D++   +SIHQIA  G SEGKAVG+WFGPNTVA VLRKLA 
Sbjct: 99  QWAPGIRDESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN---------------PQWQPLVLVI 174
           +D WSS+  HVA+DN ++++ ++K+C     A S                  W+PL+L I
Sbjct: 159 FDKWSSLAIHVAMDNVVIMDDIRKVCRLEATAESGVRNRAEPAGLAAAAAESWKPLLLFI 218

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PLRLG+ +INP+Y  G+K+ +AL                            QSLG+IGGK
Sbjct: 219 PLRLGLSEINPIYYCGLKRTFAL---------------------------KQSLGIIGGK 251

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           PNHALY IG VG+D++FLDPHT Q        + D+E   D +YHC  ASR+ I  +DPS
Sbjct: 252 PNHALYIIGVVGDDLVFLDPHTTQ-----LAVDLDTEFPDDESYHCAHASRMDIGQLDPS 306

Query: 295 IAV 297
           IA+
Sbjct: 307 IAL 309


>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
          Length = 401

 Score =  293 bits (749), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 192/304 (63%), Gaps = 48/304 (15%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
            +L+ +R D+TS++W TYRK F  I  +  T+D GWGCMLRCGQMVIA+AL+  HLG+ W
Sbjct: 52  HELDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGW 111

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           QW    ++E YL++L+MF+D++   YSIHQIA  G SEGKAVG+WFGPNT+A VLRKL+ 
Sbjct: 112 QWAPGIRDENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA----------------SSNPQWQPLVLV 173
           +D WSS+  HVA+DN +V++ ++K+C     A                +S   W+PL+L 
Sbjct: 172 FDKWSSLAVHVAMDNVVVMDDIRKICRVETPAVDDGVRHRTQSHGLACASAVSWKPLLLF 231

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           IPLRLG+ +INPVY  G+K+ +AL                            QS+G+IGG
Sbjct: 232 IPLRLGLNEINPVYYCGLKRTFAL---------------------------KQSVGIIGG 264

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           KPNHAL+ IG VG+D++FLDPHT Q        + D E   D +YHC  ASR+ I  +DP
Sbjct: 265 KPNHALFIIGVVGDDLVFLDPHTTQ-----LAVDLDVEFPEDESYHCAHASRMDIGQLDP 319

Query: 294 SIAV 297
           SIA+
Sbjct: 320 SIAL 323


>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
          Length = 410

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 141/318 (44%), Positives = 201/318 (63%), Gaps = 63/318 (19%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           DL ++++D+ SRLW TYRKGF PIG SG T+D+GWGCMLRCGQM++AQ+L+  HLGRDW+
Sbjct: 46  DLAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQGWGCMLRCGQMMLAQSLICRHLGRDWR 105

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
           W  +  +  Y +IL+MF+D+R+A YS+  IA  G SEGKA+GEWFGPNT++QVLRKL   
Sbjct: 106 WTKDKYDPKYFEILRMFQDKRSAKYSLQVIASMGTSEGKAIGEWFGPNTISQVLRKLCVS 165

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP------------------------- 165
           D+WS++V HVALDNT++++ V  LC ++K+ S+ P                         
Sbjct: 166 DEWSNLVVHVALDNTVIIDDVFCLCKSSKKESNEPIPGVHAACASALLFNGHDPTAEGHD 225

Query: 166 ------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
                  W+PL+L++PLRLG+ +INPVYI  +K C                         
Sbjct: 226 PSGEDDSWRPLLLIVPLRLGLSEINPVYIPFLKTC------------------------- 260

Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
              TF QS+G+IGGKPNHA +FIG++ ++++++DPHT Q      D  Q  E   D++YH
Sbjct: 261 --LTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPF---VDVTQPGES--DASYH 313

Query: 280 CPQASRLHILHMDPSIAV 297
           C  + R+ + ++DPS+AV
Sbjct: 314 CSYSCRMPVSYLDPSVAV 331


>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
 gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
          Length = 409

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 153/288 (53%), Positives = 191/288 (66%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF+P+G+  LTTD+GWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELEVIRRDIQSRLWCTYRHGFMPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W    ++  YLKI+  FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+KL  
Sbjct: 121 FWTPECQDATYLKIVNRFEDVRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVL 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDW S+V HVA+D+T+V++ V  LC           W+PL+L+IPLRLGI DINP+YI 
Sbjct: 181 FDDWCSLVVHVAMDSTVVLDDVYSLCLEGD------AWKPLLLIIPLRLGISDINPIYIP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVEDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K    E++ D TYH   A+RL    MDPS+AV
Sbjct: 268 LYLDPHTTQRTGVVGQKTSSGEQEHDETYHQKHAARLSFSAMDPSLAV 315


>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
 gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
          Length = 409

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 153/288 (53%), Positives = 191/288 (66%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF+P+G+  LTTD+GWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELEVIRRDIQSRLWCTYRHGFMPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W    ++  YLKI+  FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+KL  
Sbjct: 121 FWTPECQDATYLKIVNRFEDVRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVL 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDW S+V HVA+D+T+V++ V  LC           W+PL+L+IPLRLGI DINP+YI 
Sbjct: 181 FDDWCSLVVHVAMDSTVVLDDVYSLCLEGD------AWKPLLLIIPLRLGISDINPIYIP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVEDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K    E++ D TYH   A+RL    MDPS+AV
Sbjct: 268 LYLDPHTTQKTGVVGQKTSSGEQEHDETYHQKHAARLSFSAMDPSLAV 315


>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
 gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
          Length = 389

 Score =  292 bits (747), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 156/291 (53%), Positives = 204/291 (70%), Gaps = 31/291 (10%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           +  DLE IR+D+ SRLW TYR+GFVPIG++ LTTDKGWGCMLRCGQMV+AQALL LHLGR
Sbjct: 37  ASDDLEAIRQDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAQALLQLHLGR 96

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQVLRK 126
           DW W   ++++ YL I+  FED + AP+S+HQIAL G +SE K +GEWFGPNTVAQVL+K
Sbjct: 97  DWVWEAETRDDIYLNIVNRFEDSKQAPFSLHQIALMGDSSEEKRIGEWFGPNTVAQVLKK 156

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPV 186
           L K+DDW  +V HVALDNT+  +++ +LC   K   +   W+PL+L+IPLRLG+ ++NP+
Sbjct: 157 LVKFDDWCRLVIHVALDNTVATDEIVELCVDKKEPEA---WKPLLLIIPLRLGLSEVNPI 213

Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
           YI G+KKC+ L                           P S G+IGG+PN ALYFIGYVG
Sbjct: 214 YIEGLKKCFQL---------------------------PGSCGMIGGRPNQALYFIGYVG 246

Query: 247 NDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            + ++LDPHT Q +G V  K+  +E++LD T+H   ASR+    MDPS+AV
Sbjct: 247 GEALYLDPHTVQRVGTVGSKQDPAEQELDETFHQRYASRISFTSMDPSLAV 297


>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
 gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
          Length = 379

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 157/296 (53%), Positives = 208/296 (70%), Gaps = 30/296 (10%)

Query: 4   ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           +N L   DL+QIRRD+ SRLW TYR+GFVPIG S  T+DKGWGCMLRCGQMV+AQALL L
Sbjct: 22  SNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQMVLAQALLQL 81

Query: 64  HLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQ 122
           HLGRDW+W   +++E YL+I+  FED + AP+S+HQIALTG +SE K VGEWFGPNTVAQ
Sbjct: 82  HLGRDWEWTAETRDETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVGEWFGPNTVAQ 141

Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQD 182
           VL+KL K+DDW S+V HVALD+TL  ++V +LC    ++ +   W+PL+L+IPLRLG+ +
Sbjct: 142 VLKKLVKFDDWCSVVVHVALDSTLATDEVVELC--EDKSDAGTSWKPLLLIIPLRLGLSE 199

Query: 183 INPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI 242
           INP+Y+ G+KKC+ L                             + G+IGG+PN ALYFI
Sbjct: 200 INPIYVAGLKKCFELA---------------------------GNCGMIGGRPNQALYFI 232

Query: 243 GYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
           GYVG++ +FLDPHT Q  G + DK    E+++D ++H   A R++   MDPS+A+ 
Sbjct: 233 GYVGDEALFLDPHTVQRSGNIGDKTGLDEREMDESFHQRYARRINFKAMDPSLALC 288


>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
           purpuratus]
          Length = 390

 Score =  291 bits (744), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 154/345 (44%), Positives = 202/345 (58%), Gaps = 82/345 (23%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           LS   LE  R D+ SRLWFTYRKGF  IG +G TTD+GWGCMLRCGQM++AQAL++ HLG
Sbjct: 57  LSQHQLEA-RLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGCMLRCGQMMLAQALVYKHLG 115

Query: 67  RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
           RDW+W    ++E YLKIL++F D++ + +SIHQIA  G  EGK VG+WFGPNTV QV+RK
Sbjct: 116 RDWRWRPQEQDETYLKILQLFLDKKDSCFSIHQIAQMGVGEGKKVGDWFGPNTVGQVIRK 175

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN------------------KRASSNP--- 165
           L+ +D WS +  HVALDNT+V+  ++KLCT N                  KR SS+    
Sbjct: 176 LSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETSSEGSKTGSERRKRTSSSENIR 235

Query: 166 -----------------------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYA 196
                                         W+ L L+IPLRLG+ +IN VY+  +K+C  
Sbjct: 236 HKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIPLRLGLNEINTVYMQRLKRC-- 293

Query: 197 LPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
                                    FT PQSLGVIGGKPNHA YFIG +G+++++LDPHT
Sbjct: 294 -------------------------FTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHT 328

Query: 257 NQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQR 301
            Q    + DK    +   D ++HC  ASR+ I ++DPSI +VS +
Sbjct: 329 TQPAADI-DKWAFLQ---DESFHCEHASRMPIKNLDPSIGLVSTK 369


>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
 gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
          Length = 384

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 148/315 (46%), Positives = 193/315 (61%), Gaps = 52/315 (16%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           EQ+  DITSRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  H+GRDW+W+
Sbjct: 40  EQLLNDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWD 99

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
               +  YL IL  F D++ + YSIHQIA  G  EGK +G+W+GPNTVAQVLRKLA +D 
Sbjct: 100 KQKPKGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQ 159

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSN----------------PQWQPLVLVIPL 176
           WSSI  H+A+DNT+VV+++++LC      SS+                 QW+PLVL+IPL
Sbjct: 160 WSSIAVHIAMDNTVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDPSCAQWKPLVLLIPL 219

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
           RLG+ +IN  YI  +K C                           F  PQSLGVIGG+PN
Sbjct: 220 RLGLSEINEAYIETLKHC---------------------------FMVPQSLGVIGGRPN 252

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSI 295
            A YFIGYVG+++I+LDPHT Q    +  +  D     D ++HC     R+H+  +DPSI
Sbjct: 253 SAHYFIGYVGDELIYLDPHTTQ----LSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSI 308

Query: 296 AV----VSQRSYSDY 306
           AV     SQ  + D+
Sbjct: 309 AVGFFCSSQEDFEDW 323


>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
 gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
          Length = 411

 Score =  290 bits (743), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 151/288 (52%), Positives = 191/288 (66%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF P+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIA  G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPDCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+D+T+V++ V   C           W+PL+L+IPLRLGI DINP+Y+ 
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYASCREGG------SWKPLLLIIPLRLGITDINPLYVP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E+  D TYH   A+RL+   MDPS+AV
Sbjct: 268 LYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFSAMDPSLAV 315


>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
 gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
 gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
 gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
 gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
          Length = 411

 Score =  290 bits (743), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 151/288 (52%), Positives = 191/288 (66%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF P+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIA  G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPDCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+D+T+V++ V   C           W+PL+L+IPLRLGI DINP+Y+ 
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYASC------REGGSWKPLLLIIPLRLGITDINPLYVP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E+  D TYH   A+RL+   MDPS+AV
Sbjct: 268 LYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFSAMDPSLAV 315


>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
 gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
          Length = 410

 Score =  290 bits (743), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 151/288 (52%), Positives = 191/288 (66%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF P+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIA  G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPDCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+D+T+V++ V   C           W+PL+L+IPLRLGI DINP+Y+ 
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYASC------REGGSWKPLLLIIPLRLGITDINPLYVP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E+  D TYH   A+RL+   MDPS+AV
Sbjct: 268 LYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFSAMDPSLAV 315


>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
           tropicalis]
          Length = 384

 Score =  290 bits (743), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 148/314 (47%), Positives = 192/314 (61%), Gaps = 49/314 (15%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           EQ+  DITSRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQALL  H+GRDW+W+
Sbjct: 40  EQLLNDITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWD 99

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
               +  YL IL  F D++ + YSIHQIA  G  EGK +G+W+GPNTVAQVLRKLA +D 
Sbjct: 100 KQKSQGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQ 159

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ----------------WQPLVLVIPL 176
           WSSI  H+A+DNT+V++++++LC      SS                   W+PLVL+IPL
Sbjct: 160 WSSIAVHIAMDNTVVMDEIRRLCRAGTNESSEAGALCNGYTGVSDPSCSLWKPLVLLIPL 219

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
           RLG+ DIN  YI  +K C                           F  PQSLGVIGG+PN
Sbjct: 220 RLGLSDINEAYIETLKHC---------------------------FMVPQSLGVIGGRPN 252

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSI 295
            A YFIGYVG+++I+LDPHT Q    +  +  D     D ++HC     R+H+  +DPSI
Sbjct: 253 SAHYFIGYVGDELIYLDPHTTQ----LAVEPSDCCFVEDESFHCQHPPCRMHVSEIDPSI 308

Query: 296 AV-VSQRSYSDYKN 308
           AV    RS  D+++
Sbjct: 309 AVGFFCRSQEDFED 322


>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
 gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
          Length = 411

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 151/288 (52%), Positives = 190/288 (65%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF P+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIA  G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTADCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+D+T+V++ V   C           W+PL+L+IPLRLGI DINP+Y+ 
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYSSC------REGGSWKPLLLIIPLRLGITDINPLYVP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E+  D TYH   A+RL    MDPS+AV
Sbjct: 268 LYLDPHTTQRTGVVGQKTAVAEQDYDETYHQKHAARLSFSAMDPSLAV 315


>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
 gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
          Length = 400

 Score =  290 bits (741), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 147/288 (51%), Positives = 192/288 (66%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+L+ IRRDI SRLW TYR  FVP+G+  LTTD+GWGCMLRCGQMV+AQAL+ LHLGR+W
Sbjct: 55  QELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLGREW 114

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W    ++  YLKI+  FED R + YS+HQIAL G S+ K VGEW GPNTVAQ+L+KL  
Sbjct: 115 YWTSECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILKKLVC 174

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDW S+V HVA+D+T+V++ +  L      +     W+PL+L+IPLRLGI DINP+Y+ 
Sbjct: 175 FDDWCSLVIHVAMDSTVVLDDIYSL------SQDGESWKPLLLIIPLRLGITDINPIYVP 228

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C                           F    S G+IGG+PN ALYF+GYV ++V
Sbjct: 229 ALKRC---------------------------FELESSCGMIGGRPNQALYFVGYVDDEV 261

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E++LD TYH   A+RL+   MDPS+AV
Sbjct: 262 LYLDPHTTQRTGAVGQKTTTAEQELDETYHQKYAARLNFSAMDPSLAV 309


>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
 gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
          Length = 411

 Score =  290 bits (741), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 150/288 (52%), Positives = 190/288 (65%), Gaps = 33/288 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+LE IRRDI SRLW TYR GF P+G+  LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61  QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            W  + ++  YLKI+  FED R + YSIHQIA  G ++ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTSDCRDATYLKIVNRFEDVRNSYYSIHQIAQMGETQNKAVGEWLGPNTVAQILKKLVR 180

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+D+T+V++ V   C           W+PL+L+IPLRLGI DINP+Y+ 
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYSSC------REGGSWKPLLLIIPLRLGITDINPLYVP 234

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C  L                             S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------ESSCGMIGGRPNQALYFLGYVDDEV 267

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q  G V  K   +E+  D TYH   A+RL    MDPS+AV
Sbjct: 268 LYLDPHTTQRTGVVGQKTAAAEQDYDETYHQKHAARLSFSAMDPSLAV 315


>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
          Length = 405

 Score =  289 bits (740), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 140/324 (43%), Positives = 196/324 (60%), Gaps = 69/324 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D ++++ D  S++W TYRK F  IG +G T D GWGCMLRCGQM++AQAL+  HLGRDW+
Sbjct: 43  DRDELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWK 102

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
           WN N +++ Y +IL+MF D+++A YSI QIA  G SEGK VG WFGPNTVAQVL+KLA Y
Sbjct: 103 WNKNCQDQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN-------------------------- 164
           D+WSSIV H+A+DNT++ N +K +C  + +++ +                          
Sbjct: 163 DEWSSIVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRSKKSSQDSSK 222

Query: 165 -----------PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
                        W+PL+LVIPLRLG+ +IN VY+  +K C                   
Sbjct: 223 QDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKAC------------------- 263

Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
                    +FPQS+G+IGGKPNHA +F+GY+ + +I+LDPHT Q       ++ DS   
Sbjct: 264 --------LSFPQSVGIIGGKPNHAHWFVGYMSDKLIYLDPHTTQLC-----EDLDSPNF 310

Query: 274 LDSTYHCPQASRLHILHMDPSIAV 297
            D +YHCP  S ++++ +DPSIA+
Sbjct: 311 SDESYHCPYPSTMNVMELDPSIAL 334


>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
          Length = 396

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 144/315 (45%), Positives = 201/315 (63%), Gaps = 55/315 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 42  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 101

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN         P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 221

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 310

Query: 290 HMDPSIAVVSQRSYS 304
           ++DPS+A+V  R  S
Sbjct: 311 NLDPSVALVGIRRLS 325


>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
          Length = 398

 Score =  287 bits (734), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 196/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----------------------TNKRASSNPQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                       +   ++  P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTAGESPPGSLTALNQSKGTSACRPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++GN++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGNELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
          Length = 398

 Score =  286 bits (732), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 139/308 (45%), Positives = 199/308 (64%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                    +N+  S++   P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAGESPPSSLNASNRSKSTSAGWPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
           cuniculus]
          Length = 405

 Score =  286 bits (732), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 51  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 110

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 111 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 170

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR--------------ASSNPQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T  +R              ++  P W+PL
Sbjct: 171 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSANTPGERLHDSLTASNQSKGTSACCPAWKPL 230

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 231 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 263

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++GN++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 264 LGGKPNNAYYFIGFLGNELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 319

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 320 NLDPSVAL 327


>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
          Length = 252

 Score =  286 bits (731), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 130/183 (71%), Positives = 157/183 (85%), Gaps = 1/183 (0%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           DL+QIR DI SRLWFTYRKGFV IG++  T+D+GWGCMLRCGQMVI QAL+FLHLGRDW+
Sbjct: 59  DLQQIRNDIQSRLWFTYRKGFVQIGNTNFTSDRGWGCMLRCGQMVIGQALIFLHLGRDWR 118

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
           W+ + ++  YLKIL+MFED+R+APYSIHQIAL G S GK VGEWFGPNT+AQVL+KLA  
Sbjct: 119 WDPDKRDIDYLKILRMFEDKRSAPYSIHQIALMGVSHGKQVGEWFGPNTIAQVLKKLATM 178

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYIN 189
           D+ SS+VFHVALDNTLV+N+VKKLCT  ++ +S+ Q W+PLVLVIPLRLGI  INP Y+ 
Sbjct: 179 DELSSLVFHVALDNTLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQ 238

Query: 190 GIK 192
           G+K
Sbjct: 239 GVK 241


>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
          Length = 391

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 148/320 (46%), Positives = 194/320 (60%), Gaps = 57/320 (17%)

Query: 2   RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
           +  N L+ +D  +I  D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+
Sbjct: 31  KEYNALTEKD--EILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALV 88

Query: 62  FLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
             HLGRDW+W    K+ + Y+ +L  F D++ + YSIHQIA  G  EGK +G+W+GPNTV
Sbjct: 89  CRHLGRDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTV 148

Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-------------- 166
           AQVL+KLA +D WS +V HVA+DNT+V+ ++K+LC     A    +              
Sbjct: 149 AQVLKKLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGACA 208

Query: 167 --------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
                   W+PLVL+IPLRLG+ DIN  YI  +K+C+ LP                    
Sbjct: 209 MAEEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLP-------------------- 248

Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
                  QSLGVIGGKPN A YFIGYVG ++I+LDPHT Q      +  +DS+   D TY
Sbjct: 249 -------QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP---AVEPSEDSQVP-DETY 297

Query: 279 HCPQ-ASRLHILHMDPSIAV 297
           HC     R+HI  +DPSIA 
Sbjct: 298 HCQHPPCRMHICELDPSIAA 317


>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
 gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related cysteine endopeptidase 2A;
           Short=Autophagin-2A; AltName: Full=Autophagy-related
           protein 4 homolog A; AltName: Full=bAut2A
 gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
          Length = 398

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C T   ++  P                       W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q   R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTAD-DQTFHCLQPPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 422

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 139/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 68  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 127

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 128 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 187

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------------------TTNKRASSN--PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                    +   R +S   P W+PL
Sbjct: 188 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPGDRPPDSLTASNESRGTSAYCPAWKPL 247

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 248 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 280

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 281 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 336

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 337 NLDPSVAL 344


>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
          Length = 398

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 199/308 (64%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                   T+N+   ++   P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTADKSSPDSFITSNQSKDTSAFCPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  +++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQQMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
          Length = 396

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C T   ++  P                       W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q   R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTAD-DQTFHCLQPPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
 gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
          Length = 406

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 139/318 (43%), Positives = 200/318 (62%), Gaps = 56/318 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC---------------------TTNKRASSNP--QWQP 169
           W+S+  +V++DNT+V+  +KK+C                     ++  + +S P   W+P
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKP 223

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           L+L++PLRLGI  INPVYI   K+C                           F  PQSLG
Sbjct: 224 LLLIVPLRLGINQINPVYIEAFKEC---------------------------FKMPQSLG 256

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHI 288
            +GGKPN+A YFIG +G+++IFLDPHT Q     +   ++S    D T+HC Q+  R+ I
Sbjct: 257 ALGGKPNNAYYFIGSLGDELIFLDPHTTQT----FVDTEESGLVDDHTFHCLQSPQRMSI 312

Query: 289 LHMDPSIAVVSQRSYSDY 306
           L++DPS+A+V Q ++  +
Sbjct: 313 LNLDPSVALVGQGAFMGF 330


>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
          Length = 398

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 194/308 (62%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI +RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++   Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----------------------TNKRASSNPQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                       +    +S P W+PL
Sbjct: 164 WNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIGESPLNTLNASNQSKSAPASCPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
          Length = 398

 Score =  285 bits (728), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C T   ++  P                       W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q   R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQPPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
          Length = 398

 Score =  285 bits (728), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C T   ++  P                       W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q   R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQPPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
 gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
 gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
          Length = 398

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 141/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN         P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
          Length = 396

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 141/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 42  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 101

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN         P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 221

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 310

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 311 NLDPSVAL 318


>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
          Length = 394

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 149/312 (47%), Positives = 187/312 (59%), Gaps = 58/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ QAL+  HLGRDW+W 
Sbjct: 40  EEILSDVTSRLWFTYRKSFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWV 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              K+ + Y+ IL  F D++ + YSIHQIA  G  EGK +G+W+GPNTVAQVL+KLA +D
Sbjct: 100 RGQKQRQEYISILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT---TNKRASSNPQ---------------------- 166
            WS +V HVA+DNT+V+ ++K+LC            P+                      
Sbjct: 160 TWSRLVVHVAMDNTVVIEEIKRLCMPWLDKAEVFGEPERVGELNGCLEGACALSEEEVAL 219

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W+PLVL+IPLRLG+ DIN  YI  +KKC+ LP                           Q
Sbjct: 220 WKPLVLLIPLRLGLSDINGAYIETLKKCFMLP---------------------------Q 252

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
           SLGVIGGKPN A YFIGYVG ++I+LDPHT Q      +  Q      D TYHC     R
Sbjct: 253 SLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFP----DDTYHCQHPPCR 308

Query: 286 LHILHMDPSIAV 297
           +HI  +DPSIAV
Sbjct: 309 MHICELDPSIAV 320


>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
          Length = 411

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 137/308 (44%), Positives = 196/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 57  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 116

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 117 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 176

Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----------------------TNKRASSNPQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                       +   ++  P W+PL
Sbjct: 177 WNSLAVYVSMDNTVVIEDIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPL 236

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 237 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 269

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 270 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQSPQRMNIL 325

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 326 NLDPSVAL 333


>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
          Length = 408

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 196/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 54  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 113

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 114 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 173

Query: 133 WSSIVFHVALDNTLVVNQVKKLC----------------TTNKRASSN------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                T N    S       P W+PL
Sbjct: 174 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVGESPPDTLNASNQSKGTPAGRPAWKPL 233

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 234 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 266

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 267 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 322

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 323 NLDPSVAL 330


>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
          Length = 398

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 141/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN         P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
           [Desmodus rotundus]
          Length = 396

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 42  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 101

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------------------TTNKRASSN--PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                    T + +A S   P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPSESSHDPLNATNHNKAISACCPAWKPL 221

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R+ IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQSPQRMSIL 310

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 311 NLDPSVAL 318


>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
          Length = 398

 Score =  283 bits (725), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 197/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------TTNKRASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C             T     +SN         P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVGESTPGTLNASNQSRGTFACCPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q     +   +++    D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQ----TFVNTEENGTVDDQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
           [Ornithorhynchus anatinus]
          Length = 436

 Score =  283 bits (725), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 196/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 83  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWCWEK 142

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
           + K+ E Y KIL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 143 HKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 202

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ----------------------WQPL 170
           W+S+  +V++DNT+V+  +KK+C    + S   Q                      W+PL
Sbjct: 203 WNSLAVYVSMDNTVVIEDIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPL 262

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INP+YI+  K+C                           F  PQSLG 
Sbjct: 263 LLIVPLRLGINHINPIYIDAFKEC---------------------------FKTPQSLGA 295

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++GN++I+LDPHT Q      D E++ +   D ++HC QA  R+ I+
Sbjct: 296 LGGKPNNAYYFIGFLGNELIYLDPHTTQTF---VDTEENGQVD-DHSFHCQQAPQRMKIM 351

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 352 NLDPSVAL 359


>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
          Length = 429

 Score =  283 bits (725), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 75  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 134

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 135 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 194

Query: 133 WSSIVFHVALDNTLVVNQVKKLC----------------TTNKRASSN------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                T N    S       P W+PL
Sbjct: 195 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPL 254

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 255 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 287

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R+ IL
Sbjct: 288 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMSIL 343

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 344 NLDPSVAL 351


>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
          Length = 373

 Score =  283 bits (725), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 42  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 101

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161

Query: 133 WSSIVFHVALDNTLVVNQVKKLC----------------TTNKRASSN------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                T N    S       P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPL 221

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R+ IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMSIL 310

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 311 NLDPSVAL 318


>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
          Length = 398

 Score =  283 bits (724), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 137/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C     ++  P                       W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPGDRPPDSLTASNQSKGTSAYCSAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
 gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
          Length = 398

 Score =  283 bits (723), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 137/308 (44%), Positives = 194/308 (62%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTT----------------------NKRASSNPQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                           +++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVYI   K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYIEAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q     +   ++S    D T+HC Q+  R+ IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQT----FVDTEESGIVDDETFHCLQSPQRMSIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
 gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
 gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
 gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
          Length = 395

 Score =  282 bits (721), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 137/305 (44%), Positives = 189/305 (61%), Gaps = 57/305 (18%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
           ++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW+W  +
Sbjct: 49  LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKH 108

Query: 75  SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
            +  E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 109 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 168

Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ--------------------WQPLVLV 173
           +S+  +V++DNT+V+  +K +C     + S  Q                    W+PL+L+
Sbjct: 169 NSLAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSSGWRPLLLI 228

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY++  K C                           F  PQSLG +GG
Sbjct: 229 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 261

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPNHA YFIG+ G+++I+LDPHT Q      D++Q        TYHC +  + + +L++D
Sbjct: 262 KPNHAYYFIGFSGDEIIYLDPHTTQTFVDTEDQDQ--------TYHCQKGPNSMKVLNLD 313

Query: 293 PSIAV 297
           PS+A+
Sbjct: 314 PSVAL 318


>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
          Length = 390

 Score =  282 bits (721), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 144/308 (46%), Positives = 188/308 (61%), Gaps = 54/308 (17%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           EQ+  D+ SRLWFTYRK F PIG +G T+D GWGCMLRCGQM++A+AL+  HLGRDW+W 
Sbjct: 40  EQLLSDVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWA 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ E Y+ IL  F D++ + YSIHQIA  G  EGK +G+W+GPNTVAQVL+KLA +D
Sbjct: 100 RGRRQREEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT----TNKRASS-----------------NPQWQPL 170
            WS +  HVA+DNT+++ ++K+LC        R  +                    W+PL
Sbjct: 160 TWSRLAVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGACALVEEETALWKPL 219

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           VL+IPLRLG+ DIN  YI+ +K+C+ L                           PQSLGV
Sbjct: 220 VLLIPLRLGLSDINEAYIDTLKQCFML---------------------------PQSLGV 252

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
           IGGKPN A YFIGYVG ++I+LDPHT Q      +  +D +   D TYHC     R+HI 
Sbjct: 253 IGGKPNSAHYFIGYVGEELIYLDPHTTQP---AVEPSEDGQVP-DETYHCQHPPCRMHIC 308

Query: 290 HMDPSIAV 297
            +DPSIA 
Sbjct: 309 ELDPSIAA 316


>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
          Length = 398

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----TNKRASSNP------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C     +   A   P                   W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
 gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
           gorilla]
 gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
           cysteine endopeptidase; AltName: Full=Autophagin-2;
           AltName: Full=Autophagy-related cysteine endopeptidase
           2; AltName: Full=Autophagy-related protein 4 homolog A;
           Short=hAPG4A
 gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
 gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
 gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
 gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
 gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
           construct]
          Length = 398

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN           W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
 gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
 gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
 gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
          Length = 398

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 137/308 (44%), Positives = 194/308 (62%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C     +   P                       W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSIDTPGDRPPDSLTASNQSKGTSAYCSAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
           [Homo sapiens]
          Length = 402

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 48  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 107

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 108 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 167

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN           W+PL
Sbjct: 168 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 227

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 228 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 260

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 261 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 316

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 317 NLDPSVAL 324


>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
          Length = 398

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSNPQ---------WQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN           W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCTAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
 gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
          Length = 395

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 137/305 (44%), Positives = 193/305 (63%), Gaps = 53/305 (17%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
           ++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDWQW  +
Sbjct: 45  LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWEKH 104

Query: 75  SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
            +  E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 105 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 164

Query: 134 SSIVFHVALDNTLVVNQVKKLC------------TTNKRASSNPQ--------WQPLVLV 173
           +S+  +V++DNT+V+  +K +C             +++R  S  +        W+PL+L+
Sbjct: 165 NSLAVYVSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLEQSSGWRPLLLI 224

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY++  K C                           F  PQSLG +GG
Sbjct: 225 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 257

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPNHA YFIG+ G+++I+LDPHT Q     + + +++    D TYHC +  + + +L +D
Sbjct: 258 KPNHAYYFIGFSGDEIIYLDPHTTQ----TFVETEEAGTVQDQTYHCQKGPNSMKVLKLD 313

Query: 293 PSIAV 297
           PS+A+
Sbjct: 314 PSVAL 318


>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
          Length = 392

 Score =  281 bits (719), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 137/305 (44%), Positives = 193/305 (63%), Gaps = 53/305 (17%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
           ++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDWQW  +
Sbjct: 42  LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWEKH 101

Query: 75  SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
            +  E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 102 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 161

Query: 134 SSIVFHVALDNTLVVNQVKKLC------------TTNKRASSNPQ--------WQPLVLV 173
           +S+  +V++DNT+V+  +K +C             +++R  S  +        W+PL+L+
Sbjct: 162 NSLAVYVSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLEQSSGWRPLLLI 221

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY++  K C                           F  PQSLG +GG
Sbjct: 222 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 254

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPNHA YFIG+ G+++I+LDPHT Q     + + +++    D TYHC +  + + +L +D
Sbjct: 255 KPNHAYYFIGFSGDEIIYLDPHTTQ----TFVETEEAGTVQDQTYHCQKGPNSMKVLKLD 310

Query: 293 PSIAV 297
           PS+A+
Sbjct: 311 PSVAL 315


>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
 gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
           cysteine endopeptidase; AltName: Full=Autophagin-2;
           AltName: Full=Autophagy-related cysteine endopeptidase
           2; AltName: Full=Autophagy-related protein 4 homolog A
 gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
 gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 135/305 (44%), Positives = 194/305 (63%), Gaps = 52/305 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
           W+S+  +V++DNT+V+  +KK+C      +++P                    W+PL+L+
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLI 223

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY+   K+C                           F  PQSLG +GG
Sbjct: 224 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 256

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPN+A YFIG++G+++IFLDPHT Q     +   ++S    D T+HC Q+  R+ IL++D
Sbjct: 257 KPNNAYYFIGFLGDELIFLDPHTTQT----FVDIEESGLVDDQTFHCLQSPQRMSILNLD 312

Query: 293 PSIAV 297
           PS+A+
Sbjct: 313 PSVAL 317


>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
          Length = 369

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 198/308 (64%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRD  W  
Sbjct: 15  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDLNWEK 74

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 75  QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 134

Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C                    +N+  S++   P W+PL
Sbjct: 135 WNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAGESPPSSLNASNRSKSTSAGWPAWKPL 194

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 195 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 227

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 228 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 283

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 284 NLDPSVAL 291


>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
          Length = 408

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 137/305 (44%), Positives = 192/305 (62%), Gaps = 55/305 (18%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 42  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 101

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
           W+S+  +V++DNT+V+  +KK+C T   ++  P                       W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 221

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q   R++IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQPPQRMNIL 310

Query: 290 HMDPS 294
           ++DPS
Sbjct: 311 NLDPS 315


>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
 gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
 gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
 gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
          Length = 355

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 40  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 99

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 100 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 159

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN           W+PL
Sbjct: 160 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 219

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 220 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 252

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL
Sbjct: 253 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 308

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 309 NLDPSVAL 316


>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
          Length = 395

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 139/305 (45%), Positives = 192/305 (62%), Gaps = 53/305 (17%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
           ++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDWQW  +
Sbjct: 45  LQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWEKH 104

Query: 75  SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
            +  E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 105 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 164

Query: 134 SSIVFHVALDNTLVVNQVKKLC------------TTNKRASSNPQ--------WQPLVLV 173
           +S+  +V++DNT+V+  +K +C             +++R  S  +        W+PL+L+
Sbjct: 165 NSLAVYVSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLEQSSGWRPLLLI 224

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY++  K C                           F  PQSLG +GG
Sbjct: 225 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 257

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPNHA YFIG+ G+++I+LDPHT Q      D E+    + D TYHC +  + + +L +D
Sbjct: 258 KPNHAYYFIGFSGDEIIYLDPHTTQTF---VDTEEAGTVQ-DQTYHCQKGPNSMKVLKLD 313

Query: 293 PSIAV 297
           PS+A+
Sbjct: 314 PSVAL 318


>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
          Length = 389

 Score =  279 bits (714), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 147/302 (48%), Positives = 204/302 (67%), Gaps = 36/302 (11%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L+++  D+ SRL  TYR+ F PIGDSG+T+D+GWGCMLRCGQMV+AQAL+  HLGR   W
Sbjct: 66  LDELNSDVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFW 125

Query: 72  NVNSKE---EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
            V   +   E+Y KILK+FED++TA YSIHQ+A  G SEGK +G+WFGPNTVAQVL+KL+
Sbjct: 126 PVGDDQRTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLS 185

Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYI 188
           +YD+WS++  HVA+DN +V+ ++++LC      +    W PL+LV+PLRLG+ +INP+YI
Sbjct: 186 EYDEWSALKIHVAMDNAVVIEEIEQLCHKKITPTETSTWSPLLLVVPLRLGLLNINPIYI 245

Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
           + +K C  +                           PQS+G+IGGKP+ ALYFIGYVG+D
Sbjct: 246 DSLKACLQM---------------------------PQSIGMIGGKPSQALYFIGYVGDD 278

Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV-SQRSYSDYK 307
           V+FLDPH  QN   + + E D     DS+YH    +R+    MDPS+AV  S  ++S++K
Sbjct: 279 VVFLDPHLTQNAIDLDEDEFD-----DSSYHPATCARISFQSMDPSLAVCFSCTTHSEWK 333

Query: 308 NV 309
           ++
Sbjct: 334 DL 335


>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
          Length = 393

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E++  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  EELLSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCQHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ------------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+    + P                         W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSLPCGTAPASSAAPDQHCNGFPAGAEVTTRLSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K+C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINAAYVETLKRC---------------------------FRMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EATDSCLVPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIGELDPSIAV 319


>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
 gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
          Length = 396

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 134/305 (43%), Positives = 193/305 (63%), Gaps = 52/305 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQV++KL  +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLTLFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
           W+S+  +V++DNT+V+  +KK+C      +++P                    W+PL+L+
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLI 223

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY+   K+C                           F  PQSLG +GG
Sbjct: 224 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 256

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPN+A YFIG++G+++IFLDPHT Q     +   ++S    D T+HC Q+  R+ IL++D
Sbjct: 257 KPNNAYYFIGFLGDELIFLDPHTTQT----FVDIEESGLVDDQTFHCLQSPQRMSILNLD 312

Query: 293 PSIAV 297
           PS+A+
Sbjct: 313 PSVAL 317


>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
 gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
          Length = 398

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 139/308 (45%), Positives = 196/308 (63%), Gaps = 55/308 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
           W+S+  +V++DNT+V+  +KK+C        T   R      +SN           W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPL 223

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +L++PLRLGI  INPVY++  K+C                           F  PQSLG 
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
           +GGKPN+A YFIG++G+++IFLDPHT Q      D  ++     D T+HC Q+  R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTGENGTVN-DQTFHCLQSPQRMNIL 312

Query: 290 HMDPSIAV 297
           ++DPS+A+
Sbjct: 313 NLDPSVAL 320


>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
          Length = 461

 Score =  278 bits (711), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 145/310 (46%), Positives = 188/310 (60%), Gaps = 56/310 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQALL  HLGRDW+W 
Sbjct: 109 EDILSDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWK 168

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 169 KGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAAFD 228

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN--KRASSNP---------------------QWQ 168
            WSS+  H+A+DNT+V+ ++++LC  N    AS+ P                     QW+
Sbjct: 229 TWSSLAVHIAMDNTVVIEEIRRLCKPNFPAGASAFPTDSEFLLNGFPSGAEVTNRPTQWK 288

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           PLVL+IPLRLG+ +IN  YI  +K C                           F  PQSL
Sbjct: 289 PLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQSL 321

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLH 287
           GVIGGKPN A YFIGYVG ++I+LDPHT Q    +      S    D ++HC     R++
Sbjct: 322 GVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEI----SGSCFIPDESFHCQHPPCRMN 377

Query: 288 ILHMDPSIAV 297
           I+ +DPSIAV
Sbjct: 378 IVELDPSIAV 387


>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  278 bits (711), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 134/305 (43%), Positives = 193/305 (63%), Gaps = 52/305 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
           W+S+  + ++DNT+V+  +KK+C      +++P                    W+PL+L+
Sbjct: 164 WNSLAVYDSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLI 223

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY+   K+C                           F  PQSLG +GG
Sbjct: 224 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 256

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPN+A YFIG++G+++IFLDPHT Q    +    ++S    D T+HC Q+  R+ IL++D
Sbjct: 257 KPNNAYYFIGFLGDELIFLDPHTTQTFVDI----EESGLVDDQTFHCLQSPQRMSILNLD 312

Query: 293 PSIAV 297
           PS+A+
Sbjct: 313 PSVAL 317


>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
 gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
          Length = 394

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 195/324 (60%), Gaps = 61/324 (18%)

Query: 2   RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
           R  + L+ +D   I  D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+
Sbjct: 31  RQFSALTEKD--DILADVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALI 88

Query: 62  FLHLGRDWQWNVNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
             HLGRDW+W+   ++   Y+ IL  F D++ + YSIHQIA  G  EGK++G+W+GPNTV
Sbjct: 89  CRHLGRDWKWSPGQRQRPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTV 148

Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT----NKRA---SSNPQ------- 166
           AQVL+KLA +D WS +  HVA+DNT+V+ ++K+LC      ++ A   S  P+       
Sbjct: 149 AQVLKKLAVFDSWSRLAVHVAMDNTVVIEEIKRLCMPWLDFDRGACAVSEEPREMNGDLE 208

Query: 167 ------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                       W+PLVL+IPLRLG+ DIN  YI  +K+C                    
Sbjct: 209 GACALAEEETALWKPLVLLIPLRLGLSDINEAYIEPLKQC-------------------- 248

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                  F  PQSLGVIGGKPN A YFIG+VG+++I+LDPHT Q      D  +D     
Sbjct: 249 -------FMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDPHTTQP---AVDPSEDGHFP- 297

Query: 275 DSTYHCPQ-ASRLHILHMDPSIAV 297
           D +YHC     R+HI  +DPSIA 
Sbjct: 298 DDSYHCQHPPCRMHICELDPSIAA 321


>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
          Length = 343

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 189/313 (60%), Gaps = 58/313 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 39  EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99  KGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------- 166
            WSS+  H+A+DNT+V+ ++++LC +N     A++ P                       
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL 218

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W+PLVL+IPLRLG+ +IN  YI  +K C                           F  PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
           SLGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPSDSGCLPDESFHCQHPPCR 307

Query: 286 LHILHMDPSIAVV 298
           + I  +DPSIAVV
Sbjct: 308 MSIAELDPSIAVV 320


>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
 gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
          Length = 375

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 138/289 (47%), Positives = 188/289 (65%), Gaps = 36/289 (12%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  D+ SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW+W+ 
Sbjct: 41  ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMILAQALICSHLGRDWRWDP 100

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
              + + Y +IL  F D++ + YSIHQ+A  G  EGK+VGEW+GPNTVAQVL+KLA +DD
Sbjct: 101 EKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVGEGKSVGEWYGPNTVAQVLKKLALFDD 160

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTN--KRASSNP-QWQPLVLVIPLRLGIQDINPVYIN 189
           W+S+  +V++DNT+V+  +KKLC     +  S  P  W+PL+LVIPLR+GI  INPVYI 
Sbjct: 161 WNSLSVYVSMDNTVVIEDIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQ 220

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+C                           F  PQS GV+GGKPN A YFIG++ +++
Sbjct: 221 ALKEC---------------------------FKMPQSCGVLGGKPNLAYYFIGFIDDEL 253

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHMDPSIAV 297
           I+LDPHT Q      D E  S    D ++HC +   R+ I  +DPS+A+
Sbjct: 254 IYLDPHTTQQ---AVDTESGSAVD-DQSFHCQRTPHRMKITSLDPSVAL 298


>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
          Length = 393

 Score =  276 bits (707), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 190/313 (60%), Gaps = 59/313 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           + I  D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+  HLGRDW+W+
Sbjct: 39  DDILADVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWS 98

Query: 73  VNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++   Y+ IL  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99  PGQRQRPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTT----NKRA---SSNPQ------------------ 166
            WS +  HVA+DNT+V+ ++K+LC      ++ A   S  P+                  
Sbjct: 159 SWSRLAVHVAMDNTVVIEEIKRLCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETA 218

Query: 167 -WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
            W+PLVL+IPLRLG+ DIN  YI  +K+C                           F  P
Sbjct: 219 LWKPLVLLIPLRLGLSDINEAYIEPLKQC---------------------------FMMP 251

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-AS 284
           QSLGVIGGKPN A YFIG+VG+++I+LDPHT Q      D  +D     D +YHC     
Sbjct: 252 QSLGVIGGKPNSAHYFIGFVGDELIYLDPHTTQP---AVDPSEDGHFP-DDSYHCQHPPC 307

Query: 285 RLHILHMDPSIAV 297
           R+HI  +DPSIA 
Sbjct: 308 RMHICELDPSIAA 320


>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
          Length = 394

 Score =  276 bits (707), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 190/313 (60%), Gaps = 59/313 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           + I  D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+  HLGRDW+W+
Sbjct: 40  DDILADVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWS 99

Query: 73  VNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++   Y+ IL  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 PGQRQRPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTT----NKRA---SSNPQ------------------ 166
            WS +  HVA+DNT+V+ ++K+LC      ++ A   S  P+                  
Sbjct: 160 SWSRLAVHVAMDNTVVIEEIKRLCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETA 219

Query: 167 -WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
            W+PLVL+IPLRLG+ DIN  YI  +K+C                           F  P
Sbjct: 220 LWKPLVLLIPLRLGLSDINEAYIEPLKQC---------------------------FMMP 252

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-AS 284
           QSLGVIGGKPN A YFIG+VG+++I+LDPHT Q      D  +D     D +YHC     
Sbjct: 253 QSLGVIGGKPNSAHYFIGFVGDELIYLDPHTTQP---AVDPSEDGHFP-DDSYHCQHPPC 308

Query: 285 RLHILHMDPSIAV 297
           R+HI  +DPSIA 
Sbjct: 309 RMHICELDPSIAA 321


>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
          Length = 405

 Score =  276 bits (706), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 189/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F PIG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 52  DEILSDVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 111

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 112 QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 171

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT--------------TNKRASSNPQ----------W 167
            WS++  H+A+DNT+V+  +++LC+              +++  +  P           W
Sbjct: 172 TWSALAVHIAMDNTVVMEDIRRLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPW 231

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K+C                           F  PQS
Sbjct: 232 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 264

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGY G ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 265 LGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAV----ELTDSCFIADESFHCRHPPSRM 320

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 321 SIGELDPSIAV 331


>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
          Length = 518

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 188/312 (60%), Gaps = 57/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 163 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 222

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 223 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 282

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 283 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 342

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 343 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 375

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 376 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 431

Query: 287 HILHMDPSIAVV 298
            I  +DPSIAVV
Sbjct: 432 SIAELDPSIAVV 443


>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
 gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related cysteine endopeptidase 2B;
           Short=Autophagin-2B; Short=cAut2B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
          Length = 393

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 143/312 (45%), Positives = 188/312 (60%), Gaps = 58/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 39  EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99  KGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------- 166
            WSS+  H+A+DNT+V+ ++++LC +N     A++ P                       
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL 218

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W+PLVL+IPLRLG+ +IN  YI  +K C                           F  PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
           SLGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPSDSGCLPDESFHCQHPPCR 307

Query: 286 LHILHMDPSIAV 297
           + I  +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319


>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
 gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
          Length = 368

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 136/288 (47%), Positives = 181/288 (62%), Gaps = 45/288 (15%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+  +  D+ SR+W TYRK F  IG +G TTD GWGCMLRCGQM++AQAL+  HLGRDWQ
Sbjct: 43  DMGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDSGWGCMLRCGQMMLAQALVCRHLGRDWQ 102

Query: 71  WN-VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           W+  N+    Y++IL+ F D++ + YSIHQIA  G SEGKAVG WFGPNTVAQVL+KL+ 
Sbjct: 103 WDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQMGVSEGKAVGSWFGPNTVAQVLKKLSA 162

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
           +DDWSS+  HVA+DNT+++  +               W+PLVL IPLRLG+ ++N VY  
Sbjct: 163 FDDWSSLCLHVAMDNTVIIEDISN-------------WRPLVLFIPLRLGLTEMNVVYNE 209

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K C                           FTF QSLG+IGG+PNHA YFIGY GN++
Sbjct: 210 PLKAC---------------------------FTFKQSLGIIGGRPNHATYFIGYFGNNL 242

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPHT Q          +  +  D ++HC    R++I  +DPS+A+
Sbjct: 243 VYLDPHTTQQTV----NPDELSRIPDGSFHCVYPCRMNIADVDPSVAL 286


>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
          Length = 393

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 143/312 (45%), Positives = 188/312 (60%), Gaps = 58/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 39  EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99  KGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------- 166
            WSS+  H+A+DNT+V+ ++++LC +N     A++ P                       
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPTVEADVLYNGYPEEAGVRDKLSL 218

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W+PLVL+IPLRLG+ +IN  YI  +K C                           F  PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
           SLGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPSDSGCLPDESFHCQHPPCR 307

Query: 286 LHILHMDPSIAV 297
           + I  +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319


>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
          Length = 393

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 143/312 (45%), Positives = 187/312 (59%), Gaps = 58/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 39  EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99  KGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------------SSNPQ---------- 166
            WSS+  H+A+DNT+V+ ++++LC +N                  +  P+          
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAACPAVESDGLYNGCPEEAGVRDRRSL 218

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W+PLVL+IPLRLG+ +IN  YI  +K C                           F  PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
           SLGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EHNDSGCLPDESFHCQHPPCR 307

Query: 286 LHILHMDPSIAV 297
           + I  +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319


>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
           [Desmodus rotundus]
          Length = 394

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 188/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQALL  HLGRDW+W 
Sbjct: 41  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWT 100

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFHVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 160

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  HVA+DNT+V+  +++LC ++     AS+ P                      W
Sbjct: 161 TWSALAVHVAMDNTVVMEDIRRLCRSSLPCAGASAFPADSEGHCNGFPARAEVTNRPSPW 220

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKGC---------------------------FMMPQS 253

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSCSIPDESFHCQHPPSRM 309

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 310 SIGELDPSIAV 320


>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
          Length = 369

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 189/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F PIG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 37  DEILSDVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 97  QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 156

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT--------------TNKRASSNPQ----------W 167
            WS++  H+A+DNT+V+  +++LC+              +++  +  P           W
Sbjct: 157 TWSALAVHIAMDNTVVMEDIRRLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPW 216

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K+C                           F  PQS
Sbjct: 217 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 249

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGY G ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 250 LGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPA----VELTDSCFIADESFHCRHPPSRM 305

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 306 SIGELDPSIAV 316


>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
          Length = 354

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 188/312 (60%), Gaps = 57/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAVV 298
            I  +DPSIAVV
Sbjct: 309 SIAELDPSIAVV 320


>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
 gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
 gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
          Length = 393

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
            WSS+  H+A+DNT+V+ ++++LC  N                          ++ P  W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 GIGELDPSIAV 319


>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
 gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
 gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
 gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
 gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
          Length = 393

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
            WSS+  H+A+DNT+V+ ++++LC  N                          ++ P  W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 GIGELDPSIAV 319


>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
           cysteine endopeptidase; AltName: Full=Autophagin-1;
           AltName: Full=Autophagy-related cysteine endopeptidase
           1; AltName: Full=Autophagy-related protein 4 homolog B
          Length = 393

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
            WSS+  H+A+DNT+V+ ++++LC  N                          ++ P  W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 GIGELDPSIAV 319


>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
          Length = 509

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 156 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 215

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 216 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 275

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 276 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 335

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 336 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 368

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       GC            D ++HC  
Sbjct: 369 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 419

Query: 283 -ASRLHILHMDPSIAV 297
              R+ I  +DPSIAV
Sbjct: 420 PPCRMSIAELDPSIAV 435


>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
          Length = 415

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
          Length = 380

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K CY +                           PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHCYMM---------------------------PQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
          Length = 521

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 156 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 215

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 216 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 275

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 276 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 335

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 336 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 368

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       GC            D ++HC  
Sbjct: 369 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 419

Query: 283 -ASRLHILHMDPSIAV 297
              R+ I  +DPSIAV
Sbjct: 420 PPCRMSIAELDPSIAV 435


>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
          Length = 390

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 37  DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 97  QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 156

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
            WSS+  H+A+DNT+V+ ++++LC  N                          ++ P  W
Sbjct: 157 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAW 216

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 217 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 249

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 250 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 305

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 306 GIGELDPSIAV 316


>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
          Length = 510

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 157 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 216

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 217 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 276

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 277 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 336

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 337 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 369

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       GC            D ++HC  
Sbjct: 370 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 420

Query: 283 -ASRLHILHMDPSIAV 297
              R+ I  +DPSIAV
Sbjct: 421 PPCRMSIAELDPSIAV 436


>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
          Length = 496

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 156 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 215

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 216 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 275

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 276 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 335

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 336 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 368

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 369 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 424

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 425 SIAELDPSIAV 435


>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
          Length = 393

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
 gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
          Length = 393

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
          Length = 394

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 142/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQALL  HLGRDW+W 
Sbjct: 41  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWT 100

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFHVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAIFD 160

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  H+A+DNT+V+  +++LC ++     A++ P                      W
Sbjct: 161 TWSALAVHIAMDNTVVMEDIRRLCRSSLPCAEATAFPADSEGHCNGLPAGAEVTNRPSLW 220

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKGC---------------------------FMMPQS 253

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSFLIPDESFHCQHPPSRM 309

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 310 SIGELDPSIAV 320


>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
          Length = 420

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQALL  HLGRDW+W 
Sbjct: 67  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWA 126

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 127 QRRRQPDSYFSVLHAFIDRKDSHYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 186

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
            WSS+  H+A+DNT+V+ ++++LC ++                         A+  P  W
Sbjct: 187 TWSSLAVHIAMDNTVVMEEIRRLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTW 246

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 247 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FRMPQS 279

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +        D T+HC     R+
Sbjct: 280 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VELAGGFSIPDETFHCQHPPCRM 335

Query: 287 HILHMDPSIAV 297
           +I  +DPSIAV
Sbjct: 336 NIAELDPSIAV 346


>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
 gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
           cysteine endopeptidase; AltName: Full=Autophagin-1;
           AltName: Full=Autophagy-related cysteine endopeptidase
           1; AltName: Full=Autophagy-related protein 4 homolog B;
           Short=hAPG4B
 gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
          Length = 393

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
          Length = 481

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 128 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 187

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 188 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 247

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 248 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 307

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 308 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 340

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       GC            D ++HC  
Sbjct: 341 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 391

Query: 283 -ASRLHILHMDPSIAV 297
              R+ I  +DPSIAV
Sbjct: 392 PPCRMSIAELDPSIAV 407


>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
 gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
 gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
 gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
 gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
 gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
           construct]
 gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
          Length = 393

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
          Length = 396

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 43  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 102

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 311

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 312 SIAELDPSIAV 322


>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
          Length = 405

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
 gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
          Length = 398

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 45  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 104

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 105 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 164

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 165 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 224

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 225 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 257

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 258 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 313

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 314 SIAELDPSIAV 324


>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
          Length = 396

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 43  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 102

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 311

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 312 SIAELDPSIAV 322


>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
          Length = 468

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 128 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 187

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 188 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 247

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 248 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 307

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 308 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 340

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 341 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 396

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 397 SIAELDPSIAV 407


>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
          Length = 393

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 189/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  H+A+DNT+V+  +++LC ++     A++ P                      W
Sbjct: 160 TWSALAVHIAMDNTVVMEDIRRLCRSSLPCAGAAAFPADSDRHCNGFPAGAEVTNRPAPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K+C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSCFIPDESFHCQHPPSRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIGELDPSIAV 319


>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
          Length = 468

 Score =  273 bits (698), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 128 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 187

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 188 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 247

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 248 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 307

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 308 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 340

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 341 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 396

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 397 SIAELDPSIAV 407


>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
          Length = 380

 Score =  273 bits (698), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
          Length = 508

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 155 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 214

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 215 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 274

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 275 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 334

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 335 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 367

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +   S    D ++HC     R+
Sbjct: 368 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTGSCFIPDESFHCQHPPCRM 423

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 424 SIAELDPSIAV 434


>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
          Length = 393

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +   S    D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTGSCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
          Length = 393

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +   S    D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTGSCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
 gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
 gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
          Length = 393

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +   S    D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTGSCFIPDESFHCQHPPCRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
          Length = 394

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 41  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 100

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 160

Query: 132 DWSSIVFHVALDNTLVVNQVKKLC--------------TTNKRASSNPQ----------W 167
            WS++  H+A+DNT+V+  +++LC               +++  +  P           W
Sbjct: 161 TWSALAVHIAMDNTVVMEDIRRLCRGSLPCAGAAALPADSSRHCNGFPAGAEVTNRLAPW 220

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K+C                           F  PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 253

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSCFIPDESFHCQHPPSRM 309

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 310 SIGELDPSIAV 320


>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
          Length = 390

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 37  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 97  QRKRQSDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 156

Query: 132 DWSSIVFHVALDNTLVVNQVKKLC--------------TTNKRASSNPQ----------W 167
            WS++  H+A+DNT+V+  +++LC               +++  +  P           W
Sbjct: 157 TWSALAVHIAMDNTVVMEDIRRLCRGSLPCAGATALPTDSSRHCNGFPAGAEVTNRPAPW 216

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K+C                           F  PQS
Sbjct: 217 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 249

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 250 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCRHPPSRM 305

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 306 GISELDPSIAV 316


>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
          Length = 393

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 143/313 (45%), Positives = 187/313 (59%), Gaps = 61/313 (19%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  EEILSDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWK 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT------------TNKRASSN------------PQW 167
            WSS+  H+A+DNT+V+ ++++LC             T+    SN              W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCKAGFPCADGAAFPTDSELLSNGYPPAAEVTDRASPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y   +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYTETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL--DSTYHCPQ-AS 284
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q         + +E  +  D T+HC     
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQ------PAVESTEGGVFPDETFHCQHPPC 306

Query: 285 RLHILHMDPSIAV 297
           R++I  +DPSIAV
Sbjct: 307 RMNIGELDPSIAV 319


>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
          Length = 396

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 145/330 (43%), Positives = 194/330 (58%), Gaps = 73/330 (22%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 43  DEILSDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWK 102

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNP---------------------QW 167
            WSS+  H+A+DNT+V+  +++LC  N     A++ P                     QW
Sbjct: 163 TWSSLAVHIAMDNTVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQW 222

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y   +K C                           F  PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYTETLKHC---------------------------FMMPQS 255

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ------NIGCVYDKEQDSEKKLDSTYHCP 281
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q      N G + D+          ++HC 
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNGGVIPDE----------SFHCQ 305

Query: 282 Q-ASRLHILHMDPSIAV----VSQRSYSDY 306
               R++I  +DPSIAV     S+  ++D+
Sbjct: 306 HPPCRMNIGELDPSIAVGFFCKSEEDFNDW 335


>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
          Length = 479

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 126 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 185

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 186 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 245

Query: 132 DWSSIVFHVALDNTLVVNQVKKLC-------------TTNKR-----------ASSNPQW 167
            WSS+  H+A+DNT+V+ ++++LC             T ++R           A+    W
Sbjct: 246 TWSSLAVHIAMDNTVVMEEIRRLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAW 305

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 306 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 338

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R+
Sbjct: 339 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VELTDSCFIPDESFHCQHPPCRM 394

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 395 GIGELDPSIAV 405


>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
 gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
          Length = 393

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 142/311 (45%), Positives = 185/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQALL  HLGR W+W 
Sbjct: 40  DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRGWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QWERQPDSYFSVLHAFMDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAAFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---------------------KRASSNPQ---W 167
            WS++  HVA+DNT+V+ ++++LC ++                       A   P+   W
Sbjct: 160 TWSALAVHVAMDNTVVMEEIRRLCRSSLPRAGAAAFPADSDRHCNGFPAEAEVGPRPVPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y   +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINAAYTETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q    V     DS    D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQV----TDSCLIPDESFHCQHPPHRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
          Length = 473

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 140/309 (45%), Positives = 185/309 (59%), Gaps = 56/309 (18%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           +I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W  
Sbjct: 122 EILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQ 181

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ ++YL +L  F DR+ + YSIHQIA  G  EGK+VG+W+GPNTVAQVL+KLA +D 
Sbjct: 182 QKRQPDSYLSVLHAFMDRKDSYYSIHQIAQMGVGEGKSVGQWYGPNTVAQVLKKLAVFDT 241

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTT-----------------------NKRASSNPQWQP 169
           WSS+  H+A+DNT+V+ ++++LC +                       +   ++   W+P
Sbjct: 242 WSSLAVHIAMDNTVVMEEIRRLCRSSHPCAGAATPPAGADWHCNGFPASTEVTNRSPWRP 301

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           LVL+IPLRLG+ DIN  Y+  +K C                           F  PQSLG
Sbjct: 302 LVLLIPLRLGLTDINEAYVETLKLC---------------------------FRMPQSLG 334

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHI 288
           VIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+ I
Sbjct: 335 VIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VELTDLCFIPDESFHCQHPPCRMSI 390

Query: 289 LHMDPSIAV 297
             +DPSIAV
Sbjct: 391 GELDPSIAV 399


>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
          Length = 394

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 41  DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 100

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 160

Query: 132 DWSSIVFHVALDNTLVVNQVKKLC-------------TTNKR-----------ASSNPQW 167
            WSS+  H+A+DNT+V+ ++++LC             T ++R           A+    W
Sbjct: 161 TWSSLAVHIAMDNTVVMEEIRRLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAW 220

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 253

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPCRM 309

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 310 GIGELDPSIAV 320


>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
          Length = 342

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 186/312 (59%), Gaps = 57/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L+ F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  HVA+DNT+V+  +++LC ++     A + P                      W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ D+N  Y   +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q      D+        D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVP----DESFHCQHPPGRM 308

Query: 287 HILHMDPSIAVV 298
            I  +DPSIAVV
Sbjct: 309 SIAELDPSIAVV 320


>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
          Length = 445

 Score =  271 bits (692), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 92  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 151

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 152 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 211

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  HVA+DNT+V+  +++LC        A++ P                      W
Sbjct: 212 TWSALAVHVAMDNTVVMEDIRRLCRAGLPCAGAAALPADPGRHCNGFPAGAEVSNRLAPW 271

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 272 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 304

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC    SR+
Sbjct: 305 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFADSCFIPDESFHCQHPPSRM 360

Query: 287 HILHMDPSIAV 297
            +  +DPSIAV
Sbjct: 361 GVRELDPSIAV 371


>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
          Length = 393

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 185/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L+ F DR+ + YSIHQIA  G  EGK+VG+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  HVA+DNT+V+  +++LC ++     A + P                      W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ D+N  Y   +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q      D+        D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADR----CPVPDESFHCQHPPGRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
          Length = 393

 Score =  270 bits (690), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 141/312 (45%), Positives = 187/312 (59%), Gaps = 58/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 39  EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ + Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99  KGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------------SSNPQ---------- 166
            WSS+  H+A+DNT+V+ ++++LC ++                  +  P+          
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAACPALESDVLYNGCPEDVGLRERLAL 218

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W+PLVL+IPLRLG+ +IN  YI  +K C                           F  PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
           SLGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPGDSGCLPDESFHCQHPPCR 307

Query: 286 LHILHMDPSIAV 297
           + I  +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319


>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
 gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related cysteine endopeptidase 2B;
           Short=Autophagin-2B; AltName: Full=Autophagy-related
           protein 4 homolog B; AltName: Full=bAut2B
 gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
          Length = 393

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 185/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L+ F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  HVA+DNT+V+  +++LC ++     A + P                      W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ D+N  Y   +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q      D+        D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADR----CPVPDESFHCQHPPGRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
          Length = 390

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 185/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L+ F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WS++  HVA+DNT+V+  +++LC ++     A + P                      W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ D+N  Y   +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q      D+        D ++HC     R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADR----CPVPDESFHCQHPPGRM 308

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 309 SIAELDPSIAV 319


>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
          Length = 412

 Score =  270 bits (689), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 186/320 (58%), Gaps = 73/320 (22%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           + I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 57  DDILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 116

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 117 QRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 176

Query: 132 DWSSIVFHVALDNTLVVNQVKKLC----------------------------TTNKRASS 163
            WSS+  H+A+DNT+V+ ++++LC                             TN+++ S
Sbjct: 177 TWSSLAVHIAMDNTVVMEEIRRLCRTGLPCAGAAALPTDADRHCNGFPTQTEVTNRQSPS 236

Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
              W+PLVL+IPLRLG+ DIN  Y+  +K C                           F 
Sbjct: 237 --LWRPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FM 267

Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTY 278
            PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q       GC            D T+
Sbjct: 268 MPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIP---------DETF 318

Query: 279 HCPQ-ASRLHILHMDPSIAV 297
           HC     R+ I  +DPSIAV
Sbjct: 319 HCQHPPCRMGIGELDPSIAV 338


>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
 gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
          Length = 357

 Score =  270 bits (689), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 43  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 102

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDP T Q       +  D     D ++HC     R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPATTQPA----VEPTDGCFIPDESFHCQHPPCRM 311

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 312 SIAELDPSIAV 322


>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
          Length = 393

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 143/312 (45%), Positives = 186/312 (59%), Gaps = 58/312 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+TSRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W+
Sbjct: 39  EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHLGRDWRWS 98

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              K+ ++Y  +L  F D++ + YSIHQIA  G  EGK++G+W+GPNTVAQVLRKLA +D
Sbjct: 99  KGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLRKLASFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQWQPLVL---------------- 172
            WSS+  H+A+DNT+V+ ++++LC  +     AS+ P  +P  L                
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLL 218

Query: 173 ------VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
                 +IPLRLG+ DIN  YI  +K C                           F  PQ
Sbjct: 219 WKPLVLLIPLRLGLTDINEAYIETLKHC---------------------------FMMPQ 251

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
           SLGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPMDSCYIPDESFHCQHPPCR 307

Query: 286 LHILHMDPSIAV 297
           + I  +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319


>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
           Complex
          Length = 357

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWG MLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 43  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGSMLRCGQMIFAQALVCRHLGRDWRWT 102

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 311

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 312 SIAELDPSIAV 322


>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
          Length = 412

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 145/311 (46%), Positives = 200/311 (64%), Gaps = 55/311 (17%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D  ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDWQ
Sbjct: 41  DKSKLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWQ 100

Query: 71  WNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           W  + K+ E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA 
Sbjct: 101 WEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLAL 160

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTT---------------------NKR-ASSNPQW 167
           +D+W+S+  +V++DNT+V+  +KK+C +                     NK  A   P W
Sbjct: 161 FDEWNSLAVYVSMDNTVVIEDIKKMCWSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGW 220

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PL+L+IPLRLGI  INPVYI+  K+C                           F  PQS
Sbjct: 221 KPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMPQS 253

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RL 286
           LG +GGKPN+A YFIG++GN++I+LDPHT Q+     D E++     D ++HC QA  R+
Sbjct: 254 LGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DKSFHCQQAPHRM 309

Query: 287 HILHMDPSIAV 297
            I+++DPS+A+
Sbjct: 310 KIMNLDPSVAL 320


>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
          Length = 392

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 187/311 (60%), Gaps = 58/311 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  E K++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGE-KSIGQWYGPNTVAQVLKKLAVFD 158

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 218

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 219 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 251

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  D     D ++HC     R+
Sbjct: 252 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 307

Query: 287 HILHMDPSIAV 297
            I ++DPSIAV
Sbjct: 308 SIANLDPSIAV 318


>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
          Length = 417

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 141/306 (46%), Positives = 194/306 (63%), Gaps = 53/306 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W +
Sbjct: 66  KLLSDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEM 125

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 126 QQEQPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 185

Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------------------TTNKRASSNPQWQPLVL 172
           W+S+  +V++DNT+V+  +KKLC                     +      +P W+PL+L
Sbjct: 186 WNSLAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGLDQSTHLPEPSPGWKPLLL 245

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           +IPLRLGI  INPVYI+  K+C                           F  PQSLG +G
Sbjct: 246 IIPLRLGINQINPVYIDAFKEC---------------------------FKMPQSLGALG 278

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHM 291
           GKPN A YFIG++GN++I+LDPHT Q      D E+D     D ++HC Q+  R+ IL++
Sbjct: 279 GKPNSAYYFIGFLGNELIYLDPHTTQTF---VDSEEDGTVD-DQSFHCQQSPHRMQILNL 334

Query: 292 DPSIAV 297
           DPS+A+
Sbjct: 335 DPSVAL 340


>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
          Length = 395

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 142/311 (45%), Positives = 197/311 (63%), Gaps = 55/311 (17%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D  ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDWQ
Sbjct: 39  DKSKLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWQ 98

Query: 71  WNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           W  + ++ E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA 
Sbjct: 99  WEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLAL 158

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC----------------------TTNKRASSNPQW 167
           +D+W+S+  +V++DNT+V+  +KK+C                       T   A     W
Sbjct: 159 FDEWNSLAVYVSMDNTVVIEDIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGW 218

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PL+L+IPLRLGI  INPVYI+  K+C                           F  PQS
Sbjct: 219 KPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMPQS 251

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRL 286
           LG +GGKPN+A YFIG++GN++I+LDPHT Q+     D E++     D ++HC QA  R+
Sbjct: 252 LGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DESFHCQQAPHRM 307

Query: 287 HILHMDPSIAV 297
            I+++DPS+A+
Sbjct: 308 KIMNLDPSVAL 318


>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
          Length = 397

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 142/313 (45%), Positives = 201/313 (64%), Gaps = 55/313 (17%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
           ++D  ++  D+++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRD
Sbjct: 39  NEDKSKLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRD 98

Query: 69  WQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           WQW  + K+ E Y +IL  F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KL
Sbjct: 99  WQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 158

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT---------------------NKRASS-NP 165
           A +D+W+S+  +V++DNT+V+  +KK+C +                     N+ A+    
Sbjct: 159 ALFDEWNSLAVYVSMDNTVVIEDIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCT 218

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
            W+PL+L+IPLRLGI  INPVYI+  K+C                           F  P
Sbjct: 219 GWKPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMP 251

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-S 284
           QSLG +GGKPN+A YFIG++GN++I+LDPHT Q+     D E++     D ++HC QA  
Sbjct: 252 QSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DQSFHCQQAPH 307

Query: 285 RLHILHMDPSIAV 297
           R+ I+++DPS+A+
Sbjct: 308 RMKIMNLDPSVAL 320


>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
 gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
          Length = 380

 Score =  265 bits (676), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 142/313 (45%), Positives = 201/313 (64%), Gaps = 55/313 (17%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
           ++D  ++  D+++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRD
Sbjct: 22  NEDKSKLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRD 81

Query: 69  WQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           WQW  + K+ E Y +IL  F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KL
Sbjct: 82  WQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 141

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT---------------------NKRASS-NP 165
           A +D+W+S+  +V++DNT+V+  +KK+C +                     N+ A+    
Sbjct: 142 ALFDEWNSLAVYVSMDNTVVIEDIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCT 201

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
            W+PL+L+IPLRLGI  INPVYI+  K+C                           F  P
Sbjct: 202 GWKPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMP 234

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-S 284
           QSLG +GGKPN+A YFIG++GN++I+LDPHT Q+     D E++     D ++HC QA  
Sbjct: 235 QSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DQSFHCQQAPH 290

Query: 285 RLHILHMDPSIAV 297
           R+ I+++DPS+A+
Sbjct: 291 RMKIMNLDPSVAL 303


>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
          Length = 393

 Score =  265 bits (676), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 140/290 (48%), Positives = 187/290 (64%), Gaps = 39/290 (13%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D E +R+  +S LWFTYRK F  IG  G T+D GWGCMLR GQM++ QAL+  HLGR W 
Sbjct: 73  DFEYVRKSFSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWM 132

Query: 71  WNVNSK---EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           W  + +    E YL+IL+MF+D+++A +SIHQI+L G SEGKAVGEWFGPNTVAQ L+KL
Sbjct: 133 WTSDDRLPDRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKL 192

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
            +YD WS +  HVA+DN ++++ +K LC     A  + +W+PL+LV+PLRLG+ +IN +Y
Sbjct: 193 VQYDHWSEMKLHVAMDNIIILSDIKSLCC----AKESNKWRPLLLVVPLRLGLSEINDIY 248

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
            N                  +L+S          F    SLG+IGG+P+HALYFIG    
Sbjct: 249 TNA-----------------VLNS----------FKMKHSLGIIGGRPSHALYFIGIQRE 281

Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +++FLDPHT  N       + D E   DSTYHC +A R+ I +MDPSIA+
Sbjct: 282 ELVFLDPHTTHNY-----VDLDEEPYNDSTYHCQRAQRMKISNMDPSIAM 326


>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
 gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
          Length = 410

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 134/321 (41%), Positives = 192/321 (59%), Gaps = 62/321 (19%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           QD + I+++I SR+WFTYRK F PIG +G  +D GWGCMLRCGQM++AQAL+  HLGR+W
Sbjct: 45  QDFDDIKKEIRSRMWFTYRKSFSPIGGTGPISDSGWGCMLRCGQMLLAQALICRHLGREW 104

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           QW+ + ++EAY++IL+MF+D++   YSIH IA  G SEGK +G+WFGP+T+A V++KLA 
Sbjct: 105 QWSPSCRDEAYVRILRMFQDKKNELYSIHMIAKMGESEGKEIGKWFGPSTIAHVIKKLAI 164

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCT-----------------------------TNKR 160
           YDDWSS+  HVA+DN +V   VKKLC+                              NK+
Sbjct: 165 YDDWSSLAVHVAMDNVIVQEDVKKLCSREVFDALRKRLLQEEPSEIVADWFEDARKDNKK 224

Query: 161 ---ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQT 217
              A+ +  W+PL+L++P+RLG+ ++NP YI  +K+ +A                YN   
Sbjct: 225 VDCANLSSPWKPLLLILPMRLGLSELNPCYIPALKEFFA--------------CKYN--- 267

Query: 218 PRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDST 277
                     +G+IGGKPNHALYFIG   + +++LDPH  Q      D +   +   DS+
Sbjct: 268 ----------IGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTF---VDLDVSMDLFDDSS 314

Query: 278 YHCPQASRLHILHMDPSIAVV 298
           YH      +    +DPS+A+ 
Sbjct: 315 YHSAFILDISFNEIDPSLAIA 335


>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
          Length = 433

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 138/349 (39%), Positives = 189/349 (54%), Gaps = 89/349 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D++ I+  +TSRLWFTYRK F+PIG +G T+D+GWGCMLRCGQM++AQAL+  HLG +W 
Sbjct: 39  DMDSIKEYVTSRLWFTYRKNFMPIGGTGPTSDQGWGCMLRCGQMLLAQALIVRHLGTEWM 98

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
           W+ ++KEE Y +IL+MF+D++  P+S+HQIA  G SE K +GEWFGPNT AQVL+KL  Y
Sbjct: 99  WDRDNKEEDYKRILRMFQDKKCCPFSLHQIAQMGVSERKQIGEWFGPNTAAQVLKKLVVY 158

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTN-------------------------------- 158
           DDWS +  HVALDN L+ + V+ +  T                                 
Sbjct: 159 DDWSRLAVHVALDNLLIASDVRTMAHTRPPSRLSSRHTTENEQSEESGNASGGNSLCSFG 218

Query: 159 ------------KRASSNP-----QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
                       K    NP     QW+PL++++PLRLG+  IN  Y+  I+  + L    
Sbjct: 219 SVKMCMLQSALMKECDENPVEDEEQWRPLLIIVPLRLGLTSINRCYLPAIEAFFQL---- 274

Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ--- 258
                                  PQ  G+IGG+PNHALYFIG  G  +I+LDPH  Q   
Sbjct: 275 -----------------------PQCTGIIGGRPNHALYFIGIAGEQLIYLDPHVCQAAI 311

Query: 259 --NIGCVYDKEQDSEKKL--------DSTYHCPQASRLHILHMDPSIAV 297
             +  C   ++QD   ++        DS+YHCP    +     DPS+A+
Sbjct: 312 DLDERCASLQQQDGFVEVVKSTDIFDDSSYHCPFLLHIAYDSADPSLAL 360


>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
           gallopavo]
          Length = 421

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 140/313 (44%), Positives = 198/313 (63%), Gaps = 55/313 (17%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
           ++D  ++  D+++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRD
Sbjct: 63  NEDKSKLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRD 122

Query: 69  WQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           WQW  + ++ E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KL
Sbjct: 123 WQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 182

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC----------------------TTNKRASSNP 165
           A +D+W+S+  +V++DNT+V+  +KK+C                           A    
Sbjct: 183 ALFDEWNSLAVYVSMDNTVVIEDIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCT 242

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
            W+PL+L+IPLRLGI  INPVYI+  K+C                           F  P
Sbjct: 243 GWKPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMP 275

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS- 284
           QSLG +GGKPN+A YFIG++GN++I+LDPHT Q+     D E++     D ++HC QA  
Sbjct: 276 QSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DQSFHCQQAPH 331

Query: 285 RLHILHMDPSIAV 297
           R+ I+++DPS+A+
Sbjct: 332 RMKIMNLDPSVAL 344


>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
 gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
          Length = 385

 Score =  263 bits (671), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 136/305 (44%), Positives = 187/305 (61%), Gaps = 48/305 (15%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           ++ +++  DI S+ WFTYRK + PIG  G T+DKGWGCMLRCGQM++ QAL+  HLGRDW
Sbjct: 36  EEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKGWGCMLRCGQMILGQALVMRHLGRDW 95

Query: 70  QWNVNSKEEA-YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
           +W  N ++ A Y KILK+F D + + YSIHQIA  G SEGK + +WFGPNT AQVL+KL 
Sbjct: 96  RWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQMGVSEGKKISQWFGPNTAAQVLKKLI 155

Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLC-------------TTNKRASSNPQ---WQPLVL 172
            +D+WS +  +VA+DN +V++ +KK+C              ++ + SSN Q   W+PL+L
Sbjct: 156 MFDEWSQMGVYVAMDNIVVIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLL 215

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
            IPLRLG+ D+NP+Y + + KC                           F    +LG+IG
Sbjct: 216 FIPLRLGLTDLNPIYKDKLNKC---------------------------FRIKNTLGIIG 248

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           GKPN A YFIG  G+ +++LDPHT Q    V      S+K    TYH    +RLH  +MD
Sbjct: 249 GKPNSAHYFIGIQGDYLLYLDPHTVQETVKVKPNCPFSDK----TYHQKGTNRLHFSYMD 304

Query: 293 PSIAV 297
           PS+A+
Sbjct: 305 PSVAL 309


>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
          Length = 475

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 139/320 (43%), Positives = 182/320 (56%), Gaps = 71/320 (22%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 118 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 177

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 178 QRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 237

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQP---------------------- 169
            WSS+  HVA+DNT+V+ ++++LC ++   S                             
Sbjct: 238 TWSSLAVHVAMDNTVVMEEIRRLCRSSLPCSGAAALPADADRHCNGFPAPMEVTSRPSPS 297

Query: 170 ------LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
                 LVL+IPLRLG+ DIN  Y+  +K+C                           F 
Sbjct: 298 PSPWRPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FM 330

Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-----NIGCVYDKEQDSEKKLDSTY 278
            PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q       GC            D T+
Sbjct: 331 MPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFIP---------DETF 381

Query: 279 HCPQ-ASRLHILHMDPSIAV 297
           HC     R+ I  +DPSIAV
Sbjct: 382 HCQHPPCRMGIGELDPSIAV 401


>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
          Length = 381

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 136/302 (45%), Positives = 187/302 (61%), Gaps = 40/302 (13%)

Query: 4   ANKLS-HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
            N+LS   D+E++  ++ SR  FTYRK F+ I DSG T+D GWGCMLRCGQMV+A+AL  
Sbjct: 36  GNELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDSGWGCMLRCGQMVLAEALQR 95

Query: 63  LHLGRDWQWNV-----NSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS--EGKAVGEWF 115
           + LGR+W+W+      N + + YL+ILK+F+D + APYS+HQIAL G S    K VG WF
Sbjct: 96  VSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLHQIALMGESIQSKKPVGTWF 155

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
           GPNT+AQVLRKL+  +  + I  HVA+DNT++V+++K+ C      S   Q +PL+L IP
Sbjct: 156 GPNTIAQVLRKLSVSETTNPIRVHVAMDNTVIVDEIKESCGFIGDPS---QGKPLLLFIP 212

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           LRLG+ +INP+Y   +K+C                           F FPQ LGVIGG+P
Sbjct: 213 LRLGLTEINPIYFQDLKEC---------------------------FEFPQILGVIGGRP 245

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
           NHALYFIGY+ N++I+LDPH                +  D TYH  +A R+    +DPS+
Sbjct: 246 NHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGGSE--DKTYHTDRAYRMDFKDLDPSL 303

Query: 296 AV 297
           ++
Sbjct: 304 SL 305


>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
          Length = 385

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 141/305 (46%), Positives = 195/305 (63%), Gaps = 56/305 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  + K+
Sbjct: 35  DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWHWEEHKKQ 94

Query: 78  -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
            E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+W+S+
Sbjct: 95  PEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 154

Query: 137 VFHVALDNTLVVNQVKKLCTTNKR-----ASSNP------------------QWQPLVLV 173
             +V++DNT+V+  +KK+C    +     A  +P                   W+PL+L+
Sbjct: 155 AVYVSMDNTVVIEDIKKMCRLPNQNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLI 214

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           IPLRLGI  INPVY++  K+C                           F  PQSLG +GG
Sbjct: 215 IPLRLGINHINPVYVDAFKEC---------------------------FKMPQSLGALGG 247

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHMD 292
           KPN+A YFIG++GN++I+LDPHT Q      D E++S    D ++HC QA  R+ I+++D
Sbjct: 248 KPNNAYYFIGFLGNELIYLDPHTTQ---LFVDSEENSTVD-DRSFHCQQAPHRMKIMNLD 303

Query: 293 PSIAV 297
           PS+A+
Sbjct: 304 PSVAL 308


>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
 gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
          Length = 397

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 140/305 (45%), Positives = 190/305 (62%), Gaps = 53/305 (17%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
           ++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW+W  +
Sbjct: 47  LQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWEKH 106

Query: 75  SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
               E Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 107 KNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 166

Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ--------------------WQPLVLV 173
           +S+  +V++DNT+VV  +K +C    ++ S  Q                    W+PL+LV
Sbjct: 167 NSLAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGHCSGWRPLLLV 226

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY++  K C                           F  PQSLG +GG
Sbjct: 227 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 259

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
           KPNHA YFIG+ G+++I+LDPHT Q      D E+    + D TYHC +  + + +L++D
Sbjct: 260 KPNHAYYFIGFSGDEIIYLDPHTTQTF---VDTEEAGTVQ-DQTYHCQKGPNSMKVLNLD 315

Query: 293 PSIAV 297
           PS+A+
Sbjct: 316 PSVAL 320


>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
          Length = 390

 Score =  253 bits (647), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 135/313 (43%), Positives = 189/313 (60%), Gaps = 45/313 (14%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           +  D+ ++  ++ SRL FTYRK F  I  SG T+D GWGCMLRCGQMV+ +AL  + LGR
Sbjct: 41  ARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDSGWGCMLRCGQMVLGEALQRISLGR 100

Query: 68  DWQWNVNSKEEA-------YLKILKMFEDRRTAPYSIHQIALTGAS--EGKAVGEWFGPN 118
           DW+W+     E        YLKIL +F+D + APYSIHQIAL G S    K VG WFGPN
Sbjct: 101 DWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYSIHQIALMGESIQSKKPVGTWFGPN 160

Query: 119 TVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRL 178
           TVAQVL+KL+ ++    I  HVA+DNT++++++K+ C      S     +PL+L IPLRL
Sbjct: 161 TVAQVLKKLSFFEKTVPIRLHVAMDNTVIIDEIKESCGFVGGDSE----KPLLLFIPLRL 216

Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
           G+ +INP+Y   +K+C                           F FPQ LGVIGG+PNHA
Sbjct: 217 GLTEINPIYFQDLKEC---------------------------FEFPQILGVIGGRPNHA 249

Query: 239 LYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           LYFIGYV N++I+LDPH + Q+     D     +   D T+H  +A R+    +DPS+++
Sbjct: 250 LYFIGYVDNELIYLDPHISTQSASSTVDTFGGPQ---DQTHHTERAYRMDFKDLDPSLSL 306

Query: 298 VSQ-RSYSDYKNV 309
               R+ S+++++
Sbjct: 307 CFLCRNESEFEDM 319


>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
 gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
 gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
          Length = 397

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 136/305 (44%), Positives = 186/305 (60%), Gaps = 22/305 (7%)

Query: 2   RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
           +  N L+ +  E I   +TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+
Sbjct: 31  KEFNALTEK--EDILSHVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALV 88

Query: 62  FLHLGRDWQW-NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
             HLGRDW+W    S+ E Y+ IL  F D++   YS+HQIA  G  EGK++G+W+GPNTV
Sbjct: 89  RRHLGRDWRWVRSQSQREDYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTV 148

Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
           AQVL+KLA +D WS +  HVA+DNT+V+ ++K+LC                  + L+ G+
Sbjct: 149 AQVLKKLAVFDSWSRLTVHVAMDNTVVIEEIKRLCMPWLDYGG-------AACVDLQGGM 201

Query: 181 QDINPVYINGI----KKCYALPISPVYDMVKILSSTYN---MQTPRYEFTFPQSLGVIGG 233
            + N           ++        +   +++  S  N   ++T +  F  PQSLGVIGG
Sbjct: 202 PEPNGCLEGACALAEEETALWKPLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGG 261

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMD 292
           KPNHA YFIGYVG ++I+LDPHT Q      +  +DS+   D TYHC     R+HI  +D
Sbjct: 262 KPNHAHYFIGYVGEELIYLDPHTTQP---AVEPCEDSQVP-DDTYHCQHPPCRMHICEID 317

Query: 293 PSIAV 297
           PSIAV
Sbjct: 318 PSIAV 322


>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
          Length = 350

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 129/293 (44%), Positives = 169/293 (57%), Gaps = 84/293 (28%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           +SH D+E IR+D+ SRLW TYR+GFVPIG++ LTTDKGWGCMLRCGQMV+A+AL  LHLG
Sbjct: 62  ISHADIEAIRQDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLG 121

Query: 67  RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQVLR 125
           RDWQW+  +++  YLKI+  FED + AP+S+HQIAL G +SE K +GEWFGPNTVAQVL 
Sbjct: 122 RDWQWSEETRDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGPNTVAQVLN 181

Query: 126 KLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP 185
           ++                                    NP                    
Sbjct: 182 EV------------------------------------NP-------------------- 185

Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
           +YI G+KKC+ L                           P S G+IGG+PN ALYFIGYV
Sbjct: 186 IYIEGLKKCFQL---------------------------PGSCGMIGGRPNQALYFIGYV 218

Query: 246 GNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
           G + ++LDPHT Q +GC+ +K++  E++ D+T+H   ASR+    MDPS+AV 
Sbjct: 219 GEEALYLDPHTVQRVGCIGEKQESVEQEQDATFHQRHASRIAFASMDPSLAVC 271


>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
 gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
          Length = 481

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 134/347 (38%), Positives = 188/347 (54%), Gaps = 79/347 (22%)

Query: 4   ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
             ++S +D +E +++ +TSR WFTYR+ F PIG +G +TD+GWGCMLRC QM++ + LL 
Sbjct: 68  GKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLR 127

Query: 63  LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
            H+GR ++W++    E Y KIL+MF D + A YSIHQIA  G +EGK V +WFGPNT AQ
Sbjct: 128 RHIGRHFEWDIEKTSEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNTAAQ 187

Query: 123 VLRKLAKYDDWSSIVFHVALDNTLV------------VNQVKKLCTTNKRASSN------ 164
           V++KL  +DDWS+I  HVALDN LV                 KL   N     N      
Sbjct: 188 VMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENGLVDKNRLSLSP 247

Query: 165 ----PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
               P+W+PL+L+IPLRLG+  INP Y++ I++                           
Sbjct: 248 GNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEF-------------------------- 281

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------TNQNI 260
            F  PQ +G+IGG+PNHALYF+G  G+ + +LDPH                    T  ++
Sbjct: 282 -FKIPQCVGIIGGRPNHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDV 340

Query: 261 GCVYDKE--------QDSEKKL-DSTYHCPQASRLHILHMDPSIAVV 298
           G  + +E         D   K+ DSTYHC     +   ++DPS+A+ 
Sbjct: 341 GFSHLEELVPLPSQTADVYTKMDDSTYHCQMMLWIEYENVDPSLALA 387


>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
 gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
          Length = 454

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 133/347 (38%), Positives = 189/347 (54%), Gaps = 79/347 (22%)

Query: 4   ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
             ++S +D +E +++ +TSR WFTYR+ F PIG +G +TD+GWGCMLRC QM++ + LL 
Sbjct: 41  GKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLR 100

Query: 63  LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
            H+GR ++W++    E Y KIL+MF D + A YSIHQIA  G +EGK V +WFGPNT AQ
Sbjct: 101 RHIGRHFEWDIEKTSEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNTAAQ 160

Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT------------NKRASSN------ 164
           V++KL  +DDWS+I  HVALDN LV      + T+            N     N      
Sbjct: 161 VMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENGLVDKNRLSLSP 220

Query: 165 ----PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
               P+W+PL+L+IPLRLG+  INP Y++ I++                           
Sbjct: 221 GNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEF-------------------------- 254

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------TNQNI 260
            F  PQ +G+IGG+PNHALYF+G  G+ + +LDPH                    T  ++
Sbjct: 255 -FKIPQCVGIIGGRPNHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDV 313

Query: 261 GCVYDKE--------QDSEKKL-DSTYHCPQASRLHILHMDPSIAVV 298
           G  + +E         D   K+ DSTYHC     +   ++DPS+A+ 
Sbjct: 314 GFSHLEELVPLPSQTADVYTKMDDSTYHCQMMLWIEYENVDPSLALA 360


>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
 gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
          Length = 458

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 187/348 (53%), Gaps = 85/348 (24%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S +D+E+I+  + S LWFTYRK F PIG  G TTD+GWGCMLRCGQM++A+ L+  HLGR
Sbjct: 65  SRRDMERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGR 124

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           +W W+ + K   Y +IL+MF+D++ + +SIHQIA  G SEGK +GEWFGPNT AQVL+KL
Sbjct: 125 NWLWDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLKKL 184

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT------------------NKRASSNP---- 165
             YD WS +  HVALDN L+ + ++ +  T                  +   + NP    
Sbjct: 185 VIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAEAE 244

Query: 166 -------------------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
                                    +W+PL+++IPLRLG+  IN  Y   I+  + L   
Sbjct: 245 IFPESTRSPTRSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFFQL--- 301

Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
                                   PQ +G+IGG+PNHALYF G V N++++LDPH  Q+ 
Sbjct: 302 ------------------------PQCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDF 337

Query: 261 -----GCVYDKEQD------SEKKLDSTYHCPQASRLHILHMDPSIAV 297
                      E+D      +++  DSTYHCP      I  +DPS+A+
Sbjct: 338 VDLDETTATRDERDGYVEIKNDEFRDSTYHCPFILTTKIDKVDPSLAL 385


>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
          Length = 379

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 130/300 (43%), Positives = 176/300 (58%), Gaps = 57/300 (19%)

Query: 26  TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
           ++R+     G +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L
Sbjct: 39  SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 98

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
             F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DN
Sbjct: 99  NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 158

Query: 145 TLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVLVIPLRLGI 180
           T+V+ ++++LC T+     A++ P                      W+PLVL+IPLRLG+
Sbjct: 159 TVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGL 218

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            DIN  Y+  +K C                           F  PQSLGVIGGKPN A Y
Sbjct: 219 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 251

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAVVS 299
           FIGYVG ++I+LDPHT Q       +  D     D ++HC     R+ I  +DPSIAV S
Sbjct: 252 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGS 307


>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
          Length = 379

 Score =  244 bits (623), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 129/298 (43%), Positives = 175/298 (58%), Gaps = 57/298 (19%)

Query: 26  TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
           ++R+     G +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L
Sbjct: 39  SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 98

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
             F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DN
Sbjct: 99  NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 158

Query: 145 TLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVLVIPLRLGI 180
           T+V+ ++++LC T+     A++ P                      W+PLVL+IPLRLG+
Sbjct: 159 TVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGL 218

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            DIN  Y+  +K C                           F  PQSLGVIGGKPN A Y
Sbjct: 219 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 251

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
           FIGYVG ++I+LDPHT Q       +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 252 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 305


>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
           gorilla]
          Length = 379

 Score =  244 bits (623), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 129/298 (43%), Positives = 175/298 (58%), Gaps = 57/298 (19%)

Query: 26  TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
           ++R+     G +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L
Sbjct: 39  SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 98

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
             F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DN
Sbjct: 99  NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 158

Query: 145 TLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVLVIPLRLGI 180
           T+V+ ++++LC T+     A++ P                      W+PLVL+IPLRLG+
Sbjct: 159 TVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGL 218

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            DIN  Y+  +K C                           F  PQSLGVIGGKPN A Y
Sbjct: 219 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 251

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
           FIGYVG ++I+LDPHT Q       +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 252 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 305


>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
          Length = 436

 Score =  244 bits (623), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 134/336 (39%), Positives = 182/336 (54%), Gaps = 79/336 (23%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D+E+   +I ++ WFTYR+ F PIG +G  +D GWGCMLRCGQM++AQALL  HLGRDW
Sbjct: 42  EDMEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGCMLRCGQMMLAQALLCRHLGRDW 101

Query: 70  QWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
            W    K+ E Y+ IL  F D++ + YSIHQIA  G  EGK +G+WFGPNTVAQV++KL 
Sbjct: 102 DWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVGEGKQIGQWFGPNTVAQVIKKLV 161

Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLC----------------------TTNKRASSNPQ 166
            +DD + +  HVA+DNT+V+  +KKLC                      T N+  S  P 
Sbjct: 162 LFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYGECSYIHDRSSLTGNQSVSKPPH 221

Query: 167 -------------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
                                    W+PL+L IPLRLG+ +IN  Y              
Sbjct: 222 CSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLGLSEINSDY-------------- 267

Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIG 261
            Y+ +KI+            FT  QSLGVIGGKPNHA YFIG+ G+ +++LDPHT Q   
Sbjct: 268 -YNSLKIM------------FTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQT- 313

Query: 262 CVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
               + +      D ++HC     +    +DPS+A+
Sbjct: 314 ---IEPERFNVIPDESFHCVYPCFMSFQSLDPSVAL 346


>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
          Length = 318

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 114/254 (44%), Positives = 160/254 (62%), Gaps = 47/254 (18%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 92  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 151

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 152 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 211

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
           W+S+  +V++DNT+V+  +KK+C      +++P                    W+PL+L+
Sbjct: 212 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLI 271

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PLRLGI  INPVY+   K+C                           F  PQSLG +GG
Sbjct: 272 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 304

Query: 234 KPNHALYFIGYVGN 247
           KPN+A YFIG++G 
Sbjct: 305 KPNNAYYFIGFLGK 318


>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
          Length = 378

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 127/298 (42%), Positives = 174/298 (58%), Gaps = 57/298 (19%)

Query: 26  TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
           ++R+     G +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L
Sbjct: 38  SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 97

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
             F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DN
Sbjct: 98  NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 157

Query: 145 TLVVNQVKKLC--------------TTNKRASSNPQ----------WQPLVLVIPLRLGI 180
           T+V+ ++++LC               +++  +  P           W+PLVL+IPLRLG+
Sbjct: 158 TVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGL 217

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            DIN  Y+  +K C                           F  PQSLGVIGGKPN A Y
Sbjct: 218 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 250

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
           FIGYVG ++I+LDPHT Q       +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 251 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 304


>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
          Length = 324

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 128/289 (44%), Positives = 168/289 (58%), Gaps = 63/289 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 44  EEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 103

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
             +++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 104 QWTQQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 163

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
            WSS+  H+A+DNT+V  ++                              +IN  Y+  +
Sbjct: 164 TWSSLAVHIAMDNTVVTGEI------------------------------NINEAYVETL 193

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K C                           F  PQSLGVIGGKPN A YFIGYVG+++I+
Sbjct: 194 KHC---------------------------FMMPQSLGVIGGKPNSAHYFIGYVGDELIY 226

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAVVS 299
           LDPHT Q       +  DS    D ++HC    SR+ I  +DPSIAV+S
Sbjct: 227 LDPHTTQPAV----ELTDSCLVPDESFHCQHPPSRMSIRELDPSIAVLS 271


>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
           aries]
          Length = 454

 Score =  240 bits (612), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 127/311 (40%), Positives = 169/311 (54%), Gaps = 73/311 (23%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 87  DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 146

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y ++                    G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 147 QRKRQPDSYCRVPPQM----------------GVGEGKSIGQWYGPNTVAQVLKKLAVFD 190

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN------------------------KRASSNPQW 167
            WS++  HVA+DNT+V+  V++LC +                         +       W
Sbjct: 191 AWSALAVHVAMDNTVVMADVRRLCRSGLPCAGAEAFPADSERHCNGFPAGAEGGECTAPW 250

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ D+N  Y   +K C                           F  PQS
Sbjct: 251 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 283

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
           LGVIGGKPN A YFIGYVG ++I+LDPHT Q      D+        D ++HC     R+
Sbjct: 284 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVP----DESFHCQHPPGRM 339

Query: 287 HILHMDPSIAV 297
            I  +DPSIAV
Sbjct: 340 SITELDPSIAV 350


>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
          Length = 268

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 119/256 (46%), Positives = 157/256 (61%), Gaps = 52/256 (20%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 40  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
              ++ ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
            WSS+  H+A+DNT+V+ ++++LC T+     A++ P                      W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPW 219

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           +PLVL+IPLRLG+ DIN  Y+  +K C                           F  PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252

Query: 228 LGVIGGKPNHALYFIG 243
           LGVIGGKPN A YFIG
Sbjct: 253 LGVIGGKPNSAHYFIG 268


>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
          Length = 394

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 123/289 (42%), Positives = 168/289 (58%), Gaps = 45/289 (15%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D      D+ SR WFTYRK F PIGD+G T+D GWGC LRCGQM++   LL  HLGRDW
Sbjct: 56  RDGASFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGCTLRCGQMLLGHTLLLRHLGRDW 115

Query: 70  QWNVNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
           +W+ +S  +  Y KIL+MF D R + YSI  IAL GA  G++VG+WFGPN VAQ +++LA
Sbjct: 116 RWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGADFGRSVGQWFGPNNVAQAIKRLA 175

Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYI 188
            +D WS +  +VA+D  +V++ +                +P+++ IPLRLG +  N  Y 
Sbjct: 176 VHDQWSEVAVYVAMDMLVVIDDISNF-------------RPVLVFIPLRLGQERFNMEYK 222

Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
             +K C+A+                            QS+G+IGGKP HAL+F GY  + 
Sbjct: 223 EAVKACFAV---------------------------RQSVGIIGGKPRHALWFTGYHDDY 255

Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +I+LDPH  Q+  CV     D+    DSTYH  Q  RLHI  +DPS+A+
Sbjct: 256 LIYLDPHKTQS--CV--TLPDAGIVSDSTYHTTQIERLHISELDPSLAL 300


>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
          Length = 457

 Score =  237 bits (604), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 131/356 (36%), Positives = 187/356 (52%), Gaps = 77/356 (21%)

Query: 4   ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
             +++ +D +E +++ ++SR WFTYRK F PIG +G T+D+GWGCMLRC QM++ + LL 
Sbjct: 36  GKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVLLR 95

Query: 63  LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
            H+GR ++W++ +    Y KIL+MF D + A YSIHQIA  G +EGK + +WFGPNT AQ
Sbjct: 96  RHIGRHFEWDIETTSVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNTAAQ 155

Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKR--------------------AS 162
           VL+KL  +DDWS++  HVALDN LV      + TT                        S
Sbjct: 156 VLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQVEKHYATITS 215

Query: 163 SNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEF 222
              +W+PL+L+IPLRLG+  IN  Y+  I++ + L                         
Sbjct: 216 KEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKL------------------------- 250

Query: 223 TFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT---------------------NQNIG 261
             PQ +G+IGGKPN A YF+G  G  + +LDPH                      + N  
Sbjct: 251 --PQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTESEQHDTNFS 308

Query: 262 CVYDKE----QDSE---KKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
            + D E    Q S+   K  DSTYHC     +    +DPS+A+ +   S  D+ N+
Sbjct: 309 ELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESREDFDNL 364


>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
 gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
          Length = 478

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 136/366 (37%), Positives = 180/366 (49%), Gaps = 106/366 (28%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           LE +++ +TSRLWFTYR+ F PIG +G +TD+GWGCMLRC QM++ + LL  H+GR ++W
Sbjct: 49  LEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHFEW 108

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
           ++    E Y KIL+MF D + A YSIHQIA  G +EGK V EWFGPNT AQV++KL  +D
Sbjct: 109 DIEKTSEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLTIFD 168

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTT---------------------------------- 157
           DWS+I  HVALDN LV      + TT                                  
Sbjct: 169 DWSNIAVHVALDNILVKEDALTMATTYPSDNASYIFAVHNFLKYFTLNLTFPNFAENGQI 228

Query: 158 -NKRASS--NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
              R SS     W+PL+++IPLRLG+  INP Y+  I+K + L                 
Sbjct: 229 EKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFEL----------------- 271

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------TNQNI 260
                     PQ +G+IGGKPN A YF+G  G  + +LDPH              TN  I
Sbjct: 272 ----------PQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHHCRAKTTKRDAGVTTNTMI 321

Query: 261 GCV-----------------YDKEQDSE-----------KKLDSTYHCPQASRLHILHMD 292
             +                 + K +D E           K  DSTYHC     +    +D
Sbjct: 322 SSITTTDAQLDIQNQIDDSDFHKLEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEYESID 381

Query: 293 PSIAVV 298
           PS+A+ 
Sbjct: 382 PSLALA 387


>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
 gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
 gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
          Length = 321

 Score =  234 bits (596), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 121/274 (44%), Positives = 170/274 (62%), Gaps = 55/274 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM++AQAL+  HLGRDW W    ++ + Y +IL+ F DR+   YSIHQ+A  G  
Sbjct: 1   MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC--------TTN 158
           EGK++GEWFGPNTVAQVL+KLA +D+W+S+  +V++DNT+V+  +KK+C        T  
Sbjct: 61  EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 120

Query: 159 KR-----ASSN---------PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
            R      +SN           W+PL+L++PLRLGI  INPVY++  K+C          
Sbjct: 121 DRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVYVDAFKEC---------- 170

Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
                            F  PQSLG +GGKPN+A YFIG++G+++IFLDPHT Q      
Sbjct: 171 -----------------FKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTF---V 210

Query: 265 DKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 211 DTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 243


>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
          Length = 319

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 121/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------- 158
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC T+        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120

Query: 159 ---------------KRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
                             +S P  W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  DS    D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---EPTDSCFIPDESFHCQHPPCRMSIAELDPSIAV 245


>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
           boliviensis]
          Length = 319

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 121/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------- 158
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC T+        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120

Query: 159 ---------------KRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
                             +S P  W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  DS    D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---EPTDSCFIPDESFHCQHPPCRMSIAELDPSIAV 245


>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
          Length = 331

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 120/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC T+     A++
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVLCAGATA 120

Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
            P                      W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245


>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
           gorilla]
 gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
           gorilla]
 gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
          Length = 319

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 120/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC T+     A++
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120

Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
            P                      W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245


>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 331

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 120/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC T+     A++
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120

Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
            P                      W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245


>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
          Length = 280

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 121/279 (43%), Positives = 161/279 (57%), Gaps = 57/279 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC T+     A++
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120

Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
            P                      W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG  +I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPA-- 211

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAVVSQ 300
              +  D     D ++HC     R+ I  +DPSIAV  Q
Sbjct: 212 --VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGKQ 248


>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
          Length = 319

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 120/276 (43%), Positives = 159/276 (57%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------- 158
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++ KLC  +        
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEISKLCRASLPCAGAAA 120

Query: 159 ---------------KRASSNP-QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
                             ++ P  W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 LSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  DS    D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---ELTDSCFIPDESFHCQHPPCRMGIGELDPSIAV 245


>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
          Length = 319

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 119/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+D+T+V+ ++++LC T+     A++
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDSTVVMEEIRRLCRTSVPCAGATA 120

Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
            P                      W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245


>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
          Length = 331

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 118/276 (42%), Positives = 160/276 (57%), Gaps = 57/276 (20%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM+ AQAL+  HLGRDW+W    ++ ++Y  +L  F DR+ + YSIHQIA  G  
Sbjct: 1   MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC----------- 155
           EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC           
Sbjct: 61  EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRNSVPCAGATA 120

Query: 156 ---TTNKRASSNPQ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
               +++  +  P           W+PLVL+IPLRLG+ DIN  Y+  +K C        
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212

Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
              +  D     D ++HC     R+ I  +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245


>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
           [Ciona intestinalis]
          Length = 422

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 125/306 (40%), Positives = 181/306 (59%), Gaps = 55/306 (17%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
           I S LWFTYRKG+ PIG +G T+D GWGCMLRCGQM++A+AL  L + +DW+W  +  + 
Sbjct: 60  IKSFLWFTYRKGYTPIGGTGPTSDSGWGCMLRCGQMLLARALAELTMDKDWKWTEDKPQP 119

Query: 79  A-YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
             Y +IL    D R++ YSIHQIA  G  EGK VG+WFGPNT++QVLR+L+++D  + + 
Sbjct: 120 PPYKRILHQLSDERSSCYSIHQIAQMGVEEGKEVGQWFGPNTISQVLRRLSQFDQENVLA 179

Query: 138 FHVALDNTLVVNQVKKLCTT--------------------------NKRASSNPQWQPLV 171
            HVA+DNT+ +  +++LC+T                          N   +S+  W+PL+
Sbjct: 180 IHVAMDNTVCIEDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLL 239

Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
           L+IPLRLG+ +INPVY   +K+C                             + +S+GVI
Sbjct: 240 LLIPLRLGLSEINPVYFTHLKEC---------------------------LHWKESVGVI 272

Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           GGKPNHA YF+G   + +IFLDPHT Q    + D   + E+  D+T+HC    R+ + ++
Sbjct: 273 GGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN-ERYDDTTFHCDTPGRMLLTNL 331

Query: 292 DPSIAV 297
           DPS+A+
Sbjct: 332 DPSLAL 337


>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
          Length = 481

 Score =  223 bits (568), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 122/327 (37%), Positives = 172/327 (52%), Gaps = 80/327 (24%)

Query: 4   ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
             ++S +D ++ +++ +TSR WFTYR+ F PIG +G +TD+ WGCMLRC QM++ + LL 
Sbjct: 36  GKEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPSTDQYWGCMLRCAQMLLGEVLLR 95

Query: 63  LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
            H+GR ++W++    + Y KIL+MF D + A YSIHQIA  G SEGK V EWFGPNT AQ
Sbjct: 96  RHIGRHFEWDIEKTSDVYEKILQMFFDEKDALYSIHQIAQMGVSEGKEVSEWFGPNTAAQ 155

Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT-----------------NKRASSN- 164
           V++KL  +DDWS+I  HVALDN LV      + TT                 + R SS+ 
Sbjct: 156 VIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYPSEDAVKLIMGEFGFKSDRISSSH 215

Query: 165 --------------------------------PQWQPLVLVIPLRLGIQDINPVYINGIK 192
                                            +W+PL+L+IPLRLG+  IN  Y++ I+
Sbjct: 216 IICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRPLLLMIPLRLGLTSINSCYLSAIQ 275

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
           + + L                           PQ +G+IGGKPN A YF+G  G  + +L
Sbjct: 276 EFFKL---------------------------PQCVGIIGGKPNLAHYFVGIAGTKLFYL 308

Query: 253 DPHTNQNIGCVY--DKEQDSEKKLDST 277
           DPH  +     +  +KEQ  +   DST
Sbjct: 309 DPHHCRPKTSKFFVEKEQQQQSSGDST 335


>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
          Length = 358

 Score =  221 bits (563), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 112/255 (43%), Positives = 151/255 (59%), Gaps = 52/255 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           + +  +W   RK  +  G +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W    ++
Sbjct: 21  ETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQ 80

Query: 78  -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
            ++Y  +L  F DR+ + YSIHQIA  G  EGK++G+W+GPNTVAQVL+KLA +D WSS+
Sbjct: 81  PDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSL 140

Query: 137 VFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVL 172
             H+A+DNT+V+ ++++LC T+     A++ P                      W+PLVL
Sbjct: 141 AVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVL 200

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           +IPLRLG+ DIN  Y+  +K C                           F  PQSLGVIG
Sbjct: 201 LIPLRLGLTDINEAYVETLKHC---------------------------FMMPQSLGVIG 233

Query: 233 GKPNHALYFIGYVGN 247
           GKPN A YFIGYVG 
Sbjct: 234 GKPNSAHYFIGYVGE 248


>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
          Length = 393

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 118/304 (38%), Positives = 167/304 (54%), Gaps = 88/304 (28%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           DI++RLWFTYR+ F PI                                 DW W    ++
Sbjct: 76  DISARLWFTYRRKFSPI---------------------------------DWNWEKQKEQ 102

Query: 78  -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
            + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+W+S+
Sbjct: 103 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 162

Query: 137 VFHVALDNTLVVNQVKKLCTT----------NKRASSN------------PQWQPLVLVI 174
             +V++DNT+V+  +KK+C            ++R S N            P W+PL+L++
Sbjct: 163 AVYVSMDNTVVIEDIKKMCCASALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIV 222

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PLRLGI  INPVY++  K+C                           F  PQSLG +GGK
Sbjct: 223 PLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGALGGK 255

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDP 293
           PN+A YFIG++G+++IFLDPHT Q      D E++     D T+HC Q   R++IL++DP
Sbjct: 256 PNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQPPQRMNILNLDP 311

Query: 294 SIAV 297
           S+A+
Sbjct: 312 SVAL 315


>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
 gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
          Length = 440

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 125/361 (34%), Positives = 177/361 (49%), Gaps = 103/361 (28%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  D+ +++  + S LWFTYRK F PIG +G TTD+GWGCMLRCGQM++A+ L+  HLGR
Sbjct: 65  SRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLIVRHLGR 124

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           +W W+ +     Y +IL                   G SEGK +GEWFGPNT AQVL+KL
Sbjct: 125 NWLWDRDVMLTEYKRILPNM----------------GVSEGKEIGEWFGPNTAAQVLKKL 168

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTN----------------------------- 158
             YD WS +  HVALDN L+ + ++ +  T                              
Sbjct: 169 VIYDQWSRLTVHVALDNVLITSDIRTMAFTRPPYRRSRRETESDYNDNLGTIDPTEAEIL 228

Query: 159 KRASSNP----------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
            +++ +P                +W+PL+++IPLRLG+  IN  Y   I+  + L     
Sbjct: 229 PKSTRSPTRSETSSISSYSGVSEEWRPLLIIIPLRLGLNTINRCYFPAIQAFFEL----- 283

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-- 260
                                 PQ +G+IGG+PNHALYF G V N++++LDPH  QN   
Sbjct: 284 ----------------------PQCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQNFVD 321

Query: 261 ----GCVYDKEQD-----SEKKLDSTYHCPQASRLHILHMDPSIAVV----SQRSYSDYK 307
                   D+  D     +++  DSTYHCP      I  +DPS+A+     ++  YS+  
Sbjct: 322 LDEATTTKDERGDYVEIKNDEFRDSTYHCPFILSTKIDKVDPSLALGFFCHTEDDYSELA 381

Query: 308 N 308
           N
Sbjct: 382 N 382


>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
          Length = 246

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 98/203 (48%), Positives = 139/203 (68%), Gaps = 24/203 (11%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLC---------------------TTNKRASSNP--QWQP 169
           W+S+  +V++DNT+V+  +KK+C                     ++  + +S P   W+P
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKP 223

Query: 170 LVLVIPLRLGIQDINPVYINGIK 192
           L+L++PLRLGI  INPVYI   K
Sbjct: 224 LLLIVPLRLGINQINPVYIEAFK 246


>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
          Length = 431

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 124/334 (37%), Positives = 182/334 (54%), Gaps = 54/334 (16%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYRK F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 52  DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 111

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIA--------------LTGASEGKAVGEWFGP 117
              ++ ++Y  +L+ F DR+ + YSIHQIA              +  +  G  + + F  
Sbjct: 112 QRKRQPDSYFSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGPQLCQSFAA 171

Query: 118 NTVAQVLR----------KLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
             +++  R          KLA +D WS++  H+A+DNT+V+  +    + ++  +  P  
Sbjct: 172 VRLSRRRRWELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDI----SADRHCNGVPAG 227

Query: 168 QPLV------------LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
             +             L+IPLRLG+ DIN  Y+  +K    L        V + S+  ++
Sbjct: 228 AEVTHRPPLPPWRPLVLLIPLRLGLTDINEAYVGTLKLASTL--------VGLCSAAASL 279

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
              ++ F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q    V D+        D
Sbjct: 280 PLRQHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEVADRRSIP----D 335

Query: 276 STYHCPQ-ASRLHILHMDPSIAVVSQRSYSDYKN 308
            ++HC    SR+ I  +DPSIA    ++  D+ +
Sbjct: 336 ESFHCQHPPSRMRIGELDPSIAGFFCQTEDDFDD 369


>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
          Length = 433

 Score =  207 bits (526), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 118/324 (36%), Positives = 164/324 (50%), Gaps = 76/324 (23%)

Query: 35  GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP 94
           G +G T+D+GWGCMLRC QM++ + LL  H+GR ++W++ +    Y KIL+MF D + A 
Sbjct: 44  GGTGPTSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETTSVVYEKILQMFFDEKDAL 103

Query: 95  YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL 154
           YSIHQIA  G +EGK + +WFGPNT AQVL+KL  +DDWS++  HVALDN LV      +
Sbjct: 104 YSIHQIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTM 163

Query: 155 CTTNKR--------------------ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            TT                        S   +W+PL+L+IPLRLG+  IN  Y+  I++ 
Sbjct: 164 ATTYPSEDAVKLIMENGQVEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEF 223

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
           + L                           PQ +G+IGGKPN A YF+G  G  + +LDP
Sbjct: 224 FKL---------------------------PQCVGIIGGKPNLAHYFVGIAGTKLFYLDP 256

Query: 255 H---------------------TNQNIGCVYDKE----QDSE---KKLDSTYHCPQASRL 286
           H                      + N   + D E    Q S+   K  DSTYHC     +
Sbjct: 257 HYCRPKTSKVFAEKEPSTESEQHDTNFSELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWM 316

Query: 287 HILHMDPSIAV-VSQRSYSDYKNV 309
               +DPS+A+ +   S  D+ N+
Sbjct: 317 EFESIDPSLALALFCESREDFDNL 340


>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 366

 Score =  204 bits (520), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 117/286 (40%), Positives = 154/286 (53%), Gaps = 61/286 (21%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
           I  D+TSRLWFTYRKGF PIG +G T+D GWGCMLRCGQM++ QAL+  HLGRDW+W V+
Sbjct: 65  ILSDVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRW-VS 123

Query: 75  SKEE--AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
            +E+   Y+ IL  F D++ + YSIHQI        +    W            + + + 
Sbjct: 124 GEEQRHEYVNILNAFIDKKDSYYSIHQIE-------RLCMPWLDKAEACAASEGVGELNG 176

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           +                 ++  C  ++  ++   W+PLVL+IPLRLG+ DIN  YI  +K
Sbjct: 177 Y-----------------LEGACAFSEEETA--LWKPLVLLIPLRLGLTDINEAYIETLK 217

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
           KC+ L                           PQSLGVIGGKPN A YFIGYVG ++I+L
Sbjct: 218 KCFML---------------------------PQSLGVIGGKPNSAHYFIGYVGEELIYL 250

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
           DPHT Q      D  +D     D +YHC     R+HI  +DPSIA 
Sbjct: 251 DPHTTQT---AVDPCEDG-TFTDDSYHCQHPPCRMHICELDPSIAA 292


>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 117/295 (39%), Positives = 170/295 (57%), Gaps = 41/295 (13%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
           +++LE I+ D  SRLWFTYR+ F  IG SG T+D+GWGCMLR GQM++A+ LL   LGR+
Sbjct: 36  YEELEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRN 95

Query: 69  WQWNVNS-KEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EGKAVGEWFGPNTVAQVLRK 126
           + W+ +S ++E Y +IL++F D  +A  S+ QIALTGA+ E +AVGEWFGPNT+AQVL++
Sbjct: 96  YVWSESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMAQVLKR 155

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPV 186
           + K       V  VA+D+ + V  V        + +      PLVL+IPLRLG+  +N +
Sbjct: 156 ITKSRSLGFGV-TVAMDSVVSVEDVSAEIINGGKPT------PLVLMIPLRLGLNSVNEI 208

Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
           Y+N +K                L+S Y              +G++GGKPN A YF+GY  
Sbjct: 209 YVNPLK--------------IFLASKY-------------CVGIMGGKPNQAHYFVGYQE 241

Query: 247 ND----VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
                 +++LDPHT Q      +     E + D + H  +   +  L +DPS+AV
Sbjct: 242 TVEDTWLLYLDPHTTQQSPVSVNNNMPFE-QFDKSLHTDKLCWIKALKLDPSLAV 295


>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 441

 Score =  201 bits (511), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 106/285 (37%), Positives = 156/285 (54%), Gaps = 38/285 (13%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D  SRLW TYRKGF  I  +G T D GWGCMLR GQM++A ALLF  LGRDW+   ++  
Sbjct: 141 DFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWRLGDSNDR 200

Query: 78  E---AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWS 134
           +    Y  IL  F D  T+PYSI +IA  G    K +GEWFGP+T++QVL+ L   D   
Sbjct: 201 DTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLVNDDQRI 260

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
           S+  HV+ D  +  N++  + +  +     P    ++++IPLRLG++ +NPVY  G+K C
Sbjct: 261 SLKVHVSNDGVVYKNEINTILSATRDDGKTPA---VLIMIPLRLGVETMNPVYYPGVKHC 317

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
           +A+                              +G+ GG+PN +L+F+G  G+ +I+LDP
Sbjct: 318 FAM---------------------------SHCVGIAGGRPNSSLFFLGVDGDHLIYLDP 350

Query: 255 HTNQNIGCVYDKEQDSEKKLDS--TYHCPQASRLHILHMDPSIAV 297
           H   ++    D    +  K++   +YHC +   L I  MDPS+ +
Sbjct: 351 H---HLRPSVDSRDITSYKMEDLLSYHCEKVRLLPIASMDPSLVI 392


>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
          Length = 431

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 105/255 (41%), Positives = 151/255 (59%), Gaps = 53/255 (20%)

Query: 65  LGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
           L  DW W  + ++ E Y +ILK F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQV
Sbjct: 131 LQADWGWEKHQEQPEEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQV 190

Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC-------TTNKRASS------------- 163
           L+KLA +D+W+S+  +V++DNT+V+  +KK+C       T +  +SS             
Sbjct: 191 LKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLDWNTDCPGQ 250

Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
              W+PL+L++PLRLGI  INP+Y +  K+C                           F 
Sbjct: 251 TSGWKPLLLIVPLRLGINQINPIYADAFKEC---------------------------FK 283

Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
            PQSLG +GGKPN A YFIG++G+++I+LDPHT Q      D E++     D ++HC Q+
Sbjct: 284 MPQSLGALGGKPNSAYYFIGFLGDELIYLDPHTTQTF---VDTEENGTVN-DQSFHCQQS 339

Query: 284 -SRLHILHMDPSIAV 297
             R+ IL++DPS+A+
Sbjct: 340 PPRMKILNLDPSVAL 354


>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
 gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
          Length = 342

 Score =  196 bits (499), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 106/299 (35%), Positives = 163/299 (54%), Gaps = 45/299 (15%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ- 70
           LE+  R  TS +W TYR+ FV +  S LT+D GWGCMLR GQM++A  L+F  L +DW+ 
Sbjct: 51  LEEFHRHFTSLIWLTYRRSFVQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRI 110

Query: 71  ---WNVNSKEEAYLKILKMF---EDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVL 124
               +   +E  Y  IL+ F   +D   +P+S+H++   G   GK  G+W+GP +VA +L
Sbjct: 111 SGRCHSREQEHYYRVILQFFGDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHIL 170

Query: 125 RKL---AKYDDWSSIVFHVALDNTLVVNQVKKLCT---TNKRASSNPQWQPLVLVIPLRL 178
            K    A +     I  +VA D T+ +++VK++CT   T++R  S+ +W+P+++++P+RL
Sbjct: 171 EKAMISATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRL 230

Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
           G + +NP+YI  +K                             FT  Q +G+IGG+P H+
Sbjct: 231 GGEALNPIYIPCVKSL---------------------------FTLDQCIGIIGGRPKHS 263

Query: 239 LYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           LYF+G+    +I LDPH  Q    V D  Q  EK    ++HCP   +     MDPS  +
Sbjct: 264 LYFVGFQDEKMIHLDPHYCQP---VVDTTQ--EKFPTESFHCPNPRKTSFKKMDPSCTI 317


>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 360

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 112/286 (39%), Positives = 158/286 (55%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 68  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 127

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 128 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 187

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 188 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 208

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP+S                TP             G +P  +L       +++IFL
Sbjct: 209 MCRVLPLS--------------ADTP-------------GDRPPDSLT-ASNESDELIFL 240

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 241 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 282


>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
           jacchus]
          Length = 360

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 112/286 (39%), Positives = 158/286 (55%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 68  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 127

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 128 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 187

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 188 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 208

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP+S                TP             G +P  +L       +++IFL
Sbjct: 209 MCRVLPLS--------------ADTP-------------GDRPPDSLT-ASNRSDELIFL 240

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 241 DPHTTQTF---VDAEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 282


>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
          Length = 336

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 112/286 (39%), Positives = 158/286 (55%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP+S                TP             G +P  +L       +++IFL
Sbjct: 185 MCRVLPLS--------------ADTP-------------GDRPPDSLT-ASNQSDELIFL 216

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 258


>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
           [Homo sapiens]
          Length = 340

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 48  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 107

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 108 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 167

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 168 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 188

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP+S                               G +P  +L       +++IFL
Sbjct: 189 MCRVLPLSA---------------------------DTAGDRPPDSLT-ASNQSDELIFL 220

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 221 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 262


>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
          Length = 336

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP+S                               G +P  +L       +++IFL
Sbjct: 185 MCCVLPLSA---------------------------DTAGDRPPDSLT-ASNQSDELIFL 216

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 258


>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
 gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
           gorilla]
 gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
 gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 336

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP+S                               G +P  +L       +++IFL
Sbjct: 185 MCRVLPLSA---------------------------DTAGDRPPDSLT-ASNQSDELIFL 216

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 258


>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
          Length = 336

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 109/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP S         + T    TP          G +               +++IFL
Sbjct: 185 MCCVLPSS---------ADTVGESTP----------GTLNASNQ---------SDELIFL 216

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q     +   +++    D T+HC Q+  R++IL++DPS+A+
Sbjct: 217 DPHTTQ----TFVNTEENGTVDDQTFHCLQSPQRMNILNLDPSVAL 258


>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
          Length = 336

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 110/286 (38%), Positives = 155/286 (54%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++DNT+V                                I+DI        K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
            C  LP+S                               G +P   L       +++IFL
Sbjct: 185 MCRVLPLSA---------------------------DTAGDRPLDYLT-ASNQSDELIFL 216

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGMVN-DQTFHCLQSPQRMNILNLDPSVAL 258


>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
          Length = 336

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 110/286 (38%), Positives = 153/286 (53%), Gaps = 73/286 (25%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W  
Sbjct: 44  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103

Query: 74  NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             ++ + Y +IL+ F DR+   YSIHQ+A  G  EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
           W+S+  +V++D                                        N V I  IK
Sbjct: 164 WNSLAVYVSMD----------------------------------------NTVVIEDIK 183

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
           K           M  +L               P S    G  P  +L  +    N++IFL
Sbjct: 184 K-----------MCCVL---------------PSSADTAGESPPGSLTALNQ-SNELIFL 216

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           DPHT Q      D E++     D T+HC Q+  R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNILNLDPSVAL 258


>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
 gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
          Length = 556

 Score =  186 bits (473), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 78/153 (50%), Positives = 114/153 (74%), Gaps = 1/153 (0%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGD-SGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           S  D E+I R + SRLW TYRKGF PIG  +G  +D GWGCM RCGQM++A+A+L  HLG
Sbjct: 31  SLDDREEIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLG 90

Query: 67  RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
           R W+W+   +   Y ++L+MF+DRR+A YSI  I LTG S GK++G WFGPNTVAQVL+K
Sbjct: 91  RSWKWSPEQESPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKK 150

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
           L+ YD W+++  H+++++ ++++++K LC  ++
Sbjct: 151 LSVYDRWTNLFIHISVEDGIIIDEIKSLCCQHR 183



 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 5/76 (6%)

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
           F  P  +G++GG P HA++ +G   +DVI LDPHT Q  G       + +   D TYHC 
Sbjct: 351 FRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPAG-----RGNLKPDYDQTYHCD 405

Query: 282 QASRLHILHMDPSIAV 297
              R+ +  +DPS+ +
Sbjct: 406 NPIRIPLKRLDPSMVL 421


>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
          Length = 477

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 104/301 (34%), Positives = 158/301 (52%), Gaps = 52/301 (17%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           +E+ +RD  SR+W TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+   LGR+W+W
Sbjct: 135 IEEFKRDFVSRIWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRW 194

Query: 72  NVNSKEEA---------YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
                 E          +  I+K F D+  +P+SIH++ L GAS GK  G+W+GP++VA 
Sbjct: 195 RPEQPIETLQQRLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAH 254

Query: 123 VLRKLAKY------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
           +L +  +        ++  +  +VA D  + +  V+ +C T      + +W+ LVL++PL
Sbjct: 255 LLSQAVECASKQSNSNFDHLAVYVAQDCAVYLQDVENICRT-----PDGKWKALVLLVPL 309

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
           RLG   +NPVY                     L+S   + T          +GVIGG+P 
Sbjct: 310 RLGADKLNPVY------------------APCLTSLLTLDT---------CIGVIGGRPR 342

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
           H+LYFIGY  + +I LDPH  Q    V+  +        +++HC    ++ +  MDPS  
Sbjct: 343 HSLYFIGYQDDKLIHLDPHYCQETVDVWKNDFSL-----TSFHCTSPRKMLLSKMDPSCC 397

Query: 297 V 297
           V
Sbjct: 398 V 398


>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
          Length = 256

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 80/150 (53%), Positives = 108/150 (72%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  D+ +++  + S LWFTYRK F PIG +G TTD+GWGCMLRCGQM++A+ L+  HLG 
Sbjct: 36  SRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLIVRHLGH 95

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           +W W+ + K   Y +IL+MF+D++   +SIHQIA  G SEGK +GEWFGPNT AQVL+KL
Sbjct: 96  NWLWDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWFGPNTAAQVLKKL 155

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT 157
             YD WS +  HVALDN L+ + ++ +  T
Sbjct: 156 VIYDQWSRLTVHVALDNVLITSDIRTMAFT 185


>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 336

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 105/287 (36%), Positives = 153/287 (53%), Gaps = 39/287 (13%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D + +   + S  W TYR  F  I DS   TD GWGCMLRCGQM++A+A+   HLG++W 
Sbjct: 22  DEQALEHAVRSFPWMTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWA 81

Query: 71  -WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
             +   + +   + L +F D   AP+SIH+IA  G + GK +G+WFGPNTVAQVL+ L  
Sbjct: 82  PTSRKQRHQEMARFLPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVN 141

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI-QDINPVYI 188
               SS++ H A+D   V+N+ +   T    A S+ +   L++++P+RLG+ Q INPVYI
Sbjct: 142 -SQRSSLIVHCAMDG--VLNRTEA-STQLAAALSDGKKHSLLVLVPIRLGLNQSINPVYI 197

Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
             +K    L                           PQ LG+IGGKPN A +F+G V  +
Sbjct: 198 PALKATLEL---------------------------PQCLGIIGGKPNAAHFFVGTVNEN 230

Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
           V++LDPH       V D   +       ++     S++ I  +DPS+
Sbjct: 231 VLYLDPHV------VQDAAMELTPDTVESFSVAVLSKMAISDVDPSM 271


>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
          Length = 488

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/326 (30%), Positives = 170/326 (52%), Gaps = 60/326 (18%)

Query: 5   NKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH 64
           NK +    +    D ++RLWFTYR+ F P+  +G T+D GWGCMLR  QM++A+A +F  
Sbjct: 138 NKNNSASFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGCMLRSAQMMLAEAFIFHL 197

Query: 65  LGRDWQWNVNSKEE---AYLKILKMFE---DRRTAPYSIHQIALTGASEGKAVGEWFGPN 118
           LGR W+W    +++    + KI+K F    D   AP+S+H +    A  GK  G+WFGP+
Sbjct: 198 LGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMVRAAAHCGKKAGDWFGPS 257

Query: 119 TVAQVLRKLAKY--------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPL 170
           T A +L++  +         + +  +  +VA D T+    V  LCT++     N +W+ +
Sbjct: 258 TAAYLLKRCLEEAAGVADSKEIFEQMAIYVAQDCTIYTQDVLDLCTSDP----NIEWKSV 313

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           VL+IP+RLG + +N  YI+ IK+  A                           +   LG+
Sbjct: 314 VLLIPVRLGGERVNVNYIHCIKEILA---------------------------YQNCLGI 346

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD---STYHCPQASRLH 287
           IGGKP H+LYF+G+ G  +++LDPH        Y ++     +L+   +++HC  A ++ 
Sbjct: 347 IGGKPRHSLYFVGFQGKKLVYLDPH--------YLQKTTDTSRLNFSVNSFHCTTARKVS 398

Query: 288 ILHMDPSIAV----VSQRSYSDYKNV 309
              +DPS  +     ++R +  ++++
Sbjct: 399 FSKLDPSATIGFYCKTRRDFESFQSI 424


>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
          Length = 392

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 107/296 (36%), Positives = 162/296 (54%), Gaps = 46/296 (15%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           +E+ +RD  SRLW TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+   LGR+W+W
Sbjct: 55  IEEFKRDFMSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRW 114

Query: 72  --NVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
               ++ E ++  I+K F D+ T  +P+SIH++   GAS GK  G+W+GP++VA +L + 
Sbjct: 115 RPEQSTDESSHRMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQA 174

Query: 128 AK--YDDWSS----IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
            +   +D +S    +  +VA D  + +  V+ +C T          + L+L++PLRLG  
Sbjct: 175 MERASEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR-----KALILLVPLRLGAD 229

Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
            +NPVY                     L+S   + T          +GVIGG+P H+LYF
Sbjct: 230 KLNPVY------------------APCLTSLLTLDT---------CIGVIGGRPRHSLYF 262

Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           IGY  + +I LDPH  Q    V    + +EK   +++HC    ++ +  MDPS  V
Sbjct: 263 IGYQDDKLIHLDPHYCQETVDV----EGNEKFPLTSFHCTSPRKMLLSKMDPSCCV 314


>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
          Length = 414

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 74/146 (50%), Positives = 110/146 (75%), Gaps = 1/146 (0%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGD-SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           D E+I   + SRLW TYRKGF PIG  +G  +D GWGCM RCGQM++A+A+L +HLGR W
Sbjct: 40  DREEIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSW 99

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           +W+   +   Y ++L+MF+DRR+  YSI  I LTG S GK++G WFGPNT+AQVL+KL+ 
Sbjct: 100 RWSPEQESPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSV 159

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC 155
           YD W+++  H+++++ ++++++K LC
Sbjct: 160 YDRWTNLFVHISVEDGIIIDEIKSLC 185



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 144 NTLVVNQVKKLCTTNKRASSNP---QWQPLVLVIPLRLGIQDINPVYINGIK 192
           N +       +C ++  +S+NP    W+PL+L +PLRLG+ + NP Y N IK
Sbjct: 361 NQINSTTAASVCESSSLSSTNPPSSNWRPLLLFVPLRLGLHNPNPCYFNAIK 412


>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
          Length = 525

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 166/310 (53%), Gaps = 54/310 (17%)

Query: 5   NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           + +S +D +E+ ++D TSRLW TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+  
Sbjct: 173 DAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 232

Query: 64  HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
            LGR+W+W  +           E  +  I+K F D   RT+P+SIH +   GA  GK  G
Sbjct: 233 FLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKRAG 292

Query: 113 EWFGPNTVAQVLRK-----LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
           +W+GP++VA +L +     + ++  ++++  +VA D  + +  ++ +C T     S+ +W
Sbjct: 293 DWYGPSSVAHLLSQAVENAVERHPAFNNLAVYVAQDCAVYLQDIENVCQT-----SDGKW 347

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           + L+L +PLRLG   +NPVY + +                            +  T    
Sbjct: 348 KSLILFVPLRLGADKLNPVYTSCLT---------------------------HLLTLDTC 380

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           +GVIGG+P H+LYFIG+  + +I LDPH  Q      D  +D+     +++HC    ++ 
Sbjct: 381 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFSL--TSFHCTSPRKML 435

Query: 288 ILHMDPSIAV 297
           I  MDPS  V
Sbjct: 436 ISKMDPSCCV 445


>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
          Length = 632

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 74/146 (50%), Positives = 110/146 (75%), Gaps = 1/146 (0%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGD-SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           D E+I   + SRLW TYRKGF PIG  +G  +D GWGCM RCGQM++A+A+L +HLGR W
Sbjct: 40  DREEIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSW 99

Query: 70  QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
           +W+   +   Y ++L+MF+DRR+  YSI  I LTG S GK++G WFGPNT+AQVL+KL+ 
Sbjct: 100 RWSPEQESPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSV 159

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC 155
           YD W+++  H+++++ ++++++K LC
Sbjct: 160 YDRWTNLFVHISVEDGIIIDEIKSLC 185



 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 74/157 (47%), Gaps = 35/157 (22%)

Query: 144 NTLVVNQVKKLCTTNKRASSNP---QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
           N +       +C ++  +S+NP    W+PL+L +PLRLG+ + NP Y N IK        
Sbjct: 361 NQINSTTAASVCESSSLSSTNPPSSNWRPLLLFVPLRLGLHNPNPCYFNAIKAV------ 414

Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
                                F  P  +G++GG P HA++ +G  G+DVI LDPHT Q  
Sbjct: 415 ---------------------FRLPNCIGILGGSPCHAVWIVGVTGDDVICLDPHTTQPA 453

Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           G       + +   D TYHC    R+ +  +DPS+ +
Sbjct: 454 G-----RGNLKPDYDQTYHCENPIRMPLKRLDPSMVL 485


>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
          Length = 453

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 103/304 (33%), Positives = 160/304 (52%), Gaps = 53/304 (17%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +  E  ++D  SRLW TYR+ F  +  S  ++D GWGCMLR GQM+IAQAL+   LGRDW
Sbjct: 107 EGFEGFKKDFISRLWLTYRREFPILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDW 166

Query: 70  QWNVN---SKEEAYL------KILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPN 118
           +W  +   +  E+++      KI+K F D+  R +P+SIH +   G + GK  G+W+GP 
Sbjct: 167 RWQPDHQPTTRESFIEVVNHRKIIKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGPG 226

Query: 119 TVAQVLR---KLAKYDDWS--SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
            VA + R   K A  D++   S+   VA D  + +  V + CT       N +W+ L+L+
Sbjct: 227 FVAHLFRQAFKRASEDNYEFDSLTVCVAQDCAVYIKDVMEECTDK-----NGKWKSLILL 281

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           IP+RLG +  N +Y            +P    +               F+  Q +G+IGG
Sbjct: 282 IPVRLGAEKFNSIY------------APCLTTL---------------FSLKQCIGIIGG 314

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P H+LYF+GY  + +I LDPH  Q +  V+  +        +++HC    ++H+  MDP
Sbjct: 315 RPKHSLYFVGYQDDKLIHLDPHYCQEVVDVWAVDFPL-----TSFHCRSPRKIHLSKMDP 369

Query: 294 SIAV 297
           S  +
Sbjct: 370 SCCI 373


>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
          Length = 517

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 101/296 (34%), Positives = 161/296 (54%), Gaps = 44/296 (14%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E    D +SRLWFTYR+ F PI  + +T+D GWGCMLR  QM++AQA++   LGR W++ 
Sbjct: 179 ELFLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYR 238

Query: 73  VNSKEEA----YLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
            N++ EA    + +++++F DR    +P+S+H++   G   GK  G+W+GP++ A +L++
Sbjct: 239 RNNQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKE 298

Query: 127 LAKYDDWSS-----IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
             +    +      +  +VA D T+ +  V+ LC    R++  P W+ +++++P+RLG +
Sbjct: 299 ALEGACQTEQLLLDLRIYVAQDCTIYLEDVRALC-RGTRSNGAPLWRSVIILVPVRLGGE 357

Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
            +NP YI  +K                              + P  +GVIGG+P H+LYF
Sbjct: 358 QLNPTYIPCVKGM---------------------------LSHPNCIGVIGGRPRHSLYF 390

Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +G+ G  VI+LDPH  Q    V    QD    LDS YHC    ++    MDPS  +
Sbjct: 391 LGWQGEKVIYLDPHYVQE--AVDVGPQDF--PLDS-YHCSWPRKMSFYKMDPSCTM 441


>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
 gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
          Length = 583

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 101/338 (29%), Positives = 163/338 (48%), Gaps = 81/338 (23%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D+E  +RD  +RLW TYRK F  + DS  T+D GWGCM+R GQM++AQ LL   LGR+W
Sbjct: 165 EDIEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNW 224

Query: 70  QWN------------VNSKEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWF 115
           +W+            +N ++  + KI++ F D   RT+P+SIH +   G   GK  G+W+
Sbjct: 225 RWDATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWY 284

Query: 116 GPNTVAQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----- 165
           GP +VA +LR+  K       D   +  +VA D  + +  +   CT +   +  P     
Sbjct: 285 GPGSVAHLLRQAVKLAAQEISDLDGVNVYVAQDCAVYIQDIIDECTVSAGPTLAPWQKKS 344

Query: 166 --------------------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
                                      W+ L+L++PLRLG + +NP+Y + +K   +L  
Sbjct: 345 PGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNPIYSDCLKAMLSLD- 403

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
                                       +G+IGG+P H+LYF+G+  + +I LDPH  Q+
Sbjct: 404 --------------------------NCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQD 437

Query: 260 IGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +  V ++E        +++HC    ++ +  MDPS  +
Sbjct: 438 MVDVVNQENFPV----ASFHCKSPRKMKLSKMDPSCCI 471


>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
          Length = 456

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 103/303 (33%), Positives = 152/303 (50%), Gaps = 54/303 (17%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           +E+ +RD  SRLW TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+   LGR+W+W
Sbjct: 111 IEEFKRDFASRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKW 170

Query: 72  NVNSKEEA---------YLKILKMFEDRR--TAPYSIHQIALTGASEGKAVGEWFGPNTV 120
                 E          +  I+K F D+    +P+SIH++   GAS GK  G+W+GPN+V
Sbjct: 171 RPEQSIENTQQMRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSV 230

Query: 121 AQVLRKLAKY------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
           A +L +  +          S +  +VA D  + +  V+++C T     S+  W+ L+L++
Sbjct: 231 AHLLSQAVERTGELPNSKLSRLAVYVAQDCAVYMQDVEEVCRT-----SDGGWKSLILLV 285

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PL LG   +NPVY   +                               T    +GVIGG+
Sbjct: 286 PLMLGTDKLNPVYAPCVTSL---------------------------LTLDACIGVIGGR 318

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           P H+LYFIGY  + +I LDPH      C    +   E    +++HC    ++ +  MDPS
Sbjct: 319 PRHSLYFIGYQDDKLIHLDPHY-----CQETVDVSKENFPLTSFHCTSPRKMLLSKMDPS 373

Query: 295 IAV 297
             V
Sbjct: 374 CCV 376


>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
          Length = 432

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 106/314 (33%), Positives = 168/314 (53%), Gaps = 66/314 (21%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH-LGRDWQ 70
           +E+   D +++LW +YR+GF  IGDS    D GWGCMLR GQM++A  LL    +G+DW+
Sbjct: 88  IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147

Query: 71  WNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLA 128
              N +  E + K++++F DR +AP+SIH IAL G +  GK++GEWF P+ ++  +R L 
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207

Query: 129 -KY------------------------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
            KY                        D+  ++  +V+ D +L ++Q+ ++        S
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALR-----S 262

Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
           +  W PL+++IP +LGI  IN +Y             P+ D+                +T
Sbjct: 263 DGSWMPLLILIPTKLGIDTINEIYYR-----------PLLDI----------------YT 295

Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
           FPQ+LG++GGKP  +LYFI    +++ +LDPHT QN       E DS+  L S+Y C   
Sbjct: 296 FPQNLGIVGGKPRASLYFIASQDDNLFYLDPHTVQN-----SIESDSDFSL-SSYFCNIP 349

Query: 284 SRLHILHMDPSIAV 297
            + +I  +DPS+ +
Sbjct: 350 KKANISEVDPSLVI 363


>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
          Length = 486

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 104/310 (33%), Positives = 164/310 (52%), Gaps = 54/310 (17%)

Query: 5   NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           + +S +D +E+ ++D TSRLW TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+  
Sbjct: 134 DAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 193

Query: 64  HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
            LGR+W+W  +           E  +  I+K F D   RT+P+SIH +   GA  GK  G
Sbjct: 194 FLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKRAG 253

Query: 113 EWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
           +W+GP++VA +L +       ++  ++++  +VA D  + +  ++ +C T      + +W
Sbjct: 254 DWYGPSSVAHLLSQAVENAAERHPAFNNLAVYVAQDCAVYLQDIENVCQT-----PDGKW 308

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           + L+L +PLRLG   +NPVY + +                            +  T    
Sbjct: 309 KSLILFVPLRLGADKLNPVYTSCLT---------------------------HLLTLDTC 341

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           +GVIGG+P H+LYFIG+  + +I LDPH  Q      D  +D+     +++HC    ++ 
Sbjct: 342 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFSL--TSFHCTSPRKML 396

Query: 288 ILHMDPSIAV 297
           I  MDPS  V
Sbjct: 397 ISKMDPSCCV 406


>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
          Length = 424

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 104/327 (31%), Positives = 158/327 (48%), Gaps = 72/327 (22%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL  +L R
Sbjct: 54  SEGDIQRFQRDFASRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHYLPR 113

Query: 68  DWQWNVN------------------------------------SKEEAYLKILKMFEDRR 91
           DW W                                       S+E  + +I+  F D  
Sbjct: 114 DWTWAEGAGLGPPEPVGLSSPNRYRGPARWMAPTLGPGAPPSWSRERRHRQIVSWFADHP 173

Query: 92  TAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQ 150
            AP+ +HQ+   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    
Sbjct: 174 RAPFGLHQLVELGQSSGKKAGDWYGPSLVAHILRKAVESCAEVTRLVVYVSQDCTVYKAD 233

Query: 151 VKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILS 210
           V +L     R     +W+ +V+++P+RLG + +NPVY+  +K              ++L 
Sbjct: 234 VARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLR 276

Query: 211 STYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
           S                LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   
Sbjct: 277 SEL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSRADFPL 323

Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
           E     ++HC    ++    MDPS  V
Sbjct: 324 E-----SFHCTSPRKMAFTKMDPSCTV 345


>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
          Length = 486

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 163/310 (52%), Gaps = 54/310 (17%)

Query: 5   NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           + +S +D +E+ ++D TSRLW TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+  
Sbjct: 134 DAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 193

Query: 64  HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
            LGR+W+W V+           E  +  I+K F D    T+P+SIH +   GA  GK  G
Sbjct: 194 FLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKRAG 253

Query: 113 EWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
           +W+GP++VA +L +       ++  +S++  +VA D  + +  V+ +C        + +W
Sbjct: 254 DWYGPSSVAHLLSQAVEQAAERHPVFSNLAVYVAQDCAVYLQDVENVCQM-----PDGKW 308

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           + L+L +PLRLG   +NPVY + +                            +  T    
Sbjct: 309 KSLILFVPLRLGADKLNPVYASCLT---------------------------HLLTLNTC 341

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           +GVIGG+P H+LYFIG+  + +I LDPH  Q      D  +D+     +++HC    ++ 
Sbjct: 342 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFPL--TSFHCTSPRKML 396

Query: 288 ILHMDPSIAV 297
           I  MDPS  V
Sbjct: 397 ISKMDPSCCV 406


>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
           rotundus]
          Length = 473

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 105/326 (32%), Positives = 159/326 (48%), Gaps = 71/326 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  D+++ +RD  SRLW TYR+ F P     LT+D GWGCMLR GQM++AQ LL   L R
Sbjct: 104 SEGDIQRFQRDFVSRLWLTYRRDFPPFAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163

Query: 68  DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
           DW W   +                                   +E  + +I+  F D   
Sbjct: 164 DWTWARGASLSPPEPSGLASSNRYRGPAHCMTPCWAQRAPELEQERRHRQIVSWFADHPQ 223

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
           AP+ +HQ+   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V
Sbjct: 224 APFGLHQLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADV 283

Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
            +L     R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S
Sbjct: 284 ARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRS 326

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
                           LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q ++
Sbjct: 327 EL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---AVDVSQ-AD 369

Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
             L+S +HC    ++    MDPS  V
Sbjct: 370 FPLES-FHCTSPRKMAFAKMDPSCTV 394


>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
          Length = 423

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 158/326 (48%), Gaps = 71/326 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L R
Sbjct: 54  SEGDIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 113

Query: 68  DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
           DW W+  S                                   +E  + +I+  F D   
Sbjct: 114 DWTWSEASGLGPSEPSGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQ 173

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
           AP+ +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V
Sbjct: 174 APFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADV 233

Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
            +L     R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S
Sbjct: 234 ARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRS 276

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
                           LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E
Sbjct: 277 EL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE 323

Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
                ++HC    ++    MDPS  V
Sbjct: 324 -----SFHCTSPRKMAFAKMDPSCTV 344


>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
          Length = 471

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 105/321 (32%), Positives = 160/321 (49%), Gaps = 69/321 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLWFTYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 107 DIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166

Query: 71  WNVN---------------------------------SKEEAYLKILKMFEDRRTAPYSI 97
           W                                     +E  + +I+  F D   AP+S+
Sbjct: 167 WAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAPFSL 226

Query: 98  HQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L  
Sbjct: 227 HRLVELGQSLGKKAGDWYGPSVVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVA 286

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
              R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S     
Sbjct: 287 ---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL--- 326

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q ++  L+S
Sbjct: 327 ----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDISQ-ADFPLES 372

Query: 277 TYHCPQASRLHILHMDPSIAV 297
            +HC    ++    MDPS  V
Sbjct: 373 -FHCTAPRKMAFTKMDPSCTV 392


>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
 gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related protein 4 homolog D
 gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
          Length = 469

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 156/319 (48%), Gaps = 67/319 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166

Query: 71  WNVN-------------------------------SKEEAYLKILKMFEDRRTAPYSIHQ 99
           W+                                  +E  + +I+  F D   AP+ +H+
Sbjct: 167 WSQGVGLGPPESSPNRYRGPAHWMPPHWVQAAPELEQERRHRQIVSWFADHPRAPFGLHR 226

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN 158
           +   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L    
Sbjct: 227 LVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVA-- 284

Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
            R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S       
Sbjct: 285 -RPDPTAEWKAVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL----- 324

Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
                    LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E     ++
Sbjct: 325 --------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE-----SF 371

Query: 279 HCPQASRLHILHMDPSIAV 297
           HC    ++    MDPS  V
Sbjct: 372 HCTSPRKMAFTKMDPSCTV 390


>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
          Length = 473

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 157/326 (48%), Gaps = 71/326 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L R
Sbjct: 104 SEGDIQRFQRDFMSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMLLAQGLLLHFLPR 163

Query: 68  DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
           DW W   S                                   +E  + +I+  F D   
Sbjct: 164 DWTWAEGSGLGPPELSGSASPSRYRGPARRVPPHWAQCTPELEQEHWHRQIVSWFADHPQ 223

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
           AP+ +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V
Sbjct: 224 APFGLHRLVALGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADV 283

Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
            +L     R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S
Sbjct: 284 ARLVA---RPDPKAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRS 326

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
                           LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E
Sbjct: 327 EL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPSVDVSQADFSLE 373

Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
                ++HC    ++    MDPS  V
Sbjct: 374 -----SFHCTSPRKMAFTKMDPSCTV 394


>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
          Length = 408

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 160/321 (49%), Gaps = 69/321 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ+LL   L RDW 
Sbjct: 44  DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWT 103

Query: 71  W--NVNSKEEA-------------------------------YLKILKMFEDRRTAPYSI 97
           W   + S E A                               + +I+  F D   AP+ +
Sbjct: 104 WAEGLGSAEPAGSASPSRYRGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPGAPFGL 163

Query: 98  HQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L  
Sbjct: 164 HRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVA 223

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
              R     +W+ +V+++P+RLG + +NPVY+  +K+   L +                 
Sbjct: 224 ---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRLEL----------------- 263

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q ++  L+S
Sbjct: 264 ----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-TDFPLES 309

Query: 277 TYHCPQASRLHILHMDPSIAV 297
            +HC    ++    MDPS  V
Sbjct: 310 -FHCTSPRKMAFAKMDPSCTV 329


>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
          Length = 411

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 45  DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 105 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    D + +V +V+ D T+    V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARL 224

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 225 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 262

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    R+    MDPS  V
Sbjct: 312 --SFHCTSPRRMAFAKMDPSCTV 332


>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
          Length = 453

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 100/323 (30%), Positives = 159/323 (49%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL     RDW 
Sbjct: 87  DIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSRDWT 146

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W+                                      +EE + +I+  F D+  AP+
Sbjct: 147 WSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPGAPF 206

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +  +V+ D T+    V +L
Sbjct: 207 GLHRLVELGRSSGKRAGDWYGPSVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQL 266

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                +   + +W+ +V+++P+RLG + +NPVY+  +K+   L +               
Sbjct: 267 VA---QPDPSTEWKSIVILVPVRLGGETLNPVYVPCVKELLRLEL--------------- 308

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        +G+IGGKP H+LYFIGY  + +++LDPH  Q      D  Q+S   L
Sbjct: 309 ------------CIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPF---VDTSQES-FPL 352

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           +S +HC    ++    MDPS  +
Sbjct: 353 ES-FHCTSPRKMAFSRMDPSCTI 374


>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
          Length = 428

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 156/323 (48%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 62  DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 121

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   S                                   +E  + +I+  F D   AP+
Sbjct: 122 WAEGSAPSPSEPSGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQAPF 181

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 182 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARL 241

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 242 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 283

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 284 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 328

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 329 --SFHCTSPRKMAFAKMDPSCTV 349


>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
          Length = 356

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 86/201 (42%), Positives = 120/201 (59%), Gaps = 55/201 (27%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           ++  DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGR      
Sbjct: 94  KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGR------ 147

Query: 74  NSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
                                      A  G  EGK+VGEWFGPNTVAQVL+KLA +D+W
Sbjct: 148 ---------------------------AQMGVGEGKSVGEWFGPNTVAQVLKKLALFDEW 180

Query: 134 SSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPLV 171
           +S+  +V++DNT+V+  +KK+C                   T+N+   ++   P W+PL+
Sbjct: 181 NSLAVYVSMDNTVVIEDIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLL 240

Query: 172 LVIPLRLGIQDINPVYINGIK 192
           L++PLRLGI  INPVY++  K
Sbjct: 241 LIVPLRLGINQINPVYVDAFK 261


>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
 gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
          Length = 682

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 104/319 (32%), Positives = 165/319 (51%), Gaps = 63/319 (19%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   LGR W
Sbjct: 274 EGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 333

Query: 70  QWNVNS------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S      ++  + KI+K F D   + +P+SIH +   G   GK  G+W+GP +V+
Sbjct: 334 RYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGKKPGDWYGPASVS 393

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCT-------------TNKRASS 163
            +L+   ++      D+ +I  +VA D T+ +  +++LC+               KR++S
Sbjct: 394 YLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKPHVPWQQAKRSTS 453

Query: 164 NP-----QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
           +       W+ L+++IPLRLG   +NPVY + +K               +LS+ Y     
Sbjct: 454 DAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLK--------------LLLSTEY----- 494

Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
                    LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E     ++
Sbjct: 495 --------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQETFPMHSF 541

Query: 279 HCPQASRLHILHMDPSIAV 297
           HC    +L    MDPS  +
Sbjct: 542 HCKSPRKLKSSKMDPSCCI 560


>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
          Length = 445

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 156/323 (48%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 79  DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 138

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   S                                   +E  + +I+  F D   AP+
Sbjct: 139 WAEGSAPSPSEPSGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQAPF 198

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 199 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARL 258

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 259 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 300

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 301 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 345

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 346 --SFHCTSPRKMAFAKMDPSCTV 366


>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
          Length = 392

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 156/326 (47%), Gaps = 71/326 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L R
Sbjct: 26  SEGDIQRFQRDFASRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 85

Query: 68  DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
           DW W   +                                   +E  + +I+  F D   
Sbjct: 86  DWTWAEGAGLSPPEPSGLASPNRHHGLAHWKPPRWAQGAPELEQEHWHRQIVSWFADHPQ 145

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
           AP+ +HQ+   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V
Sbjct: 146 APFGLHQLVELGQSWGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADV 205

Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
            +L     R     +W+ +V+++P+RLG + +NPVY+  +K+              +L S
Sbjct: 206 ARLVA---RPDCTAEWKSVVILVPVRLGGETLNPVYVPCVKE--------------LLRS 248

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
                           LG++GGKP H+LYFIGY  + +++LDPH  Q    V       E
Sbjct: 249 EL-------------CLGIMGGKPRHSLYFIGYQDDSLLYLDPHYCQPTVDVSQAGFPLE 295

Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
                ++HC    ++    MDPS  V
Sbjct: 296 -----SFHCTSPRKMAFTKMDPSCTV 316


>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
           [Megachile rotundata]
          Length = 518

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 103/310 (33%), Positives = 162/310 (52%), Gaps = 54/310 (17%)

Query: 5   NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           + +S +D +E+ ++D TSRLW TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+  
Sbjct: 167 DAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 226

Query: 64  HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
            LGR+W+W  +           E  +  I++ F D   R +P+SIH +   GA  GK  G
Sbjct: 227 FLGREWRWQPDQPIKTEQQKLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAG 286

Query: 113 EWFGPNTVAQVLRKLAKYDD-----WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
           +W+GP++VA +L +  ++       +S++  +VA D  + +  V+ +C        + +W
Sbjct: 287 DWYGPSSVAHLLSQAVEHAAEHLPIFSNLAVYVAQDCAVYLQDVESVCQM-----PDGKW 341

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           + L+L +PLRLG   +NPVY + +                            +  T    
Sbjct: 342 KSLILFVPLRLGTDKLNPVYTSCLT---------------------------HLLTLDTC 374

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           +GVIGG+P H+LYFIG+  + +I LDPH  Q      D  +D+     +++HC    ++ 
Sbjct: 375 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFPL--TSFHCTSPRKML 429

Query: 288 ILHMDPSIAV 297
           I  MDPS  V
Sbjct: 430 ISKMDPSCCV 439


>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
 gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
          Length = 472

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 155/322 (48%), Gaps = 70/322 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166

Query: 71  WNVNS----------------------------------KEEAYLKILKMFEDRRTAPYS 96
           W   +                                  +E  + +I+  F D   AP+ 
Sbjct: 167 WCQGAGLGPSEPPGLGSPSRRRGPARWLPPRWAQAPELEQERRHRQIVSWFADHPRAPFG 226

Query: 97  IHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
           +H++   G   GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L 
Sbjct: 227 LHRLVELGQGSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLV 286

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
               R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S    
Sbjct: 287 A---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL-- 327

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                       LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E    
Sbjct: 328 -----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE---- 372

Query: 276 STYHCPQASRLHILHMDPSIAV 297
            ++HC    R+    MDPS  V
Sbjct: 373 -SFHCTSPRRMAFAKMDPSCTV 393


>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
          Length = 388

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 156/323 (48%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 74  DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 133

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   S                                   +E  + +I+  F D   AP+
Sbjct: 134 WAEGSGLGPSEPSGLASPNRYRGPARWVPPRWAHGTPELEQERRHRQIVSWFADHPRAPF 193

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 194 GLHRLGGLGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 253

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 254 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 295

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 296 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVTQADFPLE--- 340

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 341 --SFHCTSPRKMAFAKMDPSCTV 361


>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
 gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
           cysteine endopeptidase; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related cysteine endopeptidase
           4; AltName: Full=Autophagy-related protein 4 homolog D
 gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
 gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
 gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
           [Homo sapiens]
 gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
           construct]
          Length = 474

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    D + +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395


>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
           familiaris]
          Length = 473

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 167 WAEGPGLGPSEPAGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQAPF 226

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARL 286

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 287 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 328

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 329 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 373

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 374 --SFHCTSPRKMAFAKMDPSCTV 394


>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
          Length = 474

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 153/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    D + +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W  +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VA---RPDPTAEWMSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395


>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
           porcellus]
          Length = 474

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 103/325 (31%), Positives = 156/325 (48%), Gaps = 75/325 (23%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWM 166

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 167 WAEGPGLGSPELPGTASPSPGRSPARWVPPRWPRGAPELEQELRHRQIVSWFADHPRAPF 226

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +  +V+ D T+    V  L
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHL 286

Query: 155 CTTNKRASSNP--QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
                 AS +P  +W+ +V+++P+RLG + +NPVY+ G+K+                   
Sbjct: 287 V-----ASRDPTAEWKSVVILVPVRLGGETLNPVYVPGVKELL----------------- 324

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEK 272
                 R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E 
Sbjct: 325 ------RSELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE- 373

Query: 273 KLDSTYHCPQASRLHILHMDPSIAV 297
               ++HC    ++    MDPS  V
Sbjct: 374 ----SFHCTSPRKMAFAKMDPSCTV 394


>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
          Length = 364

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 104/306 (33%), Positives = 154/306 (50%), Gaps = 74/306 (24%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D    +W   ++  +  G +G ++D GWGCMLRCGQM++AQAL+  HLGR          
Sbjct: 29  DTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGR---------- 78

Query: 78  EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
                                  A  G  EGK++GEWFGPNTVAQVL+KLA +D+W+S+ 
Sbjct: 79  -----------------------AQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLA 115

Query: 138 FHVALDNTLVVNQVKKLCT-----------------TNKRASSNPQ-----WQPLVLVIP 175
            +V++DNT+V+  +KK+C                  T    S  P      W+PL+L++P
Sbjct: 116 VYVSMDNTVVIEDIKKMCCVLPLSADTDTESPPDSPTASNQSKGPSACGSAWKPLLLIVP 175

Query: 176 LRLGIQDINPVYINGIK---KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           LRLGI  INPVY++  K    C+     P+  + K       +  P+       S G   
Sbjct: 176 LRLGINQINPVYVDAFKLQASCH-----PILIVTKEGVRRTRILPPK------DSSGARA 224

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHM 291
            +     +     G+++IFLDPHT Q      D E++     D T+HC Q+  R++IL++
Sbjct: 225 SESLKVKHVSFKTGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQSPQRMNILNL 280

Query: 292 DPSIAV 297
           DPS+A+
Sbjct: 281 DPSVAL 286


>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
          Length = 474

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 168 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395


>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
          Length = 411

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 45  DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 105 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 224

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 225 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 262

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 312 --SFHCTSPRKMAFAKMDPSCTV 332


>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
 gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
          Length = 474

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 160/323 (49%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D++Q +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 108 DIQQFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   ++  + +I+  F D   AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + S +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            +     +   +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 288 LSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 329

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q S   L
Sbjct: 330 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQPS-FPL 373

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           +S +HC    ++    MDPS  V
Sbjct: 374 ES-FHCTSPRKMAFAKMDPSCTV 395


>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
          Length = 485

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 103/310 (33%), Positives = 162/310 (52%), Gaps = 54/310 (17%)

Query: 5   NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           + +S +D +E+ ++D TSRLW TYR+ F  +  S  T+D GWGCMLR GQM++AQAL+  
Sbjct: 133 DAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALVCH 192

Query: 64  HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
            LGR+W+W V+           E  +  I+K F D    T+P+SIH +   GA  GK  G
Sbjct: 193 FLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKRAG 252

Query: 113 EWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
           +W+GP++VA +L +       ++  +S++  +VA D  + +  V+ +C        + +W
Sbjct: 253 DWYGPSSVAHLLSQAVEQAAERHPVFSNLAVYVAQDCAVYLQDVENVCQM-----PDGKW 307

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           + L+L +PLRLG   +N VY + +                            +  T    
Sbjct: 308 KSLILFVPLRLGADKLNLVYASCLT---------------------------HLLTLNTC 340

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           +GVIGG+P H+LYFIG+  + +I LDPH  Q      D  +D+     +++HC    ++ 
Sbjct: 341 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFPL--TSFHCTSPRKML 395

Query: 288 ILHMDPSIAV 297
           I  MDPS  V
Sbjct: 396 ISKMDPSCCV 405


>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
          Length = 442

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 76  DIQRFQRDFVSRLWLTYRRDFPPLAGGYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWM 135

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 136 WVKGVGLDPPEPSRLASPYWHHGPACWIPPHWTQGSPELEQERRHRQIVSWFADHPKAPF 195

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +HQ+   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 196 GLHQLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARL 255

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 256 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 297

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q    V       E   
Sbjct: 298 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--- 342

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 343 --SFHCTSPRKMAFTKMDPSCTV 363


>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
          Length = 472

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 100/322 (31%), Positives = 155/322 (48%), Gaps = 70/322 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166

Query: 71  WNVNS----------------------------------KEEAYLKILKMFEDRRTAPYS 96
           W   +                                  +E  + +I+  F D   AP+ 
Sbjct: 167 WCQGAGLGPSEPPGLGSPSRRRGPARWLPPRWAQAPELEQERRHRQIVSWFADHPRAPFG 226

Query: 97  IHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
           +H++   G   GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L 
Sbjct: 227 LHRLVELGQGSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLV 286

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
               R     +W+ +V+++P+RLG + +NPVY+  +K              ++L S    
Sbjct: 287 A---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL-- 327

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                       LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E    
Sbjct: 328 -----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE---- 372

Query: 276 STYHCPQASRLHILHMDPSIAV 297
            ++HC    ++    MDPS  V
Sbjct: 373 -SFHCTSPRKMAFAKMDPSCTV 393


>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
          Length = 474

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 168 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395


>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
          Length = 497

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 131 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 190

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 191 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 250

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 251 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 310

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 311 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 348

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 349 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 397

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 398 --SFHCTSPRKMAFAKMDPSCTV 418


>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
 gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
          Length = 474

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395


>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
          Length = 411

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 45  DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 105 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 224

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 225 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 262

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 312 --SFHCTSPRKMAFAKMDPSCTV 332


>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
 gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
          Length = 606

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 164/339 (48%), Gaps = 80/339 (23%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           +  + ++  RRD  SR+W TYR+ F  + DS  T+D GWGCM+R GQM++AQ L+   LG
Sbjct: 191 VEEEGIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLG 250

Query: 67  RDWQWNVNS----KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
           R W+W+V+     +E  + K+++ F D   +T+P+SIH +   G   GK  G+W+GP  V
Sbjct: 251 RSWRWDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAV 310

Query: 121 AQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCT------------------T 157
           A +LR+  +       D   I  +VA D  + +  +   CT                  T
Sbjct: 311 AHLLRQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGT 370

Query: 158 NKRASS-------------------NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
           N  +S+                   +  W+ L+L++PLRLG   +NP+Y   +K      
Sbjct: 371 NSSSSTAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLK------ 424

Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
                    +LS  Y              +G+IGG+P H+LYF+GY  + +I LDPH  Q
Sbjct: 425 --------AMLSLDY-------------CIGIIGGRPKHSLYFVGYQEDKLIHLDPHYCQ 463

Query: 259 NIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++    D  QD+     +++HC    ++ +  MDPS  +
Sbjct: 464 DM---VDVNQDNFPV--ASFHCKSPRKMKLSKMDPSCCI 497


>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
          Length = 439

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 73  DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 132

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 133 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 192

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 193 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 252

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 253 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 290

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 291 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 339

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 340 --SFHCTSPRKMAFAKMDPSCTV 360


>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
          Length = 607

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 157/323 (48%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 240 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWM 299

Query: 71  W-------------NVNS----------------------KEEAYLKILKMFEDRRTAPY 95
           W             + +S                      +E  + +I+  F D   AP 
Sbjct: 300 WIEGPGLAHPELPGSASSSQGRGPARWMPPSCPWGALEREQELRHRQIVSWFADHPRAPL 359

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +  +V+ D T+    V  L
Sbjct: 360 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHL 419

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
             +   A+   +W+ +V+++P+RLG + +NPVY+ G+K+                     
Sbjct: 420 VASPDPAA---EWKSVVILVPVRLGGETLNPVYVPGVKELL------------------- 457

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 458 ----RSELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFSLE--- 506

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 507 --SFHCTSPRKMAFAKMDPSCTV 527



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 32/59 (54%), Positives = 41/59 (69%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L R W
Sbjct: 154 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212


>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
          Length = 411

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 100/323 (30%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 45  DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 105 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 224

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +++++P+RLG + +NPVY+  +K+                     
Sbjct: 225 VA---RPDPTAEWKSVIILVPVRLGGETLNPVYVPCVKELL------------------- 262

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 312 --SFHCTSPRKMAFAKMDPSCTV 332


>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
          Length = 434

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 99/291 (34%), Positives = 146/291 (50%), Gaps = 45/291 (15%)

Query: 21  SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY 80
           S +W TYR  F  +G    T+D GWGCMLR GQMV+AQ L    LG +W+   +     Y
Sbjct: 116 SVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQSDRSSPLY 175

Query: 81  LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV 140
            K+++ F D    P+S+H+IA  G   GK VGEWFGP+T+AQVL +L K    S +  +V
Sbjct: 176 AKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLKEFSPSGLRAYV 235

Query: 141 ALDNTLVVNQVKKLCTT-------NKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
             D  L ++Q+++  T        +        W P+++++PLRLG+  +N  Y   +K+
Sbjct: 236 CQDGCLYLDQLRRTATAAHWPLDEDDDEGQGKSWAPMLIMLPLRLGLDQLNEDYAPVLKE 295

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                                       F  PQS+G+ GGKP  +LYF+G   + V +LD
Sbjct: 296 T---------------------------FRIPQSVGISGGKPRASLYFVGNQDDYVFYLD 328

Query: 254 PHTNQ------NIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
           PHT Q       +G V      + + +  T+HC    RL I  +DPS+ + 
Sbjct: 329 PHTVQPAPRFPEVGDV-----PASEDVYDTFHCSAPLRLPIRDIDPSLCLA 374


>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
          Length = 474

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 100/323 (30%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +++++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VA---RPDPTAEWKSVIILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395


>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
 gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
          Length = 269

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 95/203 (46%), Positives = 125/203 (61%), Gaps = 33/203 (16%)

Query: 95  YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL 154
           YSIHQIA  G S+ KAVGEW GPNTVAQ+L+KL ++DDWSS+  HVA+D+T+V++ V   
Sbjct: 4   YSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSSLAIHVAMDSTVVLDDVYAS 63

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
           C           W+PL+L+IPLRLGI DINP+Y+  +K+C  L                 
Sbjct: 64  CREGG------SWKPLLLIIPLRLGITDINPLYVPALKRCLEL----------------- 100

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                       S G+IGG+PN ALYF+GYV ++V++LDPHT Q  G V  K   +E+  
Sbjct: 101 ----------DSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDY 150

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           D TYH   A+RL+   MDPS+AV
Sbjct: 151 DETYHQKHAARLNFSAMDPSLAV 173


>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
 gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
          Length = 474

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 99/323 (30%), Positives = 157/323 (48%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   ++  + +I+  F D   AP+
Sbjct: 168 WVEGTGLAPPEMPGPASPSRYRGPGRHVPPRWTQGTLEMEQDRWHRRIVSWFADHPQAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK + K  +   +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVEKCSEVPRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            +     +   +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E   
Sbjct: 326 ----RSELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  +
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTI 395


>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
          Length = 505

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 107/314 (34%), Positives = 168/314 (53%), Gaps = 60/314 (19%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR-DWQ 70
           +E+ +RD  SRLW TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+   LGR  W+
Sbjct: 129 IEEFKRDFMSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWR 188

Query: 71  WNVN--SKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
           W     + E ++  I+K F D+ T  +P+SIH++ + GAS GK  G+W+GP++VA +L +
Sbjct: 189 WRPEQLTDESSHRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQ 248

Query: 127 LAKY--DDWSS----IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
             +   +D +S    +  +VA D  + +  V+ +C T      + + + L+L++PLRLG 
Sbjct: 249 AMERASEDPNSKLNQLAVYVAQDCAVYMQDVENVCCT-----PDGRRKALILLVPLRLGA 303

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
             +NPVY                     L++   + T          +GVIGG+P H+LY
Sbjct: 304 DKLNPVY------------------APCLTALLTLDT---------CIGVIGGRPRHSLY 336

Query: 241 FIGYVGNDVIFLDPHTNQN-------------IGCVYDKE----QDSEKKLDSTYHCPQA 283
           FIGY  + +I LDPH  QN             +  ++ +E    + +EK   +++HC   
Sbjct: 337 FIGYQDDKLIHLDPHYCQNEFYFRILLSITDSLPYLFIQETVDVEGNEKFPLTSFHCTSP 396

Query: 284 SRLHILHMDPSIAV 297
            ++ +  MDPS  V
Sbjct: 397 RKMLLSKMDPSCCV 410


>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
          Length = 474

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 160/323 (49%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   ++  + +I+  F D   AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + S +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            +     +   +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 288 LSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 329

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q S   L
Sbjct: 330 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQPS-FPL 373

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           +S +HC    ++    MDPS  V
Sbjct: 374 ES-FHCTSPRKMAFAKMDPSCTV 395


>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
 gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
           cysteine endopeptidase; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related cysteine endopeptidase
           4; AltName: Full=Autophagy-related protein 4 homolog D
 gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
 gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
 gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
 gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
 gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
          Length = 474

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 160/323 (49%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   ++  + +I+  F D   AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + S +V +V+ D T+    V +L
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            +     +   +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 288 LSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 329

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q S   L
Sbjct: 330 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQPS-FPL 373

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           +S +HC    ++    MDPS  V
Sbjct: 374 ES-FHCTSPRKMAFAKMDPSCTV 395


>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
          Length = 474

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 100/323 (30%), Positives = 154/323 (47%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 168 WAEGTGLGPPELSGPASPSWYHGPARWMPPCWAQGAPELEQERRHRQIVSWFADHPQAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + ++ +V+ D T+    V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARL 287

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R   + +W  +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 288 VA---RPDPSAEWNSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V       E   
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--- 374

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395


>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 356

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 111/311 (35%), Positives = 157/311 (50%), Gaps = 52/311 (16%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           LS Q  E+   D TSR+W TYRKGF  +G S LT+D GWGCMLR GQM++AQAL+  +LG
Sbjct: 44  LSVQAFEEFISDFTSRIWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLG 103

Query: 67  RDWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
           R W+        +AYL+IL+ F D  + P+SIH +   G   G A G W GP  + + L 
Sbjct: 104 RSWRREPGQPCSQAYLQILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLE 163

Query: 126 KLAKYDDWSS-----------IVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQ 166
            LA+ D   S            V+ V+ +          L V  V  LC+  +  +   +
Sbjct: 164 ALARADREQSQKKGGKRALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTE--E 221

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W PL++++PL LG+  +NP Y+  +                           R  FTFPQ
Sbjct: 222 WTPLLVLVPLVLGLDKVNPRYLPSL---------------------------RATFTFPQ 254

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
           SLG+ GGKP  + Y IG      ++LDPH NQ +  V  +  + +    S+YHC    RL
Sbjct: 255 SLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELDT---SSYHCSTVRRL 311

Query: 287 HILHMDPSIAV 297
            +  +DPS+A+
Sbjct: 312 PLDTIDPSLAI 322


>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
          Length = 332

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 103/266 (38%), Positives = 131/266 (49%), Gaps = 48/266 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           +  + D+ SRLWF+YR  F PI  + LTTD GWGCM+R GQM+I QAL+  HLGRDW+ +
Sbjct: 102 QNFKLDMWSRLWFSYRYNFHPISGTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLS 161

Query: 73  VNSK----EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
             SK       Y K+L+MF D   AP SIH     G   GK  G WFGPNTV     KL 
Sbjct: 162 HTSKYNELPSDYRKVLEMFLDHPCAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKLH 221

Query: 129 KYDDWSSIVFHVAL-------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
                 S      L       DNT+  ++  +L           Q  PL +++P RLG+ 
Sbjct: 222 AGGALGSDNNLQLLAYDGNDGDNTIYKSEALELL----------QAGPLFILLPTRLGVS 271

Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
            ++P YI  I                              F+FPQSLG IGGKP+ A YF
Sbjct: 272 SVDPSYIPKISHV---------------------------FSFPQSLGFIGGKPSSAHYF 304

Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKE 267
           I   G  V +LDPHT Q +  + +KE
Sbjct: 305 IASQGEAVYYLDPHTPQPLINISEKE 330


>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
          Length = 517

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 98/310 (31%), Positives = 158/310 (50%), Gaps = 50/310 (16%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN--- 74
           D  S+LWFTYRKGF  + D+ LT+D GWGCMLR  QM+IAQ+ +   LGR+W+W  +   
Sbjct: 115 DFHSKLWFTYRKGFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLS 174

Query: 75  -SKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---- 127
             + + +  I+  F D +    P+S+HQ+   G S     G W+GPNT A +++      
Sbjct: 175 MEQSDIHRNIITWFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECA 234

Query: 128 -AKYDDWSSIVFHVALDNTLVVNQVKKLCT-------TNKRASSNPQWQPLVLVIPLRLG 179
             K +  ++I+ ++A D+T+ ++ V ++C         + + S+    + ++++IP+RLG
Sbjct: 235 KGKTELLNNIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRSVIVLIPVRLG 294

Query: 180 IQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHAL 239
              +NP+YI  I+                              T  QS+G++GGKP H+L
Sbjct: 295 EATLNPIYIPCIQSM---------------------------LTLDQSVGIMGGKPKHSL 327

Query: 240 YFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-V 298
           YFIG+    + +LDPH  Q      D     +  L   YHC    + +I  MDPS  +  
Sbjct: 328 YFIGFQDEYLFYLDPHYCQQA----DHPAAFKNDLLQNYHCNSPRKTNISKMDPSCCLGF 383

Query: 299 SQRSYSDYKN 308
             R Y D+++
Sbjct: 384 YCRDYKDFQS 393


>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
          Length = 450

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 101/297 (34%), Positives = 158/297 (53%), Gaps = 51/297 (17%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-- 69
            ++ +RD +S++WFTYRK F  +  S LT+D GWGCMLR  QM+IAQAL+  +LGRDW  
Sbjct: 114 FDRFKRDFSSKIWFTYRKDFPKLYGSPLTSDVGWGCMLRTAQMIIAQALVMHYLGRDWTI 173

Query: 70  -QWNVNSKEEA-YLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
                N KE   + +I+++F D     +P+SI  +   G   GK  G+W+GP +VA V+R
Sbjct: 174 HHTQQNRKETMLHRQIIRLFGDFPGNDSPFSIQALVRIGVDHGKRPGDWYGPASVAYVVR 233

Query: 126 K-LAKYDDW----SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
             + +  D+    S +  +VA D T+ +  V  LCT +        W+ +V+++P+RLG 
Sbjct: 234 DAINQVPDFHPLLSQVCVYVAPDCTVYIQDVIDLCTQH--------WKAVVILVPVRLGG 285

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
           + +NP+Y   ++   A  +                            LG+IGG+P H+LY
Sbjct: 286 EALNPIYSQCVQSLLAHELC---------------------------LGIIGGRPKHSLY 318

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           F+G+    +++LDPH  Q+   V  + +D      STYHC    +L +  MDPS  +
Sbjct: 319 FVGWQEEKLLYLDPHFCQDT--VDTRFRDFPT---STYHCLSPRKLALQKMDPSCTL 370


>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
          Length = 452

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 99/322 (30%), Positives = 153/322 (47%), Gaps = 73/322 (22%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+ RR   S LW TYR+GF  +  S LTTD GWGC+LR GQM++A+ LL   +   W W+
Sbjct: 91  ERFRRSFASLLWLTYRRGFPQLAGSSLTTDSGWGCVLRTGQMLLARGLLTHLMPPGWMWS 150

Query: 73  V------------------------------------NSKEEAYLKILKMFEDRRTAPYS 96
           V                                       E  + K++  F D   AP+ 
Sbjct: 151 VWYRAVKDDLDLPHHADCTDCKSNMRCRYQSLGSLYDRPLEAMHRKVVSWFADHPKAPFG 210

Query: 97  IHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
           IH++   GAS GK  G+W+GP+ VA +L+K +A   D  ++V +VA D T+ +  V+ LC
Sbjct: 211 IHRLVELGASSGKKAGDWYGPSIVAHILQKAVAASVDLPNLVVYVAQDCTIYLQDVRGLC 270

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
                 S    W+ +++++P+RLG QD+NP YI+ +KK   L                  
Sbjct: 271 ERPPPHS----WKSVIILVPVRLGGQDLNPSYISCVKKLLELQC---------------- 310

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                       +G+IGG+P H+L+F+G+  + +++LDPH  Q    V  +    E    
Sbjct: 311 -----------CIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQLTVNVTKENFPLE---- 355

Query: 276 STYHCPQASRLHILHMDPSIAV 297
            ++HC    ++    MDPS  +
Sbjct: 356 -SFHCKYPRKMPFSRMDPSCTI 376


>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
 gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
          Length = 673

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 162/323 (50%), Gaps = 67/323 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM+IAQ L+   LGR W
Sbjct: 265 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIAQGLICHFLGRSW 324

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   + +P+SIH +   G   GK  G+W+GP +V+
Sbjct: 325 RYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGKKPGDWYGPASVS 384

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCT--------------TNKRAS 162
            +L+   ++      D+ +I  +VA D T+ +  V++ C+                K  S
Sbjct: 385 YLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQHVPWQHAKKSTS 444

Query: 163 SNPQ--------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
             P+        W+ L+++IPLRLG   +NPVY +                +K+L ST +
Sbjct: 445 DAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAH---------------CLKLLLSTEH 489

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E   
Sbjct: 490 ------------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQETFS 532

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  +
Sbjct: 533 MHSFHCKSPRKIKSSKMDPSCCI 555


>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
          Length = 405

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 104/331 (31%), Positives = 173/331 (52%), Gaps = 59/331 (17%)

Query: 4   ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           ++ L   + E ++ D  SR+W TYRK F  +  S  T+D GWGCMLR GQM++AQAL+  
Sbjct: 39  SSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSDCGWGCMLRSGQMLLAQALVCH 98

Query: 64  HLGRDWQWNVNSKEEA-------YLKILKMFEDRRT--APYSIHQIALTG-ASEGKAVGE 113
            LGRDW+WN +  +E        +  I++ F D+ +   P SIHQ+   G  S GK  G+
Sbjct: 99  FLGRDWRWNESGAQEQQTLQESLHRMIVQWFGDKPSPACPLSIHQMVSQGHISAGKRPGD 158

Query: 114 WFGPNTVAQVLRKLAK-----YDDWSSIVFHVALDNTLVVNQVKKLCT--TNKRASS--- 163
           W+GP++V+ +++++ +     Y +  ++  ++A D T+ ++ VK+ C+   N        
Sbjct: 159 WYGPSSVSYIIKQILQRATDTYPELDTLRVYIAQDCTVYLDDVKQSCSKICNYECEETDY 218

Query: 164 ---NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
              + QW+ L+L+IPLRLG + +NP Y + +K   +L                       
Sbjct: 219 ELIDDQWKSLILLIPLRLGGERMNPTYDSCLKGLLSLE---------------------- 256

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
                Q +G+IGGKP H+ YFIG+  + +I LDPH  Q +  V     + +     ++HC
Sbjct: 257 -----QCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDVLIPNFNLK-----SFHC 306

Query: 281 PQASRLHILHMDPSIAV----VSQRSYSDYK 307
            +  +  +  +DPS  V     SQR + +++
Sbjct: 307 HELRKTALKQVDPSCCVGFYLRSQREFDEFR 337


>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
          Length = 482

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 96/338 (28%), Positives = 157/338 (46%), Gaps = 86/338 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ ++D  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL     RDW 
Sbjct: 101 DIQRFQKDFASRLWLTYRRDFPPLDGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSRDWT 160

Query: 71  WNVN--------------------------------------------------SKEEAY 80
           W                                                      +EE +
Sbjct: 161 WAEAVLPPSPRESELFRSMSPSRSGASWQRGSSTASGLGRATWSTGGTLSPRQLEQEEQH 220

Query: 81  LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFH 139
            +I+  F D+  AP+ +H++   G S GK  G+W+GP+ VA +LRK +    + + +  +
Sbjct: 221 RRIVSWFADQPGAPFGLHRLVELGRSSGKRAGDWYGPSVVAHILRKAVESSSEVAQLEVY 280

Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
           V+ D T+    V +L     +   + +W+ +++++P+RLG + +NPVY+  +K+   L +
Sbjct: 281 VSQDCTVYKADVAQLMA---QPDPSTEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDL 337

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
                                       +G+IGGKP H+LYFIGY  + +++LDPH  Q 
Sbjct: 338 ---------------------------CIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQP 370

Query: 260 IGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
             CV   +   E+    ++HC    ++    MDPS  +
Sbjct: 371 --CV---DTSQERFPLESFHCTSPRKMAFSRMDPSCTI 403


>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
 gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
          Length = 473

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 161/323 (49%), Gaps = 72/323 (22%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+  S LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGS-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 166

Query: 71  W----NVNSKEEA-------------------------------YLKILKMFEDRRTAPY 95
           W     + S E                                 + +I+  F D   AP+
Sbjct: 167 WVEGTGLASSEMPGPASPSRYRGPGRRGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPF 226

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 286

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            +     +   +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 287 VSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 328

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q +   L
Sbjct: 329 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVNQ-ANFPL 372

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           +S +HC    ++    MDPS  V
Sbjct: 373 ES-FHCTSPRKMAFAKMDPSCTV 394


>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
          Length = 442

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 161/323 (49%), Gaps = 72/323 (22%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+  S LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 77  DIQRFQRDFVSRLWLTYRRDFPPLAGS-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 135

Query: 71  W----NVNSKEEA-------------------------------YLKILKMFEDRRTAPY 95
           W     + S E                                 + +I+  F D   AP+
Sbjct: 136 WVEGTGLASSEMPGPASPSRYRGPGRRGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPF 195

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 196 GLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 255

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            +     +   +W+ +V+++P+RLG + +NPVY+  +K              ++L S   
Sbjct: 256 VSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 297

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q +   L
Sbjct: 298 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVNQ-ANFPL 341

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           +S +HC    ++    MDPS  V
Sbjct: 342 ES-FHCTSPRKMAFAKMDPSCTV 363


>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
          Length = 355

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 100/314 (31%), Positives = 159/314 (50%), Gaps = 61/314 (19%)

Query: 8   SHQDLEQI--------RRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQA 59
           +HQ +EQI        + D  S++W TYR+ F  +  S  TTD GWGCMLR GQM++AQA
Sbjct: 5   AHQPMEQIYGEGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQA 64

Query: 60  LLFLHLGRDWQW-------NVNSKEEAYL--KILKMFEDRRT--APYSIHQIALTGASEG 108
           L+   LGR W+W       N    +E  L  KI+K F D+ +  +P SIHQ+   G + G
Sbjct: 65  LVCHFLGRSWRWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALG 124

Query: 109 KAVGEWFGPNTVAQVLRKLAKYD-----DWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
           K  G+W+GP +VA  L+ L         ++  +  +VA D+T+ +  +  +C     A  
Sbjct: 125 KKPGDWYGPASVAHCLKSLIASASKENYEFDHLEVYVAQDSTVYIQDIYSMCQLLHGA-- 182

Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
              W+ L+L++P++LG +  NP+Y             P   +  +L+  +          
Sbjct: 183 ---WKSLILLVPVKLGTEKFNPIY------------GPC--LTSLLTLDF---------- 215

Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
               +G+IGG+P H+LYF+GY  + +I LDPH  Q +  V+      +     ++HC   
Sbjct: 216 ---CIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQEMVDVWQPNFSLQ-----SFHCRSP 267

Query: 284 SRLHILHMDPSIAV 297
            ++ +  MDPS  +
Sbjct: 268 RKMPLAKMDPSCCI 281


>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
 gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
           Full=Autophagy-related protein 4 homolog a;
           Short=AtAPG4a; Short=Protein autophagy 4a
 gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
 gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
 gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
          Length = 467

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 143/303 (47%), Gaps = 49/303 (16%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L  ++ D +S++  TYRKGF P  D+  T+D  WGCM+R  QM+ AQALLF  LGR W  
Sbjct: 135 LAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWTK 194

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
                E+ YL+ L+ F D   + +SIH + + GAS G A G W GP  + +    LA   
Sbjct: 195 KSELPEQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLACKK 254

Query: 129 -KYDDWSSIVFHVAL-------------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
            K  D  +    +A+                L +    K C    +  S  +W P++L++
Sbjct: 255 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPIILLV 312

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PL LG+  +NP YI  +                              FTFPQS+G++GGK
Sbjct: 313 PLVLGLDSVNPRYIPSLVA---------------------------TFTFPQSVGILGGK 345

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           P  + Y +G   +   +LDPH  Q +  V  +  D +    S+YHC     + +  +DPS
Sbjct: 346 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVLRYVPLESLDPS 402

Query: 295 IAV 297
           +A+
Sbjct: 403 LAL 405


>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 346

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 107/312 (34%), Positives = 158/312 (50%), Gaps = 54/312 (17%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           +S    E+   D +SR+W TYRKGF  +G+S LT+D GWGCMLR GQ+++AQAL+  +LG
Sbjct: 28  VSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTSDVGWGCMLRSGQILLAQALVCHYLG 87

Query: 67  RDWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
           R W+ N   +  + YL+IL+ F D  +  +SIH +   G   G A G W GP  + + L 
Sbjct: 88  RTWRRNACQECLQEYLQILQSFGDSESCSFSIHNLLEAGRPFGLAAGSWLGPYALCRTLE 147

Query: 126 KLAKYDDWSSI-----------VFHVALDN--------TLVVNQVKKLCTTNKRASSNPQ 166
            LAK D+  +            V+ V+ +            V     LC+  K   +  +
Sbjct: 148 ALAKADEDQNAKKGGKRALPFAVYVVSGETEGDRGGAPVRCVEDAAVLCS--KWGEATEE 205

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W PLV+++PL LG+  +NP Y+  +                           R  FT PQ
Sbjct: 206 WSPLVVLVPLVLGLDKLNPRYLPSL---------------------------RATFTLPQ 238

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDST-YHCPQASR 285
           SLGV GGKP  + + IG  G+  ++LDPH NQ +  V  +  +    LD++ YHC    R
Sbjct: 239 SLGVAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLE----LDTSFYHCSVVRR 294

Query: 286 LHILHMDPSIAV 297
           L +  +DPS+A+
Sbjct: 295 LPLDSIDPSLAI 306


>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
          Length = 459

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 152/328 (46%), Gaps = 82/328 (25%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           +  RR   S LWFTYR+GF P+  S LTTD GWGC+LR  QM++AQ LL   +   W W+
Sbjct: 96  QHFRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWS 155

Query: 73  VNSK---------------------------------------EEAYLKILKMFEDRRTA 93
            N +                                       E    +IL+ F D  TA
Sbjct: 156 GNQRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTA 215

Query: 94  PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL--AKYDDWSSIVFHVALDNTLVVNQV 151
           P+ IH++   G S GK  G+W+GP+  A +LRK   A   D  ++V +VA D T+ +  V
Sbjct: 216 PFGIHRLVELGKSSGKKAGDWYGPSIAAHILRKAVEASVVDLPNLVAYVAQDCTIYLQDV 275

Query: 152 KKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILS 210
           +KLC         PQ W+ +++++P+RLG QD+NP YI  +KK   L             
Sbjct: 276 RKLC-----ERPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLEC----------- 319

Query: 211 STYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
                            +G+IGGKP H+L+F+G+  + +++LDPH  Q          D 
Sbjct: 320 ----------------CIGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPT-------VDV 356

Query: 271 EKKLD-STYHCPQASRLHILHMDPSIAV 297
            K     ++HC    ++    MDPS  +
Sbjct: 357 TKNFPLESFHCKNPRKMPFSRMDPSCTI 384


>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
 gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
          Length = 672

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 105/323 (32%), Positives = 165/323 (51%), Gaps = 67/323 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   LGR W
Sbjct: 264 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 323

Query: 70  QWNVNS------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S      ++  + KI+K F D   +++P+SIH +   G   GK  G+W+GP +V+
Sbjct: 324 RYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGKKPGDWYGPASVS 383

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLC---------------TTNKRA 161
            +L+   ++      D+ +I  +VA D T+ +  +++ C               T+ K A
Sbjct: 384 YLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKPHVPWQMTSKKPA 443

Query: 162 SSNPQ-------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
           S  P+       W+ L+++IPLRLG   +NPVY +                +K+L ST +
Sbjct: 444 SDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAH---------------CLKLLLSTEH 488

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E   
Sbjct: 489 ------------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQETFS 531

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    +L    MDPS  +
Sbjct: 532 MQSFHCKSPRKLKSSKMDPSCCI 554


>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
 gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
          Length = 422

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 143/303 (47%), Gaps = 49/303 (16%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L  ++ D +S++  TYRKGF P  D+  T+D  WGCM+R  QM+ AQALLF  LGR W  
Sbjct: 90  LAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWTK 149

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
                E+ YL+ L+ F D   + +SIH + + GAS G A G W GP  + +    LA   
Sbjct: 150 KSELPEQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLACKK 209

Query: 129 -KYDDWSSIVFHVAL-------------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
            K  D  +    +A+                L +    K C    +  S  +W P++L++
Sbjct: 210 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPIILLV 267

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PL LG+  +NP YI  +                              FTFPQS+G++GGK
Sbjct: 268 PLVLGLDSVNPRYIPSLVA---------------------------TFTFPQSVGILGGK 300

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           P  + Y +G   +   +LDPH  Q +  V  +  D +    S+YHC     + +  +DPS
Sbjct: 301 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVLRYVPLESLDPS 357

Query: 295 IAV 297
           +A+
Sbjct: 358 LAL 360


>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
          Length = 354

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 151/303 (49%), Gaps = 52/303 (17%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  + D  S++W TYR+ F  +  S  TTD GWGCMLR GQM++AQAL+   LGR W
Sbjct: 7   EGIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 66

Query: 70  QW------NVNSKEEAYLK--ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNT 119
           +W      N    +E  L   I+K F D+ +  +P SIHQ+   G + GK  G+W+GP +
Sbjct: 67  RWSEKPIQNGREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGPAS 126

Query: 120 VAQVLRKL-----AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
           VA  L+ +      +  ++  +  +VA D+T+ +  V   C        N  W+ L+L++
Sbjct: 127 VAHCLKSVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTHCRL-----PNGCWKSLILLV 181

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           P++LG + +NP+Y   +     L                              +G+IGG+
Sbjct: 182 PVKLGTERLNPIYGPCLTSLLTLDFC---------------------------IGIIGGR 214

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           P H+LYF+GY  + +I LDPH  Q +  V+      +     T+HC    ++ I  MDPS
Sbjct: 215 PKHSLYFVGYQDDRLIHLDPHYCQEMVDVWQPNFSLQ-----TFHCRSPRKMPISKMDPS 269

Query: 295 IAV 297
             +
Sbjct: 270 CCI 272


>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
 gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
          Length = 471

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 100/317 (31%), Positives = 154/317 (48%), Gaps = 68/317 (21%)

Query: 14  QIRRDITSRLWFTYRKGFVPI------GDS------------------GLTTDKGWGCML 49
           Q   D  SRLW TYR  F PI      G S                  G T+D GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195

Query: 50  RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-G 108
           R GQ ++A  LLFL LGRDW+     +EE+  +++ +F D   AP+SIH+    GA+  G
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKIQEES--ELVSLFADHPRAPFSIHRFVQHGATACG 253

Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
           K  GEWFGP+  AQ ++ L K +  + +  +V  D + +  +  +    ++  S     +
Sbjct: 254 KCPGEWFGPSAAAQCIQALVKSNPQAGLRVYVTNDGSDIYERQFREVACDESGS----IK 309

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           P ++++ +RLGI  + P+Y               +D +K L              +PQS+
Sbjct: 310 PTLILLGVRLGIDRVTPIY---------------WDSLKAL------------LHYPQSV 342

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------STYHC 280
           G+ GG+P+ + YFI   G+   +LDPH  Q   C+  + + +E +          STYH 
Sbjct: 343 GIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLAPRSEPTEDEESHPYSPEELSTYHT 400

Query: 281 PQASRLHILHMDPSIAV 297
            +  RLH+  MDPS+ +
Sbjct: 401 RRLRRLHVREMDPSMLI 417


>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
 gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
          Length = 676

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 103/322 (31%), Positives = 162/322 (50%), Gaps = 65/322 (20%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
            + +E  RRD  SRLW TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   LGR 
Sbjct: 269 EEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLIVHFLGRS 328

Query: 69  WQWNVNS------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
           W+++  S      ++  + KI+K F D   +++P+SIH +   G + GK  G+W+GP +V
Sbjct: 329 WRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGKKPGDWYGPASV 388

Query: 121 AQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTN--------------KRA 161
           + +L+   ++      D+ +I  +VA D T+ +  ++  C+                KR 
Sbjct: 389 SYLLKHALEHATQENADFDNISVYVAKDCTIYIQDIEDQCSIPEPAPKQTHVPWQQMKRP 448

Query: 162 SSNP------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
           S N        W+ ++++IPLRLG   +NP Y +                +K+L ST N 
Sbjct: 449 SLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAH---------------CLKLLLSTEN- 492

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                       LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E    
Sbjct: 493 -----------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQENFSM 536

Query: 276 STYHCPQASRLHILHMDPSIAV 297
            ++HC    ++    MDPS  +
Sbjct: 537 QSFHCKSPRKIKTSKMDPSCCI 558


>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
          Length = 263

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/219 (40%), Positives = 121/219 (55%), Gaps = 56/219 (25%)

Query: 104 GASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KR 160
           G  EGK++G+W+GPNTVAQVL+KLA +D WSS+  H+A+DNT+V+ ++++LC T+     
Sbjct: 2   GVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAG 61

Query: 161 ASSNPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
           A++ P                      W+PLVL+IPLRLG+ DIN  Y+  +K C     
Sbjct: 62  ATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC----- 116

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
                                 F  PQSLGVIGGKPN A YF+GYVG ++I+LDPHT Q 
Sbjct: 117 ----------------------FMMPQSLGVIGGKPNSAHYFVGYVGEELIYLDPHTTQP 154

Query: 260 IGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
                 +   S    D ++HC     R+ I  +DPSIAV
Sbjct: 155 AV----EPTGSCFIPDESFHCQHPPCRMSIAELDPSIAV 189


>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
 gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
          Length = 454

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 100/317 (31%), Positives = 152/317 (47%), Gaps = 68/317 (21%)

Query: 14  QIRRDITSRLWFTYRKGFVPI--------GDS----------------GLTTDKGWGCML 49
           Q   D  S+LW TYR  F PI        GDS                G T+D GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178

Query: 50  RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-G 108
           R GQ ++A  LLF+ LGRDW+     +EE+  +++ +F D   AP+SIH+    GA+  G
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKLQEES--ELVSLFADHPRAPFSIHRFVHHGATACG 236

Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
           K  GEWFGP+  +Q ++ L K +    +  ++  D + +  +  K    ++        Q
Sbjct: 237 KCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDESGG----IQ 292

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           P ++++ +RLGI  + PVY               +D +K L              FPQS+
Sbjct: 293 PTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRFPQSV 325

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------STYHC 280
           G+ GG+P+ + YFI   G+   +LDPH  Q   C+  + + +  +          STYH 
Sbjct: 326 GIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLTPRAESTGDEESHPYSPEELSTYHT 383

Query: 281 PQASRLHILHMDPSIAV 297
            +  RLHI  MDPS+ +
Sbjct: 384 RRLRRLHIREMDPSMLI 400


>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
 gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
          Length = 678

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 167/323 (51%), Gaps = 67/323 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   LGR W
Sbjct: 263 EGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 322

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++ +S+      +  + KI+K F D   +++P+SIH +   G + GK  G+W+GP +V+
Sbjct: 323 RYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGKKPGDWYGPASVS 382

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 383 YLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKPHVPWQQAKRPQA 442

Query: 167 ------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                       W+ L+++IPLRLG   +NPVY +                +K+L ST +
Sbjct: 443 EAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH 487

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG+IGGKP H+LYF+G+  + +I LDPH  Q +  +     + E   
Sbjct: 488 ------------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDI-----NQEHFS 530

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC  A +L +  MDPS  +
Sbjct: 531 LHSFHCKSARKLKVSKMDPSCCI 553


>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
 gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
          Length = 676

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 167/323 (51%), Gaps = 67/323 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   LGR W
Sbjct: 263 EGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 322

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++ +S+      +  + KI+K F D   +++P+SIH +   G + GK  G+W+GP +V+
Sbjct: 323 RYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGKKPGDWYGPASVS 382

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 383 YLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKPHVPWQQAKRPQA 442

Query: 167 ------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                       W+ L+++IPLRLG   +NPVY +                +K+L ST +
Sbjct: 443 EAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH 487

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                        LG+IGGKP H+LYF+G+  + +I LDPH  Q +  +     + E   
Sbjct: 488 ------------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDI-----NQEHFS 530

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC  A +L +  MDPS  +
Sbjct: 531 LHSFHCKSARKLKVSKMDPSCCI 553


>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 445

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 98/302 (32%), Positives = 144/302 (47%), Gaps = 61/302 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
           D  SR+W TYR  F PI  S                      G T+D GWGCM+R GQ +
Sbjct: 114 DFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIRSGQSL 173

Query: 56  IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
           +A  ++   LGRDW+     KE  +  IL +F D   AP+SIH+    GA   G   GEW
Sbjct: 174 LANTIVVHRLGRDWR--KGQKEREHKDILSLFADTPDAPFSIHKFVEHGAQACGTYPGEW 231

Query: 115 FGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           FGPN  A+ LR L  KY      V+    D+ + ++    L  T  +  +N ++QP ++V
Sbjct: 232 FGPNATARCLRALTDKYHQAGLRVYARPNDSDVYID---ALTATATQKDANDEFQPTLIV 288

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI+ + P Y   +K    L                           PQS+G+ GG
Sbjct: 289 LGIRLGIEKVTPAYHAALKAALEL---------------------------PQSMGIAGG 321

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YF+G+ G++  +LDPHT + +       Q S + +D T H  +  RL +  MDP
Sbjct: 322 RPSSSHYFVGHQGDNFFYLDPHTTRPMLS----PQPSAEDVD-TCHTRRVRRLSLAEMDP 376

Query: 294 SI 295
           S+
Sbjct: 377 SM 378


>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
          Length = 1114

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 103/317 (32%), Positives = 166/317 (52%), Gaps = 64/317 (20%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ ++D +S LWFTYR+ F  I  + LT+D GWGCMLR GQM++A+AL   +LG   +
Sbjct: 258 NIEKFKQDFSSLLWFTYRQDFPAIPGTKLTSDCGWGCMLRSGQMMLAKALTLHYLGP--E 315

Query: 71  WNVNS----KEEAYLK-ILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
           WNV S    ++E Y K I++ F D     +P+S+H++   G + GK  GEWFGP +VA +
Sbjct: 316 WNVFSDQTREQETYRKQIIRWFGDYLCDESPFSMHRLVEVGKNLGKQPGEWFGPASVAHI 375

Query: 124 LRKLAKYDD-----WSSIVFHVALDNTLVVNQVKKLCTTNKRA----------------- 161
           L++            S +  +V+ D T+    + +LC T  RA                 
Sbjct: 376 LKETMVKGQKTQTVLSDLCVYVSQDCTVYKQDIYELCCTRPRADTKFTNSTESEHESSQD 435

Query: 162 SSNPQWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           +S+  W+  +V++IP+RLG + +NPVYI  +K               +LS          
Sbjct: 436 ASSMDWKRAVVILIPVRLGGEQLNPVYIPCVKG--------------LLSQD-------- 473

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
                  +G+IGGKP H+LYF+G+  + +I+LDPH  Q++    ++    +     +YHC
Sbjct: 474 -----SCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVDTRERHFPIQ-----SYHC 523

Query: 281 PQASRLHILHMDPSIAV 297
               ++ I  +DPS  +
Sbjct: 524 MSPRKVSIDKIDPSCTI 540


>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
 gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
          Length = 476

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 103/304 (33%), Positives = 150/304 (49%), Gaps = 50/304 (16%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
           L   R+D +S +  TYR+GF PIGD+  T+D  WGCMLR GQM+ AQALLF  LGR W +
Sbjct: 137 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 196

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
            +     E YL+IL++F D   + +SIH + L G S G A G W GP  V +    LA+ 
Sbjct: 197 KDSEPPNEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARK 256

Query: 131 DDWSSIVFHVALDNT-----------------LVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           +   + V H +                     L +  V K C   + +  + +W P++L+
Sbjct: 257 NKEETDVKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL--EFSEGDTEWPPILLL 314

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PL LG+  +NP YI  +                              FTFPQSLG++GG
Sbjct: 315 VPLVLGLDKVNPRYIPSLIA---------------------------TFTFPQSLGILGG 347

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           KP  + Y +G   +   +LDPH  Q +  V  + QD +    S+YHC     + +  +DP
Sbjct: 348 KPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVDT---SSYHCNTLRYVPLESLDP 404

Query: 294 SIAV 297
           S+A+
Sbjct: 405 SLAL 408


>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
 gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
          Length = 404

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 101/305 (33%), Positives = 153/305 (50%), Gaps = 59/305 (19%)

Query: 18  DITSRLWFTYRKGFVPI----GDS-------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F PI    GD                    G T+D GWGCM+R GQ 
Sbjct: 81  DFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQS 140

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A AL  L LGRDW+     +EE+  ++L +F D  TAP+S+H+    GA S GK  GE
Sbjct: 141 LLANALSMLVLGRDWRRGARFEEES--QLLSLFADTPTAPFSVHRFVKHGAESCGKYPGE 198

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ +  L+      ++  +V+ D + V     K     +  S    +QP +++
Sbjct: 199 WFGPSATAKCIEALSSQCGNPTLKVYVSNDTSEVYQD--KFMDIARNTSG--AFQPTLIL 254

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI +I PVY +G+K                               FPQS+G+ GG
Sbjct: 255 LGTRLGIDNITPVYWDGLKAA---------------------------LQFPQSVGIAGG 287

Query: 234 KPNHALYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YF+G  G+ + +LDPH T   +    + E  S++++D TYH  +  R+H+  MD
Sbjct: 288 RPSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVD-TYHTRRLRRIHVRDMD 346

Query: 293 PSIAV 297
           PS+ +
Sbjct: 347 PSMLI 351


>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
 gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
          Length = 440

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/335 (30%), Positives = 155/335 (46%), Gaps = 68/335 (20%)

Query: 4   ANKLSHQDL---EQIRRDITSRLWFTYRKGFVPI----------------------GDSG 38
           +  +  +DL    Q   D  SR+W TYR  F PI                          
Sbjct: 97  SKSMEEEDLGWPSQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGN 156

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
            T+D GWGCM+R GQ ++A  ++ L LGRDW+     KE+ + +IL MF D   AP+SIH
Sbjct: 157 FTSDTGWGCMIRSGQSLLANTVVMLRLGRDWR--RGQKEKQHHEILSMFADTPEAPFSIH 214

Query: 99  QIALTGASE-GKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           +    GAS  G   GEWFGP+  A+ +R L  KY D    V+    D+ + ++    L  
Sbjct: 215 KFVEHGASACGTYPGEWFGPSATARCIRALTEKYHDVGLRVYARPNDSDVYID---TLTA 271

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
           T  + S++  + P ++V+ +RLGI+ + P Y   +K    L                   
Sbjct: 272 TTTQHSASETFSPTLIVLGVRLGIEKVTPAYHAALKSILEL------------------- 312

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                   PQS+G+ GG+P+ + YF+G+ G+   +LDPHT + +       +D E     
Sbjct: 313 --------PQSVGIAGGRPSSSHYFVGHQGDHFFYLDPHTTRPMLTAQPTAEDVE----- 359

Query: 277 TYHCPQASRLHILHMDPSI----AVVSQRSYSDYK 307
           + H  +  RL I  MDPS+     V  +  + D+K
Sbjct: 360 SCHTRRIRRLSIAEMDPSMLLGFLVRDKEDFEDWK 394


>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
           1015]
          Length = 384

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/310 (32%), Positives = 154/310 (49%), Gaps = 59/310 (19%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI----GDS-------------------GLTTDKGWGCML 49
           E    D  SR+W TYR  F PI    GD                    G T+D GWGCM+
Sbjct: 56  ESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMI 115

Query: 50  RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
           R GQ ++A AL  L LGRDW+     +EE+  ++L +F D  TAP+S+H+    GA S G
Sbjct: 116 RSGQSLLANALSMLVLGRDWRRGARFEEES--QLLSLFADTPTAPFSVHRFVKHGAESCG 173

Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
           K  GEWFGP+  A+ +  L+      ++  +V+ D + V     K     +  S    +Q
Sbjct: 174 KYPGEWFGPSATAKCIEALSSQCGNPTLKVYVSNDTSEVYQD--KFMDIARNTSG--AFQ 229

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           P ++++  RLGI +I PVY +G+K                               FPQS+
Sbjct: 230 PTLILLGTRLGIDNITPVYWDGLKAA---------------------------LQFPQSV 262

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           G+ GG+P+ + YF+G  G+ + +LDPH T   +    + E  S++++D TYH  +  R+H
Sbjct: 263 GIAGGRPSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVD-TYHTRRLRRIH 321

Query: 288 ILHMDPSIAV 297
           +  MDPS+ +
Sbjct: 322 VRDMDPSMLI 331


>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
           UAMH 10762]
          Length = 446

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/302 (32%), Positives = 146/302 (48%), Gaps = 61/302 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
           D+ +R+W TYR  F PI  S                      G T+D GWGCM+R GQ +
Sbjct: 117 DMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGNSGGFTSDAGWGCMIRSGQTL 176

Query: 56  IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
           +A +L  L LGRDW+     KE+ Y  ++ +F D   AP+SIH+    GA   GK  GEW
Sbjct: 177 LANSLATLKLGRDWR--RGQKEDDYKHLISLFADTPEAPFSIHKFVEHGAQACGKHPGEW 234

Query: 115 FGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           FGP+  A+ ++ L  KY D    V+    D  + V+    L  T  +  +N ++QP ++V
Sbjct: 235 FGPSATARSVQALTEKYRDVGLRVYARPDDGDVYVD---SLFATAGQMDANDEFQPTLIV 291

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI  I PVY   +K    +                           PQS+G+ GG
Sbjct: 292 LGIRLGIDRITPVYHAALKATLEM---------------------------PQSVGIAGG 324

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YF+G+ G++  +LDPHT +         Q+   +  ++ H  +  RL I  MDP
Sbjct: 325 RPSSSHYFVGHQGDNFFYLDPHTTRQA-----IPQNPSAEDLASCHTRRLRRLKIAEMDP 379

Query: 294 SI 295
           S+
Sbjct: 380 SM 381


>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
           Full=Autophagy-related protein 4 homolog b;
           Short=AtAPG4b; Short=Protein autophagy 4b
 gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
 gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
 gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 477

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/304 (33%), Positives = 151/304 (49%), Gaps = 50/304 (16%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
           L   R+D +S +  TYR+GF PIGD+  T+D  WGCMLR GQM+ AQALLF  LGR W +
Sbjct: 138 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 197

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
            +    +E YL+IL++F D   + +SIH + L G S G A G W GP  V +    LA+ 
Sbjct: 198 KDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARK 257

Query: 131 DDWS--------SIVFHVALDNT---------LVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           +           S+  H+   +          L +  V K C   + +    +W P++L+
Sbjct: 258 NKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL--EFSEGETEWPPILLL 315

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PL LG+  +NP YI  +                              FTFPQSLG++GG
Sbjct: 316 VPLVLGLDRVNPRYIPSLIA---------------------------TFTFPQSLGILGG 348

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           KP  + Y +G   +   +LDPH  Q +  V  + QD +    S+YHC     + +  +DP
Sbjct: 349 KPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVDT---SSYHCNTLRYVPLESLDP 405

Query: 294 SIAV 297
           S+A+
Sbjct: 406 SLAL 409


>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
 gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
          Length = 518

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 92/320 (28%), Positives = 157/320 (49%), Gaps = 57/320 (17%)

Query: 2   RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
           + AN +S    E    D  SRLW TYR  F P+ ++  TTD GWGCM+R  QM++AQA++
Sbjct: 158 KDANGVS-SGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIM 216

Query: 62  FLHLGRDWQW--------NVNSKEEAYLK-------ILKMFEDRRTAPYSIHQIALTGAS 106
               GR+W++         +N +E  + +       ILK+FED+ ++P  IH++    A 
Sbjct: 217 LNRFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAK 276

Query: 107 E--GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN 164
           E   KAVG W+ P+    +++K       +  +  +  D  + ++   ++   +    + 
Sbjct: 277 EKGKKAVGSWYSPSEAVFIMKKAL-----TESISPLTGDTAMYLSIDGRVHIRDIEVETK 331

Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
              + L+LVI +RLG  ++NP+Y+  + +                            F+ 
Sbjct: 332 NWMKTLILVIVVRLGAAELNPIYVPHLMRL---------------------------FSM 364

Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT-------NQNIGCVYDKEQDSEKKLDST 277
              LGV GG+P+H+ +F+G+ G+ +I+LDPH        + N        + S+K  + +
Sbjct: 365 ESCLGVTGGRPDHSCWFVGFYGDQIIYLDPHVAHEYIPIDMNFNVNMTDNKKSKKCPERS 424

Query: 278 YHCPQASRLHILHMDPSIAV 297
           YHC   S++H L MDPS A+
Sbjct: 425 YHCRLLSKMHFLDMDPSCAL 444


>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
 gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
          Length = 458

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + +++++P+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
          Length = 458

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + +++++P+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
           digitatum PHI26]
          Length = 401

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 152/318 (47%), Gaps = 62/318 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F PI  +                       G T+D GWGCM+R GQ 
Sbjct: 74  DFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQS 133

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A A   L LGRDW+     KEE   K++ MF D   AP+SIH+    GA S GK  GE
Sbjct: 134 LLANAFSVLLLGRDWR--RGEKEEEESKLISMFADHPEAPFSIHKFVNRGAESCGKYPGE 191

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+   +   +  +V  D + V        + ++        QP +++
Sbjct: 192 WFGPSATAKCIQLLSTQSEAHRLRVYVTNDTSDVYEDKFAHVSHDRSGCI----QPTLIL 247

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           I  RLGI+++ P Y +G+                           R   T+PQS+G+ GG
Sbjct: 248 IGTRLGIENVTPAYWDGL---------------------------RAALTYPQSVGIAGG 280

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YF+G     + FLDPHT +        E  ++++LDS Y+  +  R+HI  MDP
Sbjct: 281 RPSASHYFLGAQDCHLFFLDPHTTRPATPYRPDELYTQEELDS-YYTSRLRRIHIKDMDP 339

Query: 294 SI----AVVSQRSYSDYK 307
           S+     +  +  ++D+K
Sbjct: 340 SMLIGFLIKDEEDWADWK 357


>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
 gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
          Length = 703

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 103/321 (32%), Positives = 163/321 (50%), Gaps = 65/321 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM+ AQ L+   LGR W
Sbjct: 296 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 355

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   +++P+SIH +   G   GK  G+W+GP +V+
Sbjct: 356 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 415

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA--------------S 162
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A              +
Sbjct: 416 YLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQKAKRPQA 475

Query: 163 SNPQ------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
            NP+      W+ L+++IPLRLG   +NPVY +                +K+L ST +  
Sbjct: 476 ENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEHC- 519

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E     
Sbjct: 520 -----------LGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 563

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           ++HC    +L    MDPS  +
Sbjct: 564 SFHCKSPRKLKASKMDPSCCI 584


>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
 gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
          Length = 439

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 143/305 (46%), Gaps = 57/305 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  +++W TYR  F  I  S                       G T+D GWGCM+R GQ 
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRSGQS 165

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A ALL L +GR+W+  V+S EE   KIL +F D   APYSIH+    GAS  GK  GE
Sbjct: 166 LLANALLTLRMGREWRRGVSSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 223

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+     S +  ++  D + V     K  +  K   S+  + P +++
Sbjct: 224 WFGPSATARCIQALSNSQAKSELRVYITGDGSDVYED--KFMSIAKPNHSD--FTPTLIL 279

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLG+  I PVY   +K                           Y    PQS+G+ GG
Sbjct: 280 VGTRLGLDKITPVYWEALK---------------------------YSLQMPQSVGIAGG 312

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YFIG   +D  +LDPH  +      D  +D   +   + H  +  RLHI  MDP
Sbjct: 313 RPSSSHYFIGVQESDFFYLDPHQTRPALPYKDNVEDYTTEDIDSCHTRRLRRLHIKEMDP 372

Query: 294 SIAVV 298
           S+ + 
Sbjct: 373 SMLIA 377


>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
          Length = 396

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 13  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 72

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 73  WPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHEMRNE 132

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 133 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 192

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + ++++IP+RLG +  N  Y++ +K 
Sbjct: 193 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVK- 249

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 250 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 283

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 284 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 335


>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
 gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
          Length = 458

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + +++++P+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
 gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
          Length = 473

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 100/307 (32%), Positives = 147/307 (47%), Gaps = 57/307 (18%)

Query: 18  DITSRLWFTYRKGFVPI----GDS------------------GLTTDKGWGCMLRCGQMV 55
           D  SRLW TYR  F PI    G S                  G T+D GWGCM+R GQ +
Sbjct: 141 DFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSGQSL 200

Query: 56  IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
           +A  LLFL LGR W+     +EE+  ++L +F D   AP+SIH+    GA+  GK  GEW
Sbjct: 201 LANTLLFLRLGRGWRRGSQEQEES--ELLSLFADHPRAPFSIHRFVQHGATACGKCPGEW 258

Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVN-QVKKL-CTTNKRASSNPQWQPLVL 172
           FGP   AQ ++ LA     + +  ++  D + +   Q +++ C        +   +P ++
Sbjct: 259 FGPAAAAQCIQALANGHPQAGLNVYITSDGSDIYERQFREIACRGLGEDGEDDSIKPTLI 318

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           ++ +RLGI  + PVY   +K+                              FPQS+G+ G
Sbjct: 319 LLGVRLGIDRVTPVYWESLKEV---------------------------IRFPQSVGIAG 351

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD--SEKKLDSTYHCPQASRLHILH 290
           G+P+ + YFI   G+   +LDPH  +         +D  S  +L STYH  +  RLHI  
Sbjct: 352 GRPSSSHYFIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGEL-STYHTRRLRRLHIRE 410

Query: 291 MDPSIAV 297
           MDPS+ +
Sbjct: 411 MDPSMLI 417


>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
          Length = 447

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 149/320 (46%), Gaps = 65/320 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
           D  SR+W TYR GF PI  S                      G T+D GWGCM+R GQ +
Sbjct: 115 DFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIRSGQSL 174

Query: 56  IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
           +A  +L   LGRDW+     K+E +  IL +F D   AP+SIH+    GA   G   GEW
Sbjct: 175 LANTILLHRLGRDWR--KGQKQEEHKNILSLFADTPEAPFSIHKFVEHGAQACGTYPGEW 232

Query: 115 FGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           FGPN  A+ LR L  KY      V+    D+ +  +    L  T  +  ++ ++QP ++V
Sbjct: 233 FGPNATARCLRALTDKYHGAGLRVYARPNDSDVYAD---ALIETATQKDADDKFQPTLIV 289

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI+ +   Y   +K    L                           PQS+G+ GG
Sbjct: 290 LGIRLGIEKVTSAYHVALKAALEL---------------------------PQSVGIAGG 322

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YF+G+ G+   +LDPHT +++       +D E     T H  +  +L +  MDP
Sbjct: 323 RPSSSHYFLGHQGDSFFYLDPHTTRHMLSPQPSAEDIE-----TCHTRRIRKLPLSEMDP 377

Query: 294 SI----AVVSQRSYSDYKNV 309
           S+     V SQ  + +++  
Sbjct: 378 SMLLGFLVRSQEEFEEWRKA 397


>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
          Length = 458

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IHHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + +++++P+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
 gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
          Length = 703

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 103/321 (32%), Positives = 162/321 (50%), Gaps = 65/321 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM+ AQ L+   LGR W
Sbjct: 296 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 355

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   +++P+SIH +   G   GK  G+W+GP +V+
Sbjct: 356 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 415

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 416 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 475

Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                     W+ L+++IPLRLG   +NPVY +                +K+L ST +  
Sbjct: 476 ETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEHC- 519

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E     
Sbjct: 520 -----------LGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 563

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           ++HC    +L    MDPS  +
Sbjct: 564 SFHCKSPRKLKASKMDPSCCI 584


>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
          Length = 454

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 98/321 (30%), Positives = 150/321 (46%), Gaps = 72/321 (22%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDS----------------------------GLTTDKGW 45
           Q   D  S+LW TYR  F PI  +                            G T+D GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
           GCM+R GQ ++A  LLFL LGRDW+     +EE+  +++ +F D   AP+SIH+    GA
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKVQEES--ELVSLFADHPRAPFSIHRFVHHGA 232

Query: 106 SE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN 164
           +  GK  GEWFGP+  +Q ++ L K +    +  ++  D + +  +  K    ++     
Sbjct: 233 TACGKCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDESGGI- 291

Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
              QP ++++ +RLGI  + PVY               +D +K L              F
Sbjct: 292 ---QPTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRF 321

Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------S 276
           PQS+G+ GG+P+ + YFI   G+   +LDPH  Q   C+  + + +  +          S
Sbjct: 322 PQSVGIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLTPRAESTGDEESHPYSPEELS 379

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           TYH  +  RLHI  MDPS+ +
Sbjct: 380 TYHTRRLRRLHIREMDPSMLI 400


>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
          Length = 458

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 155/344 (45%), Gaps = 91/344 (26%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIVSWFGDSPLALFGLHQLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K C +   AS NP  + +++++P+RLG +  N  Y+  +K 
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQCAS--MASDNPDNKAVIILVPVRLGGERTNVDYLEFVKG 312

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 313 --------------ILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
          Length = 467

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/265 (32%), Positives = 128/265 (48%), Gaps = 53/265 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
           D  SR+W TYR GF PI  S                     G T+D G+GCM+R GQ ++
Sbjct: 99  DFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCMIRSGQCIL 158

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGRDW+W  N  ++ + +IL +F D   AP+SIH+    GA+  GK  GEWF
Sbjct: 159 ANALQILRLGRDWRWQENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAVCGKYPGEWF 218

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
           GP+  A+ ++ LA     + +  +V+ D   V     K    ++       WQP ++++ 
Sbjct: 219 GPSAAARCIQDLANKHREAGLKVYVSGDGADVYEDKLKQVAVDEDG----LWQPTLILVG 274

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
            RLGI  I PVY   +K                                PQS+G+ GG+P
Sbjct: 275 TRLGIDKITPVYWEALKAS---------------------------LQIPQSIGIAGGRP 307

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNI 260
           + + YF+G  GN+  +LDPH+ + +
Sbjct: 308 SASHYFVGVQGNNFYYLDPHSTRPL 332


>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
          Length = 469

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 98/321 (30%), Positives = 150/321 (46%), Gaps = 72/321 (22%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDS----------------------------GLTTDKGW 45
           Q   D  S+LW TYR  F PI  +                            G T+D GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
           GCM+R GQ ++A  LLFL LGRDW+     +EE+  +++ +F D   AP+SIH+    GA
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKVQEES--ELVSLFADHPRAPFSIHRFVHHGA 247

Query: 106 SE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN 164
           +  GK  GEWFGP+  +Q ++ L K +    +  ++  D + +  +  K    ++     
Sbjct: 248 TACGKCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDESGGI- 306

Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
              QP ++++ +RLGI  + PVY               +D +K L              F
Sbjct: 307 ---QPTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRF 336

Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------S 276
           PQS+G+ GG+P+ + YFI   G+   +LDPH  Q   C+  + + +  +          S
Sbjct: 337 PQSVGIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLTPRAESTGDEESHPYSPEELS 394

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           TYH  +  RLHI  MDPS+ +
Sbjct: 395 TYHTRRLRRLHIREMDPSMLI 415


>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
 gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
          Length = 708

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 163/321 (50%), Gaps = 65/321 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   LGR W
Sbjct: 301 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 360

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   +++P+SIH +   G   GK  G+W+GP +V+
Sbjct: 361 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 420

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 421 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 480

Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                     W+ ++++IPLRLG   +NPVY +                +K+L ST +  
Sbjct: 481 ETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH-- 523

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E     
Sbjct: 524 ----------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 568

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           ++HC    +L    MDPS  +
Sbjct: 569 SFHCKSPRKLKASKMDPSCCI 589


>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
 gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
          Length = 706

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 163/321 (50%), Gaps = 65/321 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   LGR W
Sbjct: 299 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 358

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   +++P+SIH +   G   GK  G+W+GP +V+
Sbjct: 359 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 418

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 419 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 478

Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                     W+ ++++IPLRLG   +NPVY +                +K+L ST +  
Sbjct: 479 ETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH-- 521

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E     
Sbjct: 522 ----------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 566

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           ++HC    +L    MDPS  +
Sbjct: 567 SFHCKSPRKLKASKMDPSCCI 587


>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
 gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
          Length = 358

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 91/288 (31%), Positives = 138/288 (47%), Gaps = 43/288 (14%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D +SR+W TYR+GF  IG+S  T+D GWGCM+R GQM+ AQAL+   LGR W+       
Sbjct: 72  DFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRLGRGWRRGEQPYA 131

Query: 78  EAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSS 135
             YL+IL  F D  +   P+SIH     G+  G A G W GP  +   +  LA+ D    
Sbjct: 132 REYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCHAIEALARNDGRGR 191

Query: 136 ------IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
                  V+ V+ D          L   +          P+++++PL LG+  INP Y+ 
Sbjct: 192 QGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-----PVLILVPLVLGLDKINPRYLP 246

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +                           R  F FPQS+G+ GGKP  ++YF+G   +  
Sbjct: 247 SL---------------------------RATFAFPQSVGIAGGKPAASVYFVGVQDDQA 279

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPH  Q +  V  +  + +    ++YHC    ++ +  +DPS+A+
Sbjct: 280 LYLDPHEVQKVVSVSGESLEFDS---ASYHCSVVRKMPLDAIDPSLAL 324


>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
          Length = 342

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 93/265 (35%), Positives = 135/265 (50%), Gaps = 69/265 (26%)

Query: 35  GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTA 93
           G +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W    ++ + Y +IL+ F DR+  
Sbjct: 67  GGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDC 126

Query: 94  PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKK 153
            YSIHQ+                   +  +L   A           +A +N         
Sbjct: 127 CYSIHQM-----------------EKMCCILPLSAD----------IATENPSGSPNASN 159

Query: 154 LCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
              +   ++  P W+PL+L++PLRLGI  INPVY++  K                     
Sbjct: 160 --HSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK--------------------- 196

Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
                        SLG +GGKPN+A YFIG++G+++IFLDPHT Q      D E++    
Sbjct: 197 -------------SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD 240

Query: 274 LDSTYHCPQ-ASRLHILHMDPSIAV 297
            D T+HC Q   R++IL++DPS+A+
Sbjct: 241 -DQTFHCLQPPQRMNILNLDPSVAL 264


>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
 gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
          Length = 458

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/357 (30%), Positives = 160/357 (44%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK     ++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + ++++IP+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
 gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
          Length = 358

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/288 (31%), Positives = 138/288 (47%), Gaps = 43/288 (14%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D +SR+W TYR+GF  IG+S  T+D GWGCM+R GQM+ AQAL+   LGR W+       
Sbjct: 72  DFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRLGRGWRRGEQPYA 131

Query: 78  EAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSS 135
             YL+IL  F D  +   P+SIH     G+  G A G W GP  +   +  LA+ D    
Sbjct: 132 REYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCHAIEALARNDGRGR 191

Query: 136 ------IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
                  V+ V+ D          L   +          P+++++PL LG+  INP Y+ 
Sbjct: 192 EGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-----PVLILVPLVLGLDKINPRYLP 246

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +                           R  F FPQS+G+ GGKP  ++YF+G   +  
Sbjct: 247 SL---------------------------RATFAFPQSVGIAGGKPAASVYFVGVQDDQA 279

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPH  Q +  V  +  + +    ++YHC    ++ +  +DPS+A+
Sbjct: 280 LYLDPHEVQKVVSVSGESLEFDS---ASYHCSVVRKMLLDAIDPSLAL 324


>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
           familiaris]
 gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
           familiaris]
          Length = 458

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 159/357 (44%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S  TTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W---------------------------------------NVNSKE-------------E 78
           W                                        V+ KE             E
Sbjct: 135 WPDALNIENSDSDSWTSNTVKKFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNE 194

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
            Y  KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + ++++IP+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
          Length = 653

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 162/321 (50%), Gaps = 65/321 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM+ AQ L+   LGR W
Sbjct: 246 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 305

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   +++P+SIH +   G   GK  G+W+GP +V+
Sbjct: 306 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 365

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 366 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 425

Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                     W+ L+++IPLRLG   +NPVY +                +K+L ST +  
Sbjct: 426 ETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEHC- 469

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG++GGKP H+LYF+G+  + +I LDPH  Q +  V     + E     
Sbjct: 470 -----------LGILGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 513

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           ++HC    +L    MDPS  +
Sbjct: 514 SFHCKSPRKLKASKMDPSCCI 534


>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
 gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
 gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
          Length = 668

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 162/321 (50%), Gaps = 65/321 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM+ AQ L+   LGR W
Sbjct: 261 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 320

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   +++P+SIH +   G   GK  G+W+GP +V+
Sbjct: 321 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 380

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 381 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 440

Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                     W+ L+++IPLRLG   +NPVY +                +K+L ST +  
Sbjct: 441 ETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH-- 483

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG++GGKP H+LYF+G+  + +I LDPH  Q +  V     + E     
Sbjct: 484 ----------CLGILGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 528

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           ++HC    +L    MDPS  +
Sbjct: 529 SFHCKSPRKLKASKMDPSCCI 549


>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
 gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
          Length = 463

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 158/350 (45%), Gaps = 95/350 (27%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++++ R+D TSR+W TYR+ F  +  S  T+D GWGC LR GQM++AQALL   LGRDW+
Sbjct: 74  NVDEFRKDFTSRVWLTYREEFPALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWK 133

Query: 71  WNV-------------------------------------------NSKEEA--YLK--- 82
           W+                                               EEA  YLK   
Sbjct: 134 WSEALSLEPLDTETWTSSAARRLVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETY 193

Query: 83  ---ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSSI 136
              I+  F D  +A   I+++   G + GK  G+W+GP  VA +LRK    A       I
Sbjct: 194 HRTIVSWFGDGPSAQLGIYKLVELGMTSGKQAGDWYGPAVVAHILRKAVDEAVDAMLKGI 253

Query: 137 VFHVALDNTLVVNQVKKLCTTNKRASSNPQW---------QPLVLVIPLRLGIQDINPVY 187
             +VA D T+    V    +T   + S+PQ          + +V++IP+RLG + INP Y
Sbjct: 254 RVYVAQDCTVYSADVIDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEY 313

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
           +N +K               ILS  Y              +G+IGGKP  A YF+G+  +
Sbjct: 314 LNFVK--------------SILSLEY-------------CIGIIGGKPKQAYYFVGFQDD 346

Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            +I++DPH  Q+   V      S+  L S +HCP   ++    MDPS  +
Sbjct: 347 SLIYMDPHYCQSFVDV----STSDFPLQS-FHCPSPKKMSFSKMDPSCTI 391


>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
 gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
          Length = 521

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 156/315 (49%), Gaps = 58/315 (18%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
            E    D  SRLW TYR  F  + D+  TTD GWGCM+R  QM++AQA++    GRDW++
Sbjct: 168 FENFCSDYYSRLWITYRTDFPALLDTDTTTDCGWGCMIRTTQMMVAQAIMVNRFGRDWRF 227

Query: 72  N--------VNSKEEAYLK-------ILKMFEDRRTAPYSIHQ-IALTGASEG-KAVGEW 114
                     +  E+ + +       ILK+FED+ TAP  IH+ + +    +G KAVG W
Sbjct: 228 TRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDKPTAPLGIHKMVGIAAMGKGKKAVGSW 287

Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
           + P+    +++K A  +  S +  + A    ++++   ++   +    +    + L+LVI
Sbjct: 288 YSPSEAVFIMKK-ALTESSSPLTGNTA----MLLSIDGRVHIRDIEVETKNWMKKLILVI 342

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
            +RLG  ++NP+Y+  + + +A+                              LG+ GG+
Sbjct: 343 VVRLGAAELNPIYVPHLMRLFAM---------------------------ESCLGITGGR 375

Query: 235 PNHALYFIGYVGNDVIFLDPHT---------NQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
           P+H+ +F+GY G+ +I+LDPH          N N   V    + ++K  + +YHC   S+
Sbjct: 376 PDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSDSKKAKKCPEKSYHCRLLSK 435

Query: 286 LHILHMDPSIAVVSQ 300
           +H   MDPS A+  Q
Sbjct: 436 MHFFDMDPSCALCFQ 450


>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
          Length = 427

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 146/323 (45%), Gaps = 71/323 (21%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 61  DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 120

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 121 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 180

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 181 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 240

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K+                     
Sbjct: 241 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 278

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
               R E      LG++GGKP H+LYFIG              Q    V   +   E   
Sbjct: 279 ----RCELC----LGIMGGKPRHSLYFIGXXXXXXXXXXXXXCQPTVDVSQADFPLE--- 327

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
             ++HC    ++    MDPS  V
Sbjct: 328 --SFHCTSPRKMAFAKMDPSCTV 348


>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
           NZE10]
          Length = 442

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 149/319 (46%), Gaps = 64/319 (20%)

Query: 4   ANKLSHQDL---EQIRRDITSRLWFTYRKGFVPIGDS----------------------G 38
           +  +  +DL    +   D+ S++W TYR  F PI  S                      G
Sbjct: 99  SKAMDEEDLGWPSEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDG 158

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
            T+D GWGCM+R GQ ++A A+L   LGRDW+     KE  Y  IL +F D   +P SIH
Sbjct: 159 FTSDTGWGCMIRSGQSLLANAILIHRLGRDWR--RGDKEREYKDILSLFADTPESPLSIH 216

Query: 99  QIALTGASE-GKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           +    GA   G   GEWFGPN  A+ +R L  KY +    V+    D+ + V+    L  
Sbjct: 217 KFVEHGAQACGTYPGEWFGPNATARCIRALTEKYHEAGLQVYSRPNDSDVYVD---SLMQ 273

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
           T  +  ++ ++QP ++V+ +RLGI+ + P Y   +K    L                   
Sbjct: 274 TAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALEL------------------- 314

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                    QS+G+ GG+P+ + YFIG+ G++  +LDPHT + +       +D      +
Sbjct: 315 --------SQSVGIAGGRPSSSHYFIGHQGDNFFYLDPHTTRPMLSPQPLAEDI-----N 361

Query: 277 TYHCPQASRLHILHMDPSI 295
           + H  +  RL I  MDPS+
Sbjct: 362 SCHTRRVRRLGIAEMDPSM 380


>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
          Length = 457

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/343 (30%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPGALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    AK+ D  
Sbjct: 195 IYHRKIVSWFGDSPLAFFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + + R S   + + +++++P+RLG +  NP Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDTQSAS-RTSEGAEDKAVIILVPVRLGGERTNPDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q    V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQPFVDVSVKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
           aries]
          Length = 438

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 95/288 (32%), Positives = 145/288 (50%), Gaps = 36/288 (12%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGTLTSDCGWGCMLRSGQMMLAQGLLLHLLPRDWT 166

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAK 129
           W+  +                  P         G + GK  G+W+GP+ VA +LRK +  
Sbjct: 167 WSQGAGLGPAEPPGLGSPSPGPGPXXXXXXXSWGRAPGKKAGDWYGPSLVAHILRKAVES 226

Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
             + + +V +V+ D T+    V +L     R+    +W+ +V+++P+RLG + +NPVY+ 
Sbjct: 227 CSEVTRLVVYVSQDCTVYKADVARLVA---RSDPTAEWKSVVILVPVRLGGETLNPVYVP 283

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            +K+                         R E      LG++GG P H+LYFIGY  + +
Sbjct: 284 CVKELL-----------------------RSELC----LGIMGGTPRHSLYFIGYQDDFL 316

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++LDPH  Q      D  Q ++  L+S +HC    ++    MDPS  V
Sbjct: 317 LYLDPHYCQP---TVDVSQ-ADFPLES-FHCTSPRKMAFAKMDPSCTV 359


>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
          Length = 458

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 108/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D TSR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NVNSKE----------------EAYL----------------------------- 81
           W    N+ + +                EA L                             
Sbjct: 135 WPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K C +   AS +   + +++++P+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCAS--MASDHADDKAVIILVPVRLGGERTNTDYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
          Length = 508

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 97/308 (31%), Positives = 148/308 (48%), Gaps = 58/308 (18%)

Query: 18  DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F  +P                     +   G TTD GWGCM+R GQ 
Sbjct: 128 DFESKIWLTYRSNFPLIPKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 187

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGRDW+     KEE+  K+L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 188 LLANALAILSLGRDWRRGTKIKEES--KLLSLFADDPKAPFSIHRFVEHGASACGKYPGE 245

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKR-ASSNPQWQPLV 171
           WFGP+  A+ ++ L+   + + +  +V  D + V  ++ + + +     A ++    P +
Sbjct: 246 WFGPSATARCIQALSSECEHAGLNVYVTSDGSDVYEDRFRAIASAGGTGAGTSTDVHPTL 305

Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
           +++ +RLGI  + PVY   +K                               +PQS+G+ 
Sbjct: 306 ILLGIRLGIDRVTPVYWEALKAV---------------------------LKYPQSVGIA 338

Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHCPQASRLHIL 289
           GG+P+ + YFIG  G+   +LDPH +     VY    D     +  +TYH  +  RLHI 
Sbjct: 339 GGRPSSSHYFIGAQGSHFFYLDPH-HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIK 397

Query: 290 HMDPSIAV 297
            MDPS+ +
Sbjct: 398 DMDPSMLI 405


>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
 gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
          Length = 458

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 103/343 (30%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   AP+ +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 VYHRKIISWFGDSPLAPFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
          Length = 458

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 102/342 (29%), Positives = 153/342 (44%), Gaps = 89/342 (26%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ RRD  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 77  NVEEFRRDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 136

Query: 71  W-------NVNSK--------------------------------------------EEA 79
           W       N +S+                                            E  
Sbjct: 137 WPDALDVDNSDSESWTSHTVKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESC 196

Query: 80  YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSSI 136
           + KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D   I
Sbjct: 197 HRKIVSWFADSPLACFGLHQLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQGI 256

Query: 137 VFHVALDNTLV-VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCY 195
             +VA D T+   + + K C++      N + + +++++P+RLG +  N  Y+  +K   
Sbjct: 257 TIYVAQDCTVYKADVIDKQCSSMD--PENTEDKAVIILVPVRLGGERTNMDYLEFVK--- 311

Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                       ILS  Y              +G+IGGKP  + YF G+  + +I++DPH
Sbjct: 312 -----------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDPH 347

Query: 256 TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
             Q+   V  K+   E     ++HCP   ++    MDPS  V
Sbjct: 348 YCQSFVDVSIKDFPLE-----SFHCPSPKKMSFRKMDPSCTV 384


>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
          Length = 458

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               K++  F D   AP+ +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKVISWFGDSPLAPFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
 gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
          Length = 468

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/330 (27%), Positives = 156/330 (47%), Gaps = 78/330 (23%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++++ ++D  SR+W TYR+ F  +  + LTTD GWGCM+R GQM++AQ LL   L R+W 
Sbjct: 94  EIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSREWT 153

Query: 71  W-------------------------------------------NVNSKEEAYLKILKMF 87
           W                                                 + +  I++ F
Sbjct: 154 WPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIRWF 213

Query: 88  EDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD-DWSSIVFHVALDNTL 146
            D  +AP+ +H++   G+  GK  G+W+GP+ VA +++K  +   + + +  +V+ D T+
Sbjct: 214 SDHPSAPFGLHRMVALGSIFGKKAGDWYGPSIVAHIIKKAIETSCEVAELSVYVSQDCTV 273

Query: 147 VVNQVKKLCTTN--KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
               +++L   +     +S    + +++++P RLG +  NPVY + +K+   +       
Sbjct: 274 YKADIEQLFAGDVPHAETSRDAGKAVIILVPARLGGETFNPVYKHCLKEFLRM------- 326

Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
                               P  LG+IGGKP H+LYFIGY  N +++LDPH +Q+    Y
Sbjct: 327 --------------------PSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQS----Y 362

Query: 265 DKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
                ++  L+S +HC    ++ I  MDPS
Sbjct: 363 IDTSRNDFPLES-FHCNTPRKISITRMDPS 391


>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
           FP-101664 SS1]
          Length = 997

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 139/284 (48%), Gaps = 73/284 (25%)

Query: 18  DITSRLWFTYRKGFVPI---------------------------------GDSGLTTDKG 44
           D TSR+W TYR  F PI                                 G+ G TTD G
Sbjct: 301 DFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTTDAG 360

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA----YLKILKMFEDRRTA--PYSIH 98
           WGCMLR GQ ++A AL+ LHLGRDW+   +    A    Y++I+  F D  +   P+S+H
Sbjct: 361 WGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPFSVH 420

Query: 99  QIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-----KK 153
           ++AL G   GK VG+WFGP+T A  ++ L      +++    A+D TL  + V       
Sbjct: 421 RMALVGKDLGKDVGQWFGPSTAAGAIKTLVHAFPEATLGVANAVDGTLYESDVYAASRSV 480

Query: 154 LCTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
           + +T +   +   W  + ++++I +RLGI+ +NP+Y N IK  Y                
Sbjct: 481 MYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTLY---------------- 524

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                      TFPQS+G+ GG+P+ + YF+G   +++ +LDPH
Sbjct: 525 -----------TFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 557


>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
          Length = 468

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 164/373 (43%), Gaps = 106/373 (28%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E  RRD  SR+W TYR+ F P+  S LT+D GWGCMLR GQM++AQALL   LGRDW 
Sbjct: 67  NVEDFRRDFGSRIWLTYREEFPPLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWT 126

Query: 71  WN---------------------------------------------VNSKEEA------ 79
           W+                                               S EEA      
Sbjct: 127 WSGAMSLQPLDTETWTTSAAKRLVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDG 186

Query: 80  --YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
             +  ++  F D  +A + +H++   G + GK  GEW+GP  VA +L+K    A+    +
Sbjct: 187 GFHRTLVSWFGDSPSAQFGLHRMVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLA 246

Query: 135 SIVFHVALDNTL----VVNQ-------------VKKLCTTNKRASSNPQWQPLVLVIPLR 177
            I  +V+ D T+    V++              V      ++ AS++P  + +++++P+R
Sbjct: 247 GISSYVSQDCTVYSADVIDSHKASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVR 306

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG +  NP Y N  K                LS  Y              +G+IGGKP  
Sbjct: 307 LGGEKTNPDYFNLAKS--------------FLSLDY-------------CIGIIGGKPKQ 339

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           A YF+G+  + +I++DPH  Q+   V      S+  L S +HCP   ++    MDPS   
Sbjct: 340 ACYFVGFQDDSLIYMDPHYCQSFVDV----STSDFPLQS-FHCPSPKKMPFTKMDPSCTF 394

Query: 298 -VSQRSYSDYKNV 309
               RS  D++ +
Sbjct: 395 GFYSRSAQDFERI 407


>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
 gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
          Length = 668

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 100/321 (31%), Positives = 163/321 (50%), Gaps = 65/321 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           + +E  RRD  SR+W TYR+ F  +  S  T+D GWGCMLR GQM++AQ L+   +GR W
Sbjct: 260 EGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFMGRTW 319

Query: 70  QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
           +++  S+      +  + KI+K F D   +++P+SIH +   G + GK  G+W+GP +V+
Sbjct: 320 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGKKPGDWYGPASVS 379

Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
            +L+   ++      D+ +I  +VA D T+ +  ++  C+  + A         +  PQ 
Sbjct: 380 YLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKPNVPWQQAKRPQA 439

Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                     W+ L+++IPLRLG   +N  Y +                +K+L ST +  
Sbjct: 440 EVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAH---------------CLKLLLSTEH-- 482

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
                      LG+IGGKP H+LYF+G+  + +I LDPH  Q +  V     + E    +
Sbjct: 483 ----------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLN 527

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           ++HC    +L    MDPS  +
Sbjct: 528 SFHCKSPRKLKSSKMDPSCCI 548


>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
          Length = 400

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 107/357 (29%), Positives = 161/357 (45%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D TSR+W TYR+ F  I  S LTTD GWGC +R GQM++AQ L+   LGR W 
Sbjct: 17  NVEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLGRAWT 76

Query: 71  W----NVNSKE----------------EAYL----------------------------- 81
           W    N+ + +                EA L                             
Sbjct: 77  WPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNE 136

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 137 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 196

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K C +   AS +   + +++++P+RLG +  N  Y++ +K 
Sbjct: 197 GITIYVAQDCTVYSSDVIDKQCAS--MASDHADDKAVIILVPVRLGGERTNTDYLDFVK- 253

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 254 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 287

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 288 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 339


>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
 gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
          Length = 480

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 157/308 (50%), Gaps = 50/308 (16%)

Query: 13  EQIRRDITSRLWFTYRKGF-VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD--- 68
           +++ R   S  WFTYR    +PIG S   +D GWGCM+R GQM++ QA++  H+  D   
Sbjct: 148 DKLTRAFKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMM-RHVFEDNLK 206

Query: 69  --WQWNVNSKEEAYLKILKMFEDR---RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
             +   +    E YL +L++F+D    + +PYSI  IA  G    +  G+W+GP  ++ V
Sbjct: 207 YEYIEKITEYREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIV 266

Query: 124 LRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQD 182
           L++L K Y        +V L+  + +N +++         S    Q + +VIPLRLG+  
Sbjct: 267 LKRLTKIYKPVKQFTMYVCLEGNIYLNVIQE--------KSKDWTQSVFIVIPLRLGLNY 318

Query: 183 INPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI 242
           I P Y++ +KK                            FTFPQ++G+ GG+ N ALYFI
Sbjct: 319 IEPEYLSSVKKV---------------------------FTFPQNVGIAGGRENSALYFI 351

Query: 243 GY--VGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-V 298
           G     N++I+LDPH   +++     +  +   + +S++HC +  ++ +  M  S+A+  
Sbjct: 352 GISDSSNNLIYLDPHLVQKSVPTCNMQTNEQFYQYESSFHCTKFKKMPLNRMCTSVAIGF 411

Query: 299 SQRSYSDY 306
             R Y+D+
Sbjct: 412 YIRDYNDF 419


>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
          Length = 478

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 110/381 (28%), Positives = 161/381 (42%), Gaps = 115/381 (30%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D++  RRD  SR+W TYR+ F P+  S LT+D GWGCMLR GQM++AQ L+   LGRDW 
Sbjct: 70  DVDAFRRDFASRVWLTYREEFSPLPGSTLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWT 129

Query: 71  WN------------------------------------------------VNSKEEAYLK 82
           W+                                                + S EEA   
Sbjct: 130 WSEALTLQPLDTETWTTTAAKRLVASLEASLQGVPGPSVRSSSPQAQALSLGSAEEADAH 189

Query: 83  ILKM--------FEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYD 131
           + +M        F D  + P  +H++   G + GK  G+W+GP  VA +L+K    A   
Sbjct: 190 LKEMYHRTLVSWFGDSPSTPLGLHRLVRLGLTMGKQAGDWYGPAVVAHILKKAVEEAMDP 249

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKR----------------------ASSNPQWQP 169
             + I  +V+ D T+    V   C    R                      AS+ P+ + 
Sbjct: 250 GLACITAYVSQDCTVYSADVVD-CHRAPRAERTSDETPDAPTLPQNDQPAHASTLPESRA 308

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           +++++P+RLG +  NP Y +  K               ILS  Y              +G
Sbjct: 309 VIILVPVRLGGEKTNPEYFDFAK--------------SILSLEY-------------CIG 341

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
           +IGGKP  A YF+G+  + +I++DPH  Q+   V      S+  L S YHCP   ++   
Sbjct: 342 IIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDV----STSDFPLQS-YHCPSPKKMPFS 396

Query: 290 HMDPSIAV-VSQRSYSDYKNV 309
            MDPS  V    RS  DY+ +
Sbjct: 397 KMDPSCTVGFYSRSVQDYERI 417


>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
          Length = 355

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/248 (34%), Positives = 125/248 (50%), Gaps = 37/248 (14%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGL-TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           ++IR   ++ LWFTYR     IGDS    TD+GWGC LR GQM++ +AL   H  RD+  
Sbjct: 45  DEIRSRASAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK 104

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
                E A + ILK FEDR     S+H +A+     GK  G+W  P  VA VLR      
Sbjct: 105 LSYPSEAARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQ 164

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
           +   +  HVA+D+ +V++ ++KL   ++           +L +PLRLGI  +    I  +
Sbjct: 165 EAMGLQVHVAMDSMVVLDDLRKLFRADR---------ATLLFVPLRLGIDIVQAEMIPAV 215

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K+                            F  P +LG++GG+P  A YFIGY+ ++++ 
Sbjct: 216 KRF---------------------------FHSPSALGIMGGRPGAAHYFIGYMDHNLLL 248

Query: 252 LDPHTNQN 259
           LDPHT Q+
Sbjct: 249 LDPHTTQD 256


>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
          Length = 508

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 99/305 (32%), Positives = 149/305 (48%), Gaps = 57/305 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S++W TYR GF  I  S                       G TTD GWGCM+R GQ 
Sbjct: 148 DFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 207

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGRDW+    + +E+ L  L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 208 LLASALSILSLGRDWRRGTKTDQESNL--LSLFADDPKAPFSIHRFVEYGASACGKYPGE 265

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+     + +  +V  D + V     +  T     ++     P +++
Sbjct: 266 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYED--RFRTIASSGATEAGIHPTLIL 323

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI  + PVY   +K           D++K                +PQS+G+ GG
Sbjct: 324 LGIRLGIDRVTPVYWEALK-----------DVLK----------------YPQSVGIAGG 356

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YFIG  G+   +LDPH  +     +   Q  +E++L+S YH  +  RLHI  MD
Sbjct: 357 RPSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNS-YHTRRLRRLHIKDMD 415

Query: 293 PSIAV 297
           PS+ +
Sbjct: 416 PSMLI 420


>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
          Length = 513

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 96/304 (31%), Positives = 146/304 (48%), Gaps = 55/304 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S++W TYR GF  I  S                       G TTD GWGCM+R GQ 
Sbjct: 153 DFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 212

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGRDW+    + +E+ L  L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 213 LLASALSILSLGRDWRRGTKTDQESNL--LSLFADDPKAPFSIHRFVEYGASACGKYPGE 270

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+     + +  +V  D + V     +  T     ++     P +++
Sbjct: 271 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYED--RFRTIASSGATEAGIHPTLIL 328

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI  + PVY   +K           D++K                +PQS+G+ GG
Sbjct: 329 LGIRLGIDRVTPVYWEALK-----------DVLK----------------YPQSVGIAGG 361

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YFIG  G+   +LDPH  +     +   Q   ++  ++YH  +  RLHI  MDP
Sbjct: 362 RPSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDP 421

Query: 294 SIAV 297
           S+ +
Sbjct: 422 SMLI 425


>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
 gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
          Length = 858

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/266 (33%), Positives = 131/266 (49%), Gaps = 69/266 (25%)

Query: 18  DITSRLWFTYRKGFVPI------------------------GDSGLTTDKGWGCMLRCGQ 53
           D  SRLW TYR GF PI                        G  GLT+D GWGCMLR GQ
Sbjct: 160 DFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTSDAGWGCMLRTGQ 219

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAV 111
            ++A AL+   +GR            Y+ ++ +F D  +  AP+S+H++AL G + GK V
Sbjct: 220 SLLANALVVAWMGR-------GALALYIHLISLFLDSPSPSAPFSVHRMALAGRALGKDV 272

Query: 112 GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW--QP 169
           G+WFGP+T A  ++ L     +      VA+    VV Q ++     +R     +W  QP
Sbjct: 273 GQWFGPSTAAGAIKALVNA--YPDAGLGVAIAEDGVVYQTQRRQKERER-----EWGDQP 325

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           +++++ +RLG+  +NP+Y + IK+ Y                           TFPQSLG
Sbjct: 326 VLVLLGIRLGLDGVNPIYYDTIKQLY---------------------------TFPQSLG 358

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPH 255
           + GG+P+ + YF+G    D+ +LDPH
Sbjct: 359 IAGGRPSSSYYFVGAQAGDLFYLDPH 384


>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 628

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 100/340 (29%), Positives = 157/340 (46%), Gaps = 57/340 (16%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           +  + +E  +RD  SRLW TYRK F  + DS  T+D GWGCM+R GQM++AQ L+   LG
Sbjct: 186 VEDEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLG 245

Query: 67  RDWQWNVNS------------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
           R W+W+ +             ++  + KI++ F D   RT+P+SIH +   G   GK  G
Sbjct: 246 RGWRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPG 305

Query: 113 EWFGPNTVAQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
           +W+GP +VA +LR+  K       D   I  +VA D  + +  +   CT +   S  P W
Sbjct: 306 DWYGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAP-W 364

Query: 168 Q------------PLVLVIPLRLGIQDINPVYINGIKKCYALP----------------- 198
           Q            P     P R+G         +     +  P                 
Sbjct: 365 QKKMSSAAACTDSPSQATTP-RVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLI 423

Query: 199 -ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
            + P+    + L+  YN    +   +    +G+IGG+P H+L+F+GY  + +I LDPH  
Sbjct: 424 LLVPLRLGTEKLNPIYN-DCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYC 482

Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           Q++  V     + E    S++HC    ++ +  MDPS  +
Sbjct: 483 QDMVDV-----NQENFPVSSFHCKSPRKMKLSKMDPSCCI 517


>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
          Length = 439

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 98/316 (31%), Positives = 147/316 (46%), Gaps = 58/316 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  +++W TYR  F  I  S                       G T+D GWGCM+R GQ 
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRSGQS 165

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A ALL L +GR+W+   +S EE   KIL +F D   APYSIH+    GAS  GK  GE
Sbjct: 166 LLANALLTLRMGREWRRGSSSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 223

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L      S +  ++  D + V        +  K  S+  ++ P +++
Sbjct: 224 WFGPSAAARCIQALTNSQVESELRVYITGDGSDVYEDT--FMSIAKPNST--KFTPTLIL 279

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLG+  I PVY   +K    +P                           QS+G+ GG
Sbjct: 280 VGTRLGLDKITPVYWEALKSSLQMP---------------------------QSVGIAGG 312

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YFIG   +D  +LDPH  +      D  +D   +   + H  +  RLHI  MDP
Sbjct: 313 RPSSSHYFIGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDP 372

Query: 294 SIAVVSQ-RSYSDYKN 308
           S+ +    R  +D+K+
Sbjct: 373 SMLIAFLIRDENDWKD 388


>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
 gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
           SOWgp]
 gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
           SOWgp]
 gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
 gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
          Length = 432

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 97/305 (31%), Positives = 143/305 (46%), Gaps = 62/305 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S+ WFTYR  F  I  S                       G T D GWGCM+R GQ 
Sbjct: 108 DFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRSGQS 167

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L+LGRDW+     KEE   ++L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 168 LLANALSILNLGRDWRRGSKIKEEC--ELLSLFADNPQAPFSIHRFVDYGASACGKHPGE 225

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKRASSNPQWQPLVL 172
           WFGP+  A+ +  L+     + +  +V  D + V  +Q +++   +         +P ++
Sbjct: 226 WFGPSATARCIEALSNECKHTDLNVYVMSDGSDVHEDQFRQIAGPDG-------IRPTLI 278

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           ++ +RLGI+ + PVY   ++                               +PQS+G+ G
Sbjct: 279 LLGVRLGIESVTPVYWEALRAI---------------------------IRYPQSVGIAG 311

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           G+P+ +LYFIG  G    +LDPH  +           S + LD TYH  +  RLHI  MD
Sbjct: 312 GRPSSSLYFIGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLD-TYHTRRLRRLHIREMD 370

Query: 293 PSIAV 297
           PS+ +
Sbjct: 371 PSMLI 375


>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
           rotundus]
          Length = 458

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 105/346 (30%), Positives = 153/346 (44%), Gaps = 95/346 (27%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 VYHRKIISWFGDSPVALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRA---SSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
            I  +VA D T+  + V       +RA   S N   + +++++P+RLG +  N  Y+  I
Sbjct: 255 GITIYVAQDCTVYSSDV----IDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFI 310

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K               ILS  Y              +G+IGGKP  + YF G+  + +I+
Sbjct: 311 K--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIY 343

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +DPH  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 344 MDPHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 489

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 148/310 (47%), Gaps = 69/310 (22%)

Query: 18  DITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCMLRCGQ 53
           D  S++W TYR  F PI  S                        G T+D GWGCM+R GQ
Sbjct: 156 DFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIRSGQ 215

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
           M++A AL    LGRDW+   ++ EE   K+L +F D   AP+SIH+    GA   GK  G
Sbjct: 216 MLLANALAISRLGRDWRRVSHTTEEN--KLLSLFADDPAAPFSIHRFVRHGALYCGKHPG 273

Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
           EWFGP+  A  ++ L++    + +  +V+ D+T V     K    N+        +P ++
Sbjct: 274 EWFGPSATATCIQALSEEYKVAGMNVYVSSDSTYVYEDKFKAVAYNQPG----HMRPTLI 329

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           ++  RLGI  I PVY  G++           D++K+                PQSLG+ G
Sbjct: 330 LLGTRLGIDRITPVYRKGLE-----------DLLKL----------------PQSLGIAG 362

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQ-----NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           G+P+ + YFIG   +   +LDPH  +      +   Y +EQ     +DS  H  +  R+H
Sbjct: 363 GRPSSSHYFIGVQNSFFFYLDPHHTRPALPHKVDSAYTQEQ-----VDSC-HTRRLRRIH 416

Query: 288 ILHMDPSIAV 297
           I  MDPS+ V
Sbjct: 417 IDDMDPSMLV 426


>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
          Length = 435

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 100/343 (29%), Positives = 156/343 (45%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++++ R+D  SR+W TYR+ F PI  S L+TD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 53  NVDEFRKDFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWI 112

Query: 71  W-------NVNS---------------------------------------------KEE 78
           W       N++S                                             ++E
Sbjct: 113 WPDALNIENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDE 172

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
            Y  KI+  F D  +A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 173 VYHRKIISWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 232

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + + R + N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 233 GITVYVAQDCTVYNSDVIDKQSAS-RPAGNADDKAVIILVPVRLGGERTNTDYLEFVK-- 289

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 290 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 324

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 325 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 362


>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
          Length = 585

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 95/346 (27%), Positives = 155/346 (44%), Gaps = 90/346 (26%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D+E  ++D  SR+W TYR+ F  +  +  TTD GWGCMLR GQM++AQ L+   LG+DW
Sbjct: 193 EDVEGFQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDW 252

Query: 70  Q-------------------------------------------WNVNS----------- 75
                                                       W + +           
Sbjct: 253 TWPDALHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELE 312

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-YDDWS 134
           KE  + KI+  F DR  A + IH++   G S GK  G+W+GP+  A ++RK      +  
Sbjct: 313 KERYHRKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDCCSEAG 372

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNK-RASSNPQ--WQPLVLVIPLRLGIQDINPVYINGI 191
           ++V +V+ D T+    V  L   ++ R + +P   W+ +++++P+RLG +  NP Y++ +
Sbjct: 373 NLVVYVSQDCTVYKGDVANLANKSEDRTAWDPGAVWKAVIILVPMRLGGEAFNPAYVDCV 432

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K+   L                       EF     +G+IGGKP H+LYF+GY  + +++
Sbjct: 433 KELLKL-----------------------EFC----IGIIGGKPRHSLYFVGYQDDALLY 465

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           LDPH  Q        +   E     ++HC    +     +DPS  +
Sbjct: 466 LDPHYCQPF-----VDTTKENFPLESFHCNSPRKTAFTKVDPSCTI 506


>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
          Length = 458

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTICLKETIGKCSEDHETENE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 ICHRKIISWFGDSPLAAFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITVYVAQDCTVYSSDVIDKQSAS-MTSDNTDDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
 gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
          Length = 458

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 105/357 (29%), Positives = 158/357 (44%), Gaps = 92/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  +  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSNTAKKFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D     + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLTLFGLHQLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K C +   A  N   + +++++P+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCAS--MAPDNTDDKAVIILVPVRLGGERTNADYLDFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 454

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 100/306 (32%), Positives = 148/306 (48%), Gaps = 60/306 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCMLRCGQ 53
           D   R+W TYR GF PI  S                        G T+D GWGCM+R GQ
Sbjct: 120 DFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIRSGQ 179

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
            ++A AL    LGRDW+   NS EE   ++L +F D   AP+SIH+    GA   GK  G
Sbjct: 180 SLLANALAISRLGRDWRRGSNSTEEN--RLLSLFADDPAAPFSIHKFVRHGALYCGKHPG 237

Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
           EWFGP+  A  ++ L+     + +  +V+ DNT V     K    N+    + + +P ++
Sbjct: 238 EWFGPSATATCIQALSDEYKDAGMNVYVSSDNTYVYEDKFKAVAYNQ----SDRMRPTLI 293

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           ++  RLGI  I PVY  G++           D++K+                PQ+LG+ G
Sbjct: 294 LLGTRLGIDRITPVYRKGLE-----------DLLKL----------------PQALGIAG 326

Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           G+P+ + YFIG   +   +LDP HT   +         +++++DS  H  +  R+HI  M
Sbjct: 327 GRPSASHYFIGVQNSFFFYLDPHHTRPALPYKTGDLAYTQEEIDSC-HTRRLRRIHIDDM 385

Query: 292 DPSIAV 297
           DPS+ V
Sbjct: 386 DPSMLV 391


>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 601

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 95/304 (31%), Positives = 147/304 (48%), Gaps = 55/304 (18%)

Query: 18  DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
           D  S++W TYR GF  +P                     +   G TTD GWGCM+R GQ 
Sbjct: 239 DFESKIWLTYRSGFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 298

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGRDW+    + +E+ L  L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 299 LLASALSILSLGRDWRRGTKTDQESNL--LSLFADDPKAPFSIHRFVEYGASACGKYPGE 356

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+     + +  +V  D + V     +  T     ++     P +++
Sbjct: 357 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYED--RFRTIASGGATEAGIHPTLIL 414

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI  + PVY   +K           D++K                +PQS+G+ GG
Sbjct: 415 LGIRLGIDRVTPVYWEALK-----------DVLK----------------YPQSVGIAGG 447

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YFIG  G+   +LDPH  +     +   Q   ++  ++YH  +  RLHI  MDP
Sbjct: 448 RPSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDP 507

Query: 294 SIAV 297
           S+ +
Sbjct: 508 SMLI 511


>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
          Length = 409

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 137/303 (45%), Gaps = 57/303 (18%)

Query: 18  DITSRLWFTYRKGFVPI---------------------GDSGLTTDKGWGCMLRCGQMVI 56
           D  S LW TYR  F PI                        G T+D GWGCM+R GQ VI
Sbjct: 89  DFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQAVI 148

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGR W+  +  +EE   ++L +F D   AP+SIH+    G  E GK  GEWF
Sbjct: 149 ANALAHLRLGRGWRRGMKPEEEK--RLLALFADDPRAPFSIHKFVRHGEVECGKNPGEWF 206

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
           GP+  A  ++ L    + + +  +    N L     +K+   N        ++P +++  
Sbjct: 207 GPSAAAMCIQALTHAYEPAGLRVYQTNSNDLYEEDFRKVAVVNG------VFKPTLVLAG 260

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           +RLGI+ I  +Y   +  C  +P                           Q++G+ GG+P
Sbjct: 261 IRLGIERITNIYYEPLAACLRMP---------------------------QTVGIAGGRP 293

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
           + + YFI   G +  +LDPHT + I    +  QD  ++   T H  +  RLHI  MDPS+
Sbjct: 294 SSSHYFIAVQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHTRRIRRLHIREMDPSM 353

Query: 296 AVV 298
            + 
Sbjct: 354 LIA 356


>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 440

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/375 (28%), Positives = 162/375 (43%), Gaps = 108/375 (28%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E  RRD  SR+W TYR+ F P+  S LT+D GWGCMLR GQM++AQALL   +GRDW 
Sbjct: 74  NVEDFRRDFGSRIWLTYREEFPPLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWT 133

Query: 71  --------------WNVNSKE--------------------------------------- 77
                         W  ++ +                                       
Sbjct: 134 WSRTMSLQPLDTETWTTSAAKRLVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEG 193

Query: 78  EAYLKIL-KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY---DDW 133
           EA+ + L   F D  +A + +H++   G   GK  GEW+GP  VA +L+K  +       
Sbjct: 194 EAFHRTLVSWFGDSPSAQFGLHRMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSL 253

Query: 134 SSIVFHVALDNTL------------------VVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
           + I  +V+ D T+                    + V  L   N+ AS+ P  + +++++P
Sbjct: 254 AGITAYVSQDCTVYSADVIDGHKASTSASPESSDDVTLLSPNNQAASALPDSRAVIILVP 313

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           +RLG +  NP Y N  K               ILS  Y              +G+IGGKP
Sbjct: 314 VRLGGEKTNPDYFNLAKS--------------ILSLDY-------------CIGIIGGKP 346

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
             A YF+G+  + +I++DPH  Q+   V      S+  L S +HCP   ++    MDPS 
Sbjct: 347 KQACYFVGFQDDSLIYMDPHYCQSFVDV----STSDFPLQS-FHCPSPKKMPFTKMDPSC 401

Query: 296 AV-VSQRSYSDYKNV 309
            +    RS  D++ +
Sbjct: 402 TLGFYSRSAQDFEKI 416


>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
 gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
 gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
 gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
 gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
           cysteine endopeptidase; AltName: Full=Autophagin-3;
           AltName: Full=Autophagy-related cysteine endopeptidase
           3; AltName: Full=Autophagy-related protein 4 homolog C
 gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
 gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
 gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
 gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
           construct]
 gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
           construct]
 gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
 gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
          Length = 458

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
          Length = 482

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 165/384 (42%), Gaps = 113/384 (29%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
            S  ++E  R+D TSR+W TYR+ F P+  S LTTD GWGC+LR GQM++AQAL+   LG
Sbjct: 70  FSMGNVEAFRKDFTSRVWLTYREEFPPLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLG 129

Query: 67  RDWQWN---------------------VNSKE---------------------------- 77
           RDW W+                     V S E                            
Sbjct: 130 RDWTWSEALTLQPLDTETWTASAAKRLVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEA 189

Query: 78  EAYLK------ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---A 128
           EA+LK      I+  F D  +A   +H++   G + GK  G W+GP  VA +L+K    A
Sbjct: 190 EAHLKEMYHRTIISWFGDTSSALLGLHRLVRLGLTMGKNAGNWYGPAVVAHILKKAVEEA 249

Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKL--CTTNKRASSN--------------------PQ 166
                + I  +V+ D T+    V       + ++AS +                    P 
Sbjct: 250 MDSGLAGITAYVSQDCTVYSADVADCHKPPSARQASVSPPIAGGGPSKEDQPGSASILPD 309

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
            Q ++++IP+RLG + INP Y   +K               ILS  Y             
Sbjct: 310 SQAVIILIPVRLGGEKINPEYFEFVK--------------NILSVEY------------- 342

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
            +G+IGGKP  A YF+G+  + +I++DPH  Q+   V + +   +     ++HCP   ++
Sbjct: 343 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDFPLQ-----SFHCPSPKKI 397

Query: 287 HILHMDPSIAV-VSQRSYSDYKNV 309
               MDPS  +    RS  DY  +
Sbjct: 398 PFTRMDPSCTIGFYSRSLQDYDRI 421


>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
          Length = 259

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 92/252 (36%), Positives = 130/252 (51%), Gaps = 73/252 (28%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
           MLRCGQM++AQAL+  HLGRDW W    ++ + Y +IL+ F DR+   YSIHQ+A  G  
Sbjct: 1   MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60

Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
           EGK++GEWFGPNTVAQVL+KLA +D+W+S+  +V++DNT+V                   
Sbjct: 61  EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVV------------------- 101

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
                        I+DI        K C  LP+S                          
Sbjct: 102 -------------IEDIK-------KMCRVLPLSA------------------------- 116

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SR 285
                G +P  +L      G+++IFLDPHT Q      D E++     D T+HC Q+  R
Sbjct: 117 --DTAGDRPPDSLT-ASNQGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQR 169

Query: 286 LHILHMDPSIAV 297
           ++IL++DPS+A+
Sbjct: 170 MNILNLDPSVAL 181


>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
 gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 401

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 93/304 (30%), Positives = 144/304 (47%), Gaps = 58/304 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F PI  +                       G T+D GWGCM+R GQ 
Sbjct: 74  DFGSRIWITYRSNFTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQS 133

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A     L LGRDW+     +EE+  K++ MF D   AP+SIH+    GA S GK  GE
Sbjct: 134 LLANTFSVLLLGRDWRRGEKVEEES--KLISMFADHPEAPFSIHRFVNRGAESCGKYPGE 191

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+   +   +  ++  D + V          ++      + QP +++
Sbjct: 192 WFGPSATAKCIQLLSTQSEVPQLRVYLTNDTSDVYEDKFAHVAHDESG----RIQPTLIL 247

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           I  RLGI ++ P Y +G+                           R   T+PQS+G+ GG
Sbjct: 248 IGTRLGIDNVTPAYWDGL---------------------------RAALTYPQSVGIAGG 280

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YF+G     + FLDPHT +           ++++LDS Y+  +  R+HI  MDP
Sbjct: 281 RPSASHYFVGAQDCHLFFLDPHTTRPATLYRPDGLYTQEELDS-YYTSRLRRIHIKDMDP 339

Query: 294 SIAV 297
           S+ +
Sbjct: 340 SMLI 343


>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
 gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
          Length = 458

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
          Length = 458

 Score =  150 bits (378), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 107/358 (29%), Positives = 157/358 (43%), Gaps = 94/358 (26%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  WNV-----NSKEEA---------------------------------------------- 79
           W       NS  E+                                              
Sbjct: 135 WPYALSIENSDSESRTSHTVKKFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNE 194

Query: 80  --YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
             + KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQV--KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
            I  +VA D T+  + V  K+  +    AS N   + +++++P+RLG +  N  Y+  IK
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQRASM---ASDNTDDKAVIILVPVRLGGERTNTDYLEFIK 311

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
                          ILS  Y              +G+IGGKP  + YF G+  + +I++
Sbjct: 312 --------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYM 344

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           DPH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 345 DPHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNIQDFKRA 397


>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
           boliviensis]
          Length = 458

 Score =  150 bits (378), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 152/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 MYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
          Length = 451

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 156/355 (43%), Gaps = 90/355 (25%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           +E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W W
Sbjct: 76  VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTW 135

Query: 72  ----NV-NSKEEAYL--------------------------------------------- 81
               N+ NS  E++                                              
Sbjct: 136 PDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEV 195

Query: 82  ---KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSS 135
              KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D   
Sbjct: 196 YHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG 255

Query: 136 IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCY 195
           I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K   
Sbjct: 256 ITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGEERTNTDYLEFVK--- 311

Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                       ILS  Y              +G+IGGKP  + YF G+  + +I++DPH
Sbjct: 312 -----------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDPH 347

Query: 256 TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
             Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 348 YCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397


>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
          Length = 458

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 152/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
 gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
          Length = 494

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 94/305 (30%), Positives = 142/305 (46%), Gaps = 54/305 (17%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F  I  S                       G TTD GWGCM+R GQ 
Sbjct: 122 DFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMIRSGQS 181

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGR+W+     KEE+ L  L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 182 LLANALAILFLGREWRRGTKVKEESNL--LSLFADDPRAPFSIHRFVEHGASACGKYPGE 239

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+     + +  +V  D + V     +   +     ++   +P +++
Sbjct: 240 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYEDRFRAIASGGGTGTSTDIRPTLIL 299

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI  + PVY   +K                               +PQ++G+ GG
Sbjct: 300 LGIRLGIDRVTPVYWEALKAV---------------------------LKYPQAVGIAGG 332

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YFIG  G+   +LDP HT   +      +Q    +  +TYH  +  RLHI  MD
Sbjct: 333 RPSSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMD 392

Query: 293 PSIAV 297
           PS+ +
Sbjct: 393 PSMLI 397


>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
          Length = 499

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 159/369 (43%), Gaps = 99/369 (26%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  ++E+ R    SR+W TYRK F  +  S  TTD GWGCMLR GQM++AQ LL   + R
Sbjct: 99  SEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPR 158

Query: 68  DWQW----------------------------------NVNSKEEAYL----------KI 83
            W W                                    ++ E   L          K 
Sbjct: 159 GWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPERPLLSEQATKCSRKKR 218

Query: 84  LKMFEDRRTAP----------------YSIHQIALTGASEGKAVGEWFGPNTVAQVLRK- 126
           L+  +DR+  P                + IHQ+   G S GK  G+W+GP  VA +LRK 
Sbjct: 219 LESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKAGDWYGPAIVAHILRKA 278

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLC-TTNKRASSNP----QWQPLVLVIPLRLGIQ 181
           +A+     S+V +VA D T+    V  LC  T  +  S+P     W+ +++++P+RLG +
Sbjct: 279 VARASAVHSLVVYVAQDCTVYKEDVMHLCDPTPSQTPSDPLSHQAWKSVIILVPVRLGGE 338

Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
            +NP YI  +K    L                              +G+IGGKP H+LYF
Sbjct: 339 CLNPSYIECVKNILKLDC---------------------------CIGIIGGKPKHSLYF 371

Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQ 300
           +G+    +++LDPH  Q +  V       E     ++HC    ++    MDPS  +    
Sbjct: 372 VGFQDEQLLYLDPHYCQPVVDVSQVNSSLE-----SFHCNAPKKMPFNRMDPSCTIGFYA 426

Query: 301 RSYSDYKNV 309
           +S  D++++
Sbjct: 427 KSKKDFESL 435


>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
 gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
          Length = 494

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 95/306 (31%), Positives = 146/306 (47%), Gaps = 56/306 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F  I  S                       G TTD GWGCM+R GQ 
Sbjct: 122 DFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMIRSGQS 181

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGR+W+     KEE+ L  L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 182 LLANALAILFLGREWRRGTKVKEESNL--LSLFADDPRAPFSIHRFVEHGASACGKYPGE 239

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L+     + +  +V  D + V     +   +     ++   +P +++
Sbjct: 240 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYEDRFRAIASGGGTGTSTDIRPTLIL 299

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + +RLGI  + PVY   +K                               +PQ++G+ GG
Sbjct: 300 LGIRLGIDRVTPVYWEALKAV---------------------------LKYPQAVGIAGG 332

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGC--VYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           +P+ + YFIG  G+   +LDPH  +      V   +Q ++++L+ TYH  +  RLHI  M
Sbjct: 333 RPSSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELN-TYHTRRLRRLHIKDM 391

Query: 292 DPSIAV 297
           DPS+ +
Sbjct: 392 DPSMLI 397


>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
 gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
          Length = 400

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 97/328 (29%), Positives = 151/328 (46%), Gaps = 61/328 (18%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKG 44
            HQ  E+   D+ SR+W TYR  F PI                          G T+D G
Sbjct: 69  EHQWPEEFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTG 128

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
           WGCM+R GQ ++A A+L L LGRDW+    + +EA  ++L  F D   AP+SIH+    G
Sbjct: 129 WGCMIRSGQSLLANAMLILLLGRDWRRGTEAGKEA--QLLHQFADHPEAPFSIHRFVQHG 186

Query: 105 ASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
           A    K  GEWFGP+  A+ ++ L      S +  ++  D+T  + + K         + 
Sbjct: 187 AEFCNKYPGEWFGPSATARCIQALVAQQGSSELRVYIT-DDTADIYEDKFARIAQ---AE 242

Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
           +  + P ++++  RLGI  + P Y + +K+   LP                         
Sbjct: 243 HGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLP------------------------- 277

Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
             QS+G+ GG+P+ + YFIG  G  + +LDPH  +      D       +  +TYH  + 
Sbjct: 278 --QSVGIAGGRPSASHYFIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRL 335

Query: 284 SRLHILHMDPSI----AVVSQRSYSDYK 307
            R+HI  MDPS+     + S+  ++D+K
Sbjct: 336 RRIHIKDMDPSMLIGFIIRSREDWTDWK 363


>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
          Length = 459

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/346 (29%), Positives = 153/346 (44%), Gaps = 95/346 (27%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++++ R+D  SR+W TYR+ F P+G SGLTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVDEFRKDFVSRIWLTYREEFPPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWT 134

Query: 71  WNV-----NSKEEAYL-------------------------------------------- 81
           W       NS  E++                                             
Sbjct: 135 WPAALDMENSDSESWTSHTVKKLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D     + +HQ+   G   GK  G+W+GP  VA +LRK     ++ D  
Sbjct: 195 GFHRKIISWFGDSPRTYFGLHQLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPLVLVIPLRLGIQDINPVYINGI 191
            +  +VA D T+  + V    T   RAS++      + +++++P+RLG +  N  Y+  +
Sbjct: 255 GLTVYVAQDCTVYNSDV----TDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFV 310

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K               ILS  Y              +G+IGGKP  + YF G+  + +I+
Sbjct: 311 K--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIY 343

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +DPH  Q+   V  K+   E     ++HCP   ++    MDPS  V
Sbjct: 344 MDPHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFRKMDPSCTV 384


>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
          Length = 988

 Score =  149 bits (376), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 70/281 (24%)

Query: 18  DITSRLWFTYRKGFVPI--------------------------------GDSGLTTDKGW 45
           D TSR+W TYR  F PI                                G+ G T+D GW
Sbjct: 308 DFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSDAGW 367

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
           GCMLR GQ ++A  LL LHLGRDW+   + + + + A Y++IL  F D  +   P+S+H+
Sbjct: 368 GCMLRTGQSLLANTLLHLHLGRDWRRPPYPICTADYATYVQILTWFFDNPSPLCPFSVHR 427

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN- 158
           +AL G   GK VG+WFGP+T A  ++ L      + +   VA D+ +  + V     +N 
Sbjct: 428 MALVGKELGKEVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVATDSVIYQSDVYTASRSNL 487

Query: 159 --KRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
              R +    W  + +++++ +RLG+  +NP+Y               YD +K L     
Sbjct: 488 GSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIY---------------YDTIKAL----- 527

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                  +TFPQS+G+ GG+P+ + YF+G   +++ +LDPH
Sbjct: 528 -------YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561


>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
 gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
          Length = 458

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 151/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+    V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNCDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------SILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMAFRKMDPSCTI 384


>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
           24927]
          Length = 444

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/298 (30%), Positives = 138/298 (46%), Gaps = 55/298 (18%)

Query: 18  DITSRLWFTYRKGFVPI-------------------GDSGLTTDKGWGCMLRCGQMVIAQ 58
           D  ++ W TYR  F PI                      G T+D GWGCM+R GQ V+A 
Sbjct: 114 DFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCVLAN 173

Query: 59  ALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGP 117
           A+  L LGRDW+   + +EE +  IL +F D   AP+S+H     G AS G   GEWFGP
Sbjct: 174 AISLLKLGRDWRRGKSPQEEQH--ILSLFADDPRAPFSLHNFVKYGEASCGVYPGEWFGP 231

Query: 118 NTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLR 177
           +  A+ ++ LA   D    V+       +  +  +K+  ++        + P ++++ +R
Sbjct: 232 SATARCIQALAAQHDEGLQVYITGDGGDVYEDAFRKIAISDDGV-----FHPTLVLVGIR 286

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LGI+ + PVY   +K    +                           PQS+G+ GG+P+ 
Sbjct: 287 LGIERVTPVYWEALKSSLMM---------------------------PQSVGIAGGRPSA 319

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
           + YFIG  G  + +LDPH  + +   Y K+ D   +     H  +  RLH+  MDPS+
Sbjct: 320 SHYFIGVQGQSLFYLDPHNTRPL-LPYRKDSDYTAEEIEFCHTRKLRRLHLREMDPSM 376


>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
           NRRL3357]
 gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
           NRRL3357]
          Length = 439

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 59/310 (19%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKGWGCML 49
           E    D  S++W TYR  F PI                          G T+D GWGCM+
Sbjct: 110 EAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWGCMI 169

Query: 50  RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
           R GQ ++A A+L L LGRDW+    ++EEA  ++L +F D   AP SIH+    GA S G
Sbjct: 170 RSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSIHRFVKYGAESCG 227

Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
           K  GEWFGP+  A+ +  L+      +I   V + N    + V +        S +   Q
Sbjct: 228 KHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSFLRVARSGSGSIQ 283

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           P ++++  RLGI ++ PVY +G+K    L                           PQS+
Sbjct: 284 PTLILLGTRLGIDNVTPVYWDGLKAVLQL---------------------------PQSV 316

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           G+ GG+P+ + YFIG  G    +LDPHT +  +    D    S+ ++ STYH  +  R+H
Sbjct: 317 GIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI-STYHTRRLRRIH 375

Query: 288 ILHMDPSIAV 297
           I  MDPS+ +
Sbjct: 376 IQDMDPSMLI 385


>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
          Length = 458

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 100/343 (29%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SRLW TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRLWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W-------NVNS---------------------------------------------KEE 78
           W       N +S                                             + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNE 194

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
           AY  KI+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            +  +VA D T+  + V     T+   + + + + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
          Length = 459

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 100/345 (28%), Positives = 154/345 (44%), Gaps = 91/345 (26%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGRDW
Sbjct: 74  RNVEEFRKDFISRIWLTYREEFPQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDW 133

Query: 70  QW-----NVNSKEEAYL------------------------------------------- 81
            W     N N + E++                                            
Sbjct: 134 TWPDALVNENPESESWTSHTVKKLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRD 193

Query: 82  -----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDW 133
                KI+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    AK  + 
Sbjct: 194 EHYHRKIVSWFGDSPLANFGLHRLIEYGNKSGKMAGDWYGPAVVAHLLRKAVEEAKDPEL 253

Query: 134 SSIVFHVALDNTLVVNQVKKL-CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
             I  +VA D T+  + V ++ C+   + S  P  + ++++IP+RLG +  N  Y+  +K
Sbjct: 254 QGITVYVAQDCTVYKSDVVEMQCSL--KDSEKPGAKSVIILIPVRLGGERTNMEYLEFVK 311

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
                          ILS  Y              +G++GG+P  + YF G+  + +I++
Sbjct: 312 --------------GILSLEY-------------CIGIVGGRPKQSYYFAGFQDDSLIYM 344

Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           DPH  Q+   V  K    E     ++HCP   ++    MDPS  +
Sbjct: 345 DPHYCQSFVDVSIKNFPLE-----SFHCPSPKKMSFKKMDPSCTI 384


>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 441

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 96/307 (31%), Positives = 144/307 (46%), Gaps = 60/307 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F  I  S                       G T+D GWGCM+R GQ 
Sbjct: 106 DFESKIWLTYRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGFTSDTGWGCMIRSGQS 165

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL+ L +GRDW+   ++ +E    I+ +F D  TAPYSIH     GA+  GK  GE
Sbjct: 166 LLANALVMLRMGRDWRRGSSASQEER-SIISLFADTPTAPYSIHNFVEHGAAACGKHPGE 224

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVL 172
           WFGP+  A+ ++ LA       +  +V  D   V  +   K+   + +A     + P ++
Sbjct: 225 WFGPSATARCIQALANGHQSPELRVYVTGDGLEVYEDSFMKIAKPDGQA-----FIPTLI 279

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           ++  RLG+  I PVY   +K                                PQSLG+ G
Sbjct: 280 LVGTRLGLDKITPVYWEALKSS---------------------------LQIPQSLGIAG 312

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHM 291
           G+P+ + YFIG  G+   +LDPH  +    + D  +D S++ +DS  H  +  R+HI  M
Sbjct: 313 GQPSSSHYFIGVQGHHFFYLDPHQTRPALPLPDNIEDYSQEDIDSC-HTRRLRRIHIKEM 371

Query: 292 DPSIAVV 298
           DPS+ + 
Sbjct: 372 DPSMLIA 378


>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
 gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
 gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
 gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
          Length = 458

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 151/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+    V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNCDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
          Length = 473

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 98/301 (32%), Positives = 146/301 (48%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
            D +SR+W TYRKGF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 130 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 189

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L + +    
Sbjct: 190 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVRTNREHH 249

Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                  ++   ++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 250 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 307

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  +NP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 308 VLGLDKLNPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 340

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   + V++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 341 TSTYVAGVQDDRVLYLDPHEVQ---LAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLA 397

Query: 297 V 297
           +
Sbjct: 398 I 398


>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
          Length = 402

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 59/310 (19%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKGWGCML 49
           E    D  S++W TYR  F PI                          G T+D GWGCM+
Sbjct: 74  EAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWGCMI 133

Query: 50  RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
           R GQ ++A A+L L LGRDW+    ++EEA  ++L +F D   AP SIH+    GA S G
Sbjct: 134 RSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSIHRFVKYGAESCG 191

Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
           K  GEWFGP+  A+ +  L+      +I   V + N    + V +        S +   Q
Sbjct: 192 KHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSFLRVARSGSGSIQ 247

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           P ++++  RLGI ++ PVY +G+K    L                           PQS+
Sbjct: 248 PTLILLGTRLGIDNVTPVYWDGLKAVLQL---------------------------PQSV 280

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           G+ GG+P+ + YFIG  G    +LDPHT +  +    D    S+ ++ STYH  +  R+H
Sbjct: 281 GIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI-STYHTRRLRRIH 339

Query: 288 ILHMDPSIAV 297
           I  MDPS+ +
Sbjct: 340 IQDMDPSMLI 349


>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
          Length = 407

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 59/310 (19%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKGWGCML 49
           E    D  S++W TYR  F PI                          G T+D GWGCM+
Sbjct: 79  EAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWGCMI 138

Query: 50  RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
           R GQ ++A A+L L LGRDW+    ++EEA  ++L +F D   AP SIH+    GA S G
Sbjct: 139 RSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSIHRFVKYGAESCG 196

Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
           K  GEWFGP+  A+ +  L+      +I   V + N    + V +        S +   Q
Sbjct: 197 KHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSFLRVARSGSGSIQ 252

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           P ++++  RLGI ++ PVY +G+K    L                           PQS+
Sbjct: 253 PTLILLGTRLGIDNVTPVYWDGLKAVLQL---------------------------PQSV 285

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           G+ GG+P+ + YFIG  G    +LDPHT +  +    D    S+ ++ STYH  +  R+H
Sbjct: 286 GIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI-STYHTRRLRRIH 344

Query: 288 ILHMDPSIAV 297
           I  MDPS+ +
Sbjct: 345 IQDMDPSMLI 354


>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
          Length = 458

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 101/343 (29%), Positives = 152/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW- 69
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 70  -------------QWNVNSKE------EAYL----------------------------- 81
                         W  N+ +      EA L                             
Sbjct: 135 WPDALHIENSDSDSWTSNTVKKFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+  + V     TN   S + + + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNSDVIDK-QTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
          Length = 892

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYR+GF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKP 193

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L +      
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 + +   ++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401

Query: 297 V 297
           +
Sbjct: 402 I 402


>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
           TFB-10046 SS5]
          Length = 989

 Score =  148 bits (374), Expect = 3e-33,   Method: Composition-based stats.
 Identities = 96/286 (33%), Positives = 138/286 (48%), Gaps = 75/286 (26%)

Query: 18  DITSRLWFTYRKGFVPIGDSGL-----------------------------TTDKGWGCM 48
           D TSR+W TYR  F PI D  L                             T+D GWGCM
Sbjct: 317 DFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGWGCM 376

Query: 49  LRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQIAL 102
           LR GQ ++A  L+ LHLGRDW+    N  S E A Y+KIL  F D  +  AP+S+H++A+
Sbjct: 377 LRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHRMAM 436

Query: 103 TGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL-------C 155
           +G   GK VG+WFGP+T A  +R L      + +   +A+D  L    +           
Sbjct: 437 SGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVAIAVDGVLYETDIYSASHYPMSSA 496

Query: 156 TTNKRASS---NP-QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKIL 209
              +RAS    +P +W  + +++++  RLG+  +NP+Y   +K                 
Sbjct: 497 DGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTI--------------- 541

Query: 210 SSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                       FTFPQSLG+ GG+P+ + YF+G  GN + +LDPH
Sbjct: 542 ------------FTFPQSLGIAGGRPSSSYYFVGSQGNSLFYLDPH 575


>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
 gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
          Length = 486

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 145/308 (47%), Gaps = 50/308 (16%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S   L +  +D +SR+  TYRKGF  IGDS LT+D  WGCMLR  QM++AQALL   +GR
Sbjct: 129 SSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGR 188

Query: 68  DWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
            W+   +   ++ Y++IL  F D + + +SIH I   G + G A G W GP  + +    
Sbjct: 189 SWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWET 248

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP- 185
           LA+                            +KR  ++ + Q L + I +  G +D    
Sbjct: 249 LAR----------------------------SKREETDLECQSLPMAIYIVSGDEDGERG 280

Query: 186 ----VYIN-GIKKCYALPISPVYDMVKILSSTYNMQ-----TPRY------EFTFPQSLG 229
               VYI    + C       V D   IL     +       PRY       FTFPQSLG
Sbjct: 281 GAPVVYIEEASRHCLEFSKGQV-DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLG 339

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
           ++GGKP  + Y +G       +LDPH  Q+   V D  +++ +   S+YHC     + + 
Sbjct: 340 ILGGKPGASTYIVGVQDEKAFYLDPHEAQS---VVDIRRENLEADTSSYHCNIIRHICLD 396

Query: 290 HMDPSIAV 297
            +DPS+A+
Sbjct: 397 SIDPSLAI 404


>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
 gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
          Length = 480

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 97/342 (28%), Positives = 155/342 (45%), Gaps = 90/342 (26%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI----------------------------------GDSG 38
           E++ +DI SR+WFTYR GF PI                                   +  
Sbjct: 79  EEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALDNIHGLFNNQN 138

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
            TTD GWGCM+R  QM++A A+  L LGR + +  +S E+ +  I+ MF D   AP+S+H
Sbjct: 139 FTTDVGWGCMIRTSQMLLANAIQLLLLGRGFTY-ADSSEKKHSDIIDMFTDDPKAPFSLH 197

Query: 99  QIALTGASEGKAV--GEWFGPNTVAQVLRKLAK--YDDWSSIVFHVALDNTLVV--NQVK 152
                 +     V  GEWFGPN  +  +++L K  +D+ SS  F V +  +  +  +++ 
Sbjct: 198 NFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDESSSPRFRVIISESCDIYDDKIG 257

Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
           KL   N+ A        +++++P+RLG+  ++P Y N +                     
Sbjct: 258 KLLQENEDAEG-----AILILLPVRLGLNKVSPYYHNSLSSL------------------ 294

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI--GCVYDKEQDS 270
                    F+ PQ +G+ GGKP+ + YF G    ++++LDPH  Q++    +YD     
Sbjct: 295 ---------FSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSVKASSIYD----- 340

Query: 271 EKKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYKN 308
                 T+H      L I  MDPS    I + S+  Y  +K+
Sbjct: 341 ------TFHTHNVQSLKIEDMDPSMLIGILIKSKEDYESFKD 376


>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
           206040]
          Length = 452

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 92/306 (30%), Positives = 145/306 (47%), Gaps = 60/306 (19%)

Query: 17  RDITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQ 53
            D++S+ W TYR GF PI  S                       G ++D GWGCM+R GQ
Sbjct: 122 EDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSGWGCMIRSGQ 181

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
            ++A  +  L LGRDW+ + + +EE +L  + MF D   APYSIH     GA+  GK  G
Sbjct: 182 SLLATTIGILRLGRDWRRDQSQEEERHL--ISMFADDPRAPYSIHNFVRHGATACGKYPG 239

Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
           EWFGP+  AQ ++ L      S  ++       +  +   K+  ++ +      + P ++
Sbjct: 240 EWFGPSATAQCIQALTSSSGLSLNIYSPNDGQDVYEDSFMKIAKSDGQT-----FNPTLI 294

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           +I  RLGI  I P+Y + +     +                           PQS+G+ G
Sbjct: 295 LIRTRLGIDKITPIYWDALIAALHM---------------------------PQSVGIAG 327

Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           G+P  + YF+G  G+ + +LDP HT + I    D  + +E+ ++S  H  +  R+HI  M
Sbjct: 328 GRPASSHYFVGSQGSYLFYLDPHHTRKAIPYHDDVTKYTEEDIESC-HTSRLRRIHIKEM 386

Query: 292 DPSIAV 297
           DPS+ +
Sbjct: 387 DPSMLI 392


>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
          Length = 458

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 152/347 (43%), Gaps = 97/347 (27%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALD----NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
            I  +VA D    N  V+++     T     S N   + +++++P+RLG +  N  Y+  
Sbjct: 255 GITIYVAQDFSVYNCDVIDKQSASMT-----SDNADDKAVIILVPVRLGGERTNTDYLEF 309

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
           +K               ILS  Y              +G+IGGKP  + YF G+  + +I
Sbjct: 310 VK--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLI 342

Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           ++DPH  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 343 YMDPHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
 gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
          Length = 458

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W-------NVNS---------------------------------------------KEE 78
           W       N +S                                             + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNE 194

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
           AY  KI+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            +  +VA D T+  + V     T+   + + + + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
 gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
 gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
           cysteine endopeptidase; AltName: Full=Autophagin-3;
           AltName: Full=Autophagy-related cysteine endopeptidase
           3; AltName: Full=Autophagy-related protein 4 homolog C
          Length = 458

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W-------NVNS---------------------------------------------KEE 78
           W       N +S                                             + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNE 194

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
           AY  KI+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            +  +VA D T+  + V     T+   + + + + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
          Length = 466

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 83  NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 142

Query: 71  W-------NVNS---------------------------------------------KEE 78
           W       N +S                                             + E
Sbjct: 143 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNE 202

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
           AY  KI+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 203 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 262

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            +  +VA D T+  + V     T+   + + + + +++++P+RLG +  N  Y+  +K  
Sbjct: 263 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 319

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 320 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 354

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 355 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 392


>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
 gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 145/298 (48%), Gaps = 50/298 (16%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK- 76
           D +SR+  TYRKGF  I DS LT+D  WGCMLR  QM++AQALLF  LGR W+  ++   
Sbjct: 145 DFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQALLFHRLGRSWRKPLDKPL 204

Query: 77  EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK------- 129
           +  Y++IL +F D  ++ +SIH +   G + G A G W GP  V      L +       
Sbjct: 205 DREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYAVCHSWESLVRSRREETN 264

Query: 130 --YDDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLG 179
             Y   S  V+ V+            L + +  + C+   +   +  W P++L++PL LG
Sbjct: 265 LEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQED--WTPILLLVPLVLG 322

Query: 180 IQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHAL 239
           +  INP YI  ++                             FTFPQSLG++GGKP  + 
Sbjct: 323 LDKINPRYIPSLQAT---------------------------FTFPQSLGILGGKPGAST 355

Query: 240 YFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           Y +G    +  +LDPH  Q    V +  +D  +   S+YHC     + +  +DPS+A+
Sbjct: 356 YIVGVQDENAFYLDPHEVQP---VVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAI 410


>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
 gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B
 gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
 gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
 gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
 gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 478

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYR+GF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKP 193

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L +      
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 + +   ++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401

Query: 297 V 297
           +
Sbjct: 402 I 402


>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
          Length = 460

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 104/352 (29%), Positives = 151/352 (42%), Gaps = 103/352 (29%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
           + ++E+ RRD  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR 
Sbjct: 75  YGNVEEFRRDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRA 134

Query: 69  WQW-----------------------------------------------NVNSKEE--- 78
           W W                                                  S++E   
Sbjct: 135 WTWPDALDIENSDSASWTSHTVKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGR 194

Query: 79  ---AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDD 132
               + KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D
Sbjct: 195 NELCHRKIISWFGDSPLACFGLHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPD 254

Query: 133 WSSIVFHVALDNTLVVNQV-------KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP 185
              I  +VA D T+    V         L TT  +A        ++L++P+RLG +  N 
Sbjct: 255 LQGITIYVAQDCTVYKADVIDKQGISAGLETTEDKA--------IILLVPVRLGGERTNM 306

Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
            Y++ +K               ILS  Y              +G+IGGKP  + YF G+ 
Sbjct: 307 DYLDFVK--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQ 339

Query: 246 GNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            + +I++DPH  Q+   V  K+   E     ++HCP   ++    MDPS  V
Sbjct: 340 DDSLIYMDPHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFRKMDPSCTV 386


>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
          Length = 486

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 99/302 (32%), Positives = 148/302 (49%), Gaps = 54/302 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV-NS 75
            D +SR+W TYRKGF  I DS LT+D  WGCM+R  QM++AQAL+F HLGR W+    N 
Sbjct: 139 EDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNP 198

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y++IL +F D     +SIH +   G S G A G W GP  + +  + L +      
Sbjct: 199 SNPEYIRILHLFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQP 258

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLC-TTNKRASSNPQWQPLVLVIP 175
                 + +   ++ V+ D          + ++   +LC   NK  S+   W P++L++P
Sbjct: 259 EVINRNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSA---WSPILLLVP 315

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           L LG+  INP YI  +K+                            FTFPQSLG++GGKP
Sbjct: 316 LVLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKP 348

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
             + Y  G   +  ++LDPH  Q      +   D+ +   S+YHC     + +  +DPS+
Sbjct: 349 GASTYIAGVQDDRALYLDPHEVQ---LAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSL 405

Query: 296 AV 297
           A+
Sbjct: 406 AI 407


>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
          Length = 480

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 147/320 (45%), Gaps = 72/320 (22%)

Query: 18  DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
           D  SR+W TYR  F PI  S                      G T+D GWGCM+R GQ +
Sbjct: 117 DFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSGQSL 176

Query: 56  IAQALLFLHLGRDWQWN----------------VNSKEEAYLKILKMFEDRRTAPYSIHQ 99
           +A  L+ LHLGRDW+ +                 ++K EA  +IL +F D   AP+SIH+
Sbjct: 177 LANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREA--EILSLFADSPDAPFSIHR 234

Query: 100 IALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALD-NTLVVNQVKKLCTT 157
               GAS  GK  G+WFGP+  A  +R+L+     + +  +V    + L  ++ + +   
Sbjct: 235 FVQHGASACGKHPGQWFGPSATASCIRELSTECAAAGLRVYVTPSASELYEDRFRSIAAA 294

Query: 158 NKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQT 217
           +    S+P  +P +++  +RLG+  I PVY   +K                         
Sbjct: 295 SP---SDPTIKPTLILFGIRLGLDRITPVYHEALKSS----------------------- 328

Query: 218 PRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDST 277
                T+PQS+G+ GG+P+ + YF+G  G+   +LDPH  +     +    D  ++  +T
Sbjct: 329 ----LTYPQSIGIAGGRPSSSHYFVGCQGDLFFYLDPHETRPALPHHASPADYSEEEIAT 384

Query: 278 YHCPQASRLHILHMDPSIAV 297
            H  +   L I  MDPS+ +
Sbjct: 385 CHTRRLRGLRINEMDPSMLI 404


>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
          Length = 1202

 Score =  147 bits (371), Expect = 6e-33,   Method: Composition-based stats.
 Identities = 92/295 (31%), Positives = 138/295 (46%), Gaps = 84/295 (28%)

Query: 18  DITSRLWFTYRKGFVPI---------------------------GDSGLTTDKGWGCMLR 50
           D TSR+  TYR GF PI                            + GL+TD GWGCMLR
Sbjct: 548 DFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGCMLR 607

Query: 51  CGQMVIAQALLFLHLGRDWQWNVNSKEEA---------------YLKILKMFEDRRT--A 93
            GQ ++A AL F+HLGRDW+   +S +E+               Y ++L  F D  +   
Sbjct: 608 TGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPSPLC 667

Query: 94  PYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
           P+S+H+ A+ G  + GK +GEWFGP+T A  ++ LA     +++   V++D T+  + V+
Sbjct: 668 PFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLASNFAPANLGVAVSVDGTVYRSDVQ 727

Query: 153 KLC-------TTNKRASSNP----QWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
                      T  R    P     WQ P++++I  RLG+  +NP+Y   IK        
Sbjct: 728 AAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESIKAA------ 781

Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                                 +FPQS+G+ GG+P+ + YF+G   N V ++DPH
Sbjct: 782 ---------------------LSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPH 815


>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
          Length = 912

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
            D +SR+W TYR+GF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKP 193

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L +      
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 + +   ++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401

Query: 297 V 297
           +
Sbjct: 402 I 402


>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
           Full=Autophagy-related protein 4 homolog B;
           Short=Protein autophagy 4; AltName: Full=OsAtg4
          Length = 478

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
            D +SR+W TYR+GF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKP 193

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L +      
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 + +   ++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401

Query: 297 V 297
           +
Sbjct: 402 I 402


>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
 gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
          Length = 481

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 97/304 (31%), Positives = 146/304 (48%), Gaps = 50/304 (16%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L    RD +SR+  TYRKGF  I DS LT+D  WGCMLR  QM++AQALLF  LGR W+ 
Sbjct: 138 LAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRK 197

Query: 72  NVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
            V+   +  Y++IL +F D   + +SIH +   G + G A G W GP  + +    LA+ 
Sbjct: 198 PVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLAR- 256

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN----PV 186
                                      +KR  +N ++Q L + + +  G +D      PV
Sbjct: 257 ---------------------------SKREETNLEYQTLPMAVYVVSGCEDGERGGAPV 289

Query: 187 YI--NGIKKCYALPI-----SPVYDMVKILSSTYNMQTPRY------EFTFPQSLGVIGG 233
               +  + C          +P+  +V ++     +  PRY       FTFPQSLG++GG
Sbjct: 290 LSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKIN-PRYIPSLQATFTFPQSLGILGG 348

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           KP  + Y +G    +  +LDPH  Q    V +  +D  +   S+YHC     + +  +DP
Sbjct: 349 KPGASTYIVGVQDENAFYLDPHEVQP---VVNFSRDDVEANTSSYHCDVVRHIPLDLIDP 405

Query: 294 SIAV 297
           S+A+
Sbjct: 406 SLAI 409


>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 102/311 (32%), Positives = 148/311 (47%), Gaps = 53/311 (17%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S   L +  +D +SR+  TYRKGF  IGDS LT+D  WGCMLR  QM++AQALL   +GR
Sbjct: 129 SSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGR 188

Query: 68  DWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
            W+   +   ++ Y++IL  F D + + +SIH I   G + G A G W GP  + +    
Sbjct: 189 SWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWET 248

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP- 185
           LA+                            +KR  ++ + Q L + I +  G +D    
Sbjct: 249 LAR----------------------------SKREETDLECQSLPMAIYIVSGDEDGERG 280

Query: 186 ----VYIN-GIKKCYALPISPVYDMVKILSSTYNMQ-----TPRY------EFTFPQSLG 229
               VYI    + C       V D   IL     +       PRY       FTFPQSLG
Sbjct: 281 GAPVVYIEEASRHCLEFSKGQV-DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLG 339

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL-HI 288
           ++GGKP  + Y +G       +LDPH  Q+   V D  +++ +   S+YHC  +S + HI
Sbjct: 340 ILGGKPGASTYIVGVQDEKAFYLDPHEAQS---VVDIRRENLEADTSSYHCNCSSIIRHI 396

Query: 289 L--HMDPSIAV 297
               +DPS+A+
Sbjct: 397 CLDSIDPSLAI 407


>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
 gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
          Length = 467

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 94/303 (31%), Positives = 144/303 (47%), Gaps = 49/303 (16%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L   ++D +S++  TYR+GF P  D+  T+D  WGCM+R  QM+ AQALLF  LGR W  
Sbjct: 135 LAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRSWTK 194

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
                E+ YL+ L+ F D  ++ +SIH + + G+S G A G W GP  + +    LA   
Sbjct: 195 KSELPEQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGSWVGPYAICRAWESLACKK 254

Query: 129 -KYDDWSSIVFHVALD-------------NTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
            K  D  +    +A+                L +    K C    +  S  +W P++L++
Sbjct: 255 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPILLLV 312

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PL LG+  +NP YI  +                              FTFPQS+G++GGK
Sbjct: 313 PLVLGLDSVNPRYIPSLIA---------------------------TFTFPQSVGILGGK 345

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           P  + Y +G   +   +LDPH  Q +  V  +  D +    S+YHC     + +  +DPS
Sbjct: 346 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVIRYVPLESLDPS 402

Query: 295 IAV 297
           +A+
Sbjct: 403 LAL 405


>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
          Length = 450

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 149/316 (47%), Gaps = 61/316 (19%)

Query: 17  RDITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQ 53
            D+ ++ W TYR GF PI  S                       G ++D GWGCM+R GQ
Sbjct: 117 EDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRSGQ 176

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
            ++A  +  L LGRDW+   N +EE   +++ MF D   AP+SIH     GA+  GK  G
Sbjct: 177 SLLATTIATLQLGRDWRRGKNQQEER--RLISMFADDPRAPFSIHNFVRHGATACGKFPG 234

Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
           EWFGP+  AQ ++ L    D    V+       +  +   K+   + +      + P ++
Sbjct: 235 EWFGPSATAQCIQALTSSSDLDLHVYSPNDGQDVYEDSFMKVAKPDGQ-----DFHPTLI 289

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           +I  RLGI  I P+Y                   + L +T  M         PQS+G+ G
Sbjct: 290 LIRTRLGIDKITPIYW------------------EPLIATLQM---------PQSVGIAG 322

Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           G+P+ + YF+G  G+ + +LDP HT + +    D    +++ +DS  H  +  RLH+  M
Sbjct: 323 GRPSSSHYFVGSQGSYLFYLDPHHTRKALPYHEDVANYTDEDIDSC-HTSRLRRLHVKEM 381

Query: 292 DPSIAV-VSQRSYSDY 306
           DPS+ +    RS SD+
Sbjct: 382 DPSMLIGFLIRSESDW 397


>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
          Length = 442

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 92/293 (31%), Positives = 142/293 (48%), Gaps = 47/293 (16%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW---QWNV 73
            D +S ++ +YRK F  + +S LT+D GWGCMLR GQM++A ALL   L   W   +   
Sbjct: 102 EDFSSLIYLSYRKHFSQLANSNLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKY 161

Query: 74  NSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLR---KLA 128
             K   Y  IL+ F D  +  +P+S+H++   G+   K  GEW+GP +VA  L     L 
Sbjct: 162 TEKNYIYRMILRFFNDENSDNSPFSLHELVRIGS---KKPGEWYGPTSVAHTLSAAVNLT 218

Query: 129 KYDDWSSIVFHVALDNTL----VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
            +    +   +VA D T+    V++   K     K+      W+ +++++P+RLG   +N
Sbjct: 219 SHPVLDTFRVYVANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLN 278

Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
           P+YI  +K   AL                         T    +G+IGG+P H+LYF+G+
Sbjct: 279 PIYIPCLK---AL------------------------LTLDYCVGIIGGRPKHSLYFVGF 311

Query: 245 VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            G  +I LDPH  Q    +  +E   E     ++ C    ++    MDPS AV
Sbjct: 312 QGKKLINLDPHYLQEYVDMTTQEFPVE-----SFRCHYPKKMAFKKMDPSCAV 359


>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
           heterostrophus C5]
          Length = 471

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 83/262 (31%), Positives = 129/262 (49%), Gaps = 55/262 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
           D  SR+W TYR GF+ I  S                     G T+D G+GCM+R GQ ++
Sbjct: 99  DFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQSIL 158

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGRDW++      + + +IL +F D   AP+SIH+    GA+  GK  GEWF
Sbjct: 159 ANALQILRLGRDWRYQDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
           GP+  A+ ++ LA     + +  +V+ D   V  +++K++   +     + QWQP ++++
Sbjct: 219 GPSAAARCIQDLANKHREAGLRVYVSGDGADVYEDKLKEVAIDD-----DGQWQPTLILV 273

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
             RLGI  I PVY   +K    +                            QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306

Query: 235 PNHALYFIGYVGNDVIFLDPHT 256
           P+ + YF+   GN+  +LDPH+
Sbjct: 307 PSASHYFVATQGNNFFYLDPHS 328


>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
          Length = 458

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 98/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W-------NVNS---------------------------------------------KEE 78
           W       N +S                                             + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNE 194

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
           AY  KI+  F +   A + +H++   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 AYHRKIISWFGNSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            +  +VA D T+  + V     T+   + + + + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
          Length = 484

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 96/301 (31%), Positives = 147/301 (48%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV-NS 75
            D +SR+W TYRKGF  I DS LT+D  WGCM+R  QM++AQAL+F HLGR W+    N 
Sbjct: 137 EDFSSRVWITYRKGFDVISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNP 196

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
            +  + +IL +F D     +SIH +   G S G A G W GP  + +  + L +      
Sbjct: 197 SDPEHTRILHLFGDSEVCAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQP 256

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 + +  +++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 257 EVINRNESFPMVLYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKGQS--AWSPILLLVPL 314

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 315 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 347

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q      +   D+ +   S+YHC     + +  +DPS+A
Sbjct: 348 ASTYIAGVQDDRALYLDPHEVQ---LAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLA 404

Query: 297 V 297
           +
Sbjct: 405 I 405


>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
          Length = 449

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/318 (29%), Positives = 144/318 (45%), Gaps = 62/318 (19%)

Query: 18  DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F PI                      GD S  ++D GWGCM+R GQ 
Sbjct: 117 DFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQLGDQSPFSSDSGWGCMIRSGQS 176

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A  +  + LGRDW+   + +EE   +ILK F D   APYSIH     GAS  GK  GE
Sbjct: 177 LLANTIALVRLGRDWRQGQSLEEEC--RILKDFADDPRAPYSIHSFVRHGASACGKYPGE 234

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA   + S  V+       +  +   K+      A     + P +++
Sbjct: 235 WFGPSATARCIQALANSHEPSIRVYSTGDGPDVYEDDFMKIANPTGEA-----FHPTLVL 289

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLG+  I PVY   +     +P                           QS+G+ GG
Sbjct: 290 VGTRLGLDKITPVYWEALIAALQMP---------------------------QSVGIAGG 322

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YFIG  G+ + +LDPH  +     ++   D   +   + H  +  R+H+  MDP
Sbjct: 323 RPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMDYTSEEIESCHTARLRRIHVREMDP 382

Query: 294 SI----AVVSQRSYSDYK 307
           S+     + S+  + D+K
Sbjct: 383 SMLIGFLIRSEEDWQDWK 400


>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
          Length = 509

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 154/323 (47%), Gaps = 74/323 (22%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCML 49
           +  RD+ SR+W TYR GF  I  +                        G TTD GWGCM+
Sbjct: 73  EFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSLIRGTVDLATVTKGFTTDAGWGCMI 132

Query: 50  RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EG 108
           R  Q ++A +LL L LGR W+++   +   + +I+  F D  TAP+SIH     GA+  G
Sbjct: 133 RTSQSLLANSLLQLRLGRGWRYDQTRECAKHAEIVSWFVDIPTAPFSIHNFVEQGANCAG 192

Query: 109 KAVGEWFGPNTVAQVLRKL--AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
           K  GEWFGP+  A+ ++ L  A YD     V+  A    +  +++ +L      A    +
Sbjct: 193 KKPGEWFGPSAAARSIQVLCEANYDKTGLKVYFTA-SGDIYEDELFEL------AQQGAE 245

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
            +P++++  +RLG++++NP+Y + +KK                              +PQ
Sbjct: 246 LRPVLILAGIRLGVKNVNPLYWDFLKKTLG---------------------------WPQ 278

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK------------EQDSEKKL 274
           S+G+ GG+P+ + YF G+ G+ + +LDPH  Q    +  +            E +S   L
Sbjct: 279 SVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHESPDPNHYVEVESGLDL 338

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           DS  H  +  +LH+  MDPS+ V
Sbjct: 339 DSV-HTNKIRKLHLDQMDPSMLV 360


>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
          Length = 429

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 95/301 (31%), Positives = 145/301 (48%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYRKGF  I DS LT+D  WGCM+R  QM++AQAL+F HLGR W+      
Sbjct: 150 EDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKP 209

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y+ +L +F D     +SIH +   G + G A G W GP  + +  + L +      
Sbjct: 210 YNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQA 269

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 +++   ++ V+ D          + ++   +LC+   +  S   W P++L++PL
Sbjct: 270 DAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPST--WSPILLLVPL 327

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            F FPQSLG++GGKP 
Sbjct: 328 VLGLDKINPRYIPLLKE---------------------------TFMFPQSLGILGGKPG 360

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 361 TSTYIAGVQDDRALYLDPHEVQ---MTVDIALDNLEADTSSYHCSVVRALALEQIDPSLA 417

Query: 297 V 297
           +
Sbjct: 418 I 418


>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
 gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
 gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
 gi|219886349|gb|ACL53549.1| unknown [Zea mays]
 gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
 gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
          Length = 492

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 95/301 (31%), Positives = 145/301 (48%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYRKGF  I DS LT+D  WGCM+R  QM++AQAL+F HLGR W+      
Sbjct: 150 EDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKP 209

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y+ +L +F D     +SIH +   G + G A G W GP  + +  + L +      
Sbjct: 210 YNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQA 269

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 +++   ++ V+ D          + ++   +LC+   +  S   W P++L++PL
Sbjct: 270 DAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPST--WSPILLLVPL 327

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            F FPQSLG++GGKP 
Sbjct: 328 VLGLDKINPRYIPLLKE---------------------------TFMFPQSLGILGGKPG 360

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 361 TSTYIAGVQDDRALYLDPHEVQ---MTVDIALDNLEADTSSYHCSVVRALALEQIDPSLA 417

Query: 297 V 297
           +
Sbjct: 418 I 418


>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
          Length = 459

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 105/357 (29%), Positives = 158/357 (44%), Gaps = 91/357 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V  K CT+   AS N   + ++++IP+RLG +  N  Y++ +K 
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVKG 312

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                I    ++V +L                     +  KP  + YF G+  + +I++D
Sbjct: 313 -----ILRALNIVWVL---------------------LVAKPKQSYYFAGFQDDSLIYMD 346

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           PH  Q+   V  K+   E     T+HCP   ++    MDPS  +    R+  D+K  
Sbjct: 347 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 398


>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
 gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
          Length = 489

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 96/304 (31%), Positives = 146/304 (48%), Gaps = 50/304 (16%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L +   D +SR+  TYR+GF  IGDS   +D GWGCMLR  QM++AQALLF  LGR W  
Sbjct: 134 LAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLVAQALLFHKLGRAWTK 193

Query: 72  NVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK- 129
                 ++AY++IL +F D   AP+SIH +   G +   A G W GP  + +    LA+ 
Sbjct: 194 PFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWVGPYAMCRSWESLARS 253

Query: 130 --------YDDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
                   Y      V+ V+ D          + +    + C    R  ++  W P++L+
Sbjct: 254 KREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEFSRGQAD--WTPILLL 311

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +PL LG+  +NP YI  ++                             FTF QSLG++GG
Sbjct: 312 VPLVLGLDKVNPRYIPSLQAT---------------------------FTFSQSLGIMGG 344

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           KP  + Y +G   ++  +LDPH  Q+   V +  +D  +   S+YH      + +  +DP
Sbjct: 345 KPGASTYIVGVQDDNAFYLDPHEVQS---VVNIGRDDIEADTSSYHSDIVRHIPLHSIDP 401

Query: 294 SIAV 297
           S+A+
Sbjct: 402 SLAI 405


>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
          Length = 1257

 Score =  144 bits (364), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 92/305 (30%), Positives = 142/305 (46%), Gaps = 94/305 (30%)

Query: 18  DITSRLWFTYRKGFVPI----------------------------------------GDS 37
           D TSR+W TYR  F PI                                        G+ 
Sbjct: 320 DYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWSGEK 379

Query: 38  GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE----AYLKILKMFEDRRT- 92
           G T+D GWGCMLR GQ ++A AL+ LHL R W+   +         Y++IL  F D  + 
Sbjct: 380 GWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDNPSP 439

Query: 93  -APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV 151
            AP+ IH++AL G   GK VG WFGP+T A  +++L    + + +   +A+D+ +  + V
Sbjct: 440 LAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLVGEFEDAGLEVALAVDSVVYQSDV 499

Query: 152 -----------------KKLCTTN--KRASSNPQW--QPLVLVIPLRLGIQDINPVYING 190
                            K + T+   K+    P+W  +P+++++ +RLGI  +NP+Y   
Sbjct: 500 YAASAASRNQNGVEGDSKTVGTSKSRKKGQGPPKWGNRPVLILVGIRLGIDGVNPIY--- 556

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
                       Y+ VK L            FTFPQ++G+ GG+P+ + YF+G  G+ + 
Sbjct: 557 ------------YESVKTL------------FTFPQTVGIAGGRPSSSYYFVGAQGDSLF 592

Query: 251 FLDPH 255
           +LDPH
Sbjct: 593 YLDPH 597


>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
 gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
           Full=Autophagy-related protein 4 homolog C
 gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
          Length = 450

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 101/340 (29%), Positives = 149/340 (43%), Gaps = 93/340 (27%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++++ R+D  SR+W TYRK F  I  S  TTD GWGC LR GQM++AQ LL   LGRDW 
Sbjct: 76  NVDEFRKDFISRIWLTYRKEFPQIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWT 135

Query: 71  WNV-------------------------------------------NSK-----EEAYLK 82
           W                                             NS+     E+ + K
Sbjct: 136 WTEALDIFCSESDFWTANTARKLDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRK 195

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---DWSSIVFH 139
           I+  F D   A + +HQ+   G + GK  G+W+GP  V+ +LRK  +     +   I  +
Sbjct: 196 IISWFADYPLAYFGLHQLVKLGKNSGKVAGDWYGPAVVSHLLRKAIEESSDPELQGITIY 255

Query: 140 VALDNTLVVNQVKKL-CTT-NKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYAL 197
           VA D T+    V  L C   N++A        +V+++P+RLG +  N  Y   +K   +L
Sbjct: 256 VAQDCTIYNADVYDLQCNKGNEKA--------VVILVPVRLGGERTNMEYFEYVKGILSL 307

Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
                                  EF     +G+IGGKP  + YF+G+  + +I++DPH  
Sbjct: 308 -----------------------EFC----IGIIGGKPKQSYYFVGFQDDSLIYMDPHYC 340

Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           Q+   V  K    E     ++HCP   ++    MDPS  V
Sbjct: 341 QSFVDVSIKNFPLE-----SFHCPSPKKMSFKKMDPSCTV 375


>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
          Length = 473

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 82/262 (31%), Positives = 128/262 (48%), Gaps = 55/262 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
           D  SR+W TYR GF  I  S                     G T+D G+GCM+R GQ ++
Sbjct: 99  DFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQSIL 158

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGRDW++      + + +IL +F D   AP+SIH+    GA+  GK  GEWF
Sbjct: 159 ANALQILRLGRDWRYQDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
           GP+  A+ ++ LA     + +  +V+ D   V  +++K++   +     + +WQP ++++
Sbjct: 219 GPSAAARCIQDLANKHREAGLRVYVSGDGADVYEDKLKEVAIDD-----DGEWQPTLILV 273

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
             RLGI  I PVY   +K    +                            QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306

Query: 235 PNHALYFIGYVGNDVIFLDPHT 256
           P+ + YF+   GN+  +LDPH+
Sbjct: 307 PSASHYFVATQGNNFFYLDPHS 328


>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
          Length = 458

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 98/343 (28%), Positives = 152/343 (44%), Gaps = 90/343 (26%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134

Query: 71  W-------NVNS--------------------------------------------KEEA 79
           W       N +S                                            ++E 
Sbjct: 135 WPDALDIENSDSESWTAHTVKKLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEV 194

Query: 80  Y-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSS 135
           Y  KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A+  +   
Sbjct: 195 YHRKIISWFGDSPLAAFGLHQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG 254

Query: 136 IVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
           +  +VA D T+  + V  + C+      ++   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 VTVYVAQDCTVYSSDVIDRQCSFMDSGETDT--KAVIILVPVRLGGERTNMDYLEFVK-- 310

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        ILS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 311 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 345

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     ++HCP   ++    MDPS  +
Sbjct: 346 HYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 383


>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
           T30-4]
 gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
           T30-4]
          Length = 392

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 87/269 (32%), Positives = 132/269 (49%), Gaps = 45/269 (16%)

Query: 5   NKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH 64
           + ++  D +  +R   S LWFTYR+ +  +     T+D GWGCMLR  QM++ QAL    
Sbjct: 31  DDVAAVDFDAYKRSFESILWFTYRRDYPAMTPYEHTSDAGWGCMLRSAQMLLGQALQRRL 90

Query: 65  LGRDW------QWNVNSK-EEAYLKILKMFEDRRTAP--YSIHQIALTGASEGKAVGEWF 115
           LGRDW      +  ++++  E Y+++L+ F D       YSIHQ+   G    K  GEW+
Sbjct: 91  LGRDWRLPALFETEIDARLPETYVQLLRWFADSPDVECRYSIHQMVKLGVQYDKLPGEWY 150

Query: 116 GPNTVAQVLRKLA---KYDDWSSIVFHVALDNTLVVNQVKKLCTTN-----KRASSNPQW 167
           GP T AQVLR L    + +    +  +V  +  +  + V KLC  +             W
Sbjct: 151 GPTTAAQVLRDLVNLHRREFGGELSMYVPQEGVVYSDDVAKLCFFDPLLHPPTTEDKSDW 210

Query: 168 Q-PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
              L+++IPLRLG+  +N  Y+  I+K +A                           FPQ
Sbjct: 211 STALLILIPLRLGLDQVNERYVPAIQKSFA---------------------------FPQ 243

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
           S+G+IGGK  H++YF+G   + +  LDPH
Sbjct: 244 SVGIIGGKKGHSVYFVGTQQDQLHLLDPH 272


>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
          Length = 493

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 94/301 (31%), Positives = 144/301 (47%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYRKGF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+      
Sbjct: 146 EDFSSRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPSQKP 205

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y++IL +F D     +S+H +   G S G A G W GP  + +  + L +      
Sbjct: 206 CNPEYIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQP 265

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 + +   ++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 266 EVSNGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQST--WSPILLLVPL 323

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 324 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 356

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH  Q    +     D++    S+YHC     + +  +DPS+A
Sbjct: 357 TSTYIAGIQDDRALYLDPHDVQMAVNIASDNLDADT---SSYHCSTVRDMALDLLDPSLA 413

Query: 297 V 297
           +
Sbjct: 414 I 414


>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
          Length = 1509

 Score =  143 bits (361), Expect = 8e-32,   Method: Composition-based stats.
 Identities = 91/336 (27%), Positives = 151/336 (44%), Gaps = 104/336 (30%)

Query: 37   SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE------------------ 78
            +GLTTD GWGCMLR GQ ++A AL+ +HLGR WQ     K +                  
Sbjct: 774  AGLTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAEN 833

Query: 79   --------------AYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
                           Y++IL  F D  +   P+ +H++A  G   GK VGEWFGP+T A 
Sbjct: 834  QSLASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 893

Query: 123  VLRKLAKYDDWSSIVFHVALDNTLVVNQVKK-------------LCTTNKRASSNPQWQP 169
             +++L      + I   +A D    +++V+              + + N+RA +    +P
Sbjct: 894  AIKQLVFDFPEAGIAVELAHDGVFYLDEVRAAASASTGKSRASGMLSGNRRAETAVWRRP 953

Query: 170  LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
            ++++I +RLG++ +NP+Y   +K                             F+FPQS+G
Sbjct: 954  VLILIGIRLGLETVNPIYYESVKAT---------------------------FSFPQSVG 986

Query: 230  VIGGKPNHALYFIGYVGNDVIFLDPHTNQ--------------------NIGCVY---DK 266
            + GG+P+ + YF+G+ GN + +LDPH  +                    ++   Y   D+
Sbjct: 987  IAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPTTFPAAAPSRFDVSHRYALEDR 1046

Query: 267  EQDSE-------KKLDSTYHCPQASRLHILHMDPSI 295
            + + E       +   ST+HC +  R+ I  +DPS+
Sbjct: 1047 DDEDEWWSHAYTEAQTSTFHCEKVRRMPIKSLDPSM 1082


>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
          Length = 456

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 97/344 (28%), Positives = 148/344 (43%), Gaps = 94/344 (27%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134

Query: 71  W-------------------------------------------------NVNSKEEAY- 80
           W                                                     + E Y 
Sbjct: 135 WPEALDMESCDWESWTSSTVRKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYH 194

Query: 81  LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSSIV 137
            KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A+  +   + 
Sbjct: 195 RKIISWFGDSPLAAFGLHQLIEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQGVT 254

Query: 138 FHVALDNTL----VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            +VA D T+    V+++   L  + K  +     + ++++ P+RLG +  N  Y+  +K 
Sbjct: 255 VYVAQDCTVYSSDVIDRQCSLVDSGKAGT-----KAVIILFPVRLGGERTNTDYLEFVK- 308

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 309 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 342

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           PH  Q+   V  K+   E     ++HCP   ++    MDPS  +
Sbjct: 343 PHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 381


>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
 gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
          Length = 458

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 96/343 (27%), Positives = 152/343 (44%), Gaps = 89/343 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W---------------------------------------NVNSKEEA------------ 79
           W                                        V+ KE +            
Sbjct: 135 WPDALHIESSDSDSWTSNTIHKFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSE 194

Query: 80  --YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
             + +I+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 IYHRQIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            +  +VA D T+  + V     T+   + + + + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
                        +LS  Y              +G+IGGKP  + YF G+  + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346

Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           H  Q+   V  K+   E     T+HCP   ++    MDPS  +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384


>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
 gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
           Full=Autophagy-related protein 4 homolog C
 gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
 gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
          Length = 450

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 97/338 (28%), Positives = 147/338 (43%), Gaps = 89/338 (26%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++++ R+D  SR+W TYR+ F  I  S  TTD GWGC LR GQM++AQ L+   LGRDW 
Sbjct: 76  NVDEFRKDFISRIWLTYREEFPQIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWT 135

Query: 71  W---------------------------------------------NVNSK---EEAYLK 82
           W                                             N + K   E+ + K
Sbjct: 136 WTEALDIFSSESEFWTANTARKLTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQK 195

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---DWSSIVFH 139
           I+  F D   A + +HQ+   G + GK  G+W+GP  V+ +LRK  +     +   I  +
Sbjct: 196 IISWFADYPLAYFGLHQLVKLGKNSGKVAGDWYGPAVVSHLLRKAIEESSDPELQGITIY 255

Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
           VA D T+    V  L   NK        + +V+++P+RLG +  N  Y   +K   +L  
Sbjct: 256 VAQDCTIYSADVYDL-QCNKGTE-----KAVVILVPVRLGGERTNMEYFEFVKGILSL-- 307

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
                                EF     +G+IGGKP  + YF+G+  + +I++DPH  Q+
Sbjct: 308 ---------------------EFC----IGIIGGKPKQSYYFVGFQDDSLIYMDPHYCQS 342

Query: 260 IGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
              V  K    E     ++HCP   ++    MDPS  +
Sbjct: 343 FVDVSVKNFPLE-----SFHCPSPKKMSFKKMDPSCTI 375


>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
          Length = 459

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 99/344 (28%), Positives = 152/344 (44%), Gaps = 91/344 (26%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134

Query: 71  W-------NVNSKE-------------EAYL----------------------------- 81
           W       N +S+              EA L                             
Sbjct: 135 WPDALDIENSDSESWTAHTVKKLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A+  +  
Sbjct: 195 VYHRKIISWFGDSPLAAFGLHQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            +  +VA D T+  + V  + C+      ++   + +++++P+RLG +  N  Y+  +K 
Sbjct: 255 GVTVYVAQDCTVYSSDVIDRQCSFMDSGETDT--KAVIILVPVRLGGERTNMDYLEFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           PH  Q+   V  K+   E     ++HCP   ++    MDPS  +
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 384


>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
          Length = 509

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/319 (30%), Positives = 153/319 (47%), Gaps = 74/319 (23%)

Query: 18  DITSRLWFTYRKGFVPI-----GDS-------------------GLTTDKGWGCMLRCGQ 53
           D+ SR+W TYR GF  I     G S                   G TTD GWGCM+R  Q
Sbjct: 77  DVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSLIRGTVDLATVTKGFTTDAGWGCMIRTSQ 136

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EGKAVG 112
            ++A  LL L LGR W+++   +   + +I+  F D  TAP+SIH     GA+  GK  G
Sbjct: 137 SLLANGLLQLRLGRGWRYDQTRECAKHAEIVSWFVDIPTAPFSIHNFVEQGANCAGKKPG 196

Query: 113 EWFGPNTVAQVLRKL--AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPL 170
           EWFGP+  A+ ++ L  A YD     V+  A    +  +++ +L      A    + +P+
Sbjct: 197 EWFGPSAAARSIQVLCEANYDKIGLKVYFTA-SGDIYEDELFEL------AQEGAELRPV 249

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           +++  +RLG++++NP+Y + +KK                             ++PQS+G+
Sbjct: 250 LILAGIRLGVKNVNPLYWDFLKKT---------------------------LSWPQSVGI 282

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK------------EQDSEKKLDSTY 278
            GG+P+ + YF G+ G+ + +LDPH  Q    +  +            E +S   LDS  
Sbjct: 283 AGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHESPDPNHYVEVESGLDLDSV- 341

Query: 279 HCPQASRLHILHMDPSIAV 297
           H  +  +LH+  MDPS+ V
Sbjct: 342 HTNKIRKLHLDQMDPSMLV 360


>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
          Length = 437

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 88/293 (30%), Positives = 143/293 (48%), Gaps = 58/293 (19%)

Query: 24  WFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN----------- 72
           W TYR GF PI  S LTTD GWGCM+R GQM++A  L    LGRDW+ +           
Sbjct: 102 WMTYRCGFSPILSSSLTTDCGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHR 161

Query: 73  -VNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLR---- 125
            V +     + IL  F D  +   P+SIH++       G   G+WFGP+ V+ ++R    
Sbjct: 162 QVKNWNNYVVLILSWFGDSESELCPFSIHRLMEAAYYHGNKPGDWFGPSQVSILIRDCVR 221

Query: 126 -KLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
             L ++ +   +  +V+ D T+ +  V+ +  ++         Q L++++P+RLG + +N
Sbjct: 222 RALREHINLQKLNIYVSHDCTVYIKDVQDIFESDLD-------QSLLVLVPVRLGSESLN 274

Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
           P+YI  +K   AL                             ++G+IGG+P H+++FIG+
Sbjct: 275 PIYIPCVKALLALD---------------------------HTVGIIGGRPKHSVFFIGF 307

Query: 245 VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
              ++I LDPH +Q    +   + D      S+YHC    ++ +  MDPS  +
Sbjct: 308 QDENLIHLDPHYSQTAVNMTRTDFDV-----SSYHCRSPKKIPVTKMDPSCTL 355


>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
           SS1]
          Length = 1286

 Score =  143 bits (360), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 88/288 (30%), Positives = 135/288 (46%), Gaps = 77/288 (26%)

Query: 18  DITSRLWFTYRKGFVPIGDS-------------------------------------GLT 40
           D TSR+W TYR  F PI DS                                     G T
Sbjct: 342 DFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGWPGSGEKGWT 401

Query: 41  TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN----VNSKEEAYLKILKMFEDRRT--AP 94
           +D GWGCMLR GQ ++A AL+ LHLGRDW+        +    Y+++L  F D  T   P
Sbjct: 402 SDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWFFDSPTPHCP 461

Query: 95  YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL 154
           +S+H++AL G   GK VG+WFGP+T A  ++ L      + +   +A D+ +  + V   
Sbjct: 462 FSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSIASDSQIFQSDVFAA 521

Query: 155 C-------TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVK 207
                   ++ K+ +S    + ++++I +RLG+  +NP+Y   IK  Y            
Sbjct: 522 SHPPMDSPSSKKKLASTWGGRAVLVLIGIRLGLDGVNPIYYETIKALY------------ 569

Query: 208 ILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                          TFPQS+G+ GG+P+ + YF+G   +++ +LDPH
Sbjct: 570 ---------------TFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 602


>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
          Length = 454

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/305 (31%), Positives = 138/305 (45%), Gaps = 59/305 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S+ W TYR  F  I  S                       G T+D GWGCM+R GQ 
Sbjct: 121 DFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRSGQS 180

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A A+  ++LGRDW+   N ++E   K+L  F D   APYSIHQ    GA + GK  GE
Sbjct: 181 LLANAMAAINLGRDWRRGQNPEDE--RKLLSWFADDPRAPYSIHQFVQHGAVACGKYPGE 238

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA   +   +  +   D   V     K     K   S  ++ P +++
Sbjct: 239 WFGPSATARCIQALANAQEQQPLRVYSTGDGPDVYED--KFMEIAKPDGS--RFNPTLIL 294

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  I PVY   +     +                           PQS+G+ GG
Sbjct: 295 VGTRLGIDKITPVYWEALIAALQM---------------------------PQSVGIAGG 327

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P  + YFIG  G+ + +LDP HT   +    D    SE  +D T H  +  RLH+  +D
Sbjct: 328 RPASSHYFIGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVD-TVHTRRLRRLHVRELD 386

Query: 293 PSIAV 297
           PS+ V
Sbjct: 387 PSMLV 391


>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
           Full=Autophagy-related protein 4 homolog A
 gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
           sativa Japonica Group]
 gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
           Japonica Group]
 gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
 gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 474

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 98/302 (32%), Positives = 145/302 (48%), Gaps = 54/302 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
            D +SR+W TYRKGF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 131 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 190

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L   +    
Sbjct: 191 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHH 250

Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTT-NKRASSNPQWQPLVLVIP 175
                  ++   ++ V+ D          + ++   +LC   NK  S+   W P++L++P
Sbjct: 251 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKNQST---WSPILLLVP 307

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           L LG+  +NP YI  +K+                             TFPQSLG++GGKP
Sbjct: 308 LVLGLDKLNPRYIPLLKE---------------------------TLTFPQSLGILGGKP 340

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
             + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+
Sbjct: 341 GTSTYIAGVQDDRALYLDPHEVQ---LAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSL 397

Query: 296 AV 297
           A+
Sbjct: 398 AI 399


>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
           sativa Japonica Group]
 gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
           Japonica Group]
          Length = 505

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 98/302 (32%), Positives = 145/302 (48%), Gaps = 54/302 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
            D +SR+W TYRKGF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 131 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 190

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L   +    
Sbjct: 191 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHH 250

Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTT-NKRASSNPQWQPLVLVIP 175
                  ++   ++ V+ D          + ++   +LC   NK  S+   W P++L++P
Sbjct: 251 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKNQST---WSPILLLVP 307

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           L LG+  +NP YI  +K+                             TFPQSLG++GGKP
Sbjct: 308 LVLGLDKLNPRYIPLLKE---------------------------TLTFPQSLGILGGKP 340

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
             + Y  G   +  ++LDPH  Q      D   D+ +   S+YHC     L +  +DPS+
Sbjct: 341 GTSTYIAGVQDDRALYLDPHEVQ---LAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSL 397

Query: 296 AV 297
           A+
Sbjct: 398 AI 399


>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
          Length = 451

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 140/306 (45%), Gaps = 60/306 (19%)

Query: 17  RDITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQ 53
            D+ ++ W TYR GF PI  S                       G ++D GWGCM+R GQ
Sbjct: 119 EDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRSGQ 178

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
            ++A  +  L LGRDW+     +EE   +++ MF D   APYSIH     GA+  GK  G
Sbjct: 179 SLLATTIGILQLGRDWRRGKCQQEER--QLISMFADDPRAPYSIHNFVRHGATACGKFPG 236

Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
           EWFGP+  AQ ++ L         V+       +  +   K+   + +      + P ++
Sbjct: 237 EWFGPSATAQCIQALTSASGLPLKVYSPNDGQDVYEDSFMKIAKPDGQ-----DFHPTLI 291

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           +I  RLGI  I P+Y   +     +                           PQS+G+ G
Sbjct: 292 LIRTRLGIDKITPIYWEPLLAALQM---------------------------PQSVGIAG 324

Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           G+P+ + YF+G  G+ + +LDP HT + I    D  + +E+ ++S  H  +  RLH+  M
Sbjct: 325 GRPSSSHYFVGSQGSYLFYLDPHHTRKAIPYHADVTKYTEEDIESC-HTSRLRRLHLKEM 383

Query: 292 DPSIAV 297
           DPS+ +
Sbjct: 384 DPSMLI 389


>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
          Length = 457

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 101/344 (29%), Positives = 148/344 (43%), Gaps = 90/344 (26%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGRDW
Sbjct: 74  RNVEEFRKDFISRIWLTYREEFPQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDW 133

Query: 70  ---------------------------------------------------QWNVNSKEE 78
                                                              Q    S EE
Sbjct: 134 TWANAFVFENPESESWTSQTVKKLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEE 193

Query: 79  AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
            Y  +I+  F D   A + +H++   G   GK  G+W+GP  VA +LRK    A+  +  
Sbjct: 194 QYHRRIISWFADSPFANFGLHRLIEYGKKSGKIAGDWYGPAVVAHLLRKAVEKARDPELQ 253

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            I  +VA D T+  + V   LC      S     + ++++IP+RLG +  N  Y   +K 
Sbjct: 254 GITIYVAQDCTVYKSDVIDALCPFTD--SEKTSVKSIIILIPVRLGGERTNMEYFEFVK- 310

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 311 -------------GILSLDY-------------CIGIIGGKPKQSYYFAGFQDDSLIYMD 344

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           PH  Q+   V  K+   E     ++HCP   ++    MDPS  +
Sbjct: 345 PHYCQSFVDVSVKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 383


>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 470

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/266 (29%), Positives = 128/266 (48%), Gaps = 55/266 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
           D  SR+W TYR GF PI  S                     G T+D G+GCM+R GQ ++
Sbjct: 99  DFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQCIL 158

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGRDW++      + +  ++ MF D   AP+SIH+    GA+  GK  GEWF
Sbjct: 159 ANALQILRLGRDWRYQEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
           GP+  A+ ++ L   +  + +  +V+ D   V  +++K++   +     + +W P ++++
Sbjct: 219 GPSAAARCIQDLVHKNREAGLKVYVSGDGADVYEDKLKEIAVDD-----DGEWHPTLILV 273

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
             RLGI  I PVY   +K    +                            QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNI 260
           P+ + YF+    N+  +LDPH+ + +
Sbjct: 307 PSASHYFVATQANNFFYLDPHSTRPL 332


>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
          Length = 459

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 97/344 (28%), Positives = 152/344 (44%), Gaps = 91/344 (26%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134

Query: 71  W----NVNSKE----------------EAYL----------------------------- 81
           W    +++S +                EA L                             
Sbjct: 135 WPDALDIDSSDSESWTAHTVKKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D     + +HQ+   G   GK  G+W+GP  VA +LRK    A+  +  
Sbjct: 195 VYHRKIISWFGDSPLTAFGLHQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQ 254

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
            +  +VA D T+  + V  + C+      ++   + +++++P+RLG +  N  Y+  +K 
Sbjct: 255 GVTIYVAQDCTVYSSDVIDRQCSFMDSGEADT--KAVIILVPVRLGGERTNMDYLEFVK- 311

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                         ILS  Y              +G+IGGKP  + YF G+  + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           PH  Q+   V  K+   E     ++HCP   ++    MDPS  +
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 384


>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
 gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
           AltName: Full=Autophagy-related protein 4 homolog D
 gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
          Length = 469

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 155/334 (46%), Gaps = 78/334 (23%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ ++D  SR+W TYR+ F  +  + LTTD GWGCM+R GQM++AQ LL   L R+W 
Sbjct: 95  EIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSREWT 154

Query: 71  WN-------------------------------------------VNSKEEAYLKILKMF 87
           W+                                               ++ +  I++ F
Sbjct: 155 WSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMRWF 214

Query: 88  EDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTL 146
            D   +P+ +HQ+   G+  GK  G+W+GP+ VA +++K +    +   +  +V+ D T+
Sbjct: 215 SDHPGSPFGLHQLVTLGSIFGKKAGDWYGPSIVAHIIKKAIETSSEVPELSVYVSQDCTV 274

Query: 147 VVNQVKKLCTTN--KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
               +++L   +     +S    + +++++P+RLG +  NPVY + +K+   +       
Sbjct: 275 YKADIEQLFAGDVPHAETSRGAGKAVIILVPVRLGGETFNPVYKHCLKEFLRM------- 327

Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
                               P  LG+IGGKP H+LYFIGY  N +++LDPH  Q     Y
Sbjct: 328 --------------------PSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQP----Y 363

Query: 265 DKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
                ++  L+S +HC    ++ I  MDPS    
Sbjct: 364 IDTSKNDFPLES-FHCNSPRKISITRMDPSCTFA 396


>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
          Length = 758

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 102/381 (26%), Positives = 168/381 (44%), Gaps = 106/381 (27%)

Query: 1   MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIG---DS------------------GL 39
           ++  NK S    +    D+ S++W TYR GF PI    DS                  G 
Sbjct: 51  IKDGNKKSTTYSQSFIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGF 110

Query: 40  TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRTAPYSIH 98
           T+D GWGCM+R  Q ++A ALLFLHLGRDW +         + +I+  F D    P+SIH
Sbjct: 111 TSDAGWGCMIRTSQSLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIH 170

Query: 99  QIALTG-ASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCT 156
                G     K  GEWFGP+  ++ ++ L K Y      V+  +    +   +V++L  
Sbjct: 171 NFVQQGIKCCDKKPGEWFGPSAASRAIKNLCKEYPPCGLRVYFSSDCGDVYDTEVRELAY 230

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
            +        + P+++++ +RLG++ +NPVY + +++C +L                   
Sbjct: 231 GDSDT-----FTPILVLLGIRLGVEKVNPVYWDSLRECLSL------------------- 266

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------- 255
                    QS+G+ GG+P  + YF G+ G+ + +LDPH                     
Sbjct: 267 --------KQSVGIAGGRPCSSHYFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTK 318

Query: 256 -TNQNIGCVY-----DKEQDS------EKKLDS-------------TYHCPQASRLHILH 290
            T++N    Y     D   ++      E KLD+             + H P+ ++LH+ H
Sbjct: 319 KTDENAAGQYPVSNTDSNNETNHDDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSH 378

Query: 291 MDPSI----AVVSQRSYSDYK 307
           MDPS+     + S+  ++D+K
Sbjct: 379 MDPSMLIGFLITSEDDFNDWK 399


>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
 gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
          Length = 483

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 95/300 (31%), Positives = 146/300 (48%), Gaps = 50/300 (16%)

Query: 16  RRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNS 75
            +D +SR+  TYRKGF  I DS  T+D  WGCMLR  QM++AQALLF  LGR W+     
Sbjct: 138 EQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPSQK 197

Query: 76  K-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWS 134
             ++ Y++IL +F D  T+ +SIH +   G +   A G W GP  + +    L +    +
Sbjct: 198 PLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRSWETLVRSKRET 257

Query: 135 SI---------VFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLR 177
            I         ++ V+ D          L ++   + C    +   +  W P++L++PL 
Sbjct: 258 PILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHD--WSPILLLVPLV 315

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG++ INP YI  +                           R  FTFPQSLG++GGKP  
Sbjct: 316 LGLEKINPRYIPSL---------------------------RTTFTFPQSLGILGGKPGA 348

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           + Y +G    +  +LDPH  Q    V + ++D  +   S+YHC     + +  +DPS+A+
Sbjct: 349 STYIVGVQDENAFYLDPHEVQQ---VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAI 405


>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
          Length = 437

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 148/317 (46%), Gaps = 60/317 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  +R+W TYR  F  I  S                       G ++D GWGCM+R GQ 
Sbjct: 108 DFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFSSDTGWGCMIRSGQS 167

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A AL  L LGR W+   +S+ E   +IL +F D   AP+SIH+    GA + GK  GE
Sbjct: 168 LLANALQVLRLGRAWRRGQDSQGE--RRILSLFADDPKAPFSIHRFVEHGAVACGKHPGE 225

Query: 114 WFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
           WFGP+  A+ ++ L+  Y+D    V+     + +  +   K+        +N  + P ++
Sbjct: 226 WFGPSATARCIQALSNGYEDAGLRVYITGDGSDVYEDSFMKVAK-----DANNTFHPTLV 280

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           ++ +RLGI  + PVY   +K    L                            QS+G+ G
Sbjct: 281 LVGIRLGIDRVTPVYWEALKASLQLS---------------------------QSIGIAG 313

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           G+P+ + YF+G  G+   +LDPHT +    ++    D  ++   + H  +  RLH+  MD
Sbjct: 314 GRPSASHYFVGTQGSYFFYLDPHTTRPFLPLHSDLSDYTQEDIDSCHTRRLRRLHVKEMD 373

Query: 293 PSIAVVSQ-RSYSDYKN 308
           PS+ +    R  +D++N
Sbjct: 374 PSMLIAFLIRDETDWQN 390


>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
          Length = 356

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 91/262 (34%), Positives = 135/262 (51%), Gaps = 36/262 (13%)

Query: 38  GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSI 97
           G T+D GWGCM+R GQ ++A A+L L LGRDW+    ++EEA  ++L +F D   AP SI
Sbjct: 76  GFTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSI 133

Query: 98  HQIALTGA-SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           H+    GA S GK  GEWFGP+  A+ +  L+      +I   V + N    + V +   
Sbjct: 134 HRFVKYGAESCGKHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSF 189

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                S +   QP ++++  RLGI ++ PVY +G+K    L                   
Sbjct: 190 LRVARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQL------------------- 230

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLD 275
                   PQS+G+ GG+P+ + YFIG  G    +LDPHT +  +    D    S+ ++ 
Sbjct: 231 --------PQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI- 281

Query: 276 STYHCPQASRLHILHMDPSIAV 297
           STYH  +  R+HI  MDPS+ +
Sbjct: 282 STYHTRRLRRIHIQDMDPSMLI 303


>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
           pastoris GS115]
 gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
           pastoris GS115]
          Length = 531

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 102/381 (26%), Positives = 168/381 (44%), Gaps = 106/381 (27%)

Query: 1   MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIG---DS------------------GL 39
           ++  NK S    +    D+ S++W TYR GF PI    DS                  G 
Sbjct: 51  IKDGNKKSTTYSQSFIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGF 110

Query: 40  TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRTAPYSIH 98
           T+D GWGCM+R  Q ++A ALLFLHLGRDW +         + +I+  F D    P+SIH
Sbjct: 111 TSDAGWGCMIRTSQSLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIH 170

Query: 99  QIALTG-ASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCT 156
                G     K  GEWFGP+  ++ ++ L K Y      V+  +    +   +V++L  
Sbjct: 171 NFVQQGIKCCDKKPGEWFGPSAASRAIKNLCKEYPPCGLRVYFSSDCGDVYDTEVRELAY 230

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
            +        + P+++++ +RLG++ +NPVY + +++C +L                   
Sbjct: 231 GDSDT-----FTPILVLLGIRLGVEKVNPVYWDSLRECLSL------------------- 266

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------- 255
                    QS+G+ GG+P  + YF G+ G+ + +LDPH                     
Sbjct: 267 --------KQSVGIAGGRPCSSHYFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTK 318

Query: 256 -TNQNIGCVY-----DKEQDS------EKKLDS-------------TYHCPQASRLHILH 290
            T++N    Y     D   ++      E KLD+             + H P+ ++LH+ H
Sbjct: 319 KTDENAAGQYPVSNTDSNNETNHDDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSH 378

Query: 291 MDPSI----AVVSQRSYSDYK 307
           MDPS+     + S+  ++D+K
Sbjct: 379 MDPSMLIGFLITSEDDFNDWK 399


>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
 gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
          Length = 470

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 80/266 (30%), Positives = 127/266 (47%), Gaps = 55/266 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
           D  SR+W TYR GF PI  S                     G T+D G+GCM+R GQ ++
Sbjct: 99  DFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQCIL 158

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGRDW++      + +  I+ MF D   AP+SIH+    GA+  GK  GEWF
Sbjct: 159 ANALQILRLGRDWRYQEQPDAKEHCDIVAMFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
           GP+  A+ ++ L   +    +  +V+ D   V  +++K++   +     + +W P ++++
Sbjct: 219 GPSAAARCIQDLVHKNKEVGLKVYVSGDGADVYEDKLKEIAVDD-----DGEWHPTLILV 273

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
             RLGI  I PVY   +K    +                            QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNI 260
           P+ + YF+    N+  +LDPH+ + +
Sbjct: 307 PSASHYFVATQANNFFYLDPHSTRPL 332


>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
 gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
          Length = 1541

 Score =  141 bits (356), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 92/332 (27%), Positives = 147/332 (44%), Gaps = 100/332 (30%)

Query: 37   SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW--------------------------- 69
            +GLTTD GWGCMLR GQ ++A ALL +HLGR W                           
Sbjct: 818  AGLTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLSLDSSVEM 877

Query: 70   ----QW-NVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
                +W    ++  AY+KIL  F D  +   P+ +H++A  G   GK VGEWFGP+T A 
Sbjct: 878  QSLQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 937

Query: 123  VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN--------KRASSNPQWQ-PLVLV 173
             +++L      + I   +A D    +++V+              ++  +   W+ P+V++
Sbjct: 938  AIKQLVTEFPDAGIAVELAHDGVFYLDEVRLAAGARSALQSGKGRQGDAAVTWRRPVVIL 997

Query: 174  IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
            I +RLG+  +NP+Y   +K+                            F+FP S+G+ GG
Sbjct: 998  IGIRLGLDSVNPIYYESVKET---------------------------FSFPHSVGIAGG 1030

Query: 234  KPNHALYFIGYVGNDVIFLDPH-TNQNIGCVY----------------------DKEQDS 270
            +P+ + YF+G+ GN + +LDPH     +   Y                      DK+ + 
Sbjct: 1031 RPSSSYYFMGHQGNSLFYLDPHNVRPAVALRYPPSTFPTAVPHQLDVAHRFALEDKDDEL 1090

Query: 271  E-------KKLDSTYHCPQASRLHILHMDPSI 295
            E       +   ST+HC +  R+ I  +DPS+
Sbjct: 1091 EWWSHAYTEAQTSTFHCEKVRRMPIKSLDPSM 1122


>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 100/301 (33%), Positives = 149/301 (49%), Gaps = 46/301 (15%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L    +D +S++  TYRKGF  IGD+  T+D  WGCMLR  QM++AQALLF  LGR W+ 
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198

Query: 72  NVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA-K 129
            ++   ++ Y+ +L++F D   + +SIH +   G   G AVG W GP  + +    LA K
Sbjct: 199 PIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARK 258

Query: 130 YDDWSS-----IVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
            +D         ++ V+ D          + +    K C+  + +S    W PL+L++PL
Sbjct: 259 KNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCS--EFSSGLAVWTPLLLLVPL 316

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  +NP YI                   +L ST         F FPQSLG++GGKP 
Sbjct: 317 VLGLDKVNPRYI------------------PLLRST---------FKFPQSLGIMGGKPG 349

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y IG       +LDPH  Q +  +    Q  E    S+YHC     + +  +DPS+A
Sbjct: 350 ASTYIIGVQNEKAFYLDPHDVQQVVNISGDTQ--EPTGTSSYHCNVMRHIPLDSIDPSLA 407

Query: 297 V 297
           +
Sbjct: 408 I 408


>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 101/299 (33%), Positives = 149/299 (49%), Gaps = 42/299 (14%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L    +D +S++  TYRKGF  IGD+  T+D  WGCMLR  QM++AQALLF  LGR W+ 
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALLFHKLGRSWRK 198

Query: 72  NVNS-KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA-K 129
            ++   ++ Y+ +L++F D   + +SIH +   G   G AVG W GP  + +    LA K
Sbjct: 199 PIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARK 258

Query: 130 YDDWSSI-----VFHVALDNTLVVNQVKKLC--TTNKR----ASSNPQWQPLVLVIPLRL 178
            +D   +     ++ V+ D          +C    +KR    +S    W PL+L++PL L
Sbjct: 259 KNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWTPLLLLVPLVL 318

Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
           G+  +NP YI                   +L ST         F FPQSLG++GGKP  +
Sbjct: 319 GLDKVNPRYI------------------PLLRST---------FKFPQSLGIMGGKPGAS 351

Query: 239 LYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            Y IG       +LDPH  Q +  +    Q  E    S+YHC     + +  +DPS+A+
Sbjct: 352 TYIIGAQNEKAFYLDPHDVQQVVNISGDTQ--EPTSTSSYHCNIMRHIPLDSIDPSLAI 408


>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
          Length = 572

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 140/311 (45%), Gaps = 66/311 (21%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF PI  S                       G TTD GWGCM+R GQ 
Sbjct: 235 DFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 294

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A +LL   LGR W+      EE   K+L +F D   APYSIH     GA++ GK  GE
Sbjct: 295 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 352

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ +  LA   + S  V+       +  +   ++  ++ +      + P +++
Sbjct: 353 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKSDGKT-----FHPTLIL 407

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           I  RLGI  IN VY   +     L                           PQS+G+ GG
Sbjct: 408 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 440

Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
           +P+ + YF+G   +D      + +LDP HT   +    D +  +   +DS  H  +  RL
Sbjct: 441 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 499

Query: 287 HILHMDPSIAV 297
           HI  MDPS+ +
Sbjct: 500 HIREMDPSMLI 510


>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
          Length = 431

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 78/243 (32%), Positives = 120/243 (49%), Gaps = 42/243 (17%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 84  DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 143

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 144 WAEGMGLGPPELSRSASPSRYHGPAHWRPPRWAQGTPELEQERRHRQIVSWFADHPRAPF 203

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
            +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V+ D T+    V +L
Sbjct: 204 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVVRL 263

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                R     +W+ +V+++P+RLG + +NPVY+  +K    +P  P  D +  L   Y 
Sbjct: 264 VA---RPDPAAEWKSVVILVPVRLGGETLNPVYVPCVK---LMPTPPTDDFLLYLDPHYC 317

Query: 215 MQT 217
             T
Sbjct: 318 QPT 320


>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
           oryzae 3.042]
          Length = 357

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 91/262 (34%), Positives = 135/262 (51%), Gaps = 36/262 (13%)

Query: 38  GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSI 97
           G T+D GWGCM+R GQ ++A A+L L LGRDW+    ++EEA  ++L +F D   AP SI
Sbjct: 76  GFTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSI 133

Query: 98  HQIALTGA-SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           H+    GA S GK  GEWFGP+  A+ +  L+      +I   V + N    + V +   
Sbjct: 134 HRFVKYGAESCGKHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSF 189

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
                S +   QP ++++  RLGI ++ PVY +G+K    L                   
Sbjct: 190 LRVARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQL------------------- 230

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLD 275
                   PQS+G+ GG+P+ + YFIG  G    +LDPHT +  +    D    S+ ++ 
Sbjct: 231 --------PQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEI- 281

Query: 276 STYHCPQASRLHILHMDPSIAV 297
           STYH  +  R+HI  MDPS+ +
Sbjct: 282 STYHTRRLRRIHIQDMDPSMLI 303


>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
          Length = 454

 Score =  140 bits (353), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 95/305 (31%), Positives = 139/305 (45%), Gaps = 59/305 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S+ W TYR  F  I  S                       G ++D GWGCM+R GQ 
Sbjct: 121 DFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKSQLVDQNGFSSDSGWGCMIRSGQS 180

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A A+  ++LGRDW+   N +EE   K+L +F D   APYSIHQ    GA + GK  GE
Sbjct: 181 LLANAMAVINLGRDWRRGQNQEEER--KLLSLFADDPRAPYSIHQFVQHGAVACGKYPGE 238

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA       +  +   D   V     K     K   S  ++ P +++
Sbjct: 239 WFGPSATARCIQALANAQMHQPLRVYSTGDGPDVYED--KFMKIAKPDGS--RFHPTLIL 294

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  I PVY   +     +                           PQS+G+ GG
Sbjct: 295 VGTRLGIDKITPVYWEALIAALQM---------------------------PQSVGIAGG 327

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YFIG  G+ + +LDP HT   +    +    SE  +D T H  +  RLH+  +D
Sbjct: 328 RPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVD-TVHTRRLRRLHVRELD 386

Query: 293 PSIAV 297
           PS+ +
Sbjct: 387 PSMLI 391


>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
 gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
          Length = 491

 Score =  140 bits (353), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 139/311 (44%), Gaps = 66/311 (21%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF PI  S                       G TTD GWGCM+R GQ 
Sbjct: 154 DFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 213

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A +LL   LGR W+      EE   K+L +F D   APYSIH     GA++ GK  GE
Sbjct: 214 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 271

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ +  LA   + S  V+       +  +   ++   + +      + P +++
Sbjct: 272 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKPDGKT-----FHPTLIL 326

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           I  RLGI  IN VY   +     L                           PQS+G+ GG
Sbjct: 327 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 359

Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
           +P+ + YF+G   +D      + +LDP HT   +    D +  +   +DS  H  +  RL
Sbjct: 360 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 418

Query: 287 HILHMDPSIAV 297
           HI  MDPS+ +
Sbjct: 419 HIREMDPSMLI 429


>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
          Length = 451

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 88/303 (29%), Positives = 135/303 (44%), Gaps = 65/303 (21%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L  ++ D +S++  TYRKGF P  D+  T+D  WGCM+R  QM+ AQ             
Sbjct: 135 LAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQL------------ 182

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
                E+ YL+ L+ F D   + +SIH + + GAS G A G W GP  + +    LA   
Sbjct: 183 ----PEQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLACKK 238

Query: 129 -KYDDWSSIVFHVAL-------------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
            K  D  +    +A+                L +    K C    +  S  +W P++L++
Sbjct: 239 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPIILLV 296

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
           PL LG+  +NP YI  +                              FTFPQS+G++GGK
Sbjct: 297 PLVLGLDSVNPRYIPSLVA---------------------------TFTFPQSVGILGGK 329

Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
           P  + Y +G   +   +LDPH  Q +  V  +  D +    S+YHC     + +  +DPS
Sbjct: 330 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVLRYVPLESLDPS 386

Query: 295 IAV 297
           +A+
Sbjct: 387 LAL 389


>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
          Length = 572

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 139/311 (44%), Gaps = 66/311 (21%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF PI  S                       G TTD GWGCM+R GQ 
Sbjct: 235 DFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 294

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A +LL   LGR W+      EE   K+L +F D   APYSIH     GA++ GK  GE
Sbjct: 295 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 352

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ +  LA   + S  V+       +  +   ++   + +      + P +++
Sbjct: 353 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKPDGKT-----FHPTLIL 407

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           I  RLGI  IN VY   +     L                           PQS+G+ GG
Sbjct: 408 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 440

Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
           +P+ + YF+G   +D      + +LDP HT   +    D +  +   +DS  H  +  RL
Sbjct: 441 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 499

Query: 287 HILHMDPSIAV 297
           HI  MDPS+ +
Sbjct: 500 HIREMDPSMLI 510


>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 468

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 94/310 (30%), Positives = 141/310 (45%), Gaps = 64/310 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W +YR GF PI  S                         TTD GWGCM+R GQ 
Sbjct: 131 DFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRTGQS 190

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A  LL   LGR W+    S EE   K+L +F D   APYSIH+    GA++ GK  GE
Sbjct: 191 LLANTLLSHRLGRGWRRGEKSDEE--RKLLSLFADDPRAPYSIHKFVEHGAAKCGKYPGE 248

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ +  LA  ++ +  V+       +  +   ++   + +      + P +++
Sbjct: 249 WFGPSATARCIEALANTNEKTLRVYSTGDLPDVYEDSFMEVARPDGKT-----FHPTLIL 303

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  IN VY                     L++T  M         PQS+G+ GG
Sbjct: 304 VSTRLGIDKINQVYWES------------------LTATLQM---------PQSVGIAGG 336

Query: 234 KPNHALYFIGY------VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           +P+ + YF+G        G+++ +LDPH  +     +D  Q        + H  +  RLH
Sbjct: 337 RPSSSHYFVGAQRSDEDQGSNLFYLDPHHTRPALPYFDDPQLYTPSDVDSCHTRRLRRLH 396

Query: 288 ILHMDPSIAV 297
           I  MDPS+ +
Sbjct: 397 IREMDPSMLI 406


>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
          Length = 1572

 Score =  139 bits (351), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 89/344 (25%), Positives = 148/344 (43%), Gaps = 112/344 (32%)

Query: 37   SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV----------------------- 73
            +GLTTD GWGCMLR GQ ++A AL+ +HLGR WQ +                        
Sbjct: 828  AGLTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLSIADAAEK 887

Query: 74   ---------NSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
                      ++   Y+KIL  F D  +   P+ +H++A  G   GK VGEWFGP+T + 
Sbjct: 888  ESLQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTASG 947

Query: 123  VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL--------------------CTTNKRAS 162
             +++L      + I   +A D    +++V+                        +  R  
Sbjct: 948  AIKQLVSEFPQAGIAVELARDGVFYLDEVRAAASASASAASVQSGGKARSSGAASGSRKG 1007

Query: 163  SNPQW-QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                W +P++++I +RLG++ +NP+Y   +K                             
Sbjct: 1008 EGLIWRRPVLILIGIRLGLESVNPIYYESVKA---------------------------T 1040

Query: 222  FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH------------------TNQNIGCV 263
            F+FP S+G+ GG+P+ + YF+G+ GN + +LDPH                    +++G  
Sbjct: 1041 FSFPHSVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPSTFPDAVPRHLGIA 1100

Query: 264  Y-----DKEQDSE-------KKLDSTYHCPQASRLHILHMDPSI 295
            +     DK+ + E       +   ST+HC +  R+ I  +DPS+
Sbjct: 1101 HRFVLEDKDDEDEWWSHAYSEVQTSTFHCEKVRRMPIKSLDPSM 1144


>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
          Length = 398

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/258 (32%), Positives = 124/258 (48%), Gaps = 45/258 (17%)

Query: 16  RRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNS 75
           +R   + LWFTYR+ F  +     T+D GWGCMLR  QM++ QAL    LGRDW+     
Sbjct: 42  KRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPALF 101

Query: 76  KEE-------AYLKILKMFEDRR--TAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
           + E        Y+ +L+ F D       YSIH +   G    K  GEW+GP T AQVLR 
Sbjct: 102 EAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLRD 161

Query: 127 LA---KYDDWSSIVFHVALDNTLVVNQVKKLCTTN-----KRASSNPQWQ-PLVLVIPLR 177
           L    + +    +  +V  +  +  + V +LC  +       A  +  W   L+++IPLR
Sbjct: 162 LVNLHRREFGGELAMYVPQEGVVYTDDVTRLCFFDPLLHPPTAEDSSDWSTALLILIPLR 221

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+  +N  Y+  ++K +A                           FPQS+G+IGGK  H
Sbjct: 222 LGLDQVNERYVPALEKTFA---------------------------FPQSVGIIGGKKGH 254

Query: 238 ALYFIGYVGNDVIFLDPH 255
           ++YF+G   + +  LDPH
Sbjct: 255 SVYFVGTQQDQLHLLDPH 272


>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
           MF3/22]
          Length = 1147

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 95/306 (31%), Positives = 141/306 (46%), Gaps = 99/306 (32%)

Query: 18  DITSRLWFTYRKGFVPI---------------------------------GDSGLTTDKG 44
           D +SR+W TYR  + PI                                 G+ G T+D G
Sbjct: 345 DFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGSGEKGWTSDSG 404

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQ------WNVNSKEEAYLKILKMFEDRRT--APYS 96
           WGCMLR GQ ++A AL+ LHLGRDW+      + V+     Y+KIL  F D      P+S
Sbjct: 405 WGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYA--TYVKILTWFFDSTDIHCPFS 462

Query: 97  IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           +H++AL G   GK VG+WFGP+T A  ++ +      + +   VA D   VV +   L  
Sbjct: 463 VHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHAFAEAGLGVSVATDG--VVYETDVLAA 520

Query: 157 TN--------KRASSNPQ-----------------W--QPLVLVIPLRLGIQDINPVYIN 189
           +N         R +++                   W  +P+++++ +RLGI  +NPVY  
Sbjct: 521 SNAGPYMYRHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGIRLGIDCVNPVY-- 578

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
                        YD VK L            FTFPQS+G+ GG+P+ + YF+G   +++
Sbjct: 579 -------------YDAVKAL------------FTFPQSVGIAGGRPSSSYYFVGVQTDNL 613

Query: 250 IFLDPH 255
            +LDPH
Sbjct: 614 FYLDPH 619


>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
          Length = 449

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 140/305 (45%), Gaps = 60/305 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S+ W TYR  F PI  S                       G ++D GWGCM+R GQ 
Sbjct: 117 DFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQFMDQAGYSSDSGWGCMIRSGQS 176

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A A+  L LGRDW+  V +++E   ++L  F D   APYSIH+    GA + GK  GE
Sbjct: 177 LLANAMAVLDLGRDWRRGVAAEKER--QLLSKFADDPKAPYSIHRFVQHGAVACGKYPGE 234

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L   ++    V+       +  ++   +        S   + P +++
Sbjct: 235 WFGPSATARCIQALVNANEPHLRVYSTGDGPDVYEDRFFDIAK-----PSGETFHPTLIL 289

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  I PVY + +     +                           PQS+G+ GG
Sbjct: 290 VGTRLGIDKITPVYWDALIAALQM---------------------------PQSIGIAGG 322

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVY-DKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YFIG  G+ + +LDPH  +     Y D    ++  +DS  H  +  RLH+  MD
Sbjct: 323 RPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSV-HTRRLRRLHVREMD 381

Query: 293 PSIAV 297
           PS+ +
Sbjct: 382 PSMLI 386


>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
           1558]
          Length = 1159

 Score =  139 bits (349), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 88/255 (34%), Positives = 130/255 (50%), Gaps = 64/255 (25%)

Query: 36  DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE----------------A 79
           + GLTTD GWGCMLR GQ ++A AL+ LHLGRDW+  V S+ +                +
Sbjct: 577 ERGLTTDAGWGCMLRTGQSLLANALIHLHLGRDWR--VPSQPQVPPTSAAHLAELEAYSS 634

Query: 80  YLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
           Y++IL  F D  +   P+S+H+IAL G   GK VGEWFGP+T A  L+ L      S + 
Sbjct: 635 YVRILSWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTLVNSFPPSGMA 694

Query: 138 FHVALDNTLVVNQV---KKLCTTNKRASSNP------------QW--QPLVLVIPLRLGI 180
              A+D+ +  + V     L +T     S P             W  + ++++I +RLG+
Sbjct: 695 VATAVDSIVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNRAVLVLIGIRLGL 754

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
             +NP+Y               Y+ +K L            FTFPQS+G+ GG+P+ + Y
Sbjct: 755 DGVNPLY---------------YESIKAL------------FTFPQSVGIAGGRPSSSYY 787

Query: 241 FIGYVGNDVIFLDPH 255
           F+G   N +++LDPH
Sbjct: 788 FVGTQANSLVYLDPH 802


>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
 gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
 gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
          Length = 487

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 95/300 (31%), Positives = 139/300 (46%), Gaps = 48/300 (16%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
             +D  SR+  TYRKGF  I DS  T+D  WGCMLR  QM++AQALLF  LGR W+  V+
Sbjct: 142 FEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRKTVD 201

Query: 75  SK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
              ++ Y+ IL++F D   A +SIH +   G   G AVG W GP  + +    LA+    
Sbjct: 202 KPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLAR---- 257

Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN----PVYI- 188
                                   N+R  +    Q L + I +  G +D      PV   
Sbjct: 258 ------------------------NQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCI 293

Query: 189 -NGIKKCYALPISPV----YDMVKILSSTYNMQTPRY------EFTFPQSLGVIGGKPNH 237
            +  K+C       V      ++  L    +    RY       F FPQSLG++GGKP  
Sbjct: 294 EDACKRCLEFSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGA 353

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           + Y IG   +   +LDPH    +  V +   D+++   S+YHC  +  + +  +DPS+A+
Sbjct: 354 STYIIGVQNDKAFYLDPH---EVKPVVNITGDTQEPNTSSYHCNISRHMPLDSIDPSLAI 410


>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
          Length = 489

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 139/300 (46%), Gaps = 48/300 (16%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
             +D +S++  TYRKGF  IGDS  T+D  WGCMLR  QM++AQALLF  LGR W+   +
Sbjct: 143 FEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRMWRKTTD 202

Query: 75  SK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
              ++ YL IL+ F D   + +SIH +   G   G AVG W GP  + +    LA+    
Sbjct: 203 KPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGSWVGPYAMCRSWEVLAR---- 258

Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN----PVYI- 188
                                   N+R +++   QPL + + +  G +D      PV   
Sbjct: 259 ------------------------NQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCI 294

Query: 189 -NGIKKCY----ALPISPVYDMVKILSSTYNMQTPRY------EFTFPQSLGVIGGKPNH 237
            +  ++C      L       ++  L    +    RY       F FPQSLG++GGKP  
Sbjct: 295 EDASRRCSEFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGA 354

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           + Y IG       +LDPH  Q +  +    QD      S+YHC    ++ +  +DPS+A+
Sbjct: 355 STYIIGVQNEKAFYLDPHDVQPVVHINGDAQDPNT---SSYHCNIVRQMPLDSIDPSLAI 411


>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
           tropicalis]
 gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
 gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
           tropicalis]
          Length = 470

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 93/333 (27%), Positives = 153/333 (45%), Gaps = 77/333 (23%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++++ ++D  SR+W TYR+ F  +  + LTTD GWGCM+R GQM++AQ LL   L R+W 
Sbjct: 95  EIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSREWT 154

Query: 71  WN------------------------------------------VNSKEEAYLKILKMFE 88
           W+                                               E +  I+  F 
Sbjct: 155 WSEALYTHFVEMEPIRSSSPSSMPLSLATDHSGRHSQPQTHCSRAPYGGEVHQNIVSWFS 214

Query: 89  DRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLV 147
           D  +AP+ +H++   G+  GK  G+W+GP+ VA +++K +    +   +  +V+ D T+ 
Sbjct: 215 DHASAPFGLHRMVALGSIFGKRAGDWYGPSIVAHIIKKAIESSSEVPDLSVYVSQDCTVY 274

Query: 148 VNQVKKLCTTN--KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDM 205
              +++L         +S    + +++++P RLG +  NPVY + +K+   +        
Sbjct: 275 KADIEQLFAGEVPHTDTSRGAGKAVIILVPARLGGETFNPVYKHCLKEFLRM-------- 326

Query: 206 VKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYD 265
                              P  LG+IGGKP H+LYFIGY  N +++LDPH  Q      D
Sbjct: 327 -------------------PSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPY---ID 364

Query: 266 KEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
             +D+   L+S +HC    +L I  MDPS    
Sbjct: 365 TSRDN-FPLES-FHCNAPRKLSITRMDPSCTFA 395


>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 348

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/288 (31%), Positives = 136/288 (47%), Gaps = 40/288 (13%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
           RD  SR W TYR+GF  +G +   TD GWGC LR  QM++A AL     GR W+  V +K
Sbjct: 27  RDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTLRSAQMMVANALSIHTRGRHWRRQVKAK 86

Query: 77  E--EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL--AKYDD 132
           E  E+   +L MF D  +AP+SIH +  T  + G   G WF P+ + +    L  A  D 
Sbjct: 87  EDDESVDHVLSMFIDDASAPFSIHSVCETTTAWGAPPGRWFEPSVMCRAFSALIEANGDL 146

Query: 133 WSSIVFHV--ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI-QDINPVYIN 189
            + I  HV    +       V  +     RA S    + L+L +PL LG+ ++IN  YI+
Sbjct: 147 RNQIAVHVVGGQNEDDSAGGVPTIDDGELRAKSADVGKALLLFVPLVLGVGRNINTRYIS 206

Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
            ++   A                           F QS+GVIGG+PN +LY +G+  +  
Sbjct: 207 QLRSIIA---------------------------FKQSIGVIGGRPNASLYLVGHSDDVF 239

Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            +LDPHT Q           +E     +Y+C    ++    +DP++A+
Sbjct: 240 FYLDPHTVQPANSF------AEAVDFDSYYCSTPLQMRGELLDPTLAL 281


>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
 gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
          Length = 462

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 93/301 (30%), Positives = 145/301 (48%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYRKGF  I  S LT+D  WGCM+R  QM++AQAL+F HLGR W+      
Sbjct: 117 EDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKP 176

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
            +  Y+++L +F D     +SIH +   G + G A G W GP  + +  + L +      
Sbjct: 177 YDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQA 236

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 +++   ++ V+ D            ++   +LC+   +      W P++L+IPL
Sbjct: 237 DAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCT--WSPILLLIPL 294

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            F FPQSLG++GGKP 
Sbjct: 295 VLGLDKINPRYIPLLKE---------------------------TFKFPQSLGILGGKPG 327

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH   ++    D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 328 TSTYIAGVQEDRALYLDPH---DVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLA 384

Query: 297 V 297
           +
Sbjct: 385 I 385


>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
          Length = 595

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 145/300 (48%), Gaps = 52/300 (17%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK- 76
           D +SR+W TYRKGF  I  S LT+D  WGCM+R  QM++AQAL+F HLGR W+       
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPY 207

Query: 77  EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY------ 130
           +  Y+++L +F D     +SIH +   G + G A G W GP  + +  + L +       
Sbjct: 208 DPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQAD 267

Query: 131 -----DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLR 177
                +++   ++ V+ D            ++   +LC+   +      W P++L+IPL 
Sbjct: 268 AVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCT--WSPILLLIPLV 325

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+  INP YI  +K+                            F FPQSLG++GGKP  
Sbjct: 326 LGLDKINPRYIPLLKE---------------------------TFKFPQSLGILGGKPGT 358

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           + Y  G   +  ++LDPH   ++    D   D+ +   S+YHC     L +  +DPS+A+
Sbjct: 359 STYIAGVQEDRALYLDPH---DVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLAI 415


>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
 gi|194701156|gb|ACF84662.1| unknown [Zea mays]
 gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
 gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
 gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
          Length = 492

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 93/301 (30%), Positives = 145/301 (48%), Gaps = 52/301 (17%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYRKGF  I  S LT+D  WGCM+R  QM++AQAL+F HLGR W+      
Sbjct: 147 EDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKP 206

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
            +  Y+++L +F D     +SIH +   G + G A G W GP  + +  + L +      
Sbjct: 207 YDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQA 266

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 +++   ++ V+ D            ++   +LC+   +      W P++L+IPL
Sbjct: 267 DAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCT--WSPILLLIPL 324

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            F FPQSLG++GGKP 
Sbjct: 325 VLGLDKINPRYIPLLKE---------------------------TFKFPQSLGILGGKPG 357

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
            + Y  G   +  ++LDPH   ++    D   D+ +   S+YHC     L +  +DPS+A
Sbjct: 358 TSTYIAGVQEDRALYLDPH---DVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLA 414

Query: 297 V 297
           +
Sbjct: 415 I 415


>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
          Length = 1505

 Score =  137 bits (345), Expect = 7e-30,   Method: Composition-based stats.
 Identities = 89/339 (26%), Positives = 147/339 (43%), Gaps = 107/339 (31%)

Query: 37   SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW--------------------------- 69
            +GLTTD GWGCMLR GQ ++A AL+ +HLGR W                           
Sbjct: 783  AGLTTDSGWGCMLRTGQSLLANALINVHLGRSWMREAPPARQLEFLQELANLSLDTSAEK 842

Query: 70   ----QW-NVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
                +W    ++   Y+KIL  F D  +   P+ +H++A  G   GK VGEWFGP+T A 
Sbjct: 843  QSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 902

Query: 123  VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC---------------TTNKRASSNPQW 167
             +++L      + +   +A D    +++V+                  T  ++  +   W
Sbjct: 903  AIKQLVSEFPDAGLAVELAHDGVFYLDEVRAAAGASRQLGKGRASATGTNGRKGDTALTW 962

Query: 168  -QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
             +P++++I +RLG+  +NP+Y   +K                             F+FP 
Sbjct: 963  HKPVLILIGIRLGLDSVNPIYYESVKAT---------------------------FSFPH 995

Query: 227  SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ--------------------NIGCVYD- 265
            S+G+ GG+P+ + YF+G+ GN + +LDPH  +                    +I   +  
Sbjct: 996  SVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVALRFPPSTFPAAVPRQLDIAHRFAF 1055

Query: 266  KEQDSEKKL---------DSTYHCPQASRLHILHMDPSI 295
            +E D E +           ST+HC +  R+ I  +DPS+
Sbjct: 1056 EEHDDEDEWWSHAYTEAQTSTFHCDKVRRMPIKSLDPSM 1094


>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
          Length = 491

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 138/311 (44%), Gaps = 66/311 (21%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF  I  S                       G TTD GWGCM+R GQ 
Sbjct: 154 DFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 213

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A +LL   LGR W+      EE   K+L +F D   APYSIH     GA++ GK  GE
Sbjct: 214 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 271

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ +  LA   + S  V+       +  +   ++   + +      + P +++
Sbjct: 272 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKPDGKT-----FHPTLIL 326

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           I  RLGI  IN VY   +     L                           PQS+G+ GG
Sbjct: 327 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 359

Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
           +P+ + YF+G   +D      + +LDP HT   +    D +  +   +DS  H  +  RL
Sbjct: 360 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 418

Query: 287 HILHMDPSIAV 297
           HI  MDPS+ +
Sbjct: 419 HIREMDPSMLI 429


>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
          Length = 1216

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 98/331 (29%), Positives = 144/331 (43%), Gaps = 79/331 (23%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
            D +SR+W TYR+GF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+      
Sbjct: 405 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKP 464

Query: 77  -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
               Y+ IL MF D     +SIH +   G S G A G W GP  + +  + L +      
Sbjct: 465 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 524

Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
                 + +   ++ V+ D          + ++   +LC    +  S   W P++L++PL
Sbjct: 525 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 582

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
            LG+  INP YI  +K+                            FTFPQSLG++GGKP 
Sbjct: 583 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 615

Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCV------------------------------YDK 266
            + Y  G   +  ++LDPH  Q    V                               D 
Sbjct: 616 TSTYIAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYGSYSGVFSTSQAVDI 675

Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
             D+ +   S+YHC     L +  +DPS+A+
Sbjct: 676 AADNIEADTSSYHCSTVRDLALDLIDPSLAI 706


>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
 gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
           commune H4-8]
          Length = 602

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 66/281 (23%)

Query: 18  DITSRLWFTYRKGFV-------------------------------PIGDSGLTTDKGWG 46
           D  +R+W TYR GF                                P G  G ++D GWG
Sbjct: 132 DFATRIWLTYRSGFELIRDRQLIDLPPPVASLDGHLQGEWATDEAEPPGAYGFSSDSGWG 191

Query: 47  CMLRCGQMVIAQALLFLHLGRDWQW--NVNSKEEA-YLKILKMFED--RRTAPYSIHQIA 101
           CMLR GQ ++A ALL    GRDW+    V + + + Y+ +L +F D    TAP+SIH++A
Sbjct: 192 CMLRTGQSLLANALLTAWFGRDWRRISEVETHQHSLYVHLLSLFLDTPHPTAPFSIHRMA 251

Query: 102 LTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT---N 158
           L G   GK +G+WFGP+T A  ++ L      + I   V +D  L  ++V     +   +
Sbjct: 252 LAGKQLGKDIGQWFGPSTAAGAIKNLVSAYPLAGIGVVVGMDGALSKSEVFTASHSEWSD 311

Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
           + A+ +   +P+++++ LRLG+  +NP+Y               +D +K L         
Sbjct: 312 EEAALDWGDRPVLILLNLRLGLDRVNPIY---------------HDTIKAL--------- 347

Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
              FTFPQS+G+ GG+P  + +F+G  G+D+I+LDPH  +N
Sbjct: 348 ---FTFPQSVGIAGGRPCSSYHFVGAQGSDLIYLDPHHTRN 385


>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
          Length = 459

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 97/328 (29%), Positives = 141/328 (42%), Gaps = 91/328 (27%)

Query: 21  SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE--- 77
           S LW+TYR+ F  +     T+D GWGCMLR  QM++++A     LG  W+    S++   
Sbjct: 102 SILWYTYRRDFETMVPYDFTSDAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLEL 161

Query: 78  -EAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL----AKY 130
            + Y+K+LK F D       YSIH I   G    K  GEW+GP T AQ LR L    A+ 
Sbjct: 162 PKVYVKLLKWFVDSFDTECKYSIHNITRIGMQYDKLPGEWYGPTTAAQALRDLVNLHAQE 221

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------------KRA 161
               ++V +V  D  +    V +LC ++                              R 
Sbjct: 222 SPECNLVMYVPQDGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRD 281

Query: 162 SSNPQWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           +S   WQ  L+++IPLRLG+  INP Y+  I++                           
Sbjct: 282 NSEKMWQKSLLILIPLRLGLDSINPRYLPAIQRV-------------------------- 315

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
            F FPQ++G+IGGK  H++YF+G   + +  LDPH             D     D     
Sbjct: 316 -FEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPH-------------DIHPTADLNTAF 361

Query: 281 PQASRLHILH-----------MDPSIAV 297
           P A+ L  +H           +DPS+A+
Sbjct: 362 PTATHLRTVHSRLPLEMSLGSIDPSLAL 389


>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
          Length = 485

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 160/363 (44%), Gaps = 101/363 (27%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+   S +W TYR+ F  +  S LTTD GWGCMLR GQM++AQ LL   +  DW+
Sbjct: 95  EVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPTDWR 154

Query: 71  W-NVNSKEEAYLKILK-------------------------------MFEDRRTAP---- 94
           W + ++  +   ++LK                               + E  R AP    
Sbjct: 155 WSDCHALTDVDFEVLKPRSPSRPAGMSMPSFSSSWSSSIPQINPSPGITEAHRRAPARCP 214

Query: 95  ------------------YSIHQIALTGASE----GKAVG----EWFGPNTVAQVLRK-L 127
                             +  H  A  G  +    GK  G    +W+GP+ VA +LRK +
Sbjct: 215 SASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHMLRKAV 274

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
           A+  ++  +  +VA D T+    V  LC      SS   W+ +V+++P+RLG + +NP Y
Sbjct: 275 ARAAEFEDLAVYVAQDCTVYKEDVMSLCE-----SSGVGWKSVVILVPVRLGGESLNPSY 329

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
           I  +K    L                              +G+IGGKP H+L+F+G+   
Sbjct: 330 IECVKNILKLKC---------------------------CIGIIGGKPKHSLFFVGFQDE 362

Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDY 306
            +++LDPH  Q +  V       E     ++HC    +++   MDPS  + +  RS +D+
Sbjct: 363 QLLYLDPHYCQPVVDVTQANFSLE-----SFHCNSPRKMNFSRMDPSCTIGLYARSKTDF 417

Query: 307 KNV 309
           +++
Sbjct: 418 ESL 420


>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
          Length = 362

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 146/311 (46%), Gaps = 36/311 (11%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q L  I  D+ SR+W TYR+GF PI  SG+T+D GWGC LR GQM++AQAL++  +GR W
Sbjct: 15  QVLNAILSDLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQW 74

Query: 70  QWNVNSK-EEAYLKILKMFEDRRTA--PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
           +  + +   E   ++L+ F D+     P+SIH +  TG + G   G+W GP+ +   L  
Sbjct: 75  RRKLEAAYPEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLAD 134

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPV 186
           +        +   V             LCT+    +                G  D +  
Sbjct: 135 MVNKVQPGGLQCRVV---ATFGGGAPVLCTSRLATAFE--------------GGADRSGG 177

Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQ-TPRY------EFTFPQSLGVIGGKPNHAL 239
            +       + P      ++  L    N +  PRY        T+PQS+G++GG+P+ +L
Sbjct: 178 EVGSSGSEESGPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSL 237

Query: 240 YFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-V 298
           YFIG     V++LDPH  Q +         SE     TY C     + + ++DPS+A+  
Sbjct: 238 YFIGLQDQHVLYLDPHEVQEVA--------SEAADLDTYFCSSLRLMPLANIDPSLAIGF 289

Query: 299 SQRSYSDYKNV 309
              S SD++++
Sbjct: 290 YCSSLSDFEDL 300


>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
           42464]
 gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
           42464]
          Length = 456

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 145/305 (47%), Gaps = 60/305 (19%)

Query: 18  DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF  +P                    +GD +G T+D GWGCM+R GQ 
Sbjct: 125 DFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFTSDTGWGCMIRSGQS 184

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A ALL   LGRDW+   +   +A   IL +F D   APYS+H     G  + GK  GE
Sbjct: 185 LLANALLISRLGRDWRRMTDP--DAERPILALFADDSRAPYSLHNFVKHGELACGKYPGE 242

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA   + S  V+     +   V +   + T      +   + P +++
Sbjct: 243 WFGPSATARCIQALANKHESSLRVYSTG--DLPDVYEDSFMATAKPDGET---FHPTLIL 297

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  IN VY                  V+ L ST  M+         QS+G+ GG
Sbjct: 298 VCTRLGIDKINQVY------------------VEALISTLQME---------QSIGIAGG 330

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHMD 292
           +P  + YF+G  G  + +LDPH  +      +   D + ++LDS  H  +  RLH+  MD
Sbjct: 331 RPASSHYFVGVQGQWLFYLDPHHPRPKLPYRENPDDYTSEELDSC-HTRRLRRLHVEDMD 389

Query: 293 PSIAV 297
           PS+ +
Sbjct: 390 PSMLI 394


>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
 gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
          Length = 340

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 154/334 (46%), Gaps = 85/334 (25%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPI-----GDSGL-------------------------TT 41
           LE+I   I SRLWFTYR GF PI     G S L                         +T
Sbjct: 52  LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111

Query: 42  DKGWGCMLRCGQMVIAQALLFLHLGRDWQ--WNVNSKEEAYLKILKMFEDRRTAPYSIHQ 99
           D GWGCM+R  Q ++A AL  L LGRD Q    + S  E   KI+++F D  T P+S+H 
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171

Query: 100 -IALTGASEGKA-VGEWFGPNTVAQVLRKL-AKYD--DWSSIVFHVALDNTLVVNQVKKL 154
            I +  AS  K   GEWFGP+  +  +++L AK++  +  +I   +     L   +++ +
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRLCAKFESNEIPNINVSICESCNLYDEEIRGI 231

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
              ++         PL+++ PLRLGI  IN +Y   + +  AL                 
Sbjct: 232 FEESES--------PLLILFPLRLGIDKINSIYYPSLLQLLALK---------------- 267

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                      QS+G+ GGKP+ + YF G+ G+++++LDPH  Q           +    
Sbjct: 268 -----------QSVGIAGGKPSSSYYFFGFQGSNLLYLDPHNLQ-----------AASSD 305

Query: 275 DSTYHCPQASRLHILHMDPSIAV--VSQRSYSDY 306
             TYH  +   L I ++DP  A   V+Q +Y DY
Sbjct: 306 PGTYHTSKFQTLSISNLDPLNACWSVNQMTYDDY 339


>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
 gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
          Length = 379

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 144/304 (47%), Gaps = 58/304 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F PI                          G T+D GWGCM+R GQ 
Sbjct: 53  DFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRSGQS 112

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A ++  L LGRDW+     +EE   K+L +F D   AP+SIH     GA   GK  GE
Sbjct: 113 LLANSMAILLLGRDWRRGERLEEEG--KLLSLFADSPHAPFSIHSFVKHGADFCGKHPGE 170

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP   A+ ++ LA   D S++  ++A DN+  V+Q K +  +     +    +P +++
Sbjct: 171 WFGPTATARCIQGLAARYDQSNLQVYIADDNS-DVHQDKFMSVSRDEKGTV---RPTLIL 226

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + LRLGI  I  VY NG+K    L                           PQS+G+ GG
Sbjct: 227 LGLRLGIDRITAVYWNGLKAVLQL---------------------------PQSVGIAGG 259

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YF+   G+   +LDPH N      Y +     +   +TYH  +  RL+I  MDP
Sbjct: 260 RPSASHYFVAVQGSHFFYLDPH-NTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDP 318

Query: 294 SIAV 297
           S+ +
Sbjct: 319 SMLI 322


>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
          Length = 448

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 144/305 (47%), Gaps = 59/305 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S+L F+YR GF  I  S                       G ++D GWGCM+R GQ 
Sbjct: 111 DFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRSGQS 170

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A +++ L L R W+  V   +E   +I+ +F D   APYSIH+    GA   GK  G+
Sbjct: 171 LLANSMVILRLSRGWRRGVGRDKE--REIVSLFADDPRAPYSIHKFVEHGAEACGKYPGQ 228

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ +++LAK  + + +  ++  D + V        +  K    N  ++P +++
Sbjct: 229 WFGPSATARCIQELAKRHESADVRVYITGDGSDVYKD--GFMSVAKPDGVN--FKPTLIL 284

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  + PVY   +K    +                           PQS+G+ GG
Sbjct: 285 VGTRLGIDKVTPVYWEALKASLQM---------------------------PQSVGIAGG 317

Query: 234 KPNHALYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YF+G  G+   +LDPH T   I    D ++ +  ++DS  H  +  RL I  MD
Sbjct: 318 RPSSSHYFVGVQGSHFFYLDPHQTMAAIPFHTDVDEYTPAEIDSC-HTRRLRRLDIKEMD 376

Query: 293 PSIAV 297
           PS+ +
Sbjct: 377 PSMLI 381


>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
 gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
          Length = 480

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 151/342 (44%), Gaps = 90/342 (26%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI----------------------------------GDSG 38
           E++ +DI SR+WFTYR GF PI                                   +  
Sbjct: 79  EEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALDNIHGLFNNQN 138

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
            TTD GWGCM+R  QM++A A   L LGRD+ + V+  E+ +  I+ MF D    P+S+H
Sbjct: 139 FTTDVGWGCMIRTSQMLLANAFQLLLLGRDFAY-VDGSEKKHSDIIDMFTDEPKTPFSLH 197

Query: 99  QIALTGASEGKAV--GEWFGPNTVAQVLRKLAK--YDDWSSIVFHVALDNTLVV--NQVK 152
                 +     V  GEWFGPN  +  +++L K  +D   S  F V +  +  +  +++ 
Sbjct: 198 NFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDGSVSPSFRVIISESCDIYDDKIG 257

Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
           KL    + +        +++++P+RLG+  ++P Y               +D +  L   
Sbjct: 258 KLLQEIENSE-----DAILILLPVRLGLNKVSPYY---------------HDSLSSL--- 294

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI--GCVYDKEQDS 270
                    F   Q +G+ GGKP+ + YF G     +++LDPH  Q++    +YD     
Sbjct: 295 ---------FCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPHYPQSMKASSIYD----- 340

Query: 271 EKKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYKN 308
                 T+H  +   L I  MDPS    I + S+  Y  +K+
Sbjct: 341 ------TFHTNKVQSLKIEDMDPSMLIGILIKSKEDYESFKD 376


>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
          Length = 497

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 106/370 (28%), Positives = 167/370 (45%), Gaps = 93/370 (25%)

Query: 1   MRHANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQA 59
           + HA  L+ +D +E+ R D  SR+W TYR+ F  +  S LTTD GWGCMLR GQM++AQ 
Sbjct: 95  LGHAYLLNSEDEVERFRLDFVSRIWLTYRREFPQLEGSTLTTDCGWGCMLRSGQMLLAQG 154

Query: 60  LLFLHLGRDWQW-NVNSKEEAYLKILK--------------------------------- 85
           LL   +  DW W + +   +   +I +                                 
Sbjct: 155 LLLHLMPPDWTWPDAHQLTDVDFEIFRPRSPVRAAGVPIPSFGAPRASTTPEKSCSSSQK 214

Query: 86  ----MFEDRRTAPYSIHQIALTG----------------ASEGKAVGEWFGPNTVAQVLR 125
                  DR+  P     + L G                   GK  G+W+GP+ VA +LR
Sbjct: 215 KKTESSRDRQAEPTHQKLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAHILR 274

Query: 126 K-LAKYDDWSSIVFHVALDNTLVVNQVKKLC--TTNKRAS--SNPQWQPLVLVIPLRLGI 180
           K +AK     S+  +VA D T+    V +LC  + ++R +  S+  W+ +++++P+RLG 
Sbjct: 275 KAVAKTSVGQSLAVYVAQDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGG 334

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
           + +NP YI  +K   +L                              +G+IGGKP H+LY
Sbjct: 335 EALNPSYIECVKNILSLDC---------------------------CIGIIGGKPKHSLY 367

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VS 299
           FIG+    +++LDPH  Q    V D  Q +   L+S +HC    ++    MDPS  +   
Sbjct: 368 FIGFQDEQLLYLDPHYCQP---VVDFTQ-ANFSLES-FHCSSPKKMPFSRMDPSCTIGFY 422

Query: 300 QRSYSDYKNV 309
            R+  D++++
Sbjct: 423 ARTKEDFESM 432


>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
           4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
           nidulans FGSC A4]
          Length = 402

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 144/304 (47%), Gaps = 58/304 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F PI                          G T+D GWGCM+R GQ 
Sbjct: 76  DFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRSGQS 135

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A ++  L LGRDW+     +EE   K+L +F D   AP+SIH     GA   GK  GE
Sbjct: 136 LLANSMAILLLGRDWRRGERLEEEG--KLLSLFADSPHAPFSIHSFVKHGADFCGKHPGE 193

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP   A+ ++ LA   D S++  ++A DN+  V+Q K +  +     +    +P +++
Sbjct: 194 WFGPTATARCIQGLAARYDQSNLQVYIADDNS-DVHQDKFMSVSRDEKGTV---RPTLIL 249

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + LRLGI  I  VY NG+K    L                           PQS+G+ GG
Sbjct: 250 LGLRLGIDRITAVYWNGLKAVLQL---------------------------PQSVGIAGG 282

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YF+   G+   +LDPH N      Y +     +   +TYH  +  RL+I  MDP
Sbjct: 283 RPSASHYFVAVQGSHFFYLDPH-NTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDP 341

Query: 294 SIAV 297
           S+ +
Sbjct: 342 SMLI 345


>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
           H99]
          Length = 1185

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 86/267 (32%), Positives = 125/267 (46%), Gaps = 85/267 (31%)

Query: 38  GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW----------QWNVNSKEEA------YL 81
           GLT+D GWGCMLR GQ ++  AL+ +HLGRDW          +   N +  A      Y 
Sbjct: 559 GLTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYA 618

Query: 82  KILKMFEDRRTA--PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK---------- 129
           ++L  F D  +   P+S+H++AL G   GK VGEWFGP+T A  L+ LA           
Sbjct: 619 QMLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANSFAPCGVAVA 678

Query: 130 -------------------YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW--Q 168
                               DDW+SI        +   N  KK    + +A    +W  +
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSI--------SPTFNSSKKKRGGDNKAKEG-KWGKR 729

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
            +++++ +RLG+  +NP+Y               YD +K L            FTFPQS+
Sbjct: 730 AVLILVGIRLGLDGVNPIY---------------YDSIKAL------------FTFPQSV 762

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPH 255
           G+ GG+P+ + YFIG   N + +LDPH
Sbjct: 763 GIAGGRPSSSYYFIGSQANHLFYLDPH 789


>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
 gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
          Length = 508

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 144/308 (46%), Gaps = 66/308 (21%)

Query: 18  DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF  +P                     GD +G ++D GWGCM+R GQ 
Sbjct: 176 DFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRSGQS 235

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A A+L    GR W+   N   E   +I+ +F D   APYSI      GA+  GK  GE
Sbjct: 236 LLANAMLISRAGRAWRRTTNPDIE--REIVCLFADDPRAPYSIQNFVNHGAAACGKYPGE 293

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPL 170
           WFGP+  A+ ++ LAK  D S  V+        +   + ++   N  +++NP    + P 
Sbjct: 294 WFGPSATARCIQALAKKHDSSLRVY--------LTRDLPEVYEDNFMSTANPDGNHFHPT 345

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           ++++  RLGI  INP+Y   +     L                           PQ++G+
Sbjct: 346 LILVSTRLGIDKINPIYHEALISTLQL---------------------------PQAIGI 378

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHIL 289
            GG+P+ + YFIG  G  + +LDPH  +      +   D + ++LDS  H  +   LH+ 
Sbjct: 379 AGGRPSSSHYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSC-HTRRLRHLHVE 437

Query: 290 HMDPSIAV 297
            MDPS+ +
Sbjct: 438 DMDPSMLI 445


>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
          Length = 511

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 141/327 (43%), Gaps = 85/327 (25%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 151 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 210

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 211 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 270

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
            +H++   G S GK  G+W+GP+ VA +LRK                + T +V  V + C
Sbjct: 271 GLHRLVELGQSSGKKAGDWYGPSLVAHILRK----------AVESCSEVTRLVVYVSQDC 320

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
           T  + +S                 + D               P S    ++ +L      
Sbjct: 321 TAAEASSP----------------VSDT--------------PASGPLHLLPLLLGVLFQ 350

Query: 216 QTPRYEFTFPQ-----SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
           Q  R+ F          LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q +
Sbjct: 351 QRCRWLFVCELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-A 406

Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
           +  L+S +HC    ++    MDPS  V
Sbjct: 407 DFPLES-FHCTSPRKMAFAKMDPSCTV 432


>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1355

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 91/297 (30%), Positives = 136/297 (45%), Gaps = 86/297 (28%)

Query: 18  DITSRLWFTYRKGFV-PIGDSGLT------------------------------------ 40
           D  SR+W TYR  F  PI DS LT                                    
Sbjct: 337 DFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGGEKS 396

Query: 41  --TDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT-- 92
             +D GWGCMLR GQ ++A AL+ +HLGRDW+   + V + + A Y++IL  F D  +  
Sbjct: 397 WSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTPSPD 456

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
           AP+S+H++AL G   G  VG+WFGP+  A  +++L      S +   VA D  L    V 
Sbjct: 457 APFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRLVNEFPRSGVGVSVAKDGVLSQTDVF 516

Query: 153 KLCTTNKRASSNP------------QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
                +   ++               W  +P+++++ LRLGI  +NP+Y           
Sbjct: 517 LASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVNPIY----------- 565

Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
               Y+ +K L            FT PQS+G+ GG+P  + YF+G   +++ +LDPH
Sbjct: 566 ----YETIKTL------------FTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPH 606


>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
           bisporus H97]
          Length = 1261

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 91/297 (30%), Positives = 136/297 (45%), Gaps = 86/297 (28%)

Query: 18  DITSRLWFTYRKGFV-PIGDSGLT------------------------------------ 40
           D  SR+W TYR  F  PI DS LT                                    
Sbjct: 250 DFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGGEKS 309

Query: 41  --TDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT-- 92
             +D GWGCMLR GQ ++A AL+ +HLGRDW+   + V + + A Y++IL  F D  +  
Sbjct: 310 WSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTPSPD 369

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
           AP+S+H++AL G   G  VG+WFGP+  A  +++L      S +   VA D  L    V 
Sbjct: 370 APFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRLVNEFPRSGVGVSVAKDGVLSQTDVF 429

Query: 153 KLCTTNKRASSNP------------QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
                +   ++               W  +P+++++ LRLGI  +NP+Y           
Sbjct: 430 LASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVNPIY----------- 478

Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
               Y+ +K L            FT PQS+G+ GG+P  + YF+G   +++ +LDPH
Sbjct: 479 ----YETIKTL------------FTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPH 519


>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
          Length = 994

 Score =  134 bits (338), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 92/283 (32%), Positives = 142/283 (50%), Gaps = 72/283 (25%)

Query: 18  DITSRLWFTYRKGFVPI--------------------------------GDSGLTTDKGW 45
           D TSR+W TYR  F PI                                G+ G T+D GW
Sbjct: 312 DFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSDSGW 371

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
           GCMLR GQ ++A ALL LHLGRDW+   + + + + A Y++I+  F D  +   P+S+H+
Sbjct: 372 GCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFSVHR 431

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT-- 157
           +AL G   GK VG+WFGP+T A  ++ L      + +   VA+D  +  + V  +  +  
Sbjct: 432 MALVGKELGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVAVDGVIYQSDVYAVSRSTM 491

Query: 158 ---NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
              + R    P W  + ++++I +RLGI  +NP+Y               YD++K L   
Sbjct: 492 GLGSPRKHGRPSWGDRAVLVLIGIRLGIDGVNPIY---------------YDLIKAL--- 533

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                    +T PQ+LG+ GG+P+ + YF+G   N++ +LDPH
Sbjct: 534 ---------YTLPQTLGIAGGRPSSSYYFVGSQANNLFYLDPH 567


>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
 gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
          Length = 462

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 80/263 (30%), Positives = 122/263 (46%), Gaps = 53/263 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
           D  SR+W TYR GF  I  S                     G T+D G+GCM+R GQ ++
Sbjct: 99  DFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQCIL 158

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGRDW++  +   + +  IL +F D   AP+SIH+    GA+  GK  GEWF
Sbjct: 159 ANALQTLRLGRDWRYQDDPTAQEHCNILSLFADDPQAPFSIHRFVEHGAAVCGKYPGEWF 218

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
           GP+  A+ ++ L      + +  +V+ D   V     K     +      +W P ++++ 
Sbjct: 219 GPSAAARCIQDLVHKYKEAGLRVYVSGDGADVYEDKLKQVAVEEDG----EWIPTLILVG 274

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
            RLGI  I PVY   +K                  ++  M+         QS+G+ GG+P
Sbjct: 275 TRLGIDKITPVYWEALK------------------ASLQMK---------QSMGIAGGRP 307

Query: 236 NHALYFIGYVGNDVIFLDPHTNQ 258
           + + YF+    N   +LDPH+ +
Sbjct: 308 SASHYFVATQANHFFYLDPHSTR 330


>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 999

 Score =  133 bits (335), Expect = 9e-29,   Method: Composition-based stats.
 Identities = 92/284 (32%), Positives = 139/284 (48%), Gaps = 73/284 (25%)

Query: 18  DITSRLWFTYRKGFVPI---------------------------------GDSGLTTDKG 44
           D TSR+W TYR  F PI                                 G+ G T+D G
Sbjct: 306 DFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTSDAG 365

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA----YLKILKMFEDRRT--APYSIH 98
           WGCMLR GQ ++A ALL LHLGRDW+   +    A    Y++I+  F D  +   P+S+H
Sbjct: 366 WGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPFSVH 425

Query: 99  QIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-----KK 153
           ++AL G   GK VG+WFGP+T A  ++ L      + +   VA D+TL  + V       
Sbjct: 426 RMALVGKDLGKEVGQWFGPSTAAGAIKTLVHSFPDAGLGVAVASDSTLYESDVYAASRSS 485

Query: 154 LCTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
           + +T +      +W  + ++++I +RLGI+ +NP+Y N IK  Y                
Sbjct: 486 VYSTRRHGHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLY---------------- 529

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                      TFPQ++G+ GG+P+ + YF+G   +++ +LDPH
Sbjct: 530 -----------TFPQTVGIAGGRPSSSYYFVGSQADNLFYLDPH 562


>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
 gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
          Length = 454

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 93/307 (30%), Positives = 141/307 (45%), Gaps = 64/307 (20%)

Query: 18  DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF  +P                     GD +G ++D GWGCM+R GQ 
Sbjct: 121 DFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRSGQS 180

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL    LGRDW+   +   +A  +IL +F D   APYS+H     GA+  GK  GE
Sbjct: 181 LLANALQISRLGRDWRRATDP--DAEREILSLFADDPRAPYSLHNFVKHGAAACGKYPGE 238

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPL 170
           WFGP+  A+ +  LA   + S  V+            +  +   +  A +NP    + P 
Sbjct: 239 WFGPSATARCIEALANQHESSLRVYS--------TGDLPDVYEDSFMAVANPDGEHFHPT 290

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           ++++  RLGI  IN VY                   + L ST  M+         QS+G+
Sbjct: 291 LILVCTRLGIDKINQVY------------------EEALISTLQME---------QSIGI 323

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILH 290
            GG+P+ + YF+G  G  + +LDPH  +      +  +D   +   + H  +   LH+  
Sbjct: 324 AGGRPSSSHYFVGVQGQWLFYLDPHHPRPALPYREAPEDYTSEELGSCHTRRLRHLHVED 383

Query: 291 MDPSIAV 297
           MDPS+ +
Sbjct: 384 MDPSMLI 390


>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
 gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
          Length = 492

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 93/341 (27%), Positives = 147/341 (43%), Gaps = 91/341 (26%)

Query: 15  IRRDITSRLWFTYRKGFVPIG----------------------------------DSGLT 40
           I +DI S++W TYR GF PI                                   +   T
Sbjct: 85  IEQDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNFHGLLDNDNFT 144

Query: 41  TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
           TD GWGCM+R  Q ++A     L LGR + +  + +   + +I+ MF D   AP+S+H  
Sbjct: 145 TDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRD-RSPRHDEIIDMFMDEPRAPFSLHNF 203

Query: 101 ALTGASEGKAV--GEWFGPNTVAQVLRKLA----KYDDWSSIVFHVALDNTLVVNQVKKL 154
               +     V  G+WFGPN  +  +++L     + +    +   ++  + L  + + ++
Sbjct: 204 IKVASESPLKVKPGQWFGPNAASLSIKRLCDNVYESNGTGRVKVVISESSNLYDDIITQM 263

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            TT      NP    +++++P+RLGI  +NP+Y   + +  AL                 
Sbjct: 264 FTT-----LNPVPDAILVLLPVRLGIDKVNPLYHASVLELLALR---------------- 302

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ---NIGCVYDKEQDSE 271
                      QS+G+ GGKP+ + YF GY GND+++LDPH  Q   N   VYD      
Sbjct: 303 -----------QSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRNKTSVYD------ 345

Query: 272 KKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYKN 308
                TYH     +L +  MDPS    I +     Y D+K+
Sbjct: 346 -----TYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKS 381


>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
           [Homo sapiens]
          Length = 231

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/176 (41%), Positives = 100/176 (56%), Gaps = 30/176 (17%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W    ++
Sbjct: 48  DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ 107

Query: 78  -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
            + Y +IL+ F DR+   YSIHQ+                     ++ R L    D    
Sbjct: 108 PKEYQRILQCFLDRKDCCYSIHQM--------------------EKMCRVLPLSAD---T 144

Query: 137 VFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
                 D+    NQ K    T+   S+   W+PL+L++PLRLGI  INPVY++  K
Sbjct: 145 AGDRPPDSLTASNQSK---GTSAYCSA---WKPLLLIVPLRLGINQINPVYVDAFK 194


>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1038

 Score =  132 bits (333), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 95/281 (33%), Positives = 141/281 (50%), Gaps = 70/281 (24%)

Query: 18  DITSRLWFTYRKGFVPIGDS--------------------------------GLTTDKGW 45
           D TSR+W TYR  F PI DS                                G TTD GW
Sbjct: 291 DFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSPSPKSRRWFGGEKGWTTDTGW 350

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDR--RTAPYSIHQ 99
           GCMLR GQ ++A ALL LHLGRDW+   + + +++ A Y++I+  F D     AP+S+H+
Sbjct: 351 GCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYVQIITWFLDSPLPQAPFSVHR 410

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
           +AL G   GK VG+WFGP+T A  +++L +    + +   VA D  L    V      + 
Sbjct: 411 MALAGKDLGKDVGQWFGPSTAAGAIKRLVQAFPDAGLGVAVASDGALYQTDVYSASYVDV 470

Query: 160 RASSNP---QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
            +  N    +W  + ++++  +RLGI  +NP+Y               YD +K L     
Sbjct: 471 GSPRNVRKLRWGGRAVLVLFGIRLGINGVNPIY---------------YDTIKGL----- 510

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                  F  PQS+G+ GG+P+ + YF+G  G+++I+LDPH
Sbjct: 511 -------FEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPH 544


>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
          Length = 1119

 Score =  132 bits (332), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 73/236 (30%), Positives = 120/236 (50%), Gaps = 47/236 (19%)

Query: 35  GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN------------SKEEAYLK 82
            + GL++D GWGCMLR GQ ++A AL+ +HLGRDW+  +                  Y +
Sbjct: 705 AEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYAR 764

Query: 83  ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV 140
           IL +F D  +  +P+S+H+ A  G   GK +GEWFGP+T A  ++ L    + + +    
Sbjct: 765 ILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYEPAGLKVVS 824

Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
            +D T+  ++V    T +       +W+ P++++I +RLGI  +NP+Y   IK  + L  
Sbjct: 825 CVDGTVYESEVVAASTKD-----GEKWKTPVLVLINVRLGIDGVNPIYYEAIKGIFRL-- 877

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                                    PQS+G+ GG+P+ + YF+G   N + ++DPH
Sbjct: 878 -------------------------PQSVGIAGGRPSSSYYFVGAQANSLFYIDPH 908


>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 494

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/308 (30%), Positives = 138/308 (44%), Gaps = 65/308 (21%)

Query: 18  DITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCMLRCGQ 53
           D  SR+W TYR GF  I  S                        G ++D GWGCM+R GQ
Sbjct: 158 DFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGADQAGFSSDTGWGCMIRSGQ 217

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
            ++A ALL   LGR+W+   N K E   +IL +F D   APYS+H     GA   GK  G
Sbjct: 218 SLLANALLISRLGREWRRGQNPKAE--REILSLFADDPRAPYSLHNFVKHGAEACGKFPG 275

Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ---P 169
           EWFGP+  A+ ++ LA          H +         +  +   +  A +NP  Q   P
Sbjct: 276 EWFGPSATARCIQALANK--------HESELRVYSTGDLPDVYEDSFMAIANPDGQHFHP 327

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
            ++++  RLGI  IN VY                   + L ST  M+         QS+G
Sbjct: 328 TLVLVCTRLGIDKINKVY------------------EQALISTLQME---------QSIG 360

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
           + GG+P+ + YFIG     + +LDPH  + +    +  +D  ++   + H  +   LH+ 
Sbjct: 361 IAGGRPSQSHYFIGVQDQWLFYLDPHYPRPMLPYRENPEDYTQEEVDSCHTRRLRHLHVE 420

Query: 290 HMDPSIAV 297
            +DPS+ +
Sbjct: 421 DLDPSMLI 428


>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
 gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
          Length = 1039

 Score =  132 bits (332), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 72/286 (25%)

Query: 18  DITSRLWFTYRKGF-VPIGDSGL-------------------------------TTDKGW 45
           D TSR+W TYR  F  PI D+ L                               ++D GW
Sbjct: 339 DFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSDTGW 398

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
           GCMLR GQ ++A AL+ +HLGRDW+   + V + + A Y++I+  F D     AP+S+H+
Sbjct: 399 GCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFSVHR 458

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-----KKL 154
           +AL G   G  VG+WFGP+  A  ++ L      S +   VA D TL  + V      ++
Sbjct: 459 MALAGKEFGTDVGQWFGPSVAAGAIKTLVNSFPESGLGVSVATDGTLFQSDVFAVSHGEM 518

Query: 155 CTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
            + + R      W  +P++L++ +RLGI+ +NP+Y               Y+ +K+L   
Sbjct: 519 SSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIY---------------YETIKLL--- 560

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
                    +TFPQS+G+ GG+P+ + YF+G   +++ +LDPH  +
Sbjct: 561 ---------YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTR 597


>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
           4308]
          Length = 378

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/306 (31%), Positives = 150/306 (49%), Gaps = 62/306 (20%)

Query: 18  DITSRLWFTYRKGFVPI----GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           D  SR+W TYR  F PI    GD     DK     L     ++A AL  L LGRDW+   
Sbjct: 82  DFESRIWMTYRSNFPPIPRVEGD-----DKSASMTLGS---LLANALSTLVLGRDWRRGA 133

Query: 74  NSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGEWFGPNTVAQVLRKLAKYDD 132
             +EE+  ++L +F D  TAP+S+H+    GA S GK  GEWFGP+  A+ +  L+    
Sbjct: 134 RFEEES--QLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCIEALSSQCG 191

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
             ++  +V+ D +    +V +    N   +S+  +QP ++++  RLGI  I PVY +G+K
Sbjct: 192 SPTLKVYVSNDTS----EVYQDRFMNVARNSSGVFQPTLILLGTRLGIDHITPVYWDGLK 247

Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
               LP                           QS+G+ GG+P+ + YF+G  G+ + +L
Sbjct: 248 ATLQLP---------------------------QSVGIAGGRPSASHYFVGAQGSHLFYL 280

Query: 253 DPH------TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI----AVVSQRS 302
           DPH       ++  G +Y KE+     +D TYH  +  R+H+  MDPS+     +  Q  
Sbjct: 281 DPHYTRPALPDRQGGELYSKEE-----VD-TYHTRRLRRIHVRDMDPSMLIGFLIRDQED 334

Query: 303 YSDYKN 308
           + D+ N
Sbjct: 335 WDDWLN 340


>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
 gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
          Length = 266

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 78/204 (38%), Positives = 104/204 (50%), Gaps = 56/204 (27%)

Query: 119 TVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------------------- 158
           +V    RKLA +D WSS+  H+A+DNT+V+ ++++LC  N                    
Sbjct: 20  SVLAFCRKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGF 79

Query: 159 ---KRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                 ++ P  W+PLVL+IPLRLG+ DIN  Y+  +K C                    
Sbjct: 80  PAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHC-------------------- 119

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                  F  PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q       +  DS    
Sbjct: 120 -------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIP 168

Query: 275 DSTYHCPQ-ASRLHILHMDPSIAV 297
           D ++HC    SR+ I  +DPSIAV
Sbjct: 169 DESFHCQHPPSRMGIGELDPSIAV 192


>gi|307190831|gb|EFN74681.1| Cysteine protease ATG4B [Camponotus floridanus]
          Length = 115

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 59/93 (63%), Positives = 71/93 (76%)

Query: 67  RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
           +DWQW   +K   YLKIL  FED+R A +SIHQIAL GASEGK VG+WFGPNT+AQVL+K
Sbjct: 15  KDWQWMPETKNSTYLKILSRFEDKRAAAFSIHQIALMGASEGKEVGQWFGPNTIAQVLKK 74

Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
           L  YD+WSSI  HVALDNTL++N + K    +K
Sbjct: 75  LVVYDEWSSITIHVALDNTLIINDICKYAVISK 107


>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
          Length = 459

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 93/312 (29%), Positives = 143/312 (45%), Gaps = 67/312 (21%)

Query: 18  DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
           D  SR+W TYR  F PI  S                      G ++D GWGCM+R GQ +
Sbjct: 127 DFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLADQGGFSSDTGWGCMIRSGQSL 186

Query: 56  IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGEW 114
           +A  L+   LGRDW+    +++E   +IL  F D   APYS+H     GA + GK  GEW
Sbjct: 187 LANTLVICQLGRDWRRGKAARQER--EILARFADDPRAPYSLHNFVRHGAVACGKFPGEW 244

Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
           FGP+  A+ ++ LA  ++ S  V+       +  +    +   +        + P ++++
Sbjct: 245 FGPSATARCIQALANSNESSLRVYSTGDLPDVYEDSFMAVAKPDGET-----FHPTLILV 299

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
             RLGI  IN VY                   + L++T  M         PQS+G+ GG+
Sbjct: 300 GTRLGIDKINQVYW------------------EALTATLQM---------PQSVGIAGGR 332

Query: 235 PNHALYFIGY--------VGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
           P+ + YFIG          G+ + +LDPH T   +    D +Q +   ++ T H  +  R
Sbjct: 333 PSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN-TCHTRRLRR 391

Query: 286 LHILHMDPSIAV 297
           LH+  MDPS+ +
Sbjct: 392 LHVRDMDPSMLI 403


>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
 gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
          Length = 651

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 90/283 (31%), Positives = 131/283 (46%), Gaps = 56/283 (19%)

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRT--APY 95
            T+D GWGCMLR  Q ++A AL+ +HLGR W+     K    Y +IL  F D  +   P+
Sbjct: 313 FTSDVGWGCMLRSVQSMLANALIRVHLGRHWRRRAKQKTHPQYARILSWFMDDPSLECPF 372

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
           SIH++   G   G   G+WFGP+T A  L KL +  D   +   V  D  L   QV    
Sbjct: 373 SIHRLVDEGQRLGVQAGDWFGPSTAAFALCKLIQAYDACGLGVVVTNDGMLYKEQVVAAS 432

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
               R  S+P  +P+++++  RLG+  + P Y   +K+                      
Sbjct: 433 FAPGR--SDPWTRPVLILLVQRLGLDQVPPHYRPALKQ---------------------- 468

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------TNQNIG 261
                 FT PQS+GV+GG+P  +LYF+G     ++ LDPH              T  ++G
Sbjct: 469 -----SFTMPQSVGVVGGRPRSSLYFVGVQREHLLCLDPHHVRPCVPFRSPPRMTRASVG 523

Query: 262 CVYD---------KEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
              D         +E  + ++LDS +H P  S L I  MDPS+
Sbjct: 524 ASTDLASTVSPWFEEAYTAEELDS-FHTPHTSLLPISQMDPSM 565


>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 1009

 Score =  131 bits (329), Expect = 4e-28,   Method: Composition-based stats.
 Identities = 85/297 (28%), Positives = 137/297 (46%), Gaps = 86/297 (28%)

Query: 18  DITSRLWFTYRKGFVPI-------------------------------GDSGLTTDKGWG 46
           D TSR+W TYR  F+PI                               GD   ++D GWG
Sbjct: 311 DFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDAGWG 370

Query: 47  CMLRCGQMVIAQALLFLHLGRDWQWNVN----SKEEAYLKILKMFEDRRT--APYSIHQI 100
           CMLR GQ ++A AL+ +HLGRDW+   +    S    Y++I+  F D  +   P+S+H++
Sbjct: 371 CMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSVHRM 430

Query: 101 ALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW----------------SSIVFHVALDN 144
           AL G   G  VG+WFGP+T A  ++ ++ +                   + +  +VA D 
Sbjct: 431 ALVGKQLGVKVGQWFGPSTAAGAIKYVSAHSSMVPNQPARRTLVHAFPEAGLGIYVAADG 490

Query: 145 TLVVNQ----VKKLCTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
             + +            + R  +   W  +P++++I  RLGI  +NP+Y           
Sbjct: 491 GTIYDSEVFAASHSGIGSPRRHTRRVWGDRPVLILIGHRLGIDGVNPIY----------- 539

Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
               YD +K L            +T+PQS+G+ GG+P+ + YF+G   +++ +LDPH
Sbjct: 540 ----YDTLKTL------------YTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 580


>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 992

 Score =  130 bits (328), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 92/283 (32%), Positives = 141/283 (49%), Gaps = 72/283 (25%)

Query: 18  DITSRLWFTYRKGFVPIGDS----------------------------------GLTTDK 43
           D TSR+W TYR  F PI DS                                  G T+D 
Sbjct: 301 DFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPVGGEKGWTSDA 360

Query: 44  GWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSI 97
           GWGCMLR GQ ++A ALL LHLGRDW+   + V++ + A Y++I+  F D  +  +P+S+
Sbjct: 361 GWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFDTPSPQSPFSV 420

Query: 98  HQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT 157
           H++AL G   GK VG+WFGP+T A  ++ L      + +   VA D  +  + V      
Sbjct: 421 HRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVASDGVIFQSDVYAASNA 480

Query: 158 ---NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
              + R  +   W  + ++++I +RLG+  +NP+Y               YD +K L   
Sbjct: 481 YIGSPRRHAKVSWGGRAVIVLIGIRLGLDGVNPIY---------------YDTIKAL--- 522

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                    +TFPQS+G+ GG+P+ + YF+G   +++ +LDPH
Sbjct: 523 ---------YTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPH 556


>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
 gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
           Full=Autophagy-related protein 4
 gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 506

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 91/305 (29%), Positives = 137/305 (44%), Gaps = 60/305 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F  I  S                       G ++D GWGCM+R GQ 
Sbjct: 174 DFESRIWMTYRTDFALIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 233

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A A+L   LGR+W+   +   E    I+ +F D   APYS+H     GA+  GK  GE
Sbjct: 234 LLANAILIARLGREWRRGTDLDAEK--DIIALFADDPRAPYSLHNFVKYGATACGKYPGE 291

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA        V+       +  +    +   + R      +QP +++
Sbjct: 292 WFGPSATARCIQALADEKQSGLRVYSTGDLPDVYEDSFMAVANPDGRG-----FQPTLIL 346

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  IN VY                   + L ST  +         PQS+G+ GG
Sbjct: 347 VCTRLGIDKINQVY------------------EEALISTLQL---------PQSIGIAGG 379

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YF+G  G  + +LDP H    +    D    + ++LD T H  +  +LHI  MD
Sbjct: 380 RPSSSHYFVGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELD-TCHTRRLRQLHIGDMD 438

Query: 293 PSIAV 297
           PS+ +
Sbjct: 439 PSMLI 443


>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
           2508]
 gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
           2509]
          Length = 506

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 91/305 (29%), Positives = 137/305 (44%), Gaps = 60/305 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F  I  S                       G ++D GWGCM+R GQ 
Sbjct: 174 DFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 233

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A A+L   LGR+W+   +   E    I+ +F D   APYS+H     GA+  GK  GE
Sbjct: 234 LLANAILIARLGREWRRGTDLDAEK--DIIALFADDPRAPYSLHNFVKYGATACGKYPGE 291

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA        V+       +  +    +   + R      +QP +++
Sbjct: 292 WFGPSATARCIQALADEKQSGLRVYSTGDLPDVYEDSFMAVANPDGRG-----FQPTLIL 346

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  IN VY                   + L ST  +         PQS+G+ GG
Sbjct: 347 VCTRLGIDKINQVY------------------EEALISTLQL---------PQSIGIAGG 379

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YF+G  G  + +LDP H    +    D    + ++LD T H  +  +LHI  MD
Sbjct: 380 RPSSSHYFVGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELD-TCHTRRLRQLHIGDMD 438

Query: 293 PSIAV 297
           PS+ +
Sbjct: 439 PSMLI 443


>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 1193

 Score =  130 bits (326), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 86/269 (31%), Positives = 126/269 (46%), Gaps = 85/269 (31%)

Query: 36  DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---------WNVNSKEEAYLK---- 82
           + GLT+D GWGCMLR GQ ++  AL+ +HLGRDW+             ++E A LK    
Sbjct: 559 ERGLTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAK 618

Query: 83  ---ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-------- 129
              +L  F D  +   P+S+H++AL G   GK VGEWFGP+T A  L+ LA         
Sbjct: 619 YAQMLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANSFAPCGVA 678

Query: 130 ---------------------YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW- 167
                                 DDW+SI        +   N  KK    +  A    +W 
Sbjct: 679 VATATDSIIYKSDVYTASNLPSDDWNSI--------SPTFNSSKKKRRGDNEAKEE-KWG 729

Query: 168 -QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
            + +++++ +RLG+  +NP+Y               YD +K L            FTFPQ
Sbjct: 730 KRAVLILVGVRLGLDGVNPIY---------------YDSIKAL------------FTFPQ 762

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
           S+G+ GG+P+ + YF+G   N + +LDPH
Sbjct: 763 SVGIAGGRPSSSYYFVGSQANHLFYLDPH 791


>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 918

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 147/328 (44%), Gaps = 82/328 (25%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN----- 72
           D  + + F+YRK F  I  S  TTD GWGC LR  QM++A+AL+    GR W+       
Sbjct: 301 DFQTLVCFSYRKDFERIPGSKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCP 360

Query: 73  ---VNSKEEAYLKILKMFED--RRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRK 126
               +SKE+    I+++F+D  R  +P+SIH I   G     K  G+WFGP +V +V   
Sbjct: 361 APLSSSKEDQLRLIIRLFQDQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFAD 420

Query: 127 L---AKYDDWSSIVFHVALDNTLVVNQVKKLC---------------TTNKRASSNPQWQ 168
           L   A     S    + A+D+ +  + V +LC               +T++  S++    
Sbjct: 421 LINQAYAMHQSPFRAYQAIDHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVT 480

Query: 169 PLVLV-----------------IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
           P                     +PLRLG+ +IN +YI  +K                   
Sbjct: 481 PSASTSQSPPVLPPPFIPLLILMPLRLGLNEINRMYIPCLKAL----------------- 523

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC--VYDKEQD 269
                         Q +G+IGG+P H+LYF+GY  ++VIF DPH     GC    D +Q 
Sbjct: 524 ----------LMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPH-----GCKRFVDMQQT 568

Query: 270 SEKKLDSTYHCPQASRLHILHMDPSIAV 297
           S      T+H    +++   HMDPS+A+
Sbjct: 569 SFPT--ETFHSAVPNKIPFTHMDPSMAI 594


>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
          Length = 411

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 141/286 (49%), Gaps = 64/286 (22%)

Query: 40  TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW--------NVNSKEEAY-------LKIL 84
           TTD GWGCM+R  QM++AQA++    GR+W++         VN +E  +         IL
Sbjct: 88  TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147

Query: 85  KMFEDRRTAPYSIHQIALTGASE--GKAVGEWFGPNTVAQVLRKLAKYDDWSSI----VF 138
           K+FED+ +AP  IH++    A E   +AVG W+ P+    +++K A  +  S +    V 
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPSEAVFIMKK-AITESASPLTGDTVM 206

Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
           ++++D  +    ++ L    K  +       L+LVI +RLG  ++N +Y+  + +     
Sbjct: 207 YLSIDGRV---HIRDLEVETKHWTKT-----LMLVIVVRLGAAELNRIYVPHLMRL---- 254

Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
                                  F+    LG+ GG+P+H+ +F+GY G+ VI+LDPH   
Sbjct: 255 -----------------------FSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAH 291

Query: 259 N---IGCVYDKEQDSEKK----LDSTYHCPQASRLHILHMDPSIAV 297
               I   ++  Q+  KK     + +YHC   S++H L MDPS A+
Sbjct: 292 EYIPIDMDFNTSQEDPKKPKKCPERSYHCRLLSKMHFLDMDPSCAL 337


>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
 gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
          Length = 357

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/197 (36%), Positives = 101/197 (51%), Gaps = 26/197 (13%)

Query: 18  DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
           D  SR+W TYR GF PI  S                     G T+D G+GCM+R GQ ++
Sbjct: 99  DFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCMIRSGQCIL 158

Query: 57  AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
           A AL  L LGRDW+W  N  ++ + +IL +F D   AP+SIH+    GA+  GK  GEWF
Sbjct: 159 ANALQILRLGRDWRWQENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAVCGKYPGEWF 218

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
           GP+  A+ ++ LA     + +  +V+ D   V     K    ++       WQP ++++ 
Sbjct: 219 GPSAAARCIQDLANKHREAGLKVYVSGDGADVYEDKLKQVAVDEDG----LWQPTLILVG 274

Query: 176 LRLGIQDINPVYINGIK 192
            RLGI  I PVY   +K
Sbjct: 275 TRLGIDKITPVYWEALK 291


>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
          Length = 448

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/305 (30%), Positives = 142/305 (46%), Gaps = 60/305 (19%)

Query: 18  DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF PI                      GD +G ++D GWGCM+R GQ 
Sbjct: 116 DFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRSGQS 175

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A ALL   LGRDW+   +   E    I+ +F D   APYS+      GA + GK  GE
Sbjct: 176 LLANALLISQLGRDWRRTTDPGAER--NIVALFADDARAPYSLQNFVKHGAIACGKHPGE 233

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA   + S  ++     +   V +   L T      +   + P +++
Sbjct: 234 WFGPSATARCIQALADQHESSLRIYSTG--DLPDVYEDSFLATARPDGET---FHPTLIL 288

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  INPVY                   + L ST  M+         QS+G+ GG
Sbjct: 289 VCTRLGIDKINPVY------------------EEALISTLQME---------QSIGIAGG 321

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YF+G     + +LDPH  +      +   + + ++LDS  H  +   LH+  MD
Sbjct: 322 RPSSSHYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSC-HTRRLRYLHVEDMD 380

Query: 293 PSIAV 297
           PS+ +
Sbjct: 381 PSMLI 385


>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
 gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1188

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 83/261 (31%), Positives = 122/261 (46%), Gaps = 69/261 (26%)

Query: 36  DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW---------NVNSKEEAYLK---- 82
           + GLT+D GWGCMLR GQ ++  AL+ +HLGRDW+             S+E A LK    
Sbjct: 557 ERGLTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAK 616

Query: 83  ---ILKMFEDRRTA--PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
              ++  F D  +   P+S+H++AL G   GK VGEWFGP+T A  L+ LA       I 
Sbjct: 617 YAQMVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANSFAPCGIA 676

Query: 138 FHVALDNTL---------------------VVNQVKKLCTTNKRASSNPQW--QPLVLVI 174
              A D+ +                       N  +K    N  A    +W  + +++++
Sbjct: 677 VATATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEG-KWGERAVLILV 735

Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
            +RLG+  +NP+Y               YD +K L            FTFPQ+ G  GG+
Sbjct: 736 GIRLGLDGVNPIY---------------YDSIKAL------------FTFPQAGGSAGGR 768

Query: 235 PNHALYFIGYVGNDVIFLDPH 255
           P+ + YF+G   N + +LDPH
Sbjct: 769 PSSSYYFVGSQANHLFYLDPH 789


>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
           boliviensis]
          Length = 463

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 95/322 (29%), Positives = 135/322 (41%), Gaps = 103/322 (31%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 131 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 190

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   +E  + +I+  F D   AP+
Sbjct: 191 WAEGTGLGPPELSGPASPSRYHGPARWMPPCWAQGAPELEQERRHRQIVSWFADHPQAPF 250

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
            +H++   G S GK  G+W+GP+ VA +   L K  + SS V       T +V  V + C
Sbjct: 251 GLHRLVELGQSSGKKAGDWYGPSLVAHI---LRKAVESSSEV-------TRLVVYVSQDC 300

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
           T   + +  P  Q L+                                            
Sbjct: 301 T--GKGTCTPSLQELL-------------------------------------------- 314

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
              R E      LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q +   L+
Sbjct: 315 ---RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-ANFPLE 363

Query: 276 STYHCPQASRLHILHMDPSIAV 297
           S +HC    ++    MDPS  V
Sbjct: 364 S-FHCTSPRKMAFAKMDPSCTV 384


>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
          Length = 491

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 100/371 (26%), Positives = 150/371 (40%), Gaps = 112/371 (30%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 75  NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
            I  +VA D T+    V    + +   S N   + +++++P+RLG +  N  Y+  +K  
Sbjct: 255 GITIYVAQDCTVYNYDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI---- 250
                        ILS  Y              +G+IGGKP  + YF G+  N+V     
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQENEVQRSSM 346

Query: 251 -FLDPHTNQNIGCVYDKEQDSEKKLDS-----------------------TYHCPQASRL 286
             L   +++N   +   E+  +    S                       T+HCP   ++
Sbjct: 347 NSLKQKSSKNNLKLEGSEKRPQMGFGSEDEFKNILLDHVQAFGPPSYPRLTFHCPSPKKM 406

Query: 287 HILHMDPSIAV 297
               MDPS  +
Sbjct: 407 SFRKMDPSCTI 417


>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
          Length = 500

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 90/308 (29%), Positives = 138/308 (44%), Gaps = 74/308 (24%)

Query: 18  DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF  +P                     GD +G ++D GWGCM+R GQ 
Sbjct: 176 DFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRSGQS 235

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A A+L    GR W+   N   E   +I+ +F D   APYSI      GA+  GK  GE
Sbjct: 236 LLANAMLISRAGRAWRRTTNPDIE--REIVCLFADDPRAPYSIQNFVNHGAAACGKYPGE 293

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPL 170
           WFGP+  A+ +  L  Y                +   + ++   N  +++NP    + P 
Sbjct: 294 WFGPSATARCIHSLRVY----------------LTRDLPEVYEDNFMSTANPDGNHFHPT 337

Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
           ++++  RLGI  INP+Y   +     L                           PQ++G+
Sbjct: 338 LILVSTRLGIDKINPIYHEALISTLQL---------------------------PQAIGI 370

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHIL 289
            GG+P+ + YFIG  G  + +LDPH  +      +   D + ++LDS  H  +   LH+ 
Sbjct: 371 AGGRPSSSHYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSC-HTRRLRHLHVE 429

Query: 290 HMDPSIAV 297
            MDPS+ +
Sbjct: 430 DMDPSMLI 437


>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
          Length = 330

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 83/288 (28%), Positives = 130/288 (45%), Gaps = 71/288 (24%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNS-------------------------------- 75
           MLR GQM++AQ LL   L RDW W   +                                
Sbjct: 1   MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60

Query: 76  ---KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYD 131
              +E  + +I+  F D   AP+ +H++   G S GK  G+W+GP+ VA +LRK +    
Sbjct: 61  ELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCS 120

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
           + + +V +V+ D T+    V +L     R     +W+ +V+++P+RLG + +NPVY+  +
Sbjct: 121 EVTRLVVYVSQDCTVYKADVARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCV 177

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K+                         R E      LG++GGKP H+LYFIGY  + +++
Sbjct: 178 KELL-----------------------RCELC----LGIMGGKPRHSLYFIGYQDDFLLY 210

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVS 299
           LDPH  Q    V   +   E     ++HC    ++    MDPS  V S
Sbjct: 211 LDPHYCQPTVDVSQADFPLE-----SFHCTSPRKMAFAKMDPSCTVGS 253


>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
          Length = 616

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 100/370 (27%), Positives = 151/370 (40%), Gaps = 120/370 (32%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD-- 68
           ++E+   D  S LWF+YRK F  I ++ +TTD GWGCMLR GQM++A+ALL      +  
Sbjct: 193 EVERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENI 252

Query: 69  ---WQWNVNSKEEAYLKILKMFED--RRTAPYSIHQIA-----LTGASEGK--------- 109
               +   NSK   Y KI+  F D   +   YSIHQI      +T  +  K         
Sbjct: 253 PYGEKIKTNSK---YKKIMSWFCDYPSKENFYSIHQIVHKNKIITKYNNSKLKDFDIDSD 309

Query: 110 ------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK----------- 152
                  V EWF P  +A VL+ L K    SSI  +V  D  +  ++V            
Sbjct: 310 DQDDWNNVDEWFAPTKIAVVLKLLVKSHHSSSIAMYVPSDGVVYKDRVAKICTIRDDQSA 369

Query: 153 --------------KLCTTNKRASSN--------------------------------PQ 166
                         KL +T   +S N                                  
Sbjct: 370 PARVPLSLSLPAGIKLFSTTSPSSPNLFVPSQSTGNSMEDQSFLVGEEEDNTDNNSNQSN 429

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W+ L++++P++LG+  +N +Y +GIK    +P                            
Sbjct: 430 WKSLIILVPVKLGLDKLNEIYFSGIKAMLQMP---------------------------S 462

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
           S+G+IGGKP  + YF+G+    +I+LDPH       V+D     +    ++YH     ++
Sbjct: 463 SIGLIGGKPKQSFYFVGFQDEHIIYLDPH------FVHDTIHPFDSNFLNSYHDCIPQKM 516

Query: 287 HILHMDPSIA 296
           H   +DPS+A
Sbjct: 517 HFSQIDPSMA 526


>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 515

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 137/305 (44%), Gaps = 60/305 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F  I  S                       G ++D GWGCM+R GQ 
Sbjct: 183 DFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 242

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A A+L   LGR+W+   +   E    I+ +F D   AP+S+H     GA+  GK  GE
Sbjct: 243 LLANAILVARLGREWRRETDLDAEK--DIIALFADDPRAPFSLHNFVKYGATACGKYPGE 300

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP   A+ ++ L    +    V+       +  +    +   + R      +QP +++
Sbjct: 301 WFGPLATARCIQALTDEKESGLRVYSTGDLPDVYEDSFMAVANPDGRG-----FQPTLIL 355

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  IN VY                   + L ST  +         PQS+G+ GG
Sbjct: 356 VCTRLGIDKINQVY------------------EEALISTLQL---------PQSIGIAGG 388

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YFIG  G  + +LDP H    +    D +  + ++LD T H  +  +LHI  MD
Sbjct: 389 RPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELD-TCHTRRLRQLHIDDMD 447

Query: 293 PSIAV 297
           PS+ +
Sbjct: 448 PSMLI 452


>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
 gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
          Length = 1034

 Score =  127 bits (319), Expect = 6e-27,   Method: Composition-based stats.
 Identities = 92/282 (32%), Positives = 138/282 (48%), Gaps = 71/282 (25%)

Query: 18  DITSRLWFTYRKGF-VPIGDSGL-------------------------------TTDKGW 45
           D TSR+W TYR  F  PI D  L                               ++D GW
Sbjct: 305 DFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSDSGW 364

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
           GCMLR GQ ++A AL+ +HLGRDW+   + V + + A Y+ IL  F D     AP+S+H+
Sbjct: 365 GCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFSVHR 424

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV------KK 153
           +AL G   G  VG+WFGP+  A  ++ L      + I   VA+D  L    V        
Sbjct: 425 MALAGKELGTDVGQWFGPSVAAGAIKALVNSFPEAGIGVAVAVDGVLYQTDVHAASHGDH 484

Query: 154 LCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
              T +R   +   +P++L++ +RLGI+ +NP+Y               YD +K+L    
Sbjct: 485 FGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIY---------------YDTIKML---- 525

Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                   +TFPQS+G+ GG+P+ + YF+G   +++ +LDPH
Sbjct: 526 --------YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 559


>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
 gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
 gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
          Length = 521

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 95/317 (29%), Positives = 154/317 (48%), Gaps = 70/317 (22%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDK 43
           E+   D+ +RL FTYR  FVPI     G S ++                        TD 
Sbjct: 114 EEFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDI 173

Query: 44  GWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALT 103
           GWGCM+R GQ ++A AL    LGRD++ + N+  E  L+I+K FED    P+S+H+    
Sbjct: 174 GWGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHELRIIKWFEDDPKYPFSLHKFVQE 233

Query: 104 GAS-EGKAVGEWFGPNTVAQVLRKL-AKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKR 160
           G S  GK  GEWFGP+  ++ ++ L AK+         ++ D+  V +++V+ L   +  
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPACGIAHCVISTDSGDVYMDEVEPLFRADPS 293

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           A+       ++L++ +RLG+  +N VY   I+               ILSS +       
Sbjct: 294 AA-------VLLLLCVRLGVDVVNEVYWEHIR--------------HILSSEH------- 325

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
                 S+G+ GG+P+ +LYF GY    + +LDPH  Q     Y ++ D    L  + H 
Sbjct: 326 ------SVGIAGGRPSSSLYFFGYQDEHLFYLDPHKPQLNLASYQQDLD----LFRSVHT 375

Query: 281 PQASRLHILHMDPSIAV 297
            + +++H+  +DPS+ +
Sbjct: 376 QRFNKVHMSDIDPSMLI 392


>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
          Length = 330

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 82/286 (28%), Positives = 127/286 (44%), Gaps = 71/286 (24%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVN--------------------------------- 74
           MLR GQM++AQ LL   L RDW W                                    
Sbjct: 1   MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60

Query: 75  --SKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYD 131
              +E  + +I+  F D   AP+ +H++   G S GK  G+W+GP+ VA +LRK +    
Sbjct: 61  ELERERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCS 120

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
           D + +V +V+ D T+    V +L     R     +W+ +V+++P+RLG + +NPVY+  +
Sbjct: 121 DVTRLVVYVSQDCTVYKADVARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCV 177

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K+                         R E      LG++GGKP H+LYFIGY  + +++
Sbjct: 178 KELL-----------------------RCELC----LGIMGGKPRHSLYFIGYQDDFLLY 210

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           LDPH  Q    V   +   E     ++HC    ++     DPS  V
Sbjct: 211 LDPHYCQPTVDVSQADFPLE-----SFHCTSPRKMAFAKTDPSCTV 251


>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
          Length = 1093

 Score =  125 bits (315), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 92/321 (28%), Positives = 147/321 (45%), Gaps = 78/321 (24%)

Query: 21  SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL--------------- 65
           +R W  +    VP G   LT+D GWGCMLR GQM++A +L+ LH+               
Sbjct: 422 NRRWLAW----VP-GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPA 476

Query: 66  -GRDWQWNVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
                      + EAY+KIL  F D  +   P+S+H++AL GA  G+ VG+WFGP+  A 
Sbjct: 477 PSLPPSETDRQRFEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAG 536

Query: 123 VLRKLAKYDDWSSIVFHVALDNTL-------------VVNQVKKLCTTNKRASS------ 163
            ++KL        +   V  D  +             + +    L  T  R +       
Sbjct: 537 SIKKLVSAFPACGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERANRM 596

Query: 164 NPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
             +W  + ++++I LRLGI+ + P+Y               YD VK L            
Sbjct: 597 KEEWGDRAVLILIGLRLGIEGVTPIY---------------YDSVKAL------------ 629

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ---NIGCVYDKEQDSEKKLD--- 275
           FTFPQ++G+ GG+P+ + YF+G  G+ + +LDPH+ +    +    D   D+  +     
Sbjct: 630 FTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTRPAVPLRVPTDGPYDATGQFTLSE 689

Query: 276 -STYHCPQASRLHILHMDPSI 295
             T+H  +  ++HI  +DPS+
Sbjct: 690 MKTFHSDKVRKMHISGLDPSM 710


>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
 gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
          Length = 424

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/336 (26%), Positives = 144/336 (42%), Gaps = 81/336 (24%)

Query: 4   ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
           +N++  ++ E   RD  SR W TYR+GF  +G +   TD GWGC LR  QM++A AL   
Sbjct: 58  SNEVGRREWE---RDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTLRSAQMMLANALSIH 114

Query: 64  HLGRDWQWNVN--------------------------------------SKEEAYLKILK 85
             GR W+  V                                       +  +A   IL+
Sbjct: 115 SRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSERTRAGSDAQEDILR 174

Query: 86  MFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKYDDWSSIVFHVALDN 144
           +F D   AP+SIH++       G   G WF P+ + +    L A++D  S +  HV    
Sbjct: 175 LFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALVAEHDLGSELTVHVVSGR 234

Query: 145 TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI-QDINPVYINGIKKCYALPISPVY 203
                 V  +     RA S    + L+L +P+ LG+ + IN  Y++ ++   A       
Sbjct: 235 EGEDGGVPTVDEAEVRAKSADVGKALLLFVPVVLGVGRTINARYLSQLRSMMA------- 287

Query: 204 DMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCV 263
                               F QS+G++GG+PN +LY +G+  +   +LDPHT Q    +
Sbjct: 288 --------------------FKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASSM 327

Query: 264 YDKEQDSEKKLDSTYHCPQASRLHIL--HMDPSIAV 297
              + +S       Y+CP  + LH+    +DP++A+
Sbjct: 328 VTMDFES-------YYCP--TPLHVCGGDLDPTLAL 354


>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
          Length = 494

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD    +  V+  B +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD---CIVSVSSGB-IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
          Length = 430

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 133/302 (44%), Gaps = 60/302 (19%)

Query: 18  DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
           D  SR W TYR  F  +P                     +  SG T+D GWGCM+R GQ 
Sbjct: 126 DFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMIRSGQS 185

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A A+  L LGRDW+  +    E   ++L +F D   APYSIH     G     K  GE
Sbjct: 186 LLANAMAVLDLGRDWRRGMLPDRER--RLLALFADDPRAPYSIHNFVRHGEKYCSKYPGE 243

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L         ++       +  +   K+   +        + P +++
Sbjct: 244 WFGPSATARCIQDLVNSRKQELRIYSTGDGPDIYEDNFMKIAKPDGEV-----FHPTLVL 298

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  I PVY                   + L ++  M          QS+G+ GG
Sbjct: 299 VGTRLGIDKITPVYW------------------EALIASVQMS---------QSVGIAGG 331

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YF+G  G+ + +LDP HT + +    D  + +   +DS  H  +  R+H+  MD
Sbjct: 332 RPSSSHYFVGSQGHFLFYLDPHHTRKALPYYSDVARYTIDDMDSC-HTSRLRRIHVREMD 390

Query: 293 PS 294
           P+
Sbjct: 391 PN 392


>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
          Length = 500

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 95/357 (26%), Positives = 146/357 (40%), Gaps = 101/357 (28%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           S  ++E+ R    SR+W TYR+ F  +  S  TTD GWGCMLR GQM++AQ LL   + R
Sbjct: 100 SEDEVERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPR 159

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGE-------------- 113
           DW W     ++      ++F  R  A      I   G+  G +  E              
Sbjct: 160 DWVW--PESQQLTDVDFEVFRPRSPARAGGVPIPSFGSPRGSSTPEKSLPSSQAPRCSQK 217

Query: 114 --------------------WFG----------------------------PNTVAQVLR 125
                               WFG                            P+ VA +LR
Sbjct: 218 KRVHESTKDRQEHIHSRLVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAHILR 277

Query: 126 K-LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-KRASSNPQ---WQPLVLVIPLRLGI 180
           K + K    +++  +VA D T+    V +LC  +  + SS+P    W+ +++++P+RLG 
Sbjct: 278 KAVDKTSVVTNLAVYVAQDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGG 337

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
           + +NP YI+ +K    L                              +G+IGGKP H+LY
Sbjct: 338 EALNPSYIDCVKNFLKLDC---------------------------CIGIIGGKPKHSLY 370

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           FIG+    +++LDPH  Q +  V       E     ++HC    ++    MDPS  +
Sbjct: 371 FIGFQDEQLLYLDPHYCQPVVDVSQINFSLE-----SFHCSSPKKMPFNRMDPSCTI 422


>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
 gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
 gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
 gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
 gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
 gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
 gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
 gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 506

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 101 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 160

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 161 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 220

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 221 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 276

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 277 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 306

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 307 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 353

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 354 KFGKLQLSEMDPSMLI 369


>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
          Length = 494

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 494

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
          Length = 506

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 101 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 160

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 161 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 220

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 221 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 276

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 277 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 306

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 307 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 353

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 354 KFGKLQLSEMDPSMLI 369


>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
          Length = 494

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 494

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 89  DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357


>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 371

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 136/317 (42%), Gaps = 83/317 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 101 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 160

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G    
Sbjct: 161 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 220

Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
            K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  +
Sbjct: 221 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 276

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
                   ++ ++ ++LGI  +N  Y   I                ILSST         
Sbjct: 277 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 306

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
               QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H  
Sbjct: 307 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 353

Query: 282 QASRLHILHMDPSIAVV 298
           +  +L +  MDP  ++V
Sbjct: 354 KFGKLQLSEMDPRCSLV 370


>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
           benhamiae CBS 112371]
 gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
           benhamiae CBS 112371]
          Length = 437

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 91/310 (29%), Positives = 131/310 (42%), Gaps = 97/310 (31%)

Query: 14  QIRRDITSRLWFTYRKGFVPI--------GDS-----------------GLTTDKGWGCM 48
           Q   D  S+LW TYR  F PI        GDS                 G T+D GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE- 107
           +R GQ ++A  LLFL LGRDW+     +EE+ L  + +F D   AP+SIH+    GA+  
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKVQEESEL--VSLFADHPRAPFSIHRFVHHGATAC 262

Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
           GK  GEWFGP+  +Q ++ L K +    +  ++  D + +  +  K    ++   S    
Sbjct: 263 GKCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDE---SGGGI 319

Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
           QP ++++ +RLGI  + PVY               +D +K L              FPQS
Sbjct: 320 QPTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRFPQS 352

Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
           +G+ G  P                                        STYH  +  RLH
Sbjct: 353 VGIAG--PEEL-------------------------------------STYHTRRLRRLH 373

Query: 288 ILHMDPSIAV 297
           +  MDPS+ +
Sbjct: 374 VREMDPSMLI 383


>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 1295

 Score =  123 bits (309), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 87/279 (31%), Positives = 122/279 (43%), Gaps = 69/279 (24%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           LSH       R      W     G+V  G+ GLT+D GWGCMLR GQ ++A AL+ LHLG
Sbjct: 512 LSHSQTMMPSRQSGGGAW-----GWVKGGERGLTSDAGWGCMLRTGQSMLANALIHLHLG 566

Query: 67  RDWQWNVNSKE---------------EAYLKILKMFEDRRT--APYSIHQIALTGASEGK 109
           R W+                        Y+++L  F D  +   P+S+H+ AL G   GK
Sbjct: 567 RGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFALIGKELGK 626

Query: 110 AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV---VNQVKKLCT--TNKRASSN 164
            VGEWFGP+T A  L+ LA       +    A D ++    V Q   L T  T     S 
Sbjct: 627 EVGEWFGPSTAAGALKTLANSFPPCGLSVVSAADGSVFRSEVYQASNLPTDWTTGAKPSR 686

Query: 165 PQ------W--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
           P       W  + +++VIP RLG+  +NP+Y + IK                        
Sbjct: 687 PNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------------------------ 722

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                     S+G+ GG+P+ + YF+    N + +LDPH
Sbjct: 723 ----------SVGIAGGRPSSSYYFVASQANSLFYLDPH 751


>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 1295

 Score =  123 bits (308), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 87/279 (31%), Positives = 122/279 (43%), Gaps = 69/279 (24%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           LSH       R      W     G+V  G+ GLT+D GWGCMLR GQ ++A AL+ LHLG
Sbjct: 512 LSHSQTMMPSRQSGGGAW-----GWVKGGERGLTSDAGWGCMLRTGQSMLANALIHLHLG 566

Query: 67  RDWQWNVNSKE---------------EAYLKILKMFEDRRT--APYSIHQIALTGASEGK 109
           R W+                        Y+++L  F D  +   P+S+H+ AL G   GK
Sbjct: 567 RGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFALIGKELGK 626

Query: 110 AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV---VNQVKKLCT--TNKRASSN 164
            VGEWFGP+T A  L+ LA       +    A D ++    V Q   L T  T     S 
Sbjct: 627 EVGEWFGPSTAAGALKTLANSFPPCGLSVVSAADGSVFRSEVYQASNLPTDWTTGAKPSR 686

Query: 165 PQ------W--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
           P       W  + +++VIP RLG+  +NP+Y + IK                        
Sbjct: 687 PNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------------------------ 722

Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                     S+G+ GG+P+ + YF+    N + +LDPH
Sbjct: 723 ----------SVGIAGGRPSSSYYFVASQANSLFYLDPH 751


>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
          Length = 476

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/330 (29%), Positives = 155/330 (46%), Gaps = 67/330 (20%)

Query: 18  DITSRLWFTYRKGFV-----PIGDS-------------------GLTTDKGWGCMLRCGQ 53
           D+ +RLWFTYR GF      P G S                   G TTD GWGCM+R  Q
Sbjct: 96  DVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCMIRTSQ 155

Query: 54  MVIAQALLFLHLGRDWQW----NVNS------KEEAYLKILKMFEDRRTAPYSIHQIALT 103
            ++A ALL LH+GR W++    N N       K E   +I+  F D   AP+SI QI   
Sbjct: 156 SLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQWQIITWFADFPWAPFSIQQIVRY 215

Query: 104 GASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRAS 162
           G+    K  GEWFGP+  ++ +  L K    +  +     +    + + + L  +    +
Sbjct: 216 GSEHCNKKPGEWFGPSAASRSIVYLCKQSYKACKLNTYLTEGNGDIYEDELLXVSCPEGT 275

Query: 163 SNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEF 222
            N  ++P +++  +RLG+  +NPVY   +KK  ++                         
Sbjct: 276 EN-GFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIH------------------------ 310

Query: 223 TFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHC 280
              QS+G+ GG+P+ + YF GY G+++ ++DPHT Q    + D   D++ + +  ++ H 
Sbjct: 311 ---QSVGIAGGRPSSSHYFFGYQGDNLFYMDPHTPQT-ALLADHVDDADYRXEYVASVHT 366

Query: 281 PQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
            +  +L +  MDPS+ + +   S  DYK +
Sbjct: 367 KRIRKLGLCEMDPSMLIGLLVTSLEDYKEL 396


>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
          Length = 423

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/336 (27%), Positives = 146/336 (43%), Gaps = 94/336 (27%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSG---------------------------------LT 40
           + +  I S LW +YR GF PI  S                                   T
Sbjct: 83  EAKEYIQSLLWLSYRCGFTPIPKSADGPQPVSFLPSVLFSKSTLTNMSNLRGLFDNDNFT 142

Query: 41  TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
           +D GWGCM+R  Q ++A ALL L        +    E A L ILK+F+D  T+P+S+H  
Sbjct: 143 SDAGWGCMIRTSQNLLAIALLKL--------SEEHNESAQLDILKLFQDDPTSPFSLHNF 194

Query: 101 ALTGASEGKAV--GEWFGPNTVAQVLRKLA----KYDDWSSIVF-HVALDNTLVVNQVKK 153
               +S    V  G+WFGPN  +  ++KL     K +    I + +++ +  L  ++++ 
Sbjct: 195 IRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLETPGEIPYVYISENADLFDDEIED 254

Query: 154 LCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
           L         N + +PL+L+ P+RLGI  +N  Y   I +  +LP               
Sbjct: 255 LF--------NEEQKPLLLLFPVRLGIDQVNKYYYKSILQLLSLPY-------------- 292

Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG-NDVIFLDPHTNQNIGCVYDKEQDSEK 272
                        S+G+ GGKP+ + YFIGY   N +++ DPH  Q +    +       
Sbjct: 293 -------------SVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI------ 333

Query: 273 KLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYK 307
              +TYH    ++L I  +DPS+ + V  +S  +YK
Sbjct: 334 ---TTYHTANYNKLDIEMVDPSMMIGVLLKSMDEYK 366


>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
 gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
          Length = 545

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 139/337 (41%), Gaps = 101/337 (29%)

Query: 18  DITSRLWFTYRKGF--VPIGDS---------------------GLTTDKGWGCMLRCGQM 54
           D+ SR+W +YR GF  +P  D                      G T+D GWGCM+R  Q 
Sbjct: 68  DVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRGYTSDVGWGCMIRTSQS 127

Query: 55  VIAQALLFLHLGRDWQWN------------------------VNSKEEAYLK-------- 82
           ++A ALLF HLGR W+WN                         N ++E  +         
Sbjct: 128 LLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSEETAVSEE 187

Query: 83  -ILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV 140
            I+  F D   +P+SIH+    G        G+WFGP+     +  L      S +  + 
Sbjct: 188 TIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYALCNEFPDSGLKVYY 247

Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
             +    V + + L T            PL+++  LRLGI ++NP+Y + +++  +L   
Sbjct: 248 NGNGGGDVYEDELLETGF----------PLLVLCGLRLGIDNVNPIYWDSLRQMLSL--- 294

Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
                                   PQS+G+ GG+P  + YF G+ G  + +LDPH  +  
Sbjct: 295 ------------------------PQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPA 330

Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
               DK+        +++H  +  +LH+  MDPS+ V
Sbjct: 331 VKTTDKDT-------TSFHSSRIWKLHLKEMDPSMLV 360


>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
 gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
          Length = 603

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 59/143 (41%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 12  LEQIRRDITSR-LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           +++   D T+R LWFTYR+GF  I ++    D GWGCMLR GQM+++  LL   LG DW+
Sbjct: 136 IKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHHALGDDWK 195

Query: 71  WNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-A 128
            + NS   + Y  I+ MF D+ +AP+SIH IAL G + GK +GEWF P+ ++Q ++ L +
Sbjct: 196 KSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQAIKSLVS 255

Query: 129 KYDDWSSIVFHVALDNTLVVNQV 151
           K  +  +I   ++ D +L ++Q+
Sbjct: 256 KNYEKCNISVFISEDGSLYIDQL 278



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 27/91 (29%)

Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
           PL+++IP+RLG+  +N +Y   + +                            F FPQ+L
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEI---------------------------FKFPQNL 403

Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
           GV+GGKP  +LYFI    +++ +LDPHT QN
Sbjct: 404 GVVGGKPRASLYFIAVQDDNLFYLDPHTVQN 434


>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
          Length = 510

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 88/156 (56%), Gaps = 4/156 (2%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D  SR+W TYR  F  IG++ L TD GWGCMLR GQM++AQAL+  +LGRDW+       
Sbjct: 118 DFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAEENM 177

Query: 78  EAYLKILKMFEDRRT--APYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWS 134
             Y ++L+ F D  +  +PYSIH IA  G  +  K +G+WF P T+++ LR L      +
Sbjct: 178 MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTEHSPN 237

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPL 170
            +  +V  D  +   +V +LC   + A    Q  PL
Sbjct: 238 GLKMYVPKDGIIYRKEVYQLCAV-QPADGPAQHSPL 272



 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/145 (32%), Positives = 72/145 (49%), Gaps = 37/145 (25%)

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W P+++++P+RLGIQ +NP+YI  +K                             F+FPQ
Sbjct: 342 WHPVIILVPVRLGIQCLNPIYIPTLKAF---------------------------FSFPQ 374

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS-TYHCPQASR 285
            LGVIGGKP+ + YF+GY  N V+++DPH  Q        + D    ++S     PQA  
Sbjct: 375 CLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQP---TVKMDDDPLFPIESYRMEIPQA-- 429

Query: 286 LHILHMDPSIAV----VSQRSYSDY 306
           +    +DPS+A+     SQ  + D+
Sbjct: 430 MSFDDIDPSLALGFLCSSQAEFDDF 454


>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
          Length = 373

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 83/141 (58%), Gaps = 3/141 (2%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D  SR+W TYR  F  IG++ L TD GWGCMLR GQM++AQAL+  +LGRDW+       
Sbjct: 150 DFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAEENM 209

Query: 78  EAYLKILKMFEDRRT--APYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWS 134
             Y ++L+ F D  +  +PYSIH IA  G  +  K +G+WF P T+++ LR L      +
Sbjct: 210 MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTEHSPN 269

Query: 135 SIVFHVALDNTLVVNQVKKLC 155
            +  +V  D  +   +V +LC
Sbjct: 270 GLKMYVPKDGIIYRKEVYQLC 290


>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
 gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
          Length = 332

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 143/294 (48%), Gaps = 49/294 (16%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           +D+++  R     +W TYRK    I +   TTD GWGCM+R  QMV+AQ  L + LG +W
Sbjct: 29  KDIDEFARHT---IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNW 83

Query: 70  QWN---VNSKEEAY--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVL 124
           ++    +N++   +    I+ +F D   + +SIH++    ++ G   G+W+GP+  + + 
Sbjct: 84  KYENNCMNTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIA 143

Query: 125 RKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
            +            +VA   ++V  ++++L      +     + P ++ +PLRLG     
Sbjct: 144 AEHINEMRVFRTRGYVAKLGSIVGPKIEEL------SKDEVGFNPCIIFVPLRLG----- 192

Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
                        P SP  +   +L + +++         PQ +G+IGGKP +A YF  +
Sbjct: 193 -------------PESPENEFRPLLKTIFDI---------PQCMGMIGGKPGYAHYFHTF 230

Query: 245 VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
            G ++ FLDPHT QN     D + D   +   +Y C     ++   +DPSI++V
Sbjct: 231 DGTNLYFLDPHTTQN---AIDMKGDWSYQ---SYFCKDNKSMNYSKIDPSISLV 278


>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
          Length = 603

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 133/278 (47%), Gaps = 68/278 (24%)

Query: 18  DITSRLWFTYRKGFVPIG-------DSGLT--------------------TDKGWGCMLR 50
           D  SR+W TYR  F  I        D GL                     TD+GWGCMLR
Sbjct: 68  DFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLRERTFNTDQGWGCMLR 127

Query: 51  CGQMVIAQALLFLHLGRDWQWNV------NSKEEAY---LKILKMFEDRRT--APYSIHQ 99
             Q ++A  L  + LGR W+ N        +K + Y   +K+L +F D  +  +P+S+H+
Sbjct: 128 TSQSLLANTLQIMLLGRQWRRNPFVDLTDYAKRKEYVNLIKLLNLFMDNPSTLSPFSVHR 187

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
           +A+ G S GK VGEWFGP+T A  ++ L       ++   VA D+ +  + V +  +   
Sbjct: 188 MAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQTDINLSVSVASDSVIYKSDVYQ-ASGGT 246

Query: 160 RASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQT 217
             +++ +W  +P+++++ +RLG+  I+P Y               Y+ +K       MQ+
Sbjct: 247 STTADSEWGNKPVLILVGVRLGLDGIHPRY---------------YETLKAF---LRMQS 288

Query: 218 PRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                     +G+ GG+P+ + YF GY  + + ++DPH
Sbjct: 289 ---------CVGIAGGRPSSSYYFFGYQSDSLFYVDPH 317


>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
 gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
          Length = 495

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/274 (29%), Positives = 133/274 (48%), Gaps = 67/274 (24%)

Query: 17  RDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGC 47
           +D+ +RL FTYR  F PI     G S L                         TD GWGC
Sbjct: 77  KDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGWGC 136

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS- 106
           M+R GQ ++   L  + LGRD++++  +K+ +  +I++ F D    P+S+HQ    G   
Sbjct: 137 MIRTGQSLLGNTLQIVRLGRDFRYDPENKDISENRIIEWFIDAPEKPFSLHQFITEGMEL 196

Query: 107 EGKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDN-TLVVNQVKKLCTTNKRASSN 164
            GK  GEWFGP   A+ ++ L  K+ D       V++ +  +  ++VK++   NK+    
Sbjct: 197 SGKNPGEWFGPAATARSIQSLIRKFPDCGIAECLVSVSSGDIYSDEVKQVFADNKKN--- 253

Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
                L++++ ++LG+  +N  Y + I+               ILSS Y           
Sbjct: 254 -----LLILLGVKLGLNAVNECYWDSIR--------------HILSSKY----------- 283

Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
             S+G+ GG+P+ +LYF GY G+++++ DPH+ Q
Sbjct: 284 --SVGISGGRPSSSLYFFGYEGDELLYFDPHSPQ 315


>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 330

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 78/281 (27%), Positives = 135/281 (48%), Gaps = 46/281 (16%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN---VNSKEEA 79
           +W TYRK    I +   TTD GWGCM+R  QM +AQ  L + LG +W++    +N++   
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96

Query: 80  Y--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
           +    I+ +F D   + +SIH++    ++ G   G+W+GP+  + +  +           
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRTR 156

Query: 138 FHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYAL 197
            +VA   +++ +++++L            + P ++ +PLRLG                  
Sbjct: 157 GYVAKLGSIIGSKIEELI------KDGGGFNPCIIFVPLRLG------------------ 192

Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
           P SP  +   +L + +++         PQ +G+IGGKP +A YF  + G ++ FLDPHT 
Sbjct: 193 PESPENEFKPLLKTIFDI---------PQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTT 243

Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
           QN     D + D   +   +Y C     +    MDPSI++V
Sbjct: 244 QN---AIDMKGDWSYQ---SYFCKDNKSMLYSKMDPSISLV 278


>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
          Length = 389

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 141/313 (45%), Gaps = 59/313 (18%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH----- 64
           Q +E+++      +WF+YR   + +  S LT+D GWGCMLR GQM + Q + + +     
Sbjct: 52  QKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFYNLSSS 111

Query: 65  --LGRDWQWNVNSKEEAYLKILKMFEDRRT----APYSIHQIALTGASE-GKAVGEWFGP 117
             L    Q   ++ EE   K +   +  +T    +P+SI +I +    E  K+ GEW+ P
Sbjct: 112 QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQTKLELQKSPGEWYKP 171

Query: 118 NTVAQVLRKLAKYDDWS-SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQP------- 169
           N +  VL+ L +Y  +  ++  H+  +N  +++ V  L     +   + +W         
Sbjct: 172 NDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFN--KNGGDEEWLKEQIEKGQ 229

Query: 170 -----LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
                + + I  R+G+   N  Y+                  K+L+            T+
Sbjct: 230 NDEFGVSIFILTRIGLDTCNQEYL------------------KVLNDI---------MTY 262

Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS 284
           PQ  G++GG PN ALY +G VGN  I+LDPH  QN     + E D      S+Y C    
Sbjct: 263 PQFQGILGGFPNKALYILGRVGNYYIYLDPHYVQNAQNYQEMENDR-----SSYTCQSIQ 317

Query: 285 RLHILHMDPSIAV 297
            +    +DPS+A+
Sbjct: 318 LIDSNQLDPSMAI 330


>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
          Length = 378

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 76/238 (31%), Positives = 109/238 (45%), Gaps = 57/238 (23%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ RRD  SR+W TYR+ F PI  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 47  NVEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 106

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 107 WPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHEIRNE 166

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ D  
Sbjct: 167 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 226

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
            I  +VA D T+  + V     T   A  N   + +++++P+RLG +  N  Y+  +K
Sbjct: 227 GITIYVAQDCTVYSSDVIDKQRTAMTA-DNADDKAVIILVPVRLGGERTNTDYLEFVK 283


>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
           verrucosum HKI 0517]
 gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
           verrucosum HKI 0517]
          Length = 398

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 68/214 (31%), Positives = 104/214 (48%), Gaps = 32/214 (14%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDS--------------------------GLTTDKGWGC 47
           Q   D  S+LW TYR  F PI  +                          G T+D GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE 107
           M+R GQ ++A  LLFL LGRDW+     +EE+  +++ +F D   AP+SIH+    GA+ 
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKVQEES--ELVSLFADHPRAPFSIHRFVHHGATA 302

Query: 108 -GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
            GK  GEWFGP+  +Q ++ L K +    +   +  D + +  +  K    ++       
Sbjct: 303 CGKCPGEWFGPSAASQCIQALVKSNPQVGLRVCITSDGSDIYEKQFKEVACDESGGG--- 359

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
            QP ++++ +RLGI  + PVY + +K     P S
Sbjct: 360 IQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQS 393


>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 330

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 78/281 (27%), Positives = 135/281 (48%), Gaps = 46/281 (16%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN---VNSKEEA 79
           +W TYRK    I +   TTD GWGCM+R  QM +AQ  L + LG +W++    +N++   
Sbjct: 39  IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96

Query: 80  Y--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
           +    I+ +F D   + +SIH++    ++ G   G+W+GP+  + +  +           
Sbjct: 97  FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRTR 156

Query: 138 FHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYAL 197
            +VA   +++ +++++L            + P ++ +PLRLG                  
Sbjct: 157 GYVAKLGSIIGSKIEELI------KDGGGFNPCIIFVPLRLG------------------ 192

Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
           P SP  +   +L + +++         PQ +G+IGGKP +A YF  + G ++ FLDPHT 
Sbjct: 193 PESPENEFRPLLKTIFDI---------PQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTT 243

Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
           QN     D + D   +   +Y C     +    MDPSI++V
Sbjct: 244 QN---AIDMKGDWSYQ---SYFCKDNKSMLYSKMDPSISLV 278


>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 360

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 74/119 (62%), Gaps = 1/119 (0%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
           L   R+D +S +  TYR+GF PIGD+  T+D  WGCMLR GQM+ AQALLF  LGR W +
Sbjct: 138 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 197

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            +    +E YL+IL++F D   + +SIH + L G S G A G W GP  V +    LA+
Sbjct: 198 KDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLAR 256


>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 485

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 88/312 (28%), Positives = 138/312 (44%), Gaps = 75/312 (24%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  FVPI     G S L+                        TD GWGCM
Sbjct: 80  DVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIGWGCM 139

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ + +       +I+  F D   AP+S+H    TG    
Sbjct: 140 IRTGQSLLGNALQILHLGRDFRVDEDDDFRRESRIVNWFNDTPEAPFSLHNFVSTGTELS 199

Query: 108 GKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDN-TLVVNQVKKLCTTNKRASSNP 165
            K  GEWFGP   A+ ++ L   + +       V++ +  +  N+V+++   N  +S   
Sbjct: 200 DKRPGEWFGPAATARSIQYLIYGFPECGINACIVSVSSGDIYENEVEEVFVDNPNSS--- 256

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
               ++ ++ ++LGI  +N  Y   I                IL+S +            
Sbjct: 257 ----ILFLLGVKLGINAVNESYRESI--------------CGILNSAW------------ 286

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
            S+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E    ++ H  +  R
Sbjct: 287 -SVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVNSCHTSKFGR 336

Query: 286 LHILHMDPSIAV 297
           L +  MDPS+ +
Sbjct: 337 LQLSEMDPSMLI 348


>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
           purpuratus]
          Length = 1018

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 111/203 (54%), Gaps = 24/203 (11%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ- 70
           +E  ++D +SRLW TYR+ F  +  S  T+D GWGCMLR GQM++A +L+   LGR+W  
Sbjct: 376 IEMFKQDFSSRLWMTYRREFPTLAGSNFTSDCGWGCMLRSGQMMLAHSLILHFLGREWNI 435

Query: 71  WNVNSKE--EAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
           +   ++E  + + +I++ F D+    +P+S+H++   G + GK VG+W+GP++VA +L++
Sbjct: 436 YKPQTQEMLQFHRQIVRWFGDQPLDMSPFSVHRLVGIGQNNGKKVGDWYGPSSVAHILKE 495

Query: 127 LAKYDD-----WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
                         +  +VA D T+    V  LC    R+ S  + QP+   IP     +
Sbjct: 496 AMDSAHELNPLLGEVCIYVAQDCTVYKQDVIDLC----RSKSKKRLQPVYRDIP---SSE 548

Query: 182 DINPVYINGIKKCYALPI-SPVY 203
           D +PV      KC   PI  P Y
Sbjct: 549 DNSPV------KCTTNPIKGPAY 565



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 62/131 (47%), Gaps = 32/131 (24%)

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W  +V++IP+RLG  ++NPVYI  I+                             FT   
Sbjct: 836 WCAVVIMIPVRLGGDEVNPVYIRPIQSL---------------------------FTLES 868

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
            LG+IGGKP H+L+F+G+    +I LDPH  Q +  V  K +D       ++HC    ++
Sbjct: 869 CLGIIGGKPKHSLFFVGFQEEKLIHLDPHYCQQV--VDMKTRDFPLW---SFHCMSPRKM 923

Query: 287 HILHMDPSIAV 297
            I  MDPS  +
Sbjct: 924 SISKMDPSCTI 934


>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
          Length = 494

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 133/312 (42%), Gaps = 75/312 (24%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SR+ FTYR  F+PI     G S L+                        TD GWGCM
Sbjct: 89  DVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIGWGCM 148

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL  LHLGRD++ +     +   KI+  F D   AP+SIH    TG    
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVDNEKSLKRESKIVTWFNDTPEAPFSIHNFVSTGTELS 208

Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV--ALDNTLVVNQVKKLCTTNKRASSNP 165
            K  GEWFGP   A+ ++ L        I   V       +  N+V+K+   N  +    
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGITDCVVSVSSGDIYQNEVEKIYVENPDSI--- 265

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
               ++ ++ ++LGI  +N  Y   I                IL+S              
Sbjct: 266 ----ILFLLGVKLGINAVNESYRESI--------------CGILNSA------------- 294

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
           +S+G+ GG+P+ +LYF GY GN  ++ DPH  Q            E+    + H  +  +
Sbjct: 295 RSVGIAGGRPSSSLYFFGYQGNQFLYFDPHIPQPA---------VEESFVESCHTSKFGK 345

Query: 286 LHILHMDPSIAV 297
           L +  MDPS+ +
Sbjct: 346 LQLSEMDPSMLI 357


>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
 gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
          Length = 267

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 74/119 (62%), Gaps = 1/119 (0%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
           L   R+D +S +  TYR+GF PIGD+  T+D  WGCMLR GQM+ AQALLF  LGR W +
Sbjct: 52  LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 111

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            +    +E YL+IL++F D   + +SIH + L G S G A G W GP  V +    LA+
Sbjct: 112 KDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLAR 170


>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1093

 Score =  116 bits (290), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 88/287 (30%), Positives = 134/287 (46%), Gaps = 76/287 (26%)

Query: 18  DITSRLWFTYRKGFVPI---------------------------------------GDSG 38
           D TSR+W TYR  F PI                                       G+ G
Sbjct: 372 DFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGGEKG 431

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA----YLKILKMFEDRRT-- 92
            T+D GWGCMLR GQ ++A ALL LHLGRDW+     +  A    Y+++L  F D  +  
Sbjct: 432 WTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSPSPL 491

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
            P+S+H++AL G   GK VG+WFGP+T A  ++ L        +   VA+D  +    V 
Sbjct: 492 CPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHAFPGGGLGVAVAVDGVVYETDVF 551

Query: 153 KLCTT--NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKI 208
               +  ++R      W  + ++++I +RLG+  +NP+Y + IK+ Y             
Sbjct: 552 SASHSPDSRRHHRTSTWGDRGVLILIGIRLGLDGVNPIYYDTIKELY------------- 598

Query: 209 LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
                         T+PQS+G+ GG+P+ + YF+G   + + +LDPH
Sbjct: 599 --------------TWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631


>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
 gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
          Length = 551

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/142 (39%), Positives = 88/142 (61%), Gaps = 5/142 (3%)

Query: 12  LEQIRRDITSR-LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           +++   D T+R LWFTYR+GF  I D+    D GWGCMLR GQM+++  LL   LG +W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199

Query: 71  WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
               S    +  I+ MF D+ +AP+SIH IA+ G + GK +GEWF P+ ++Q ++ L   
Sbjct: 200 ---RSSSATHPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILVSR 256

Query: 131 D-DWSSIVFHVALDNTLVVNQV 151
           + D  +I   ++ D +L ++Q+
Sbjct: 257 NYDQCNISVFISEDGSLYIDQL 278



 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 76/152 (50%), Gaps = 31/152 (20%)

Query: 147 VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMV 206
           + ++ K   + N    ++  W+PL+++IP+RLG+  +N +Y + + +             
Sbjct: 363 IDDESKDEISENNNKDNDETWEPLLILIPMRLGLDGLNSIYHSSLLEI------------ 410

Query: 207 KILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK 266
                          F FPQ+LGV+GGKP  +LYFI    +++ +LDPHT QN    + +
Sbjct: 411 ---------------FKFPQNLGVVGGKPRASLYFIAAQDDNLFYLDPHTVQN----HIE 451

Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
            ++  K   +T+ C    R H+  +DPS+ V 
Sbjct: 452 VENGSKFPLNTFFCSTTKRTHVSEVDPSLVVA 483


>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
 gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
          Length = 446

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 92/336 (27%), Positives = 144/336 (42%), Gaps = 102/336 (30%)

Query: 19  ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
           I S+LW +YR GF PI  S                                    T+D G
Sbjct: 83  IESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTSDAG 142

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
           WGCM+R  Q ++A  LL L+           K E   +I+K+F+D  ++P+SIH      
Sbjct: 143 WGCMIRTSQNLLANTLLKLY----------PKNEP--EIVKLFQDDTSSPFSIHNFIRVA 190

Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLA-------KYDDWSSIVFHVALDNTLVVNQVKKLC 155
           +     V  GEWFGPN  +  +++LA       + D        ++ ++ L  ++++ + 
Sbjct: 191 SLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVFISENSDLFDDEIRDVF 250

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
              K AS       ++++ P+RLGI  +N  Y N I                +L+S Y  
Sbjct: 251 AKEKNAS-------VLILFPIRLGIDKVNSYYYNSI--------------FHLLASKY-- 287

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                      S G+ GGKP+ + YF+GY   D+I+ DPH  Q +        ++   +D
Sbjct: 288 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV--------ETPINMD 328

Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
           S YH    +RL+I  +DPS    I V +   Y D+K
Sbjct: 329 S-YHTTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363


>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
           leucogenys]
          Length = 441

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 78/290 (26%), Positives = 132/290 (45%), Gaps = 45/290 (15%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGW--GCMLRCGQMVIAQALLFLHLGRDW---QWN 72
           D  SRLW TYR     +    +  D  W  G  L   ++  + +    H    W   +W 
Sbjct: 108 DFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWA 167

Query: 73  VNS----KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-L 127
             +    +E  + +I+  F D   AP+ +H++   G S GK  G+W+GP+ VA +LRK +
Sbjct: 168 QGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAV 227

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
               + + +V +V+   ++    V +L     R     +W+ +V+++P+RLG + +NPVY
Sbjct: 228 ESCSEVTRLVVYVSQTCSMYKADVARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVY 284

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
           +  +K+     +                            LG++GGKP H+LYFIGY  +
Sbjct: 285 VPCVKELLRCQL---------------------------CLGIMGGKPRHSLYFIGYQDD 317

Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            +++LDPH  Q    V   +   E     ++HC    ++    MDPS  V
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLE-----SFHCTSPRKMAFAKMDPSCTV 362


>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
 gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
          Length = 427

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 89/321 (27%), Positives = 135/321 (42%), Gaps = 83/321 (25%)

Query: 18  DITSRLWFTYRKGFVPI-----------------------------GDSGLTTDKGWGCM 48
           DI SRL FTYR  F PI                                   TD GWGCM
Sbjct: 3   DIKSRLNFTYRTRFKPIQRMSDGPSPFHFSFILRENPINTLENVISNPDCFFTDIGWGCM 62

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSK---EEAYLKILKMFEDRRTAPYSIHQIALTGA 105
           +R GQ ++  AL   +LGRDW+++ N+     E   +I   F D    P+S+H+    G 
Sbjct: 63  IRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSLHRFISKGM 122

Query: 106 S-EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN------ 158
              GK  GEWFGP   A+ ++ L              +D  L+      +  T       
Sbjct: 123 QLSGKKPGEWFGPAATARSIQSLVHE------FPECGIDKCLISVSSGDIYKTEVEDVFN 176

Query: 159 ----KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
                 A +  + + +++++ ++LGI+ IN  Y + I+              +ILSS Y 
Sbjct: 177 EGHTGEARNGQKDKTILILLGVKLGIETINRCYWDSIR--------------RILSSEY- 221

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
                       S+G+ GG+P+ +LYF GY G+++++ DPH+ Q     YDK        
Sbjct: 222 ------------SIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQP---SYDKND----LF 262

Query: 275 DSTYHCPQASRLHILHMDPSI 295
             T H     +L +  MDPS+
Sbjct: 263 YETCHTTNFGKLSLADMDPSM 283


>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
 gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
          Length = 489

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 140/312 (44%), Gaps = 71/312 (22%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SRL FTYR  F+PI     G S L+                        TD GWGCM
Sbjct: 81  DVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNPACFNTDVGWGCM 140

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL    LGR ++     K E  + I+  F D   AP+SIH     G    
Sbjct: 141 IRTGQSLLGNALQIARLGRGYRIGSELKPEE-ISIIDWFVDIPDAPFSIHNFVSKGMELS 199

Query: 108 GKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQ-VKKLCTTNKRASSNP 165
            K  GEWFGP   ++ ++ L + +         +++ +  V  + V K+   +K +    
Sbjct: 200 SKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQISVSSGDVYEEDVMKVFNESKDSR--- 256

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
               ++L++ ++LGI  +N  Y N IK+              +L S +            
Sbjct: 257 ----ILLLLGVKLGINAVNEFYWNDIKR--------------LLGSKF------------ 286

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
            S+G+ GG+P+ +LYFIGY GN++++LDPHT Q     +      E+    + H     +
Sbjct: 287 -SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQ----PFLSPSHQERSFYDSCHSSNYGK 341

Query: 286 LHILHMDPSIAV 297
           L I  +DPS+ +
Sbjct: 342 LAIQDLDPSMLI 353


>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
           AG-1 IA]
          Length = 808

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 83/293 (28%), Positives = 132/293 (45%), Gaps = 94/293 (32%)

Query: 18  DITSRLWFTYRKGFVPIGDSGL------------------------------------TT 41
           D TS +W TYR  + PI D+ L                                    T+
Sbjct: 145 DFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKSWTS 204

Query: 42  DKGWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APY 95
           D GWGCMLR GQ ++A AL+ LHLGR+W+   + + ++E A Y+KIL  F D  +  AP+
Sbjct: 205 DAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPLAPF 264

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV---- 151
            +H++AL G + GK VG WFGP+T A  ++ LA       +   +A+D T+  + V    
Sbjct: 265 GVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPECQLSVSLAVDGTVFASDVYAAS 324

Query: 152 -KKLCTT----NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
              + TT         S  +W  + +++++ +RLG+ ++NP+Y               YD
Sbjct: 325 HMGMVTTSGRSISSRRSASKWGGRAVLILVNIRLGLDNVNPIY---------------YD 369

Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH--ALYFIGYVGNDVIFLDPH 255
            +K+                        G+P    + YF+G   + + +LDPH
Sbjct: 370 ALKV------------------------GRPRQGSSYYFVGSQADSLFYLDPH 398


>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
 gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
          Length = 443

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 140/336 (41%), Gaps = 103/336 (30%)

Query: 19  ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
           I S+LW +YR GF PI  S                                    T+D G
Sbjct: 83  IESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFANLKSLFDKENFTSDAG 142

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
           WGCM+R  Q ++A  LL L+   + +            I+K+F+D   +P+SIH      
Sbjct: 143 WGCMIRTSQNLLANTLLKLYPKNEQE------------IVKLFQDDTKSPFSIHNFIRVA 190

Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSI-------VFHVALDNTLVVNQVKKLC 155
           +S    V  GEWFGPN  +  +++L        I       VF ++ ++ L  ++++ + 
Sbjct: 191 SSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGINPPRVF-ISENSDLFDDEIRDVF 249

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
              K  S       ++++ P+RLGI  +N  Y N I                +LSS Y  
Sbjct: 250 AKEKSNS-------VIILFPIRLGIDKVNSYYYNSI--------------FHLLSSKY-- 286

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                      S G+ GGKP+ + YF+GY   D+I+ DPH  Q +   ++ +        
Sbjct: 287 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQIVETPFNMD-------- 327

Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
            +YH    + L+I  +DPS    I V +   Y D+K
Sbjct: 328 -SYHSTNYNTLNISLLDPSMMIGILVTNIDEYIDFK 362


>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
           70294]
 gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 411

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 134/311 (43%), Gaps = 73/311 (23%)

Query: 18  DITSRLWFTYRKGFVPIG--DSG---------------------------LTTDKGWGCM 48
           D+ SR+ FTYR  F+PI   D G                             TD GWGCM
Sbjct: 77  DVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGWGCM 136

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++A A+    LGR+++ N     E   KI+  F D    P+S+H     G    
Sbjct: 137 IRTGQSLLANAIQIAILGREFRVNDGDVNEQERKIISWFMDTPDEPFSLHNFVKKGCELS 196

Query: 108 GKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
            K  GEWFGP   ++ ++ L + + D       V++ +  +          NKR S+   
Sbjct: 197 SKKPGEWFGPAATSRSIQSLVEQFPDCGIDRCIVSVSSADIFKDEINDIFKNKRYSN--- 253

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
              ++L++ ++LG+  +N  Y+  I+              KIL S Y             
Sbjct: 254 ---ILLLMGVKLGVDKVNEYYLKDIR--------------KILESRY------------- 283

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
           S+G+ GG+P+ +LYF GY  + +++ DPH  Q           + + L  T H     ++
Sbjct: 284 SVGISGGRPSSSLYFFGYQDDTLLYFDPHKPQ---------PSTIESLLETCHTDNFDKI 334

Query: 287 HILHMDPSIAV 297
           +I  MDPS+ +
Sbjct: 335 NISDMDPSMLI 345


>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 446

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/336 (27%), Positives = 143/336 (42%), Gaps = 102/336 (30%)

Query: 19  ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
           I S+LW +YR GF PI  S                                    T+D G
Sbjct: 83  IESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTSDAG 142

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
           WGCM+R  Q ++A  LL L+           K E   +I+K+F+D  ++P+SIH      
Sbjct: 143 WGCMIRTSQNLLANTLLKLY----------PKNEP--EIVKLFQDGTSSPFSIHNFIRVA 190

Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLA-------KYDDWSSIVFHVALDNTLVVNQVKKLC 155
           +     V  GEWFGPN  +  +++L        + D        ++ ++ L  ++++ + 
Sbjct: 191 SLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPRVFISENSDLFDDEIRDVF 250

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
              K AS       ++++ P+RLGI  +N  Y N I                +L+S Y  
Sbjct: 251 AKEKNAS-------VLILFPIRLGIDKVNSYYYNSI--------------FHLLASKY-- 287

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                      S G+ GGKP+ + YF+GY   D+I+ DPH  Q +        ++   +D
Sbjct: 288 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV--------ETPINMD 328

Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
           S YH    +RL+I  +DPS    I V +   Y D+K
Sbjct: 329 S-YHTTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363


>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
 gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
           [Candida albicans SC5314]
          Length = 446

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/336 (27%), Positives = 143/336 (42%), Gaps = 102/336 (30%)

Query: 19  ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
           I S+LW +YR GF PI  S                                    T+D G
Sbjct: 83  IESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTSDAG 142

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
           WGCM+R  Q ++A  LL L+           K E   +I+K+F+D  ++P+SIH      
Sbjct: 143 WGCMIRTSQNLLANTLLKLY----------PKNEP--EIVKLFQDGTSSPFSIHNFIRVA 190

Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLA-------KYDDWSSIVFHVALDNTLVVNQVKKLC 155
           +     V  GEWFGPN  +  +++L        + D        ++ ++ L  ++++ + 
Sbjct: 191 SLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVFISENSDLFDDEIRDVF 250

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
              K AS       ++++ P+RLGI  +N  Y N I                +L+S Y  
Sbjct: 251 AKEKSAS-------VLILFPIRLGIDKVNSYYYNSI--------------FHLLASKY-- 287

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                      S G+ GGKP+ + YF+GY   D+I+ DPH  Q +        ++   +D
Sbjct: 288 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV--------ETPINMD 328

Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
           S YH    +RL+I  +DPS    I V +   Y D+K
Sbjct: 329 S-YHTTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363


>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
          Length = 450

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 82/310 (26%), Positives = 142/310 (45%), Gaps = 71/310 (22%)

Query: 18  DITSRLWFTYRKGFVPIG-----------------------DSGLT------TDKGWGCM 48
           D+ SR++FTYR  F PI                        ++ LT      +D GWGCM
Sbjct: 64  DVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALTDPDSFYSDIGWGCM 123

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ-IALTGASE 107
           +R GQ ++A A+  + L R+++ N +  ++  L +++ F+D    P S+H  +       
Sbjct: 124 IRTGQALLANAIQRVKLAREFRINASRIDDNELNLIRWFQDDVKYPLSLHNFVKAEEKIS 183

Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV--NQVKKLCTTNKRASSNP 165
           G   G+WFGP+  A+ ++ L +      I   +    +  +  ++V ++   ++ A+   
Sbjct: 184 GMKPGQWFGPSATARSIKTLIEGFPLCGIKNCIISTQSADIYEDEVTRIFHKDRDAN--- 240

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
               L+L+  +RLG+  IN +Y                D+ KILSS             P
Sbjct: 241 ----LLLLFAVRLGVDKINSLYWK--------------DIFKILSS-------------P 269

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
            S+G+ GGKP+ +LYF GY   ++ +LDPH  Q    + D     + +   + H  + ++
Sbjct: 270 YSVGIAGGKPSSSLYFFGYQNENLFYLDPHNTQQSSLMMD-----DLEFYRSCHGHKFNK 324

Query: 286 LHILHMDPSI 295
           LHI   DPS+
Sbjct: 325 LHISETDPSM 334


>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
          Length = 172

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 48/84 (57%), Positives = 64/84 (76%), Gaps = 1/84 (1%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W    ++
Sbjct: 52  DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ 111

Query: 78  -EAYLKILKMFEDRRTAPYSIHQI 100
            + Y +IL+ F DR+   YSIHQ+
Sbjct: 112 PKEYQRILQCFLDRKDCCYSIHQM 135


>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
          Length = 1055

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 80/266 (30%), Positives = 121/266 (45%), Gaps = 59/266 (22%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPI-GDSGLTTDKGWGCMLRCGQMVIAQALLF------ 62
           Q+ + ++  I   +W TYRKG+ PI GD+ LT+D GWGC  R GQM++AQAL+       
Sbjct: 590 QESDDLKAHIRRLVWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSA 649

Query: 63  ----LHLGRDWQWNVNSKEEAYLKILKMFEDRR--TAPYSIHQIALTGASEGKAVGEWFG 116
               L   R   W     EE    +L MF+D     A +SI  +A T     K  G+W  
Sbjct: 650 RMQRLEGVRPSTWQ---HEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLS 706

Query: 117 PNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
           P+ VA ++R+L               +  + V  V     + +R  +   W P +L+IPL
Sbjct: 707 PSEVALIIRRLNPP------------ETGMRVRIVNDTLLSTRRILAGEPWMPTLLMIPL 754

Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
           R G+  + P  +            P +                  F +P  +G IGGKP 
Sbjct: 755 RAGLDTLQPESV------------PAFVAF---------------FDWPWCVGAIGGKPG 787

Query: 237 HALYFIGYVGND---VIFLDPHTNQN 259
            A Y++G + +D   V++LDPHT ++
Sbjct: 788 SAYYYVG-IDHDRRRVLYLDPHTTRS 812


>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
          Length = 128

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 49/90 (54%), Positives = 65/90 (72%), Gaps = 1/90 (1%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           ++I  D+ SRLWFTYR+ F  IG +G T+D GWGCMLRCGQM+ AQAL+  HLGRDW+W 
Sbjct: 37  DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96

Query: 73  VNSKE-EAYLKILKMFEDRRTAPYSIHQIA 101
              ++ ++Y  +L  F DR+ + YSIHQI 
Sbjct: 97  QRKRQPDSYFNVLNAFLDRKDSYYSIHQIG 126


>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
          Length = 256

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 48/85 (56%), Positives = 64/85 (75%), Gaps = 1/85 (1%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+  HLGRDW W    ++
Sbjct: 46  DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEKQKEQ 105

Query: 78  -EAYLKILKMFEDRRTAPYSIHQIA 101
            + Y +IL+ F DR+   YSIHQ+ 
Sbjct: 106 PKEYQRILQCFLDRKDCCYSIHQMG 130



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 5/61 (8%)

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIA 296
             Y I  +G+++IFLDPHT Q      D E+D     D T+HC Q+  R++IL++DPS+A
Sbjct: 122 CCYSIHQMGDELIFLDPHTTQTF---VDTEEDGTVD-DQTFHCLQSPQRMNILNLDPSVA 177

Query: 297 V 297
           +
Sbjct: 178 L 178


>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
          Length = 296

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 69/217 (31%), Positives = 112/217 (51%), Gaps = 36/217 (16%)

Query: 82  KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHV 140
           +I+  F D   AP+ +H++   G S GK  G+W+GP+ VA +LRK +    + S +V +V
Sbjct: 36  RIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYV 95

Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
           + D T+    V +L +     +   +W+ +V+++P+RLG + +NPVY+  +K        
Sbjct: 96  SQDCTVYKADVARLLSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK-------- 144

Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
                 ++L S                LG++GGKP H+LYFIGY  + +++LDPH  Q  
Sbjct: 145 ------ELLRSEL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP- 184

Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
               D  Q S   L+S +HC    ++    MDPS  V
Sbjct: 185 --TVDVSQPS-FPLES-FHCTSPRKMAFAKMDPSCTV 217


>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
 gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
          Length = 414

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 90/346 (26%), Positives = 145/346 (41%), Gaps = 92/346 (26%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L     D  +R+WFTYR GF  I  +    D GWGC +R GQM++A+ +L  +LGRDW  
Sbjct: 78  LSDFLEDFRTRIWFTYRHGFPCIPGTKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLL 137

Query: 72  NVNS--KEEAYL--KILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLR- 125
             +   ++EA +  K++ +F D  T+P+S+H +   G    GK  G W+GP +V Q+L+ 
Sbjct: 138 GQSGLPEDEALMHRKVIGLFCDNLTSPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQV 197

Query: 126 ---KLAKYDDWSSIVFHVALDNTLVVNQVKKLCT------------TNKRASSNPQ---- 166
                 +      +  HV  D  L+++ V++L               N  A   P+    
Sbjct: 198 AMNNAIERGLVEGLAVHVIGDGELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSY 257

Query: 167 ----------------------------------WQPLVLV-IPLRLGIQDINPVYINGI 191
                                             W   VLV +PLRLG++  N +Y + +
Sbjct: 258 LDLRRLTSVSNGDLLPSHDGESIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHL 317

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K              ++LS+ +              +GVIGG+ +   YF G+  + +I 
Sbjct: 318 K--------------RVLSTKF-------------CVGVIGGRHHKCYYFCGWHTDYLIR 350

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           LDPH +Q      D  Q        ++HC    +  I  +DP  ++
Sbjct: 351 LDPHYSQP---AVDATQPGVSL--HSFHCKYPKKTLIADIDPWCSI 391


>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
 gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
          Length = 376

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 81/284 (28%), Positives = 145/284 (51%), Gaps = 37/284 (13%)

Query: 31  FVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY-LKILKMFED 89
           ++P+  S  T+D GWGCM RCGQM++AQAL+   LGR+W+   N ++  + L+I+K F D
Sbjct: 60  YIPL--SVQTSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFND 117

Query: 90  RRT--APYSIHQIALTGASEGKAVGEWFGPNTV-AQVLRKLAKYDDWSS----IVFHVAL 142
             +  +P S+H+  L   S+ K  GEW GP+++ + +LR +AK     S    +  ++A 
Sbjct: 118 SWSPFSPLSLHR--LVQMSDRKP-GEWCGPSSICSAILRVMAKGSSLDSRLSQVQVYLAR 174

Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY---INGIKKCYALPI 199
           D  +   ++  L    +   ++ Q+QP       ++   D   +Y    +     ++   
Sbjct: 175 DRVIYREEIIDLA---RGLHTSYQYQP-------KIYFTDHTALYRSQSDQTNDSHSFKP 224

Query: 200 SPVYDMVKILSSTYNMQTPRY------EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
           + +  ++ ++    N   PRY       F+ P  +G+IGG+  H+ Y++G   N +I+LD
Sbjct: 225 TAILLLIPLMFGKGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLD 284

Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           PH  Q       +  +S K    ++HCP    +   +++PS AV
Sbjct: 285 PHFTQPT-----QNLNSPKFSVDSWHCPIPKTMSAANLNPSCAV 323


>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 469

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 93/316 (29%), Positives = 151/316 (47%), Gaps = 72/316 (22%)

Query: 14  QIRRDITSRLWFTYRKGFVPI-----GDSGL------------------------TTDKG 44
           +  +D+ SRL FTYR  F PI     G S +                         TD G
Sbjct: 62  EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE--EAYLKILKMFEDRRTAPYSIHQIAL 102
           WGCM+R GQ ++A AL   +LGRD++ + +  +  E  +KI++ FED    P+S+H+   
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181

Query: 103 TGAS-EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNT--LVVNQVKKLCTTNK 159
            G    GK  GEWFGP+ +++ +R L      S I   +   ++  + ++++  L   N 
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHCIISTDSADVYLDEIDPLFRANP 241

Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
           +A+        +L++ +RLG+   N  Y + IK               ILSS+       
Sbjct: 242 KANV-------LLLLGVRLGVDFTNEYYWDDIKN--------------ILSSS------- 273

Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
                 QS+G+ GG+P+ +LYF GY G+ + +LDPH  Q    +Y  E D E+    + H
Sbjct: 274 ------QSVGISGGRPSSSLYFFGYQGDYLFYLDPHKVQLNLALY--ESDEERF--HSVH 323

Query: 280 CPQASRLHILHMDPSI 295
               +++H+  +DPS+
Sbjct: 324 PQTFNKIHLSAIDPSM 339


>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
 gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 414

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 129/316 (40%), Gaps = 83/316 (26%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  +++W TYR  F  I  S                       G T+D GWGC       
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCS------ 159

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
                              +S EE   KIL +F D   APYSIH+    GAS  GK  GE
Sbjct: 160 -------------------SSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 198

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L      S +  ++  D + V        +  K  S+  ++ P +++
Sbjct: 199 WFGPSAAARCIQALTNSQVESELRVYITGDGSDVYEDT--FMSIAKPNST--KFTPTLIL 254

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLG+  I PVY   +K    +P                           QS+G+ GG
Sbjct: 255 VGTRLGLDKITPVYWEALKSSLQMP---------------------------QSVGIAGG 287

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
           +P+ + YFIG   +D  +LDPH  +      D  +D   +   + H  +  RLHI  MDP
Sbjct: 288 RPSSSHYFIGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDP 347

Query: 294 SIAVVSQ-RSYSDYKN 308
           S+ +    R  +D+K+
Sbjct: 348 SMLIAFLIRDENDWKD 363


>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
          Length = 378

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 73/239 (30%), Positives = 109/239 (45%), Gaps = 59/239 (24%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++E+ R+D  SR+W TYR+ F  I  S LTTD GWGC LR GQM++AQ L+   LGR W 
Sbjct: 47  NVEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 106

Query: 71  W----NV-NSKEEAYL-------------------------------------------- 81
           W    N+ NS  E++                                             
Sbjct: 107 WPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHEMQNE 166

Query: 82  ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
               KI+  F D   A + +HQ+   G   GK  G+W+GP  VA +LRK    A++ +  
Sbjct: 167 IYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPELQ 226

Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
            I  +VA D T+  + V  K C +   A      + +++++P+RLG +  N  Y+  +K
Sbjct: 227 GITIYVAQDCTVYSSDVIDKQCAS--MAPDITDDKAVIILVPVRLGGERTNIDYLEFVK 283


>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
          Length = 357

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 75/239 (31%), Positives = 108/239 (45%), Gaps = 40/239 (16%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
           L+F+YR   VP+ + G TTD  WGCM+R GQM++A A +    G          +E   +
Sbjct: 74  LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
              +F D  +AP+ IH I   G   G   GEWFGP  +A+ L  L A Y        V  
Sbjct: 133 TQTLFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNALMASYLAAGGEGPVVL 192

Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
              +  + + QVK+L           Q   +VL+IP+ LGI+ I+  Y   +K+C  +  
Sbjct: 193 AFPERQIFLEQVKELLR---------QSMHVVLLIPVMLGIRVISEKYSQLMKRCLEM-- 241

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
                                      S+G++GGK   AL+  G+  +DV FLDPH  Q
Sbjct: 242 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQ 275


>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
          Length = 337

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 67/217 (30%), Positives = 112/217 (51%), Gaps = 36/217 (16%)

Query: 82  KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHV 140
           +I+  F D   AP+ +H++   G S GK  G+W+GP+ VA +LRK +    + + +V +V
Sbjct: 77  RIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVESCSEVTRLVVYV 136

Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
           + D T+    V +L +     +   +W+ +V+++P+RLG + +NPVY+  +K        
Sbjct: 137 SQDCTVYKADVARLVSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK-------- 185

Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
                 ++L S                LG++GGKP H+LYFIGY  + +++LDPH  Q  
Sbjct: 186 ------ELLRSEL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP- 225

Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
               D  Q +   L+S +HC    ++    MDPS  V
Sbjct: 226 --TVDVNQ-ANFPLES-FHCTSPRKMAFAKMDPSCTV 258


>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
          Length = 402

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 85/327 (25%), Positives = 141/327 (43%), Gaps = 65/327 (19%)

Query: 1   MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQAL 60
           +R+ + +  Q +E+++R  +S +WF+YRK       S LT+D GWGCM+R  QM +AQ +
Sbjct: 52  VRNPSFILKQRIEKLKRICSSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVI 111

Query: 61  LFLH----------LGRDWQWNVNSKEEAYLKILKMFEDRRT----APYSIHQIALTGAS 106
              H          L R +   ++  ++  +  +K  +  +     AP+SI +I      
Sbjct: 112 RHYHSFTQPEQLIVLIRHF---LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKV 168

Query: 107 E-GKAVGEWFGPNTVAQVLRKLAKYDDWS-----SIVFHVALDNTLVVNQV--------- 151
           E  K  G+W+ PN + + L  L KY  +S      I +  A      + Q+         
Sbjct: 169 EFKKEPGDWYKPNEILETLNYLFKYSQYSLNMQIYINYQCAFILQDAIKQMFNYDKGNQE 228

Query: 152 -KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILS 210
             K C  N     +   + + + +P R+G+Q +N  Y+               +++ IL 
Sbjct: 229 WLKECIKNNNQFISQHDKGIAIFLPARIGLQRVNQDYL---------------EVLNIL- 272

Query: 211 STYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
                       T P   G+IGG  N A Y +G + + +I+LDPH  QN     D     
Sbjct: 273 -----------MTLPYFQGIIGGVTNRAFYIVGRIQDYLIYLDPHFVQNAQNFEDL---- 317

Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
             K  ++Y C     +H   +DPSI V
Sbjct: 318 -SKTQASYTCQNIQLIHNKSIDPSIVV 343


>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
          Length = 392

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 82/324 (25%), Positives = 142/324 (43%), Gaps = 75/324 (23%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
            Q +E++++  +  +WF+YRK       S LT+D GWGCM+R  QM +AQ + +      
Sbjct: 51  EQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQIIRY------ 104

Query: 69  WQWNVNSKEEAYLKILKMFEDRRT-------------------APYSIHQIALTGASE-G 108
             +N   K E  + +++ F D                      AP+SI +I      E  
Sbjct: 105 --YNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162

Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWS-SIVFHVALDNTLVV-NQVKKLCTTNK------- 159
           K  G+W+  + + Q L  L KY  +S ++  ++  D   ++ + ++++    +       
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQYSLNMEIYINYDCAFILQDAIQQMFNQQEGNEIWLK 222

Query: 160 -RASSNPQW-----QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
            RA +N Q+     + + + +P R+G+Q+IN  Y+  + +  ALP               
Sbjct: 223 ERAKNNNQFDLQDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQ------------ 270

Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
                          G+IGG    ALYF+G + + +I+LDPH  QN      +  D   K
Sbjct: 271 ---------------GMIGGVSKRALYFVGRIQDYLIYLDPHFVQNA-----QNFDDLSK 310

Query: 274 LDSTYHCPQASRLHILHMDPSIAV 297
             ++Y C     +H   +DPSI V
Sbjct: 311 NQASYTCQNIQLIHNSLIDPSIVV 334


>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 557

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 95/327 (29%), Positives = 141/327 (43%), Gaps = 54/327 (16%)

Query: 15  IRRDITSRL-WFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW---Q 70
           IRRD    L WFTYR  F  I    +T+D GWGCMLR  QM++ QAL      RDW   Q
Sbjct: 167 IRRDDERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQ 226

Query: 71  WNVNSKEEAYLK-ILKMFEDRRTAP---YSIHQIALTGASE-GKAVGEWFGPNTVAQVLR 125
                +++++++ +L  F D  ++    YS+H +   G S+  K  GEW+GP T   V+R
Sbjct: 227 LLARRRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMR 286

Query: 126 KLAKYDDWSSIVFHVALD-----------NTLVVNQVKKLCTTNKRA----------SSN 164
            L    +    +    LD            T+  + +    TT  R            + 
Sbjct: 287 DLVHIHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQ 346

Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
           PQ  PL L     L ++  N V  +            +   +  L+  Y +Q+  + F+ 
Sbjct: 347 PQAHPLDLEWEEEL-MESANTVEWDTALLLLVP----LRLGLTSLNEEY-VQSLAHTFSL 400

Query: 225 PQSLGVIGGKPNHALYFIGYV--GNDVIFLDPHT------------NQNIGCVYDKEQDS 270
           PQS+GV+GG+P  A +F G    G+ +  LDPHT            N     V +   D 
Sbjct: 401 PQSVGVLGGRPRGARWFYGAQKDGSKIFGLDPHTVQTAPGRQTARVNGQASSVVELSDDY 460

Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
            +   +T  CP+        MDPSIA+
Sbjct: 461 LRSCHTT--CPEM--FPFCKMDPSIAL 483


>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 357

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 111/239 (46%), Gaps = 40/239 (16%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
           L+F+YR   VP+ + G TTD  WGCM+R GQM++A A +    G          +E   +
Sbjct: 74  LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
              +F D  +AP+ IH +   G   G   GEWFGP  +A+ L  L A Y        V  
Sbjct: 133 TQTLFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSALMASYLAAGGEGPVVL 192

Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
              +  + + +VK+L     R S++     +VL+IP+ LGI+ I+  Y   +K+C  +  
Sbjct: 193 AFPERQIFLEEVKELL----RQSTH-----VVLLIPVMLGIRVISEKYSQLMKRCLEM-- 241

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
                                      S+G++GGK   AL+  G+  +DV FLDPH  Q
Sbjct: 242 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQ 275


>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
          Length = 351

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 111/239 (46%), Gaps = 40/239 (16%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
           L+F+YR   VP+ + G TTD  WGCM+R GQM++A A +    G          +E   +
Sbjct: 68  LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
              +F D  +AP+ IH +   G   G   GEWFGP  +A+ L  L A Y        V  
Sbjct: 127 TQTLFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSALMASYLAAGGEGPVVL 186

Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
              +  + + +VK+L     R S++     +VL+IP+ LGI+ I+  Y   +K+C  +  
Sbjct: 187 AFPERQIFLEEVKELL----RQSTH-----VVLLIPVMLGIRVISEKYSQLMKRCLEM-- 235

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
                                      S+G++GGK   AL+  G+  +DV FLDPH  Q
Sbjct: 236 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQ 269


>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
           Clan CA, family C54, putative [Trypanosoma cruzi]
          Length = 357

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 111/239 (46%), Gaps = 40/239 (16%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
           L+F+YR   VP+ + G TTD  WGCM+R GQM++A A +    G   +      +E   +
Sbjct: 74  LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
              +F D  +AP+ IH +   G   G   GEWFGP  +A+ L  L A Y        V  
Sbjct: 133 TQTLFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSALMASYLATGGEGPVIL 192

Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
              +  + + +VK+L     R S++     +VL+IP+ LGI  I+  Y   +K+C  +  
Sbjct: 193 AFPERQIFLEEVKELL----RQSTH-----VVLLIPVMLGICVISEKYSQLMKRCLEM-- 241

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
                                      S+G++GGK   AL+  G+  +DV FLDPH  Q
Sbjct: 242 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQ 275


>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
          Length = 330

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 83/303 (27%), Positives = 142/303 (46%), Gaps = 59/303 (19%)

Query: 10  QDLEQIRRDIT--SR--LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
           Q   ++R DI   SR  +W TYRK    +   G T+D GWGCM+R  QM +AQ+ + L +
Sbjct: 21  QHPRELREDINLYSRHTIWVTYRKNMKEL-PGGRTSDSGWGCMIRSMQMALAQSFVSLVM 79

Query: 66  GRDWQWNVNS----KEEAYLK-ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
           G  W++        + + +L+ I+ +F D   + +SIH +     + G   G+W+GP+  
Sbjct: 80  GNSWKFTKTGFQVERNKFHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGPSFA 139

Query: 121 AQVLRKLAKYDDWSSI-VF----HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
           +++       D  ++I VF    +VA    +V   +  +      +  N    P ++ +P
Sbjct: 140 SEI-----AADHLNTIHVFRTRGYVARLGRIVKPDILDI------SEDNGNILPTIIFVP 188

Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
           LRLG                  P++   D   IL   +++         PQ +G++GGKP
Sbjct: 189 LRLG------------------PVNAEEDFRPILKKVFDI---------PQCVGMVGGKP 221

Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
           N A +F  + GN + +LDPHT QN     D    +E     +Y C     +   ++DPS+
Sbjct: 222 NLAFFFHTFDGNLLYYLDPHTTQN-AVSMDGGWSAE-----SYFCNDVKSMKYKNLDPSV 275

Query: 296 AVV 298
           +++
Sbjct: 276 SLL 278


>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
          Length = 592

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 151/311 (48%), Gaps = 67/311 (21%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGL------------------TTDKGWGCMLRCGQM 54
           D+ +R+W TYR  F PI     G S L                  TTD GWGCM+R  Q 
Sbjct: 107 DVYTRIWLTYRTKFSPIDRDPEGPSPLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQS 166

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A ALL LH+GRDW++      E + +I+  F D  + P+SIH+I   G     K  GE
Sbjct: 167 LLANALLNLHIGRDWRY-TGELNEMHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGE 225

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ L    D    V+  +    +  N V K+         N  ++P++++
Sbjct: 226 WFGPSAAARSIQSLCNEFDSGVKVYIGSDSGDIYENDVFKVA-----KDENGVFKPILIL 280

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           + LRLGI +INPVY + +K               IL+S              +S+G+ GG
Sbjct: 281 LGLRLGIDNINPVYWDSLK--------------AILNSK-------------ESIGIAGG 313

Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE--------KKLD-STYHCPQAS 284
           +P+ + YF G+ G+ + +LDPH  Q    ++D + D+           LD ++ H  +  
Sbjct: 314 RPSTSHYFFGFQGDHLFYLDPHLPQ-PALLHDDQLDTSVSESTEIVSSLDVNSVHTKKLR 372

Query: 285 RLHILHMDPSI 295
           ++H+  +DPS+
Sbjct: 373 KIHLSEVDPSM 383


>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
          Length = 392

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 131/287 (45%), Gaps = 56/287 (19%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           +Q+  D+ +R+WFTYRK F P+  S  TTD GWGCMLRCGQM++A  L+ +   R     
Sbjct: 118 QQLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLMAVLQPRVHHLL 177

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY-D 131
             + E  +LK  +           +HQ+                P+ +AQ    L ++ D
Sbjct: 178 KYTMENHHLKAGRFQGPSSVGSALLHQV----------------PSALAQ----LNQFRD 217

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
           +   +  + A D  ++++Q++             +++P++LV+PLRLGI+ I P Y    
Sbjct: 218 EEVKLRTYFASDTLVILDQLRP-------EEGQAEFEPIMLVLPLRLGIEKIGPQY---- 266

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
                      +  +++L               P  +G IGG    A+Y  GY G+    
Sbjct: 267 -----------HARLQLL------------LRQPWCMGFIGGHDKRAMYIFGYQGHQYFG 303

Query: 252 LDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           LDPH  +  +     + +D   ++  ++H  + S +    +DPS+AV
Sbjct: 304 LDPHRCSAAVAQSTAELRDRWVEVRDSFHTSKLSGIERDDLDPSLAV 350


>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 377

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 102/208 (49%), Gaps = 38/208 (18%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F  I  S                       G TTD GWGCM+R GQ 
Sbjct: 95  DFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMIRSGQS 154

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A ALL   LGRDW+    + +E  + +L +F DR  AP+SIH+    GA+  GK  GE
Sbjct: 155 LLANALLIQKLGRDWRRGSETGKE--IALLSLFADRPQAPFSIHRFVEHGAAACGKHPGE 212

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKRASSNPQWQPLVL 172
           WFGP+  A+ + +       + +  +V  D + V  ++ +++   +         +P ++
Sbjct: 213 WFGPSATARCIDECEH----AGLNVYVTSDGSDVHEDKFRQIAGLD-------DIKPTLI 261

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPIS 200
           ++ +RLGI  I PVY + +K     P S
Sbjct: 262 LLGVRLGIDSITPVYWDALKAIIQYPQS 289


>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
          Length = 194

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 53/112 (47%), Positives = 72/112 (64%), Gaps = 4/112 (3%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D TSRLW TYR  + PI  S   TD GWGC LR GQ ++A  L+   LGRDW+    ++ 
Sbjct: 83  DFTSRLWMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQA 142

Query: 78  --EAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
             + Y +I+  F D  +  AP+SIH+IAL G   GK +GEWFGP+T++QV++
Sbjct: 143 AWKQYSRIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGPSTISQVIQ 194


>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
          Length = 408

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 73/312 (23%), Positives = 131/312 (41%), Gaps = 82/312 (26%)

Query: 19  ITSRLWFTYRKGFVPI-------------------------------GDSGLTTDKGWGC 47
           + + +W TYR GF PI                                +   TTD GWGC
Sbjct: 77  VEALVWLTYRTGFEPIPKNPNGPHPLAFVQSMVFNKNPLSTNVHSFIDNENFTTDVGWGC 136

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE 107
           M+R  Q ++A           ++  ++   +  +++L  F+D   AP+S+H         
Sbjct: 137 MIRTSQSLLANT---------YKRMISEDAQQEIQLLDQFKDSEAAPFSLHNFIRVANES 187

Query: 108 GKAV--GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP 165
              V  G+WFGPN  +  +++L    +         L  ++++++   L     +   + 
Sbjct: 188 PLQVKPGQWFGPNAASLSIQRLCNLVNSKENFGLPGL--SVLISENSDLYDDKVQEFLDK 245

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
           + Q L++++P+RLGI   N  Y + I              +++L+               
Sbjct: 246 KKQSLLILLPIRLGIDKTNEFYYSSI--------------LQLLNCK------------- 278

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
           QS+G+ GGKP+ + YF GY  +++++LDPH  Q     Y+           +YH P+  R
Sbjct: 279 QSVGIAGGKPSSSFYFFGYDNDELLYLDPHYPQGTNAGYN-----------SYHTPRYQR 327

Query: 286 LHILHMDPSIAV 297
           L I  +DPS+ +
Sbjct: 328 LTISQLDPSMMI 339


>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
           6054]
 gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 514

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 91/345 (26%), Positives = 143/345 (41%), Gaps = 99/345 (28%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGF--VP-------------------------------I 34
           S+Q  E+   DI  +L  TYR GF  +P                               I
Sbjct: 94  SYQTTEEAHEDIIKKLCLTYRYGFERIPRAVNGPSPLSFMQSVIFSKSLLYNLQNFNNFI 153

Query: 35  GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP 94
                TTD GWGCM+R  Q ++A   + L    D Q +          I+ +F D   AP
Sbjct: 154 EKENFTTDVGWGCMIRTSQSLLANTFVRL---LDKQSD----------IIALFNDTYLAP 200

Query: 95  YSIHQIALTGASEGKAV--GEWFGPNTVAQVLRKLAK--YDDWSS------IVFHVALDN 144
           +S+H      +S    V  GEWFGPN  +  +++L    YD+ +S      I   ++   
Sbjct: 201 FSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRLCDGYYDNSTSETILPRINVLISEST 260

Query: 145 TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
            L  +Q+ +L   +       + + L++++P+RLGI  IN  Y + +    +L       
Sbjct: 261 DLYDSQIAQLLEPST------ETKGLLVLLPVRLGIDSINSYYFSSLLHLLSLE------ 308

Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
                                QS+G+ GGKP+ + YF GY  N +I++DPH+ Q      
Sbjct: 309 ---------------------QSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQIFSSDI 347

Query: 265 DKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKN 308
           D          STY+  +  R+ I  +DPS+ + V  R  + Y+N
Sbjct: 348 DM---------STYYATRYQRVDIGKLDPSMLIGVFIRDLTSYEN 383


>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
          Length = 257

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 76/150 (50%), Gaps = 35/150 (23%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167

Query: 71  WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
           W   +                                   ++  + +I+  F D   AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
            +H++   G S GK  G+W+GP+ VA +LR
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILR 257


>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4; AltName:
           Full=Pexophagy zeocin-resistant mutant protein 8
 gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
          Length = 533

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 77/279 (27%), Positives = 127/279 (45%), Gaps = 57/279 (20%)

Query: 1   MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIG---DS------------------GL 39
           ++  NK S    +    D+ S++W TYR GF PI    DS                  G 
Sbjct: 51  IKDGNKKSTTYSQSFIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGF 110

Query: 40  TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRTAPYSIH 98
           T+D GWGCM+R  Q ++A ALLFLHLGRDW +         + +I+  F D    P+SIH
Sbjct: 111 TSDAGWGCMIRTSQSLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIH 170

Query: 99  QIALTG-ASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCT 156
                G     K  GEWFGP+  ++ ++ L K Y      V+  +    +   +V++L  
Sbjct: 171 NFVQQGIKCCDKKPGEWFGPSAASRAIKNLCKEYPPCGLRVYFSSDCGDVYDTEVRELAY 230

Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
            +     +  + P+++++ +RLG++ +N    + +++C +L                   
Sbjct: 231 GD-----SDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSL------------------- 266

Query: 217 TPRYEFTFPQSLGVIGGKPNH-ALYFIGYVGNDVIFLDP 254
                    QS+G+ G K +  AL  IG+ G+ + +L P
Sbjct: 267 --------KQSVGISGRKTSFLALLSIGFQGDYLFYLIP 297


>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
          Length = 280

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 53/150 (35%), Positives = 74/150 (49%), Gaps = 35/150 (23%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L RDW 
Sbjct: 131 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 190

Query: 71  WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
           W                                       +E  + +I+  F D   AP+
Sbjct: 191 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 250

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
            +H++   G S GK  G+W+GP+ VA +LR
Sbjct: 251 GLHRLVELGQSSGKKAGDWYGPSLVAHILR 280


>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
          Length = 169

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/114 (42%), Positives = 68/114 (59%), Gaps = 1/114 (0%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
            D +SR+W TYRKGF  IGDS  T+D  WGCM+R  QM++AQALLF HLGR W+  +   
Sbjct: 48  EDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKP 107

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
            +  Y++IL +F D     +SIH +   G + G A  EW GP  + +    + +
Sbjct: 108 HDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITR 161


>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
           passalidarum NRRL Y-27907]
          Length = 363

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 81/339 (23%), Positives = 143/339 (42%), Gaps = 99/339 (29%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDS------------------------------G 38
           +  LE     I+++LW +YR GF PI  +                               
Sbjct: 61  YTSLEDAEHSISNKLWLSYRCGFDPITKAPDGPTPISFFPSLVFNKRLFTTVRSLFDSEN 120

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
             +D GWGCM+R  Q ++A AL+ L            +  A  +++ +F+D   + +S+H
Sbjct: 121 FNSDVGWGCMIRTSQSLLANALMKL------------QPSAEHEVINLFQDNIASAFSLH 168

Query: 99  QIALTGASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSI-------VFHVALDNTLVVN 149
                 +     V  G+WFGPN  +   +KL       +I       VF ++ ++ L   
Sbjct: 169 NFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKTIQGVKYPHVF-ISENSDLYDE 227

Query: 150 QVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKIL 209
           ++++L   +           ++++ P+RLGI ++N  Y + I +  A P +         
Sbjct: 228 EIEELLVESS----------VLILFPVRLGIDNVNSYYYDSIFQLLACPFT--------- 268

Query: 210 SSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD 269
                             +G+ GGKP+ + YF+GY   D+++ DPH+ Q    +Y+   +
Sbjct: 269 ------------------VGISGGKPSSSFYFLGYQDQDLLYFDPHSPQ----LYENPIN 306

Query: 270 SEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYK 307
                 +TYH     RLHI  +DPS+ V +  +  S+YK
Sbjct: 307 Y-----TTYHTNNYQRLHIHMLDPSMMVGILVKDKSEYK 340


>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
 gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
           norvegicus]
          Length = 224

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 88/181 (48%), Gaps = 56/181 (30%)

Query: 142 LDNTLVVNQVKKLCTTN-----------------------KRASSNP-QWQPLVLVIPLR 177
           +DNT+V+ ++++LC  +                          ++ P  W+PLVL+IPLR
Sbjct: 1   MDNTVVMEEIRRLCRASLPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLR 60

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+ DIN  Y+  +K C                           F  PQSLGVIGGKPN 
Sbjct: 61  LGLTDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNS 93

Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIA 296
           A YFIGYVG ++I+LDPHT Q       +  DS    D ++HC     R+ I  +DPSIA
Sbjct: 94  AHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPCRMGIGELDPSIA 149

Query: 297 V 297
           V
Sbjct: 150 V 150


>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
          Length = 256

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 55/150 (36%), Positives = 77/150 (51%), Gaps = 36/150 (24%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+  S LT+D GWGCMLR GQM++AQ LL   L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGS-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 166

Query: 71  W----NVNSKE-------------------------------EAYLKILKMFEDRRTAPY 95
           W     + S E                                 + +I+  F D   AP+
Sbjct: 167 WVEGTGLASSEMPGPASPSRYRGPGRRGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPF 226

Query: 96  SIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
            +H++   G S GK  G+W+GP+ VA +LR
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSVVAHILR 256


>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
 gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
          Length = 460

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 133/317 (41%), Gaps = 76/317 (23%)

Query: 14  QIRRDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKG 44
           Q   D+ SRL FTYR  FVPI     G S L+                        TD G
Sbjct: 60  QFLSDVHSRLHFTYRTKFVPIPRVSDGPSPLSFHFLIRENPLTTIENAIYNPDCFNTDIG 119

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
           WGCM+R GQ ++  AL   +LGRD++ N    +E Y KI+  F D   A +SIH     G
Sbjct: 120 WGCMIRTGQSLLGNALQIANLGRDFRVNQGKDQEEY-KIIDWFADTPQAHFSIHNFVSQG 178

Query: 105 AS-EGKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA- 161
                K  GEWFGP   ++ ++ L  ++ D         +D  L+      +     R  
Sbjct: 179 LKLSNKKPGEWFGPAATSRSIQCLVEQFPD-------CGIDKCLISVSSGDVFEDEVREI 231

Query: 162 -SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
            +  PQ          R+ +     + +N + + Y        D+ K L S +       
Sbjct: 232 FAQKPQ---------SRILLLLGVKLGVNAVNEYYW------DDVKKTLGSKF------- 269

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
                 S+G+ GG+P+ +LYF+G+ GN++I+ DPHT Q           +      T H 
Sbjct: 270 ------SVGIAGGRPSSSLYFMGFQGNELIYFDPHTPQ-------PSLQTSANFYDTCHA 316

Query: 281 PQASRLHILHMDPSIAV 297
               +L +  +DPS+ +
Sbjct: 317 LNFGKLLLSDLDPSMLI 333


>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
          Length = 378

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 70/222 (31%), Positives = 101/222 (45%), Gaps = 51/222 (22%)

Query: 5   NKLSHQD--LEQIRRDITSRLWFTYRKGFVPI----------------------GD-SGL 39
           +++ H++   +Q   D  SR W TYR  F PI                      GD  G 
Sbjct: 103 DEMDHENGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGF 162

Query: 40  TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ 99
           ++D GWGCM+R GQ ++A A   + LGRDW+      EE  +KI++MF D   APYSIH 
Sbjct: 163 SSDSGWGCMIRSGQSLLANATGIVRLGRDWRRGQQKAEE--IKIMRMFADDPAAPYSIHN 220

Query: 100 IALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN 158
               G+S+ GK  GEWFGP+  +Q +     Y+D  S +     D+              
Sbjct: 221 FVDYGSSKCGKYPGEWFGPSATSQCINPDV-YED--SFMATAKSDHGF------------ 265

Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
                   ++P +++I  RLGI  I  VY   +     +P S
Sbjct: 266 --------FKPTLILISTRLGIDKITQVYWEALISALQMPQS 299


>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 388

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 111/247 (44%), Gaps = 40/247 (16%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
           +R      L+F+YR+ F P+  +G T+D GWGC +R  QM++A A +    G     + N
Sbjct: 86  VRAAAQKLLYFSYRRQFEPL-RNGATSDVGWGCTIRACQMMLAWAFMRYRNGGSVTMDDN 144

Query: 75  SKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA---KYD 131
             +       ++F D  TAP+ IH +   G   G   G WFGP  +A+V+  L    +  
Sbjct: 145 VVDSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSS 204

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
                   VA D  +    V+ +    +R+      Q +VL+IP++LG Q ++  Y N +
Sbjct: 205 GGEGPEVLVASDRQI---GVQDVVVRLQRS------QHVVLLIPVKLGPQTVSVTYANAL 255

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           K+                            F    S+G +GG+ N A +F GY G+ +I 
Sbjct: 256 KR---------------------------FFEMGSSIGAVGGEKNSAYFFFGYQGDKIIH 288

Query: 252 LDPHTNQ 258
           LDPH  Q
Sbjct: 289 LDPHYVQ 295


>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
          Length = 507

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 77/297 (25%), Positives = 129/297 (43%), Gaps = 56/297 (18%)

Query: 40  TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNS------KEEAYLKILKMFED--RR 91
           T+D GWGCM+R GQM++AQ L+   LGRDW+    +      ++  + ++++ F D   +
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242

Query: 92  TAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTL 146
            +P+S+H++     + G+  G WFGP T+   L K+      ++++ + +  +   D  +
Sbjct: 243 ESPFSLHRLV---QASGQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299

Query: 147 VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMV 206
              ++  L           + QP V   P RL   D +  + +   +  + PI P Y   
Sbjct: 300 YREEIMNLA----------RGQP-VRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQD 348

Query: 207 KILSSTYNMQTPRYEFTF--------------------------PQSLGVIGGKPNHALY 240
            I SS      P +                              P  +G+IGG+P H++Y
Sbjct: 349 GIQSSPSTTLFPSHAVILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIY 408

Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            +G     +I LDPH  Q    V     DSE+    T+HC     +    +DPS AV
Sbjct: 409 ILGCQNTQLIHLDPHFTQP---VVRNVVDSEEFNVKTWHCLVPRVIEAAKLDPSCAV 462


>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
          Length = 285

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 126/285 (44%), Gaps = 62/285 (21%)

Query: 19  ITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
             S +W TYR+ F P+        +D GWGCM+R GQM +A+ L    +  D        
Sbjct: 2   FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGLKRFQIKED-------- 53

Query: 77  EEAYLKILKMFEDRRTAPYSIHQIALTGASEGK-AVGEWFGPNTVAQVLRKLAKYDDWSS 135
                +I+ +F+D++ + +SI  I   G  E K   G+WF P  +  +L+ L +   +  
Sbjct: 54  -----EIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKKGFKD 108

Query: 136 I-VFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
           + +  ++ D  L+   ++   ++ K          L+L +  +LG++     Y+      
Sbjct: 109 LKIRTISSDRILIFEDLEMEFSSEKNG--------LILFLVCKLGLEKTEENYLK----- 155

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
            AL I                      F +  S+G+IGGKP  AL+F+G + + +I+LDP
Sbjct: 156 IALKI----------------------FDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDP 193

Query: 255 HTNQNIGCVYDKEQDSEKKLD-STYHCPQASRLHILHMDPSIAVV 298
           H  Q+          ++  +D ++Y C   + L    +D SI  V
Sbjct: 194 HYVQDF---------NQNNVDQNSYFCKNYAVLDQKKIDSSIGNV 229


>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
 gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
          Length = 419

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 95/348 (27%), Positives = 154/348 (44%), Gaps = 96/348 (27%)

Query: 4   ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDS-------------------------- 37
            N   +QD  + R  I S LW +YR GF PI  S                          
Sbjct: 72  GNHFINQD--EARDHIYSLLWLSYRCGFSPIPKSIDGPQPVTFFPSLLFSKSTLTNVGNL 129

Query: 38  -------GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDR 90
                    T+D GWGCM+R  Q ++A A     L    + N N +    L+ILK+F+D 
Sbjct: 130 RSLFDNENFTSDAGWGCMIRTSQNLLANA----LLKLAGEANGNVQ----LEILKLFQDD 181

Query: 91  RTAPYSIHQIALTGASEGKAV--GEWFGPNTVAQVLRKLA--KYDDWSSIV---FHVALD 143
             A +SIH      ++   +V  G+WFGPN  +  +R+L     D  S  V    +++ +
Sbjct: 182 PNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLTIEMTDQESPTVVPFVYISEN 241

Query: 144 NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVY 203
             L  +++++     KR        PL+L+ P+RLGI  +N  Y   I            
Sbjct: 242 ADLYDDEIEETFLKEKR--------PLLLLFPVRLGIDHVNKYYYKSI------------ 281

Query: 204 DMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHTNQNIGC 262
             +++L+S +             S+G+ GGKP+ + YFIGY  ++ +I+ DPH  Q    
Sbjct: 282 --LQLLASRF-------------SVGIAGGKPSSSFYFIGYENDENLIYFDPHLPQ---- 322

Query: 263 VYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
           V++   +      ++YH    ++L I  +DPS+ + V   S S+Y+ +
Sbjct: 323 VFESPINL-----ASYHTLNYNKLSIEMLDPSMMIGVLLGSMSEYREL 365


>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
           anatinus]
          Length = 147

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 72/147 (48%), Gaps = 33/147 (22%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+E  +RD  SRLW TYR+ F P+  S  T+D GWGCMLR GQM++AQ L+   L RDW 
Sbjct: 1   DVESFQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWI 60

Query: 71  WN---------------------------------VNSKEEAYLKILKMFEDRRTAPYSI 97
           W                                  +  +E  + +I+  F D   AP+S+
Sbjct: 61  WAEAGPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSL 120

Query: 98  HQIALTGASEGKAVGEWFGPNTVAQVL 124
           H++   G   GK  G+W+GP+  A +L
Sbjct: 121 HRLVRLGQGSGKRAGDWYGPSLTAHLL 147


>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
          Length = 700

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 71/98 (72%)

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGE 113
           M++A+A+  +HLG+DW+W    ++EAY ++ +MF+D +++ YSI  I + G +  K +G 
Sbjct: 1   MMLAEAITRIHLGKDWRWTPGCQDEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPIGS 60

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV 151
           WFGPNTVAQV++KL  YD  ++   H+++++ ++V+++
Sbjct: 61  WFGPNTVAQVIKKLCAYDPCTNWYVHISVEDGVIVDEI 98



 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 34/147 (23%)

Query: 153 KLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
           +L  +   A+ +P  W+PL+L IPLRLG+   NP Y N IK    +              
Sbjct: 246 RLQASEIEATPSPATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQI-------------- 291

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN-DVIFLDPHTNQNIGCVYDKEQDS 270
                        P S+G++GG+P+HA++ +G  G+ D++ LDPHT Q        + D 
Sbjct: 292 -------------PHSIGIMGGRPSHAVWIVGTAGDEDLLCLDPHTTQPAS-----QDDL 333

Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
             + D T+HC    RL +  +DPS+ +
Sbjct: 334 TAEDDVTHHCDCPVRLPLERLDPSMVI 360


>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
          Length = 347

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 113/257 (43%), Gaps = 54/257 (21%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS- 106
           M+R GQ ++  AL  LHLGRD++ N N   E   K +  F D   AP+S+H     G   
Sbjct: 1   MIRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTEL 60

Query: 107 EGKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKR 160
             K  GEWFGP   A+ ++ L         DD   IV   + D  +  N+V+K+   N  
Sbjct: 61  SDKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPN 116

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           +        ++ ++ ++LGI  +N  Y   I                ILSST        
Sbjct: 117 SR-------ILFLLGVKLGINAVNESYRESI--------------CGILSST-------- 147

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
                QS+G+ GG+P+ +LYF GY GN+ +  DPH  Q            E     + H 
Sbjct: 148 -----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHT 193

Query: 281 PQASRLHILHMDPSIAV 297
            +  +L +  MDPS+ +
Sbjct: 194 SKFGKLQLSEMDPSMLI 210


>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
          Length = 411

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 76/305 (24%), Positives = 122/305 (40%), Gaps = 95/305 (31%)

Query: 18  DITSRLWFTYRKGFVPIG-----------------------DSGLTTDKGWGCMLRCGQM 54
           D  S+ W TYR  F  I                         SG ++D GWGCM+R GQM
Sbjct: 114 DFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMIRSGQM 173

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEW 114
           ++A A+   +LGR                                      + GK  GEW
Sbjct: 174 LLANAMAITNLGR-------------------------------------VACGKYPGEW 196

Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLV 173
           FGP+  A+ ++ L    +  S+  +   D   V  ++  K+   +       ++ P +++
Sbjct: 197 FGPSATARCIQSLTNAQEQPSLRVYSTGDGPDVYEDKFMKIAKPD-----GTRFHPTLIL 251

Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
           +  RLGI  I PVY + +     +P                           QS+G+ GG
Sbjct: 252 VGTRLGIDKITPVYWDALIAALQMP---------------------------QSVGIAGG 284

Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
           +P+ + YFIG  G+ + +LDP HT   +    D  + ++  +D T H  +  RLH+  MD
Sbjct: 285 RPSASHYFIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADID-TAHTRRLRRLHVREMD 343

Query: 293 PSIAV 297
           PS+ +
Sbjct: 344 PSMLI 348


>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
          Length = 348

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 78/257 (30%), Positives = 115/257 (44%), Gaps = 54/257 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E ++      L+F+YR  F P+  +G TTD GWGC +R GQM++A AL+     R     
Sbjct: 40  EMVKLAACKLLYFSYRCQFEPL-RNGSTTDIGWGCTIRAGQMMLAHALM-----RYKNGG 93

Query: 73  VNSKEEAYLKILK-----MFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
             S E++ +  LK     +F D  +AP+ IH I   G   G   G WFGP  VA V+  L
Sbjct: 94  GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVVMGAL 153

Query: 128 AKYDDWSSIVFH-----VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQD 182
              +D+ S         V  D  ++ ++V+K+   +K              IP+ LG   
Sbjct: 154 --MEDYLSSGGQGPDVLVLRDRQVMEDEVRKILLLSKHVLLL---------IPVMLGPHH 202

Query: 183 INPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI 242
           I+  Y   +K+C                        R E T    +G +GGK   A +F+
Sbjct: 203 ISEGYAKLLKRCL-----------------------RMEST----VGAVGGKEGSAFFFM 235

Query: 243 GYVGNDVIFLDPHTNQN 259
           GY G ++I LDPH  Q+
Sbjct: 236 GYQGGNLIVLDPHYAQS 252


>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
          Length = 354

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 68/248 (27%), Positives = 117/248 (47%), Gaps = 42/248 (16%)

Query: 15  IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQA-LLFLHLGRDWQWNV 73
           + R     L+F+YR GF P+ + G TTD  WGC++R  QM++AQA + F + G  +  + 
Sbjct: 61  VTRATQKLLYFSYRCGFTPLSN-GSTTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFV-DG 118

Query: 74  NSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
           ++ +    K+  +F D  +AP+ IH +       G A G+WFG    A+ +  L +    
Sbjct: 119 SALQILREKVQPLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSL 178

Query: 134 ---SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
              +     V +D  +   +V+ L + +++         +VL+IP  LG+  I+  Y   
Sbjct: 179 RGGNGPAVLVFVDREVSALKVRDLLSHSRQ---------VVLLIPAVLGLDRISVKYSKM 229

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
           + +C  +                              +GVIGG+ + ALYF+G+  N++I
Sbjct: 230 LIRCLEM---------------------------ESCIGVIGGRKSSALYFVGHQSNNII 262

Query: 251 FLDPHTNQ 258
           +LDPH  Q
Sbjct: 263 YLDPHRAQ 270


>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
 gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
          Length = 577

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 83/326 (25%), Positives = 130/326 (39%), Gaps = 78/326 (23%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPI-----------------------------GDSG 38
           +  D  +   D  SRL FTYR  F PI                               + 
Sbjct: 122 ADDDSVEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNS 181

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-----NVNSKEEAYLKILKMFEDRRTA 93
            TTD GWGCM+R GQ ++  AL  ++LGR+++      N N+K      I++ F D    
Sbjct: 182 FTTDIGWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEEDIIEWFYDNPNK 241

Query: 94  PYSIHQIALTGAS-EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
           P+SIH+    G     K  GEWFGP+T    ++ L  Y+          +D  ++     
Sbjct: 242 PFSIHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLI-YE-----FPECGIDECILSVSSG 295

Query: 153 KLCTTNKRASSNPQWQPLVLV-IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
            +               ++L+ + ++LGI  IN  Y N IK               IL+S
Sbjct: 296 DIYEDEINEHFQKNENTIILILLGVKLGIDKINQCYFNDIK--------------DILNS 341

Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
            Y             S G+ GG+P+ +LYF G++   + + DPH  Q             
Sbjct: 342 RY-------------SCGISGGRPSSSLYFFGHMNEYLYYFDPHKPQ---------LQLN 379

Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
           +   ++ H    S++ I  +DPS+ +
Sbjct: 380 EDFKNSCHSTDYSKILISEIDPSMLI 405


>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 348

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 50/255 (19%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E ++      L+F+YR  F P+  +G TTD GWGC +R GQM++A AL+     R     
Sbjct: 40  EMVKLAACKLLYFSYRCQFEPL-RNGSTTDIGWGCTIRAGQMMLAHALM-----RYKNGG 93

Query: 73  VNSKEEAYLKILK-----MFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
             S E++ +  LK     +F D  +AP+ IH I   G   G   G WFGP  VA V+  L
Sbjct: 94  GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVVMGAL 153

Query: 128 AK---YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
            +    +        V  D  ++ ++V+K+   +K              IP+ LG   I+
Sbjct: 154 MEDYLRNGGQGPDVLVLRDRQVMEDEVRKILLLSKHVLLL---------IPVMLGPHHIS 204

Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
             Y   +K+C                        R E T    +G +GGK   A +F+GY
Sbjct: 205 EGYAKLLKRCL-----------------------RMEST----VGAVGGKEGSAFFFMGY 237

Query: 245 VGNDVIFLDPHTNQN 259
            G ++I LDPH  Q+
Sbjct: 238 QGGNLIVLDPHYAQS 252


>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
           digitatum Pd1]
          Length = 208

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/134 (39%), Positives = 69/134 (51%), Gaps = 26/134 (19%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F PI  +                       G T+D GWGCM+R GQ 
Sbjct: 71  DFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQS 130

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A A   L LGRDW+     KEE   K++ MF D   AP+SIH+    GA S GK  GE
Sbjct: 131 LLANAFSVLLLGRDWR--RGEKEEEESKLISMFADHPEAPFSIHKFVNRGAESCGKYPGE 188

Query: 114 WFGPNTVAQVLRKL 127
           WFGP+  A+ ++ +
Sbjct: 189 WFGPSATAKCIQSV 202


>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
          Length = 93

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 43/59 (72%), Positives = 51/59 (86%), Gaps = 2/59 (3%)

Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
          ++L+ IRRDI S LWFTYRKGF+PIG  +S  T+DKGWGCMLRCGQMV+AQAL+ LHLG
Sbjct: 34 KELDMIRRDIRSMLWFTYRKGFIPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92


>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 873

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 73/221 (33%), Positives = 112/221 (50%), Gaps = 46/221 (20%)

Query: 18  DITSRLWFTYRKGFVPI----------------------------------GDSGLTTDK 43
           D TSR+W TYR  F PI                                  G+ G T+D 
Sbjct: 301 DFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPVGGEKGWTSDA 360

Query: 44  GWGCMLRCGQMVIAQALLFLHLGR-DWQ---WNVNSKEEA-YLKILKMFEDRRT--APYS 96
           GWGCMLR GQ ++A ALL LHLGR DW+   + V++ + A Y++I+  F D  +  +P+S
Sbjct: 361 GWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFFDTPSPQSPFS 420

Query: 97  IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           +H++AL G   GK VG+WFGP+T A  ++ L      + +   VA D  +  + V     
Sbjct: 421 VHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVASDGVIFQSDVYAASN 480

Query: 157 T---NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIK 192
               + R  +   W  + ++++I +RLG+  +NP+Y + IK
Sbjct: 481 AYIGSPRRHAKVSWGGRAVIVLIGIRLGLDGVNPIYYDTIK 521


>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 425

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 53/131 (40%), Positives = 69/131 (52%), Gaps = 26/131 (19%)

Query: 18  DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F  +P                     +   G TTD GWGCM+R GQ 
Sbjct: 127 DFESKIWLTYRSNFPLIPKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 186

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGRDW+     KEE+  K+L +F D   AP+SIH+    GAS  GK  GE
Sbjct: 187 LLANALAILSLGRDWRRGTKIKEES--KLLSLFADDPKAPFSIHRFVEHGASACGKYPGE 244

Query: 114 WFGPNTVAQVL 124
           WFGP+  A+ +
Sbjct: 245 WFGPSATARCI 255



 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 3/69 (4%)

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHCPQASRLHI 288
           I G+P+ + YFIG  G+   +LDPH +     VY    D     +  +TYH  +  RLHI
Sbjct: 255 IHGRPSSSHYFIGAQGSHFFYLDPH-HTRPALVYRDAGDRPYTTEELNTYHTRRLRRLHI 313

Query: 289 LHMDPSIAV 297
             MDPS+ +
Sbjct: 314 KDMDPSMLI 322


>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 402

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 131/333 (39%), Gaps = 103/333 (30%)

Query: 14  QIRRDITSRLWFTYRKGFVPI---------------------------------GDSGLT 40
           ++++ +  R W +YR GF PI                                  +   T
Sbjct: 78  EVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDNDNFT 137

Query: 41  TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
           TD GWGCM+R  Q V+A A+                   Y   +++F D  +A +S+H  
Sbjct: 138 TDVGWGCMIRTSQSVLANAI---------------DRAGYEVDVELFADTSSAAFSLHNF 182

Query: 101 ALTGASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN 158
               +     V  G+WFGP+  +  +++L +  + S+ V    L           +C + 
Sbjct: 183 VKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSSTNVPLSVL-----------VCESG 231

Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
                  Q  P++L++PLRLGI  +N VY + + +   +P                    
Sbjct: 232 DIYDDQIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEVP-------------------- 271

Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
                  QS G+ GGKP+ +LYF GY G  +++LDPH  QN+                +Y
Sbjct: 272 -------QSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNVSAGV-----------GSY 313

Query: 279 HCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
           H     +L I  MDPS    I + +   Y+D K
Sbjct: 314 HSSSYQKLDISDMDPSMMAGIVLKNNEDYTDLK 346


>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
          Length = 391

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 94/332 (28%), Positives = 146/332 (43%), Gaps = 91/332 (27%)

Query: 13  EQIRRDITSR-LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           ++I + I SR +WFTYRK F  I +S  T+D GWGCMLR GQM+ AQ +L +H+ +  Q 
Sbjct: 46  KEIIQQIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQ-ILRVHIRQKKQ- 103

Query: 72  NVNSKEEAYLKIL------------KMFEDRRT---APYSIHQI-ALTGASEGKAVGEWF 115
             +SK+  Y K+L            KMF D      +PYSI +I A++         +W+
Sbjct: 104 --HSKDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWY 160

Query: 116 GPNTVAQVLRKLAK-------------------YDDWSSIVFHVALDNTLVVNQVK---- 152
            P+ +   L  L +                   YD   S ++ + +D   +VN++K    
Sbjct: 161 RPDQILNALSLLHQQKQLEGSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKN 220

Query: 153 ----KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKI 208
               K+C   ++       + L +    R+G+ +IN  Y         LP   + D++ +
Sbjct: 221 KEISKICNICQKKDP----KALAIFFITRIGLDEINKEY---------LPF--LNDLIDL 265

Query: 209 LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ---NIGCVYD 265
                           PQ  G+IGG+ + A Y +G V   +I+LDPH  Q   N G V  
Sbjct: 266 ----------------PQFQGIIGGRDDKAYYILGRVNKRLIYLDPHYIQEHINRGNVV- 308

Query: 266 KEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
                   L  T+ C     ++   M PSIA+
Sbjct: 309 -------MLKDTFFCKDVKYINEEQMSPSIAL 333


>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 1216

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 131/301 (43%), Gaps = 80/301 (26%)

Query: 23  LWFTYRKGFVP-----IGD--SGLTTDKGWGCMLRCGQMVIAQA---------------L 60
           + FTYRK F P     I D     T+D GWGCM+R GQM+ AQ                L
Sbjct: 263 ILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRHLKKTDYIEQHQL 322

Query: 61  LFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNT 119
           + + +G   +  V    + Y+   + +   R  PYSIHQI      + K   G+W+ PN 
Sbjct: 323 INIIIGFLEEEEVQEGGKGYIFNQQSYIQDRIRPYSIHQITNRAFCKYKIQPGQWYTPNQ 382

Query: 120 VAQVLRKL-----------AKYDDWSS---IVFHVALDNTLVVNQ--VKKLCTTNKRASS 163
           +A +L++L            K D  SS   I+F   L  TL+  Q  +   C    + S 
Sbjct: 383 IAIILKELHKKNKIKGTENLKIDVHSSDKPIIFEKIL-QTLLGRQGKINLNCNHENQQSR 441

Query: 164 NPQWQPL-----VLVIPLRLGIQDINPVYING---------IKKCYA--------LP--- 198
           N   Q        ++ P +  I++ +  Y             K C+         LP   
Sbjct: 442 NSINQDQDDSFEKIMPPNQQEIEEFSSQYEESKEDQTDNLCCKDCFKTDNKLFLLLPCRL 501

Query: 199 ----ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
               ISP++  ++IL             +  QS+G+IGGKPN A YF+G+VG+D+++LDP
Sbjct: 502 GLDEISPIH--IEILKKL---------LSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDP 550

Query: 255 H 255
           H
Sbjct: 551 H 551


>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 444

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 146/332 (43%), Gaps = 100/332 (30%)

Query: 19  ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
           I SRLW +YR GF PI  +                                    T+D G
Sbjct: 82  IESRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAG 141

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ-IALT 103
           WGCM+R  Q ++A  LL L          +SK++    ++ +F+D +++P+SIH  I + 
Sbjct: 142 WGCMIRTSQNLLANTLLQLLPP-------DSKQD----VIGLFQDNQSSPFSIHNFIKVA 190

Query: 104 GASEGKA-VGEWFGPNTVAQVLRKLAKYDDWSSI-------VFHVALDNTLVVNQVKKLC 155
           G S  +   G+WFGPN  +  +++L        I       VF ++ ++ L   ++ ++ 
Sbjct: 191 GESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVF-ISENSDLYDGEINEIL 249

Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
           +   R+        ++++ P+RLGI  +N  Y + I               ++L S +  
Sbjct: 250 SEEGRS--------VLVLFPIRLGIDKVNSYYYDSI--------------FQVLKSKF-- 285

Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
                      S G+ GGKP+ + YF+GY  +D+I+ DPH  Q +    + E        
Sbjct: 286 -----------SCGISGGKPSSSFYFLGYDNSDLIYFDPHLPQLVENPINIE-------- 326

Query: 276 STYHCPQASRLHILHMDPSIAV-VSQRSYSDY 306
            +YH    +RL+I  +DPS+ + +  RS  DY
Sbjct: 327 -SYHTRNYNRLNISLLDPSMMIGILLRSMDDY 357


>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
 gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
          Length = 463

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 126/314 (40%), Gaps = 79/314 (25%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           D+ SRL FTYR  F PI     G S L                         TD GWGCM
Sbjct: 64  DVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNPDCFNTDIGWGCM 123

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
           +R GQ ++  AL    LGR ++ + N       +I+  F D    P+SIH+    G    
Sbjct: 124 IRTGQSLLGNALQLAKLGRHFRLD-NKMGIKDDEIISWFRDTTQEPFSIHKFVEKGNKLA 182

Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
            K  GEWFGP   +  ++ L +            +D  LV      +   + R       
Sbjct: 183 NKKPGEWFGPAATSISIQSLIEE------FPECGIDKCLVSVSSGDIFEDDVREIFEENM 236

Query: 168 QPLVL-VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
              +L ++ ++LG+  +N  Y                D++ IL S +             
Sbjct: 237 DSKILFLMGVKLGLDAVNSFYWE--------------DILNILDSKF------------- 269

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI---GCVYDKEQDSEKKLDSTYHCPQA 283
           S+G+ GG+P+ +LYF G+ GN++++ DPH  Q       VY+           T H    
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQPSLVDPSVYE-----------TCHTTNF 318

Query: 284 SRLHILHMDPSIAV 297
            +L I  MDPS+ +
Sbjct: 319 GKLDIKDMDPSMLI 332


>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 402

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 136/345 (39%), Gaps = 104/345 (30%)

Query: 2   RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPI--------------------------- 34
           R  ++    DLE +++ +  R W +YR GF PI                           
Sbjct: 67  REGDRDREGDLE-VQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFA 125

Query: 35  ------GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFE 88
                  +   TTD GWGCM+R  Q V+A A+                   Y   +++F 
Sbjct: 126 NIHSLVDNDNFTTDVGWGCMIRTSQSVLANAI---------------DRAGYEVDVELFA 170

Query: 89  DRRTAPYSIHQIALTGASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
           D  +A +S+H      +     V  G+WFGP+  +  +++L +  + S+ V    L    
Sbjct: 171 DTSSAAFSLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSSTNVPLSVL---- 226

Query: 147 VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMV 206
                  +C +        Q  P++L++PLRLGI  +N VY + + +   +P        
Sbjct: 227 -------VCESGDIYDDQIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEVP-------- 271

Query: 207 KILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK 266
                              QS G+ GGKP+ +LYF GY G  +++LDPH  QN+      
Sbjct: 272 -------------------QSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNVSAGV-- 310

Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
                     +YH     +L I  MDPS    I + +   Y+D K
Sbjct: 311 ---------GSYHSSLYQKLDISDMDPSMMAGIVLKNNEDYTDLK 346


>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
 gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
          Length = 433

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 86/182 (47%), Gaps = 41/182 (22%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D +SRLWFTYR+ F  I  + + TD GWGCMLR  QM++AQA +   LGR W+W     E
Sbjct: 69  DFSSRLWFTYRREFPAIPGTDIRTDCGWGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTE 128

Query: 78  EAYLKI----------------------------------LKMFEDRRTA--PYSIHQIA 101
              +++                                   + F D+  A  P+S+H + 
Sbjct: 129 AGEVRLPRHALWPLREGFRCTGGDGTAVLVRCSPKPVNDPPRWFGDKADASTPFSLHNLV 188

Query: 102 LTGASEGKAVGEWFGPNTVAQVLRKL---AKYDD--WSSIVFHVALDNTLVVNQVKKLCT 156
             G   GK  G+W+GP++VA +L+     A + D   + +  +VA D T+ ++ V  LC+
Sbjct: 189 QRGRESGKKAGDWYGPSSVAYILKDALEDAAHRDQRLAQLCIYVAQDCTIYMDDVTALCS 248

Query: 157 TN 158
             
Sbjct: 249 AG 250


>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
           972h-]
 gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
          Length = 320

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 73/285 (25%), Positives = 117/285 (41%), Gaps = 60/285 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+   D  S +  TYR G    G   +T+D GWGCM+R  Q ++A  L            
Sbjct: 42  EKFLYDSFSLITITYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL-----------R 88

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKA-VGEWFGPNTVAQVLRKLAKYD 131
           +   E+   +IL +F D  +AP+SIHQ    G +      G+WFGP T    + +L+  +
Sbjct: 89  ICYPEKQLKEILALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSCSCVARLSDQN 148

Query: 132 DWSSIVFHVALD-NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
               +  +VA + N +  +Q+ K+              P++L+IP RLGI  IN  Y + 
Sbjct: 149 PDVPLHVYVARNGNAIYRDQLSKVSF------------PVLLLIPTRLGIDSINESYYDQ 196

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
           + +                            F     +G+ GG+P  A YF         
Sbjct: 197 LLQV---------------------------FEIRSFVGITGGRPRSAHYFYARQNQYFF 229

Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
           +LDPH      C +     ++   + T+H     R+ I  +DP +
Sbjct: 230 YLDPH------CTHFAHTTTQPASEETFHSATLRRVAIQDLDPCM 268


>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
 gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
          Length = 425

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/211 (30%), Positives = 98/211 (46%), Gaps = 34/211 (16%)

Query: 18  DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
           D  SR+W TYR GF PI                      GD +G ++D GWGCM+R GQ 
Sbjct: 116 DFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRSGQS 175

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
           ++A ALL   LGRDW+   +   E    I+ +F D   APYS+      GA + GK  GE
Sbjct: 176 LLANALLISQLGRDWRRTTDPGAER--NIVALFADDARAPYSLQNFVKHGAIACGKHPGE 233

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
           WFGP+  A+ ++ LA   + S  ++     +   V +   L T      +   + P +++
Sbjct: 234 WFGPSATARCIQALADQHESSLRIYSTG--DLPDVYEDSFLATARPDGET---FHPTLIL 288

Query: 174 IPLRLGIQDINPV---YINGIKKCYALPISP 201
           +   +GI    P    Y  G+++ +   + P
Sbjct: 289 MEQSIGIAGGRPSSSHYFVGVQRQWLFYLDP 319



 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQAS 284
           QS+G+ GG+P+ + YF+G     + +LDPH  +      +   + + ++LDS  H  +  
Sbjct: 291 QSIGIAGGRPSSSHYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSC-HTRRLR 349

Query: 285 RLHILHMDPSIAV 297
            LH+  MDPS+ +
Sbjct: 350 YLHVEDMDPSMLI 362


>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
          Length = 290

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 113/251 (45%), Gaps = 49/251 (19%)

Query: 31  FVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK-EEAYLKILKMFED 89
           F  I   G  +  G GCM+R  QM++AQAL+F HLGR W+          Y+ +L++F D
Sbjct: 4   FWKISLPGYGSLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGD 63

Query: 90  RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY-----------DDWSSIVF 138
                +SIH +     + G A G W GP  + +  + L +            +++   ++
Sbjct: 64  SEACAFSIHNLLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALY 123

Query: 139 HVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
            V+ D          + ++   +LC+   +  S   W P++L++PL LG+  INP YI  
Sbjct: 124 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPST--WSPILLLVPLVLGLDKINPRYIPL 181

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
           +K+                            F FPQSLG++GGKP  + Y  G   +  +
Sbjct: 182 LKE---------------------------TFMFPQSLGILGGKPGTSTYIAGVQDDRAL 214

Query: 251 FLDPHTNQNIG 261
           +LDPH  Q  G
Sbjct: 215 YLDPHEVQMFG 225


>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
          Length = 416

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 115/253 (45%), Gaps = 68/253 (26%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           L+    D +SR+W TYRKGF  I D  LT+D  WGCM+R  QM++AQAL+F HLGR W+ 
Sbjct: 29  LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR- 87

Query: 72  NVNSKEEAYLKILKMFED----RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
                E+  ++  +   D    +   P  ++   ++G  +G+  G               
Sbjct: 88  --KPPEKTLIRTNREQADAVDGKENFPMELY--VVSGDEDGERGG--------------- 128

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
                 + +V+         ++   +LC+   +  S   W P++L++PL LG+  INP Y
Sbjct: 129 ------APVVY---------IDVAAQLCSDFNKGPST--WSPILLLVPLVLGLDKINPRY 171

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
           I  +K+                            F FPQSLG++G KP  + Y  G   +
Sbjct: 172 IPLLKE---------------------------TFMFPQSLGILGVKPGTSTYIAGVQDD 204

Query: 248 DVIFLDPHTNQNI 260
             ++LDPH  Q +
Sbjct: 205 RALYLDPHEVQMV 217


>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
           8797]
          Length = 448

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 133/316 (42%), Gaps = 77/316 (24%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSG-----------------------------LTTDKG 44
           Q  RD+ +RL FTYR  FVPI  S                                TD G
Sbjct: 42  QFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFNTDIG 101

Query: 45  WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
           WGCM+R GQ ++  AL  L  GR+++   ++ ++    I++ F+D   AP+S+H     G
Sbjct: 102 WGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD---DIIQWFKDTPDAPFSLHNFVKKG 158

Query: 105 ASEGK-AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA-- 161
                   G+WFGP   ++ ++ L              +D+ +V      +   +     
Sbjct: 159 VELADMKPGQWFGPAATSRSIQSLI------CNFPQCGIDHCIVSVSSADIYKQDVEDMF 212

Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
            ++P    L+++  ++LG+  +N  Y   I+              ++L+S +        
Sbjct: 213 DADPDSN-LLILFGVKLGVSAVNASYWEDIR--------------RLLNSKF-------- 249

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
                S+G+ GG+P+ +LYF GY   ++++ DPHT Q            +    +T H  
Sbjct: 250 -----SVGIAGGRPSSSLYFFGYQNQELLYFDPHTPQ--------PSLIDDAAFNTCHSI 296

Query: 282 QASRLHILHMDPSIAV 297
           +  +L +  MDPS+ +
Sbjct: 297 EFGKLELRDMDPSMLI 312


>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
 gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
          Length = 302

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 73/267 (27%), Positives = 133/267 (49%), Gaps = 35/267 (13%)

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY-LKILKMFEDRRT--APYSIHQIALTG 104
           M RCGQM++AQAL+   LGR+W+   N ++  + L+I+K F D  +  +P S+H+  L  
Sbjct: 1   MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHR--LVQ 58

Query: 105 ASEGKAVGEWFGPNTV-AQVLRKLAKYDDWSS----IVFHVALDNTLVVNQVKKLCTTNK 159
            S+ K  GEW GP+++ + +LR +AK     S    +  ++A D  +   ++  L    +
Sbjct: 59  MSDRKP-GEWCGPSSICSAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLA---R 114

Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVY---INGIKKCYALPISPVYDMVKILSSTYNMQ 216
              ++ Q+QP       ++   D   +Y    +     ++   + +  ++ ++    N  
Sbjct: 115 GLHTSYQYQP-------KIYFTDHTALYRSQSDQTNDSHSFKPTAILLLIPLMFGKGNRI 167

Query: 217 TPRY------EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
            PRY       F+ P  +G+IGG+  H+ Y++G   N +I+LDPH  Q       +  +S
Sbjct: 168 NPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPT-----QNLNS 222

Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
            K    ++HCP    +   +++PS AV
Sbjct: 223 PKFSVDSWHCPIPKTMSAANLNPSCAV 249


>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
 gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
          Length = 472

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 107/237 (45%), Gaps = 49/237 (20%)

Query: 31  FVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK-EEAYLKILKMFED 89
           F  I DS LT+D  WGCM+R  QM++AQAL+F HLGR  +          Y+ +L +F D
Sbjct: 34  FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93

Query: 90  RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY-----------DDWSSIVF 138
                +SIH +   G + G A G W GP  + +  + L              +++   ++
Sbjct: 94  SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153

Query: 139 HVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
            V+ D          + ++   +LC+   +  S   W P++L++PL LG+  INP YI  
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPST--WSPILLLVPLVLGLDKINPRYIPL 211

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
           +K+                            F FPQSL ++GGKP  + Y  G + N
Sbjct: 212 LKE---------------------------TFMFPQSLCILGGKPGTSTYIAGVLAN 241


>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
 gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
          Length = 484

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/135 (39%), Positives = 72/135 (53%), Gaps = 10/135 (7%)

Query: 1   MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQ-- 58
           +R  ++L H  LE +  D  SR+W TYRK F  +G S LT+D GWGC LR GQM++A+  
Sbjct: 33  LRKLSELMHA-LEAMLGDFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVR 91

Query: 59  ------ALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVG 112
                 A++ + LGRDWQ   +   EA   ++    D   AP SIH+I   G   G   G
Sbjct: 92  HGWRAGAMMRVALGRDWQ-RCSDNLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPG 150

Query: 113 EWFGPNTVAQVLRKL 127
            W GP  + + L  L
Sbjct: 151 RWLGPWMLCKGLEAL 165



 Score = 46.2 bits (108), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 49/120 (40%), Gaps = 43/120 (35%)

Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
           G+  INPVYI  +++                             ++PQS+G++GG+P+ +
Sbjct: 339 GMDKINPVYIPQLQQV---------------------------LSWPQSVGIVGGRPSAS 371

Query: 239 LYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           LY  G      I+LDPH  Q  +G               TY C     L    +DPS+A+
Sbjct: 372 LYVCGVQDASFIYLDPHEAQLALG---------------TYFCDVVRVLPSAQLDPSLAI 416


>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
          Length = 98

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 43/59 (72%), Positives = 51/59 (86%), Gaps = 2/59 (3%)

Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
          ++L+ IRRDI S LWFTYRKGFVPIG  +S  T+DKGWGCMLRCGQMV+A+AL+ LHLG
Sbjct: 34 KELDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGWGCMLRCGQMVLARALITLHLG 92


>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
          Length = 546

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 66/123 (53%), Gaps = 6/123 (4%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E+ R D+ S +W TYR GF  +   G T D GWGCMLR  QM++ QAL    LGR W+  
Sbjct: 50  EERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCMLRSAQMLMTQALQRHTLGRSWRVP 109

Query: 73  VNSKEE----AYLKILKMFEDRRTAP--YSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
              +E      Y  ++++F D       +SIH +   G    K  GEW+GP T A VLR 
Sbjct: 110 RTLEERLRVPEYRTLVRLFADHPGEANLFSIHNMCQVGIRYDKLPGEWYGPTTAACVLRD 169

Query: 127 LAK 129
           +++
Sbjct: 170 ISE 172



 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 65/129 (50%), Gaps = 29/129 (22%)

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           +VL++PLRLG+ +++  YI  +                       ++T R     PQSLG
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSL-----------------------LETLR----VPQSLG 412

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
            +GG+PNHA++FIG  GN +  LDPHT Q    +   E    ++   + HC  A  + + 
Sbjct: 413 FLGGRPNHAIFFIGAQGNTLTGLDPHTTQPAADM--GEGFPSERYVHSLHCQSAVSMDVH 470

Query: 290 HMDPSIAVV 298
            +DPS+A+ 
Sbjct: 471 RIDPSLALA 479


>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 135/327 (41%), Gaps = 78/327 (23%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           + +I++ +   +W TYR+ F P+  S   +D GWGCMLR GQM +AQ +L  HL      
Sbjct: 56  INKIKQLVQDTIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQ-MLKKHLKN---- 110

Query: 72  NVNSKEEAYLKILKMFEDRRT----------------------APYSIHQIALTGASE-G 108
           + + ++E Y  IL  F D  +                       P+SI +IA     E  
Sbjct: 111 HGDKRDEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKKEFN 170

Query: 109 KAVGEWFGPNTVAQVLR--------------KLAKYDDWSSIVFHVALDNTL--VVNQVK 152
              GEW+ PN +  +L               KL+ ++D  S +F   L N +  +  +  
Sbjct: 171 LDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFND--SCLFLDQLMNRMFDIKFETD 228

Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
           K        +       L + +  R+G+ + N  Y+                  K+L   
Sbjct: 229 KDLEEQLEKTQLKSKNSLAIFVLTRIGLDEPNQKYL------------------KVLDEL 270

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEK 272
             M+ P ++       G++GG P  A Y +G + +  I+LDPH  Q      +K Q  E 
Sbjct: 271 --MELPYFQ-------GIVGGTPKRAFYILGRINDHYIYLDPHYVQE---AENKGQIIEN 318

Query: 273 KL--DSTYHCPQASRLHILHMDPSIAV 297
           K+   ++Y C     L+  H+D S+ +
Sbjct: 319 KMFNRTSYSCKYIHLLNQKHVDTSMGL 345


>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 516

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 85/355 (23%), Positives = 151/355 (42%), Gaps = 66/355 (18%)

Query: 1   MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGD------------SGLTTDKGWGCM 48
           M    +   ++ +++  +  + +W TYRK F  + +            S   +D GWGCM
Sbjct: 55  MNENKETYEKNYKEVLENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCM 114

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRT----APYSIHQIALTG 104
           +R GQM  A+ L   HL  + +  V  KE+  + I    +D +     APYSI +I+   
Sbjct: 115 VRVGQMAFAEGLR-RHLVENKKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIA 173

Query: 105 ASEGKAV-GEWFGPNTVAQVL------RKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT 157
            S+   + GEW+ P  +  +L      RK  K  +   +    +    +  + ++++C  
Sbjct: 174 LSDFNLLPGEWYTPIRICYILGLLHNERKAIKGTEDLKVAVFSSSRPIVFQDFLERMCKV 233

Query: 158 NKRASSNPQWQPLVLVI---------------PLRLGIQDINPVYINGIKKCYALP-ISP 201
           + +   + Q  P    I                ++L  Q+ N   +   ++   L  + P
Sbjct: 234 DPQRGKHAQICPNQCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCP 293

Query: 202 V-----YDMVKILSSTYNMQTPRYEF--------TFPQSLGVIGGKPNHALYFIGYVGND 248
           +     Y M+  +     + TP+ E+         F  SLG+IGGKP  ALYF+G + ++
Sbjct: 294 IHHELQYSMIVYIVCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDE 353

Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDS-----TYHCPQASRLHILHMDPSIAVV 298
            I+LDPH        Y +E  +EK   S     TY C +       ++D S +++
Sbjct: 354 FIYLDPH--------YVQEFSNEKNFQSSSQLETYFCKKFQTYPSKNIDSSFSLM 400


>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 140/321 (43%), Gaps = 66/321 (20%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL-----G 66
           + +I++ +   +W TYR+ + P+  S   +D GWGCMLR GQM +AQ +L  HL      
Sbjct: 56  INKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGCMLRVGQMAMAQ-MLKKHLKNHGDK 114

Query: 67  RDWQW--------NVNSKEEAYLKILKMFEDRRTA-----PYSIHQIALTGASE-GKAVG 112
           RD  +        + +S+E       +  +D++ A     P+SI +IA     E     G
Sbjct: 115 RDEDYDNIILAFADNDSQENKEFIEFQNSKDKQKAHNFICPFSIQKIAYLAKKEFNLDPG 174

Query: 113 EWFGPNTV---AQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQ 168
           EW+ PN +    ++L          ++   V  D+ L ++Q+  ++         + + Q
Sbjct: 175 EWYRPNYILFLLELLHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFEAKFETDKDLEEQ 234

Query: 169 ----------PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
                      L + +  R+G+ + N  Y+                  KIL     M+ P
Sbjct: 235 LEKTQLIGKNSLAIFVLTRIGLDEPNQKYL------------------KILDEI--MELP 274

Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL--DS 276
            ++       G++GG P  A Y +G + +  ++LDPH  Q      +K+Q +E K+   +
Sbjct: 275 YFQ-------GIVGGTPKRAFYILGKINDHYLYLDPHYVQE---AENKDQINENKMFNRT 324

Query: 277 TYHCPQASRLHILHMDPSIAV 297
           +Y C     L+  H+D S+ +
Sbjct: 325 SYSCKNIHLLNQKHVDTSMGL 345


>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
 gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
          Length = 356

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 68/265 (25%), Positives = 119/265 (44%), Gaps = 54/265 (20%)

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
            T+D GWGCM+R GQ ++A AL   + G               +I+++F D    P+SIH
Sbjct: 84  FTSDIGWGCMIRTGQTLLANALQRTNKGTPCS-----------EIIELFVDETKNPFSIH 132

Query: 99  QIALTGASEGKA-VGEWFGPNTVAQVLRKLAKYDDWSSI---VFHVALDNTLVVNQVKKL 154
                G       VGEWF P+   Q++ KL + ++   I   +  ++  +    + + +L
Sbjct: 133 NFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKCIVSISSGDIYEQDVLDEL 192

Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
              +    +N + Q ++L+  ++LGI  IN      I+K Y        D+  I ++ Y 
Sbjct: 193 --DDSEPPANTKQQHILLLFGIKLGINTIN------IEK-YG------QDIKDITNNKY- 236

Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGY--VGNDVIFLDPHTNQNIGCVYDKEQDSEK 272
                       + G+ GG+P  +L+F GY    + +++ DPH   N     D       
Sbjct: 237 ------------TCGISGGQPKSSLFFFGYNNTHDRILYFDPHKPNNFTTDNDY------ 278

Query: 273 KLDSTYHCPQASRLHILHMDPSIAV 297
              STYH  + + L + ++DPS+ +
Sbjct: 279 ---STYHSTEFNELEMFNLDPSMII 300


>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
          Length = 216

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 72/143 (50%), Gaps = 38/143 (26%)

Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
           +W+PL+++IPLRLG+  IN  Y   I+  + LP                           
Sbjct: 28  EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELP--------------------------- 60

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI------GCVYDKEQD-----SEKKL 274
           Q +G+IGG+PNHALYF G V N++++LDPH  QN           D+  D     +++  
Sbjct: 61  QCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQNFVDLDETTTTRDERDDYVEIKNDEFK 120

Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
           DSTYHCP      I  +DPS+A+
Sbjct: 121 DSTYHCPFILSTKIDKVDPSLAL 143


>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
          Length = 734

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 92/183 (50%), Gaps = 25/183 (13%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++Q++++   D  + LWF+YRK F PI ++ +TTD GWGCM+R GQM++A+ALL      
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQMLLARALLRHLYQN 328

Query: 68  DWQWNVNSKEEA--YLKILKMFED--RRTAPYSIHQIALTGASEGK-------------- 109
           +    V+    +  Y K++  F D   R   YSIHQI        K              
Sbjct: 329 ENIPEVDRTRPSSKYRKVMNWFCDLPTREHYYSIHQIVHKNKIIAKYHNSKLKDFDIETD 388

Query: 110 ------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
                  V EWF P  ++ VL+ L K    S I  +V  D  +  + V+KLC T++R S 
Sbjct: 389 ENIDLLNVDEWFAPTKISVVLKHLLKSHGLSDITMYVPSDGVVYKDYVRKLC-TDERLSF 447

Query: 164 NPQ 166
           +P+
Sbjct: 448 DPE 450



 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 68/144 (47%), Gaps = 34/144 (23%)

Query: 155 CTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
           C+    +S  PQ W+ +++++P++LG+  +N VY   IK    LP               
Sbjct: 527 CSDFFSSSCIPQRWKSIIILVPIKLGLDKLNEVYFREIKSMLELP--------------- 571

Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
                       QS+G+IGGKP  + YF+GY    +I+LDPH       V+D    ++  
Sbjct: 572 ------------QSIGLIGGKPKQSFYFVGYQDEHIIYLDPHF------VHDTVSPNDIN 613

Query: 274 LDSTYHCPQASRLHILHMDPSIAV 297
              +YH     ++ I  +DPS+A+
Sbjct: 614 FSDSYHHCVPQKMLISQLDPSMAI 637


>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
          Length = 255

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 48/132 (36%), Positives = 64/132 (48%), Gaps = 26/132 (19%)

Query: 18  DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
           D  SR W TYR  F  +P                     +  SG T+D GWGCM+R GQ 
Sbjct: 126 DFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMIRSGQS 185

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A AL  L LGRDW+  +    E   ++L +F D   APYS+H     G     K  GE
Sbjct: 186 LLANALAVLDLGRDWRRGMLPDRER--RLLALFADDPRAPYSVHNFVRHGEKYCSKYPGE 243

Query: 114 WFGPNTVAQVLR 125
           WFGP+  A+ ++
Sbjct: 244 WFGPSATARCIQ 255


>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 343

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/261 (31%), Positives = 122/261 (46%), Gaps = 35/261 (13%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD--- 68
           L QI+    + ++F+YR GF     + + +D GWGCMLR GQM+ A  LL  HL  +   
Sbjct: 16  LSQIKEAQHNLIYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLL-RHLKENPQI 74

Query: 69  -WQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGK-AVGEWFGPNTVAQVLRK 126
             Q  + +  +  L I+K F + +  P+SI QIA     E K  +G W+ PN +A  L+K
Sbjct: 75  QNQLKIQNINDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKK 134

Query: 127 LA-KYDDWS--SIVFHVAL-DNTLVVNQVKKLCTTNKRASSNPQ--WQPLVLVIPLRLGI 180
           L   +  +S  +IV  V   D  L  +Q     T  K  S+ P+   Q L+  I  ++ I
Sbjct: 135 LLNNFQTFSEMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKI 194

Query: 181 --QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
             Q+ N   IN  K+ Y + I   Y   K L     +      FT   S+G+IG      
Sbjct: 195 MKQNSNKYQIN--KQNYKILIGLDYPEEKYLDILIKL------FTHRLSIGMIG------ 240

Query: 239 LYFIGYVGND-VIFLDPHTNQ 258
                 + ND + +LDPH  Q
Sbjct: 241 ------LNNDKLTYLDPHIVQ 255


>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
 gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
          Length = 142

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 68/146 (46%), Gaps = 51/146 (34%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDS--------------------------GLTTDKGWGC 47
           +   D TS++W TYR  F PI D+                          G T+D GWGC
Sbjct: 16  EFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLGGERGWTSDSGWGC 75

Query: 48  MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPY------SIHQIA 101
           MLR GQ ++A AL+F+ LGR+W+                   R  AP       S+H++A
Sbjct: 76  MLRTGQSLLANALVFMWLGREWR-------------------RPPAPMPTESYASVHRMA 116

Query: 102 LTGASEGKAVGEWFGPNTVAQVLRKL 127
           L G   GK VG+WFGP+T A  ++ L
Sbjct: 117 LAGKELGKDVGQWFGPSTAAGAIKTL 142


>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
          Length = 564

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 85/330 (25%), Positives = 133/330 (40%), Gaps = 109/330 (33%)

Query: 38  GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDR----RTA 93
            LTTD  WGC +R  QM+IA AL        + + VNS       ILK+F+D       +
Sbjct: 213 NLTTDCNWGCTIRSAQMMIANAL----QQSTFMYPVNS-------ILKLFDDNIRECTES 261

Query: 94  PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKYDDWSS----------IVFHVAL 142
            +SI  IA+ G   G+  G+W+G +++  +L+ L   Y  +S           IVF   +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321

Query: 143 ----------------DNTLVVNQVKKL---------------------CTT-------- 157
                            +++V+NQ  +                      C          
Sbjct: 322 KKGCQLVNEKQDQQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKLP 381

Query: 158 NKRASSNP----QWQPLVLVI-PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
           N     NP    +W+  VLVI  +RLG+Q I+P+Y   I K                   
Sbjct: 382 NMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKY------------------ 423

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND------VIFLDPHTNQNIGCVYDK 266
             MQ P++       +G++GGKPN A YF G++ +       ++FLDPH  Q+     + 
Sbjct: 424 --MQMPQF-------VGLVGGKPNKAFYFFGHIIDQDTNKVKLMFLDPHKVQDYTYNVET 474

Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
             D + K  + +H  +A  L I  +D  + 
Sbjct: 475 SYDLDVKEQAKFHTTEARLLKIKELDTCLG 504


>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
 gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
          Length = 314

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 118/300 (39%), Gaps = 62/300 (20%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E   +D    L  TYRK     G    ++D GWGCM+R  Q ++A  L      R  Q +
Sbjct: 42  EAFVQDTYDLLSLTYRKCIA--GMECFSSDAGWGCMIRSMQTMLANCL------RRVQPS 93

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNTVAQVLRKLAKYD 131
           +        KIL  F D   A  S+HQ    G +      G WFGP TV+     L    
Sbjct: 94  LPVH-----KILHYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHLCSTH 148

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
               +   V+ D  ++            +  + P   P +L+  LRLGI  I+  Y   +
Sbjct: 149 PQVGLNVCVSHDGAIMYR---------DQLRNTPY--PRLLLFTLRLGIDTIHTSYYEQL 197

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
             C+ L                         T PQ++G++GG+P  A YF         +
Sbjct: 198 --CHVL-------------------------TIPQAIGIVGGRPRAAHYFYACQSQWFFY 230

Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI----AVVSQRSYSDYK 307
           LDPHT Q     +D         +S++H     RL I  +DP +    A+ S+   +D++
Sbjct: 231 LDPHTTQTAH-TFDNPAP-----NSSFHVTTLRRLRINELDPCMVLGFAITSEECQTDFE 284


>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
          Length = 564

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 85/330 (25%), Positives = 133/330 (40%), Gaps = 109/330 (33%)

Query: 38  GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDR----RTA 93
            LTTD  WGC +R  QM+IA AL        + + VNS       ILK+F+D       +
Sbjct: 213 NLTTDCNWGCTIRSAQMMIANAL----QQSTFMYPVNS-------ILKLFDDNIRECTES 261

Query: 94  PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKYDDWSS----------IVFHVAL 142
            +SI  IA+ G   G+  G+W+G +++  +L+ L   Y  +S           IVF   +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321

Query: 143 ----------------DNTLVVNQVKKL---------------------CTT-------- 157
                            +++V+NQ  +                      C          
Sbjct: 322 KKGCQLVNEKQDQQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKLP 381

Query: 158 NKRASSNP----QWQPLVLVI-PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
           N     NP    +W+  VLVI  +RLG+Q I+P+Y   I K                   
Sbjct: 382 NMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKY------------------ 423

Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN------DVIFLDPHTNQNIGCVYDK 266
             MQ P++       +G++GGKPN A YF G++ +       ++FLDPH  Q+     + 
Sbjct: 424 --MQMPQF-------VGLVGGKPNKAFYFFGHIIDLDTNKVKLMFLDPHKVQDYTYDVET 474

Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
             D + K  + +H  +A  L I  +D  + 
Sbjct: 475 SYDLDVKEQAKFHTTEARLLKIKELDTCLG 504


>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 297

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/249 (26%), Positives = 111/249 (44%), Gaps = 46/249 (18%)

Query: 9   HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH-LGR 67
             D E++++ + +   FTY KGF P+   G TTDK WGC +R GQ ++ Q +  L+ L  
Sbjct: 11  QSDTEKLKKVVDTIPRFTYHKGFSPLA-GGYTTDKNWGCCIRSGQGLLMQFVSKLYQLYG 69

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           D   N+      +    ++F D   AP+ IH I     + G   GEW  P+ +A V + L
Sbjct: 70  DKIKNIFPNGSKF----ELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDL 125

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW-QPLVLVIPLRLGIQDINPV 186
             +        HV +         +  C + +       +  P++L+  L LG +D +  
Sbjct: 126 LSF-----FGIHVVI--------AENGCLSRESLREALSYGHPVLLLFTLMLGYKDFDLK 172

Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
           Y+  ++    L +S +Y                      QS+GV+GG+   A Y +G+  
Sbjct: 173 YLPFLR----LTLSLIY----------------------QSVGVVGGQQGKAYYLVGHQK 206

Query: 247 NDVIFLDPH 255
            ++++ DPH
Sbjct: 207 ENLLYFDPH 215


>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 355

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 87/193 (45%), Gaps = 13/193 (6%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSG---LTTDKGWGCMLRC--GQMVIAQALLFLHLGRD 68
           Q   D  SR+W TYR  F  I  S     T+       L+   G      +   + LGRD
Sbjct: 113 QFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQLGDQSPFSSDSMIRLGRD 172

Query: 69  WQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKL 127
           W+   +  EE   +I+K+F D   APYS+H     GAS  GK  GEWFGP+  A+ ++ L
Sbjct: 173 WRRGQSPHEE--REIIKLFADHPNAPYSLHSFVRHGASACGKYPGEWFGPSATARCIQAL 230

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
           A   + S  V+       +  ++  K+      A     + P ++++  RLGI  I PVY
Sbjct: 231 ANSHESSLRVYSTGDGPDVYEDEFMKIAKPEGEA-----FHPTLILVGTRLGIDKITPVY 285

Query: 188 INGIKKCYALPIS 200
              +     +P S
Sbjct: 286 WEALIASLQMPQS 298


>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
 gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
          Length = 102

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/57 (61%), Positives = 45/57 (78%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           ++  D+ SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+   LGR W+
Sbjct: 44  ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMILAQALVCSQLGRAWR 100


>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
 gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
          Length = 483

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 132/313 (42%), Gaps = 83/313 (26%)

Query: 18  DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
           ++ S L FTYR  F PI     G S +                         +D GWGCM
Sbjct: 94  EVHSLLHFTYRTKFEPIPKDPNGPSPMNFGTLFRDNPLNSFESAINHPDCFCSDIGWGCM 153

Query: 49  LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASE 107
           +R GQ ++  AL  L          +  EE   +++  FEDR +AP+S+H     G A  
Sbjct: 154 IRTGQALLGNALARLR---------SPPEEK--QLIGWFEDRSSAPFSLHNFVREGNALS 202

Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
            K  GEWFGP+  ++ ++ L              L++ ++        +T+         
Sbjct: 203 RKPPGEWFGPSATSRSIQSLVH------AFPQCGLNHCII--------STDSGDVYEEDV 248

Query: 168 QPLVLVIP-LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
            P++   P   + +     + +N +   Y        D+  IL S++             
Sbjct: 249 GPILEREPQATILLLLGVKLGLNNVNSRYWP------DVKHILGSSF------------- 289

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ--NIGCVYDKEQDSEKKLDSTYHCPQAS 284
           S+G+ GG+P+ +LYF GY G+ + +LDPHT+Q     C  D E     K +S  H  + +
Sbjct: 290 SVGIAGGRPSSSLYFFGYQGDYLFYLDPHTSQLDLASCATDNE-----KYESV-HSARFN 343

Query: 285 RLHILHMDPSIAV 297
           ++H   +DPS+ +
Sbjct: 344 KVHFSELDPSMLI 356


>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
          Length = 646

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 46/142 (32%), Positives = 73/142 (51%), Gaps = 22/142 (15%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           + +   D ++++W +YR+GF  IGD+    D GWG   + GQ                  
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGYWKKSGQ------------------ 450

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLA-K 129
             N   E    I++MF D+ TAP+SIH IAL G +  GK VGEWF P+ +   ++ L  K
Sbjct: 451 --NEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVNK 508

Query: 130 YDDWSSIVFHVALDNTLVVNQV 151
           ++   +I   ++ D +L V+Q+
Sbjct: 509 FNLQCNISVVISEDGSLYVDQM 530


>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 298

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 117/256 (45%), Gaps = 53/256 (20%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQAL--LFLHLGRD 68
           D E+ R+ + +   FTY K F P+   G TTDK WGC +R  Q +I Q +  L+ HLG D
Sbjct: 13  DTEKQRKLLETIPRFTYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDD 71

Query: 69  WQ--WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
            +  +  NSK E       +F D   +P+ +  I     S G   GEW  P+ +A V+++
Sbjct: 72  IRNIFPTNSKYE-------LFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKE 124

Query: 127 LAKYDDWSSIVF-HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP 185
           +  +     ++  H  L   ++          N+  S N    P++L+  L LG ++   
Sbjct: 125 IMNFFRIPVVIAEHGCLSREVL----------NEALSHN---IPVLLLFTLMLGYENFEL 171

Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
            Y+  +K    L +S +Y                      QS+GV+GG+   A + +G+ 
Sbjct: 172 KYLPFLK----LTLSLIY----------------------QSVGVVGGQQGKAYFIVGHQ 205

Query: 246 GNDVIFLDPH-TNQNI 260
              +++ DPH  N++I
Sbjct: 206 KEKLLYFDPHDVNESI 221


>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
          Length = 745

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 78/182 (42%), Gaps = 42/182 (23%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL-FLHLGRDWQWNVNSK 76
           D+ S +WF+YRK F PI ++ +TTD GWGCMLR GQM++A+AL+  L+   D    +  K
Sbjct: 233 DVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEIERK 292

Query: 77  E--EAYLKILKMFED--RRTAPYSIHQIALTGASEGK----------------------- 109
           +    Y ++L  F D   +   Y IHQI     +  K                       
Sbjct: 293 KPHSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQAMEKNNRKQQILREQVISLNRGGGGSS 352

Query: 110 --------------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
                          V EW  P  ++ +LR+L K+     +  +V  D  +  + +  LC
Sbjct: 353 KGKKKKEKEEEINDNVEEWLAPTRISNILRQLIKFQHLEDLEMYVPTDGVIYKDYINNLC 412

Query: 156 TT 157
             
Sbjct: 413 NN 414



 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 63/139 (45%), Gaps = 37/139 (26%)

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           +S  P+W+ L+++IPL+LG   +N  YI                           +  + 
Sbjct: 497 SSIPPKWKSLIIMIPLKLGADKLNSTYI---------------------------EKLKL 529

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STY 278
               PQSLG IGGKP  + YFIG+  + VI+LDPH        + +E  +    D  +TY
Sbjct: 530 LLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLDPH--------FVQESVNPNSFDYSNTY 581

Query: 279 HCPQASRLHILHMDPSIAV 297
                 ++    +DPS+++
Sbjct: 582 SGCIPQKMPFTQLDPSLSI 600


>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
           Full=Autophagy-related protein 4
 gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
 gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
          Length = 745

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 79/183 (43%), Gaps = 42/183 (22%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL-FLHLGRDWQWNVNSK 76
           D+ S +WF+YRK F PI ++ +TTD GWGCMLR GQM++A+AL+  L+   D    +  K
Sbjct: 233 DVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEIERK 292

Query: 77  E--EAYLKILKMFED--RRTAPYSIHQIALTGASEGK----------------------- 109
           +    Y ++L  F D   +   Y IHQI     +  K                       
Sbjct: 293 KPHSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQAMEKNNRKQQILREQVISLNRGGGGSS 352

Query: 110 --------------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
                          V EW  P  ++ +LR+L K+     +  +V  D  +  + +  LC
Sbjct: 353 KGKKKKEKEEEINDNVEEWLAPTRISNILRQLIKFQHLEDLEMYVPTDGVIYKDYINNLC 412

Query: 156 TTN 158
             +
Sbjct: 413 NNS 415



 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 63/139 (45%), Gaps = 37/139 (26%)

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           +S  P+W+ L+++IPL+LG   +N  YI                           +  + 
Sbjct: 497 SSIPPKWKSLIIMIPLKLGADKLNSTYI---------------------------EKLKL 529

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STY 278
               PQSLG IGGKP  + YFIG+  + VI+LDPH        + +E  +    D  +TY
Sbjct: 530 LLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLDPH--------FVQESVNPNSFDYSNTY 581

Query: 279 HCPQASRLHILHMDPSIAV 297
                 ++    +DPS+++
Sbjct: 582 SGCIPQKMPFTQLDPSLSI 600


>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
          Length = 296

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 93/222 (41%), Gaps = 54/222 (24%)

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSS 135
           +E  + +I+  F D   AP+ +H++   G S GK  G+W+GP+ VA +LRK         
Sbjct: 50  QERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRK--------- 100

Query: 136 IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCY 195
                  + T +V  V + CT                V+ +R                  
Sbjct: 101 -AVESCSEVTRLVVYVSQDCT----------------VLHMR------------------ 125

Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
           +L I P  D    L S+   +  R E      LG++GGKP H+LYFIGY  + +++LDPH
Sbjct: 126 SLAIDPSKDRSTCLPSSLQ-ELLRCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPH 180

Query: 256 TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
             Q    V       E     ++HC    ++    MDPS  V
Sbjct: 181 YCQPTVDVSQANFPLE-----SFHCTSPRKMAFAKMDPSCTV 217


>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
          Length = 207

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/74 (51%), Positives = 48/74 (64%), Gaps = 1/74 (1%)

Query: 17  RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
            D +SR+W TYRKGF  I DS  T+D  WGCM+R  QM++AQAL+F HLGR W+  +   
Sbjct: 131 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 190

Query: 76  KEEAYLKILKMFED 89
               Y+ IL MF D
Sbjct: 191 YSPEYIGILHMFGD 204


>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 523

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 132/332 (39%), Gaps = 76/332 (22%)

Query: 23  LWFTYRKGFVPI--GDSG-------------------------------LTTDKGWGCML 49
           LW +YR GF PI   D G                                T+D GWGCM+
Sbjct: 130 LWLSYRCGFEPIPKSDDGPQPITFFPSIVFNRLTLVNLSNLRSLLDKDHFTSDAGWGCMI 189

Query: 50  RCGQ--MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI--ALTGA 105
           R  Q  +  A   LF   G   Q    +K EA   ++++F+D  +AP+S+H    A    
Sbjct: 190 RTSQNLLANALLRLFHTTGGQPQNFAVTKTEA--DVIELFQDTLSAPFSLHNFIKAANSL 247

Query: 106 SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP 165
           S     G+WFGP+  +  ++KL   +D++ I      +     +   K+ T N +  S  
Sbjct: 248 SLNIKPGQWFGPSAASLSIKKLV--NDYNLIQQERRSERDSGRDSGHKVPTPNLKLHSKS 305

Query: 166 QWQPLVLVIPLRLGIQDINPVYI--------NGIKKCYALPISPVYDMVKI--------- 208
                            I  VY+        + I   + L   P+  +  I         
Sbjct: 306 ADSDSDSDSDAISKRNSIPYVYVSENCDLYDDEINAIFELEQRPILFLFPIRLGIEQVNK 365

Query: 209 --LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG-NDVIFLDPHTNQNIGCVYD 265
              SS   +   ++      S+G+ GGKP+ + YFIGY G +D+I+ DPH  Q +    +
Sbjct: 366 YYYSSILQILASKF------SVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQIVQTPVN 419

Query: 266 KEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            E         +YH  + S+L I  +DPS+ +
Sbjct: 420 LE---------SYHTSEYSKLKIDQLDPSMMI 442


>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
          Length = 473

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 129/303 (42%), Gaps = 60/303 (19%)

Query: 13  EQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           E I   +   + FTYR+GF      DS LTTD GWGC++R GQM++A+ LL  HL   ++
Sbjct: 42  EDILDVVVHTIRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAE-LLKRHLKCFYK 100

Query: 71  WNVNSKEEAYLKILKMFEDRRTAP------------YSIHQIALTGASE-GKAVGEWFGP 117
            ++ S       +L+MF+D                 +SI +I      E GK  GEW+ P
Sbjct: 101 VDLFSFPPLLQDVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKEWGKKPGEWYSP 160

Query: 118 NTVAQVLRKLAK-------YDDWSSIVFHVALDNTLVVNQVKKL--CTTNKRASSNPQW- 167
           N + Q + K+ +       Y       +   +D   +  ++  +  C   K+  S  Q+ 
Sbjct: 161 NQIVQAIYKILQEINIPYCYGLGFVPFYESQIDLRAIFQEMCMMEDCVCQKKVFSIEQFL 220

Query: 168 ----------QPLVLVIPLRLGIQDI--------NPVYINGI------KKCYALPISPVY 203
                     + +V V+     I D+        N   I  +      +KC+ +P+  V 
Sbjct: 221 KSLEKLEIGKEEMVQVMHGNDSISDVCCEDQSEQNKKEIGNLLKKYICQKCF-VPVRAV- 278

Query: 204 DMVKILSST-YNMQTPRYEFTFPQSL------GVIGGKPNHALYFIGYVGNDVIFLDPHT 256
             V +LS    +   P Y     Q +      G++GG+P  A + +G+V N  + LDPH 
Sbjct: 279 -AVCLLSRIGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHL 337

Query: 257 NQN 259
            Q 
Sbjct: 338 VQE 340


>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 284

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 107/262 (40%), Gaps = 46/262 (17%)

Query: 14  QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           +I+  I     F YR     + +S LTTDKGWGC  R  Q ++ Q +L LH  R ++   
Sbjct: 12  EIKDVIADIPRFCYRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYILKLH--RKFRSLY 69

Query: 74  NSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
           +      +  L +F D  +AP+ I  +     + G  VGEW  P+ +A  ++ +    + 
Sbjct: 70  DQVFGQNVNPLDLFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIMAATIKLIFDTLNL 129

Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
           S I   ++ D TL  N +K                P +++IP   G+  ++  Y++ +  
Sbjct: 130 SCI---ISQDLTLDSNDIKH------------TKYPALILIPSLFGLSKMDDSYLSFLLL 174

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
           C  +                             SLG + G+   A YF+G+   D  + D
Sbjct: 175 CLCI---------------------------ESSLGFVSGQNASAYYFVGFDLEDFYYFD 207

Query: 254 PHTNQN--IGCVYDKEQDSEKK 273
           PH  +   +   YD   D E K
Sbjct: 208 PHVTKEAVVSPPYDSFFDLELK 229


>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
 gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
          Length = 465

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/350 (23%), Positives = 127/350 (36%), Gaps = 136/350 (38%)

Query: 48  MLRCGQMVIAQALL------FLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP--YSIHQ 99
           MLR GQM++A+ALL       + +    +  +NSK   Y ++LK F D ++    Y IHQ
Sbjct: 1   MLRTGQMILARALLKHVYPDNVIINHQERIRINSK---YNQVLKWFSDYQSKEHLYGIHQ 57

Query: 100 IA-LTGASEGKA------------------------------------------------ 110
           I  +  A E K                                                 
Sbjct: 58  IVHMKKAMEKKIRQKALENYARRKQQLQQQQQQRYGKNSVRVRIDNYSDSSSDSEDEWDN 117

Query: 111 VGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC--------------- 155
           V EW  P  ++ VLR+L K  +   +  +V  D  +    + +LC               
Sbjct: 118 VEEWLAPTKISNVLRQLVKNQNLDDLEMYVPNDGVIYREYINQLCNPYYFNNYKNNDQNN 177

Query: 156 ----TTNKRASS-----------------------NP-QWQPLVLVIPLRLGIQDINPVY 187
               + N+   S                       NP +W+ L+++IPL+LG+  IN  Y
Sbjct: 178 QNNLSMNQSPPSRVPSEVFNHPLSVNDDDQDYYHFNPNKWKSLIIMIPLKLGVDRINTSY 237

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
           I  +K   ++                           PQSLG IGGKP  + YFIG+  +
Sbjct: 238 IRKLKSILSI---------------------------PQSLGFIGGKPKQSFYFIGFQDD 270

Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            VI+LDPH       V D    S      T+      ++   ++DPS++V
Sbjct: 271 QVIYLDPH------FVQDTVDPSSNNYSETFCGCIPQKMSFSNIDPSLSV 314


>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 200

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/177 (27%), Positives = 91/177 (51%), Gaps = 16/177 (9%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           + +D+++  R     +W TYRK    I +   TTD GWGCM+R  QMV+AQ  L + LG 
Sbjct: 27  NKKDIDEFARHT---IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGN 81

Query: 68  DWQWN---VNSKEEAY--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
           +W++    +N++   +    I+ +F D   + +SIH++    ++ G   G+W+GP+  + 
Sbjct: 82  NWKYENNCMNTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASD 141

Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLG 179
           +  +            +VA   ++V  ++++L      +     + P ++ +PLRLG
Sbjct: 142 IAAEHINEMRVFRTRGYVAKLGSIVGPKIEEL------SKDEVGFNPCIIFVPLRLG 192


>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 284

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 105/251 (41%), Gaps = 47/251 (18%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
           ++ D  QI  +I     F YR  F  I +S L+ D GWGC  R  Q ++ Q +L LH  +
Sbjct: 9   TNVDANQILAEIPR---FCYRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYILRLH--K 63

Query: 68  DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
           ++    NS        L +F D   AP+ I  I     S G  +G W  P+ +A   + +
Sbjct: 64  NFPDLYNSTFGIDKNPLDLFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSI 123

Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
            +    + I   V  D+T +  +++   +TN          P++++IP   G++ I   Y
Sbjct: 124 FQSLHLNCI---VPQDSTFIYEELE---STN---------YPVLILIPGLFGLEKIEKPY 168

Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
           I+ I                 LS   N            SLG + G  + A YFIG+  +
Sbjct: 169 ISFI----------------FLSLCMN-----------SSLGFVSGHNDSAFYFIGFDSD 201

Query: 248 DVIFLDPHTNQ 258
              + DPH  +
Sbjct: 202 YFYYFDPHVTK 212


>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
          Length = 286

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 39/186 (20%)

Query: 112 GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLV 171
           GEW  P   A     +    + S +V +VA D T+    + +L    +      +W+ ++
Sbjct: 64  GEWTRPPGKA-----VEGSSEVSGMVVYVAQDCTVYKADMARL--AGQPGDPEAEWKSII 116

Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
           +++P+RLG + +NP Y+  IK+   L + P                          LG+I
Sbjct: 117 ILVPVRLGGETLNPAYMPCIKE--LLRMEPC-------------------------LGII 149

Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           GGKP H+LYFIGY  + +++LDPH  Q   CV D  +DS   L+S +HC    +L    M
Sbjct: 150 GGKPKHSLYFIGYQDDFLLYLDPHYCQP--CV-DTMKDS-FPLES-FHCTAPRKLPFAKM 204

Query: 292 DPSIAV 297
           DPS  V
Sbjct: 205 DPSCTV 210


>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
           kowalevskii]
          Length = 338

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 71/137 (51%), Gaps = 32/137 (23%)

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           + S+  W  ++++IP+RLG +++NPVYI+ IK                            
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSL-------------------------- 189

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
            FT    +G+IGGKP H+LYFIG+  + +I LDPH  Q++  V  + +D   +   ++HC
Sbjct: 190 -FTLKHCIGIIGGKPKHSLYFIGFQEDKLIHLDPHLCQDV--VDMRSRDFPLQ---SFHC 243

Query: 281 PQASRLHILHMDPSIAV 297
               ++ ++ MDPS  +
Sbjct: 244 MSPRKMSLMKMDPSCTI 260



 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/62 (53%), Positives = 45/62 (72%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           +E+ +RD TSRLW TYR+ F  +  + LTTD GWGCMLR GQM++AQ+ L   LGR ++ 
Sbjct: 81  MERFKRDFTSRLWLTYRREFQQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140

Query: 72  NV 73
           +V
Sbjct: 141 DV 142


>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
          Length = 282

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 62/116 (53%), Gaps = 8/116 (6%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           D  SR+W TYR    P+  S  TTD GWGC LR  QM++AQAL+ LHLGR+W++  + + 
Sbjct: 142 DYYSRIWLTYRTELSPLPGSSKTTDCGWGCTLRTCQMMLAQALVVLHLGREWRFWGDEEA 201

Query: 78  EAY------LKILKMFEDRRTAPYSIHQIALTGA--SEGKAVGEWFGPNTVAQVLR 125
             Y        I+ +F D   A   ++++       +E  AVG W+   T   ++R
Sbjct: 202 NRYRCGFGHYDIVSLFGDHLDADLGLYRLMKIAKERNEHDAVGNWYSACTAFGLIR 257


>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
 gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
          Length = 384

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 71/155 (45%), Gaps = 40/155 (25%)

Query: 151 VKKLCTTNKRASSNP--------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
           V  LCT  +R SSN          W  ++++IP+RLG + +NP+Y   IK          
Sbjct: 180 VVSLCTKRRRLSSNAADRDGSTENWCSVIILIPVRLGGESLNPIYEPCIKGL-------- 231

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                              FT    LGVIGG+P H+LYF+G+  + +I LDPH  Q +  
Sbjct: 232 -------------------FTMDHCLGVIGGRPKHSLYFVGFQEDKLIHLDPHFCQEVVD 272

Query: 263 VYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
           +  ++   E     ++HC    ++ I  MDPS  +
Sbjct: 273 MTPRDFPLE-----SFHCMNPRKMSIARMDPSCTI 302



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 12  LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
           +E  +RD  S++W TYR+ F  +  S  TTD GWGCMLR GQM++A  L+   LGR ++ 
Sbjct: 119 MELFKRDFASKVWLTYRREFPQLAGSMFTTDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178

Query: 72  NVNS 75
           +V S
Sbjct: 179 DVVS 182


>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
          Length = 178

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/53 (60%), Positives = 40/53 (75%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D +SR+W TYRKGF  I  S LT+D  WGCM+R  QM++AQAL+F HLGR W+
Sbjct: 118 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 170


>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
          Length = 352

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 65/258 (25%), Positives = 112/258 (43%), Gaps = 60/258 (23%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
           +I +  +F YR  F P+ ++ LT+D GWGC +R  QM++A A     +G+ +  + ++ E
Sbjct: 83  EIINLFYFVYRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANA-----IGKLFTNDFDTGE 137

Query: 78  EAYLKILKMFEDRRTA--PYSIHQIALTGAS-EGKAVGEWF-GPNTVAQVLRKLAKYDDW 133
                ++K F D  +   P+SIH + LT A  +G   G  F  P+ VA    ++ K    
Sbjct: 138 VTDKMVIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK--KL 195

Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
           ++  F + +  T    +V                QP +++IP+ +      P   N    
Sbjct: 196 ANPKFGMEILTTTFTFRVYT--------------QPTIVLIPISI------PDSFN---- 231

Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
                     D + ++ S Y               G++GG    A YF G   + ++FLD
Sbjct: 232 ----------DKIAVIFSFYLFS------------GMVGGSGRKAFYFFGIHHDQLLFLD 269

Query: 254 PHTNQNI---GCVYDKEQ 268
           PHT +N     C +D ++
Sbjct: 270 PHTVRNTVINSCSFDPQE 287


>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
          Length = 359

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/63 (50%), Positives = 44/63 (69%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L R ++
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRVYK 166

Query: 71  WNV 73
            +V
Sbjct: 167 ADV 169



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 32/138 (23%)

Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
           R     +W+ +V+++P+RLG + +NPVY+  +K+                         R
Sbjct: 175 RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL-----------------------R 211

Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
            E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E     ++H
Sbjct: 212 SELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE-----SFH 262

Query: 280 CPQASRLHILHMDPSIAV 297
           C    ++    MDPS  V
Sbjct: 263 CTSPRKMAFAKMDPSCTV 280


>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
          Length = 360

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 92/206 (44%), Gaps = 14/206 (6%)

Query: 18  DITSRLWFTYRKGFVPIGDSG---LTTDKGWGCMLRC--GQMVIAQALLFLHLGR-DWQW 71
           D  S++W TYR  F PI  S     T+       L+   G      +   + LGR DW+ 
Sbjct: 120 DFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQLGDQSPFSSDTMVRLGRGDWRR 179

Query: 72  NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKY 130
             + +EE   ++LK F D   APYSIH     GAS  GK  GEWFGP+  A+ ++ L   
Sbjct: 180 GESVEEEC--RLLKDFADDPRAPYSIHSFVRHGASACGKYPGEWFGPSATARCIQALTNS 237

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
            + S  V+       +  ++  ++            + P ++++  RLGI  I PVY   
Sbjct: 238 HESSIRVYSTGDGPDVYEDEFMQIAKPPGE-----DFHPTLVLVGTRLGIDKITPVYWEA 292

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQ 216
           +     +P S   D  ++  +  ++Q
Sbjct: 293 LIAALQMPQSNEVDWQELKRNVKHVQ 318


>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
          Length = 469

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 132/302 (43%), Gaps = 60/302 (19%)

Query: 13  EQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           E I   +   + FTYR+GF      +S LTTD GWGC++R GQM++A+ LL  HL   + 
Sbjct: 42  EDILDVVIHTIRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAE-LLKRHLKCFYN 100

Query: 71  WNVNSKEEAYLKILKMFEDRRTAP------------YSIHQIALTGASE-GKAVGEWFGP 117
            N+        ++L++F+D                 +SI +I      E GK  GEW+ P
Sbjct: 101 VNLFQFPPLMQEVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160

Query: 118 NTVAQVLRKLAKYDD------WSSIVFHVALDNTLVVNQ---VKKLCTTNKRASSNPQW- 167
           N + Q + K+   ++       S + F+ +  +  V+ Q   V + C   +R     ++ 
Sbjct: 161 NQIVQAIYKILSDNNIIYSCGLSLLPFYESQIDLKVILQEMCVMENCICEQRVFFIEKFL 220

Query: 168 QPLVL-------VIPLRLGIQDINPVYINGI-----------------KKCYALPISPVY 203
           Q LV        VI +  G   I+ VY   +                 +KC+ +PI  V 
Sbjct: 221 QDLVRLEINKEEVIQVIHGNDSISDVYYEDLSQQNKQEIGMLLKKYVCQKCF-VPIRAV- 278

Query: 204 DMVKILSST-YNMQTPRYEFTFPQSL------GVIGGKPNHALYFIGYVGNDVIFLDPHT 256
             + +LS    +   P Y     Q +      G++GG+P  A + +G+V +  + LDPH 
Sbjct: 279 -AICLLSRIGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHL 337

Query: 257 NQ 258
            Q
Sbjct: 338 VQ 339


>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
          Length = 208

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 32/53 (60%), Positives = 40/53 (75%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D +SR+W TYRKGF  I  S LT+D  WGCM+R  QM++AQAL+F HLGR W+
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 200


>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 180

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 47/145 (32%), Positives = 71/145 (48%), Gaps = 37/145 (25%)

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           W P+++++P+RLGIQ +NP+YI  +K                             F+FPQ
Sbjct: 11  WHPVIILVPVRLGIQCLNPIYIPTLKAF---------------------------FSFPQ 43

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS-TYHCPQASR 285
            LGVIGGKP+ + YF+GY  N V+++DPH  Q        + D    ++S     PQA  
Sbjct: 44  CLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQP---TVKMDDDPLFPIESYRMEIPQAMS 100

Query: 286 LHILHMDPSIAV----VSQRSYSDY 306
                +DPS+A+     SQ  + D+
Sbjct: 101 FD--DIDPSLALGFLCSSQAEFDDF 123


>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
          Length = 271

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/53 (60%), Positives = 40/53 (75%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D +SR+W TYRKGF  I  S LT+D  WGCM+R  QM++AQAL+F HLGR W+
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 200


>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
          Length = 823

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 72/139 (51%), Gaps = 30/139 (21%)

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           ++++IP RLG+  +N  Y + IK                           Y F    ++G
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIK---------------------------YVFQCRLNVG 643

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
           ++GG+PN ALYF+G    D+I LDPH  Q+   V ++E+ S  +L+ TYHC QA +L + 
Sbjct: 644 IMGGRPNQALYFVGTQKTDLICLDPHLVQD--TVLNQEELSNVELNQTYHCDQAKKLSMT 701

Query: 290 HMDPSIAV-VSQRSYSDYK 307
            +D S+A     + Y+D++
Sbjct: 702 KLDTSLAFGFYLKDYNDFE 720



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/130 (33%), Positives = 69/130 (53%), Gaps = 8/130 (6%)

Query: 40  TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ-WNVNSKEEA---YLKILKMFEDR---RT 92
           TTD GWGC +R GQM+I QAL+   +G D    N++S E+    Y KI+++  D    +T
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLNYAKIIQLIHDNDCSQT 453

Query: 93  APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQV 151
             +SI  IA  G    K  GEW+GP+ +  +LR L + Y    +    +  D  +  +++
Sbjct: 454 GAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNRIYQPVENFQVCMFRDGNVYYDKI 513

Query: 152 KKLCTTNKRA 161
            K   T+ +A
Sbjct: 514 MKTAITDGKA 523


>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
          Length = 292

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/53 (60%), Positives = 40/53 (75%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D +SR+W TYRKGF  I  S LT+D  WGCM+R  QM++AQAL+F HLGR W+
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 200


>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
           [Homo sapiens]
          Length = 228

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 33/62 (53%), Positives = 44/62 (70%), Gaps = 2/62 (3%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL--FLHLGRD 68
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL  FL  G+ 
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKP 167

Query: 69  WQ 70
           W+
Sbjct: 168 WR 169


>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
           [Homo sapiens]
          Length = 360

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 32/63 (50%), Positives = 44/63 (69%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L R ++
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRVYK 167

Query: 71  WNV 73
            +V
Sbjct: 168 ADV 170



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 32/138 (23%)

Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
           R     +W+ +V+++P+RLG + +NPVY+  +K+                         R
Sbjct: 176 RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL-----------------------R 212

Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
            E      LG++GGKP H+LYFIGY  + +++LDPH  Q    V   +   E     ++H
Sbjct: 213 CELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE-----SFH 263

Query: 280 CPQASRLHILHMDPSIAV 297
           C    ++    MDPS  V
Sbjct: 264 CTSPRKMAFAKMDPSCTV 281


>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
          Length = 348

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 53/110 (48%), Gaps = 24/110 (21%)

Query: 18  DITSRLWFTYRKGFVPI---------------------GD-SGLTTDKGWGCMLRCGQMV 55
           D  SR W TYR GF PI                     GD S  ++D GWGCM+R GQ +
Sbjct: 123 DFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSGQSL 182

Query: 56  IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
           +A A+    LGR W+ +     E   +I+ +F D   APYSIH+    GA
Sbjct: 183 LANAMAMYELGRGWRLSDGGIAEK--EIISLFADDPRAPYSIHRFVGHGA 230


>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
          Length = 362

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/63 (50%), Positives = 44/63 (69%)

Query: 11  DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D+++ +RD  SRLW TYR+ F P+    LT+D GWGCMLR GQM++AQ LL   L R ++
Sbjct: 110 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRVYK 169

Query: 71  WNV 73
            +V
Sbjct: 170 ADV 172



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 68/138 (49%), Gaps = 32/138 (23%)

Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
           R     +W+ +V+++P+RLG + +NPVY+  +K+                         R
Sbjct: 178 RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL-----------------------R 214

Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
            E      LG++GGKP H+LYFIGY  + +++LDPH  Q      D  Q ++  L+S +H
Sbjct: 215 CELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-ADFPLES-FH 265

Query: 280 CPQASRLHILHMDPSIAV 297
           C    ++    MDPS  V
Sbjct: 266 CTSPRKMAFAKMDPSCTV 283


>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
          Length = 454

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 71/307 (23%), Positives = 115/307 (37%), Gaps = 110/307 (35%)

Query: 18  DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F  +P                     +   G TTD GWGCM+R G  
Sbjct: 128 DFESKIWLTYRSNFPLIPKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSG-- 185

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEW 114
              Q+LL            N+     L IL                              
Sbjct: 186 ---QSLL-----------ANA-----LAIL------------------------------ 196

Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKR-ASSNPQWQPLVL 172
               ++ +  R L+   + + +  +V  D + V  ++ + + +     A ++    P ++
Sbjct: 197 ----SLGRACRALSSECEHAGLNVYVTSDGSDVYEDRFRAIASAGGTGAGTSTDVHPTLI 252

Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
           ++ +RLGI  + PVY   +K                               +PQS+G+ G
Sbjct: 253 LLGIRLGIDRVTPVYWEALKAV---------------------------LKYPQSVGIAG 285

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHCPQASRLHILH 290
           G+P+ + YFIG  G+   +LDPH +     VY    D     +  +TYH  +  RLHI  
Sbjct: 286 GRPSSSHYFIGAQGSHFFYLDPH-HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKD 344

Query: 291 MDPSIAV 297
           MDPS+ +
Sbjct: 345 MDPSMLI 351


>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
          Length = 426

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 50/113 (44%), Gaps = 17/113 (15%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
           LWFTYR GF  +   G T D GWGCMLR  QM++  AL              +     L 
Sbjct: 28  LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL------------TRNGAAPRLA 75

Query: 83  ILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
              +F D    +AP+ +H  A  G       GEW+GP     VLR L    DW
Sbjct: 76  TAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLV---DW 125


>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
          Length = 567

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 96/216 (44%), Gaps = 58/216 (26%)

Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD--W----SSIVFHVALDNTLVVNQVKK 153
           + L GA        WFGP+T+ +VLR +   ++  W    + ++F    D+ +  +  + 
Sbjct: 337 LMLHGAISQPLCCRWFGPDTICRVLRHIWNMNEGVWPCHTAGMLF--VEDHCIYRDLAES 394

Query: 154 L-----------CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
           +           C+   +A     W+PL++V+P+RLG +  +  +++ I K         
Sbjct: 395 VACSRQAYSGTNCSRMAQAREPCSWRPLIVVVPVRLGARSEDQ-HLSRIDKHL------- 446

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
                                  QSLG IGG+P H+ YF+G  G +  +LDPH  Q    
Sbjct: 447 -----------------------QSLGFIGGRPRHSYYFVGVRGYNAYYLDPHITQPY-- 481

Query: 263 VYDKEQDSEKKLD-STYHCPQASRLHILHMDPSIAV 297
                Q   K ++ +++HC    ++ + H+DPS+A+
Sbjct: 482 -----QSIRKNINVASFHCAHPGKMSLAHIDPSLAL 512


>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 398

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 66/267 (24%), Positives = 106/267 (39%), Gaps = 54/267 (20%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
           L+F+YR  F P+ + G TTD  WGC+LR  QM+I   LL  H    +        E    
Sbjct: 74  LYFSYRSCFPPLPN-GSTTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELKAN 132

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKYDDWSSIVFHV 140
           I ++F D  +AP  IH+                 P      +    +A + +   + F  
Sbjct: 133 ISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSPTEAGMAMAAALIACHAEGGDVPFTF 192

Query: 141 ALDNTLVVNQ--VKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
           + +N  +     V KL           + Q ++L+IP+ LG+                 P
Sbjct: 193 SCENRNIDEPAVVAKLL----------EGQHVILIIPVVLGLA----------------P 226

Query: 199 ISPVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH- 255
           +S  Y+  M+KIL    +M+            G+ GG    + Y  G+ G  V F+DPH 
Sbjct: 227 LSDKYESMMLKIL----DMKA---------CCGIAGGFKQASFYMFGHQGRKVFFMDPHY 273

Query: 256 ------TNQNIGCVYDKEQD-SEKKLD 275
                 +++  G +Y    D + +K D
Sbjct: 274 IQKAYTSDKTAGTLYGARGDLTARKFD 300


>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
          Length = 326

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 60/252 (23%), Positives = 96/252 (38%), Gaps = 47/252 (18%)

Query: 20  TSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA 79
           TS    TYR  F P+  S LT+D+GWGC+ R  QM++A  L             ++  E 
Sbjct: 41  TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVL-----------RRHAASEC 89

Query: 80  YLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNTVAQVLRKLAKYDDWSSIVF 138
           +LK      D   AP+S+H +       G     +++ P+   + +R   +     S V 
Sbjct: 90  HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYWAPSQGCEAIRSCVE-----SAVR 144

Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV-IPLRLGIQDINPVYINGIKKCYAL 197
              L   L V          +    + +    VLV +P+R G                  
Sbjct: 145 QGLLTQKLSVVVSSSGTIPEREIHEHLRGDGSVLVLVPVRCGTS---------------- 188

Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT- 256
                    + ++ T       +    P  +GV+GG PN   Y +G  G+ +++LDPH  
Sbjct: 189 ---------RRMTQTMFFAL-EHLLHIPSCMGVVGGVPNRGYYIVGTSGHRLLYLDPHCM 238

Query: 257 --NQNIGCVYDK 266
             N  + C   K
Sbjct: 239 TQNAMVSCELGK 250


>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 348

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 66/241 (27%), Positives = 102/241 (42%), Gaps = 61/241 (25%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
           +TS ++F YR  F  + ++ LT+D GWGC +R  QM++A A++ L  G D   N+N K  
Sbjct: 84  LTSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAIIKL-FGSD---NINRK-- 137

Query: 79  AYLKILKMFED--RRTAPYSIHQIALTG-ASEGKAVGEWFGP-NTVAQVLRKLAKYDDWS 134
               ++  F D      PYSIH +  T     G   G  F P ++V   L +L   D   
Sbjct: 138 ---TVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFSSVIYALTELVNKD--- 191

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
              F+ A +  ++ N+   L + NK         P ++ IP                   
Sbjct: 192 ---FNRAFECHVITNKF-LLKSINK---------PTIVFIP------------------- 219

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
           + +P      ++ I             F+F    G++GG    A YF G   N ++FLDP
Sbjct: 220 FTIPDKFDQRLITI-------------FSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDP 266

Query: 255 H 255
           H
Sbjct: 267 H 267


>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 265

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 33/67 (49%), Positives = 45/67 (67%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
           L+  ++E+ R    SR+W TYRK F P+  S LTTD GWGCMLR GQM++AQ LL   + 
Sbjct: 69  LNLDEVERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQGLLVHLMH 128

Query: 67  RDWQWNV 73
           R ++ +V
Sbjct: 129 RVYKEDV 135



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 59/131 (45%), Gaps = 32/131 (24%)

Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
           WQ +++++P+RLG + +NP YI  +K    L                             
Sbjct: 155 WQSVIILVPVRLGGESLNPSYIECVKNILKLDCC-------------------------- 188

Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
            +G+IGGKP H+LYFIG+    +++LDPH  Q +  V       E     ++HC    ++
Sbjct: 189 -IGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQVNFSLE-----SFHCNSPKKM 242

Query: 287 HILHMDPSIAV 297
               MDPS  +
Sbjct: 243 PFSRMDPSCTI 253


>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
          Length = 429

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 59/126 (46%), Gaps = 26/126 (20%)

Query: 18  DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
           D  SR+W TYR  F  I  S                       G ++D GWGCM+R GQ 
Sbjct: 183 DFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 242

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
           ++A A+L   LGR+W+   +   E    I+ +F D   AP+S+H     GA+  GK  GE
Sbjct: 243 LLANAILVARLGREWRRETDLDAEK--DIIALFADDPRAPFSLHNFVKYGATACGKYPGE 300

Query: 114 WFGPNT 119
              P++
Sbjct: 301 CGRPSS 306



 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
           G+P+ + YFIG  G  + +LDP H    +    D +  + ++LD T H  +  +LHI  M
Sbjct: 302 GRPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELD-TCHTRRLRQLHIDDM 360

Query: 292 DPSIAV 297
           DPS+ +
Sbjct: 361 DPSMLI 366


>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 348

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 63/241 (26%), Positives = 99/241 (41%), Gaps = 61/241 (25%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
           +TS ++F YR  F  + ++ LT+D GWGC +R  QM++A +++ L  G D   N+N K  
Sbjct: 84  LTSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSIIKL-FGSD---NINRK-- 137

Query: 79  AYLKILKMFED--RRTAPYSIHQIALTGASEGK-AVGEWFGP-NTVAQVLRKLAKYDDWS 134
               ++  F D      PYSIH +  T     K   G  F P + V   L +L   D   
Sbjct: 138 ---TVIHWFLDFYNSECPYSIHSLFTTQIIVSKNPNGSSFLPFSVVIYALTELVNKDFNR 194

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
           +   H+ + N  ++N + K               P ++ IP                   
Sbjct: 195 AFECHI-ITNKFLLNSINK---------------PTIVFIP------------------- 219

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
           + +P      ++ I             F+F    G++GG    A YF G   N ++FLDP
Sbjct: 220 FTIPDEFEQRLITI-------------FSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDP 266

Query: 255 H 255
           H
Sbjct: 267 H 267


>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
 gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
          Length = 196

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 35/139 (25%)

Query: 167 WQPLVLVIPLRLGI-QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
           W PLV+++PL LG+ + +NP Y+ GI +   LP                           
Sbjct: 75  WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLP--------------------------- 107

Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHT-------NQNIGCVYDKEQDSEKKLDSTY 278
           QS+G++GGKP  +LYF+G    ++ +LDPHT        Q  GC      +S      TY
Sbjct: 108 QSVGILGGKPCASLYFVGAQDEELFYLDPHTVQLAVPLEQIWGCAQTGSPESGPFPTETY 167

Query: 279 HCPQASRLHILHMDPSIAV 297
           HC     ++   +DPS+ +
Sbjct: 168 HCRSVLHMNARELDPSMVL 186



 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 25/40 (62%), Positives = 29/40 (72%)

Query: 21 SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQAL 60
          SR+W TYR+GF  IG    TTD GWGC LR GQM++A AL
Sbjct: 3  SRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANAL 42


>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
 gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
          Length = 388

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 100/251 (39%), Gaps = 42/251 (16%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E ++      L+F+YR  F P+  +G TTD  WGC++R  QM++   LL  H    +   
Sbjct: 58  EFVKAATKKLLYFSYRNCFPPL-PNGSTTDTRWGCLVRTTQMLVGTCLLRYHCQGTYVLP 116

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKY 130
                E   +I ++F D  +AP  IH+                 P      +    +A +
Sbjct: 117 EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSPTEAGMAIAAALIAFH 176

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
                + F    ++      + +     K +    + Q ++L+IP+ LGI          
Sbjct: 177 AQGGDVPFTFCCES----RNIDEPAVMAKLS----EGQHVILIIPVVLGIA--------- 219

Query: 191 IKKCYALPISPVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
                  P+S  Y+  M+KIL    +M+            G+ GG    +LY  G+ G  
Sbjct: 220 -------PMSDQYERMMLKIL----DMKA---------CCGIAGGLKRASLYMFGHQGRS 259

Query: 249 VIFLDPHTNQN 259
           V F+DPH  QN
Sbjct: 260 VFFMDPHYIQN 270


>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
          Length = 224

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 30/53 (56%), Positives = 35/53 (66%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
           D TSRLW TYR  + PI  S   TD GWGCMLR GQ ++A  L+   LGRDW+
Sbjct: 142 DFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSLLANTLIIHFLGRDWR 194


>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
 gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
          Length = 328

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/237 (21%), Positives = 100/237 (42%), Gaps = 52/237 (21%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
            TYR  F P+  S +T+DKGWGC++R  QM++A AL        W+++ N   +  L   
Sbjct: 46  LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
           +  +   + P+S+H++      +      E++ P+   + +R             + A+D
Sbjct: 95  RDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144

Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
             L+    V    + C   +   SN ++  ++++ P+R G                 +  
Sbjct: 145 RKLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS-------------RRMTQ 191

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
              + +  +L S+               +GV+GG P  + Y +G  G  +++LDPH 
Sbjct: 192 MMFFSLEHLLHSS-------------ACIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235


>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
 gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
          Length = 388

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 100/251 (39%), Gaps = 42/251 (16%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E ++      L+F+YR  F P+  +G TTD  WGC++R  QM++   LL  H    +   
Sbjct: 58  EFVKAATKKLLYFSYRNCFPPL-PNGSTTDTRWGCLVRTTQMLVGTCLLRYHCQGAYVLP 116

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKY 130
                E   +I ++F D  +AP  IH+                 P      +    +A +
Sbjct: 117 EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSPTEAGMAIAAALIAFH 176

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
                + F    ++      + +     K +    + Q ++L+IP+ LGI          
Sbjct: 177 AQGGDVPFTFCCES----RNIDEPAVMAKLS----EGQHVILIIPVVLGIA--------- 219

Query: 191 IKKCYALPISPVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
                  P+S  Y+  M+KIL    +M+            G+ GG    +LY  G+ G  
Sbjct: 220 -------PMSDQYERMMLKIL----DMKA---------CCGIAGGLKRASLYMFGHQGRS 259

Query: 249 VIFLDPHTNQN 259
           V F+DPH  QN
Sbjct: 260 VFFMDPHYIQN 270


>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
 gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
          Length = 388

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/252 (23%), Positives = 96/252 (38%), Gaps = 46/252 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
           E ++      L+F+YR  F P+ +   TTD  WGC++R  QM++   LL  H    +   
Sbjct: 58  EFVKAAAKKLLYFSYRNCFPPLPNRS-TTDTRWGCLVRTTQMLVGSCLLRYHCKGAYVLP 116

Query: 73  VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
                E   +I ++F D  +AP  IH++                P      +        
Sbjct: 117 ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSPTEAGMAIAA------ 170

Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP------QWQPLVLVIPLRLGIQDINPV 186
            + I FH    +          C  N+    +       + Q ++L+IP+ LGI      
Sbjct: 171 -ALIAFHAQGGDAPFT-----FCCENRNIDESAVMAKLSEGQHVILIIPVVLGIA----- 219

Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
                      P+S  Y+  ++L    +M+            G+ GG    +LY  G+ G
Sbjct: 220 -----------PMSGQYE--RMLLKILDMKA---------CCGIAGGFKQASLYMFGHQG 257

Query: 247 NDVIFLDPHTNQ 258
            +V F+DPH  Q
Sbjct: 258 RNVFFMDPHYVQ 269


>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
          Length = 328

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/237 (21%), Positives = 100/237 (42%), Gaps = 52/237 (21%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
            TYR  F P+  S +T+DKGWGC++R  QM++A AL        W+++ N   +  L   
Sbjct: 46  LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
           +  +   + P+S+H++      +      E++ P+   + +R             + A+D
Sbjct: 95  RDIDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144

Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
             L+    V    + C   +   SN ++  ++++ P+R G                 +  
Sbjct: 145 RRLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS-------------RRMTQ 191

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
              + +  +L S+               +GV+GG P  + Y +G  G  +++LDPH 
Sbjct: 192 MMFFSLEHLLHSS-------------ACIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235


>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
 gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
          Length = 179

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 73/161 (45%), Gaps = 42/161 (26%)

Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
           D+T  V++ K L   N    S   ++P +++I  RLGI  I PVY + +K    LP    
Sbjct: 3   DDTGDVHEDKFLDAANDERGS---FRPTLILIGTRLGIDRITPVYWDAVKTTLQLP---- 55

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN----- 257
                                  QS+G+ GG+P+ + YF+G  G+ + +LDPH       
Sbjct: 56  -----------------------QSVGIAGGRPSASHYFVGVQGSHLFYLDPHQTRPALP 92

Query: 258 -QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            +NI   Y  E+        TYH  +  R+HI  MDPS+ +
Sbjct: 93  QRNIDERYTDEE------IETYHTRRLRRIHIRDMDPSMLI 127


>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
           Clan CA, family C54, putative [Trypanosoma cruzi]
          Length = 328

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/237 (22%), Positives = 99/237 (41%), Gaps = 52/237 (21%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
            TYR  F P+  S +T+DKGWGC++R  QM++A AL        W+++ N   +  L   
Sbjct: 46  LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
              +   + P+S+H++      +      E++ P+   + +R             + A+D
Sbjct: 95  CDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144

Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
             L+    V    + C   +   SN ++  ++++ P+R G                    
Sbjct: 145 RKLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCG-------------------A 185

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
           S     +K  S  + + +          +GV+GG P  + Y +G  G  +++LDPH 
Sbjct: 186 SRRMTQMKFFSLEHLLHS-------STCIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235


>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
 gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
          Length = 388

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/241 (25%), Positives = 92/241 (38%), Gaps = 42/241 (17%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
           L+F+YR  F P+  SG TTD  WGC++R  QM++   LL  H    +        E   +
Sbjct: 68  LYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGAYVLPEADNAELKER 126

Query: 83  ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKYDDWSSIVFHV 140
           I ++F D  +AP  IH+                 P      +    +A       + F  
Sbjct: 127 ISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSPTEAGMAIAAALIAFRAQGGDVPFTF 186

Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
             ++      + +     K      + Q +VL+IP+ LGI                 P+S
Sbjct: 187 CCES----RHIDEPAVMAKLL----EGQHVVLIIPVVLGIA----------------PMS 222

Query: 201 PVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
             Y+  M+KIL                   G+ GG    +LY  G+ G  V F+DPH  Q
Sbjct: 223 DQYELVMLKILD-------------VKACCGIAGGFKQASLYMFGHQGRSVFFMDPHYVQ 269

Query: 259 N 259
           N
Sbjct: 270 N 270


>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
          Length = 312

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/251 (23%), Positives = 114/251 (45%), Gaps = 51/251 (20%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
           Q+ E   +   + +WF YR      G +   +D+GWGC++R GQM++A AL+     R+ 
Sbjct: 44  QNAEAFNQKKDTLIWFCYRANIQFEGKA--ISDQGWGCLVRVGQMMLANALM-----REC 96

Query: 70  QWNVNSKEEAYLKILKMFEDRR----TAPYSIHQIALTGA-SEGKAVGEWFGPNTVAQVL 124
           +    +K +A   I+ +F+D +     AP+SI QI    + +    +G+W+    +  V+
Sbjct: 97  KILAINKTKAM--IIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIMSVI 154

Query: 125 RKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
             L K +        + +    +VN +++ C    +   + + +P +L+I   +G + + 
Sbjct: 155 EDLNKNN--------MNIKQINLVNFLEQ-CVLESQIDLSFK-KPHLLIIHAIIGDKSLG 204

Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
            + I  ++                     +MQ  ++        G I GK N A + IG+
Sbjct: 205 QLEIQNLQS--------------------HMQISQFA-------GAIIGKNNKAFFLIGF 237

Query: 245 VGNDVIFLDPH 255
             N+ IF+DPH
Sbjct: 238 QKNNAIFMDPH 248


>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
           tetraurelia strain d4-2]
 gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
           tetraurelia]
 gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
          Length = 277

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 108/242 (44%), Gaps = 53/242 (21%)

Query: 23  LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK-EEAYL 81
           +WF+YR      G +   +D+GWGC++R GQM++A +L+        + + NSK  +   
Sbjct: 22  IWFSYRANIQYEGRA--ISDQGWGCLIRVGQMIVANSLI--------RESTNSKPNDLKT 71

Query: 82  KILKMFEDRRT----APYSIHQ-IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
           KI+ +F+D +     AP+SI Q I          +G+W+    +  +L  L +       
Sbjct: 72  KIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLLQSAK---- 127

Query: 137 VFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYA 196
                +    ++N +++ C   K+     + QP +L+I   +G ++++  ++  ++K   
Sbjct: 128 ----TIKQLKIINFLEQ-CVIEKQIDLQFK-QPQLLIIHAIIGNKELDQYFVAELQK--- 178

Query: 197 LPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
                            +MQ P++        G I GK   A + IGY  N  I +DPH 
Sbjct: 179 -----------------HMQIPQFA-------GAIVGKSKKAYFLIGYQNNQGIVMDPHY 214

Query: 257 NQ 258
            Q
Sbjct: 215 VQ 216


>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
           Brener]
 gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 328

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/237 (21%), Positives = 99/237 (41%), Gaps = 52/237 (21%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
            TYR  F P+  S +T+DKGWGC++R  QM++A AL        W+++ N   +  L   
Sbjct: 46  LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
              +   + P+S+H++      +      E++ P+   + +R             + A+D
Sbjct: 95  CDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144

Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
             L+    V    + C   +   SN ++  ++++ P+R G                 +  
Sbjct: 145 RKLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS-------------RRMTQ 191

Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
              + +  +L S+               +GV+GG P  + Y +G  G  +++LDPH 
Sbjct: 192 MMFFSLEHLLHSS-------------ACIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235


>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
 gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
          Length = 266

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 27/44 (61%), Positives = 36/44 (81%)

Query: 18  DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
           D+ S +WF+YRK F PI ++ +TTD GWGCMLR GQM++A+ALL
Sbjct: 216 DVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALL 259


>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
          Length = 806

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 65/129 (50%), Gaps = 30/129 (23%)

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
           L++++ +RLG+++I   Y   +K C++L                            Q +G
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLR---------------------------QCVG 673

Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHI 288
           ++GGKPN ALYF+GY  + +IFLDPH  Q    +   EQ  +++L  TY   + A ++ +
Sbjct: 674 ILGGKPNFALYFVGYQQDHMIFLDPHYVQQ--ALTSDEQLKDQELKDTYQSQRSAKKIKM 731

Query: 289 LHMDPSIAV 297
             +DP I V
Sbjct: 732 ESLDPCIGV 740



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 6/67 (8%)

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSI 97
           + +D GWGCM+RC QM++A + L L      Q N N  +   +  IL M  D+  AP+ I
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFLKL-----LQQNHNFHDILTHDSILSMILDQLDAPFGI 447

Query: 98  HQIALTG 104
           HQI   G
Sbjct: 448 HQITEEG 454


>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
 gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
           mexicana MHOM/GT/2001/U1103]
          Length = 394

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
           S ++LE+   D  + L FTYR GF  +P     + TD+GWGC+LR  QM++A   L++H 
Sbjct: 33  SREELEKALAD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88

Query: 66  GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
           GR              + L +F D    TAP+SIH +  +  +      E++ P+   + 
Sbjct: 89  GRPAD-----------RKLSLFFDHSAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGCEA 137

Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
           +++                              T + A    Q Q  V+V+    G    
Sbjct: 138 IKR------------------------------TMQGAVKTEQLQTRVMVVTSTNGCIYA 167

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            ++   +  G      L    V    ++   +Y +Q  +     PQ LGV+GG P  + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225

Query: 241 FIGYVGNDVIFLDPH 255
           F  +    + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240


>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
           fumigatus A1163]
          Length = 226

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 72/161 (44%), Gaps = 42/161 (26%)

Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
           D+T  V + K L   N    S   ++P +++I  RLGI  I PVY + +K    LP    
Sbjct: 3   DDTADVYEDKFLDAANDGRGS---FRPTLILIGTRLGIDRITPVYWDAVKTTLQLP---- 55

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN----- 257
                                  QS+G+ GG+P+ + YF+G  G+ + +LDPH       
Sbjct: 56  -----------------------QSVGIAGGRPSASHYFVGVQGSHLFYLDPHQTRPALP 92

Query: 258 -QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            +NI   Y  E+        TYH  +  R+HI  MDPS+ +
Sbjct: 93  QRNIDDPYTDEE------IETYHTRRLRRIHIRDMDPSMLI 127


>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 348

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 63/241 (26%), Positives = 97/241 (40%), Gaps = 61/241 (25%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
           +TS ++F YR  F  + ++ L +D GWGC +R  QM++A A++ L  G D   N+N K  
Sbjct: 84  LTSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAIIKL-FGSD---NINRK-- 137

Query: 79  AYLKILKMFED--RRTAPYSIHQIALTG-ASEGKAVGEWFGP-NTVAQVLRKLAKYDDWS 134
               ++  F D      PYSIH +  T     G   G  F P + V   L +L   D   
Sbjct: 138 ---TVIHWFLDFYNVECPYSIHSLFTTQIIVSGNPNGSSFLPLSVVTYALTELVNKDLNR 194

Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
               HV + N  ++N + K               P ++ IP                   
Sbjct: 195 IFECHV-ITNKFLLNSINK---------------PTIIFIP------------------- 219

Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
           + +P      ++ I             F+F    G++GG    A YF G   + ++FLDP
Sbjct: 220 FTIPDEFNQRLISI-------------FSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDP 266

Query: 255 H 255
           H
Sbjct: 267 H 267


>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
 gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
           JPCM5]
          Length = 394

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
           S ++LE+   D  + L FTYR GF  +P     + TD+GWGC+LR  QM++A   L++H 
Sbjct: 33  SREELEKALTD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88

Query: 66  GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
           GR              + L +F D    TAP+SIH +  +  +      E++ P+   + 
Sbjct: 89  GRPAD-----------RRLSLFFDHSAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGCEA 137

Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
           +++                              T + A    Q Q  V+V+    G    
Sbjct: 138 IKR------------------------------TVQGAVKTEQLQTRVMVVTSTNGCIYA 167

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            ++   +  G      L    V    ++   +Y +Q  +     PQ LGV+GG P  + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225

Query: 241 FIGYVGNDVIFLDPH 255
           F  +    + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240


>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
 gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
           fumigatus Af293]
          Length = 226

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 72/161 (44%), Gaps = 42/161 (26%)

Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
           D+T  V + K L   N    S   ++P +++I  RLGI  I PVY + +K    LP    
Sbjct: 3   DDTADVYEDKFLDAANDGRGS---FRPTLILIGTRLGIDRITPVYWDAVKTTLQLP---- 55

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN----- 257
                                  QS+G+ GG+P+ + YF+G  G+ + +LDPH       
Sbjct: 56  -----------------------QSVGIAGGRPSASHYFVGVQGSHLFYLDPHQTRPALP 92

Query: 258 -QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
            +NI   Y  E+        TYH  +  R+HI  MDPS+ +
Sbjct: 93  QRNIDDPYTDEE------IETYHTRRLRRIHIRDMDPSMLI 127


>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
 gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
           brucei strain 927/4 GUTat10.1]
          Length = 327

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 102/236 (43%), Gaps = 45/236 (19%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
           FTYR+ F P+  S LT+DKGWGC+ R  QM++A +L             +S ++  L+  
Sbjct: 46  FTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL-----------RRHSAQDCKLQYF 94

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGE-WFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
              +D + AP+S+H +      +G+++   ++ P+   + +    K      I     L 
Sbjct: 95  ADLDDEQVAPFSLHCMVRHILKQGESLRPVYWAPSQGCEAISGCVKRATERGI-----LS 149

Query: 144 NTL-VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
           + L VV  V       + +    + + ++++ PLR G                       
Sbjct: 150 SPLSVVITVAGAVPAEEVSCHLKESRNVLILAPLRCGASR-------------------- 189

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHTN 257
           Y   K+  S  ++         P+S+G++GG PN   Y IG    + +++LDPH  
Sbjct: 190 YMSQKMFLSLEHL------LLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCK 239


>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
 gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
          Length = 394

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
           S ++LE+   D  + L FTYR GF  +P     + TD+GWGC+LR  QM++A   L++H 
Sbjct: 33  SREELEKALTD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88

Query: 66  GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
           GR              + L +F D    TAP+SIH +  +  +      E++ P+   + 
Sbjct: 89  GRPAD-----------RRLSLFFDHSAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGCEA 137

Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
           +++                              T + A    Q Q  V+V+    G    
Sbjct: 138 IKR------------------------------TVQGAVKTEQLQTRVMVVTSANGCIYA 167

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            ++   +  G      L    V    ++   +Y +Q  +     PQ LGV+GG P  + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225

Query: 241 FIGYVGNDVIFLDPH 255
           F  +    + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240


>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
          Length = 149

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/53 (54%), Positives = 38/53 (71%), Gaps = 1/53 (1%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
           E  RR  +S LW +YR+GF P+  S L++D GWGCMLR  QM++AQ LL LH+
Sbjct: 95  ESFRRVFSSLLWMSYRRGFRPLDGSTLSSDAGWGCMLRSAQMLLAQGLL-LHI 146


>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
           gambiense DAL972]
          Length = 327

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 104/236 (44%), Gaps = 45/236 (19%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
           FTYR+ F P+  S LT+DKGWGC+ R  QM++A +L             +S ++  L+  
Sbjct: 46  FTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL-----------RRHSAQDCKLQYF 94

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGE-WFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
              +D + AP+S+H +      +G+++   ++ P+   + +    K      I     L 
Sbjct: 95  ADLDDEQVAPFSLHCMVRHILKQGESLRPVYWAPSQGCEAISGCVKRATERGI-----LS 149

Query: 144 NTL-VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
           + L VV  V       + +    + + ++++ PLR            G  +C +      
Sbjct: 150 SPLSVVITVAGAVPAEEVSCHLKESRNVLILAPLRC-----------GASRCMS------ 192

Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHTN 257
               K+  S  ++         P+S+G++GG PN   Y IG    + +++LDPH  
Sbjct: 193 ---QKMFLSLEHL------LLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCK 239


>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
 gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
           Friedlin]
          Length = 394

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)

Query: 8   SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
           S ++LE+   D  + L FTYR GF  +P     + TD+GWGC+LR  QM++A   L++H 
Sbjct: 33  SREELEKALTD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88

Query: 66  GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
           GR              + L +F D    TAP+SIH +  +  +      E++ P+   + 
Sbjct: 89  GRPAD-----------RKLSLFFDHSAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGCEA 137

Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
           +++                              T + A    Q Q  V+V+    G    
Sbjct: 138 IKR------------------------------TVQGAVKTEQLQTRVMVVTSTNGCIYA 167

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
            ++   +  G      L    V    ++   +Y +Q  +     PQ LGV+GG P  + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225

Query: 241 FIGYVGNDVIFLDPH 255
           F  +    + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240


>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
 gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
           vaginalis G3]
          Length = 296

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 101/247 (40%), Gaps = 49/247 (19%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL-FLHLGRDWQWNVNSKEEAYLKI 83
           FTYR  F  I    +T+D GWGC  R  Q +IA   L +  +  ++ + V ++    + +
Sbjct: 30  FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNYAPVDAEYFFTVFNE----IPM 85

Query: 84  LKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
             +FEDR   P+SI  +       G   G W  P+ +A  +  + K    S +   ++ D
Sbjct: 86  FSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFKDLKLSVL---ISKD 142

Query: 144 NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVY 203
           + ++   VK +     RA              + LG++D+   +I  IK           
Sbjct: 143 SNIIPEDVKTM-----RAPFLLLIP-------ILLGMKDVEQKFIPFIK----------- 179

Query: 204 DMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN-DVIFLDPH-TNQNIG 261
                           Y F  P+ LG + G  + + + +G   + +V++ DPH T Q + 
Sbjct: 180 ----------------YTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPHVTKQAVA 223

Query: 262 CVYDKEQ 268
             +D  +
Sbjct: 224 SSFDHSE 230


>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 398

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 58/250 (23%), Positives = 99/250 (39%), Gaps = 52/250 (20%)

Query: 23  LWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY 80
           L FTYR GF  +P     + TD+GWGC+LR  QM++A  L        W +   +     
Sbjct: 46  LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFL--------WAYGRPAD---- 93

Query: 81  LKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVF 138
            + L +F D    TAP+SIH +  +  ++     E++ P+   + +++            
Sbjct: 94  -RRLALFFDHSAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGCEAIKR------------ 140

Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLG---IQDINPVYINGIKKCY 195
                             T + A    Q Q  V V+    G     +++  +  G +   
Sbjct: 141 ------------------TMQDAIKTEQLQTRVTVVTSTNGCVYADEVHHTFKQGAEVVL 182

Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
            L    V    ++   +Y +Q  +     PQ LG++GG P  + YF  +    + +LDPH
Sbjct: 183 VLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPH 240

Query: 256 TNQNIGCVYD 265
                  + D
Sbjct: 241 QRTTAALLSD 250


>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
          Length = 362

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 102/247 (41%), Gaps = 50/247 (20%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN---S 75
           +++ LW TYR G+  + +S L TD GWGC +R  QM+I+ A+  L    D   +      
Sbjct: 75  MSNLLWMTYRSGYEKLPNSSLNTDVGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIP 134

Query: 76  KEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQVLRKLAKYD 131
           K+   L ++  F D   +T P SIH +  +     + K+   +  P  VA+    L   +
Sbjct: 135 KQNEILNVVIPFVDFFEQTTPLSIHHVYESRFVVEQNKSGVNYLAPTIVAKAYSDLV--N 192

Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
            W       AL   +  N    LC   K       ++P ++ +P+ +     + +  + +
Sbjct: 193 SWK----MCALRCVMASNTSIPLCDIKKEP-----FKPTLVFLPIIM-----DQLVKSRL 238

Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
           ++ Y                 +NM             G++ G  + A+Y  G+     +F
Sbjct: 239 QQIYK----------------FNMFA-----------GIVSGIGDRAVYIFGFHVMRCLF 271

Query: 252 LDPHTNQ 258
           LDPHT Q
Sbjct: 272 LDPHTVQ 278


>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 371

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 56/259 (21%), Positives = 110/259 (42%), Gaps = 47/259 (18%)

Query: 13  EQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQAL---LFLHLGR 67
           E ++ +++S ++ +Y+K         + +TTD GWGC LR  QM++AQ L   L+    +
Sbjct: 48  ELLQEELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRHLYEKRVQ 107

Query: 68  DWQWNVNSKEEAYLKILKMF------EDRRTAPYSIHQIALTGASEGKA-VGEWFGPNTV 120
            + +N  +K + +  ++ MF      E+   +P+  H +     +  +  + + + P   
Sbjct: 108 SFIYNDKTKLD-FQHLIMMFAESNSLENMDQSPFGFHSLLTQAINLFQVPLKQQYTPVQG 166

Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
            + L++  K       +  +   +T V+ Q       + R       + L+L++  +LG 
Sbjct: 167 IKALKQQFKQQKLVKSL-KIVTSSTGVIFQ------EDIRQKMKNWEKSLLLILHFKLGT 219

Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
             +N +Y+  IK    L                              +G IGG  N +L+
Sbjct: 220 GKLNQIYVEQIKSLMDLEY---------------------------FVGAIGGIKNKSLF 252

Query: 241 FIGYVGNDVIFLDPHTNQN 259
            +GY+ +  + LDPH  QN
Sbjct: 253 MVGYMNDQFLSLDPHVQQN 271


>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
           multifiliis]
          Length = 209

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 11/126 (8%)

Query: 10  QDLEQIRRDITSRLWFTYRKGFVPIGDS-------GLTTDKGWGCMLRCGQMVIAQALLF 62
           ++ +++  +  + +W TYR+ F P+  +          +D GWGCM+R GQM +A+ L  
Sbjct: 17  KNCKKLIENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRH 76

Query: 63  LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNTVA 121
            HL +   ++     +A+L     F D   APYSI +I      E + V G+W+ P  + 
Sbjct: 77  -HLQQKGIYDNKRIIQAFLD--NDFGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRIC 133

Query: 122 QVLRKL 127
            VL  L
Sbjct: 134 HVLSLL 139


>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 341

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 56/109 (51%), Gaps = 15/109 (13%)

Query: 25  FTYRKGFVPI-GDSGLTT--DKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYL 81
           FTYR  F PI G  G T+  DKGWGC +R  QM++AQA+     G+D   +V        
Sbjct: 69  FTYRCAFEPIEGCVGPTSVSDKGWGCAIRATQMLLAQAVKM--AGKDADDSV-------- 118

Query: 82  KILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAK 129
            +L +F D   AP S+H++   G     K  G WFGP +   V  +L K
Sbjct: 119 -VLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGPTSGGMVASRLVK 166


>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 463

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 63/138 (45%), Gaps = 28/138 (20%)

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
           A ++    P ++++ +RLGI  + PVY   +K                            
Sbjct: 249 AGTSTDVHPTLILLGIRLGIDRVTPVYWEALKAV-------------------------- 282

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK-EQDSEKKLDSTYH 279
              +PQS+G+ GG+P+ + YFIG   +   +LDPH  +     +D  ++    +  +TYH
Sbjct: 283 -LKYPQSVGIAGGRPSSSHYFIGAQASHFFYLDPHHTRPALAYHDAGDRPYTTEELNTYH 341

Query: 280 CPQASRLHILHMDPSIAV 297
             +  RLHI  MDPS+ +
Sbjct: 342 TRRLRRLHIKDMDPSMLI 359


>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
           IL3000]
          Length = 327

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 57/239 (23%), Positives = 101/239 (42%), Gaps = 53/239 (22%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
           FTYRK F P+  S +TTDKGWGC+ R  QM++A AL   H+  D+ +             
Sbjct: 46  FTYRKDFEPLPRSVITTDKGWGCLARASQMLLACALR-RHMALDFSFQYFCD-------- 96

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGEWF-----GPNTVAQVLRK-LAKYDDWSSIVF 138
              +D R AP+S+H +  +    G+ +   +     G   ++  +R+ + +    S +  
Sbjct: 97  --IDDERIAPFSLHCMVRSVLRPGEDLRPVYWTPSQGCEAISGCVRRAIHRGALHSQLRV 154

Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
            V     +  ++V +    +  A         ++++P+R G            +K +   
Sbjct: 155 VVGAAGAIPKHEVNRHLEDSGNA---------LILVPVRCGTTR------RMTQKMF--- 196

Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHT 256
                     LS  + + T       P  +G++GG P    Y IG  G + +++LDPH 
Sbjct: 197 ----------LSLEHLLLT-------PMCVGMVGGVPGRCYYIIGTGGQELLLYLDPHC 238


>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
 gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
          Length = 135

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/37 (72%), Positives = 30/37 (81%)

Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWG 46
          Q+LE IRRDI SRLW TYR GF P+G+  LTTDKGWG
Sbjct: 61 QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWG 97


>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 359

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 66/285 (23%), Positives = 110/285 (38%), Gaps = 60/285 (21%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL----HLGRDWQWNVN 74
           I++  W TYR G+  + +S LTTD GWGC +R  QM+IA A+  +     L       + 
Sbjct: 75  ISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIP 134

Query: 75  SKEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQVLRKLAKY 130
           +KEE  + +L  F D    T P SIH +  +     + K+   +  P+ VA+    L   
Sbjct: 135 TKEEI-MNVLVPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLV-- 191

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
           + W                   KLC       SN       + IP               
Sbjct: 192 NSW-------------------KLCPIRCVMCSN-------VSIPTH------------- 212

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
             +   LP  P    + I+ +       +  +      G++GG  + A++  G+     +
Sbjct: 213 --ELSKLPFKPTLVFLPIVLNHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFL 270

Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP-QASRLHILHMDPS 294
           +LDPH  Q           S  ++D+  + P   +R  +  +DP+
Sbjct: 271 YLDPHIVQ-------PSFKSFTEIDTKSYSPISTNRFSVHTIDPT 308


>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
 gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
           SB210]
          Length = 649

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 73/321 (22%), Positives = 123/321 (38%), Gaps = 83/321 (25%)

Query: 23  LWFTYRKGFVPIGD-----SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD----WQWNV 73
           +WF+YR  F  I D       ++ D GWGCM+RC QM++A+AL   +L        Q + 
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204

Query: 74  NSKEEAYLKILKMF------EDRRTAPYSIHQIA---LTGASEGKAVGEWFGPNTVAQ-- 122
           + ++  Y  I+K+F       D    P S   I    L        +   FG   + Q  
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264

Query: 123 VLRKLAK-YDDW-SSIVFHVALDNTLVVNQ--------------------VKKLCTTNKR 160
           +LR+  +   +W +SI   V L   L  +Q                    +K+L   +++
Sbjct: 265 ILRQYQQNVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEASRK 324

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
              N +   +++++ L+ GI      +     K Y + +  + + V              
Sbjct: 325 Q--NDRLNNILVMVHLKFGINKFEMQH-----KDYFIELLKIKNFV-------------- 363

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY-- 278
                   G + G     +Y IG+  + +I LDPH  Q       K  + E+ LD  Y  
Sbjct: 364 --------GALSGTETKGMYIIGFQEDRLIVLDPHFIQ-------KSTEGEQGLDKDYCT 408

Query: 279 ---HCPQASRLHILHMDPSIA 296
                P++  L  L  D S+ 
Sbjct: 409 YFNKTPRSISLECLSSDISLG 429


>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
 gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
          Length = 353

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 60/136 (44%), Gaps = 17/136 (12%)

Query: 4   ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGD---------SGLTTDKGWGCMLRCGQM 54
            NK      +   +     + F+YR  F  I           S +TTD GWGCMLR  QM
Sbjct: 36  GNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSSVTTDLGWGCMLRVIQM 95

Query: 55  VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EGKAVGE 113
            +A  LL     + + ++++        IL+ F+D   + +SIHQ    G S   K   +
Sbjct: 96  SLALGLLRYCKMKKYTYSLD-------YILQNFQDLEESLFSIHQFVKVGCSIFNKKPKD 148

Query: 114 WFGPNTVAQVLRKLAK 129
           WFGP + + +   L K
Sbjct: 149 WFGPTSASTIADYLVK 164


>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 327

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 60/235 (25%), Positives = 98/235 (41%), Gaps = 47/235 (20%)

Query: 25  FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
           FTYRK F P+  S +TTDKGWGC+ R  QM++A AL   H+  D+ +             
Sbjct: 46  FTYRKDFEPLPRSVITTDKGWGCLARASQMLLACALR-RHMTLDFSFQYFCD-------- 96

Query: 85  KMFEDRRTAPYSIHQIALTGASEGKAVGE-WFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
              +D R AP+S+H +  +    G+ +   ++ P+   + +    +     S +   AL 
Sbjct: 97  --IDDERIAPFSLHCMVRSVLRPGEDLRPVYWTPSQGCEAISGCVR-----SAIHRGALH 149

Query: 144 NTL--VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
           + L  VV     +               L+LV P+R G            +K +      
Sbjct: 150 SQLRVVVGAAGAIPKHEVNRHLEDSGNALILV-PVRCGTTR------RMTQKMF------ 196

Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPH 255
                  LS  + + T       P  +G++GG P    Y +G  G + +++LDPH
Sbjct: 197 -------LSLEHLLLT-------PMCVGMVGGVPGRCYYIVGTGGQELLLYLDPH 237


>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
 gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
          Length = 348

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 68/151 (45%), Gaps = 22/151 (14%)

Query: 5   NKLSHQDLEQIRRDITSRLWFTYRKGFVPI------------GDSGLTTDKGWGCMLRCG 52
           NK + ++ +   ++    + FTYR  F  I                + +D GWGCM R  
Sbjct: 34  NKFAPEEKKYFLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVT 93

Query: 53  QMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAV 111
           QM IA  +      + +  N+N +     KIL  F+D  +A +SIH +   G SE G   
Sbjct: 94  QMSIAHGI--CQFMKRFLGNLNIE-----KILNNFQDNESAKFSIHNMVNIGLSEFGIDP 146

Query: 112 GEWFGPNTVAQVLRKLAKYDDWSSIVFHVAL 142
             W GP T + +  KL   +D  SI+ ++ +
Sbjct: 147 TSWIGPTTSSMIANKLI--NDNRSIISNIQI 175


>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
          Length = 219

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 44/86 (51%), Gaps = 8/86 (9%)

Query: 218 PRY------EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
           PRY       F FPQSLG++GGKP  + Y IG       +LDPH  Q +  +    Q  E
Sbjct: 57  PRYIPLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQ--E 114

Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
               S+YHC     + +  +DPS+A+
Sbjct: 115 PNSTSSYHCNVMRHIPLDSIDPSLAI 140


>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 359

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 65/285 (22%), Positives = 110/285 (38%), Gaps = 60/285 (21%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL----HLGRDWQWNVN 74
           I++  W TYR G+  + +S LTTD GWGC +R  QM+IA A+  +     L       + 
Sbjct: 75  ISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIP 134

Query: 75  SKEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQVLRKLAKY 130
           +K+E  + +L  F D    T P SIH +  +     + K+   +  P+ VA+    L   
Sbjct: 135 TKQEV-MNVLIPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLV-- 191

Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
           + W                   KLC       SN       + IP               
Sbjct: 192 NSW-------------------KLCPIRCVMCSN-------VSIPTH------------- 212

Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
             +   LP  P    + I+ +       +  +      G++GG  + A++  G+     +
Sbjct: 213 --ELSKLPFKPTLVFLPIVLNHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFL 270

Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHMDPS 294
           +LDPH  Q           S  ++D+  + P  + R  +  +DP+
Sbjct: 271 YLDPHIVQ-------PSFKSFTEIDTKSYSPIGTNRFSVHTIDPT 308


>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
           Full=Autophagy-related protein 4
 gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
          Length = 483

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 62/147 (42%), Gaps = 31/147 (21%)

Query: 13  EQIRRDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDK 43
           E+I   I S+L FTYR  F PI     G S +                         TD 
Sbjct: 78  EEILNAIRSKLNFTYRTNFEPIERAPDGPSPINPLIMLRINPIDAIENVFNNRECFFTDV 137

Query: 44  GWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALT 103
           GWGCM+R GQ ++  AL  +      Q  +   ++   +I  +F+D   + +S+      
Sbjct: 138 GWGCMIRTGQSLLGNALQRVKSTVKDQPYIYEMDDTK-EITDLFKDNTKSAFSLQNFVKC 196

Query: 104 GASEGK-AVGEWFGPNTVAQVLRKLAK 129
           G    K A GEWFGP T A  +R L +
Sbjct: 197 GRIYNKIAPGEWFGPATTATCIRYLIQ 223



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 9/71 (12%)

Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS 284
           P S+G+ GG+P+ +LYF GY  + ++F DPH +Q    + D         D + H     
Sbjct: 287 PFSVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQ-TALIDD--------FDESCHTENFG 337

Query: 285 RLHILHMDPSI 295
           +L+   +DPS+
Sbjct: 338 KLNFSDLDPSM 348


>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
          Length = 307

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 30/46 (65%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCG 52
           +  +D+E  RRD  SR+W TYR+ F  + DS  T+D GWGCM+  G
Sbjct: 190 VEEEDIEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMIPAG 235


>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
          Length = 321

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/238 (23%), Positives = 94/238 (39%), Gaps = 55/238 (23%)

Query: 26  TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK--I 83
           TYR+ +  +G + LT+D GWGC +R  QM++  +++ ++L + +     S +   +K   
Sbjct: 67  TYRQKYATLGHTYLTSDAGWGCAIRSVQMLLVNSIV-VYLDKSFHPEYTSHDHIAIKNNA 125

Query: 84  LKMFEDRRTAPYSIHQIALTGA--SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVA 141
            ++  D+ ++  SIH I +  A          +  P+T A  +  L  Y+ W    F V 
Sbjct: 126 KQLVFDKESSVLSIHNIYIQDAIIKHNPTGTNFLPPSTCATAVADL--YNFWEKRTFDVL 183

Query: 142 LDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
                       +CT      +    QP +L IP  +   + N +               
Sbjct: 184 ------------MCTEYIPEVT----QPTLLFIPRIVTKSERNFI--------------- 212

Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
                         QT  +    PQS G + G  + A+Y  G     V FLDPH  Q+
Sbjct: 213 --------------QTTSF---LPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQD 253


>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 193

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 9/112 (8%)

Query: 19  ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL----HLGRDWQWNVN 74
           I++  W TYR G+  + +S LTTD GWGC +R  QM+IA A+  +     L       + 
Sbjct: 75  ISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIP 134

Query: 75  SKEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQ 122
           +K+E  + +L  F D    T P SIH +  +     + K+   +  P+ VA+
Sbjct: 135 TKQEV-MNVLIPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAK 185


>gi|403222100|dbj|BAM40232.1| autophagy-related peptidase [Theileria orientalis strain Shintoku]
          Length = 351

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 98/232 (42%), Gaps = 40/232 (17%)

Query: 41  TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN---VNSKEEAYLKILKMFEDRRTAPYSI 97
           TDKGWGC++R  QM +AQAL+ L +G ++      VN+K              RT     
Sbjct: 65  TDKGWGCVVRSTQMALAQALINLIIGPEFSMEDLLVNNKSP------------RTGHLDA 112

Query: 98  HQIALTGASEGKAVGEWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVK 152
             ++L G      + +     + A  L KL+      YDD SS+    +L N +V + V 
Sbjct: 113 KLLSLDG------LQQLLTEESHADELTKLSIILSQFYDDKSSL---FSLYNFIVADLVL 163

Query: 153 KLCTTNKRASSNPQWQPLVL---VIPLRLGIQDIN----PVYINGIKKCYAL-PISPVYD 204
           K CT  K  S  P    + +   +    + I  I+      YIN +K  +       V+ 
Sbjct: 164 KTCT--KFLSFGPTSTAVCISKVINDANIAISSISFPDGVFYINKVKDLFEKNKYLLVWV 221

Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI-GYVGNDVIFLDPH 255
            +K     Y  +T R  F   Q  G++GG   H  Y+I G     + + DPH
Sbjct: 222 SMKKKLDKYEKETVRSLFKLKQFNGIVGGNLLHRSYYIFGTSSKRLYYNDPH 273


>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
 gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
          Length = 158

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 41/78 (52%), Gaps = 27/78 (34%)

Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
           LG+  +NP+Y               YD +KIL            +TFPQS+G+ GG+P+ 
Sbjct: 1   LGLDGVNPIY---------------YDTIKIL------------YTFPQSVGIAGGRPSS 33

Query: 238 ALYFIGYVGNDVIFLDPH 255
           + YF+G   +++ +LDPH
Sbjct: 34  SYYFVGSQADNLFYLDPH 51


>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
          Length = 206

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 6/78 (7%)

Query: 37  SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYS 96
           + + TD+GWGC LR  QM +A+AL      RD    +++ +E   +IL++F D   AP+S
Sbjct: 69  TDIKTDRGWGCALRATQMALAEAL------RDVLSPLDNVQEQRSRILQLFYDTTEAPFS 122

Query: 97  IHQIALTGASEGKAVGEW 114
           +  + +     G  V  W
Sbjct: 123 LENLVMADVEHGANVVAW 140



 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 3/51 (5%)

Query: 207 KILSSTYNMQTPRYEFTFPQSLGVIGGKPN--HALYFIGYVGNDVIFLDPH 255
           K LS + N +  +Y FT P   G++G K +   A YF+G+ GN  ++LDPH
Sbjct: 145 KELSESQN-ECLKYLFTLPWFKGMVGAKKDKQRAYYFVGHHGNQALYLDPH 194


>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
           gorilla]
          Length = 351

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 42/71 (59%), Gaps = 1/71 (1%)

Query: 76  KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWS 134
           +E  + +I+  F D   AP+ +H++   G S GK  G+W+GP+ VA +LRK +    + +
Sbjct: 50  QERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVT 109

Query: 135 SIVFHVALDNT 145
            +V +V+ D T
Sbjct: 110 RLVVYVSQDCT 120


>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
 gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
          Length = 350

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 64/241 (26%), Positives = 98/241 (40%), Gaps = 46/241 (19%)

Query: 35  GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP 94
           G   + +DKGWGC+LR  QM I+QALL L LG ++              ++  E R   P
Sbjct: 58  GIVTIDSDKGWGCVLRSTQMAISQALLNLVLGPEFS-------------VEQLEIRNRTP 104

Query: 95  YS--IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
            +  I Q  L   +  K +      + V+ V   LA++ D  + VF +   N ++ + V 
Sbjct: 105 RNRKIDQSLLNIDTFEKLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIY--NFVIADYVL 162

Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP--ISPVYDMVKILS 210
           K CT          + P    +     I D+N + IN I    A P  +  + D+ +IL 
Sbjct: 163 KTCT------KFLHFGPTSAALCASKIINDLN-LPINSI----AFPDGVFHISDVREILE 211

Query: 211 STYNM---------------QTPRYEFTFPQSLGVIGGKP-NHALYFIGYVGNDVIFLDP 254
              N+               +  R  F   Q  G+IGG   N + Y  G     + + DP
Sbjct: 212 EKRNLLVWVSNKKKLDRIERECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYNDP 271

Query: 255 H 255
           H
Sbjct: 272 H 272


>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 658

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 52/215 (24%), Positives = 82/215 (38%), Gaps = 74/215 (34%)

Query: 95  YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY------------------------ 130
           YS+HQ+   G   G   GEW+GP T   VLR+L +                         
Sbjct: 243 YSLHQMVAAGLGLGVLPGEWYGPTTACHVLRELNEIHCGCRERVAEVLKRRRKGGDKGDI 302

Query: 131 -------DD--WSSIVF--HVALDNTLVVNQVKKLCTTNKRA----------SSNPQWQP 169
                  DD  ++  VF  H+A +  + ++ + KL T++ ++            N     
Sbjct: 303 DEHNHVGDDSQYTCDVFRVHIATEGCIYLDAISKLMTSSNQSLQTESNDAPIQHNTDSAA 362

Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ------------- 216
            V+  PL L  +  +P+         A   +   D  +IL+  ++               
Sbjct: 363 NVIDHPLSLPEEVFDPLR--------AQVTTQSSDKEQILNQQWDTSLLLLLPLRLGIQS 414

Query: 217 --TPRYEFT------FPQSLGVIGGKPNHALYFIG 243
             TP Y  T      FPQS+G++GG P HAL+F G
Sbjct: 415 IPTPTYGSTLAKLLSFPQSVGMLGGTPRHALWFYG 449



 Score = 44.3 bits (103), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 5/42 (11%)

Query: 24  WFTYRKGF-VPI----GDSGLTTDKGWGCMLRCGQMVIAQAL 60
           W TYR    VP+    G  GL +D GWGCMLR  QM++AQ +
Sbjct: 113 WLTYRSDLTVPLRPYNGGVGLKSDAGWGCMLRSAQMMMAQTV 154


>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
 gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
 gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
           [Homo sapiens]
 gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
           construct]
          Length = 141

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 5/67 (7%)

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILH 290
           +GGKP H+LYFIGY  + +++LDPH  Q      D  Q ++  L+S +HC    ++    
Sbjct: 1   MGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-ADFPLES-FHCTSPRKMAFAK 55

Query: 291 MDPSIAV 297
           MDPS  V
Sbjct: 56  MDPSCTV 62


>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
          Length = 389

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 23/73 (31%)

Query: 18  DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
           D  S++W TYR  F PI                      GD S  ++D GWGCM+R GQ 
Sbjct: 120 DFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQLGDQSPFSSDSGWGCMIRSGQS 179

Query: 55  VIAQALLFLHLGR 67
           ++A  +  + LGR
Sbjct: 180 MLANTIAMVRLGR 192



 Score = 41.6 bits (96), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 6/68 (8%)

Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK---EQDSEKKLDSTYHCPQASRLHIL 289
           G+P+ + YFIG  G+ + +LDPH +  +   Y +   E  SE+   ++ H P+  R+H+ 
Sbjct: 262 GRPSSSHYFIGAQGSFLFYLDPH-HTRVALPYREDPIEYTSEEI--ASCHTPRLRRIHVR 318

Query: 290 HMDPSIAV 297
            MDPS+ +
Sbjct: 319 EMDPSMLI 326


>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
          Length = 141

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 5/67 (7%)

Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILH 290
           +GGKP H+LYFIGY  + +++LDPH  Q    V       E     ++HC    ++    
Sbjct: 1   MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE-----SFHCTSPRKMAFAK 55

Query: 291 MDPSIAV 297
           MDPS  V
Sbjct: 56  MDPSCTV 62


>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
 gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
          Length = 391

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI---GCVYDKEQDSEKKLD--- 275
            T+PQS+G++GG+P+ +LY  G   +  +FLDPH  Q     G   D     E       
Sbjct: 232 LTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWGIAGDAGHTKEAGNGGSA 291

Query: 276 --------STYHCPQASRLHILHMDPSIAV 297
                   +TY C     +    +DPS+A+
Sbjct: 292 VVLPASSLATYFCDTVRLMPATALDPSMAI 321


>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 183

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/96 (26%), Positives = 48/96 (50%), Gaps = 2/96 (2%)

Query: 7   LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL--H 64
           L+H         I   +  TYR+ +  +G++ L++D GWGC +R  QM++  AL+     
Sbjct: 51  LNHLTFNDANLKIHDLIVATYRQKYSYLGNTYLSSDAGWGCAIRATQMMVVNALVIFKDQ 110

Query: 65  LGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
           + +   +N    ++   +  ++  DR ++  SIH I
Sbjct: 111 MQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHNI 146


>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
           multifiliis]
          Length = 384

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 86/214 (40%), Gaps = 43/214 (20%)

Query: 114 WFGPNTVAQVLRKL-------------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKR 160
           W+ PN +  +L KL              KY     ++F   L   +V +Q  KLC  N  
Sbjct: 86  WYDPNRICFILEKLYNFSSIKGTENLKFKYFSNHKLIFFEDLIKLMVDSQA-KLCNQNIH 144

Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYING----------IKKCYALPISPVYDMVKILS 210
              N Q Q L L       I+D   V               KKC+    S +   +  L+
Sbjct: 145 ---NEQQQNLDLNNNSSQLIEDSFEVITKSSKQNTLDNLICKKCHQSDKSLLI-FISCLT 200

Query: 211 STYNMQTPRYEFTFPQ------SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
           +T  +   + +           S+G+IGG P  A YF+G + ND I+LDPH  Q      
Sbjct: 201 NTNKISNKKQQEVVISLLKNQFSIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQ------ 254

Query: 265 DKEQDSEKKLDS--TYHCPQASRLHILHMDPSIA 296
            +   +EK + +  TY C   +R+    ++ S+A
Sbjct: 255 -EAHQNEKTVQNIDTYFCKFINRVSQKKLESSLA 287


>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 325

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 49/99 (49%), Gaps = 2/99 (2%)

Query: 6   KLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL-- 63
            L+H         I   +  TYR+ +  +G++ L++D GWGC +R  QM+I   L+    
Sbjct: 50  NLNHLTFNDANIKIHDLIVATYRQKYSCLGNTYLSSDAGWGCAIRATQMMIVNTLVIFKD 109

Query: 64  HLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIAL 102
            + +   +N    ++  L+  ++  D+ ++  SIH I +
Sbjct: 110 QMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHNIYI 148



 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 22/41 (53%)

Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIG 261
             T  QS G +GG    A++  GY G  + FLDPH  QN G
Sbjct: 219 SLTLSQSRGFVGGIGESAIFVFGYQGTTLFFLDPHYVQNAG 259


>gi|71030858|ref|XP_765071.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68352027|gb|EAN32788.1| hypothetical protein TP02_0505 [Theileria parva]
          Length = 215

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 58/120 (48%), Gaps = 17/120 (14%)

Query: 39  LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYS-- 96
           + TDKGWGC+LR  QM I+QAL+ L LG ++              ++  E R  +P +  
Sbjct: 62  IDTDKGWGCVLRSTQMAISQALMNLVLGPEFS-------------VEQLEIRNRSPRNKK 108

Query: 97  IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
           I +  L   +  K +      + ++ V   LA++ D  + VF +   N ++ + V K CT
Sbjct: 109 IDESILNLDTFEKLINGVVDLDEISAVSVILAQFYDDLNAVFSIY--NFIIADYVLKTCT 166


>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
          Length = 325

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 2/97 (2%)

Query: 6   KLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL-- 63
            L+H         I   +  TYR+ +  +G++ L++D GWGC +R  QM+I  AL+    
Sbjct: 50  NLNHLTFNDANIKIHDLIVATYRQKYSYLGNTYLSSDAGWGCAIRATQMMIVNALVIFKD 109

Query: 64  HLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
            + +   +N    ++   +  ++  DR ++  SIH I
Sbjct: 110 QMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHNI 146


>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
 gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
          Length = 81

 Score = 45.8 bits (107), Expect = 0.026,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 26/43 (60%)

Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
            TFPQSLG++GGKP  + Y  G   +  ++LDPH  Q +   Y
Sbjct: 19  LTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLVKLYY 61


>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
          Length = 346

 Score = 44.7 bits (104), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 41/153 (26%), Positives = 62/153 (40%), Gaps = 29/153 (18%)

Query: 15  IRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           I + +++    TYR GF   +    LTTD GWGC LR  QM+   +L+ L      + N 
Sbjct: 62  IAKHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLIRLQ-----EPNP 116

Query: 74  NSKEEAYLKILKMF-----EDRRT---------------APYSIHQIALTGASEGKAVGE 113
              E+A  K+ K F     E+RR                + Y +  + +   +  K    
Sbjct: 117 GFGEDAAEKVQKNFIIHSMEERREYVQLIEDTPKQEAVLSLYKMFNLKIVRQNNQKGTN- 175

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
           +  P+T A  L +L +   W     HV   NT 
Sbjct: 176 YLSPSTCAIALSQLVEI--WDQRPCHVIYSNTF 206


>gi|429327650|gb|AFZ79410.1| hypothetical protein BEWA_022580 [Babesia equi]
          Length = 385

 Score = 44.7 bits (104), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 71/290 (24%), Positives = 109/290 (37%), Gaps = 62/290 (21%)

Query: 11  DLEQIRRDITSRLWFTYR-----KGFVPIG--------------------DSGLTTDKGW 45
           +  + +R IT  L FTYR     K   PIG                       + TDKGW
Sbjct: 46  EYRKFKRKITGILLFTYRSDLNYKVAKPIGLIKREHVIGIFKPFNVCLPSIQTIDTDKGW 105

Query: 46  GCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
           GC++R  QM +AQ L+ L LG ++      KE           D      S  +   +G 
Sbjct: 106 GCVIRATQMALAQTLISLILGDNFDIYSILKENT-------LPDSSGGAPSHRRSDRSGK 158

Query: 106 SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFH--------VALDNTLVVNQVKKLCTT 157
           SE          N +    +   KYD + SI+           +L   ++ + V K CT 
Sbjct: 159 SEN-------FDNIITDGYQN--KYDAFCSILSQFYDSRESKFSLYKFIIADSVLKTCT- 208

Query: 158 NKRASSNPQWQPLVL-------VIPLRLGIQDINPVYINGIKKCYALPISPV--YDMVKI 208
            K  S  P    + +        IPL+         YIN + K +    + +    + K 
Sbjct: 209 -KFLSFGPTSSAICVNKMINDANIPLKSIAFPDGVFYINEVYKGFNKNRNVIVWLSLNKK 267

Query: 209 LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI-GYVGNDVIFLDPHTN 257
           L     +   R  F   Q  G++GG  N+  Y+I G   + + ++DPH +
Sbjct: 268 LDKNEKVAV-RSLFLLKQFNGIVGGNMNNRAYYICGCSSSRLYYVDPHVS 316


>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
          Length = 346

 Score = 43.5 bits (101), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 39/153 (25%), Positives = 62/153 (40%), Gaps = 29/153 (18%)

Query: 15  IRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
           + + +++    TYR GF   +    LTTD GWGC LR  QM+   +L+ L      + N 
Sbjct: 62  VAKHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLIRLQ-----EPNP 116

Query: 74  NSKEEAYLKILKMF-----EDRRT---------------APYSIHQIALTGASEGKAVGE 113
              E+A  K+ + F     E+RR                + Y +  + +   +  K    
Sbjct: 117 GFGEDAAEKVQRNFIIHSMEERREYVQLIEDTPKQEAVLSLYKMFNLKIVRQNNQKGTN- 175

Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
           +  P+T A  L +L +   W     HV   NT 
Sbjct: 176 YLSPSTCAIALSQLVEM--WDQRPCHVIYSNTF 206


>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
 gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
          Length = 133

 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 28/41 (68%), Gaps = 3/41 (7%)

Query: 23  LWFTYRKGFVPI-GDSGLTT--DKGWGCMLRCGQMVIAQAL 60
           + FTYR  F PI G  G T+  DKGWGC +R  QM++AQA+
Sbjct: 67  ILFTYRCAFEPIEGCVGPTSVSDKGWGCAIRATQMLLAQAV 107


>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
 gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
          Length = 346

 Score = 42.7 bits (99), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 67/148 (45%), Gaps = 19/148 (12%)

Query: 15  IRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH-----LGRD 68
           I + +++    TYR GF   +    LTTD GWGC LR  QM+   +L+ L       G D
Sbjct: 62  IAKHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLIRLQEPNPGFGDD 121

Query: 69  ------WQWNVNSKEEAYLKILKMFED--RRTAPYSIHQI-ALTGASEGKAVG-EWFGPN 118
                   + ++S EE   + +++ ED  ++ A  S++++  L    +    G  +  P+
Sbjct: 122 AAEKVQQNFIIHSMEERR-EYVQLIEDTPKQEAVLSLYKMFNLKIVRQNNQKGTNYLSPS 180

Query: 119 TVAQVLRKLAKYDDWSSIVFHVALDNTL 146
           T A  L +L +   W     HV   NT 
Sbjct: 181 TCAIALSQLVEM--WDQRPCHVIYSNTF 206


>gi|440292697|gb|ELP85881.1| hypothetical protein EIN_133850 [Entamoeba invadens IP1]
          Length = 348

 Score = 42.7 bits (99), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 64/151 (42%), Gaps = 17/151 (11%)

Query: 11  DLEQIRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH----- 64
           D  QI + +++    TYR GF   +    LTTD GWGC +R  QM+   +L+ +      
Sbjct: 60  DGSQIAKHLSTLFKVTYRNGFTYHLPHCSLTTDAGWGCTIRSVQMLFLNSLIRIQEPDPG 119

Query: 65  LGRDWQWNVNS-----KEEAYLKILKMFED--RRTAPYSIHQI-ALTGASEGKAVG-EWF 115
             +D Q  +         +   + +++ ED  R+ A  SIH++  L    +    G  + 
Sbjct: 120 FDKDSQTKMKKGFLVHPMDVRREYVQLIEDTPRKEAVLSIHKMFDLEVVRKNNQKGTNYL 179

Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
            P+T A  +  L   + W     HV    T 
Sbjct: 180 SPSTCATAISVLM--EQWDERPCHVMFVQTF 208


>gi|422293936|gb|EKU21236.1| cysteine protease family, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 91

 Score = 41.2 bits (95), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 8/54 (14%)

Query: 80  YLKILKMFEDRRTAP-----YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
           Y ++L  F D   AP     +S+H +   G S  K  GEW+GP TVA +LR LA
Sbjct: 23  YCQLLDSFVD---APGPNHVFSVHNMVQIGMSYDKLPGEWYGPTTVAYILRDLA 73


>gi|422295376|gb|EKU22675.1| cysteine protease family, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 96

 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 8/54 (14%)

Query: 80  YLKILKMFEDRRTAP-----YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
           Y ++L  F D   AP     +S+H +   G S  K  GEW+GP TVA +LR LA
Sbjct: 23  YCQLLDSFVD---APGPNHVFSVHNMVQIGMSYDKLPGEWYGPTTVAYILRDLA 73


>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 3465

 Score = 40.4 bits (93), Expect = 1.00,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 34/71 (47%), Gaps = 17/71 (23%)

Query: 13   EQIRRDITSRLWFTYRKGFVPI----GDS-------------GLTTDKGWGCMLRCGQMV 55
            +Q+ + + S   FTYR GF P+    G+               + +D GWGC +R  QM+
Sbjct: 941  QQLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQML 1000

Query: 56   IAQALLFLHLG 66
            + QAL    LG
Sbjct: 1001 LMQALRRHFLG 1011


>gi|148682816|gb|EDL14763.1| mCG116861, isoform CRA_a [Mus musculus]
          Length = 127

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 5/52 (9%)

Query: 247 NDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           +++IFLDPHT Q    +    ++S    D T+HC Q+  R+ IL++DPS+A+
Sbjct: 1   DELIFLDPHTTQTFVDI----EESGLVDDQTFHCLQSPQRMSILNLDPSVAL 48


>gi|392343434|ref|XP_003754884.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
           norvegicus]
 gi|392355909|ref|XP_003752169.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
           norvegicus]
          Length = 126

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 5/52 (9%)

Query: 247 NDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
           +++IFLDPHT Q     +   ++S    D T+HC Q+  R+ IL++DPS+A+
Sbjct: 1   DELIFLDPHTTQ----TFVDTEESGLVDDHTFHCLQSPQRMSILNLDPSVAL 48


>gi|171318466|ref|ZP_02907620.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           ambifaria MEX-5]
 gi|171096332|gb|EDT41235.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           ambifaria MEX-5]
          Length = 189

 Score = 38.1 bits (87), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 41/96 (42%), Gaps = 23/96 (23%)

Query: 46  GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
           G   +CG  V+   LLF     LHLG +W  +V  KE          E  RT PY + + 
Sbjct: 71  GLQAQCGLAVLVAGLLFSVWARLHLGTNWSVSVTLKEN--------HELVRTGPYGLVRH 122

Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
                  IAL GA+     GEW G   VA V   LA
Sbjct: 123 PIYTGCLIALAGAA--LIGGEWRGALGVALVFASLA 156


>gi|115359254|ref|YP_776392.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
           ambifaria AMMD]
 gi|115284542|gb|ABI90058.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           ambifaria AMMD]
          Length = 189

 Score = 38.1 bits (87), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 41/96 (42%), Gaps = 23/96 (23%)

Query: 46  GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
           G   +CG  V+   LLF     LHLG +W  +V  KE          E  RT PY + + 
Sbjct: 71  GLQAQCGLAVLIAGLLFSVWARLHLGTNWSVSVTLKEN--------HELVRTGPYGLVRH 122

Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
                  IAL GA+     GEW G   VA V   LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGAFGVALVFASLA 156


>gi|170700470|ref|ZP_02891476.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           ambifaria IOP40-10]
 gi|170134635|gb|EDT02957.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           ambifaria IOP40-10]
          Length = 189

 Score = 37.7 bits (86), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 41/96 (42%), Gaps = 23/96 (23%)

Query: 46  GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
           G   +CG  V+   LLF     LHLG +W  +V  KE          E  RT PY + + 
Sbjct: 71  GLQAQCGLAVLIAGLLFSVWARLHLGTNWSVSVTLKEN--------HELVRTGPYGLVRH 122

Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
                  IAL GA+     GEW G   VA V   LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGALGVALVFASLA 156


>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
          Length = 538

 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 19/90 (21%)

Query: 54  MVIAQALLFLHLGRDWQWNVNSKEEAYL-----------------KILKMFEDRRTA--P 94
           M++AQ L+   LGR+W+W   ++++                    ++L++F D      P
Sbjct: 1   MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60

Query: 95  YSIHQIALTGASEGKAVGEWFGPNTVAQVL 124
           +S+H +   G + G   G W GP  + + L
Sbjct: 61  FSLHSLCRAGQACGVVAGRWLGPWVMCKTL 90



 Score = 37.0 bits (84), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 11/23 (47%), Positives = 19/23 (82%)

Query: 222 FTFPQSLGVIGGKPNHALYFIGY 244
              PQS+G++GG+P+ +LYF+G+
Sbjct: 228 LAMPQSIGIVGGRPSSSLYFVGF 250


>gi|107025629|ref|YP_623140.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
           cenocepacia AU 1054]
 gi|116693189|ref|YP_838722.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
           cenocepacia HI2424]
 gi|105895003|gb|ABF78167.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           cenocepacia AU 1054]
 gi|116651189|gb|ABK11829.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           cenocepacia HI2424]
          Length = 189

 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 42/96 (43%), Gaps = 23/96 (23%)

Query: 46  GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
           G   +CG  V+   LLF     LHLG +W  +V  KE+         E  RT PY++ + 
Sbjct: 71  GLQAQCGLAVLVAGLLFSVWARLHLGTNWSVSVTLKED--------HELVRTGPYALVRH 122

Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
                  IAL GA+     GEW G   V  V   LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGAIGVLLVFASLA 156


>gi|170737545|ref|YP_001778805.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
           cenocepacia MC0-3]
 gi|169819733|gb|ACA94315.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
           cenocepacia MC0-3]
          Length = 189

 Score = 37.4 bits (85), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 42/96 (43%), Gaps = 23/96 (23%)

Query: 46  GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
           G   +CG  V+   LLF     LHLG +W  +V  KE+         E  RT PY++ + 
Sbjct: 71  GLQAQCGLAVLVAGLLFSVWARLHLGTNWSVSVTLKED--------HELVRTGPYALVRH 122

Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
                  IAL GA+     GEW G   V  V   LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGAIGVLLVFASLA 156


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.137    0.428 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,909,479,630
Number of Sequences: 23463169
Number of extensions: 201377412
Number of successful extensions: 395909
Number of sequences better than 100.0: 779
Number of HSP's better than 100.0 without gapping: 735
Number of HSP's successfully gapped in prelim test: 44
Number of HSP's that attempted gapping in prelim test: 392486
Number of HSP's gapped (non-prelim): 1251
length of query: 309
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 167
effective length of database: 9,027,425,369
effective search space: 1507580036623
effective search space used: 1507580036623
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)