BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10465
(309 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|328707620|ref|XP_001947296.2| PREDICTED: cysteine protease ATG4B-like isoform 1 [Acyrthosiphon
pisum]
gi|328707622|ref|XP_003243448.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Acyrthosiphon
pisum]
gi|328707624|ref|XP_003243449.1| PREDICTED: cysteine protease ATG4B-like isoform 3 [Acyrthosiphon
pisum]
gi|328707626|ref|XP_003243450.1| PREDICTED: cysteine protease ATG4B-like isoform 4 [Acyrthosiphon
pisum]
Length = 402
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 184/287 (64%), Positives = 223/287 (77%), Gaps = 28/287 (9%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
DL+QIR DI SRLWFTYRKGFV IG++ T+D+GWGCMLRCGQMVI QAL+FLHLGRDW+
Sbjct: 59 DLQQIRNDIQSRLWFTYRKGFVQIGNTNFTSDRGWGCMLRCGQMVIGQALIFLHLGRDWR 118
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
W+ + ++ YLKIL+MFED+R+APYSIHQIAL G S GK VGEWFGPNT+AQVL+KLA
Sbjct: 119 WDPDKRDIDYLKILRMFEDKRSAPYSIHQIALMGVSHGKQVGEWFGPNTIAQVLKKLATM 178
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYIN 189
D+ SS+VFHVALDNTLV+N+VKKLCT ++ +S+ Q W+PLVLVIPLRLGI INP Y+
Sbjct: 179 DELSSLVFHVALDNTLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQ 238
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
G+K C FTFPQSLGVIGG+PNHALYFIG+VGNDV
Sbjct: 239 GVKMC---------------------------FTFPQSLGVIGGRPNHALYFIGFVGNDV 271
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
IFLDPHT Q IG + +K+ ++E K+D +YHC Q +RL IL+MDPS+A
Sbjct: 272 IFLDPHTTQQIGMLPNKDIETEHKIDHSYHCQQINRLPILNMDPSLA 318
>gi|91083193|ref|XP_972923.1| PREDICTED: similar to Autophagy-specific protein, putative
[Tribolium castaneum]
gi|270006970|gb|EFA03418.1| hypothetical protein TcasGA2_TC013405 [Tribolium castaneum]
Length = 366
Score = 352 bits (903), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 171/290 (58%), Positives = 214/290 (73%), Gaps = 29/290 (10%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG-DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
Q+L+ IR+DI S++WFTYRK FVPIG D GLTTDKGWGCMLRCGQMV+AQAL+ LHLGRD
Sbjct: 36 QELDTIRQDILSKIWFTYRKNFVPIGGDEGLTTDKGWGCMLRCGQMVLAQALVTLHLGRD 95
Query: 69 WQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
W W +K+ YLKIL F D+R AP+SIHQIA+ G SE K VG+WFGPNTVAQVL+KL
Sbjct: 96 WVWEPETKDSTYLKILSKFVDKRQAPFSIHQIAMMGVSENKEVGQWFGPNTVAQVLKKLV 155
Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLC-TTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
KYD+WS+I H+ALDNTL+++ +++LC + S+ W+PL+L++PLRLG+Q+INP+Y
Sbjct: 156 KYDEWSAIEMHIALDNTLIISDIRELCLSQGSDGCSSGDWKPLLLIVPLRLGLQEINPIY 215
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
+G+KKC F F QSLGVIGGKPN ALYFIG+VG+
Sbjct: 216 ASGLKKC---------------------------FQFKQSLGVIGGKPNLALYFIGHVGD 248
Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+VI+LDPHT Q G V KE + E +LDSTYHC ASR++IL MDPS+AV
Sbjct: 249 EVIYLDPHTTQKSGSVESKETEEEIELDSTYHCKYASRINILSMDPSVAV 298
>gi|242007959|ref|XP_002424782.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
gi|212508305|gb|EEB12044.1| Cysteine protease ATG4A, putative [Pediculus humanus corporis]
Length = 388
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 183/306 (59%), Positives = 216/306 (70%), Gaps = 45/306 (14%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D+ IR DI S+LWFTYRKGFVPIGDSGLT+DKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 39 RDVTAIRSDIKSKLWFTYRKGFVPIGDSGLTSDKGWGCMLRCGQMVLAQALVCLHLGRDW 98
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+W +SKE YL+ILKMFED +TA YSIHQIAL G SEGK VG+WFGPNTV QVL+KL+
Sbjct: 99 RWKKDSKEPEYLRILKMFEDTKTATYSIHQIALMGVSEGKDVGQWFGPNTVTQVLKKLSV 158
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA------------------SSNPQWQPLV 171
YD WSSIV HVALDNT++VN +K LC N+++ +S +W+PL+
Sbjct: 159 YDKWSSIVIHVALDNTIIVNDIKSLCQRNEQSVIDSSAQKHSPLNEPVYFNSARKWKPLL 218
Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
LV+PLRLG+ +INPVY+NG+K C FTF QSLGVI
Sbjct: 219 LVVPLRLGLSEINPVYLNGLKTC---------------------------FTFRQSLGVI 251
Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
GGKPNHALYFIG VG VI+LDPHT Q + V KE EK D +YHCP+ASR IL M
Sbjct: 252 GGKPNHALYFIGCVGEHVIYLDPHTTQPVSIVDGKELSYEKTADLSYHCPRASRSRILDM 311
Query: 292 DPSIAV 297
DPS+AV
Sbjct: 312 DPSVAV 317
>gi|332375955|gb|AEE63118.1| unknown [Dendroctonus ponderosae]
Length = 370
Score = 346 bits (887), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 162/302 (53%), Positives = 212/302 (70%), Gaps = 30/302 (9%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDS-GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+L IR+DI S+LWFTYRK FVPIG S G T+DKGWGCMLRCGQMV+ QAL+ +HLGRDW
Sbjct: 45 ELNTIRQDIVSKLWFTYRKDFVPIGGSDGKTSDKGWGCMLRCGQMVLGQALMSIHLGRDW 104
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
QWN +++ YL ILK FED R AP+SIHQIA G SEGK VG+WFGPNTVAQVL+KL K
Sbjct: 105 QWNPTTRDATYLSILKKFEDSRKAPFSIHQIASMGISEGKEVGQWFGPNTVAQVLKKLVK 164
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRAS-SNPQWQPLVLVIPLRLGIQDINPVYI 188
+D+ + + HVALDN +++++++ LC + + A S P W+PL+L++PLRLG+ +N +Y+
Sbjct: 165 FDEGNDVAIHVALDNVVIISEIRDLCLSKETADVSTPHWKPLLLIVPLRLGLTQMNSIYL 224
Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
G+K+C F F QSLG+IGGKPN ALYFIGYVGN+
Sbjct: 225 GGLKQC---------------------------FQFKQSLGIIGGKPNSALYFIGYVGNE 257
Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQ-RSYSDYK 307
VI+ DPHT Q G V +K+ EK +D +YHC ASR+ +L MDPS+AV RS +D+
Sbjct: 258 VIYFDPHTTQKAGSVGNKDTSEEKDVDLSYHCKHASRMSMLGMDPSVAVCFLCRSEADFN 317
Query: 308 NV 309
++
Sbjct: 318 DL 319
>gi|209969827|ref|NP_001123274.2| autophagy-specific gene 4 [Nasonia vitripennis]
Length = 405
Score = 343 bits (879), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 179/302 (59%), Positives = 213/302 (70%), Gaps = 39/302 (12%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSG--LTTDKGWGCMLRCGQMVIAQALLFLHL 65
+ +D++ IRRDI SRLWFTYRKGFVPIG G T+DKGWGCMLRCGQMV+ QAL+ LHL
Sbjct: 62 AKKDIDAIRRDIRSRLWFTYRKGFVPIGGFGSTFTSDKGWGCMLRCGQMVLGQALISLHL 121
Query: 66 GRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
GRDW+W ++ YL IL+ FEDRR APYSIHQIAL GASEGK VG+WFGPNT+AQVL+
Sbjct: 122 GRDWRWTPETRSSTYLNILRRFEDRRAAPYSIHQIALMGASEGKDVGQWFGPNTIAQVLK 181
Query: 126 KLAKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIP 175
KL YDDWSSI HVALDNTLVVN V + C TT + P QW+PL+L+IP
Sbjct: 182 KLVVYDDWSSITIHVALDNTLVVNDVVQQCRVEGATTAEVDGEKPLKAPSQWKPLLLLIP 241
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
LRLG+ +INP+YING+K F FPQSLG+IGGKP
Sbjct: 242 LRLGLNEINPIYINGLKT---------------------------SFQFPQSLGLIGGKP 274
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+HALYFIGYVG++VIFLDPHT Q G V K D+E ++D+TYHC ASR+ I MDPS+
Sbjct: 275 SHALYFIGYVGDEVIFLDPHTTQRAGSVDQKSDDNEAEVDATYHCKIASRIPITGMDPSV 334
Query: 296 AV 297
A+
Sbjct: 335 AL 336
>gi|357620505|gb|EHJ72670.1| putative Autophagy-specific protein [Danaus plexippus]
Length = 383
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/310 (53%), Positives = 208/310 (67%), Gaps = 37/310 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
QDLE+IRRDITS +W TYRKGFVPIGD GLT+DKGWGCMLRCGQMV+ AL+ +HL DW
Sbjct: 40 QDLERIRRDITSVIWCTYRKGFVPIGDEGLTSDKGWGCMLRCGQMVLGVALIKVHLSADW 99
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W +++ YLKI++ E+R+ APYSIHQ+AL GA EGK VG+WFGPNTVAQVL+KL
Sbjct: 100 VWTPETRDPTYLKIVQRLEERKQAPYSIHQVALMGACEGKEVGQWFGPNTVAQVLKKLVV 159
Query: 130 YDDWSSIVFHVALDNTLV---------VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
YD WSS+V HVALDNT+V VN + C+ N W PL+L++PLRLG+
Sbjct: 160 YDKWSSLVIHVALDNTVVKEDILQQCIVNNDRGDCSENVDGFVVSDWMPLLLIVPLRLGL 219
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
+INP+Y+ G+K C F PQS+GVIGGKPN ALY
Sbjct: 220 SEINPIYMEGLKIC---------------------------FQSPQSIGVIGGKPNQALY 252
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQ 300
IG VG++VI+LDPHT Q G V +K D +K++D TYHC ASR+ IL MDPS+AV
Sbjct: 253 LIGCVGDEVIYLDPHTTQKSGLVENKLTDEQKEMDCTYHCKYASRIPILSMDPSVAVCFL 312
Query: 301 -RSYSDYKNV 309
R+ SD+ +
Sbjct: 313 CRTRSDFDEL 322
>gi|406042044|gb|AFS31124.1| autophagy related protein Atg4-like protein, partial [Spodoptera
litura]
Length = 365
Score = 338 bits (866), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 162/296 (54%), Positives = 205/296 (69%), Gaps = 35/296 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
QDL++IRRDITS +W TYRKGF+PIGD GLT+DKGWGCMLRCGQMV+ AL+ +HL DW
Sbjct: 23 QDLDRIRRDITSIIWCTYRKGFIPIGDEGLTSDKGWGCMLRCGQMVLGVALVRVHLSADW 82
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W +++ YLKI++ FE+R+ APYSIHQ+AL GASEGK VG+WFGPNTVAQVL+KL
Sbjct: 83 VWTPETRDPTYLKIIQRFEERKQAPYSIHQVALMGASEGKQVGQWFGPNTVAQVLKKLTV 142
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNK---RASSNP-----QWQPLVLVIPLRLGIQ 181
YD WSS+V HVALDNT+V + + C N S+ P W PL+L++PLRLG+
Sbjct: 143 YDKWSSLVIHVALDNTVVKEDILQQCVVNNDRGDCSAAPDSLVTDWMPLLLIVPLRLGLS 202
Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
+INP+YI+G+K C F PQS+GVIGGKPN ALY
Sbjct: 203 EINPIYIDGLKIC---------------------------FQCPQSIGVIGGKPNQALYL 235
Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+G VG++VI+LDPHT Q G V K D +K++D +YHC ASR+ +L MDPS+AV
Sbjct: 236 VGCVGDEVIYLDPHTTQRSGLVETKTTDEQKEMDWSYHCKYASRIPMLAMDPSVAV 291
>gi|380023311|ref|XP_003695467.1| PREDICTED: cysteine protease ATG4B-like [Apis florea]
Length = 382
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 171/300 (57%), Positives = 212/300 (70%), Gaps = 39/300 (13%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRK FVPIG +S T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34 RELDAIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQWN+ ++ YLKIL+ FED+R AP+SIHQIAL GASEGK VG+WFGPNTVAQVL+KL
Sbjct: 94 DWQWNLETRNSTYLKILERFEDKRNAPFSIHQIALMGASEGKEVGQWFGPNTVAQVLKKL 153
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
+D+WSSI HVALDNTL+VN + K C TT + P QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGDAPLKAPSQWKPLLLLIPLR 213
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +INP+YING+K F PQSLGVIGGKP H
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPTH 246
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
ALYFIG VGN+VI+LDPHT Q G V K ++ E ++D+TYHC + R+ I+ +DPS+A+
Sbjct: 247 ALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMDATYHCKFSGRIPIIEIDPSVAL 306
>gi|383861144|ref|XP_003706046.1| PREDICTED: cysteine protease ATG4B-like [Megachile rotundata]
Length = 384
Score = 330 bits (845), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 174/300 (58%), Positives = 209/300 (69%), Gaps = 39/300 (13%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGD--SGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRKGFVPIG S T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34 KELDAIRRDIRSKLWFTYRKGFVPIGGYTSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW ++ YLKIL+ FEDRRTAP+SIHQIA GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94 DWQWTPETRNSTYLKILERFEDRRTAPFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 153
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCT----TNKRASSN------PQWQPLVLVIPLR 177
YDDWSSI HVALDNTL+VN + + C T A N QW+PL+L+IPLR
Sbjct: 154 VVYDDWSSITIHVALDNTLIVNDILRQCRVEGGTTAEADGNIPLKAPSQWKPLLLLIPLR 213
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +INP+YING+K F PQSLGVIGGKPN
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 246
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
ALYFIG VGN+VI+LDPHT Q G V K ++ E ++D+TYHC ASR+ I +DPS+A+
Sbjct: 247 ALYFIGCVGNEVIYLDPHTTQRSGSVDKKLEEEEIEMDATYHCKFASRIPITGIDPSVAL 306
>gi|66529516|ref|XP_624577.1| PREDICTED: cysteine protease ATG4B [Apis mellifera]
Length = 382
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 170/300 (56%), Positives = 212/300 (70%), Gaps = 39/300 (13%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRK FVPIG +S T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34 RELDAIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW++ ++ YLKIL+ FED+R AP+SIHQIAL GASEGK VG+WFGPNTVAQVL+KL
Sbjct: 94 DWQWSLETRNSTYLKILERFEDKRNAPFSIHQIALMGASEGKEVGQWFGPNTVAQVLKKL 153
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
+D+WSSI HVALDNTL+VN + K C TT + P QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGDAPLKAPSQWKPLLLLIPLR 213
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +INP+YING+K F PQSLGVIGGKP H
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPTH 246
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
ALYFIG VGN+VI+LDPHT Q G V K ++ E ++D+TYHC + R+ I+ +DPS+A+
Sbjct: 247 ALYFIGCVGNEVIYLDPHTTQKSGSVAKKLEEEEIEMDATYHCKFSGRIPIIEIDPSVAL 306
>gi|322795203|gb|EFZ18025.1| hypothetical protein SINV_08608 [Solenopsis invicta]
Length = 403
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 176/308 (57%), Positives = 215/308 (69%), Gaps = 35/308 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRKGF+PIG +S T+DKGWGCMLRCGQMV+AQAL+ LHLG+
Sbjct: 34 KELDAIRRDIRSKLWFTYRKGFIPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLHLGK 93
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW +K YLKIL FED+R A +SIHQIALTGASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94 DWQWMPETKNNTYLKILSRFEDKRAAAFSIHQIALTGASEGKEVGQWFGPNTIAQVLKKL 153
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------QWQPLVLVIPLR 177
YD+WSS+ HVALDNTL+VN + K C ++ QW+PL+L+IPLR
Sbjct: 154 IVYDEWSSLTIHVALDNTLIVNDILKQCRIEGGETAEADGEVPLKAPSQWKPLLLLIPLR 213
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY--------EFTFPQSLG 229
LG+ +INPVYING+K + KIL MQ +Y F QSLG
Sbjct: 214 LGLSEINPVYINGLKVKF-----------KILC----MQKKKYICIQFFQTSFKISQSLG 258
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
VIGGKPN ALYFIG VG++VI+LDPHT Q G V DK + E ++D TYHC ASR+ I
Sbjct: 259 VIGGKPNLALYFIGCVGDEVIYLDPHTTQRSGSVEDKISEEEIEMDITYHCKSASRIPIT 318
Query: 290 HMDPSIAV 297
MDPS+A+
Sbjct: 319 GMDPSVAL 326
>gi|332029697|gb|EGI69576.1| Cysteine protease ATG4B [Acromyrmex echinatior]
Length = 383
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 167/300 (55%), Positives = 207/300 (69%), Gaps = 39/300 (13%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRKGFVPIG +S T+DKGWGCMLRCGQMV+AQAL+ LHLG+
Sbjct: 34 KELDAIRRDIRSKLWFTYRKGFVPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLHLGK 93
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW +K YLKIL+ FED+R A +SIHQIAL GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94 DWQWMPETKNNTYLKILRRFEDKRAAAFSIHQIALMGASEGKEVGQWFGPNTIAQVLKKL 153
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------QWQPLVLVIPLR 177
YD+WSS+ HVALDNTL+VN + + C ++ QW+PL+L+IPLR
Sbjct: 154 IVYDEWSSLTIHVALDNTLIVNDILRQCRVEGGVTAEADGEIPLRAPSQWKPLLLLIPLR 213
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +INPVYING+K F QSLGVIGGKPN
Sbjct: 214 LGLSEINPVYINGLKT---------------------------SFKISQSLGVIGGKPNL 246
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
ALYFIG VG++VI+LDPHT Q G + DK + E ++D +YHC ASR+ I MDPS+A+
Sbjct: 247 ALYFIGCVGDEVIYLDPHTTQKSGSIEDKISEEEIEMDISYHCKSASRIPITGMDPSVAL 306
>gi|195401363|ref|XP_002059283.1| GJ16311 [Drosophila virilis]
gi|194156157|gb|EDW71341.1| GJ16311 [Drosophila virilis]
Length = 397
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 154/288 (53%), Positives = 198/288 (68%), Gaps = 31/288 (10%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GFVP+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 55 QELELIRRDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW 114
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIALTG S+ KAVGEW GPNTVAQ+L+ L +
Sbjct: 115 FWTPDCRDATYLKIVNRFEDTRKSFYSIHQIALTGESQNKAVGEWLGPNTVAQILKILVR 174
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+V HVA+D+T+V++++ C + S W+PL+L++PLRLGI DINP+YI
Sbjct: 175 FDDWSSLVVHVAMDSTVVLDEIYTRC----QEVSASTWKPLLLIVPLRLGISDINPMYIP 230
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 231 ALKRCLEL---------------------------SSSCGMIGGRPNQALYFLGYVDDEV 263
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E++LD +YH A+RL MDPS+AV
Sbjct: 264 LYLDPHTTQRAGSVAQKTTAAEQELDESYHQKYAARLSFGAMDPSLAV 311
>gi|340709295|ref|XP_003393246.1| PREDICTED: cysteine protease ATG4B-like isoform 1 [Bombus
terrestris]
Length = 383
Score = 320 bits (820), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 168/300 (56%), Positives = 209/300 (69%), Gaps = 39/300 (13%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRK FVPIG +S T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34 RELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW ++ YLKIL+ FED+RTA +SIHQIA GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94 DWQWTAETRNSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 153
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
+D+WSSI HVALDNTL+VN + K C TT + + P QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGAVPLKAPSQWKPLLLLIPLR 213
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +INP+YING+K F PQSLGVIGGKPN
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 246
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
ALYFIG V N+VI+LDPHT Q G V K ++ E ++D+TYHC +SR+ I +DPS+A+
Sbjct: 247 ALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMDATYHCKSSSRIPITGIDPSVAL 306
>gi|350425106|ref|XP_003494013.1| PREDICTED: cysteine protease ATG4B-like [Bombus impatiens]
Length = 383
Score = 320 bits (820), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 168/300 (56%), Positives = 209/300 (69%), Gaps = 39/300 (13%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRK FVPIG +S T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 34 RELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 93
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW ++ YLKIL+ FED+RTA +SIHQIA GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 94 DWQWTAETRNSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 153
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
+D+WSSI HVALDNTL+VN + K C TT + + P QW+PL+L+IPLR
Sbjct: 154 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGAVPLKAPSQWKPLLLLIPLR 213
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +INP+YING+K F PQSLGVIGGKPN
Sbjct: 214 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 246
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
ALYFIG V N+VI+LDPHT Q G V K ++ E ++D+TYHC +SR+ I +DPS+A+
Sbjct: 247 ALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMDATYHCKSSSRIPITGIDPSVAL 306
>gi|195118032|ref|XP_002003544.1| GI17971 [Drosophila mojavensis]
gi|193914119|gb|EDW12986.1| GI17971 [Drosophila mojavensis]
Length = 382
Score = 320 bits (819), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 153/288 (53%), Positives = 196/288 (68%), Gaps = 31/288 (10%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GFVP+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 55 QELEPIRRDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 114
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+ L +
Sbjct: 115 FWTPDCRDATYLKIVNRFEDTRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVR 174
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+D+T+V++ + C + SS W+PL+L++PLRLGI DINP+YI
Sbjct: 175 FDDWSSLAVHVAMDSTVVLDDIYTCC----QESSESSWKPLLLIVPLRLGITDINPIYIP 230
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 231 ALKRCLEL---------------------------SSSCGMIGGRPNQALYFLGYVDDEV 263
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E++LD +YH A+RL MDPS+AV
Sbjct: 264 LYLDPHTTQRAGAVAQKTTAAERELDESYHQKYAARLSFGAMDPSLAV 311
>gi|340709297|ref|XP_003393247.1| PREDICTED: cysteine protease ATG4B-like isoform 2 [Bombus
terrestris]
Length = 386
Score = 319 bits (818), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 168/300 (56%), Positives = 209/300 (69%), Gaps = 39/300 (13%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++L+ IRRDI S+LWFTYRK FVPIG +S T+DKGWGCMLRCGQMV+ QAL+ LHLGR
Sbjct: 37 RELDIIRRDIRSKLWFTYRKNFVPIGGYNSTFTSDKGWGCMLRCGQMVLGQALIILHLGR 96
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW ++ YLKIL+ FED+RTA +SIHQIA GASEGK VG+WFGPNT+AQVL+KL
Sbjct: 97 DWQWTAETRNSTYLKILERFEDKRTAAFSIHQIASMGASEGKEVGQWFGPNTIAQVLKKL 156
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC-----TTNKRASSNP-----QWQPLVLVIPLR 177
+D+WSSI HVALDNTL+VN + K C TT + + P QW+PL+L+IPLR
Sbjct: 157 VVFDEWSSITIHVALDNTLIVNDILKQCRVEGGTTVEADGAVPLKAPSQWKPLLLLIPLR 216
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +INP+YING+K F PQSLGVIGGKPN
Sbjct: 217 LGLSEINPIYINGLKT---------------------------SFKIPQSLGVIGGKPNL 249
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
ALYFIG V N+VI+LDPHT Q G V K ++ E ++D+TYHC +SR+ I +DPS+A+
Sbjct: 250 ALYFIGCVENEVIYLDPHTTQRSGSVGKKLEEEEIEMDATYHCKSSSRIPITGIDPSVAL 309
>gi|195051960|ref|XP_001993206.1| GH13687 [Drosophila grimshawi]
gi|193900265|gb|EDV99131.1| GH13687 [Drosophila grimshawi]
Length = 393
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 152/288 (52%), Positives = 196/288 (68%), Gaps = 31/288 (10%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GFVP+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELEVIRRDIQSRLWCTYRHGFVPLGEVQLTTDKGWGCMLRCGQMVLAQALIELHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+ L +
Sbjct: 121 FWTPDCRDTTYLKIVNRFEDTRKSFYSIHQIALMGESQNKAVGEWLGPNTVAQILKILVR 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+D+T+V++ + LC + S W+PL+L++PLRLGI DINP+Y+
Sbjct: 181 FDDWSSLNVHVAMDSTVVLDDIFTLC----QEPSESAWKPLLLIVPLRLGISDINPIYVP 236
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 237 ALKRCLEL---------------------------NSSCGMIGGRPNQALYFLGYVDDEV 269
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E++LD +YH A+RL MDPS+AV
Sbjct: 270 LYLDPHTTQRAGAVAQKTTAAEQELDESYHQKYAARLSFAAMDPSLAV 317
>gi|194759168|ref|XP_001961821.1| GF15159 [Drosophila ananassae]
gi|190615518|gb|EDV31042.1| GF15159 [Drosophila ananassae]
Length = 402
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/296 (51%), Positives = 193/296 (65%), Gaps = 33/296 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF P+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELELIRRDIQSRLWCTYRCGFAPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W ++ YLKI+ FED + + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPECRDATYLKIVNRFEDVKNSCYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVR 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDW S+ HVA+D+T+V++ + LC W+PL+LVIPLRLGI DINP+Y+
Sbjct: 181 FDDWCSLAVHVAMDSTVVLDDIYSLCREGD------SWKPLLLVIPLRLGITDINPMYVP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQRSYSD 305
++LDPHT Q G V K E++ D TYH A+RL+ MDPS+AV SD
Sbjct: 268 LYLDPHTTQRTGTVGQKTGVGEQEYDETYHQKHAARLNFSAMDPSLAVCFLCKTSD 323
>gi|157115549|ref|XP_001658259.1| Autophagy-specific protein, putative [Aedes aegypti]
gi|108876876|gb|EAT41101.1| AAEL007228-PA [Aedes aegypti]
Length = 389
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 157/292 (53%), Positives = 202/292 (69%), Gaps = 33/292 (11%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
+ +DL+ IRRD+ +RLW TYR+GFVPIG S LTTDKGWGCMLRCGQMV+AQAL LHLGR
Sbjct: 37 ATEDLDLIRRDVQTRLWCTYRRGFVPIGGSQLTTDKGWGCMLRCGQMVLAQALTQLHLGR 96
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQVLRK 126
DW W + E YLKI+ FED + AP+S+HQIALTG +SE K VGEWFGPNTVAQVL+K
Sbjct: 97 DWSWTPETTNETYLKIVNRFEDSKAAPFSLHQIALTGESSEEKRVGEWFGPNTVAQVLKK 156
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINP 185
L K+DDW S+V HVALDNTL ++V +LC SNP W+PL+L+IPLRLG+ +INP
Sbjct: 157 LVKFDDWCSLVIHVALDNTLATDEVLELCVDR----SNPDSWKPLLLIIPLRLGLSEINP 212
Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
+Y++G+KKC+ L + G++GG+PN ALYFIGYV
Sbjct: 213 IYVDGLKKCFEL---------------------------AGNCGMVGGRPNQALYFIGYV 245
Query: 246 GNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++ ++LDPHT Q G + K E++LD T+H A R++ MDPS+A+
Sbjct: 246 ADEALYLDPHTVQRSGTIGSKRDPDERELDETFHQKYARRINFKGMDPSLAL 297
>gi|427783027|gb|JAA56965.1| Putative cysteine protease required for autophagy [Rhipicephalus
pulchellus]
Length = 390
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 197/308 (63%), Gaps = 50/308 (16%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
+ +L+ +R +ITS++W TYRK F I + T+D GWGCMLRCGQMV+A+A++ HLG+
Sbjct: 37 TFHELDDLRSNITSKIWLTYRKNFPAISGTDYTSDTGWGCMLRCGQMVVAEAVMRRHLGK 96
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
DWQW+ +K+E YL++L+MF+D++ YSIHQIA G SEGK VG+WFGPNT+A VLRKL
Sbjct: 97 DWQWSPGTKDEKYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKEVGQWFGPNTIAHVLRKL 156
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC----TTNKR--------------ASSNPQWQP 169
+ +D WSS+ HVA+DN +V++ ++K+C TT+ A+ W+P
Sbjct: 157 STFDKWSSLAMHVAMDNVVVMDDIRKICRVETTTDVEDGIRNRTQSHGGPAAAGARSWKP 216
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
LVL IPLRLG+ +INP+Y G+K+ +AL QSLG
Sbjct: 217 LVLFIPLRLGLSEINPIYYCGLKRTFAL---------------------------KQSLG 249
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
+IGGKPNHALY IG VG+D++FLDPHT Q + D E D +YHC ASR+ I
Sbjct: 250 IIGGKPNHALYIIGVVGDDLVFLDPHTTQ-----LAVDLDVECPEDESYHCAHASRMDIG 304
Query: 290 HMDPSIAV 297
+DPSIA+
Sbjct: 305 QLDPSIAL 312
>gi|260795879|ref|XP_002592932.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
gi|229278156|gb|EEN48943.1| hypothetical protein BRAFLDRAFT_275700 [Branchiostoma floridae]
Length = 380
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 199/313 (63%), Gaps = 57/313 (18%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D ++++ DI+SRLWFTYRK F PIG +G +D+GWGCMLRCGQM++ QAL+ HLGRDW
Sbjct: 42 KDRQELQNDISSRLWFTYRKNFTPIGGTGPMSDQGWGCMLRCGQMMLGQALICRHLGRDW 101
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+W + Y KIL++F D++ + YSIHQIA G SEGK+VG+WFGPNTVAQVL+KLA
Sbjct: 102 RWKSAVYDNDYTKILQLFLDKKDSCYSIHQIAQMGVSEGKSVGQWFGPNTVAQVLKKLAL 161
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC------------------------TTNKRASSNP 165
++DWSS+ HVA+DNT++++ +KKLC T+ + S
Sbjct: 162 FEDWSSLAIHVAMDNTVIIDDIKKLCRSARQPTPSQVTNSFLCNGVSAEQTSARSRSPAL 221
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
WQPL+L+IPLRLG+ ++NPVY + +K C FT
Sbjct: 222 PWQPLMLIIPLRLGLSELNPVYTDCLKAC---------------------------FTLR 254
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL-DSTYHCPQAS 284
QSLG+IGGKPNHA YFIGYVGN +++LDPHT Q E + + DS++HC S
Sbjct: 255 QSLGMIGGKPNHAHYFIGYVGNSLVYLDPHTTQPA-----VELEGNVPIPDSSFHCTHPS 309
Query: 285 RLHILHMDPSIAV 297
R++I +DPSIA+
Sbjct: 310 RMNIQDLDPSIAL 322
>gi|291226947|ref|XP_002733451.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 356
Score = 296 bits (759), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 151/303 (49%), Positives = 197/303 (65%), Gaps = 46/303 (15%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D ++ DI SR+W TYRK F IG +G T+D GWGCMLRCGQM++AQALL HLGR+W
Sbjct: 37 RDRSELLADIKSRIWITYRKNFSAIGGTGPTSDNGWGCMLRCGQMILAQALLCKHLGREW 96
Query: 70 QWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
+W + E Y KILK+F DR+ + YSIHQIA G EGK++G+WFGPNTVAQVLRKL
Sbjct: 97 RWESREHQNETYCKILKLFLDRKDSCYSIHQIAQMGVGEGKSIGQWFGPNTVAQVLRKLT 156
Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLCTT--------NKRASSNPQ------WQPLVLVI 174
+DDWSSI H+++DNT+VV ++KLC T K AS++ + W+PLVL I
Sbjct: 157 LFDDWSSIAVHISMDNTIVVEDIRKLCRTPLFTECASPKAASASLENGGTTYWKPLVLFI 216
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PLRLG+ +INP+Y++ +KKC FT QSLG+IGGK
Sbjct: 217 PLRLGLTEINPLYLDVLKKC---------------------------FTLKQSLGMIGGK 249
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
PNHA YFIG+ G +++LDPHT Q V D + + D TYHC SR++I+H+DPS
Sbjct: 250 PNHAHYFIGFYGKTLVYLDPHTTQP---VVDINKWASIP-DDTYHCKHPSRMNIMHLDPS 305
Query: 295 IAV 297
IA+
Sbjct: 306 IAL 308
>gi|241999098|ref|XP_002434192.1| cystein protease, putative [Ixodes scapularis]
gi|215495951|gb|EEC05592.1| cystein protease, putative [Ixodes scapularis]
Length = 382
Score = 296 bits (758), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 145/305 (47%), Positives = 196/305 (64%), Gaps = 49/305 (16%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+L+ +R D+TS++W TYRK F IG +G T+D GWGCMLRCGQMV+AQAL+ HLGR+W
Sbjct: 33 HELDDLRSDVTSKIWLTYRKNFPAIGGTGPTSDSGWGCMLRCGQMVLAQALMRRHLGREW 92
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+W +K + YL IL+MF+D++ +SIHQIA G SEGK VGEWFGPNTVA VLRKLA
Sbjct: 93 RWEPGTKNKDYLYILRMFQDKKNCTFSIHQIAQMGVSEGKTVGEWFGPNTVAHVLRKLAI 152
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKL-----------------CTTNKRASSNPQWQPLVL 172
+D WSS+ HVA+DNT+++N++ K ++ A+S W+PL+L
Sbjct: 153 FDKWSSLAIHVAMDNTVIINEISKFRCHIWAAADGLVRNRTNSEPSRPANSEGSWKPLLL 212
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
IPLRLG+ +IN +Y G+K+ +AL QSLG+IG
Sbjct: 213 FIPLRLGLSEINRIYAFGLKRTFAL---------------------------KQSLGMIG 245
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
GKPNHALYFIG V +++IFLDPHT Q + C + D + D +YHC ASR++I +D
Sbjct: 246 GKPNHALYFIGVVEDELIFLDPHTTQ-LAC----DLDVDSPDDQSYHCAHASRMNISELD 300
Query: 293 PSIAV 297
PS+A+
Sbjct: 301 PSVAL 305
>gi|291202714|dbj|BAI82576.1| autophagy-related 4 [Haemaphysalis longicornis]
Length = 387
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 140/303 (46%), Positives = 191/303 (63%), Gaps = 47/303 (15%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+L+ +R D+TS++W TYR+ F I + T+D GWGCMLRCGQM +A+AL+ HL R W
Sbjct: 39 HELDDLRSDVTSKIWLTYRRNFPAISGTDYTSDTGWGCMLRCGQMAVAEALMRRHLRRGW 98
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
QW ++E+YL++L+MF+D++ +SIHQIA G SEGKAVG+WFGPNTVA VLRKLA
Sbjct: 99 QWAPGIRDESYLRVLRMFQDKKNCTFSIHQIAQMGVSEGKAVGQWFGPNTVAHVLRKLAA 158
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN---------------PQWQPLVLVI 174
+D WSS+ HVA+DN ++++ ++K+C A S W+PL+L I
Sbjct: 159 FDKWSSLAIHVAMDNVVIMDDIRKVCRLEATAESGVRNRAEPAGLAAAAAESWKPLLLFI 218
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PLRLG+ +INP+Y G+K+ +AL QSLG+IGGK
Sbjct: 219 PLRLGLSEINPIYYCGLKRTFAL---------------------------KQSLGIIGGK 251
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
PNHALY IG VG+D++FLDPHT Q + D+E D +YHC ASR+ I +DPS
Sbjct: 252 PNHALYIIGVVGDDLVFLDPHTTQ-----LAVDLDTEFPDDESYHCAHASRMDIGQLDPS 306
Query: 295 IAV 297
IA+
Sbjct: 307 IAL 309
>gi|346466653|gb|AEO33171.1| hypothetical protein [Amblyomma maculatum]
Length = 401
Score = 293 bits (749), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 192/304 (63%), Gaps = 48/304 (15%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+L+ +R D+TS++W TYRK F I + T+D GWGCMLRCGQMVIA+AL+ HLG+ W
Sbjct: 52 HELDDLRNDVTSKIWLTYRKNFPAISGTDHTSDTGWGCMLRCGQMVIAEALMRRHLGKGW 111
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
QW ++E YL++L+MF+D++ YSIHQIA G SEGKAVG+WFGPNT+A VLRKL+
Sbjct: 112 QWAPGIRDENYLRVLRMFQDKKNCTYSIHQIAQMGVSEGKAVGQWFGPNTIAHVLRKLSA 171
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA----------------SSNPQWQPLVLV 173
+D WSS+ HVA+DN +V++ ++K+C A +S W+PL+L
Sbjct: 172 FDKWSSLAVHVAMDNVVVMDDIRKICRVETPAVDDGVRHRTQSHGLACASAVSWKPLLLF 231
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
IPLRLG+ +INPVY G+K+ +AL QS+G+IGG
Sbjct: 232 IPLRLGLNEINPVYYCGLKRTFAL---------------------------KQSVGIIGG 264
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
KPNHAL+ IG VG+D++FLDPHT Q + D E D +YHC ASR+ I +DP
Sbjct: 265 KPNHALFIIGVVGDDLVFLDPHTTQ-----LAVDLDVEFPEDESYHCAHASRMDIGQLDP 319
Query: 294 SIAV 297
SIA+
Sbjct: 320 SIAL 323
>gi|443684303|gb|ELT88258.1| hypothetical protein CAPTEDRAFT_225251 [Capitella teleta]
Length = 410
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 201/318 (63%), Gaps = 63/318 (19%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
DL ++++D+ SRLW TYRKGF PIG SG T+D+GWGCMLRCGQM++AQ+L+ HLGRDW+
Sbjct: 46 DLAELKKDVKSRLWLTYRKGFDPIGGSGPTSDQGWGCMLRCGQMMLAQSLICRHLGRDWR 105
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
W + + Y +IL+MF+D+R+A YS+ IA G SEGKA+GEWFGPNT++QVLRKL
Sbjct: 106 WTKDKYDPKYFEILRMFQDKRSAKYSLQVIASMGTSEGKAIGEWFGPNTISQVLRKLCVS 165
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP------------------------- 165
D+WS++V HVALDNT++++ V LC ++K+ S+ P
Sbjct: 166 DEWSNLVVHVALDNTVIIDDVFCLCKSSKKESNEPIPGVHAACASALLFNGHDPTAEGHD 225
Query: 166 ------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
W+PL+L++PLRLG+ +INPVYI +K C
Sbjct: 226 PSGEDDSWRPLLLIVPLRLGLSEINPVYIPFLKTC------------------------- 260
Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
TF QS+G+IGGKPNHA +FIG++ ++++++DPHT Q D Q E D++YH
Sbjct: 261 --LTFKQSVGIIGGKPNHAHWFIGFLEDELVYMDPHTTQPF---VDVTQPGES--DASYH 313
Query: 280 CPQASRLHILHMDPSIAV 297
C + R+ + ++DPS+AV
Sbjct: 314 CSYSCRMPVSYLDPSVAV 331
>gi|125986465|ref|XP_001356996.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
gi|54645322|gb|EAL34062.1| GA18177 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/288 (53%), Positives = 191/288 (66%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF+P+G+ LTTD+GWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELEVIRRDIQSRLWCTYRHGFMPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W ++ YLKI+ FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+KL
Sbjct: 121 FWTPECQDATYLKIVNRFEDVRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVL 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDW S+V HVA+D+T+V++ V LC W+PL+L+IPLRLGI DINP+YI
Sbjct: 181 FDDWCSLVVHVAMDSTVVLDDVYSLCLEGD------AWKPLLLIIPLRLGISDINPIYIP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVEDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K E++ D TYH A+RL MDPS+AV
Sbjct: 268 LYLDPHTTQRTGVVGQKTSSGEQEHDETYHQKHAARLSFSAMDPSLAV 315
>gi|195159572|ref|XP_002020652.1| GL15485 [Drosophila persimilis]
gi|194117602|gb|EDW39645.1| GL15485 [Drosophila persimilis]
Length = 409
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/288 (53%), Positives = 191/288 (66%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF+P+G+ LTTD+GWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELEVIRRDIQSRLWCTYRHGFMPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W ++ YLKI+ FED R + YSIHQIAL G S+ KAVGEW GPNTVAQ+L+KL
Sbjct: 121 FWTPECQDATYLKIVNRFEDVRKSYYSIHQIALMGESQNKAVGEWLGPNTVAQILKKLVL 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDW S+V HVA+D+T+V++ V LC W+PL+L+IPLRLGI DINP+YI
Sbjct: 181 FDDWCSLVVHVAMDSTVVLDDVYSLCLEGD------AWKPLLLIIPLRLGISDINPIYIP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVEDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K E++ D TYH A+RL MDPS+AV
Sbjct: 268 LYLDPHTTQKTGVVGQKTSSGEQEHDETYHQKHAARLSFSAMDPSLAV 315
>gi|158296556|ref|XP_316946.4| AGAP008497-PA [Anopheles gambiae str. PEST]
gi|157014766|gb|EAA12240.4| AGAP008497-PA [Anopheles gambiae str. PEST]
Length = 389
Score = 292 bits (747), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 156/291 (53%), Positives = 204/291 (70%), Gaps = 31/291 (10%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
+ DLE IR+D+ SRLW TYR+GFVPIG++ LTTDKGWGCMLRCGQMV+AQALL LHLGR
Sbjct: 37 ASDDLEAIRQDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAQALLQLHLGR 96
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQVLRK 126
DW W ++++ YL I+ FED + AP+S+HQIAL G +SE K +GEWFGPNTVAQVL+K
Sbjct: 97 DWVWEAETRDDIYLNIVNRFEDSKQAPFSLHQIALMGDSSEEKRIGEWFGPNTVAQVLKK 156
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPV 186
L K+DDW +V HVALDNT+ +++ +LC K + W+PL+L+IPLRLG+ ++NP+
Sbjct: 157 LVKFDDWCRLVIHVALDNTVATDEIVELCVDKKEPEA---WKPLLLIIPLRLGLSEVNPI 213
Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
YI G+KKC+ L P S G+IGG+PN ALYFIGYVG
Sbjct: 214 YIEGLKKCFQL---------------------------PGSCGMIGGRPNQALYFIGYVG 246
Query: 247 NDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ ++LDPHT Q +G V K+ +E++LD T+H ASR+ MDPS+AV
Sbjct: 247 GEALYLDPHTVQRVGTVGSKQDPAEQELDETFHQRYASRISFTSMDPSLAV 297
>gi|170036509|ref|XP_001846106.1| Autophagy-specific protein [Culex quinquefasciatus]
gi|167879174|gb|EDS42557.1| Autophagy-specific protein [Culex quinquefasciatus]
Length = 379
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 157/296 (53%), Positives = 208/296 (70%), Gaps = 30/296 (10%)
Query: 4 ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
+N L DL+QIRRD+ SRLW TYR+GFVPIG S T+DKGWGCMLRCGQMV+AQALL L
Sbjct: 22 SNALETLDLDQIRRDVQSRLWCTYRRGFVPIGGSQHTSDKGWGCMLRCGQMVLAQALLQL 81
Query: 64 HLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQ 122
HLGRDW+W +++E YL+I+ FED + AP+S+HQIALTG +SE K VGEWFGPNTVAQ
Sbjct: 82 HLGRDWEWTAETRDETYLRIVNRFEDNKAAPFSLHQIALTGESSEEKRVGEWFGPNTVAQ 141
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQD 182
VL+KL K+DDW S+V HVALD+TL ++V +LC ++ + W+PL+L+IPLRLG+ +
Sbjct: 142 VLKKLVKFDDWCSVVVHVALDSTLATDEVVELC--EDKSDAGTSWKPLLLIIPLRLGLSE 199
Query: 183 INPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI 242
INP+Y+ G+KKC+ L + G+IGG+PN ALYFI
Sbjct: 200 INPIYVAGLKKCFELA---------------------------GNCGMIGGRPNQALYFI 232
Query: 243 GYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
GYVG++ +FLDPHT Q G + DK E+++D ++H A R++ MDPS+A+
Sbjct: 233 GYVGDEALFLDPHTVQRSGNIGDKTGLDEREMDESFHQRYARRINFKAMDPSLALC 288
>gi|390365223|ref|XP_785967.3| PREDICTED: cysteine protease ATG4B-like [Strongylocentrotus
purpuratus]
Length = 390
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 202/345 (58%), Gaps = 82/345 (23%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
LS LE R D+ SRLWFTYRKGF IG +G TTD+GWGCMLRCGQM++AQAL++ HLG
Sbjct: 57 LSQHQLEA-RLDVLSRLWFTYRKGFSNIGGTGPTTDQGWGCMLRCGQMMLAQALVYKHLG 115
Query: 67 RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
RDW+W ++E YLKIL++F D++ + +SIHQIA G EGK VG+WFGPNTV QV+RK
Sbjct: 116 RDWRWRPQEQDETYLKILQLFLDKKDSCFSIHQIAQMGVGEGKKVGDWFGPNTVGQVIRK 175
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN------------------KRASSNP--- 165
L+ +D WS + HVALDNT+V+ ++KLCT N KR SS+
Sbjct: 176 LSPFDSWSDLAVHVALDNTVVIEDIRKLCTVNSTTEETSSEGSKTGSERRKRTSSSENIR 235
Query: 166 -----------------------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYA 196
W+ L L+IPLRLG+ +IN VY+ +K+C
Sbjct: 236 HKMQLSPENTNIQLPNGLMEGACVSPGGVSWRSLFLIIPLRLGLNEINTVYMQRLKRC-- 293
Query: 197 LPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
FT PQSLGVIGGKPNHA YFIG +G+++++LDPHT
Sbjct: 294 -------------------------FTLPQSLGVIGGKPNHAHYFIGVLGDEMVYLDPHT 328
Query: 257 NQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVSQR 301
Q + DK + D ++HC ASR+ I ++DPSI +VS +
Sbjct: 329 TQPAADI-DKWAFLQ---DESFHCEHASRMPIKNLDPSIGLVSTK 369
>gi|148233205|ref|NP_001088025.1| cysteine protease ATG4B [Xenopus laevis]
gi|61211762|sp|Q640G7.1|ATG4B_XENLA RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|52221191|gb|AAH82660.1| LOC494717 protein [Xenopus laevis]
Length = 384
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 193/315 (61%), Gaps = 52/315 (16%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
EQ+ DITSRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ H+GRDW+W+
Sbjct: 40 EQLLNDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWD 99
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
+ YL IL F D++ + YSIHQIA G EGK +G+W+GPNTVAQVLRKLA +D
Sbjct: 100 KQKPKGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQ 159
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSN----------------PQWQPLVLVIPL 176
WSSI H+A+DNT+VV+++++LC SS+ QW+PLVL+IPL
Sbjct: 160 WSSIAVHIAMDNTVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDPSCAQWKPLVLLIPL 219
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
RLG+ +IN YI +K C F PQSLGVIGG+PN
Sbjct: 220 RLGLSEINEAYIETLKHC---------------------------FMVPQSLGVIGGRPN 252
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSI 295
A YFIGYVG+++I+LDPHT Q + + D D ++HC R+H+ +DPSI
Sbjct: 253 SAHYFIGYVGDELIYLDPHTTQ----LSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSI 308
Query: 296 AV----VSQRSYSDY 306
AV SQ + D+
Sbjct: 309 AVGFFCSSQEDFEDW 323
>gi|195575679|ref|XP_002077704.1| GD23066 [Drosophila simulans]
gi|194189713|gb|EDX03289.1| GD23066 [Drosophila simulans]
Length = 411
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 151/288 (52%), Positives = 191/288 (66%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF P+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIA G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPDCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+D+T+V++ V C W+PL+L+IPLRLGI DINP+Y+
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYASCREGG------SWKPLLLIIPLRLGITDINPLYVP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E+ D TYH A+RL+ MDPS+AV
Sbjct: 268 LYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFSAMDPSLAV 315
>gi|19920488|ref|NP_608563.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|7296129|gb|AAF51423.1| Autophagy-specific gene 4, isoform A [Drosophila melanogaster]
gi|16198037|gb|AAL13802.1| LD26292p [Drosophila melanogaster]
gi|220945806|gb|ACL85446.1| Atg4-PA [synthetic construct]
gi|220955642|gb|ACL90364.1| Atg4-PA [synthetic construct]
Length = 411
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 151/288 (52%), Positives = 191/288 (66%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF P+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIA G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPDCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+D+T+V++ V C W+PL+L+IPLRLGI DINP+Y+
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYASC------REGGSWKPLLLIIPLRLGITDINPLYVP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E+ D TYH A+RL+ MDPS+AV
Sbjct: 268 LYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFSAMDPSLAV 315
>gi|442625102|ref|NP_001259852.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
gi|440213106|gb|AGB92389.1| Autophagy-specific gene 4, isoform B [Drosophila melanogaster]
Length = 410
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 151/288 (52%), Positives = 191/288 (66%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF P+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIA G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTPDCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+D+T+V++ V C W+PL+L+IPLRLGI DINP+Y+
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYASC------REGGSWKPLLLIIPLRLGITDINPLYVP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E+ D TYH A+RL+ MDPS+AV
Sbjct: 268 LYLDPHTTQRTGAVAQKTAAAEQDYDETYHQKHAARLNFSAMDPSLAV 315
>gi|118404310|ref|NP_001072464.1| autophagy related 4B, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|115291929|gb|AAI21871.1| cysteine endopeptidase AUT-like (1O128) [Xenopus (Silurana)
tropicalis]
Length = 384
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 192/314 (61%), Gaps = 49/314 (15%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
EQ+ DITSRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQALL H+GRDW+W+
Sbjct: 40 EQLLNDITSRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHIGRDWRWD 99
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
+ YL IL F D++ + YSIHQIA G EGK +G+W+GPNTVAQVLRKLA +D
Sbjct: 100 KQKSQGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKCIGQWYGPNTVAQVLRKLAVFDQ 159
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ----------------WQPLVLVIPL 176
WSSI H+A+DNT+V++++++LC SS W+PLVL+IPL
Sbjct: 160 WSSIAVHIAMDNTVVMDEIRRLCRAGTNESSEAGALCNGYTGVSDPSCSLWKPLVLLIPL 219
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
RLG+ DIN YI +K C F PQSLGVIGG+PN
Sbjct: 220 RLGLSDINEAYIETLKHC---------------------------FMVPQSLGVIGGRPN 252
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSI 295
A YFIGYVG+++I+LDPHT Q + + D D ++HC R+H+ +DPSI
Sbjct: 253 SAHYFIGYVGDELIYLDPHTTQ----LAVEPSDCCFVEDESFHCQHPPCRMHVSEIDPSI 308
Query: 296 AV-VSQRSYSDYKN 308
AV RS D+++
Sbjct: 309 AVGFFCRSQEDFED 322
>gi|194853882|ref|XP_001968241.1| GG24763 [Drosophila erecta]
gi|190660108|gb|EDV57300.1| GG24763 [Drosophila erecta]
Length = 411
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 151/288 (52%), Positives = 190/288 (65%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF P+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIA G S+ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTADCRDATYLKIVNRFEDVRNSFYSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVR 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+D+T+V++ V C W+PL+L+IPLRLGI DINP+Y+
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYSSC------REGGSWKPLLLIIPLRLGITDINPLYVP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------DSSCGMIGGRPNQALYFLGYVDDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E+ D TYH A+RL MDPS+AV
Sbjct: 268 LYLDPHTTQRTGVVGQKTAVAEQDYDETYHQKHAARLSFSAMDPSLAV 315
>gi|195437827|ref|XP_002066841.1| GK24338 [Drosophila willistoni]
gi|194162926|gb|EDW77827.1| GK24338 [Drosophila willistoni]
Length = 400
Score = 290 bits (741), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 147/288 (51%), Positives = 192/288 (66%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+L+ IRRDI SRLW TYR FVP+G+ LTTD+GWGCMLRCGQMV+AQAL+ LHLGR+W
Sbjct: 55 QELDLIRRDIQSRLWCTYRHSFVPLGEVQLTTDRGWGCMLRCGQMVLAQALIDLHLGREW 114
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W ++ YLKI+ FED R + YS+HQIAL G S+ K VGEW GPNTVAQ+L+KL
Sbjct: 115 YWTSECRDATYLKIVNRFEDARKSYYSLHQIALMGESQNKMVGEWLGPNTVAQILKKLVC 174
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDW S+V HVA+D+T+V++ + L + W+PL+L+IPLRLGI DINP+Y+
Sbjct: 175 FDDWCSLVIHVAMDSTVVLDDIYSL------SQDGESWKPLLLIIPLRLGITDINPIYVP 228
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C F S G+IGG+PN ALYF+GYV ++V
Sbjct: 229 ALKRC---------------------------FELESSCGMIGGRPNQALYFVGYVDDEV 261
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E++LD TYH A+RL+ MDPS+AV
Sbjct: 262 LYLDPHTTQRTGAVGQKTTTAEQELDETYHQKYAARLNFSAMDPSLAV 309
>gi|195470405|ref|XP_002087497.1| GE17286 [Drosophila yakuba]
gi|194173598|gb|EDW87209.1| GE17286 [Drosophila yakuba]
Length = 411
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 150/288 (52%), Positives = 190/288 (65%), Gaps = 33/288 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+LE IRRDI SRLW TYR GF P+G+ LTTDKGWGCMLRCGQMV+AQAL+ LHLGRDW
Sbjct: 61 QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWGCMLRCGQMVLAQALIDLHLGRDW 120
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ YLKI+ FED R + YSIHQIA G ++ KAVGEW GPNTVAQ+L+KL +
Sbjct: 121 FWTSDCRDATYLKIVNRFEDVRNSYYSIHQIAQMGETQNKAVGEWLGPNTVAQILKKLVR 180
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+D+T+V++ V C W+PL+L+IPLRLGI DINP+Y+
Sbjct: 181 FDDWSSLAIHVAMDSTVVLDDVYSSC------REGGSWKPLLLIIPLRLGITDINPLYVP 234
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C L S G+IGG+PN ALYF+GYV ++V
Sbjct: 235 ALKRCLEL---------------------------ESSCGMIGGRPNQALYFLGYVDDEV 267
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q G V K +E+ D TYH A+RL MDPS+AV
Sbjct: 268 LYLDPHTTQRTGVVGQKTAAAEQDYDETYHQKHAARLSFSAMDPSLAV 315
>gi|405972565|gb|EKC37327.1| Cysteine protease ATG4B [Crassostrea gigas]
Length = 405
Score = 289 bits (740), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 140/324 (43%), Positives = 196/324 (60%), Gaps = 69/324 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D ++++ D S++W TYRK F IG +G T D GWGCMLRCGQM++AQAL+ HLGRDW+
Sbjct: 43 DRDELKGDFLSKIWCTYRKNFPAIGGTGPTCDGGWGCMLRCGQMMLAQALVVRHLGRDWK 102
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
WN N +++ Y +IL+MF D+++A YSI QIA G SEGK VG WFGPNTVAQVL+KLA Y
Sbjct: 103 WNKNCQDQTYKRILQMFADKKSANYSIQQIASMGVSEGKPVGSWFGPNTVAQVLKKLAVY 162
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN-------------------------- 164
D+WSSIV H+A+DNT++ N +K +C + +++ +
Sbjct: 163 DEWSSIVIHIAMDNTVIENDIKSVCKDDGKSTCDIIGVRQLKHESAATGRSKKSSQDSSK 222
Query: 165 -----------PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
W+PL+LVIPLRLG+ +IN VY+ +K C
Sbjct: 223 QDKNKQNAVDVKSWKPLLLVIPLRLGLTEINSVYVQSLKAC------------------- 263
Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
+FPQS+G+IGGKPNHA +F+GY+ + +I+LDPHT Q ++ DS
Sbjct: 264 --------LSFPQSVGIIGGKPNHAHWFVGYMSDKLIYLDPHTTQLC-----EDLDSPNF 310
Query: 274 LDSTYHCPQASRLHILHMDPSIAV 297
D +YHCP S ++++ +DPSIA+
Sbjct: 311 SDESYHCPYPSTMNVMELDPSIAL 334
>gi|355757609|gb|EHH61134.1| Cysteine protease ATG4A, partial [Macaca fascicularis]
Length = 396
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 201/315 (63%), Gaps = 55/315 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 42 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 101
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 221
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 310
Query: 290 HMDPSIAVVSQRSYS 304
++DPS+A+V R S
Sbjct: 311 NLDPSVALVGIRRLS 325
>gi|395854618|ref|XP_003799779.1| PREDICTED: cysteine protease ATG4A isoform 1 [Otolemur garnettii]
Length = 398
Score = 287 bits (734), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 196/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----------------------TNKRASSNPQWQPL 170
W+S+ +V++DNT+V+ +KK+C + ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTAGESPPGSLTALNQSKGTSACRPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++GN++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGNELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|338729393|ref|XP_001490718.3| PREDICTED: cysteine protease ATG4A [Equus caballus]
Length = 398
Score = 286 bits (732), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 199/308 (64%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPL 170
W+S+ +V++DNT+V+ +KK+C +N+ S++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAGESPPSSLNASNRSKSTSAGWPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|291407754|ref|XP_002720229.1| PREDICTED: autophagy-related cysteine endopeptidase 2 [Oryctolagus
cuniculus]
Length = 405
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 51 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 110
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 111 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 170
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR--------------ASSNPQWQPL 170
W+S+ +V++DNT+V+ +KK+C T +R ++ P W+PL
Sbjct: 171 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSANTPGERLHDSLTASNQSKGTSACCPAWKPL 230
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 231 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 263
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++GN++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 264 LGGKPNNAYYFIGFLGNELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 319
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 320 NLDPSVAL 327
>gi|328722655|ref|XP_003247627.1| PREDICTED: cysteine protease ATG4B-like [Acyrthosiphon pisum]
Length = 252
Score = 286 bits (731), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 130/183 (71%), Positives = 157/183 (85%), Gaps = 1/183 (0%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
DL+QIR DI SRLWFTYRKGFV IG++ T+D+GWGCMLRCGQMVI QAL+FLHLGRDW+
Sbjct: 59 DLQQIRNDIQSRLWFTYRKGFVQIGNTNFTSDRGWGCMLRCGQMVIGQALIFLHLGRDWR 118
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
W+ + ++ YLKIL+MFED+R+APYSIHQIAL G S GK VGEWFGPNT+AQVL+KLA
Sbjct: 119 WDPDKRDIDYLKILRMFEDKRSAPYSIHQIALMGVSHGKQVGEWFGPNTIAQVLKKLATM 178
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYIN 189
D+ SS+VFHVALDNTLV+N+VKKLCT ++ +S+ Q W+PLVLVIPLRLGI INP Y+
Sbjct: 179 DELSSLVFHVALDNTLVINEVKKLCTVMEQTNSSKQIWKPLVLVIPLRLGISAINPAYVQ 238
Query: 190 GIK 192
G+K
Sbjct: 239 GVK 241
>gi|348513452|ref|XP_003444256.1| PREDICTED: cysteine protease ATG4B-like [Oreochromis niloticus]
Length = 391
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 148/320 (46%), Positives = 194/320 (60%), Gaps = 57/320 (17%)
Query: 2 RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
+ N L+ +D +I D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+
Sbjct: 31 KEYNALTEKD--EILSDVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALV 88
Query: 62 FLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
HLGRDW+W K+ + Y+ +L F D++ + YSIHQIA G EGK +G+W+GPNTV
Sbjct: 89 CRHLGRDWRWAKGQKQRDEYISLLNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTV 148
Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ-------------- 166
AQVL+KLA +D WS +V HVA+DNT+V+ ++K+LC A +
Sbjct: 149 AQVLKKLAVFDTWSKVVVHVAMDNTVVIEEIKRLCMPWLDACGELEGVGELNGCLEGACA 208
Query: 167 --------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
W+PLVL+IPLRLG+ DIN YI +K+C+ LP
Sbjct: 209 MAEEETALWRPLVLLIPLRLGLSDINDAYIETLKQCFMLP-------------------- 248
Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
QSLGVIGGKPN A YFIGYVG ++I+LDPHT Q + +DS+ D TY
Sbjct: 249 -------QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQP---AVEPSEDSQVP-DETY 297
Query: 279 HCPQ-ASRLHILHMDPSIAV 297
HC R+HI +DPSIA
Sbjct: 298 HCQHPPCRMHICELDPSIAA 317
>gi|47564112|ref|NP_001001171.1| cysteine protease ATG4A [Bos taurus]
gi|61211781|sp|Q6PZ05.1|ATG4A_BOVIN RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related cysteine endopeptidase 2A;
Short=Autophagin-2A; AltName: Full=Autophagy-related
protein 4 homolog A; AltName: Full=bAut2A
gi|45861656|gb|AAS78581.1| Aut2a [Bos taurus]
Length = 398
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C T ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTAD-DQTFHCLQPPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|403289551|ref|XP_003935915.1| PREDICTED: cysteine protease ATG4A isoform 1 [Saimiri boliviensis
boliviensis]
Length = 422
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 68 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 127
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 128 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 187
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------------------TTNKRASSN--PQWQPL 170
W+S+ +V++DNT+V+ +KK+C + R +S P W+PL
Sbjct: 188 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPGDRPPDSLTASNESRGTSAYCPAWKPL 247
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 248 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 280
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 281 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 336
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 337 NLDPSVAL 344
>gi|348563665|ref|XP_003467627.1| PREDICTED: cysteine protease ATG4A-like [Cavia porcellus]
Length = 398
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 199/308 (64%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T+N+ ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPFSADTADKSSPDSFITSNQSKDTSAFCPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ +++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQQMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|296470926|tpg|DAA13041.1| TPA: cysteine protease ATG4A [Bos taurus]
Length = 396
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C T ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTAD-DQTFHCLQPPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|187282046|ref|NP_001119770.1| uncharacterized protein LOC678769 [Rattus norvegicus]
gi|169642267|gb|AAI60890.1| LOC678769 protein [Rattus norvegicus]
Length = 406
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 200/318 (62%), Gaps = 56/318 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC---------------------TTNKRASSNP--QWQP 169
W+S+ +V++DNT+V+ +KK+C ++ + +S P W+P
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKP 223
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
L+L++PLRLGI INPVYI K+C F PQSLG
Sbjct: 224 LLLIVPLRLGINQINPVYIEAFKEC---------------------------FKMPQSLG 256
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHI 288
+GGKPN+A YFIG +G+++IFLDPHT Q + ++S D T+HC Q+ R+ I
Sbjct: 257 ALGGKPNNAYYFIGSLGDELIFLDPHTTQT----FVDTEESGLVDDHTFHCLQSPQRMSI 312
Query: 289 LHMDPSIAVVSQRSYSDY 306
L++DPS+A+V Q ++ +
Sbjct: 313 LNLDPSVALVGQGAFMGF 330
>gi|345807894|ref|XP_538136.3| PREDICTED: cysteine protease ATG4A [Canis lupus familiaris]
Length = 398
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 194/308 (62%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI +RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDIRARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPREYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----------------------TNKRASSNPQWQPL 170
W+S+ +V++DNT+V+ +KK+C + +S P W+PL
Sbjct: 164 WNSLAIYVSMDNTVVIEDIKKMCCVLPLSADTIGESPLNTLNASNQSKSAPASCPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|426257739|ref|XP_004022480.1| PREDICTED: cysteine protease ATG4A [Ovis aries]
Length = 398
Score = 285 bits (728), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C T ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASTQSKGPSACCTAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQPPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|151554833|gb|AAI47963.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Bos taurus]
Length = 398
Score = 285 bits (728), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C T ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQPPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|387762879|ref|NP_001248420.1| cysteine protease ATG4A [Macaca mulatta]
gi|380809390|gb|AFE76570.1| cysteine protease ATG4A isoform a [Macaca mulatta]
gi|383413573|gb|AFH30000.1| cysteine protease ATG4A isoform a [Macaca mulatta]
Length = 398
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|355705060|gb|EHH30985.1| Cysteine protease ATG4A, partial [Macaca mulatta]
Length = 396
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 42 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 101
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 221
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 310
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 311 NLDPSVAL 318
>gi|410920724|ref|XP_003973833.1| PREDICTED: cysteine protease ATG4B-like [Takifugu rubripes]
Length = 394
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/312 (47%), Positives = 187/312 (59%), Gaps = 58/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ QAL+ HLGRDW+W
Sbjct: 40 EEILSDVTSRLWFTYRKSFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRWV 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
K+ + Y+ IL F D++ + YSIHQIA G EGK +G+W+GPNTVAQVL+KLA +D
Sbjct: 100 RGQKQRQEYISILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT---TNKRASSNPQ---------------------- 166
WS +V HVA+DNT+V+ ++K+LC P+
Sbjct: 160 TWSRLVVHVAMDNTVVIEEIKRLCMPWLDKAEVFGEPERVGELNGCLEGACALSEEEVAL 219
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+PLVL+IPLRLG+ DIN YI +KKC+ LP Q
Sbjct: 220 WKPLVLLIPLRLGLSDINGAYIETLKKCFMLP---------------------------Q 252
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + Q D TYHC R
Sbjct: 253 SLGVIGGKPNSAHYFIGYVGGELIYLDPHTTQTAVEPCEHGQFP----DDTYHCQHPPCR 308
Query: 286 LHILHMDPSIAV 297
+HI +DPSIAV
Sbjct: 309 MHICELDPSIAV 320
>gi|344286328|ref|XP_003414911.1| PREDICTED: cysteine protease ATG4A [Loxodonta africana]
Length = 411
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 196/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 57 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 116
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 117 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 176
Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----------------------TNKRASSNPQWQPL 170
W+S+ +V++DNT+V+ +KK+C + ++ P W+PL
Sbjct: 177 WNSLAVYVSMDNTVVIEDIKKMCCVFPLSAGAAGESPPAFPSASSQSKGTSACCPAWKPL 236
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 237 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 269
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 270 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQSPQRMNIL 325
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 326 NLDPSVAL 333
>gi|355669953|gb|AER94691.1| ATG4 autophagy related 4-like protein A [Mustela putorius furo]
Length = 408
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 196/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 54 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 113
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 114 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 173
Query: 133 WSSIVFHVALDNTLVVNQVKKLC----------------TTNKRASSN------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T N S P W+PL
Sbjct: 174 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTVGESPPDTLNASNQSKGTPAGRPAWKPL 233
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 234 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 266
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 267 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 322
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 323 NLDPSVAL 330
>gi|402911087|ref|XP_003918174.1| PREDICTED: cysteine protease ATG4A isoform 1 [Papio anubis]
Length = 398
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 141/308 (45%), Positives = 198/308 (64%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPLDYLTASNQSKGTSAHCPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|417410362|gb|JAA51656.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 396
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 42 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 101
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------------------TTNKRASSN--PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T + +A S P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSADMPSESSHDPLNATNHNKAISACCPAWKPL 221
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R+ IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQSPQRMSIL 310
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 311 NLDPSVAL 318
>gi|410989157|ref|XP_004000831.1| PREDICTED: cysteine protease ATG4A isoform 1 [Felis catus]
Length = 398
Score = 283 bits (725), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------TTNKRASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T +SN P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPSSADTVGESTPGTLNASNQSRGTFACCPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q + +++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQ----TFVNTEENGTVDDQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|345329187|ref|XP_003431344.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4A-like
[Ornithorhynchus anatinus]
Length = 436
Score = 283 bits (725), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 196/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 83 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWCWEK 142
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
+ K+ E Y KIL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 143 HKKQPEEYHKILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 202
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ----------------------WQPL 170
W+S+ +V++DNT+V+ +KK+C + S Q W+PL
Sbjct: 203 WNSLAVYVSMDNTVVIEDIKKMCRLLPQGSGMAQDGPPLHLSALGRSKNASGYCAIWKPL 262
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INP+YI+ K+C F PQSLG
Sbjct: 263 LLIVPLRLGINHINPIYIDAFKEC---------------------------FKTPQSLGA 295
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++GN++I+LDPHT Q D E++ + D ++HC QA R+ I+
Sbjct: 296 LGGKPNNAYYFIGFLGNELIYLDPHTTQTF---VDTEENGQVD-DHSFHCQQAPQRMKIM 351
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 352 NLDPSVAL 359
>gi|301780424|ref|XP_002925628.1| PREDICTED: cysteine protease ATG4A-like [Ailuropoda melanoleuca]
Length = 429
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 75 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 134
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 135 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 194
Query: 133 WSSIVFHVALDNTLVVNQVKKLC----------------TTNKRASSN------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T N S P W+PL
Sbjct: 195 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPL 254
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 255 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 287
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R+ IL
Sbjct: 288 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMSIL 343
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 344 NLDPSVAL 351
>gi|281342750|gb|EFB18334.1| hypothetical protein PANDA_015152 [Ailuropoda melanoleuca]
Length = 373
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 42 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 101
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161
Query: 133 WSSIVFHVALDNTLVVNQVKKLC----------------TTNKRASSN------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T N S P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSAATVGESPPDTLNASNQSKGTPAGCPAWKPL 221
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R+ IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMSIL 310
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 311 NLDPSVAL 318
>gi|397497900|ref|XP_003819741.1| PREDICTED: cysteine protease ATG4A isoform 1 [Pan paniscus]
Length = 398
Score = 283 bits (724), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTPGDRPPDSLTASNQSKGTSAYCSAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|354500801|ref|XP_003512485.1| PREDICTED: cysteine protease ATG4A-like [Cricetulus griseus]
gi|344251116|gb|EGW07220.1| Cysteine protease ATG4A [Cricetulus griseus]
Length = 398
Score = 283 bits (723), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 194/308 (62%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTT----------------------NKRASSNPQWQPL 170
W+S+ +V++DNT+V+ +KK+C +++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAHTADESPPDSLPASSQGKGPSATCPAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVYI K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYIEAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q + ++S D T+HC Q+ R+ IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQT----FVDTEESGIVDDETFHCLQSPQRMSIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|62860068|ref|NP_001016619.1| autophagy related 4A, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89269917|emb|CAJ81691.1| APG4 autophagy 4 homolog A (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|171846953|gb|AAI61565.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213625518|gb|AAI70776.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
gi|213627145|gb|AAI70802.1| ATG4 autophagy related 4 homolog A [Xenopus (Silurana) tropicalis]
Length = 395
Score = 282 bits (721), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 189/305 (61%), Gaps = 57/305 (18%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW+W +
Sbjct: 49 LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWRWEKH 108
Query: 75 SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
+ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 109 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 168
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ--------------------WQPLVLV 173
+S+ +V++DNT+V+ +K +C + S Q W+PL+L+
Sbjct: 169 NSLAVYVSMDNTVVIEDIKTMCKYQPHSCSMAQAASYQSTWSRCRDASGQSSGWRPLLLI 228
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY++ K C F PQSLG +GG
Sbjct: 229 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 261
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPNHA YFIG+ G+++I+LDPHT Q D++Q TYHC + + + +L++D
Sbjct: 262 KPNHAYYFIGFSGDEIIYLDPHTTQTFVDTEDQDQ--------TYHCQKGPNSMKVLNLD 313
Query: 293 PSIAV 297
PS+A+
Sbjct: 314 PSVAL 318
>gi|432853687|ref|XP_004067831.1| PREDICTED: cysteine protease ATG4B-like [Oryzias latipes]
Length = 390
Score = 282 bits (721), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 188/308 (61%), Gaps = 54/308 (17%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
EQ+ D+ SRLWFTYRK F PIG +G T+D GWGCMLRCGQM++A+AL+ HLGRDW+W
Sbjct: 40 EQLLSDVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILAEALMCRHLGRDWRWA 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ E Y+ IL F D++ + YSIHQIA G EGK +G+W+GPNTVAQVL+KLA +D
Sbjct: 100 RGRRQREEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKPIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT----TNKRASS-----------------NPQWQPL 170
WS + HVA+DNT+++ ++K+LC R + W+PL
Sbjct: 160 TWSRLAVHVAMDNTVIIEEIKRLCMPWLDIGDREEAGELNGCLEGACALVEEETALWKPL 219
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
VL+IPLRLG+ DIN YI+ +K+C+ L PQSLGV
Sbjct: 220 VLLIPLRLGLSDINEAYIDTLKQCFML---------------------------PQSLGV 252
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
IGGKPN A YFIGYVG ++I+LDPHT Q + +D + D TYHC R+HI
Sbjct: 253 IGGKPNSAHYFIGYVGEELIYLDPHTTQP---AVEPSEDGQVP-DETYHCQHPPCRMHIC 308
Query: 290 HMDPSIAV 297
+DPSIA
Sbjct: 309 ELDPSIAA 316
>gi|332226092|ref|XP_003262223.1| PREDICTED: cysteine protease ATG4A isoform 1 [Nomascus leucogenys]
Length = 398
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCT----TNKRASSNP------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C + A P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|30795252|ref|NP_443168.2| cysteine protease ATG4A isoform a [Homo sapiens]
gi|426397036|ref|XP_004064734.1| PREDICTED: cysteine protease ATG4A isoform 1 [Gorilla gorilla
gorilla]
gi|61211859|sp|Q8WYN0.1|ATG4A_HUMAN RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A;
Short=hAPG4A
gi|18181956|dbj|BAB83889.1| Apg4A [Homo sapiens]
gi|27763979|emb|CAD43218.1| autophagin-2 [Homo sapiens]
gi|38197608|gb|AAH61696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623094|gb|EAX02689.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|189069378|dbj|BAG37044.1| unnamed protein product [Homo sapiens]
gi|312151352|gb|ADQ32188.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [synthetic
construct]
Length = 398
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|350537069|ref|NP_001233457.1| cysteine protease ATG4A [Pan troglodytes]
gi|343958112|dbj|BAK62911.1| cysteine protease ATG4A [Pan troglodytes]
gi|410207960|gb|JAA01199.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410248796|gb|JAA12365.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410290856|gb|JAA24028.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
gi|410329967|gb|JAA33930.1| ATG4 autophagy related 4 homolog A [Pan troglodytes]
Length = 398
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 194/308 (62%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C + P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSIDTPGDRPPDSLTASNQSKGTSAYCSAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|119623100|gb|EAX02695.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_f
[Homo sapiens]
Length = 402
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 48 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 107
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 108 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 167
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN W+PL
Sbjct: 168 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 227
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 228 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 260
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 261 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 316
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 317 NLDPSVAL 324
>gi|15487240|emb|CAC69076.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
Length = 398
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSNPQ---------WQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCTAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|163914473|ref|NP_001106295.1| APG4A protein [Xenopus laevis]
gi|161611704|gb|AAI55873.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 193/305 (63%), Gaps = 53/305 (17%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDWQW +
Sbjct: 45 LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWEKH 104
Query: 75 SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
+ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 105 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 164
Query: 134 SSIVFHVALDNTLVVNQVKKLC------------TTNKRASSNPQ--------WQPLVLV 173
+S+ +V++DNT+V+ +K +C +++R S + W+PL+L+
Sbjct: 165 NSLAVYVSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLEQSSGWRPLLLI 224
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY++ K C F PQSLG +GG
Sbjct: 225 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 257
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPNHA YFIG+ G+++I+LDPHT Q + + +++ D TYHC + + + +L +D
Sbjct: 258 KPNHAYYFIGFSGDEIIYLDPHTTQ----TFVETEEAGTVQDQTYHCQKGPNSMKVLKLD 313
Query: 293 PSIAV 297
PS+A+
Sbjct: 314 PSVAL 318
>gi|50417810|gb|AAH78135.1| APG4A protein, partial [Xenopus laevis]
Length = 392
Score = 281 bits (719), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 193/305 (63%), Gaps = 53/305 (17%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDWQW +
Sbjct: 42 LQSDIVSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICQHLGRDWQWEKH 101
Query: 75 SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
+ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 102 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 161
Query: 134 SSIVFHVALDNTLVVNQVKKLC------------TTNKRASSNPQ--------WQPLVLV 173
+S+ +V++DNT+V+ +K +C +++R S + W+PL+L+
Sbjct: 162 NSLAVYVSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLEQSSGWRPLLLI 221
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY++ K C F PQSLG +GG
Sbjct: 222 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 254
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPNHA YFIG+ G+++I+LDPHT Q + + +++ D TYHC + + + +L +D
Sbjct: 255 KPNHAYYFIGFSGDEIIYLDPHTTQ----TFVETEEAGTVQDQTYHCQKGPNSMKVLKLD 310
Query: 293 PSIAV 297
PS+A+
Sbjct: 311 PSVAL 315
>gi|74136555|ref|NP_777364.3| cysteine protease ATG4A [Mus musculus]
gi|61211821|sp|Q8C9S8.2|ATG4A_MOUSE RecName: Full=Cysteine protease ATG4A; AltName: Full=AUT-like 2
cysteine endopeptidase; AltName: Full=Autophagin-2;
AltName: Full=Autophagy-related cysteine endopeptidase
2; AltName: Full=Autophagy-related protein 4 homolog A
gi|59809037|gb|AAH89500.1| Atg4a protein [Mus musculus]
gi|74193939|dbj|BAE36898.1| unnamed protein product [Mus musculus]
Length = 396
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 135/305 (44%), Positives = 194/305 (63%), Gaps = 52/305 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
W+S+ +V++DNT+V+ +KK+C +++P W+PL+L+
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLI 223
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY+ K+C F PQSLG +GG
Sbjct: 224 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 256
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPN+A YFIG++G+++IFLDPHT Q + ++S D T+HC Q+ R+ IL++D
Sbjct: 257 KPNNAYYFIGFLGDELIFLDPHTTQT----FVDIEESGLVDDQTFHCLQSPQRMSILNLD 312
Query: 293 PSIAV 297
PS+A+
Sbjct: 313 PSVAL 317
>gi|349605276|gb|AEQ00569.1| Cysteine protease ATG4A-like protein, partial [Equus caballus]
Length = 369
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 198/308 (64%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRD W
Sbjct: 15 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDLNWEK 74
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 75 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 134
Query: 133 WSSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPL 170
W+S+ +V++DNT+V+ +KK+C +N+ S++ P W+PL
Sbjct: 135 WNSLAVYVSMDNTVVIEDIKKMCCILPLSADTAGESPPSSLNASNRSKSTSAGWPAWKPL 194
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 195 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 227
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 228 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNIL 283
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 284 NLDPSVAL 291
>gi|440891575|gb|ELR45180.1| Cysteine protease ATG4A, partial [Bos grunniens mutus]
Length = 408
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 192/305 (62%), Gaps = 55/305 (18%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 42 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 101
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 102 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 161
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C T ++ P W+PL
Sbjct: 162 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 221
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 222 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 254
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q R++IL
Sbjct: 255 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD-DQTFHCLQPPQRMNIL 310
Query: 290 HMDPS 294
++DPS
Sbjct: 311 NLDPS 315
>gi|146387686|pdb|2P82|A Chain A, Cysteine Protease Atg4a
gi|146387687|pdb|2P82|B Chain B, Cysteine Protease Atg4a
gi|146387688|pdb|2P82|C Chain C, Cysteine Protease Atg4a
gi|146387689|pdb|2P82|D Chain D, Cysteine Protease Atg4a
Length = 355
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 40 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 99
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 100 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 159
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN W+PL
Sbjct: 160 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 219
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 220 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 252
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 253 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 308
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 309 NLDPSVAL 316
>gi|213626921|gb|AAI70397.1| APG4A protein [Xenopus laevis]
Length = 395
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 139/305 (45%), Positives = 192/305 (62%), Gaps = 53/305 (17%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDWQW +
Sbjct: 45 LQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALICQHLGRDWQWEKH 104
Query: 75 SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
+ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 105 KEHPEEYRQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 164
Query: 134 SSIVFHVALDNTLVVNQVKKLC------------TTNKRASSNPQ--------WQPLVLV 173
+S+ +V++DNT+V+ +K +C +++R S + W+PL+L+
Sbjct: 165 NSLAVYVSMDNTVVIEDIKTMCKYQPHNHSMAHAASHQRTWSRCRDTLEQSSGWRPLLLI 224
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY++ K C F PQSLG +GG
Sbjct: 225 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 257
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPNHA YFIG+ G+++I+LDPHT Q D E+ + D TYHC + + + +L +D
Sbjct: 258 KPNHAYYFIGFSGDEIIYLDPHTTQTF---VDTEEAGTVQ-DQTYHCQKGPNSMKVLKLD 313
Query: 293 PSIAV 297
PS+A+
Sbjct: 314 PSVAL 318
>gi|321472665|gb|EFX83634.1| hypothetical protein DAPPUDRAFT_194862 [Daphnia pulex]
Length = 389
Score = 279 bits (714), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 147/302 (48%), Positives = 204/302 (67%), Gaps = 36/302 (11%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L+++ D+ SRL TYR+ F PIGDSG+T+D+GWGCMLRCGQMV+AQAL+ HLGR W
Sbjct: 66 LDELNSDVKSRLLLTYRRNFPPIGDSGMTSDRGWGCMLRCGQMVVAQALINQHLGRQPFW 125
Query: 72 NVNSKE---EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
V + E+Y KILK+FED++TA YSIHQ+A G SEGK +G+WFGPNTVAQVL+KL+
Sbjct: 126 PVGDDQRTTESYKKILKLFEDKKTAVYSIHQLAQMGVSEGKEIGQWFGPNTVAQVLKKLS 185
Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYI 188
+YD+WS++ HVA+DN +V+ ++++LC + W PL+LV+PLRLG+ +INP+YI
Sbjct: 186 EYDEWSALKIHVAMDNAVVIEEIEQLCHKKITPTETSTWSPLLLVVPLRLGLLNINPIYI 245
Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
+ +K C + PQS+G+IGGKP+ ALYFIGYVG+D
Sbjct: 246 DSLKACLQM---------------------------PQSIGMIGGKPSQALYFIGYVGDD 278
Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV-SQRSYSDYK 307
V+FLDPH QN + + E D DS+YH +R+ MDPS+AV S ++S++K
Sbjct: 279 VVFLDPHLTQNAIDLDEDEFD-----DSSYHPATCARISFQSMDPSLAVCFSCTTHSEWK 333
Query: 308 NV 309
++
Sbjct: 334 DL 335
>gi|395851538|ref|XP_003798310.1| PREDICTED: cysteine protease ATG4B [Otolemur garnettii]
Length = 393
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E++ D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 EELLSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCQHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ------------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ + P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSLPCGTAPASSAAPDQHCNGFPAGAEVTTRLSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K+C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINAAYVETLKRC---------------------------FRMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EATDSCLVPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIGELDPSIAV 319
>gi|27763985|emb|CAD43221.1| autophagin-2 [Mus musculus]
gi|148675648|gb|EDL07595.1| mCG64870 [Mus musculus]
Length = 396
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 134/305 (43%), Positives = 193/305 (63%), Gaps = 52/305 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV++KL +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLTLFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
W+S+ +V++DNT+V+ +KK+C +++P W+PL+L+
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTVSNQSRDTSVPCSAWKPLLLI 223
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY+ K+C F PQSLG +GG
Sbjct: 224 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 256
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPN+A YFIG++G+++IFLDPHT Q + ++S D T+HC Q+ R+ IL++D
Sbjct: 257 KPNNAYYFIGFLGDELIFLDPHTTQT----FVDIEESGLVDDQTFHCLQSPQRMSILNLD 312
Query: 293 PSIAV 297
PS+A+
Sbjct: 313 PSVAL 317
>gi|197100863|ref|NP_001126588.1| cysteine protease ATG4A [Pongo abelii]
gi|61211744|sp|Q5R699.1|ATG4A_PONAB RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|55732020|emb|CAH92717.1| hypothetical protein [Pongo abelii]
Length = 398
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 196/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D ++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTGENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>gi|345307034|ref|XP_001513122.2| PREDICTED: cysteine protease ATG4B-like [Ornithorhynchus anatinus]
Length = 461
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 188/310 (60%), Gaps = 56/310 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQALL HLGRDW+W
Sbjct: 109 EDILSDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWK 168
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 169 KGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAAFD 228
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN--KRASSNP---------------------QWQ 168
WSS+ H+A+DNT+V+ ++++LC N AS+ P QW+
Sbjct: 229 TWSSLAVHIAMDNTVVIEEIRRLCKPNFPAGASAFPTDSEFLLNGFPSGAEVTNRPTQWK 288
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
PLVL+IPLRLG+ +IN YI +K C F PQSL
Sbjct: 289 PLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQSL 321
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLH 287
GVIGGKPN A YFIGYVG ++I+LDPHT Q + S D ++HC R++
Sbjct: 322 GVIGGKPNSAHYFIGYVGGELIYLDPHTTQPAVEI----SGSCFIPDESFHCQHPPCRMN 377
Query: 288 ILHMDPSIAV 297
I+ +DPSIAV
Sbjct: 378 IVELDPSIAV 387
>gi|26334447|dbj|BAC30924.1| unnamed protein product [Mus musculus]
Length = 396
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 134/305 (43%), Positives = 193/305 (63%), Gaps = 52/305 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
W+S+ + ++DNT+V+ +KK+C +++P W+PL+L+
Sbjct: 164 WNSLAVYDSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLI 223
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY+ K+C F PQSLG +GG
Sbjct: 224 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 256
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPN+A YFIG++G+++IFLDPHT Q + ++S D T+HC Q+ R+ IL++D
Sbjct: 257 KPNNAYYFIGFLGDELIFLDPHTTQTFVDI----EESGLVDDQTFHCLQSPQRMSILNLD 312
Query: 293 PSIAV 297
PS+A+
Sbjct: 313 PSVAL 317
>gi|148237097|ref|NP_001082821.1| cysteine protease ATG4B [Danio rerio]
gi|141795460|gb|AAI34887.1| Atg4b protein [Danio rerio]
Length = 394
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 195/324 (60%), Gaps = 61/324 (18%)
Query: 2 RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
R + L+ +D I D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+
Sbjct: 31 RQFSALTEKD--DILADVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALI 88
Query: 62 FLHLGRDWQWNVNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
HLGRDW+W+ ++ Y+ IL F D++ + YSIHQIA G EGK++G+W+GPNTV
Sbjct: 89 CRHLGRDWKWSPGQRQRPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTV 148
Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT----NKRA---SSNPQ------- 166
AQVL+KLA +D WS + HVA+DNT+V+ ++K+LC ++ A S P+
Sbjct: 149 AQVLKKLAVFDSWSRLAVHVAMDNTVVIEEIKRLCMPWLDFDRGACAVSEEPREMNGDLE 208
Query: 167 ------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
W+PLVL+IPLRLG+ DIN YI +K+C
Sbjct: 209 GACALAEEETALWKPLVLLIPLRLGLSDINEAYIEPLKQC-------------------- 248
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
F PQSLGVIGGKPN A YFIG+VG+++I+LDPHT Q D +D
Sbjct: 249 -------FMMPQSLGVIGGKPNSAHYFIGFVGDELIYLDPHTTQP---AVDPSEDGHFP- 297
Query: 275 DSTYHCPQ-ASRLHILHMDPSIAV 297
D +YHC R+HI +DPSIA
Sbjct: 298 DDSYHCQHPPCRMHICELDPSIAA 321
>gi|53132082|emb|CAG31871.1| hypothetical protein RCJMB04_12m14 [Gallus gallus]
Length = 343
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 189/313 (60%), Gaps = 58/313 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 39 EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99 KGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------- 166
WSS+ H+A+DNT+V+ ++++LC +N A++ P
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL 218
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+PLVL+IPLRLG+ +IN YI +K C F PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPSDSGCLPDESFHCQHPPCR 307
Query: 286 LHILHMDPSIAVV 298
+ I +DPSIAVV
Sbjct: 308 MSIAELDPSIAVV 320
>gi|66773074|ref|NP_001019605.1| cysteine protease ATG4A [Danio rerio]
gi|66267494|gb|AAH95617.1| Zgc:111958 [Danio rerio]
Length = 375
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 138/289 (47%), Positives = 188/289 (65%), Gaps = 36/289 (12%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ D+ SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW+W+
Sbjct: 41 ELLSDVRSRLWFTYRKKFSPIGGTGPSSDAGWGCMLRCGQMILAQALICSHLGRDWRWDP 100
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
+ + Y +IL F D++ + YSIHQ+A G EGK+VGEW+GPNTVAQVL+KLA +DD
Sbjct: 101 EKHQPKEYQRILDCFLDKKDSCYSIHQMAQMGVGEGKSVGEWYGPNTVAQVLKKLALFDD 160
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTN--KRASSNP-QWQPLVLVIPLRLGIQDINPVYIN 189
W+S+ +V++DNT+V+ +KKLC + S P W+PL+LVIPLR+GI INPVYI
Sbjct: 161 WNSLSVYVSMDNTVVIEDIKKLCVRADLQLQSQQPLDWRPLLLVIPLRMGINSINPVYIQ 220
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+C F PQS GV+GGKPN A YFIG++ +++
Sbjct: 221 ALKEC---------------------------FKMPQSCGVLGGKPNLAYYFIGFIDDEL 253
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHMDPSIAV 297
I+LDPHT Q D E S D ++HC + R+ I +DPS+A+
Sbjct: 254 IYLDPHTTQQ---AVDTESGSAVD-DQSFHCQRTPHRMKITSLDPSVAL 298
>gi|50369556|gb|AAH76463.1| Atg4b protein, partial [Danio rerio]
Length = 393
Score = 276 bits (707), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 190/313 (60%), Gaps = 59/313 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
+ I D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+ HLGRDW+W+
Sbjct: 39 DDILADVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWS 98
Query: 73 VNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ Y+ IL F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99 PGQRQRPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTT----NKRA---SSNPQ------------------ 166
WS + HVA+DNT+V+ ++K+LC ++ A S P+
Sbjct: 159 SWSRLAVHVAMDNTVVIEEIKRLCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETA 218
Query: 167 -WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W+PLVL+IPLRLG+ DIN YI +K+C F P
Sbjct: 219 LWKPLVLLIPLRLGLSDINEAYIEPLKQC---------------------------FMMP 251
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-AS 284
QSLGVIGGKPN A YFIG+VG+++I+LDPHT Q D +D D +YHC
Sbjct: 252 QSLGVIGGKPNSAHYFIGFVGDELIYLDPHTTQP---AVDPSEDGHFP-DDSYHCQHPPC 307
Query: 285 RLHILHMDPSIAV 297
R+HI +DPSIA
Sbjct: 308 RMHICELDPSIAA 320
>gi|61211768|sp|Q6DG88.2|ATG4B_DANRE RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
Length = 394
Score = 276 bits (707), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 190/313 (60%), Gaps = 59/313 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
+ I D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+ HLGRDW+W+
Sbjct: 40 DDILADVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWS 99
Query: 73 VNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ Y+ IL F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 PGQRQRPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTT----NKRA---SSNPQ------------------ 166
WS + HVA+DNT+V+ ++K+LC ++ A S P+
Sbjct: 160 SWSRLAVHVAMDNTVVIEEIKRLCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETA 219
Query: 167 -WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W+PLVL+IPLRLG+ DIN YI +K+C F P
Sbjct: 220 LWKPLVLLIPLRLGLSDINEAYIEPLKQC---------------------------FMMP 252
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-AS 284
QSLGVIGGKPN A YFIG+VG+++I+LDPHT Q D +D D +YHC
Sbjct: 253 QSLGVIGGKPNSAHYFIGFVGDELIYLDPHTTQP---AVDPSEDGHFP-DDSYHCQHPPC 308
Query: 285 RLHILHMDPSIAV 297
R+HI +DPSIA
Sbjct: 309 RMHICELDPSIAA 321
>gi|301775535|ref|XP_002923195.1| PREDICTED: cysteine protease ATG4B-like [Ailuropoda melanoleuca]
Length = 405
Score = 276 bits (706), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 189/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F PIG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 52 DEILSDVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 111
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 112 QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 171
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT--------------TNKRASSNPQ----------W 167
WS++ H+A+DNT+V+ +++LC+ +++ + P W
Sbjct: 172 TWSALAVHIAMDNTVVMEDIRRLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPW 231
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K+C F PQS
Sbjct: 232 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 264
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGY G ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 265 LGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPAV----ELTDSCFIADESFHCRHPPSRM 320
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 321 SIGELDPSIAV 331
>gi|332266032|ref|XP_003282019.1| PREDICTED: cysteine protease ATG4B [Nomascus leucogenys]
Length = 518
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 188/312 (60%), Gaps = 57/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 163 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 222
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 223 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 282
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 283 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 342
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 343 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 375
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 376 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 431
Query: 287 HILHMDPSIAVV 298
I +DPSIAVV
Sbjct: 432 SIAELDPSIAVV 443
>gi|47087191|ref|NP_998738.1| cysteine protease ATG4B [Gallus gallus]
gi|61211779|sp|Q6PZ02.1|ATG4B_CHICK RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; Short=cAut2B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|45861662|gb|AAS78584.1| AUT2B [Gallus gallus]
Length = 393
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 188/312 (60%), Gaps = 58/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 39 EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99 KGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------- 166
WSS+ H+A+DNT+V+ ++++LC +N A++ P
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL 218
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+PLVL+IPLRLG+ +IN YI +K C F PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPSDSGCLPDESFHCQHPPCR 307
Query: 286 LHILHMDPSIAV 297
+ I +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319
>gi|156395764|ref|XP_001637280.1| predicted protein [Nematostella vectensis]
gi|156224391|gb|EDO45217.1| predicted protein [Nematostella vectensis]
Length = 368
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 136/288 (47%), Positives = 181/288 (62%), Gaps = 45/288 (15%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+ + D+ SR+W TYRK F IG +G TTD GWGCMLRCGQM++AQAL+ HLGRDWQ
Sbjct: 43 DMGYLNTDVRSRIWLTYRKNFPKIGGTGPTTDSGWGCMLRCGQMMLAQALVCRHLGRDWQ 102
Query: 71 WN-VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W+ N+ Y++IL+ F D++ + YSIHQIA G SEGKAVG WFGPNTVAQVL+KL+
Sbjct: 103 WDPENNTTPEYMQILEAFLDKKDSLYSIHQIAQMGVSEGKAVGSWFGPNTVAQVLKKLSA 162
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+DDWSS+ HVA+DNT+++ + W+PLVL IPLRLG+ ++N VY
Sbjct: 163 FDDWSSLCLHVAMDNTVIIEDISN-------------WRPLVLFIPLRLGLTEMNVVYNE 209
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K C FTF QSLG+IGG+PNHA YFIGY GN++
Sbjct: 210 PLKAC---------------------------FTFKQSLGIIGGRPNHATYFIGYFGNNL 242
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPHT Q + + D ++HC R++I +DPS+A+
Sbjct: 243 VYLDPHTTQQTV----NPDELSRIPDGSFHCVYPCRMNIADVDPSVAL 286
>gi|326925776|ref|XP_003209085.1| PREDICTED: cysteine protease ATG4B-like [Meleagris gallopavo]
Length = 393
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 188/312 (60%), Gaps = 58/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 39 EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99 KGKRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------- 166
WSS+ H+A+DNT+V+ ++++LC +N A++ P
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPTVEADVLYNGYPEEAGVRDKLSL 218
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+PLVL+IPLRLG+ +IN YI +K C F PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPSDSGCLPDESFHCQHPPCR 307
Query: 286 LHILHMDPSIAV 297
+ I +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319
>gi|449266947|gb|EMC77925.1| Cysteine protease ATG4B, partial [Columba livia]
Length = 393
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 187/312 (59%), Gaps = 58/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 39 EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99 KGKRQVDNYFSVLNAFVDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------------SSNPQ---------- 166
WSS+ H+A+DNT+V+ ++++LC +N + P+
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNAPCAGAAACPAVESDGLYNGCPEEAGVRDRRSL 218
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+PLVL+IPLRLG+ +IN YI +K C F PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EHNDSGCLPDESFHCQHPPCR 307
Query: 286 LHILHMDPSIAV 297
+ I +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319
>gi|417410350|gb|JAA51650.1| Putative cysteine protease required for autophagy, partial
[Desmodus rotundus]
Length = 394
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 188/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQALL HLGRDW+W
Sbjct: 41 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWT 100
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFHVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 160
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ HVA+DNT+V+ +++LC ++ AS+ P W
Sbjct: 161 TWSALAVHVAMDNTVVMEDIRRLCRSSLPCAGASAFPADSEGHCNGFPARAEVTNRPSPW 220
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKGC---------------------------FMMPQS 253
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSCSIPDESFHCQHPPSRM 309
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 310 SIGELDPSIAV 320
>gi|281340990|gb|EFB16574.1| hypothetical protein PANDA_012287 [Ailuropoda melanoleuca]
Length = 369
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 189/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F PIG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 37 DEILSDVASRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 97 QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 156
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT--------------TNKRASSNPQ----------W 167
WS++ H+A+DNT+V+ +++LC+ +++ + P W
Sbjct: 157 TWSALAVHIAMDNTVVMEDIRRLCSGSLPCAGAASLSADSSRHCNGFPAGAEVTDRPAPW 216
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K+C F PQS
Sbjct: 217 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 249
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGY G ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 250 LGVIGGKPNSAHYFIGYAGEELIYLDPHTTQPA----VELTDSCFIADESFHCRHPPSRM 305
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 306 SIGELDPSIAV 316
>gi|119591686|gb|EAW71280.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 354
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 188/312 (60%), Gaps = 57/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAVV 298
I +DPSIAVV
Sbjct: 309 SIAELDPSIAVV 320
>gi|20071131|gb|AAH27184.1| Autophagy-related 4B (yeast) [Mus musculus]
gi|26353914|dbj|BAC40587.1| unnamed protein product [Mus musculus]
gi|74188242|dbj|BAE25791.1| unnamed protein product [Mus musculus]
Length = 393
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
WSS+ H+A+DNT+V+ ++++LC N ++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 GIGELDPSIAV 319
>gi|27883848|ref|NP_777363.1| cysteine protease ATG4B [Mus musculus]
gi|26324650|dbj|BAC26079.1| unnamed protein product [Mus musculus]
gi|26327423|dbj|BAC27455.1| unnamed protein product [Mus musculus]
gi|26344632|dbj|BAC35965.1| unnamed protein product [Mus musculus]
gi|27763983|emb|CAD43220.1| autophagin-1 [Mus musculus]
Length = 393
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
WSS+ H+A+DNT+V+ ++++LC N ++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCVGAAALPTDSERHCNGFPAGAEVTNRPSAW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 GIGELDPSIAV 319
>gi|61211813|sp|Q8BGE6.2|ATG4B_MOUSE RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B
Length = 393
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
WSS+ H+A+DNT+V+ ++++LC N ++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 GIGELDPSIAV 319
>gi|410036440|ref|XP_003309622.2| PREDICTED: cysteine protease ATG4B isoform 5 [Pan troglodytes]
Length = 509
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 156 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 215
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 216 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 275
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 276 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 335
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 336 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 368
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
LGVIGGKPN A YFIGYVG ++I+LDPHT Q GC D ++HC
Sbjct: 369 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 419
Query: 283 -ASRLHILHMDPSIAV 297
R+ I +DPSIAV
Sbjct: 420 PPCRMSIAELDPSIAV 435
>gi|119591684|gb|EAW71278.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_a
[Homo sapiens]
Length = 415
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|14042685|dbj|BAB55353.1| unnamed protein product [Homo sapiens]
Length = 380
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K CY + PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHCYMM---------------------------PQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|410036442|ref|XP_003950065.1| PREDICTED: cysteine protease ATG4B [Pan troglodytes]
Length = 521
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 156 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 215
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 216 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 275
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 276 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 335
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 336 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 368
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
LGVIGGKPN A YFIGYVG ++I+LDPHT Q GC D ++HC
Sbjct: 369 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 419
Query: 283 -ASRLHILHMDPSIAV 297
R+ I +DPSIAV
Sbjct: 420 PPCRMSIAELDPSIAV 435
>gi|148707985|gb|EDL39932.1| autophagy-related 4B (yeast), isoform CRA_a [Mus musculus]
Length = 390
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 37 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 97 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 156
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
WSS+ H+A+DNT+V+ ++++LC N ++ P W
Sbjct: 157 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAW 216
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 217 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 249
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 250 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 305
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 306 GIGELDPSIAV 316
>gi|194381088|dbj|BAG64112.1| unnamed protein product [Homo sapiens]
Length = 510
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 157 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 216
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 217 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 276
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 277 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 336
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 337 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 369
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
LGVIGGKPN A YFIGYVG ++I+LDPHT Q GC D ++HC
Sbjct: 370 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 420
Query: 283 -ASRLHILHMDPSIAV 297
R+ I +DPSIAV
Sbjct: 421 PPCRMSIAELDPSIAV 436
>gi|332815902|ref|XP_001162556.2| PREDICTED: cysteine protease ATG4B isoform 1 [Pan troglodytes]
Length = 496
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 156 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 215
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 216 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 275
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 276 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 335
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 336 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 368
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 369 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 424
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 425 SIAELDPSIAV 435
>gi|343961553|dbj|BAK62366.1| cysteine protease ATG4B [Pan troglodytes]
Length = 393
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|410206608|gb|JAA00523.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410247746|gb|JAA11840.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410295834|gb|JAA26517.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
gi|410352839|gb|JAA43023.1| ATG4 autophagy related 4 homolog B [Pan troglodytes]
Length = 393
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|432107261|gb|ELK32675.1| Cysteine protease ATG4B [Myotis davidii]
Length = 394
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQALL HLGRDW+W
Sbjct: 41 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWT 100
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFHVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAIFD 160
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ H+A+DNT+V+ +++LC ++ A++ P W
Sbjct: 161 TWSALAVHIAMDNTVVMEDIRRLCRSSLPCAEATAFPADSEGHCNGLPAGAEVTNRPSLW 220
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKGC---------------------------FMMPQS 253
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSFLIPDESFHCQHPPSRM 309
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 310 SIGELDPSIAV 320
>gi|344299096|ref|XP_003421224.1| PREDICTED: cysteine protease ATG4B [Loxodonta africana]
Length = 420
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQALL HLGRDW+W
Sbjct: 67 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRDWRWA 126
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 127 QRRRQPDSYFSVLHAFIDRKDSHYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 186
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
WSS+ H+A+DNT+V+ ++++LC ++ A+ P W
Sbjct: 187 TWSSLAVHIAMDNTVVMEEIRRLCKSSTPCAGAAACPADPSQHCNGLPAGAEAAGRPSTW 246
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 247 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FRMPQS 279
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D T+HC R+
Sbjct: 280 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VELAGGFSIPDETFHCQHPPCRM 335
Query: 287 HILHMDPSIAV 297
+I +DPSIAV
Sbjct: 336 NIAELDPSIAV 346
>gi|47132611|ref|NP_037457.3| cysteine protease ATG4B isoform a [Homo sapiens]
gi|296434400|sp|Q9Y4P1.2|ATG4B_HUMAN RecName: Full=Cysteine protease ATG4B; AltName: Full=AUT-like 1
cysteine endopeptidase; AltName: Full=Autophagin-1;
AltName: Full=Autophagy-related cysteine endopeptidase
1; AltName: Full=Autophagy-related protein 4 homolog B;
Short=hAPG4B
gi|62822370|gb|AAY14919.1| unknown [Homo sapiens]
Length = 393
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|397483831|ref|XP_003813094.1| PREDICTED: cysteine protease ATG4B isoform 1 [Pan paniscus]
Length = 481
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 187/316 (59%), Gaps = 67/316 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 128 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 187
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 188 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 247
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 248 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 307
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 308 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 340
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTYHCPQ 282
LGVIGGKPN A YFIGYVG ++I+LDPHT Q GC D ++HC
Sbjct: 341 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP---------DESFHCQH 391
Query: 283 -ASRLHILHMDPSIAV 297
R+ I +DPSIAV
Sbjct: 392 PPCRMSIAELDPSIAV 407
>gi|5262636|emb|CAB45756.1| hypothetical protein [Homo sapiens]
gi|12653857|gb|AAH00719.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Homo sapiens]
gi|27763981|emb|CAD43219.1| autophagin-1 [Homo sapiens]
gi|117646318|emb|CAL38626.1| hypothetical protein [synthetic construct]
gi|119591687|gb|EAW71281.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_d
[Homo sapiens]
gi|123981932|gb|ABM82795.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [synthetic
construct]
gi|168273130|dbj|BAG10404.1| ATG4 autophagy related 4 homolog B [synthetic construct]
Length = 393
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|71891691|dbj|BAA76787.2| KIAA0943 protein [Homo sapiens]
Length = 396
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 43 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 102
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 311
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 312 SIAELDPSIAV 322
>gi|397483835|ref|XP_003813096.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pan paniscus]
Length = 405
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|88192732|pdb|2D1I|A Chain A, Structure Of Human Atg4b
gi|88192733|pdb|2D1I|B Chain B, Structure Of Human Atg4b
Length = 398
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 45 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 104
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 105 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 164
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 165 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 224
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 225 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 257
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 258 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 313
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 314 SIAELDPSIAV 324
>gi|78101773|pdb|2CY7|A Chain A, The Crystal Structure Of Human Atg4b
Length = 396
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 43 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 102
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 311
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 312 SIAELDPSIAV 322
>gi|397483833|ref|XP_003813095.1| PREDICTED: cysteine protease ATG4B isoform 2 [Pan paniscus]
Length = 468
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 128 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 187
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 188 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 247
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 248 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPW 307
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 308 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 340
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 341 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 396
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 397 SIAELDPSIAV 407
>gi|149711769|ref|XP_001497815.1| PREDICTED: cysteine protease ATG4B [Equus caballus]
Length = 393
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 189/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ H+A+DNT+V+ +++LC ++ A++ P W
Sbjct: 160 TWSALAVHIAMDNTVVMEDIRRLCRSSLPCAGAAAFPADSDRHCNGFPAGAEVTNRPAPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K+C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSCFIPDESFHCQHPPSRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIGELDPSIAV 319
>gi|34531319|dbj|BAC86110.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 273 bits (698), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 128 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 187
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 188 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 247
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 248 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 307
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 308 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 340
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 341 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 396
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 397 SIAELDPSIAV 407
>gi|30410798|ref|NP_847896.1| cysteine protease ATG4B isoform b [Homo sapiens]
Length = 380
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|402889930|ref|XP_003908250.1| PREDICTED: cysteine protease ATG4B [Papio anubis]
Length = 508
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 155 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 214
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 215 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 274
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 275 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 334
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 335 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 367
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + S D ++HC R+
Sbjct: 368 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTGSCFIPDESFHCQHPPCRM 423
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 424 SIAELDPSIAV 434
>gi|355565356|gb|EHH21845.1| hypothetical protein EGK_04999, partial [Macaca mulatta]
Length = 393
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + S D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTGSCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|90077212|dbj|BAE88286.1| unnamed protein product [Macaca fascicularis]
Length = 393
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + S D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTGSCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|380808290|gb|AFE76020.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|383416899|gb|AFH31663.1| cysteine protease ATG4B isoform a [Macaca mulatta]
gi|384941198|gb|AFI34204.1| cysteine protease ATG4B isoform a [Macaca mulatta]
Length = 393
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + S D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTGSCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|73994337|ref|XP_851977.1| PREDICTED: cysteine protease ATG4B [Canis lupus familiaris]
Length = 394
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 41 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 100
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 160
Query: 132 DWSSIVFHVALDNTLVVNQVKKLC--------------TTNKRASSNPQ----------W 167
WS++ H+A+DNT+V+ +++LC +++ + P W
Sbjct: 161 TWSALAVHIAMDNTVVMEDIRRLCRGSLPCAGAAALPADSSRHCNGFPAGAEVTNRLAPW 220
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K+C F PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 253
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFTDSCFIPDESFHCQHPPSRM 309
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 310 SIGELDPSIAV 320
>gi|355669955|gb|AER94692.1| ATG4 autophagy related 4-like protein B [Mustela putorius furo]
Length = 390
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 37 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 97 QRKRQSDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 156
Query: 132 DWSSIVFHVALDNTLVVNQVKKLC--------------TTNKRASSNPQ----------W 167
WS++ H+A+DNT+V+ +++LC +++ + P W
Sbjct: 157 TWSALAVHIAMDNTVVMEDIRRLCRGSLPCAGATALPTDSSRHCNGFPAGAEVTNRPAPW 216
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K+C F PQS
Sbjct: 217 RPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FMMPQS 249
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 250 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCRHPPSRM 305
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 306 GISELDPSIAV 316
>gi|395528686|ref|XP_003766458.1| PREDICTED: cysteine protease ATG4B [Sarcophilus harrisii]
Length = 393
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 187/313 (59%), Gaps = 61/313 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 EEILSDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWK 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCT------------TNKRASSN------------PQW 167
WSS+ H+A+DNT+V+ ++++LC T+ SN W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCKAGFPCADGAAFPTDSELLSNGYPPAAEVTDRASPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYTETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL--DSTYHCPQ-AS 284
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + +E + D T+HC
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQ------PAVESTEGGVFPDETFHCQHPPC 306
Query: 285 RLHILHMDPSIAV 297
R++I +DPSIAV
Sbjct: 307 RMNIGELDPSIAV 319
>gi|126338580|ref|XP_001366892.1| PREDICTED: cysteine protease ATG4B-like [Monodelphis domestica]
Length = 396
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/330 (43%), Positives = 194/330 (58%), Gaps = 73/330 (22%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 43 DEILSDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWK 102
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QGRRQTDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNP---------------------QW 167
WSS+ H+A+DNT+V+ +++LC N A++ P QW
Sbjct: 163 TWSSLAVHIAMDNTVVMEDIRRLCKANFSHTDAAALPPDSDLLSNGYPPGAEVTDRLSQW 222
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y +K C F PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYTETLKHC---------------------------FMMPQS 255
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ------NIGCVYDKEQDSEKKLDSTYHCP 281
LGVIGGKPN A YFIGYVG ++I+LDPHT Q N G + D+ ++HC
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQAAVELSNGGVIPDE----------SFHCQ 305
Query: 282 Q-ASRLHILHMDPSIAV----VSQRSYSDY 306
R++I +DPSIAV S+ ++D+
Sbjct: 306 HPPCRMNIGELDPSIAVGFFCKSEEDFNDW 335
>gi|354474222|ref|XP_003499330.1| PREDICTED: cysteine protease ATG4B-like [Cricetulus griseus]
Length = 479
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 126 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 185
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 186 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 245
Query: 132 DWSSIVFHVALDNTLVVNQVKKLC-------------TTNKR-----------ASSNPQW 167
WSS+ H+A+DNT+V+ ++++LC T ++R A+ W
Sbjct: 246 TWSSLAVHIAMDNTVVMEEIRRLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAW 305
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 306 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 338
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R+
Sbjct: 339 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VELTDSCFIPDESFHCQHPPCRM 394
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 395 GIGELDPSIAV 405
>gi|298231123|ref|NP_001177212.1| cysteine protease ATG4B [Sus scrofa]
gi|296874484|gb|ADH81747.1| autophagy related 4-like protein B [Sus scrofa]
Length = 393
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 185/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQALL HLGR W+W
Sbjct: 40 DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALLCRHLGRGWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QWERQPDSYFSVLHAFMDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAAFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---------------------KRASSNPQ---W 167
WS++ HVA+DNT+V+ ++++LC ++ A P+ W
Sbjct: 160 TWSALAVHVAMDNTVVMEEIRRLCRSSLPRAGAAAFPADSDRHCNGFPAEAEVGPRPVPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINAAYTETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q V DS D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVQV----TDSCLIPDESFHCQHPPHRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|291415044|ref|XP_002723769.1| PREDICTED: APG4 autophagy 4 homolog B [Oryctolagus cuniculus]
Length = 473
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 185/309 (59%), Gaps = 56/309 (18%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
+I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 122 EILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQ 181
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ ++YL +L F DR+ + YSIHQIA G EGK+VG+W+GPNTVAQVL+KLA +D
Sbjct: 182 QKRQPDSYLSVLHAFMDRKDSYYSIHQIAQMGVGEGKSVGQWYGPNTVAQVLKKLAVFDT 241
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTT-----------------------NKRASSNPQWQP 169
WSS+ H+A+DNT+V+ ++++LC + + ++ W+P
Sbjct: 242 WSSLAVHIAMDNTVVMEEIRRLCRSSHPCAGAATPPAGADWHCNGFPASTEVTNRSPWRP 301
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
LVL+IPLRLG+ DIN Y+ +K C F PQSLG
Sbjct: 302 LVLLIPLRLGLTDINEAYVETLKLC---------------------------FRMPQSLG 334
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHI 288
VIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+ I
Sbjct: 335 VIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VELTDLCFIPDESFHCQHPPCRMSI 390
Query: 289 LHMDPSIAV 297
+DPSIAV
Sbjct: 391 GELDPSIAV 399
>gi|344239232|gb|EGV95335.1| Cysteine protease ATG4B [Cricetulus griseus]
Length = 394
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 188/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 41 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 100
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 101 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 160
Query: 132 DWSSIVFHVALDNTLVVNQVKKLC-------------TTNKR-----------ASSNPQW 167
WSS+ H+A+DNT+V+ ++++LC T ++R A+ W
Sbjct: 161 TWSSLAVHIAMDNTVVMEEIRRLCRASLPCAGAAAFPTDSERHCNGFPAGAEVANRPLAW 220
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 221 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 253
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R+
Sbjct: 254 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPCRM 309
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 310 GIGELDPSIAV 320
>gi|45861658|gb|AAS78582.1| Aut2B1 [Bos taurus]
Length = 342
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 186/312 (59%), Gaps = 57/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L+ F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ HVA+DNT+V+ +++LC ++ A + P W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ D+N Y +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q D+ D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVP----DESFHCQHPPGRM 308
Query: 287 HILHMDPSIAVV 298
I +DPSIAVV
Sbjct: 309 SIAELDPSIAVV 320
>gi|410969807|ref|XP_003991383.1| PREDICTED: cysteine protease ATG4B [Felis catus]
Length = 445
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 92 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 151
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 152 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 211
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ HVA+DNT+V+ +++LC A++ P W
Sbjct: 212 TWSALAVHVAMDNTVVMEDIRRLCRAGLPCAGAAALPADPGRHCNGFPAGAEVSNRLAPW 271
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 272 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 304
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 305 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EFADSCFIPDESFHCQHPPSRM 360
Query: 287 HILHMDPSIAV 297
+ +DPSIAV
Sbjct: 361 GVRELDPSIAV 371
>gi|440901286|gb|ELR52261.1| Cysteine protease ATG4B, partial [Bos grunniens mutus]
Length = 393
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 185/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L+ F DR+ + YSIHQIA G EGK+VG+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSVGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ HVA+DNT+V+ +++LC ++ A + P W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ D+N Y +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q D+ D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADR----CPVPDESFHCQHPPGRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|224059752|ref|XP_002193231.1| PREDICTED: cysteine protease ATG4B [Taeniopygia guttata]
Length = 393
Score = 270 bits (690), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 187/312 (59%), Gaps = 58/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 39 EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99 KGKRQMDNYFNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------------SSNPQ---------- 166
WSS+ H+A+DNT+V+ ++++LC ++ + P+
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSHVPCAGAAACPALESDVLYNGCPEDVGLRERLAL 218
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+PLVL+IPLRLG+ +IN YI +K C F PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPGDSGCLPDESFHCQHPPCR 307
Query: 286 LHILHMDPSIAV 297
+ I +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319
>gi|47564102|ref|NP_001001170.1| cysteine protease ATG4B [Bos taurus]
gi|61211780|sp|Q6PZ03.1|ATG4B_BOVIN RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related cysteine endopeptidase 2B;
Short=Autophagin-2B; AltName: Full=Autophagy-related
protein 4 homolog B; AltName: Full=bAut2B
gi|45861660|gb|AAS78583.1| Aut2b2 [Bos taurus]
Length = 393
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 185/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L+ F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ HVA+DNT+V+ +++LC ++ A + P W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ D+N Y +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q D+ D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADR----CPVPDESFHCQHPPGRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|296488734|tpg|DAA30847.1| TPA: cysteine protease ATG4B [Bos taurus]
Length = 390
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 185/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L+ F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ HVA+DNT+V+ +++LC ++ A + P W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ D+N Y +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q D+ D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADR----CPVPDESFHCQHPPGRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>gi|348577273|ref|XP_003474409.1| PREDICTED: cysteine protease ATG4B [Cavia porcellus]
Length = 412
Score = 270 bits (689), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 186/320 (58%), Gaps = 73/320 (22%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
+ I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 57 DDILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 116
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 117 QRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 176
Query: 132 DWSSIVFHVALDNTLVVNQVKKLC----------------------------TTNKRASS 163
WSS+ H+A+DNT+V+ ++++LC TN+++ S
Sbjct: 177 TWSSLAVHIAMDNTVVMEEIRRLCRTGLPCAGAAALPTDADRHCNGFPTQTEVTNRQSPS 236
Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
W+PLVL+IPLRLG+ DIN Y+ +K C F
Sbjct: 237 --LWRPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FM 267
Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-----GCVYDKEQDSEKKLDSTY 278
PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q GC D T+
Sbjct: 268 MPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVELTDGCFIP---------DETF 318
Query: 279 HCPQ-ASRLHILHMDPSIAV 297
HC R+ I +DPSIAV
Sbjct: 319 HCQHPPCRMGIGELDPSIAV 338
>gi|149244060|pdb|2Z0D|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-120) Complex
gi|149244062|pdb|2Z0E|A Chain A, The Crystal Structure Of Human Atg4b- Lc3(1-124) Complex
Length = 357
Score = 270 bits (689), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 43 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 102
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDP T Q + D D ++HC R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPATTQPA----VEPTDGCFIPDESFHCQHPPCRM 311
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 312 SIAELDPSIAV 322
>gi|327267215|ref|XP_003218398.1| PREDICTED: cysteine protease ATG4B-like [Anolis carolinensis]
Length = 393
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 186/312 (59%), Gaps = 58/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W+
Sbjct: 39 EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHLGRDWRWS 98
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
K+ ++Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVLRKLA +D
Sbjct: 99 KGKKQTDSYYNVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLRKLASFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQWQPLVL---------------- 172
WSS+ H+A+DNT+V+ ++++LC + AS+ P +P L
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCKPSCPCPGASAFPAAEPDFLSNGYPEGAECTDRLLL 218
Query: 173 ------VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+IPLRLG+ DIN YI +K C F PQ
Sbjct: 219 WKPLVLLIPLRLGLTDINEAYIETLKHC---------------------------FMMPQ 251
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPMDSCYIPDESFHCQHPPCR 307
Query: 286 LHILHMDPSIAV 297
+ I +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319
>gi|224510547|pdb|2ZZP|A Chain A, The Crystal Structure Of Human Atg4b(C74s)- Lc3(1-124)
Complex
Length = 357
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 186/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWG MLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 43 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGSMLRCGQMIFAQALVCRHLGRDWRWT 102
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 103 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 162
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 163 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 222
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 223 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 255
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 256 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRM 311
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 312 SIAELDPSIAV 322
>gi|449498615|ref|XP_002197397.2| PREDICTED: cysteine protease ATG4A [Taeniopygia guttata]
Length = 412
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 200/311 (64%), Gaps = 55/311 (17%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D ++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDWQ
Sbjct: 41 DKSKLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWQ 100
Query: 71 WNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + K+ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA
Sbjct: 101 WEKHKKQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLAL 160
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTT---------------------NKR-ASSNPQW 167
+D+W+S+ +V++DNT+V+ +KK+C + NK A P W
Sbjct: 161 FDEWNSLAVYVSMDNTVVIEDIKKMCWSPAQSSSVAHSSAHVHRSALGQNKNTAGLCPGW 220
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PL+L+IPLRLGI INPVYI+ K+C F PQS
Sbjct: 221 KPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMPQS 253
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RL 286
LG +GGKPN+A YFIG++GN++I+LDPHT Q+ D E++ D ++HC QA R+
Sbjct: 254 LGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DKSFHCQQAPHRM 309
Query: 287 HILHMDPSIAV 297
I+++DPS+A+
Sbjct: 310 KIMNLDPSVAL 320
>gi|18181958|dbj|BAB83890.1| Apg4B [Homo sapiens]
Length = 392
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 187/311 (60%), Gaps = 58/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G E K++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGE-KSIGQWYGPNTVAQVLKKLAVFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 218
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 219 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 251
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 252 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 307
Query: 287 HILHMDPSIAV 297
I ++DPSIAV
Sbjct: 308 SIANLDPSIAV 318
>gi|334350077|ref|XP_001376474.2| PREDICTED: cysteine protease ATG4A-like [Monodelphis domestica]
Length = 417
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 194/306 (63%), Gaps = 53/306 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W +
Sbjct: 66 KLLSDISARLWFTYRRKFSPIGGTGPSSDSGWGCMLRCGQMMLAQALICKHLGRDWCWEM 125
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 126 QQEQPEEYHRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 185
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------------------TTNKRASSNPQWQPLVL 172
W+S+ +V++DNT+V+ +KKLC + +P W+PL+L
Sbjct: 186 WNSLAVYVSMDNTVVIEDIKKLCHMCPSHLTHDSSPSPGNGLDQSTHLPEPSPGWKPLLL 245
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
+IPLRLGI INPVYI+ K+C F PQSLG +G
Sbjct: 246 IIPLRLGINQINPVYIDAFKEC---------------------------FKMPQSLGALG 278
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHM 291
GKPN A YFIG++GN++I+LDPHT Q D E+D D ++HC Q+ R+ IL++
Sbjct: 279 GKPNSAYYFIGFLGNELIYLDPHTTQTF---VDSEEDGTVD-DQSFHCQQSPHRMQILNL 334
Query: 292 DPSIAV 297
DPS+A+
Sbjct: 335 DPSVAL 340
>gi|449273759|gb|EMC83168.1| Cysteine protease ATG4A, partial [Columba livia]
Length = 395
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 197/311 (63%), Gaps = 55/311 (17%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D ++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDWQ
Sbjct: 39 DKSKLLLDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWQ 98
Query: 71 WNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
W + ++ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA
Sbjct: 99 WEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLAL 158
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC----------------------TTNKRASSNPQW 167
+D+W+S+ +V++DNT+V+ +KK+C T A W
Sbjct: 159 FDEWNSLAVYVSMDNTVVIEDIKKMCWSPPQGSGAAHSSAHLHRSALGRTKNAAGFCTGW 218
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PL+L+IPLRLGI INPVYI+ K+C F PQS
Sbjct: 219 KPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMPQS 251
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRL 286
LG +GGKPN+A YFIG++GN++I+LDPHT Q+ D E++ D ++HC QA R+
Sbjct: 252 LGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DESFHCQQAPHRM 307
Query: 287 HILHMDPSIAV 297
I+++DPS+A+
Sbjct: 308 KIMNLDPSVAL 318
>gi|431822415|ref|NP_001258915.1| cysteine protease ATG4A isoform 1 [Gallus gallus]
Length = 397
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 201/313 (64%), Gaps = 55/313 (17%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
++D ++ D+++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRD
Sbjct: 39 NEDKSKLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRD 98
Query: 69 WQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
WQW + K+ E Y +IL F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KL
Sbjct: 99 WQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 158
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT---------------------NKRASS-NP 165
A +D+W+S+ +V++DNT+V+ +KK+C + N+ A+
Sbjct: 159 ALFDEWNSLAVYVSMDNTVVIEDIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCT 218
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W+PL+L+IPLRLGI INPVYI+ K+C F P
Sbjct: 219 GWKPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMP 251
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-S 284
QSLG +GGKPN+A YFIG++GN++I+LDPHT Q+ D E++ D ++HC QA
Sbjct: 252 QSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DQSFHCQQAPH 307
Query: 285 RLHILHMDPSIAV 297
R+ I+++DPS+A+
Sbjct: 308 RMKIMNLDPSVAL 320
>gi|431822417|ref|NP_001258916.1| cysteine protease ATG4A isoform 2 [Gallus gallus]
gi|61211756|sp|Q5ZIW7.1|ATG4A_CHICK RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|53134379|emb|CAG32326.1| hypothetical protein RCJMB04_23b20 [Gallus gallus]
Length = 380
Score = 265 bits (676), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 201/313 (64%), Gaps = 55/313 (17%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
++D ++ D+++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRD
Sbjct: 22 NEDKSKLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRD 81
Query: 69 WQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
WQW + K+ E Y +IL F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KL
Sbjct: 82 WQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 141
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT---------------------NKRASS-NP 165
A +D+W+S+ +V++DNT+V+ +KK+C + N+ A+
Sbjct: 142 ALFDEWNSLAVYVSMDNTVVIEDIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCT 201
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W+PL+L+IPLRLGI INPVYI+ K+C F P
Sbjct: 202 GWKPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMP 234
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-S 284
QSLG +GGKPN+A YFIG++GN++I+LDPHT Q+ D E++ D ++HC QA
Sbjct: 235 QSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DQSFHCQQAPH 290
Query: 285 RLHILHMDPSIAV 297
R+ I+++DPS+A+
Sbjct: 291 RMKIMNLDPSVAL 303
>gi|391335597|ref|XP_003742176.1| PREDICTED: cysteine protease ATG4B-like [Metaseiulus occidentalis]
Length = 393
Score = 265 bits (676), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/290 (48%), Positives = 187/290 (64%), Gaps = 39/290 (13%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D E +R+ +S LWFTYRK F IG G T+D GWGCMLR GQM++ QAL+ HLGR W
Sbjct: 73 DFEYVRKSFSSMLWFTYRKNFAAIGGDGPTSDTGWGCMLRAGQMMLGQALIRKHLGRSWM 132
Query: 71 WNVNSK---EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
W + + E YL+IL+MF+D+++A +SIHQI+L G SEGKAVGEWFGPNTVAQ L+KL
Sbjct: 133 WTSDDRLPDRENYLRILRMFQDKKSATFSIHQISLMGLSEGKAVGEWFGPNTVAQALKKL 192
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
+YD WS + HVA+DN ++++ +K LC A + +W+PL+LV+PLRLG+ +IN +Y
Sbjct: 193 VQYDHWSEMKLHVAMDNIIILSDIKSLCC----AKESNKWRPLLLVVPLRLGLSEINDIY 248
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
N +L+S F SLG+IGG+P+HALYFIG
Sbjct: 249 TNA-----------------VLNS----------FKMKHSLGIIGGRPSHALYFIGIQRE 281
Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+++FLDPHT N + D E DSTYHC +A R+ I +MDPSIA+
Sbjct: 282 ELVFLDPHTTHNY-----VDLDEEPYNDSTYHCQRAQRMKISNMDPSIAM 326
>gi|339249735|ref|XP_003373855.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316969943|gb|EFV53966.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 410
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 192/321 (59%), Gaps = 62/321 (19%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
QD + I+++I SR+WFTYRK F PIG +G +D GWGCMLRCGQM++AQAL+ HLGR+W
Sbjct: 45 QDFDDIKKEIRSRMWFTYRKSFSPIGGTGPISDSGWGCMLRCGQMLLAQALICRHLGREW 104
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
QW+ + ++EAY++IL+MF+D++ YSIH IA G SEGK +G+WFGP+T+A V++KLA
Sbjct: 105 QWSPSCRDEAYVRILRMFQDKKNELYSIHMIAKMGESEGKEIGKWFGPSTIAHVIKKLAI 164
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCT-----------------------------TNKR 160
YDDWSS+ HVA+DN +V VKKLC+ NK+
Sbjct: 165 YDDWSSLAVHVAMDNVIVQEDVKKLCSREVFDALRKRLLQEEPSEIVADWFEDARKDNKK 224
Query: 161 ---ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQT 217
A+ + W+PL+L++P+RLG+ ++NP YI +K+ +A YN
Sbjct: 225 VDCANLSSPWKPLLLILPMRLGLSELNPCYIPALKEFFA--------------CKYN--- 267
Query: 218 PRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDST 277
+G+IGGKPNHALYFIG + +++LDPH Q D + + DS+
Sbjct: 268 ----------IGMIGGKPNHALYFIGAYKDRLVYLDPHWCQTF---VDLDVSMDLFDDSS 314
Query: 278 YHCPQASRLHILHMDPSIAVV 298
YH + +DPS+A+
Sbjct: 315 YHSAFILDISFNEIDPSLAIA 335
>gi|324506823|gb|ADY42901.1| Cysteine protease ATG4B [Ascaris suum]
Length = 433
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 189/349 (54%), Gaps = 89/349 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D++ I+ +TSRLWFTYRK F+PIG +G T+D+GWGCMLRCGQM++AQAL+ HLG +W
Sbjct: 39 DMDSIKEYVTSRLWFTYRKNFMPIGGTGPTSDQGWGCMLRCGQMLLAQALIVRHLGTEWM 98
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
W+ ++KEE Y +IL+MF+D++ P+S+HQIA G SE K +GEWFGPNT AQVL+KL Y
Sbjct: 99 WDRDNKEEDYKRILRMFQDKKCCPFSLHQIAQMGVSERKQIGEWFGPNTAAQVLKKLVVY 158
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTN-------------------------------- 158
DDWS + HVALDN L+ + V+ + T
Sbjct: 159 DDWSRLAVHVALDNLLIASDVRTMAHTRPPSRLSSRHTTENEQSEESGNASGGNSLCSFG 218
Query: 159 ------------KRASSNP-----QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
K NP QW+PL++++PLRLG+ IN Y+ I+ + L
Sbjct: 219 SVKMCMLQSALMKECDENPVEDEEQWRPLLIIVPLRLGLTSINRCYLPAIEAFFQL---- 274
Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ--- 258
PQ G+IGG+PNHALYFIG G +I+LDPH Q
Sbjct: 275 -----------------------PQCTGIIGGRPNHALYFIGIAGEQLIYLDPHVCQAAI 311
Query: 259 --NIGCVYDKEQDSEKKL--------DSTYHCPQASRLHILHMDPSIAV 297
+ C ++QD ++ DS+YHCP + DPS+A+
Sbjct: 312 DLDERCASLQQQDGFVEVVKSTDIFDDSSYHCPFLLHIAYDSADPSLAL 360
>gi|326924562|ref|XP_003208495.1| PREDICTED: cysteine protease ATG4A-like, partial [Meleagris
gallopavo]
Length = 421
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 198/313 (63%), Gaps = 55/313 (17%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
++D ++ D+++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRD
Sbjct: 63 NEDKSKLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRD 122
Query: 69 WQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
WQW + ++ E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KL
Sbjct: 123 WQWEKHKEQPEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 182
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLC----------------------TTNKRASSNP 165
A +D+W+S+ +V++DNT+V+ +KK+C A
Sbjct: 183 ALFDEWNSLAVYVSMDNTVVIEDIKKMCWSPPQSSSTAHSSAHLHRSALGRNRNTAGLCT 242
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W+PL+L+IPLRLGI INPVYI+ K+C F P
Sbjct: 243 GWKPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMP 275
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS- 284
QSLG +GGKPN+A YFIG++GN++I+LDPHT Q+ D E++ D ++HC QA
Sbjct: 276 QSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DQSFHCQQAPH 331
Query: 285 RLHILHMDPSIAV 297
R+ I+++DPS+A+
Sbjct: 332 RMKIMNLDPSVAL 344
>gi|195995623|ref|XP_002107680.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
gi|190588456|gb|EDV28478.1| hypothetical protein TRIADDRAFT_20340 [Trichoplax adhaerens]
Length = 385
Score = 263 bits (671), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 187/305 (61%), Gaps = 48/305 (15%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
++ +++ DI S+ WFTYRK + PIG G T+DKGWGCMLRCGQM++ QAL+ HLGRDW
Sbjct: 36 EEFDKLIADINSKFWFTYRKNYPPIGGIGPTSDKGWGCMLRCGQMILGQALVMRHLGRDW 95
Query: 70 QWNVNSKEEA-YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
+W N ++ A Y KILK+F D + + YSIHQIA G SEGK + +WFGPNT AQVL+KL
Sbjct: 96 RWFKNKEQLANYWKILKLFLDSKDSLYSIHQIAQMGVSEGKKISQWFGPNTAAQVLKKLI 155
Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLC-------------TTNKRASSNPQ---WQPLVL 172
+D+WS + +VA+DN +V++ +KK+C ++ + SSN Q W+PL+L
Sbjct: 156 MFDEWSQMGVYVAMDNIVVIDDIKKICHNHITRTSQGNAANSDAQGSSNEQSNAWKPLLL 215
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
IPLRLG+ D+NP+Y + + KC F +LG+IG
Sbjct: 216 FIPLRLGLTDLNPIYKDKLNKC---------------------------FRIKNTLGIIG 248
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
GKPN A YFIG G+ +++LDPHT Q V S+K TYH +RLH +MD
Sbjct: 249 GKPNSAHYFIGIQGDYLLYLDPHTVQETVKVKPNCPFSDK----TYHQKGTNRLHFSYMD 304
Query: 293 PSIAV 297
PS+A+
Sbjct: 305 PSVAL 309
>gi|351713264|gb|EHB16183.1| Cysteine protease ATG4B [Heterocephalus glaber]
Length = 475
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/320 (43%), Positives = 182/320 (56%), Gaps = 71/320 (22%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 118 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 177
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 178 QRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 237
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQP---------------------- 169
WSS+ HVA+DNT+V+ ++++LC ++ S
Sbjct: 238 TWSSLAVHVAMDNTVVMEEIRRLCRSSLPCSGAAALPADADRHCNGFPAPMEVTSRPSPS 297
Query: 170 ------LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
LVL+IPLRLG+ DIN Y+ +K+C F
Sbjct: 298 PSPWRPLVLLIPLRLGLTDINEAYVETLKRC---------------------------FM 330
Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-----NIGCVYDKEQDSEKKLDSTY 278
PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q GC D T+
Sbjct: 331 MPQSLGVIGGKPNSAHYFIGYVGKELIYLDPHTTQPAVELTDGCFIP---------DETF 381
Query: 279 HCPQ-ASRLHILHMDPSIAV 297
HC R+ I +DPSIAV
Sbjct: 382 HCQHPPCRMGIGELDPSIAV 401
>gi|225709006|gb|ACO10349.1| Cysteine protease ATG4B [Caligus rogercresseyi]
Length = 381
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 136/302 (45%), Positives = 187/302 (61%), Gaps = 40/302 (13%)
Query: 4 ANKLS-HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
N+LS D+E++ ++ SR FTYRK F+ I DSG T+D GWGCMLRCGQMV+A+AL
Sbjct: 36 GNELSARDDVEELNSEVLSRFLFTYRKEFLEIEDSGYTSDSGWGCMLRCGQMVLAEALQR 95
Query: 63 LHLGRDWQWNV-----NSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS--EGKAVGEWF 115
+ LGR+W+W+ N + + YL+ILK+F+D + APYS+HQIAL G S K VG WF
Sbjct: 96 VSLGREWKWSSQETLDNDQSQKYLQILKLFQDSKAAPYSLHQIALMGESIQSKKPVGTWF 155
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
GPNT+AQVLRKL+ + + I HVA+DNT++V+++K+ C S Q +PL+L IP
Sbjct: 156 GPNTIAQVLRKLSVSETTNPIRVHVAMDNTVIVDEIKESCGFIGDPS---QGKPLLLFIP 212
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
LRLG+ +INP+Y +K+C F FPQ LGVIGG+P
Sbjct: 213 LRLGLTEINPIYFQDLKEC---------------------------FEFPQILGVIGGRP 245
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
NHALYFIGY+ N++I+LDPH + D TYH +A R+ +DPS+
Sbjct: 246 NHALYFIGYMDNELIYLDPHVATQTSTPQVVTLGGSE--DKTYHTDRAYRMDFKDLDPSL 303
Query: 296 AV 297
++
Sbjct: 304 SL 305
>gi|327277326|ref|XP_003223416.1| PREDICTED: cysteine protease ATG4A-like [Anolis carolinensis]
Length = 385
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 195/305 (63%), Gaps = 56/305 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W + K+
Sbjct: 35 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWHWEEHKKQ 94
Query: 78 -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W+S+
Sbjct: 95 PEEYHRILRCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 154
Query: 137 VFHVALDNTLVVNQVKKLCTTNKR-----ASSNP------------------QWQPLVLV 173
+V++DNT+V+ +KK+C + A +P W+PL+L+
Sbjct: 155 AVYVSMDNTVVIEDIKKMCRLPNQNCPPVAHCSPLSHQSLLSRNRSPGGFCCGWKPLLLI 214
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
IPLRLGI INPVY++ K+C F PQSLG +GG
Sbjct: 215 IPLRLGINHINPVYVDAFKEC---------------------------FKMPQSLGALGG 247
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHMD 292
KPN+A YFIG++GN++I+LDPHT Q D E++S D ++HC QA R+ I+++D
Sbjct: 248 KPNNAYYFIGFLGNELIYLDPHTTQ---LFVDSEENSTVD-DRSFHCQQAPHRMKIMNLD 303
Query: 293 PSIAV 297
PS+A+
Sbjct: 304 PSVAL 308
>gi|148228573|ref|NP_001085611.1| cysteine protease ATG4A [Xenopus laevis]
gi|61211771|sp|Q6GPU1.1|ATG4A_XENLA RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|49115669|gb|AAH73017.1| MGC82614 protein [Xenopus laevis]
Length = 397
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 190/305 (62%), Gaps = 53/305 (17%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW+W +
Sbjct: 47 LQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWEKH 106
Query: 75 SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 107 KNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 166
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ--------------------WQPLVLV 173
+S+ +V++DNT+VV +K +C ++ S Q W+PL+LV
Sbjct: 167 NSLAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGHCSGWRPLLLV 226
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY++ K C F PQSLG +GG
Sbjct: 227 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 259
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPNHA YFIG+ G+++I+LDPHT Q D E+ + D TYHC + + + +L++D
Sbjct: 260 KPNHAYYFIGFSGDEIIYLDPHTTQTF---VDTEEAGTVQ-DQTYHCQKGPNSMKVLNLD 315
Query: 293 PSIAV 297
PS+A+
Sbjct: 316 PSVAL 320
>gi|225718596|gb|ACO15144.1| Cysteine protease ATG4B [Caligus clemensi]
Length = 390
Score = 253 bits (647), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 189/313 (60%), Gaps = 45/313 (14%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
+ D+ ++ ++ SRL FTYRK F I SG T+D GWGCMLRCGQMV+ +AL + LGR
Sbjct: 41 ARNDIAELNSEVLSRLLFTYRKEFSEIDGSGYTSDSGWGCMLRCGQMVLGEALQRISLGR 100
Query: 68 DWQWNVNSKEEA-------YLKILKMFEDRRTAPYSIHQIALTGAS--EGKAVGEWFGPN 118
DW+W+ E YLKIL +F+D + APYSIHQIAL G S K VG WFGPN
Sbjct: 101 DWKWDHKVDNEVDEDLKGKYLKILNLFQDSKVAPYSIHQIALMGESIQSKKPVGTWFGPN 160
Query: 119 TVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRL 178
TVAQVL+KL+ ++ I HVA+DNT++++++K+ C S +PL+L IPLRL
Sbjct: 161 TVAQVLKKLSFFEKTVPIRLHVAMDNTVIIDEIKESCGFVGGDSE----KPLLLFIPLRL 216
Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
G+ +INP+Y +K+C F FPQ LGVIGG+PNHA
Sbjct: 217 GLTEINPIYFQDLKEC---------------------------FEFPQILGVIGGRPNHA 249
Query: 239 LYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
LYFIGYV N++I+LDPH + Q+ D + D T+H +A R+ +DPS+++
Sbjct: 250 LYFIGYVDNELIYLDPHISTQSASSTVDTFGGPQ---DQTHHTERAYRMDFKDLDPSLSL 306
Query: 298 VSQ-RSYSDYKNV 309
R+ S+++++
Sbjct: 307 CFLCRNESEFEDM 319
>gi|213513159|ref|NP_001133247.1| cysteine protease ATG4B [Salmo salar]
gi|209147572|gb|ACI32896.1| Cysteine protease ATG4B [Salmo salar]
gi|223647372|gb|ACN10444.1| Cysteine protease ATG4B [Salmo salar]
Length = 397
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 186/305 (60%), Gaps = 22/305 (7%)
Query: 2 RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
+ N L+ + E I +TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+
Sbjct: 31 KEFNALTEK--EDILSHVTSRLWFTYRKNFPPIGGTGPTSDTGWGCMLRCGQMILGEALV 88
Query: 62 FLHLGRDWQW-NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
HLGRDW+W S+ E Y+ IL F D++ YS+HQIA G EGK++G+W+GPNTV
Sbjct: 89 RRHLGRDWRWVRSQSQREDYISILNAFLDKKDGYYSLHQIAQMGVGEGKSIGQWYGPNTV 148
Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
AQVL+KLA +D WS + HVA+DNT+V+ ++K+LC + L+ G+
Sbjct: 149 AQVLKKLAVFDSWSRLTVHVAMDNTVVIEEIKRLCMPWLDYGG-------AACVDLQGGM 201
Query: 181 QDINPVYINGI----KKCYALPISPVYDMVKILSSTYN---MQTPRYEFTFPQSLGVIGG 233
+ N ++ + +++ S N ++T + F PQSLGVIGG
Sbjct: 202 PEPNGCLEGACALAEEETALWKPLLLLIPLRLGLSDINEAYIETLKQCFQLPQSLGVIGG 261
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMD 292
KPNHA YFIGYVG ++I+LDPHT Q + +DS+ D TYHC R+HI +D
Sbjct: 262 KPNHAHYFIGYVGEELIYLDPHTTQP---AVEPCEDSQVP-DDTYHCQHPPCRMHICEID 317
Query: 293 PSIAV 297
PSIAV
Sbjct: 318 PSIAV 322
>gi|312378951|gb|EFR25375.1| hypothetical protein AND_09326 [Anopheles darlingi]
Length = 350
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 169/293 (57%), Gaps = 84/293 (28%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
+SH D+E IR+D+ SRLW TYR+GFVPIG++ LTTDKGWGCMLRCGQMV+A+AL LHLG
Sbjct: 62 ISHADIEAIRQDVQSRLWCTYRRGFVPIGNTQLTTDKGWGCMLRCGQMVLAEALTELHLG 121
Query: 67 RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGPNTVAQVLR 125
RDWQW+ +++ YLKI+ FED + AP+S+HQIAL G +SE K +GEWFGPNTVAQVL
Sbjct: 122 RDWQWSEETRDATYLKIVNRFEDNKQAPFSLHQIALMGDSSEEKRIGEWFGPNTVAQVLN 181
Query: 126 KLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP 185
++ NP
Sbjct: 182 EV------------------------------------NP-------------------- 185
Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
+YI G+KKC+ L P S G+IGG+PN ALYFIGYV
Sbjct: 186 IYIEGLKKCFQL---------------------------PGSCGMIGGRPNQALYFIGYV 218
Query: 246 GNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
G + ++LDPHT Q +GC+ +K++ E++ D+T+H ASR+ MDPS+AV
Sbjct: 219 GEEALYLDPHTVQRVGCIGEKQESVEQEQDATFHQRHASRIAFASMDPSLAVC 271
>gi|453230621|ref|NP_001263575.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
gi|412974713|emb|CCO25637.1| Protein ATG-4.1, isoform b [Caenorhabditis elegans]
Length = 481
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 134/347 (38%), Positives = 188/347 (54%), Gaps = 79/347 (22%)
Query: 4 ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
++S +D +E +++ +TSR WFTYR+ F PIG +G +TD+GWGCMLRC QM++ + LL
Sbjct: 68 GKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLR 127
Query: 63 LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
H+GR ++W++ E Y KIL+MF D + A YSIHQIA G +EGK V +WFGPNT AQ
Sbjct: 128 RHIGRHFEWDIEKTSEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNTAAQ 187
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLV------------VNQVKKLCTTNKRASSN------ 164
V++KL +DDWS+I HVALDN LV KL N N
Sbjct: 188 VMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENGLVDKNRLSLSP 247
Query: 165 ----PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
P+W+PL+L+IPLRLG+ INP Y++ I++
Sbjct: 248 GNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEF-------------------------- 281
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------TNQNI 260
F PQ +G+IGG+PNHALYF+G G+ + +LDPH T ++
Sbjct: 282 -FKIPQCVGIIGGRPNHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDV 340
Query: 261 GCVYDKE--------QDSEKKL-DSTYHCPQASRLHILHMDPSIAVV 298
G + +E D K+ DSTYHC + ++DPS+A+
Sbjct: 341 GFSHLEELVPLPSQTADVYTKMDDSTYHCQMMLWIEYENVDPSLALA 387
>gi|212645205|ref|NP_493375.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
gi|193247781|emb|CAB54483.2| Protein ATG-4.1, isoform a [Caenorhabditis elegans]
Length = 454
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 133/347 (38%), Positives = 189/347 (54%), Gaps = 79/347 (22%)
Query: 4 ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
++S +D +E +++ +TSR WFTYR+ F PIG +G +TD+GWGCMLRC QM++ + LL
Sbjct: 41 GKEISKEDGIEAMKKYVTSRFWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLR 100
Query: 63 LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
H+GR ++W++ E Y KIL+MF D + A YSIHQIA G +EGK V +WFGPNT AQ
Sbjct: 101 RHIGRHFEWDIEKTSEIYEKILQMFFDEKDALYSIHQIAQMGVTEGKEVSKWFGPNTAAQ 160
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT------------NKRASSN------ 164
V++KL +DDWS+I HVALDN LV + T+ N N
Sbjct: 161 VMKKLTIFDDWSNIAVHVALDNILVKEDAITMATSYPSEDAVKLIMENGLVDKNRLSLSP 220
Query: 165 ----PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
P+W+PL+L+IPLRLG+ INP Y++ I++
Sbjct: 221 GNIIPEWRPLLLMIPLRLGLTTINPCYLSAIQEF-------------------------- 254
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------TNQNI 260
F PQ +G+IGG+PNHALYF+G G+ + +LDPH T ++
Sbjct: 255 -FKIPQCVGIIGGRPNHALYFVGMSGSKLFYLDPHYCRPKTESTAKMYAEKDSTATTDDV 313
Query: 261 GCVYDKE--------QDSEKKL-DSTYHCPQASRLHILHMDPSIAVV 298
G + +E D K+ DSTYHC + ++DPS+A+
Sbjct: 314 GFSHLEELVPLPSQTADVYTKMDDSTYHCQMMLWIEYENVDPSLALA 360
>gi|312073335|ref|XP_003139474.1| hypothetical protein LOAG_03889 [Loa loa]
gi|307765357|gb|EFO24591.1| hypothetical protein LOAG_03889 [Loa loa]
Length = 458
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 187/348 (53%), Gaps = 85/348 (24%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S +D+E+I+ + S LWFTYRK F PIG G TTD+GWGCMLRCGQM++A+ L+ HLGR
Sbjct: 65 SRRDMERIKDIMASLLWFTYRKNFQPIGGIGPTTDQGWGCMLRCGQMLLARVLIVRHLGR 124
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
+W W+ + K Y +IL+MF+D++ + +SIHQIA G SEGK +GEWFGPNT AQVL+KL
Sbjct: 125 NWLWDRDIKLAEYKRILRMFQDKKNSLFSIHQIAHMGVSEGKNIGEWFGPNTTAQVLKKL 184
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT------------------NKRASSNP---- 165
YD WS + HVALDN L+ + ++ + T + + NP
Sbjct: 185 VIYDQWSRLAVHVALDNVLITSDIRTMAFTRPPYRKSGSRRETGSDYNDNHDAVNPAEAE 244
Query: 166 -------------------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
+W+PL+++IPLRLG+ IN Y I+ + L
Sbjct: 245 IFPESTRSPTRSETSSISSYGGNSEEWRPLLIIIPLRLGLSTINRCYFPAIQAFFQL--- 301
Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
PQ +G+IGG+PNHALYF G V N++++LDPH Q+
Sbjct: 302 ------------------------PQCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQDF 337
Query: 261 -----GCVYDKEQD------SEKKLDSTYHCPQASRLHILHMDPSIAV 297
E+D +++ DSTYHCP I +DPS+A+
Sbjct: 338 VDLDETTATRDERDGYVEIKNDEFRDSTYHCPFILTTKIDKVDPSLAL 385
>gi|194389756|dbj|BAG60394.1| unnamed protein product [Homo sapiens]
Length = 379
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/300 (43%), Positives = 176/300 (58%), Gaps = 57/300 (19%)
Query: 26 TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
++R+ G +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L
Sbjct: 39 SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 98
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DN
Sbjct: 99 NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 158
Query: 145 TLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVLVIPLRLGI 180
T+V+ ++++LC T+ A++ P W+PLVL+IPLRLG+
Sbjct: 159 TVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGL 218
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
DIN Y+ +K C F PQSLGVIGGKPN A Y
Sbjct: 219 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 251
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAVVS 299
FIGYVG ++I+LDPHT Q + D D ++HC R+ I +DPSIAV S
Sbjct: 252 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGS 307
>gi|397483837|ref|XP_003813097.1| PREDICTED: cysteine protease ATG4B isoform 4 [Pan paniscus]
Length = 379
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 129/298 (43%), Positives = 175/298 (58%), Gaps = 57/298 (19%)
Query: 26 TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
++R+ G +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L
Sbjct: 39 SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 98
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DN
Sbjct: 99 NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 158
Query: 145 TLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVLVIPLRLGI 180
T+V+ ++++LC T+ A++ P W+PLVL+IPLRLG+
Sbjct: 159 TVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRLSPWRPLVLLIPLRLGL 218
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
DIN Y+ +K C F PQSLGVIGGKPN A Y
Sbjct: 219 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 251
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
FIGYVG ++I+LDPHT Q + D D ++HC R+ I +DPSIAV
Sbjct: 252 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 305
>gi|426339171|ref|XP_004033533.1| PREDICTED: cysteine protease ATG4B isoform 3 [Gorilla gorilla
gorilla]
Length = 379
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 129/298 (43%), Positives = 175/298 (58%), Gaps = 57/298 (19%)
Query: 26 TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
++R+ G +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L
Sbjct: 39 SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 98
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DN
Sbjct: 99 NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 158
Query: 145 TLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVLVIPLRLGI 180
T+V+ ++++LC T+ A++ P W+PLVL+IPLRLG+
Sbjct: 159 TVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGL 218
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
DIN Y+ +K C F PQSLGVIGGKPN A Y
Sbjct: 219 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 251
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
FIGYVG ++I+LDPHT Q + D D ++HC R+ I +DPSIAV
Sbjct: 252 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 305
>gi|449666316|ref|XP_002168183.2| PREDICTED: cysteine protease ATG4B-like [Hydra magnipapillata]
Length = 436
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/336 (39%), Positives = 182/336 (54%), Gaps = 79/336 (23%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D+E+ +I ++ WFTYR+ F PIG +G +D GWGCMLRCGQM++AQALL HLGRDW
Sbjct: 42 EDMEKFNAEILTKFWFTYRRNFHPIGGTGPMSDTGWGCMLRCGQMMLAQALLCRHLGRDW 101
Query: 70 QWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
W K+ E Y+ IL F D++ + YSIHQIA G EGK +G+WFGPNTVAQV++KL
Sbjct: 102 DWRSGRKDNEIYMMILHSFLDKKDSLYSIHQIAQMGVGEGKQIGQWFGPNTVAQVIKKLV 161
Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLC----------------------TTNKRASSNPQ 166
+DD + + HVA+DNT+V+ +KKLC T N+ S P
Sbjct: 162 LFDDNADMAVHVAMDNTVVIEDIKKLCKSSINAWGCYGECSYIHDRSSLTGNQSVSKPPH 221
Query: 167 -------------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
W+PL+L IPLRLG+ +IN Y
Sbjct: 222 CSCESSQKLKSNRKLKSFNSEELQSWRPLLLFIPLRLGLSEINSDY-------------- 267
Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIG 261
Y+ +KI+ FT QSLGVIGGKPNHA YFIG+ G+ +++LDPHT Q
Sbjct: 268 -YNSLKIM------------FTLRQSLGVIGGKPNHAHYFIGFNGDRLLYLDPHTTQQT- 313
Query: 262 CVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ + D ++HC + +DPS+A+
Sbjct: 314 ---IEPERFNVIPDESFHCVYPCFMSFQSLDPSVAL 346
>gi|148691993|gb|EDL23940.1| mCG3720 [Mus musculus]
Length = 318
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 160/254 (62%), Gaps = 47/254 (18%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 92 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 151
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 152 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 211
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
W+S+ +V++DNT+V+ +KK+C +++P W+PL+L+
Sbjct: 212 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLI 271
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY+ K+C F PQSLG +GG
Sbjct: 272 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 304
Query: 234 KPNHALYFIGYVGN 247
KPN+A YFIG++G
Sbjct: 305 KPNNAYYFIGFLGK 318
>gi|297669945|ref|XP_002813144.1| PREDICTED: cysteine protease ATG4B isoform 3 [Pongo abelii]
Length = 378
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 174/298 (58%), Gaps = 57/298 (19%)
Query: 26 TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKIL 84
++R+ G +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L
Sbjct: 38 SHRRRTEAGGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVL 97
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDN 144
F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DN
Sbjct: 98 NAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDN 157
Query: 145 TLVVNQVKKLC--------------TTNKRASSNPQ----------WQPLVLVIPLRLGI 180
T+V+ ++++LC +++ + P W+PLVL+IPLRLG+
Sbjct: 158 TVVMEEIRRLCRNSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGL 217
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
DIN Y+ +K C F PQSLGVIGGKPN A Y
Sbjct: 218 TDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNSAHY 250
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
FIGYVG ++I+LDPHT Q + D D ++HC R+ I +DPSIAV
Sbjct: 251 FIGYVGEELIYLDPHTTQPA----VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 304
>gi|444518589|gb|ELV12252.1| Cysteine protease ATG4B, partial [Tupaia chinensis]
Length = 324
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 128/289 (44%), Positives = 168/289 (58%), Gaps = 63/289 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 44 EEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 103
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
+++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 104 QWTQQPDSYFNVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 163
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
WSS+ H+A+DNT+V ++ +IN Y+ +
Sbjct: 164 TWSSLAVHIAMDNTVVTGEI------------------------------NINEAYVETL 193
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K C F PQSLGVIGGKPN A YFIGYVG+++I+
Sbjct: 194 KHC---------------------------FMMPQSLGVIGGKPNSAHYFIGYVGDELIY 226
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAVVS 299
LDPHT Q + DS D ++HC SR+ I +DPSIAV+S
Sbjct: 227 LDPHTTQPAV----ELTDSCLVPDESFHCQHPPSRMSIRELDPSIAVLS 271
>gi|426218487|ref|XP_004003478.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4B [Ovis
aries]
Length = 454
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 169/311 (54%), Gaps = 73/311 (23%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 87 DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWA 146
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y ++ G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 147 QRKRQPDSYCRVPPQM----------------GVGEGKSIGQWYGPNTVAQVLKKLAVFD 190
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN------------------------KRASSNPQW 167
WS++ HVA+DNT+V+ V++LC + + W
Sbjct: 191 AWSALAVHVAMDNTVVMADVRRLCRSGLPCAGAEAFPADSERHCNGFPAGAEGGECTAPW 250
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ D+N Y +K C F PQS
Sbjct: 251 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 283
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q D+ D ++HC R+
Sbjct: 284 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADRCPVP----DESFHCQHPPGRM 339
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 340 SITELDPSIAV 350
>gi|355750993|gb|EHH55320.1| hypothetical protein EGM_04504, partial [Macaca fascicularis]
Length = 268
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 119/256 (46%), Positives = 157/256 (61%), Gaps = 52/256 (20%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIG 243
LGVIGGKPN A YFIG
Sbjct: 253 LGVIGGKPNSAHYFIG 268
>gi|340369400|ref|XP_003383236.1| PREDICTED: cysteine protease ATG4A-like [Amphimedon queenslandica]
Length = 394
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 123/289 (42%), Positives = 168/289 (58%), Gaps = 45/289 (15%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D D+ SR WFTYRK F PIGD+G T+D GWGC LRCGQM++ LL HLGRDW
Sbjct: 56 RDGASFVEDLQSRFWFTYRKNFRPIGDTGYTSDSGWGCTLRCGQMLLGHTLLLRHLGRDW 115
Query: 70 QWNVNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
+W+ +S + Y KIL+MF D R + YSI IAL GA G++VG+WFGPN VAQ +++LA
Sbjct: 116 RWSPSSSNDYKYQKILRMFLDYRDSEYSIQMIALQGADFGRSVGQWFGPNNVAQAIKRLA 175
Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYI 188
+D WS + +VA+D +V++ + +P+++ IPLRLG + N Y
Sbjct: 176 VHDQWSEVAVYVAMDMLVVIDDISNF-------------RPVLVFIPLRLGQERFNMEYK 222
Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
+K C+A+ QS+G+IGGKP HAL+F GY +
Sbjct: 223 EAVKACFAV---------------------------RQSVGIIGGKPRHALWFTGYHDDY 255
Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+I+LDPH Q+ CV D+ DSTYH Q RLHI +DPS+A+
Sbjct: 256 LIYLDPHKTQS--CV--TLPDAGIVSDSTYHTTQIERLHISELDPSLAL 300
>gi|341885317|gb|EGT41252.1| hypothetical protein CAEBREN_15768 [Caenorhabditis brenneri]
Length = 457
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 131/356 (36%), Positives = 187/356 (52%), Gaps = 77/356 (21%)
Query: 4 ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
+++ +D +E +++ ++SR WFTYRK F PIG +G T+D+GWGCMLRC QM++ + LL
Sbjct: 36 GKEITEEDGIEAMKKYMSSRFWFTYRKDFSPIGGTGPTSDQGWGCMLRCAQMLLGEVLLR 95
Query: 63 LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
H+GR ++W++ + Y KIL+MF D + A YSIHQIA G +EGK + +WFGPNT AQ
Sbjct: 96 RHIGRHFEWDIETTSVVYEKILQMFFDEKDALYSIHQIAQMGVTEGKEISKWFGPNTAAQ 155
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKR--------------------AS 162
VL+KL +DDWS++ HVALDN LV + TT S
Sbjct: 156 VLKKLTIFDDWSNVAVHVALDNILVKEDALTMATTYPSEDAVKLIMENGQVEKHYATITS 215
Query: 163 SNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEF 222
+W+PL+L+IPLRLG+ IN Y+ I++ + L
Sbjct: 216 KEGEWRPLLLMIPLRLGLTSINTCYLPAIQEFFKL------------------------- 250
Query: 223 TFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT---------------------NQNIG 261
PQ +G+IGGKPN A YF+G G + +LDPH + N
Sbjct: 251 --PQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHYCRPKTSKVFAEKEPSTESEQHDTNFS 308
Query: 262 CVYDKE----QDSE---KKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
+ D E Q S+ K DSTYHC + +DPS+A+ + S D+ N+
Sbjct: 309 ELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEFESIDPSLALALFCESREDFDNL 364
>gi|308490628|ref|XP_003107506.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
gi|308251874|gb|EFO95826.1| CRE-ATG-4.1 protein [Caenorhabditis remanei]
Length = 478
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 136/366 (37%), Positives = 180/366 (49%), Gaps = 106/366 (28%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
LE +++ +TSRLWFTYR+ F PIG +G +TD+GWGCMLRC QM++ + LL H+GR ++W
Sbjct: 49 LEAMKKYMTSRLWFTYRRDFSPIGGTGPSTDQGWGCMLRCAQMLLGEVLLRRHIGRHFEW 108
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ E Y KIL+MF D + A YSIHQIA G +EGK V EWFGPNT AQV++KL +D
Sbjct: 109 DIEKTSEVYDKILQMFFDEKDALYSIHQIAQMGVTEGKKVSEWFGPNTAAQVIKKLTIFD 168
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTT---------------------------------- 157
DWS+I HVALDN LV + TT
Sbjct: 169 DWSNIAVHVALDNILVKEDALTMATTYPSDNASYIFAVHNFLKYFTLNLTFPNFAENGQI 228
Query: 158 -NKRASS--NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R SS W+PL+++IPLRLG+ INP Y+ I+K + L
Sbjct: 229 EKPRPSSGCTTDWRPLLVMIPLRLGLTSINPCYLPAIQKFFEL----------------- 271
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------TNQNI 260
PQ +G+IGGKPN A YF+G G + +LDPH TN I
Sbjct: 272 ----------PQCVGIIGGKPNLAHYFVGIAGTKLFYLDPHHCRAKTTKRDAGVTTNTMI 321
Query: 261 GCV-----------------YDKEQDSE-----------KKLDSTYHCPQASRLHILHMD 292
+ + K +D E K DSTYHC + +D
Sbjct: 322 SSITTTDAQLDIQNQIDDSDFHKLEDLEPLPSQTSDVYTKMDDSTYHCQMMQWMEYESID 381
Query: 293 PSIAVV 298
PS+A+
Sbjct: 382 PSLALA 387
>gi|30109219|gb|AAH41862.1| ATG4 autophagy related 4 homolog A (S. cerevisiae) [Homo sapiens]
gi|119623096|gb|EAX02691.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|119623098|gb|EAX02693.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_c
[Homo sapiens]
Length = 321
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 121/274 (44%), Positives = 170/274 (62%), Gaps = 55/274 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM++AQAL+ HLGRDW W ++ + Y +IL+ F DR+ YSIHQ+A G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC--------TTN 158
EGK++GEWFGPNTVAQVL+KLA +D+W+S+ +V++DNT+V+ +KK+C T
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAG 120
Query: 159 KR-----ASSN---------PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
R +SN W+PL+L++PLRLGI INPVY++ K+C
Sbjct: 121 DRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVYVDAFKEC---------- 170
Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
F PQSLG +GGKPN+A YFIG++G+++IFLDPHT Q
Sbjct: 171 -----------------FKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTF---V 210
Query: 265 DKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 211 DTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 243
>gi|296206033|ref|XP_002750034.1| PREDICTED: cysteine protease ATG4B isoform 2 [Callithrix jacchus]
Length = 319
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------- 158
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC T+
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120
Query: 159 ---------------KRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
+S P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADADRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ DS D ++HC R+ I +DPSIAV
Sbjct: 213 ---EPTDSCFIPDESFHCQHPPCRMSIAELDPSIAV 245
>gi|403291503|ref|XP_003936827.1| PREDICTED: cysteine protease ATG4B [Saimiri boliviensis
boliviensis]
Length = 319
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 121/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWAQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------- 158
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC T+
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120
Query: 159 ---------------KRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
+S P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADANRHCNGFPAGAEVTSRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ DS D ++HC R+ I +DPSIAV
Sbjct: 213 ---EPTDSCFIPDESFHCQHPPCRMSIAELDPSIAV 245
>gi|14042153|dbj|BAB55127.1| unnamed protein product [Homo sapiens]
Length = 331
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC T+ A++
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVLCAGATA 120
Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ D D ++HC R+ I +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245
>gi|426339167|ref|XP_004033531.1| PREDICTED: cysteine protease ATG4B isoform 1 [Gorilla gorilla
gorilla]
gi|426339169|ref|XP_004033532.1| PREDICTED: cysteine protease ATG4B isoform 2 [Gorilla gorilla
gorilla]
gi|221045722|dbj|BAH14538.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC T+ A++
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120
Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ D D ++HC R+ I +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245
>gi|119591685|gb|EAW71279.1| ATG4 autophagy related 4 homolog B (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 331
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC T+ A++
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120
Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ D D ++HC R+ I +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245
>gi|14041938|dbj|BAB55042.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 121/279 (43%), Positives = 161/279 (57%), Gaps = 57/279 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC T+ A++
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA 120
Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG +I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEGLIYLDPHTTQPA-- 211
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAVVSQ 300
+ D D ++HC R+ I +DPSIAV Q
Sbjct: 212 --VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGKQ 248
>gi|149037474|gb|EDL91905.1| autophagy-related 4B (yeast), isoform CRA_b [Rattus norvegicus]
Length = 319
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 120/276 (43%), Positives = 159/276 (57%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFLDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------- 158
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++ KLC +
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEISKLCRASLPCAGAAA 120
Query: 159 ---------------KRASSNP-QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
++ P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 LSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ DS D ++HC R+ I +DPSIAV
Sbjct: 213 ---ELTDSCFIPDESFHCQHPPCRMGIGELDPSIAV 245
>gi|194384462|dbj|BAG59391.1| unnamed protein product [Homo sapiens]
Length = 319
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 119/276 (43%), Positives = 161/276 (58%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KRASS 163
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+D+T+V+ ++++LC T+ A++
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDSTVVMEEIRRLCRTSVPCAGATA 120
Query: 164 NPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ D D ++HC R+ I +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245
>gi|395733089|ref|XP_002813143.2| PREDICTED: cysteine protease ATG4B isoform 2 [Pongo abelii]
Length = 331
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 118/276 (42%), Positives = 160/276 (57%), Gaps = 57/276 (20%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM+ AQAL+ HLGRDW+W ++ ++Y +L F DR+ + YSIHQIA G
Sbjct: 1 MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC----------- 155
EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC
Sbjct: 61 EGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRNSVPCAGATA 120
Query: 156 ---TTNKRASSNPQ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
+++ + P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 121 FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC-------- 172
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q
Sbjct: 173 -------------------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV- 212
Query: 263 VYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ D D ++HC R+ I +DPSIAV
Sbjct: 213 ---EPTDGCFIPDESFHCQHPPCRMSIAELDPSIAV 245
>gi|198417051|ref|XP_002128504.1| PREDICTED: similar to autophagy-related cysteine endopeptidase 2
[Ciona intestinalis]
Length = 422
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 181/306 (59%), Gaps = 55/306 (17%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
I S LWFTYRKG+ PIG +G T+D GWGCMLRCGQM++A+AL L + +DW+W + +
Sbjct: 60 IKSFLWFTYRKGYTPIGGTGPTSDSGWGCMLRCGQMLLARALAELTMDKDWKWTEDKPQP 119
Query: 79 A-YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
Y +IL D R++ YSIHQIA G EGK VG+WFGPNT++QVLR+L+++D + +
Sbjct: 120 PPYKRILHQLSDERSSCYSIHQIAQMGVEEGKEVGQWFGPNTISQVLRRLSQFDQENVLA 179
Query: 138 FHVALDNTLVVNQVKKLCTT--------------------------NKRASSNPQWQPLV 171
HVA+DNT+ + +++LC+T N +S+ W+PL+
Sbjct: 180 IHVAMDNTVCIEDIERLCSTTPTTQYEGACSSTCKPDRTKCNGDSPNVSPTSDDFWRPLL 239
Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
L+IPLRLG+ +INPVY +K+C + +S+GVI
Sbjct: 240 LLIPLRLGLSEINPVYFTHLKEC---------------------------LHWKESVGVI 272
Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
GGKPNHA YF+G + +IFLDPHT Q + D + E+ D+T+HC R+ + ++
Sbjct: 273 GGKPNHAYYFLGCSEDSMIFLDPHTTQPYVKLPDITSN-ERYDDTTFHCDTPGRMLLTNL 331
Query: 292 DPSIAV 297
DPS+A+
Sbjct: 332 DPSLAL 337
>gi|268570274|ref|XP_002640735.1| Hypothetical protein CBG19805 [Caenorhabditis briggsae]
Length = 481
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 122/327 (37%), Positives = 172/327 (52%), Gaps = 80/327 (24%)
Query: 4 ANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLF 62
++S +D ++ +++ +TSR WFTYR+ F PIG +G +TD+ WGCMLRC QM++ + LL
Sbjct: 36 GKEISKEDGIDAMKKYMTSRFWFTYRRNFSPIGGTGPSTDQYWGCMLRCAQMLLGEVLLR 95
Query: 63 LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
H+GR ++W++ + Y KIL+MF D + A YSIHQIA G SEGK V EWFGPNT AQ
Sbjct: 96 RHIGRHFEWDIEKTSDVYEKILQMFFDEKDALYSIHQIAQMGVSEGKEVSEWFGPNTAAQ 155
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT-----------------NKRASSN- 164
V++KL +DDWS+I HVALDN LV + TT + R SS+
Sbjct: 156 VIKKLTIFDDWSNIAVHVALDNILVKQDALTMATTYPSEDAVKLIMGEFGFKSDRISSSH 215
Query: 165 --------------------------------PQWQPLVLVIPLRLGIQDINPVYINGIK 192
+W+PL+L+IPLRLG+ IN Y++ I+
Sbjct: 216 IICMNLDYFKKLLNFENGLVEKHYTSTVPANGTEWRPLLLMIPLRLGLTSINSCYLSAIQ 275
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
+ + L PQ +G+IGGKPN A YF+G G + +L
Sbjct: 276 EFFKL---------------------------PQCVGIIGGKPNLAHYFVGIAGTKLFYL 308
Query: 253 DPHTNQNIGCVY--DKEQDSEKKLDST 277
DPH + + +KEQ + DST
Sbjct: 309 DPHHCRPKTSKFFVEKEQQQQSSGDST 335
>gi|297265289|ref|XP_002799164.1| PREDICTED: cysteine protease ATG4B-like [Macaca mulatta]
Length = 358
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 112/255 (43%), Positives = 151/255 (59%), Gaps = 52/255 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
+ + +W RK + G +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W ++
Sbjct: 21 ETSEPVWILGRKYSIFTGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQ 80
Query: 78 -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D WSS+
Sbjct: 81 PDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSL 140
Query: 137 VFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------WQPLVL 172
H+A+DNT+V+ ++++LC T+ A++ P W+PLVL
Sbjct: 141 AVHIAMDNTVVMEEIRRLCRTSVPCAGAAAFPADSDRHCNGFPAGAEVTNRPSPWRPLVL 200
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
+IPLRLG+ DIN Y+ +K C F PQSLGVIG
Sbjct: 201 LIPLRLGLTDINEAYVETLKHC---------------------------FMMPQSLGVIG 233
Query: 233 GKPNHALYFIGYVGN 247
GKPN A YFIGYVG
Sbjct: 234 GKPNSAHYFIGYVGE 248
>gi|350595874|ref|XP_003484197.1| PREDICTED: cysteine protease ATG4A [Sus scrofa]
Length = 393
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 167/304 (54%), Gaps = 88/304 (28%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
DI++RLWFTYR+ F PI DW W ++
Sbjct: 76 DISARLWFTYRRKFSPI---------------------------------DWNWEKQKEQ 102
Query: 78 -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
+ Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W+S+
Sbjct: 103 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 162
Query: 137 VFHVALDNTLVVNQVKKLCTT----------NKRASSN------------PQWQPLVLVI 174
+V++DNT+V+ +KK+C ++R S N P W+PL+L++
Sbjct: 163 AVYVSMDNTVVIEDIKKMCCASALSADAAVESRRDSLNASTQSKGPSACRPAWKPLLLIV 222
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PLRLGI INPVY++ K+C F PQSLG +GGK
Sbjct: 223 PLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGALGGK 255
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDP 293
PN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q R++IL++DP
Sbjct: 256 PNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQPPQRMNILNLDP 311
Query: 294 SIAV 297
S+A+
Sbjct: 312 SVAL 315
>gi|170572866|ref|XP_001892265.1| Peptidase family C54 containing protein [Brugia malayi]
gi|158602497|gb|EDP38912.1| Peptidase family C54 containing protein [Brugia malayi]
Length = 440
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 125/361 (34%), Positives = 177/361 (49%), Gaps = 103/361 (28%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S D+ +++ + S LWFTYRK F PIG +G TTD+GWGCMLRCGQM++A+ L+ HLGR
Sbjct: 65 SRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLIVRHLGR 124
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
+W W+ + Y +IL G SEGK +GEWFGPNT AQVL+KL
Sbjct: 125 NWLWDRDVMLTEYKRILPNM----------------GVSEGKEIGEWFGPNTAAQVLKKL 168
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTN----------------------------- 158
YD WS + HVALDN L+ + ++ + T
Sbjct: 169 VIYDQWSRLTVHVALDNVLITSDIRTMAFTRPPYRRSRRETESDYNDNLGTIDPTEAEIL 228
Query: 159 KRASSNP----------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
+++ +P +W+PL+++IPLRLG+ IN Y I+ + L
Sbjct: 229 PKSTRSPTRSETSSISSYSGVSEEWRPLLIIIPLRLGLNTINRCYFPAIQAFFEL----- 283
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI-- 260
PQ +G+IGG+PNHALYF G V N++++LDPH QN
Sbjct: 284 ----------------------PQCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQNFVD 321
Query: 261 ----GCVYDKEQD-----SEKKLDSTYHCPQASRLHILHMDPSIAVV----SQRSYSDYK 307
D+ D +++ DSTYHCP I +DPS+A+ ++ YS+
Sbjct: 322 LDEATTTKDERGDYVEIKNDEFRDSTYHCPFILSTKIDKVDPSLALGFFCHTEDDYSELA 381
Query: 308 N 308
N
Sbjct: 382 N 382
>gi|149022064|gb|EDL78958.1| rCG26842 [Rattus norvegicus]
Length = 246
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 139/203 (68%), Gaps = 24/203 (11%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC---------------------TTNKRASSNP--QWQP 169
W+S+ +V++DNT+V+ +KK+C ++ + +S P W+P
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADTAGESPPDSLIASSQSKGTSAPCLAWKP 223
Query: 170 LVLVIPLRLGIQDINPVYINGIK 192
L+L++PLRLGI INPVYI K
Sbjct: 224 LLLIVPLRLGINQINPVYIEAFK 246
>gi|431912280|gb|ELK14417.1| Cysteine protease ATG4B [Pteropus alecto]
Length = 431
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 124/334 (37%), Positives = 182/334 (54%), Gaps = 54/334 (16%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 52 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 111
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIA--------------LTGASEGKAVGEWFGP 117
++ ++Y +L+ F DR+ + YSIHQIA + + G + + F
Sbjct: 112 QRKRQPDSYFSVLRAFMDRKDSYYSIHQIAPVHPQSRFWRQSASVRTSVLGPQLCQSFAA 171
Query: 118 NTVAQVLR----------KLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
+++ R KLA +D WS++ H+A+DNT+V+ + + ++ + P
Sbjct: 172 VRLSRRRRWELVTLSSPGKLAVFDTWSALAVHIAMDNTVVMEDI----SADRHCNGVPAG 227
Query: 168 QPLV------------LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
+ L+IPLRLG+ DIN Y+ +K L V + S+ ++
Sbjct: 228 AEVTHRPPLPPWRPLVLLIPLRLGLTDINEAYVGTLKLASTL--------VGLCSAAASL 279
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
++ F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q V D+ D
Sbjct: 280 PLRQHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEVADRRSIP----D 335
Query: 276 STYHCPQ-ASRLHILHMDPSIAVVSQRSYSDYKN 308
++HC SR+ I +DPSIA ++ D+ +
Sbjct: 336 ESFHCQHPPSRMRIGELDPSIAGFFCQTEDDFDD 369
>gi|341903727|gb|EGT59662.1| CBN-ATG-4.1 protein [Caenorhabditis brenneri]
Length = 433
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 118/324 (36%), Positives = 164/324 (50%), Gaps = 76/324 (23%)
Query: 35 GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP 94
G +G T+D+GWGCMLRC QM++ + LL H+GR ++W++ + Y KIL+MF D + A
Sbjct: 44 GGTGPTSDQGWGCMLRCAQMLLGEVLLRRHIGRHFEWDIETTSVVYEKILQMFFDEKDAL 103
Query: 95 YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL 154
YSIHQIA G +EGK + +WFGPNT AQVL+KL +DDWS++ HVALDN LV +
Sbjct: 104 YSIHQIAQMGVTEGKEISKWFGPNTAAQVLKKLTIFDDWSNVAVHVALDNILVKEDALTM 163
Query: 155 CTTNKR--------------------ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
TT S +W+PL+L+IPLRLG+ IN Y+ I++
Sbjct: 164 ATTYPSEDAVKLIMENGQVEKHYATITSKEGEWRPLLLMIPLRLGLTSINTCYLPAIQEF 223
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+ L PQ +G+IGGKPN A YF+G G + +LDP
Sbjct: 224 FKL---------------------------PQCVGIIGGKPNLAHYFVGIAGTKLFYLDP 256
Query: 255 H---------------------TNQNIGCVYDKE----QDSE---KKLDSTYHCPQASRL 286
H + N + D E Q S+ K DSTYHC +
Sbjct: 257 HYCRPKTSKVFAEKEPSTESEQHDTNFSELEDLEPLPSQTSDVYTKMDDSTYHCQMMQWM 316
Query: 287 HILHMDPSIAV-VSQRSYSDYKNV 309
+DPS+A+ + S D+ N+
Sbjct: 317 EFESIDPSLALALFCESREDFDNL 340
>gi|47212536|emb|CAF90552.1| unnamed protein product [Tetraodon nigroviridis]
Length = 366
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 117/286 (40%), Positives = 154/286 (53%), Gaps = 61/286 (21%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
I D+TSRLWFTYRKGF PIG +G T+D GWGCMLRCGQM++ QAL+ HLGRDW+W V+
Sbjct: 65 ILSDVTSRLWFTYRKGFPPIGGTGPTSDTGWGCMLRCGQMILGQALMCRHLGRDWRW-VS 123
Query: 75 SKEE--AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
+E+ Y+ IL F D++ + YSIHQI + W + + +
Sbjct: 124 GEEQRHEYVNILNAFIDKKDSYYSIHQIE-------RLCMPWLDKAEACAASEGVGELNG 176
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
+ ++ C ++ ++ W+PLVL+IPLRLG+ DIN YI +K
Sbjct: 177 Y-----------------LEGACAFSEEETA--LWKPLVLLIPLRLGLTDINEAYIETLK 217
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
KC+ L PQSLGVIGGKPN A YFIGYVG ++I+L
Sbjct: 218 KCFML---------------------------PQSLGVIGGKPNSAHYFIGYVGEELIYL 250
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
DPHT Q D +D D +YHC R+HI +DPSIA
Sbjct: 251 DPHTTQT---AVDPCEDG-TFTDDSYHCQHPPCRMHICELDPSIAA 292
>gi|313228003|emb|CBY23152.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 117/295 (39%), Positives = 170/295 (57%), Gaps = 41/295 (13%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
+++LE I+ D SRLWFTYR+ F IG SG T+D+GWGCMLR GQM++A+ LL LGR+
Sbjct: 36 YEELEDIQGDFQSRLWFTYRRNFASIGGSGPTSDQGWGCMLRAGQMLVAECLLRQRLGRN 95
Query: 69 WQWNVNS-KEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EGKAVGEWFGPNTVAQVLRK 126
+ W+ +S ++E Y +IL++F D +A S+ QIALTGA+ E +AVGEWFGPNT+AQVL++
Sbjct: 96 YVWSESSIEDERYTEILELFRDTHSAELSLQQIALTGATAEKRAVGEWFGPNTMAQVLKR 155
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPV 186
+ K V VA+D+ + V V + + PLVL+IPLRLG+ +N +
Sbjct: 156 ITKSRSLGFGV-TVAMDSVVSVEDVSAEIINGGKPT------PLVLMIPLRLGLNSVNEI 208
Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
Y+N +K L+S Y +G++GGKPN A YF+GY
Sbjct: 209 YVNPLK--------------IFLASKY-------------CVGIMGGKPNQAHYFVGYQE 241
Query: 247 ND----VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+++LDPHT Q + E + D + H + + L +DPS+AV
Sbjct: 242 TVEDTWLLYLDPHTTQQSPVSVNNNMPFE-QFDKSLHTDKLCWIKALKLDPSLAV 295
>gi|328769729|gb|EGF79772.1| hypothetical protein BATDEDRAFT_35298 [Batrachochytrium
dendrobatidis JAM81]
Length = 441
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 156/285 (54%), Gaps = 38/285 (13%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D SRLW TYRKGF I +G T D GWGCMLR GQM++A ALLF LGRDW+ ++
Sbjct: 141 DFHSRLWMTYRKGFAAIKPTGYTCDSGWGCMLRSGQMLVANALLFHELGRDWRLGDSNDR 200
Query: 78 E---AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWS 134
+ Y IL F D T+PYSI +IA G K +GEWFGP+T++QVL+ L D
Sbjct: 201 DTWLTYCSILTKFLDVNTSPYSIQRIATLGIRFDKQIGEWFGPSTISQVLKVLVNDDQRI 260
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
S+ HV+ D + N++ + + + P ++++IPLRLG++ +NPVY G+K C
Sbjct: 261 SLKVHVSNDGVVYKNEINTILSATRDDGKTPA---VLIMIPLRLGVETMNPVYYPGVKHC 317
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+A+ +G+ GG+PN +L+F+G G+ +I+LDP
Sbjct: 318 FAM---------------------------SHCVGIAGGRPNSSLFFLGVDGDHLIYLDP 350
Query: 255 HTNQNIGCVYDKEQDSEKKLDS--TYHCPQASRLHILHMDPSIAV 297
H ++ D + K++ +YHC + L I MDPS+ +
Sbjct: 351 H---HLRPSVDSRDITSYKMEDLLSYHCEKVRLLPIASMDPSLVI 392
>gi|395545675|ref|XP_003774724.1| PREDICTED: cysteine protease ATG4A [Sarcophilus harrisii]
Length = 431
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 105/255 (41%), Positives = 151/255 (59%), Gaps = 53/255 (20%)
Query: 65 LGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
L DW W + ++ E Y +ILK F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV
Sbjct: 131 LQADWGWEKHQEQPEEYQRILKCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQV 190
Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC-------TTNKRASS------------- 163
L+KLA +D+W+S+ +V++DNT+V+ +KK+C T + +SS
Sbjct: 191 LKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCHMCPSDLTHDSSSSSYNGLDWNTDCPGQ 250
Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
W+PL+L++PLRLGI INP+Y + K+C F
Sbjct: 251 TSGWKPLLLIVPLRLGINQINPIYADAFKEC---------------------------FK 283
Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
PQSLG +GGKPN A YFIG++G+++I+LDPHT Q D E++ D ++HC Q+
Sbjct: 284 MPQSLGALGGKPNSAYYFIGFLGDELIYLDPHTTQTF---VDTEENGTVN-DQSFHCQQS 339
Query: 284 -SRLHILHMDPSIAV 297
R+ IL++DPS+A+
Sbjct: 340 PPRMKILNLDPSVAL 354
>gi|156396522|ref|XP_001637442.1| predicted protein [Nematostella vectensis]
gi|156224554|gb|EDO45379.1| predicted protein [Nematostella vectensis]
Length = 342
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 106/299 (35%), Positives = 163/299 (54%), Gaps = 45/299 (15%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ- 70
LE+ R TS +W TYR+ FV + S LT+D GWGCMLR GQM++A L+F L +DW+
Sbjct: 51 LEEFHRHFTSLIWLTYRRSFVQLNGSNLTSDCGWGCMLRSGQMMLASGLIFHFLKKDWRI 110
Query: 71 ---WNVNSKEEAYLKILKMF---EDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVL 124
+ +E Y IL+ F +D +P+S+H++ G GK G+W+GP +VA +L
Sbjct: 111 SGRCHSREQEHYYRVILQFFGDQDDEERSPFSLHRLVTLGQHTGKQAGDWYGPASVAHIL 170
Query: 125 RKL---AKYDDWSSIVFHVALDNTLVVNQVKKLCT---TNKRASSNPQWQPLVLVIPLRL 178
K A + I +VA D T+ +++VK++CT T++R S+ +W+P+++++P+RL
Sbjct: 171 EKAMISATHPLLHDINIYVAQDCTVYIDEVKRVCTHCRTHQRDCSSGKWRPVIILVPMRL 230
Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
G + +NP+YI +K FT Q +G+IGG+P H+
Sbjct: 231 GGEALNPIYIPCVKSL---------------------------FTLDQCIGIIGGRPKHS 263
Query: 239 LYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
LYF+G+ +I LDPH Q V D Q EK ++HCP + MDPS +
Sbjct: 264 LYFVGFQDEKMIHLDPHYCQP---VVDTTQ--EKFPTESFHCPNPRKTSFKKMDPSCTI 317
>gi|403289553|ref|XP_003935916.1| PREDICTED: cysteine protease ATG4A isoform 2 [Saimiri boliviensis
boliviensis]
Length = 360
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 112/286 (39%), Positives = 158/286 (55%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 68 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 127
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 128 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 187
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 188 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 208
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP+S TP G +P +L +++IFL
Sbjct: 209 MCRVLPLS--------------ADTP-------------GDRPPDSLT-ASNESDELIFL 240
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 241 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 282
>gi|296236154|ref|XP_002763201.1| PREDICTED: uncharacterized protein LOC100409486 [Callithrix
jacchus]
Length = 360
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 112/286 (39%), Positives = 158/286 (55%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 68 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 127
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 128 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 187
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 188 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 208
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP+S TP G +P +L +++IFL
Sbjct: 209 MCRVLPLS--------------ADTP-------------GDRPPDSLT-ASNRSDELIFL 240
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 241 DPHTTQTF---VDAEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 282
>gi|397497902|ref|XP_003819742.1| PREDICTED: cysteine protease ATG4A isoform 2 [Pan paniscus]
Length = 336
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 112/286 (39%), Positives = 158/286 (55%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP+S TP G +P +L +++IFL
Sbjct: 185 MCRVLPLS--------------ADTP-------------GDRPPDSLT-ASNQSDELIFL 216
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 258
>gi|119623101|gb|EAX02696.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_g
[Homo sapiens]
Length = 340
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 48 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 107
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 108 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 167
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 168 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 188
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP+S G +P +L +++IFL
Sbjct: 189 MCRVLPLSA---------------------------DTAGDRPPDSLT-ASNQSDELIFL 220
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 221 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 262
>gi|332226094|ref|XP_003262224.1| PREDICTED: cysteine protease ATG4A isoform 2 [Nomascus leucogenys]
Length = 336
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP+S G +P +L +++IFL
Sbjct: 185 MCCVLPLSA---------------------------DTAGDRPPDSLT-ASNQSDELIFL 216
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 258
>gi|30795248|ref|NP_840054.1| cysteine protease ATG4A isoform b [Homo sapiens]
gi|426397038|ref|XP_004064735.1| PREDICTED: cysteine protease ATG4A isoform 2 [Gorilla gorilla
gorilla]
gi|15487242|emb|CAC69077.1| putative autophagy-related cysteine endopeptidase 2 [Homo sapiens]
gi|119623095|gb|EAX02690.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 336
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP+S G +P +L +++IFL
Sbjct: 185 MCRVLPLSA---------------------------DTAGDRPPDSLT-ASNQSDELIFL 216
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNILNLDPSVAL 258
>gi|410989159|ref|XP_004000832.1| PREDICTED: cysteine protease ATG4A isoform 2 [Felis catus]
Length = 336
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/286 (38%), Positives = 156/286 (54%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP S + T TP G + +++IFL
Sbjct: 185 MCCVLPSS---------ADTVGESTP----------GTLNASNQ---------SDELIFL 216
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q + +++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 217 DPHTTQ----TFVNTEENGTVDDQTFHCLQSPQRMNILNLDPSVAL 258
>gi|402911089|ref|XP_003918175.1| PREDICTED: cysteine protease ATG4A isoform 2 [Papio anubis]
Length = 336
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 155/286 (54%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++DNT+V I+DI K
Sbjct: 164 WNSLAVYVSMDNTVV--------------------------------IEDIK-------K 184
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
C LP+S G +P L +++IFL
Sbjct: 185 MCRVLPLSA---------------------------DTAGDRPLDYLT-ASNQSDELIFL 216
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGMVN-DQTFHCLQSPQRMNILNLDPSVAL 258
>gi|395854620|ref|XP_003799780.1| PREDICTED: cysteine protease ATG4A isoform 2 [Otolemur garnettii]
Length = 336
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 153/286 (53%), Gaps = 73/286 (25%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
W+S+ +V++D N V I IK
Sbjct: 164 WNSLAVYVSMD----------------------------------------NTVVIEDIK 183
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
K M +L P S G P +L + N++IFL
Sbjct: 184 K-----------MCCVL---------------PSSADTAGESPPGSLTALNQ-SNELIFL 216
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
DPHT Q D E++ D T+HC Q+ R++IL++DPS+A+
Sbjct: 217 DPHTTQTF---VDTEENGTVD-DQTFHCLQSPQRMNILNLDPSVAL 258
>gi|256078123|ref|XP_002575347.1| autophagin-1 (C54 family) [Schistosoma mansoni]
gi|360045353|emb|CCD82901.1| autophagin-1 (C54 family) [Schistosoma mansoni]
Length = 556
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 78/153 (50%), Positives = 114/153 (74%), Gaps = 1/153 (0%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGD-SGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
S D E+I R + SRLW TYRKGF PIG +G +D GWGCM RCGQM++A+A+L HLG
Sbjct: 31 SLDDREEIARHLNSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRFHLG 90
Query: 67 RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
R W+W+ + Y ++L+MF+DRR+A YSI I LTG S GK++G WFGPNTVAQVL+K
Sbjct: 91 RSWKWSPEQESPEYYRLLQMFQDRRSALYSIQTITLTGVSLGKSIGSWFGPNTVAQVLKK 150
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
L+ YD W+++ H+++++ ++++++K LC ++
Sbjct: 151 LSVYDRWTNLFIHISVEDGIIIDEIKSLCCQHR 183
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 5/76 (6%)
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
F P +G++GG P HA++ +G +DVI LDPHT Q G + + D TYHC
Sbjct: 351 FRLPHCVGILGGSPCHAVWIVGVTDDDVICLDPHTTQPAG-----RGNLKPDYDQTYHCD 405
Query: 282 QASRLHILHMDPSIAV 297
R+ + +DPS+ +
Sbjct: 406 NPIRIPLKRLDPSMVL 421
>gi|307174864|gb|EFN65142.1| Cysteine protease ATG4D [Camponotus floridanus]
Length = 477
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/301 (34%), Positives = 158/301 (52%), Gaps = 52/301 (17%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+E+ +RD SR+W TYR+ F + S TTD GWGCMLR GQM++AQAL+ LGR+W+W
Sbjct: 135 IEEFKRDFVSRIWLTYRREFQILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRW 194
Query: 72 NVNSKEEA---------YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
E + I+K F D+ +P+SIH++ L GAS GK G+W+GP++VA
Sbjct: 195 RPEQPIETLQQRLDDRNHRMIIKWFGDQSESPFSIHRLVLLGASAGKRAGDWYGPSSVAH 254
Query: 123 VLRKLAKY------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+L + + ++ + +VA D + + V+ +C T + +W+ LVL++PL
Sbjct: 255 LLSQAVECASKQSNSNFDHLAVYVAQDCAVYLQDVENICRT-----PDGKWKALVLLVPL 309
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
RLG +NPVY L+S + T +GVIGG+P
Sbjct: 310 RLGADKLNPVY------------------APCLTSLLTLDT---------CIGVIGGRPR 342
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
H+LYFIGY + +I LDPH Q V+ + +++HC ++ + MDPS
Sbjct: 343 HSLYFIGYQDDKLIHLDPHYCQETVDVWKNDFSL-----TSFHCTSPRKMLLSKMDPSCC 397
Query: 297 V 297
V
Sbjct: 398 V 398
>gi|402581511|gb|EJW75459.1| peptidase family C54 containing protein [Wuchereria bancrofti]
Length = 256
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 80/150 (53%), Positives = 108/150 (72%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S D+ +++ + S LWFTYRK F PIG +G TTD+GWGCMLRCGQM++A+ L+ HLG
Sbjct: 36 SRGDMARVKEFMASLLWFTYRKNFQPIGGTGPTTDQGWGCMLRCGQMLLARVLIVRHLGH 95
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
+W W+ + K Y +IL+MF+D++ +SIHQIA G SEGK +GEWFGPNT AQVL+KL
Sbjct: 96 NWLWDRDVKLTEYKRILRMFQDKKNCLFSIHQIANMGVSEGKEIGEWFGPNTAAQVLKKL 155
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT 157
YD WS + HVALDN L+ + ++ + T
Sbjct: 156 VIYDQWSRLTVHVALDNVLITSDIRTMAFT 185
>gi|320166566|gb|EFW43465.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 336
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 105/287 (36%), Positives = 153/287 (53%), Gaps = 39/287 (13%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D + + + S W TYR F I DS TD GWGCMLRCGQM++A+A+ HLG++W
Sbjct: 22 DEQALEHAVRSFPWMTYRNHFAQIADSYYNTDAGWGCMLRCGQMLLARAMTVQHLGKNWA 81
Query: 71 -WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+ + + + L +F D AP+SIH+IA G + GK +G+WFGPNTVAQVL+ L
Sbjct: 82 PTSRKQRHQEMARFLPLFFDTPAAPFSIHRIAERGEALGKTIGQWFGPNTVAQVLKNLVN 141
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI-QDINPVYI 188
SS++ H A+D V+N+ + T A S+ + L++++P+RLG+ Q INPVYI
Sbjct: 142 -SQRSSLIVHCAMDG--VLNRTEA-STQLAAALSDGKKHSLLVLVPIRLGLNQSINPVYI 197
Query: 189 NGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
+K L PQ LG+IGGKPN A +F+G V +
Sbjct: 198 PALKATLEL---------------------------PQCLGIIGGKPNAAHFFVGTVNEN 230
Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
V++LDPH V D + ++ S++ I +DPS+
Sbjct: 231 VLYLDPHV------VQDAAMELTPDTVESFSVAVLSKMAISDVDPSM 271
>gi|391340875|ref|XP_003744760.1| PREDICTED: cysteine protease ATG4D-like [Metaseiulus occidentalis]
Length = 488
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/326 (30%), Positives = 170/326 (52%), Gaps = 60/326 (18%)
Query: 5 NKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH 64
NK + + D ++RLWFTYR+ F P+ +G T+D GWGCMLR QM++A+A +F
Sbjct: 138 NKNNSASFKNFFADFSTRLWFTYRQDFQPMQSTGHTSDSGWGCMLRSAQMMLAEAFIFHL 197
Query: 65 LGRDWQWNVNSKEE---AYLKILKMFE---DRRTAPYSIHQIALTGASEGKAVGEWFGPN 118
LGR W+W +++ + KI+K F D AP+S+H + A GK G+WFGP+
Sbjct: 198 LGRQWRWCPQQQQQEHGVHRKIIKWFSDDPDTTEAPFSVHNMVRAAAHCGKKAGDWFGPS 257
Query: 119 TVAQVLRKLAKY--------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPL 170
T A +L++ + + + + +VA D T+ V LCT++ N +W+ +
Sbjct: 258 TAAYLLKRCLEEAAGVADSKEIFEQMAIYVAQDCTIYTQDVLDLCTSDP----NIEWKSV 313
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
VL+IP+RLG + +N YI+ IK+ A + LG+
Sbjct: 314 VLLIPVRLGGERVNVNYIHCIKEILA---------------------------YQNCLGI 346
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD---STYHCPQASRLH 287
IGGKP H+LYF+G+ G +++LDPH Y ++ +L+ +++HC A ++
Sbjct: 347 IGGKPRHSLYFVGFQGKKLVYLDPH--------YLQKTTDTSRLNFSVNSFHCTTARKVS 398
Query: 288 ILHMDPSIAV----VSQRSYSDYKNV 309
+DPS + ++R + ++++
Sbjct: 399 FSKLDPSATIGFYCKTRRDFESFQSI 424
>gi|332026942|gb|EGI67039.1| Cysteine protease ATG4D [Acromyrmex echinatior]
Length = 392
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 107/296 (36%), Positives = 162/296 (54%), Gaps = 46/296 (15%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+E+ +RD SRLW TYR+ F + S TTD GWGCMLR GQM++AQAL+ LGR+W+W
Sbjct: 55 IEEFKRDFMSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWRW 114
Query: 72 --NVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
++ E ++ I+K F D+ T +P+SIH++ GAS GK G+W+GP++VA +L +
Sbjct: 115 RPEQSTDESSHRMIIKWFGDQPTPESPFSIHKLVSLGASTGKRAGDWYGPSSVAHLLCQA 174
Query: 128 AK--YDDWSS----IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
+ +D +S + +VA D + + V+ +C T + L+L++PLRLG
Sbjct: 175 MERASEDPNSKLNQLAVYVAQDCAVYMQDVENVCCTPDGGR-----KALILLVPLRLGAD 229
Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
+NPVY L+S + T +GVIGG+P H+LYF
Sbjct: 230 KLNPVY------------------APCLTSLLTLDT---------CIGVIGGRPRHSLYF 262
Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
IGY + +I LDPH Q V + +EK +++HC ++ + MDPS V
Sbjct: 263 IGYQDDKLIHLDPHYCQETVDV----EGNEKFPLTSFHCTSPRKMLLSKMDPSCCV 314
>gi|76156435|gb|AAX27646.2| SJCHGC05841 protein [Schistosoma japonicum]
Length = 414
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 74/146 (50%), Positives = 110/146 (75%), Gaps = 1/146 (0%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGD-SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
D E+I + SRLW TYRKGF PIG +G +D GWGCM RCGQM++A+A+L +HLGR W
Sbjct: 40 DREEIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSW 99
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+W+ + Y ++L+MF+DRR+ YSI I LTG S GK++G WFGPNT+AQVL+KL+
Sbjct: 100 RWSPEQESPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSV 159
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC 155
YD W+++ H+++++ ++++++K LC
Sbjct: 160 YDRWTNLFVHISVEDGIIIDEIKSLC 185
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 3/52 (5%)
Query: 144 NTLVVNQVKKLCTTNKRASSNP---QWQPLVLVIPLRLGIQDINPVYINGIK 192
N + +C ++ +S+NP W+PL+L +PLRLG+ + NP Y N IK
Sbjct: 361 NQINSTTAASVCESSSLSSTNPPSSNWRPLLLFVPLRLGLHNPNPCYFNAIK 412
>gi|328786958|ref|XP_393739.4| PREDICTED: cysteine protease ATG4D-like [Apis mellifera]
Length = 525
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 166/310 (53%), Gaps = 54/310 (17%)
Query: 5 NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
+ +S +D +E+ ++D TSRLW TYR+ F + S TTD GWGCMLR GQM++AQAL+
Sbjct: 173 DAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 232
Query: 64 HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
LGR+W+W + E + I+K F D RT+P+SIH + GA GK G
Sbjct: 233 FLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKRAG 292
Query: 113 EWFGPNTVAQVLRK-----LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
+W+GP++VA +L + + ++ ++++ +VA D + + ++ +C T S+ +W
Sbjct: 293 DWYGPSSVAHLLSQAVENAVERHPAFNNLAVYVAQDCAVYLQDIENVCQT-----SDGKW 347
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+ L+L +PLRLG +NPVY + + + T
Sbjct: 348 KSLILFVPLRLGADKLNPVYTSCLT---------------------------HLLTLDTC 380
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
+GVIGG+P H+LYFIG+ + +I LDPH Q D +D+ +++HC ++
Sbjct: 381 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFSL--TSFHCTSPRKML 435
Query: 288 ILHMDPSIAV 297
I MDPS V
Sbjct: 436 ISKMDPSCCV 445
>gi|257205644|emb|CAX82473.1| autophagy-related cysteine endopeptidase 2 [Schistosoma japonicum]
Length = 632
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 74/146 (50%), Positives = 110/146 (75%), Gaps = 1/146 (0%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGD-SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
D E+I + SRLW TYRKGF PIG +G +D GWGCM RCGQM++A+A+L +HLGR W
Sbjct: 40 DREEIAHHLKSRLWMTYRKGFSPIGSRNGPKSDAGWGCMHRCGQMILAEAMLRVHLGRSW 99
Query: 70 QWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+W+ + Y ++L+MF+DRR+ YSI I LTG S GK++G WFGPNT+AQVL+KL+
Sbjct: 100 RWSPEQESPEYYRLLQMFQDRRSVLYSIQTITLTGLSVGKSIGSWFGPNTIAQVLKKLSV 159
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLC 155
YD W+++ H+++++ ++++++K LC
Sbjct: 160 YDRWTNLFVHISVEDGIIIDEIKSLC 185
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 74/157 (47%), Gaps = 35/157 (22%)
Query: 144 NTLVVNQVKKLCTTNKRASSNP---QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
N + +C ++ +S+NP W+PL+L +PLRLG+ + NP Y N IK
Sbjct: 361 NQINSTTAASVCESSSLSSTNPPSSNWRPLLLFVPLRLGLHNPNPCYFNAIKAV------ 414
Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
F P +G++GG P HA++ +G G+DVI LDPHT Q
Sbjct: 415 ---------------------FRLPNCIGILGGSPCHAVWIVGVTGDDVICLDPHTTQPA 453
Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
G + + D TYHC R+ + +DPS+ +
Sbjct: 454 G-----RGNLKPDYDQTYHCENPIRMPLKRLDPSMVL 485
>gi|189233733|ref|XP_971091.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270015047|gb|EFA11495.1| hypothetical protein TcasGA2_TC014208 [Tribolium castaneum]
Length = 453
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 160/304 (52%), Gaps = 53/304 (17%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ E ++D SRLW TYR+ F + S ++D GWGCMLR GQM+IAQAL+ LGRDW
Sbjct: 107 EGFEGFKKDFISRLWLTYRREFPILNGSNYSSDCGWGCMLRSGQMLIAQALVCHILGRDW 166
Query: 70 QWNVN---SKEEAYL------KILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPN 118
+W + + E+++ KI+K F D+ R +P+SIH + G + GK G+W+GP
Sbjct: 167 RWQPDHQPTTRESFIEVVNHRKIIKWFGDKPSRNSPFSIHTLVALGEASGKKAGDWYGPG 226
Query: 119 TVAQVLR---KLAKYDDWS--SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
VA + R K A D++ S+ VA D + + V + CT N +W+ L+L+
Sbjct: 227 FVAHLFRQAFKRASEDNYEFDSLTVCVAQDCAVYIKDVMEECTDK-----NGKWKSLILL 281
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
IP+RLG + N +Y +P + F+ Q +G+IGG
Sbjct: 282 IPVRLGAEKFNSIY------------APCLTTL---------------FSLKQCIGIIGG 314
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P H+LYF+GY + +I LDPH Q + V+ + +++HC ++H+ MDP
Sbjct: 315 RPKHSLYFVGYQDDKLIHLDPHYCQEVVDVWAVDFPL-----TSFHCRSPRKIHLSKMDP 369
Query: 294 SIAV 297
S +
Sbjct: 370 SCCI 373
>gi|427787309|gb|JAA59106.1| Putative peptidase family c54 [Rhipicephalus pulchellus]
Length = 517
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 101/296 (34%), Positives = 161/296 (54%), Gaps = 44/296 (14%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E D +SRLWFTYR+ F PI + +T+D GWGCMLR QM++AQA++ LGR W++
Sbjct: 179 ELFLEDFSSRLWFTYRREFPPIPGTDITSDCGWGCMLRSSQMMLAQAVVTHVLGRQWRYR 238
Query: 73 VNSKEEA----YLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
N++ EA + +++++F DR +P+S+H++ G GK G+W+GP++ A +L++
Sbjct: 239 RNNQTEASDYVHRQVVRLFGDRTASASPFSLHKLVQMGHESGKQAGDWYGPSSAAYILKE 298
Query: 127 LAKYDDWSS-----IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
+ + + +VA D T+ + V+ LC R++ P W+ +++++P+RLG +
Sbjct: 299 ALEGACQTEQLLLDLRIYVAQDCTIYLEDVRALC-RGTRSNGAPLWRSVIILVPVRLGGE 357
Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
+NP YI +K + P +GVIGG+P H+LYF
Sbjct: 358 QLNPTYIPCVKGM---------------------------LSHPNCIGVIGGRPRHSLYF 390
Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+G+ G VI+LDPH Q V QD LDS YHC ++ MDPS +
Sbjct: 391 LGWQGEKVIYLDPHYVQE--AVDVGPQDF--PLDS-YHCSWPRKMSFYKMDPSCTM 441
>gi|157126425|ref|XP_001660889.1| hypothetical protein AaeL_AAEL010516 [Aedes aegypti]
gi|108873276|gb|EAT37501.1| AAEL010516-PA [Aedes aegypti]
Length = 583
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 163/338 (48%), Gaps = 81/338 (23%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D+E +RD +RLW TYRK F + DS T+D GWGCM+R GQM++AQ LL LGR+W
Sbjct: 165 EDIEAFKRDFVTRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLLVHFLGRNW 224
Query: 70 QWN------------VNSKEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWF 115
+W+ +N ++ + KI++ F D RT+P+SIH + G GK G+W+
Sbjct: 225 RWDATAESLRMNYHSLNYEDNVHRKIIRWFGDTSSRTSPFSIHTLVALGKETGKKPGDWY 284
Query: 116 GPNTVAQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----- 165
GP +VA +LR+ K D + +VA D + + + CT + + P
Sbjct: 285 GPGSVAHLLRQAVKLAAQEISDLDGVNVYVAQDCAVYIQDIIDECTVSAGPTLAPWQKKS 344
Query: 166 --------------------------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
W+ L+L++PLRLG + +NP+Y + +K +L
Sbjct: 345 PGSSSSSTTSTSNSNPTTSSSTDSTDHWKSLILLVPLRLGAEKLNPIYSDCLKAMLSLD- 403
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
+G+IGG+P H+LYF+G+ + +I LDPH Q+
Sbjct: 404 --------------------------NCIGIIGGRPKHSLYFVGFQEDKLIHLDPHYCQD 437
Query: 260 IGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ V ++E +++HC ++ + MDPS +
Sbjct: 438 MVDVVNQENFPV----ASFHCKSPRKMKLSKMDPSCCI 471
>gi|307205961|gb|EFN84087.1| Cysteine protease ATG4D [Harpegnathos saltator]
Length = 456
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 152/303 (50%), Gaps = 54/303 (17%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+E+ +RD SRLW TYR+ F + S TTD GWGCMLR GQM++AQAL+ LGR+W+W
Sbjct: 111 IEEFKRDFASRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGREWKW 170
Query: 72 NVNSKEEA---------YLKILKMFEDRR--TAPYSIHQIALTGASEGKAVGEWFGPNTV 120
E + I+K F D+ +P+SIH++ GAS GK G+W+GPN+V
Sbjct: 171 RPEQSIENTQQMRDDSNHRMIIKWFADQSKPESPFSIHRLVSLGASTGKRAGDWYGPNSV 230
Query: 121 AQVLRKLAKY------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
A +L + + S + +VA D + + V+++C T S+ W+ L+L++
Sbjct: 231 AHLLSQAVERTGELPNSKLSRLAVYVAQDCAVYMQDVEEVCRT-----SDGGWKSLILLV 285
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PL LG +NPVY + T +GVIGG+
Sbjct: 286 PLMLGTDKLNPVYAPCVTSL---------------------------LTLDACIGVIGGR 318
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
P H+LYFIGY + +I LDPH C + E +++HC ++ + MDPS
Sbjct: 319 PRHSLYFIGYQDDKLIHLDPHY-----CQETVDVSKENFPLTSFHCTSPRKMLLSKMDPS 373
Query: 295 IAV 297
V
Sbjct: 374 CCV 376
>gi|328874598|gb|EGG22963.1| hypothetical protein DFA_05093 [Dictyostelium fasciculatum]
Length = 432
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 168/314 (53%), Gaps = 66/314 (21%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH-LGRDWQ 70
+E+ D +++LW +YR+GF IGDS D GWGCMLR GQM++A LL +G+DW+
Sbjct: 88 IEEFLEDFSNKLWCSYRQGFECIGDSLFENDCGWGCMLRSGQMLLANVLLLNSPIGKDWK 147
Query: 71 WNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLA 128
N + E + K++++F DR +AP+SIH IAL G + GK++GEWF P+ ++ +R L
Sbjct: 148 KPQNGEYPEDFYKVVRLFLDRPSAPFSIHNIALHGRNHLGKSIGEWFAPSNISNAIRALV 207
Query: 129 -KY------------------------DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
KY D+ ++ +V+ D +L ++Q+ ++ S
Sbjct: 208 YKYDNHLNGTSEEDSSDEEKEGKKKKGDNQCNLSVYVSDDGSLYIDQLLEIALR-----S 262
Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
+ W PL+++IP +LGI IN +Y P+ D+ +T
Sbjct: 263 DGSWMPLLILIPTKLGIDTINEIYYR-----------PLLDI----------------YT 295
Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
FPQ+LG++GGKP +LYFI +++ +LDPHT QN E DS+ L S+Y C
Sbjct: 296 FPQNLGIVGGKPRASLYFIASQDDNLFYLDPHTVQN-----SIESDSDFSL-SSYFCNIP 349
Query: 284 SRLHILHMDPSIAV 297
+ +I +DPS+ +
Sbjct: 350 KKANISEVDPSLVI 363
>gi|380015613|ref|XP_003691794.1| PREDICTED: cysteine protease ATG4D-like [Apis florea]
Length = 486
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 104/310 (33%), Positives = 164/310 (52%), Gaps = 54/310 (17%)
Query: 5 NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
+ +S +D +E+ ++D TSRLW TYR+ F + S TTD GWGCMLR GQM++AQAL+
Sbjct: 134 DAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 193
Query: 64 HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
LGR+W+W + E + I+K F D RT+P+SIH + GA GK G
Sbjct: 194 FLGREWRWQPDQPIKTEQQKLDEYNHRLIIKSFGDLPERTSPFSIHTLVSLGALWGKRAG 253
Query: 113 EWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
+W+GP++VA +L + ++ ++++ +VA D + + ++ +C T + +W
Sbjct: 254 DWYGPSSVAHLLSQAVENAAERHPAFNNLAVYVAQDCAVYLQDIENVCQT-----PDGKW 308
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+ L+L +PLRLG +NPVY + + + T
Sbjct: 309 KSLILFVPLRLGADKLNPVYTSCLT---------------------------HLLTLDTC 341
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
+GVIGG+P H+LYFIG+ + +I LDPH Q D +D+ +++HC ++
Sbjct: 342 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFSL--TSFHCTSPRKML 396
Query: 288 ILHMDPSIAV 297
I MDPS V
Sbjct: 397 ISKMDPSCCV 406
>gi|194213171|ref|XP_001491090.2| PREDICTED: cysteine protease ATG4D [Equus caballus]
Length = 424
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 104/327 (31%), Positives = 158/327 (48%), Gaps = 72/327 (22%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL +L R
Sbjct: 54 SEGDIQRFQRDFASRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHYLPR 113
Query: 68 DWQWNVN------------------------------------SKEEAYLKILKMFEDRR 91
DW W S+E + +I+ F D
Sbjct: 114 DWTWAEGAGLGPPEPVGLSSPNRYRGPARWMAPTLGPGAPPSWSRERRHRQIVSWFADHP 173
Query: 92 TAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQ 150
AP+ +HQ+ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+
Sbjct: 174 RAPFGLHQLVELGQSSGKKAGDWYGPSLVAHILRKAVESCAEVTRLVVYVSQDCTVYKAD 233
Query: 151 VKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILS 210
V +L R +W+ +V+++P+RLG + +NPVY+ +K ++L
Sbjct: 234 VARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLR 276
Query: 211 STYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
S LG++GGKP H+LYFIGY + +++LDPH Q V +
Sbjct: 277 SEL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSRADFPL 323
Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
E ++HC ++ MDPS V
Sbjct: 324 E-----SFHCTSPRKMAFTKMDPSCTV 345
>gi|350426238|ref|XP_003494376.1| PREDICTED: cysteine protease ATG4D-like [Bombus impatiens]
Length = 486
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 163/310 (52%), Gaps = 54/310 (17%)
Query: 5 NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
+ +S +D +E+ ++D TSRLW TYR+ F + S TTD GWGCMLR GQM++AQAL+
Sbjct: 134 DAISFEDGIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 193
Query: 64 HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
LGR+W+W V+ E + I+K F D T+P+SIH + GA GK G
Sbjct: 194 FLGREWRWQVDQPLKTEQQKLDEHNHRLIIKSFGDLPDSTSPFSIHTLVSLGALWGKRAG 253
Query: 113 EWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
+W+GP++VA +L + ++ +S++ +VA D + + V+ +C + +W
Sbjct: 254 DWYGPSSVAHLLSQAVEQAAERHPVFSNLAVYVAQDCAVYLQDVENVCQM-----PDGKW 308
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+ L+L +PLRLG +NPVY + + + T
Sbjct: 309 KSLILFVPLRLGADKLNPVYASCLT---------------------------HLLTLNTC 341
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
+GVIGG+P H+LYFIG+ + +I LDPH Q D +D+ +++HC ++
Sbjct: 342 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFPL--TSFHCTSPRKML 396
Query: 288 ILHMDPSIAV 297
I MDPS V
Sbjct: 397 ISKMDPSCCV 406
>gi|417401539|gb|JAA47652.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 473
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 159/326 (48%), Gaps = 71/326 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S D+++ +RD SRLW TYR+ F P LT+D GWGCMLR GQM++AQ LL L R
Sbjct: 104 SEGDIQRFQRDFVSRLWLTYRRDFPPFAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 163
Query: 68 DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
DW W + +E + +I+ F D
Sbjct: 164 DWTWARGASLSPPEPSGLASSNRYRGPAHCMTPCWAQRAPELEQERRHRQIVSWFADHPQ 223
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
AP+ +HQ+ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V
Sbjct: 224 APFGLHQLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADV 283
Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+L R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 284 ARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRS 326
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
LG++GGKP H+LYFIGY + +++LDPH Q D Q ++
Sbjct: 327 EL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---AVDVSQ-AD 369
Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
L+S +HC ++ MDPS V
Sbjct: 370 FPLES-FHCTSPRKMAFAKMDPSCTV 394
>gi|410950450|ref|XP_003981918.1| PREDICTED: cysteine protease ATG4D, partial [Felis catus]
Length = 423
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 158/326 (48%), Gaps = 71/326 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L R
Sbjct: 54 SEGDIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 113
Query: 68 DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
DW W+ S +E + +I+ F D
Sbjct: 114 DWTWSEASGLGPSEPSGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQ 173
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
AP+ +H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V
Sbjct: 174 APFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADV 233
Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+L R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 234 ARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRS 276
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 277 EL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE 323
Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 324 -----SFHCTSPRKMAFAKMDPSCTV 344
>gi|395850895|ref|XP_003798008.1| PREDICTED: cysteine protease ATG4D [Otolemur garnettii]
Length = 471
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 160/321 (49%), Gaps = 69/321 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLWFTYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWFTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166
Query: 71 WNVN---------------------------------SKEEAYLKILKMFEDRRTAPYSI 97
W +E + +I+ F D AP+S+
Sbjct: 167 WAEGRGLGPPELLASPSQYRVPARWMPPRWAQGTPELEQEHQHRQIVSWFADHPQAPFSL 226
Query: 98 HQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 227 HRLVELGQSLGKKAGDWYGPSVVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARLVA 286
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 287 ---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL--- 326
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG++GGKP H+LYFIGY + +++LDPH Q D Q ++ L+S
Sbjct: 327 ----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDISQ-ADFPLES 372
Query: 277 TYHCPQASRLHILHMDPSIAV 297
+HC ++ MDPS V
Sbjct: 373 -FHCTAPRKMAFTKMDPSCTV 392
>gi|178057055|ref|NP_001116551.1| cysteine protease ATG4D [Sus scrofa]
gi|61211337|sp|Q684M2.1|ATG4D_PIG RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51870495|emb|CAG15153.1| AUT-like 4, cysteine endopeptidase [Sus scrofa]
Length = 469
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 156/319 (48%), Gaps = 67/319 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166
Query: 71 WNVN-------------------------------SKEEAYLKILKMFEDRRTAPYSIHQ 99
W+ +E + +I+ F D AP+ +H+
Sbjct: 167 WSQGVGLGPPESSPNRYRGPAHWMPPHWVQAAPELEQERRHRQIVSWFADHPRAPFGLHR 226
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN 158
+ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 227 LVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVA-- 284
Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 285 -RPDPTAEWKAVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL----- 324
Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
LG++GGKP H+LYFIGY + +++LDPH Q V + E ++
Sbjct: 325 --------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE-----SF 371
Query: 279 HCPQASRLHILHMDPSIAV 297
HC ++ MDPS V
Sbjct: 372 HCTSPRKMAFTKMDPSCTV 390
>gi|344282757|ref|XP_003413139.1| PREDICTED: cysteine protease ATG4D-like [Loxodonta africana]
Length = 473
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 157/326 (48%), Gaps = 71/326 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L R
Sbjct: 104 SEGDIQRFQRDFMSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMLLAQGLLLHFLPR 163
Query: 68 DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
DW W S +E + +I+ F D
Sbjct: 164 DWTWAEGSGLGPPELSGSASPSRYRGPARRVPPHWAQCTPELEQEHWHRQIVSWFADHPQ 223
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
AP+ +H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V
Sbjct: 224 APFGLHRLVALGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADV 283
Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+L R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 284 ARLVA---RPDPKAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRS 326
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 327 EL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPSVDVSQADFSLE 373
Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 374 -----SFHCTSPRKMAFTKMDPSCTV 394
>gi|291414155|ref|XP_002723329.1| PREDICTED: APG4 autophagy 4 homolog D [Oryctolagus cuniculus]
Length = 408
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 160/321 (49%), Gaps = 69/321 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ+LL L RDW
Sbjct: 44 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQSLLLHFLPRDWT 103
Query: 71 W--NVNSKEEA-------------------------------YLKILKMFEDRRTAPYSI 97
W + S E A + +I+ F D AP+ +
Sbjct: 104 WAEGLGSAEPAGSASPSRYRGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPGAPFGL 163
Query: 98 HQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 164 HRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVA 223
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
R +W+ +V+++P+RLG + +NPVY+ +K+ L +
Sbjct: 224 ---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELLRLEL----------------- 263
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG++GGKP H+LYFIGY + +++LDPH Q D Q ++ L+S
Sbjct: 264 ----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-TDFPLES 309
Query: 277 TYHCPQASRLHILHMDPSIAV 297
+HC ++ MDPS V
Sbjct: 310 -FHCTSPRKMAFAKMDPSCTV 329
>gi|194378178|dbj|BAG57839.1| unnamed protein product [Homo sapiens]
Length = 411
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 45 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 105 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + D + +V +V+ D T+ V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARL 224
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 225 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 262
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC R+ MDPS V
Sbjct: 312 --SFHCTSPRRMAFAKMDPSCTV 332
>gi|395512609|ref|XP_003760528.1| PREDICTED: cysteine protease ATG4D [Sarcophilus harrisii]
Length = 453
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 159/323 (49%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL RDW
Sbjct: 87 DIQRFQRDFVSRLWLTYRRDFPPLEGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSRDWT 146
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W+ +EE + +I+ F D+ AP+
Sbjct: 147 WSEAVLHPGPREPELLRTMSPSRVGPPGPPAGALSPREFEQEEQHRRIVSWFADQPGAPF 206
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + + +V+ D T+ V +L
Sbjct: 207 GLHRLVELGRSSGKRAGDWYGPSVVAHILRKAVESSSEVAQLEVYVSQDCTVYKADVAQL 266
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K+ L +
Sbjct: 267 VA---QPDPSTEWKSIVILVPVRLGGETLNPVYVPCVKELLRLEL--------------- 308
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
+G+IGGKP H+LYFIGY + +++LDPH Q D Q+S L
Sbjct: 309 ------------CIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQPF---VDTSQES-FPL 352
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
+S +HC ++ MDPS +
Sbjct: 353 ES-FHCTSPRKMAFSRMDPSCTI 374
>gi|281337397|gb|EFB12981.1| hypothetical protein PANDA_010312 [Ailuropoda melanoleuca]
Length = 428
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 156/323 (48%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 62 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 121
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W S +E + +I+ F D AP+
Sbjct: 122 WAEGSAPSPSEPSGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQAPF 181
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 182 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARL 241
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 242 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 283
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 284 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 328
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 329 --SFHCTSPRKMAFAKMDPSCTV 349
>gi|351695136|gb|EHA98054.1| Cysteine protease ATG4A [Heterocephalus glaber]
Length = 356
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/201 (42%), Positives = 120/201 (59%), Gaps = 55/201 (27%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGR
Sbjct: 94 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGR------ 147
Query: 74 NSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
A G EGK+VGEWFGPNTVAQVL+KLA +D+W
Sbjct: 148 ---------------------------AQMGVGEGKSVGEWFGPNTVAQVLKKLALFDEW 180
Query: 134 SSIVFHVALDNTLVVNQVKKLC-------------------TTNKRASSN---PQWQPLV 171
+S+ +V++DNT+V+ +KK+C T+N+ ++ P W+PL+
Sbjct: 181 NSLAVYVSMDNTVVIEDIKKMCRILPFSADTADESPPDSFITSNQSKGTSAFCPAWKPLL 240
Query: 172 LVIPLRLGIQDINPVYINGIK 192
L++PLRLGI INPVY++ K
Sbjct: 241 LIVPLRLGINQINPVYVDAFK 261
>gi|195113543|ref|XP_002001327.1| GI10728 [Drosophila mojavensis]
gi|193917921|gb|EDW16788.1| GI10728 [Drosophila mojavensis]
Length = 682
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 165/319 (51%), Gaps = 63/319 (19%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM++AQ L+ LGR W
Sbjct: 274 EGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 333
Query: 70 QWNVNS------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S ++ + KI+K F D + +P+SIH + G GK G+W+GP +V+
Sbjct: 334 RYDAESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGKKPGDWYGPASVS 393
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCT-------------TNKRASS 163
+L+ ++ D+ +I +VA D T+ + +++LC+ KR++S
Sbjct: 394 YLLKHALEHAARENADFDNISVYVAKDCTIYIQDIEELCSIPEPAPKPHVPWQQAKRSTS 453
Query: 164 NP-----QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
+ W+ L+++IPLRLG +NPVY + +K +LS+ Y
Sbjct: 454 DAPKPDQHWKSLIVLIPLRLGSDKLNPVYAHCLK--------------LLLSTEY----- 494
Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E ++
Sbjct: 495 --------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQETFPMHSF 541
Query: 279 HCPQASRLHILHMDPSIAV 297
HC +L MDPS +
Sbjct: 542 HCKSPRKLKSSKMDPSCCI 560
>gi|301772016|ref|XP_002921445.1| PREDICTED: cysteine protease ATG4D-like [Ailuropoda melanoleuca]
Length = 445
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 156/323 (48%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 79 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 138
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W S +E + +I+ F D AP+
Sbjct: 139 WAEGSAPSPSEPSGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQAPF 198
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 199 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARL 258
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 259 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 300
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 301 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 345
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 346 --SFHCTSPRKMAFAKMDPSCTV 366
>gi|432099562|gb|ELK28703.1| Cysteine protease ATG4D, partial [Myotis davidii]
Length = 392
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 156/326 (47%), Gaps = 71/326 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L R
Sbjct: 26 SEGDIQRFQRDFASRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPR 85
Query: 68 DWQWNVNS-----------------------------------KEEAYLKILKMFEDRRT 92
DW W + +E + +I+ F D
Sbjct: 86 DWTWAEGAGLSPPEPSGLASPNRHHGLAHWKPPRWAQGAPELEQEHWHRQIVSWFADHPQ 145
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQV 151
AP+ +HQ+ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V
Sbjct: 146 APFGLHQLVELGQSWGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADV 205
Query: 152 KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+L R +W+ +V+++P+RLG + +NPVY+ +K+ +L S
Sbjct: 206 ARLVA---RPDCTAEWKSVVILVPVRLGGETLNPVYVPCVKE--------------LLRS 248
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
LG++GGKP H+LYFIGY + +++LDPH Q V E
Sbjct: 249 EL-------------CLGIMGGKPRHSLYFIGYQDDSLLYLDPHYCQPTVDVSQAGFPLE 295
Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 296 -----SFHCTSPRKMAFTKMDPSCTV 316
>gi|383860522|ref|XP_003705738.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like
[Megachile rotundata]
Length = 518
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 162/310 (52%), Gaps = 54/310 (17%)
Query: 5 NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
+ +S +D +E+ ++D TSRLW TYR+ F + S TTD GWGCMLR GQM++AQAL+
Sbjct: 167 DAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCH 226
Query: 64 HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
LGR+W+W + E + I++ F D R +P+SIH + GA GK G
Sbjct: 227 FLGREWRWQPDQPIKTEQQKLDESNHRFIIQSFGDLPERISPFSIHTLVSLGALWGKRAG 286
Query: 113 EWFGPNTVAQVLRKLAKYDD-----WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
+W+GP++VA +L + ++ +S++ +VA D + + V+ +C + +W
Sbjct: 287 DWYGPSSVAHLLSQAVEHAAEHLPIFSNLAVYVAQDCAVYLQDVESVCQM-----PDGKW 341
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+ L+L +PLRLG +NPVY + + + T
Sbjct: 342 KSLILFVPLRLGTDKLNPVYTSCLT---------------------------HLLTLDTC 374
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
+GVIGG+P H+LYFIG+ + +I LDPH Q D +D+ +++HC ++
Sbjct: 375 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFPL--TSFHCTSPRKML 429
Query: 288 ILHMDPSIAV 297
I MDPS V
Sbjct: 430 ISKMDPSCCV 439
>gi|149642765|ref|NP_001092616.1| cysteine protease ATG4D [Bos taurus]
gi|148744285|gb|AAI42400.1| ATG4D protein [Bos taurus]
Length = 472
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 155/322 (48%), Gaps = 70/322 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166
Query: 71 WNVNS----------------------------------KEEAYLKILKMFEDRRTAPYS 96
W + +E + +I+ F D AP+
Sbjct: 167 WCQGAGLGPSEPPGLGSPSRRRGPARWLPPRWAQAPELEQERRHRQIVSWFADHPRAPFG 226
Query: 97 IHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
+H++ G GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 227 LHRLVELGQGSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLV 286
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 287 A---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL-- 327
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 328 -----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE---- 372
Query: 276 STYHCPQASRLHILHMDPSIAV 297
++HC R+ MDPS V
Sbjct: 373 -SFHCTSPRRMAFAKMDPSCTV 393
>gi|355669960|gb|AER94694.1| ATG4 autophagy related 4-like protein D [Mustela putorius furo]
Length = 388
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 156/323 (48%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 74 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 133
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W S +E + +I+ F D AP+
Sbjct: 134 WAEGSGLGPSEPSGLASPNRYRGPARWVPPRWAHGTPELEQERRHRQIVSWFADHPRAPF 193
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 194 GLHRLGGLGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 253
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 254 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 295
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 296 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVTQADFPLE--- 340
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 341 --SFHCTSPRKMAFAKMDPSCTV 361
>gi|27903825|ref|NP_116274.3| cysteine protease ATG4D [Homo sapiens]
gi|61211809|sp|Q86TL0.1|ATG4D_HUMAN RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|27763975|emb|CAC85951.1| APG4-D protein [Homo sapiens]
gi|46362497|gb|AAH68992.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [Homo sapiens]
gi|119604524|gb|EAW84118.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_c
[Homo sapiens]
gi|312151144|gb|ADQ32084.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 474
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + D + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>gi|57101974|ref|XP_542069.1| PREDICTED: cysteine protease ATG4D isoform 1 [Canis lupus
familiaris]
Length = 473
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 167 WAEGPGLGPSEPAGLASPNRYRGPARWMPPRWAQGTPELEQERRHRQIVSWFADHPQAPF 226
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEITRLVVYVSQDCTVYKADVARL 286
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 287 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 328
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 329 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 373
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 374 --SFHCTSPRKMAFAKMDPSCTV 394
>gi|62898327|dbj|BAD97103.1| APG4 autophagy 4 homolog D variant [Homo sapiens]
Length = 474
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 153/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + D + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPTAEWMSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>gi|348550913|ref|XP_003461275.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D-like [Cavia
porcellus]
Length = 474
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/325 (31%), Positives = 156/325 (48%), Gaps = 75/325 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWM 166
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 167 WAEGPGLGSPELPGTASPSPGRSPARWVPPRWPRGAPELEQELRHRQIVSWFADHPRAPF 226
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + + +V+ D T+ V L
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESSSEVTRLAVYVSQDCTVYKADVAHL 286
Query: 155 CTTNKRASSNP--QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
AS +P +W+ +V+++P+RLG + +NPVY+ G+K+
Sbjct: 287 V-----ASRDPTAEWKSVVILVPVRLGGETLNPVYVPGVKELL----------------- 324
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEK 272
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 325 ------RSELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE- 373
Query: 273 KLDSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 374 ----SFHCTSPRKMAFAKMDPSCTV 394
>gi|444730159|gb|ELW70550.1| Cysteine protease ATG4A [Tupaia chinensis]
Length = 364
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 104/306 (33%), Positives = 154/306 (50%), Gaps = 74/306 (24%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D +W ++ + G +G ++D GWGCMLRCGQM++AQAL+ HLGR
Sbjct: 29 DTDELVWILGKQHLLKTGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGR---------- 78
Query: 78 EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
A G EGK++GEWFGPNTVAQVL+KLA +D+W+S+
Sbjct: 79 -----------------------AQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLA 115
Query: 138 FHVALDNTLVVNQVKKLCT-----------------TNKRASSNPQ-----WQPLVLVIP 175
+V++DNT+V+ +KK+C T S P W+PL+L++P
Sbjct: 116 VYVSMDNTVVIEDIKKMCCVLPLSADTDTESPPDSPTASNQSKGPSACGSAWKPLLLIVP 175
Query: 176 LRLGIQDINPVYINGIK---KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
LRLGI INPVY++ K C+ P+ + K + P+ S G
Sbjct: 176 LRLGINQINPVYVDAFKLQASCH-----PILIVTKEGVRRTRILPPK------DSSGARA 224
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHM 291
+ + G+++IFLDPHT Q D E++ D T+HC Q+ R++IL++
Sbjct: 225 SESLKVKHVSFKTGDELIFLDPHTTQTF---VDTEENGMVD-DQTFHCLQSPQRMNILNL 280
Query: 292 DPSIAV 297
DPS+A+
Sbjct: 281 DPSVAL 286
>gi|402904206|ref|XP_003914938.1| PREDICTED: cysteine protease ATG4D isoform 1 [Papio anubis]
Length = 474
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 168 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>gi|402904208|ref|XP_003914939.1| PREDICTED: cysteine protease ATG4D isoform 2 [Papio anubis]
Length = 411
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 45 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 105 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 224
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 225 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 262
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 312 --SFHCTSPRKMAFAKMDPSCTV 332
>gi|22658287|gb|AAH30861.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|74152222|dbj|BAE32395.1| unnamed protein product [Mus musculus]
Length = 474
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 160/323 (49%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D++Q +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQQFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + ++ + +I+ F D AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + S +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 288 LSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 329
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q D Q S L
Sbjct: 330 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQPS-FPL 373
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
+S +HC ++ MDPS V
Sbjct: 374 ES-FHCTSPRKMAFAKMDPSCTV 395
>gi|340722130|ref|XP_003399462.1| PREDICTED: cysteine protease ATG4D-like [Bombus terrestris]
Length = 485
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 162/310 (52%), Gaps = 54/310 (17%)
Query: 5 NKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
+ +S +D +E+ ++D TSRLW TYR+ F + S T+D GWGCMLR GQM++AQAL+
Sbjct: 133 DAISFEDSIEEFKKDFTSRLWLTYRREFPILNGSTFTSDCGWGCMLRSGQMMLAQALVCH 192
Query: 64 HLGRDWQWNVNS---------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
LGR+W+W V+ E + I+K F D T+P+SIH + GA GK G
Sbjct: 193 FLGREWRWQVDQPLKTEQQKLDEYNHRLIIKSFGDLPDSTSPFSIHTLVSLGALSGKRAG 252
Query: 113 EWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
+W+GP++VA +L + ++ +S++ +VA D + + V+ +C + +W
Sbjct: 253 DWYGPSSVAHLLSQAVEQAAERHPVFSNLAVYVAQDCAVYLQDVENVCQM-----PDGKW 307
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+ L+L +PLRLG +N VY + + + T
Sbjct: 308 KSLILFVPLRLGADKLNLVYASCLT---------------------------HLLTLNTC 340
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
+GVIGG+P H+LYFIG+ + +I LDPH Q D +D+ +++HC ++
Sbjct: 341 IGVIGGRPRHSLYFIGFQEDKLINLDPHYCQE---TVDVLKDNFPL--TSFHCTSPRKML 395
Query: 288 ILHMDPSIAV 297
I MDPS V
Sbjct: 396 ISKMDPSCCV 405
>gi|431918972|gb|ELK17839.1| Cysteine protease ATG4D [Pteropus alecto]
Length = 442
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 76 DIQRFQRDFVSRLWLTYRRDFPPLAGGYLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWM 135
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 136 WVKGVGLDPPEPSRLASPYWHHGPACWIPPHWTQGSPELEQERRHRQIVSWFADHPKAPF 195
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+HQ+ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 196 GLHQLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTHLVVYVSQDCTVYKADVARL 255
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 256 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 297
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q V E
Sbjct: 298 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--- 342
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 343 --SFHCTSPRKMAFTKMDPSCTV 363
>gi|296485832|tpg|DAA27947.1| TPA: APG4 autophagy 4 homolog D [Bos taurus]
Length = 472
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 100/322 (31%), Positives = 155/322 (48%), Gaps = 70/322 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166
Query: 71 WNVNS----------------------------------KEEAYLKILKMFEDRRTAPYS 96
W + +E + +I+ F D AP+
Sbjct: 167 WCQGAGLGPSEPPGLGSPSRRRGPARWLPPRWAQAPELEQERRHRQIVSWFADHPRAPFG 226
Query: 97 IHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
+H++ G GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 227 LHRLVELGQGSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLV 286
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 287 A---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL-- 327
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 328 -----------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE---- 372
Query: 276 STYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 373 -SFHCTSPRKMAFAKMDPSCTV 393
>gi|109123366|ref|XP_001101860.1| PREDICTED: cysteine protease ATG4D-like isoform 1 [Macaca mulatta]
Length = 474
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 168 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>gi|297276108|ref|XP_002801111.1| PREDICTED: cysteine protease ATG4D-like isoform 2 [Macaca mulatta]
Length = 497
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 131 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 190
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 191 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 250
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 251 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 310
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 311 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 348
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 349 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 397
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 398 --SFHCTSPRKMAFAKMDPSCTV 418
>gi|410226434|gb|JAA10436.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410263516|gb|JAA19724.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410294648|gb|JAA25924.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
gi|410328737|gb|JAA33315.1| ATG4 autophagy related 4 homolog D [Pan troglodytes]
Length = 474
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>gi|114675367|ref|XP_512373.2| PREDICTED: cysteine protease ATG4D [Pan troglodytes]
Length = 411
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 45 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 105 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 224
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 225 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 262
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 312 --SFHCTSPRKMAFAKMDPSCTV 332
>gi|347971093|ref|XP_554420.4| AGAP004023-PA [Anopheles gambiae str. PEST]
gi|333469628|gb|EAL39379.4| AGAP004023-PA [Anopheles gambiae str. PEST]
Length = 606
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 103/339 (30%), Positives = 164/339 (48%), Gaps = 80/339 (23%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
+ + ++ RRD SR+W TYR+ F + DS T+D GWGCM+R GQM++AQ L+ LG
Sbjct: 191 VEEEGIDAFRRDFISRIWMTYRREFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLVAHFLG 250
Query: 67 RDWQWNVNS----KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
R W+W+V+ +E + K+++ F D +T+P+SIH + G GK G+W+GP V
Sbjct: 251 RSWRWDVSMFTAYEESIHRKVIRWFGDTSSKTSPFSIHTLVALGKESGKKPGDWYGPGAV 310
Query: 121 AQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCT------------------T 157
A +LR+ + D I +VA D + + + CT T
Sbjct: 311 AHLLRQAVRLAAQEITDLDGINVYVAQDCAVYIQDILDECTVPATPAGAPWQRKGAPGGT 370
Query: 158 NKRASS-------------------NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
N +S+ + W+ L+L++PLRLG +NP+Y +K
Sbjct: 371 NSSSSTAHTERSGATSCAEGDEDVQSAHWKSLILLVPLRLGTDKLNPIYNECLK------ 424
Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
+LS Y +G+IGG+P H+LYF+GY + +I LDPH Q
Sbjct: 425 --------AMLSLDY-------------CIGIIGGRPKHSLYFVGYQEDKLIHLDPHYCQ 463
Query: 259 NIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++ D QD+ +++HC ++ + MDPS +
Sbjct: 464 DM---VDVNQDNFPV--ASFHCKSPRKMKLSKMDPSCCI 497
>gi|380796527|gb|AFE70139.1| cysteine protease ATG4D, partial [Macaca mulatta]
Length = 439
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 73 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 132
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 133 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 192
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 193 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 252
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 253 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 290
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 291 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 339
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 340 --SFHCTSPRKMAFAKMDPSCTV 360
>gi|351710014|gb|EHB12933.1| Cysteine protease ATG4D [Heterocephalus glaber]
Length = 607
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 157/323 (48%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 240 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWM 299
Query: 71 W-------------NVNS----------------------KEEAYLKILKMFEDRRTAPY 95
W + +S +E + +I+ F D AP
Sbjct: 300 WIEGPGLAHPELPGSASSSQGRGPARWMPPSCPWGALEREQELRHRQIVSWFADHPRAPL 359
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + + +V+ D T+ V L
Sbjct: 360 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESSSELTHLAVYVSQDCTVYKADVAHL 419
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ A+ +W+ +V+++P+RLG + +NPVY+ G+K+
Sbjct: 420 VASPDPAA---EWKSVVILVPVRLGGETLNPVYVPGVKELL------------------- 457
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 458 ----RSELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFSLE--- 506
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 507 --SFHCTSPRKMAFAKMDPSCTV 527
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/59 (54%), Positives = 41/59 (69%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L R W
Sbjct: 154 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGW 212
>gi|397476492|ref|XP_003809633.1| PREDICTED: cysteine protease ATG4D isoform 2 [Pan paniscus]
Length = 411
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 45 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 104
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 105 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 164
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 165 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 224
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +++++P+RLG + +NPVY+ +K+
Sbjct: 225 VA---RPDPTAEWKSVIILVPVRLGGETLNPVYVPCVKELL------------------- 262
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 263 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 311
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 312 --SFHCTSPRKMAFAKMDPSCTV 332
>gi|440798079|gb|ELR19150.1| cysteine protease, putative [Acanthamoeba castellanii str. Neff]
Length = 434
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 99/291 (34%), Positives = 146/291 (50%), Gaps = 45/291 (15%)
Query: 21 SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY 80
S +W TYR F +G T+D GWGCMLR GQMV+AQ L LG +W+ + Y
Sbjct: 116 SVVWCTYRAAFPRLGSDSYTSDMGWGCMLRTGQMVLAQTLTRHLLGTEWRRQSDRSSPLY 175
Query: 81 LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV 140
K+++ F D P+S+H+IA G GK VGEWFGP+T+AQVL +L K S + +V
Sbjct: 176 AKMVQWFADDPKQPFSLHRIAHAGLKYGKNVGEWFGPSTMAQVLEELLKEFSPSGLRAYV 235
Query: 141 ALDNTLVVNQVKKLCTT-------NKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
D L ++Q+++ T + W P+++++PLRLG+ +N Y +K+
Sbjct: 236 CQDGCLYLDQLRRTATAAHWPLDEDDDEGQGKSWAPMLIMLPLRLGLDQLNEDYAPVLKE 295
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
F PQS+G+ GGKP +LYF+G + V +LD
Sbjct: 296 T---------------------------FRIPQSVGISGGKPRASLYFVGNQDDYVFYLD 328
Query: 254 PHTNQ------NIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
PHT Q +G V + + + T+HC RL I +DPS+ +
Sbjct: 329 PHTVQPAPRFPEVGDV-----PASEDVYDTFHCSAPLRLPIRDIDPSLCLA 374
>gi|397476490|ref|XP_003809632.1| PREDICTED: cysteine protease ATG4D isoform 1 [Pan paniscus]
Length = 474
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +++++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPTAEWKSVIILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>gi|195350257|ref|XP_002041657.1| GM16788 [Drosophila sechellia]
gi|194123430|gb|EDW45473.1| GM16788 [Drosophila sechellia]
Length = 269
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 95/203 (46%), Positives = 125/203 (61%), Gaps = 33/203 (16%)
Query: 95 YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL 154
YSIHQIA G S+ KAVGEW GPNTVAQ+L+KL ++DDWSS+ HVA+D+T+V++ V
Sbjct: 4 YSIHQIAQMGESQNKAVGEWLGPNTVAQILKKLVRFDDWSSLAIHVAMDSTVVLDDVYAS 63
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
C W+PL+L+IPLRLGI DINP+Y+ +K+C L
Sbjct: 64 CREGG------SWKPLLLIIPLRLGITDINPLYVPALKRCLEL----------------- 100
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
S G+IGG+PN ALYF+GYV ++V++LDPHT Q G V K +E+
Sbjct: 101 ----------DSSCGMIGGRPNQALYFLGYVDDEVLYLDPHTTQRTGAVAQKTAAAEQDY 150
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
D TYH A+RL+ MDPS+AV
Sbjct: 151 DETYHQKHAARLNFSAMDPSLAV 173
>gi|354475125|ref|XP_003499780.1| PREDICTED: cysteine protease ATG4D [Cricetulus griseus]
gi|344240088|gb|EGV96191.1| Cysteine protease ATG4D [Cricetulus griseus]
Length = 474
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 99/323 (30%), Positives = 157/323 (48%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + ++ + +I+ F D AP+
Sbjct: 168 WVEGTGLAPPEMPGPASPSRYRGPGRHVPPRWTQGTLEMEQDRWHRRIVSWFADHPQAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + K + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVEKCSEVPRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RSELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS +
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTI 395
>gi|322785465|gb|EFZ12136.1| hypothetical protein SINV_15051 [Solenopsis invicta]
Length = 505
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 107/314 (34%), Positives = 168/314 (53%), Gaps = 60/314 (19%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR-DWQ 70
+E+ +RD SRLW TYR+ F + S TTD GWGCMLR GQM++AQAL+ LGR W+
Sbjct: 129 IEEFKRDFMSRLWLTYRREFPILNGSTFTTDCGWGCMLRSGQMMLAQALVCHFLGRGQWR 188
Query: 71 WNVN--SKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
W + E ++ I+K F D+ T +P+SIH++ + GAS GK G+W+GP++VA +L +
Sbjct: 189 WRPEQLTDESSHRMIIKWFGDQLTPESPFSIHKLVVLGASTGKRAGDWYGPSSVAHLLCQ 248
Query: 127 LAKY--DDWSS----IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
+ +D +S + +VA D + + V+ +C T + + + L+L++PLRLG
Sbjct: 249 AMERASEDPNSKLNQLAVYVAQDCAVYMQDVENVCCT-----PDGRRKALILLVPLRLGA 303
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
+NPVY L++ + T +GVIGG+P H+LY
Sbjct: 304 DKLNPVY------------------APCLTALLTLDT---------CIGVIGGRPRHSLY 336
Query: 241 FIGYVGNDVIFLDPHTNQN-------------IGCVYDKE----QDSEKKLDSTYHCPQA 283
FIGY + +I LDPH QN + ++ +E + +EK +++HC
Sbjct: 337 FIGYQDDKLIHLDPHYCQNEFYFRILLSITDSLPYLFIQETVDVEGNEKFPLTSFHCTSP 396
Query: 284 SRLHILHMDPSIAV 297
++ + MDPS V
Sbjct: 397 RKMLLSKMDPSCCV 410
>gi|26349259|dbj|BAC38269.1| unnamed protein product [Mus musculus]
Length = 474
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 160/323 (49%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + ++ + +I+ F D AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + S +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 288 LSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 329
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q D Q S L
Sbjct: 330 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQPS-FPL 373
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
+S +HC ++ MDPS V
Sbjct: 374 ES-FHCTSPRKMAFAKMDPSCTV 395
>gi|29135261|ref|NP_705811.8| cysteine protease ATG4D [Mus musculus]
gi|61211815|sp|Q8BGV9.1|ATG4D_MOUSE RecName: Full=Cysteine protease ATG4D; AltName: Full=AUT-like 4
cysteine endopeptidase; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related cysteine endopeptidase
4; AltName: Full=Autophagy-related protein 4 homolog D
gi|26331508|dbj|BAC29484.1| unnamed protein product [Mus musculus]
gi|26348941|dbj|BAC38110.1| unnamed protein product [Mus musculus]
gi|27763977|emb|CAC85952.1| APG4-D protein [Mus musculus]
gi|47125055|gb|AAH69851.1| Autophagy-related 4D (yeast) [Mus musculus]
gi|148693226|gb|EDL25173.1| autophagy-related 4D (yeast), isoform CRA_b [Mus musculus]
Length = 474
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 160/323 (49%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + ++ + +I+ F D AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + S +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 288 LSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 329
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q D Q S L
Sbjct: 330 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQPS-FPL 373
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
+S +HC ++ MDPS V
Sbjct: 374 ES-FHCTSPRKMAFAKMDPSCTV 395
>gi|296232881|ref|XP_002761778.1| PREDICTED: cysteine protease ATG4D [Callithrix jacchus]
Length = 474
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 168 WAEGTGLGPPELSGPASPSWYHGPARWMPPCWAQGAPELEQERRHRQIVSWFADHPQAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + ++ +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESSSEVTRLLVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R + +W +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPSAEWNSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>gi|168010849|ref|XP_001758116.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690572|gb|EDQ76938.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 157/311 (50%), Gaps = 52/311 (16%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
LS Q E+ D TSR+W TYRKGF +G S LT+D GWGCMLR GQM++AQAL+ +LG
Sbjct: 44 LSVQAFEEFISDFTSRIWITYRKGFECVGQSKLTSDVGWGCMLRSGQMLLAQALVCHYLG 103
Query: 67 RDWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
R W+ +AYL+IL+ F D + P+SIH + G G A G W GP + + L
Sbjct: 104 RSWRREPGQPCSQAYLQILQTFGDSESCPFSIHNLLEAGHPFGLAAGSWLGPYALCRTLE 163
Query: 126 KLAKYDDWSS-----------IVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQ 166
LA+ D S V+ V+ + L V V LC+ + + +
Sbjct: 164 ALARADREQSQKKGGKRALPFAVYVVSGEAEGERGGAPVLCVEDVATLCSKWREPTE--E 221
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W PL++++PL LG+ +NP Y+ + R FTFPQ
Sbjct: 222 WTPLLVLVPLVLGLDKVNPRYLPSL---------------------------RATFTFPQ 254
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
SLG+ GGKP + Y IG ++LDPH NQ + V + + + S+YHC RL
Sbjct: 255 SLGIAGGKPGASTYLIGVQDEQAMYLDPHENQQVVPVTPENLELDT---SSYHCSTVRRL 311
Query: 287 HILHMDPSIAV 297
+ +DPS+A+
Sbjct: 312 PLDTIDPSLAI 322
>gi|428170513|gb|EKX39437.1| hypothetical protein GUITHDRAFT_143439 [Guillardia theta CCMP2712]
Length = 332
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 103/266 (38%), Positives = 131/266 (49%), Gaps = 48/266 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
+ + D+ SRLWF+YR F PI + LTTD GWGCM+R GQM+I QAL+ HLGRDW+ +
Sbjct: 102 QNFKLDMWSRLWFSYRYNFHPISGTELTTDTGWGCMIRSGQMLIGQALVHHHLGRDWRLS 161
Query: 73 VNSK----EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
SK Y K+L+MF D AP SIH G GK G WFGPNTV KL
Sbjct: 162 HTSKYNELPSDYRKVLEMFLDHPCAPLSIHSFVRAGQQVGKKAGTWFGPNTVCSAFSKLH 221
Query: 129 KYDDWSSIVFHVAL-------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
S L DNT+ ++ +L Q PL +++P RLG+
Sbjct: 222 AGGALGSDNNLQLLAYDGNDGDNTIYKSEALELL----------QAGPLFILLPTRLGVS 271
Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
++P YI I F+FPQSLG IGGKP+ A YF
Sbjct: 272 SVDPSYIPKISHV---------------------------FSFPQSLGFIGGKPSSAHYF 304
Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKE 267
I G V +LDPHT Q + + +KE
Sbjct: 305 IASQGEAVYYLDPHTPQPLINISEKE 330
>gi|198438023|ref|XP_002129793.1| PREDICTED: similar to CG6194 CG6194-PA [Ciona intestinalis]
Length = 517
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 98/310 (31%), Positives = 158/310 (50%), Gaps = 50/310 (16%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN--- 74
D S+LWFTYRKGF + D+ LT+D GWGCMLR QM+IAQ+ + LGR+W+W +
Sbjct: 115 DFHSKLWFTYRKGFPTLNDTNLTSDTGWGCMLRTAQMMIAQSFIVHLLGRNWRWTPSRLS 174
Query: 75 -SKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---- 127
+ + + I+ F D + P+S+HQ+ G S G W+GPNT A +++
Sbjct: 175 MEQSDIHRNIITWFLDEQNIRCPFSLHQLTEIGLSYRCKPGNWYGPNTAAYIMQDALECA 234
Query: 128 -AKYDDWSSIVFHVALDNTLVVNQVKKLCT-------TNKRASSNPQWQPLVLVIPLRLG 179
K + ++I+ ++A D+T+ ++ V ++C + + S+ + ++++IP+RLG
Sbjct: 235 KGKTELLNNIMVYIAQDSTVYIDDVIEMCEWKNTASDADLKTSTTSSNRSVIVLIPVRLG 294
Query: 180 IQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHAL 239
+NP+YI I+ T QS+G++GGKP H+L
Sbjct: 295 EATLNPIYIPCIQSM---------------------------LTLDQSVGIMGGKPKHSL 327
Query: 240 YFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-V 298
YFIG+ + +LDPH Q D + L YHC + +I MDPS +
Sbjct: 328 YFIGFQDEYLFYLDPHYCQQA----DHPAAFKNDLLQNYHCNSPRKTNISKMDPSCCLGF 383
Query: 299 SQRSYSDYKN 308
R Y D+++
Sbjct: 384 YCRDYKDFQS 393
>gi|443730776|gb|ELU16134.1| hypothetical protein CAPTEDRAFT_228011 [Capitella teleta]
Length = 450
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 101/297 (34%), Positives = 158/297 (53%), Gaps = 51/297 (17%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-- 69
++ +RD +S++WFTYRK F + S LT+D GWGCMLR QM+IAQAL+ +LGRDW
Sbjct: 114 FDRFKRDFSSKIWFTYRKDFPKLYGSPLTSDVGWGCMLRTAQMIIAQALVMHYLGRDWTI 173
Query: 70 -QWNVNSKEEA-YLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
N KE + +I+++F D +P+SI + G GK G+W+GP +VA V+R
Sbjct: 174 HHTQQNRKETMLHRQIIRLFGDFPGNDSPFSIQALVRIGVDHGKRPGDWYGPASVAYVVR 233
Query: 126 K-LAKYDDW----SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
+ + D+ S + +VA D T+ + V LCT + W+ +V+++P+RLG
Sbjct: 234 DAINQVPDFHPLLSQVCVYVAPDCTVYIQDVIDLCTQH--------WKAVVILVPVRLGG 285
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
+ +NP+Y ++ A + LG+IGG+P H+LY
Sbjct: 286 EALNPIYSQCVQSLLAHELC---------------------------LGIIGGRPKHSLY 318
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
F+G+ +++LDPH Q+ V + +D STYHC +L + MDPS +
Sbjct: 319 FVGWQEEKLLYLDPHFCQDT--VDTRFRDFPT---STYHCLSPRKLALQKMDPSCTL 370
>gi|432871194|ref|XP_004071879.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 452
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 153/322 (47%), Gaps = 73/322 (22%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+ RR S LW TYR+GF + S LTTD GWGC+LR GQM++A+ LL + W W+
Sbjct: 91 ERFRRSFASLLWLTYRRGFPQLAGSSLTTDSGWGCVLRTGQMLLARGLLTHLMPPGWMWS 150
Query: 73 V------------------------------------NSKEEAYLKILKMFEDRRTAPYS 96
V E + K++ F D AP+
Sbjct: 151 VWYRAVKDDLDLPHHADCTDCKSNMRCRYQSLGSLYDRPLEAMHRKVVSWFADHPKAPFG 210
Query: 97 IHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
IH++ GAS GK G+W+GP+ VA +L+K +A D ++V +VA D T+ + V+ LC
Sbjct: 211 IHRLVELGASSGKKAGDWYGPSIVAHILQKAVAASVDLPNLVVYVAQDCTIYLQDVRGLC 270
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
S W+ +++++P+RLG QD+NP YI+ +KK L
Sbjct: 271 ERPPPHS----WKSVIILVPVRLGGQDLNPSYISCVKKLLELQC---------------- 310
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
+G+IGG+P H+L+F+G+ + +++LDPH Q V + E
Sbjct: 311 -----------CIGIIGGRPKHSLFFVGFQDDQLLYLDPHYCQLTVNVTKENFPLE---- 355
Query: 276 STYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS +
Sbjct: 356 -SFHCKYPRKMPFSRMDPSCTI 376
>gi|195054945|ref|XP_001994383.1| GH16873 [Drosophila grimshawi]
gi|193892146|gb|EDV91012.1| GH16873 [Drosophila grimshawi]
Length = 673
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 162/323 (50%), Gaps = 67/323 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM+IAQ L+ LGR W
Sbjct: 265 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLIAQGLICHFLGRSW 324
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D + +P+SIH + G GK G+W+GP +V+
Sbjct: 325 RYDPESQLHSTYEDNMHKKIIKWFGDSSSKNSPFSIHALVRLGEQLGKKPGDWYGPASVS 384
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCT--------------TNKRAS 162
+L+ ++ D+ +I +VA D T+ + V++ C+ K S
Sbjct: 385 YLLKHALEHAAQENADFDNISVYVAQDCTIYMQDVEQQCSIPEPAPKQHVPWQHAKKSTS 444
Query: 163 SNPQ--------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
P+ W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 445 DAPKLDQPPQQHWKSLIVLIPLRLGTDKLNPVYAH---------------CLKLLLSTEH 489
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 490 ------------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQETFS 532
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS +
Sbjct: 533 MHSFHCKSPRKIKSSKMDPSCCI 555
>gi|321472016|gb|EFX82987.1| hypothetical protein DAPPUDRAFT_302128 [Daphnia pulex]
Length = 405
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 173/331 (52%), Gaps = 59/331 (17%)
Query: 4 ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
++ L + E ++ D SR+W TYRK F + S T+D GWGCMLR GQM++AQAL+
Sbjct: 39 SSSLPTNNFEALKSDFFSRIWLTYRKEFPVLNGSYYTSDCGWGCMLRSGQMLLAQALVCH 98
Query: 64 HLGRDWQWNVNSKEEA-------YLKILKMFEDRRT--APYSIHQIALTG-ASEGKAVGE 113
LGRDW+WN + +E + I++ F D+ + P SIHQ+ G S GK G+
Sbjct: 99 FLGRDWRWNESGAQEQQTLQESLHRMIVQWFGDKPSPACPLSIHQMVSQGHISAGKRPGD 158
Query: 114 WFGPNTVAQVLRKLAK-----YDDWSSIVFHVALDNTLVVNQVKKLCT--TNKRASS--- 163
W+GP++V+ +++++ + Y + ++ ++A D T+ ++ VK+ C+ N
Sbjct: 159 WYGPSSVSYIIKQILQRATDTYPELDTLRVYIAQDCTVYLDDVKQSCSKICNYECEETDY 218
Query: 164 ---NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+ QW+ L+L+IPLRLG + +NP Y + +K +L
Sbjct: 219 ELIDDQWKSLILLIPLRLGGERMNPTYDSCLKGLLSLE---------------------- 256
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
Q +G+IGGKP H+ YFIG+ + +I LDPH Q + V + + ++HC
Sbjct: 257 -----QCIGIIGGKPKHSQYFIGWQDDYLIHLDPHNCQEMVDVLIPNFNLK-----SFHC 306
Query: 281 PQASRLHILHMDPSIAV----VSQRSYSDYK 307
+ + + +DPS V SQR + +++
Sbjct: 307 HELRKTALKQVDPSCCVGFYLRSQREFDEFR 337
>gi|334326299|ref|XP_001366933.2| PREDICTED: cysteine protease ATG4D [Monodelphis domestica]
Length = 482
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/338 (28%), Positives = 157/338 (46%), Gaps = 86/338 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ ++D SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL RDW
Sbjct: 101 DIQRFQKDFASRLWLTYRRDFPPLDGGSLTSDCGWGCMLRSGQMLLAQGLLLHFFSRDWT 160
Query: 71 WNVN--------------------------------------------------SKEEAY 80
W +EE +
Sbjct: 161 WAEAVLPPSPRESELFRSMSPSRSGASWQRGSSTASGLGRATWSTGGTLSPRQLEQEEQH 220
Query: 81 LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFH 139
+I+ F D+ AP+ +H++ G S GK G+W+GP+ VA +LRK + + + + +
Sbjct: 221 RRIVSWFADQPGAPFGLHRLVELGRSSGKRAGDWYGPSVVAHILRKAVESSSEVAQLEVY 280
Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
V+ D T+ V +L + + +W+ +++++P+RLG + +NPVY+ +K+ L +
Sbjct: 281 VSQDCTVYKADVAQLMA---QPDPSTEWKSVIILVPVRLGGETLNPVYVPCVKELLRLDL 337
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
+G+IGGKP H+LYFIGY + +++LDPH Q
Sbjct: 338 ---------------------------CIGIIGGKPRHSLYFIGYQDDFLLYLDPHYCQP 370
Query: 260 IGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
CV + E+ ++HC ++ MDPS +
Sbjct: 371 --CV---DTSQERFPLESFHCTSPRKMAFSRMDPSCTI 403
>gi|210032083|ref|NP_001094483.2| autophagy-related 4D [Rattus norvegicus]
gi|149020504|gb|EDL78309.1| rCG31864, isoform CRA_b [Rattus norvegicus]
Length = 473
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 161/323 (49%), Gaps = 72/323 (22%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ S LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGS-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 166
Query: 71 W----NVNSKEEA-------------------------------YLKILKMFEDRRTAPY 95
W + S E + +I+ F D AP+
Sbjct: 167 WVEGTGLASSEMPGPASPSRYRGPGRRGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPF 226
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 286
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 287 VSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 328
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q D Q + L
Sbjct: 329 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVNQ-ANFPL 372
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
+S +HC ++ MDPS V
Sbjct: 373 ES-FHCTSPRKMAFAKMDPSCTV 394
>gi|195539710|gb|AAI68141.1| Atg4d protein [Rattus norvegicus]
Length = 442
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 161/323 (49%), Gaps = 72/323 (22%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ S LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 77 DIQRFQRDFVSRLWLTYRRDFPPLAGS-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 135
Query: 71 W----NVNSKEEA-------------------------------YLKILKMFEDRRTAPY 95
W + S E + +I+ F D AP+
Sbjct: 136 WVEGTGLASSEMPGPASPSRYRGPGRRGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPF 195
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 196 GLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 255
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 256 VSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 297
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q D Q + L
Sbjct: 298 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVNQ-ANFPL 341
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
+S +HC ++ MDPS V
Sbjct: 342 ES-FHCTSPRKMAFAKMDPSCTV 363
>gi|213390042|gb|ACJ46060.1| autophagy related protein Atg4-like protein [Bombyx mori]
Length = 355
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 159/314 (50%), Gaps = 61/314 (19%)
Query: 8 SHQDLEQI--------RRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQA 59
+HQ +EQI + D S++W TYR+ F + S TTD GWGCMLR GQM++AQA
Sbjct: 5 AHQPMEQIYGEGIEGFKSDFVSKIWMTYRREFPTMTGSTFTTDCGWGCMLRSGQMMLAQA 64
Query: 60 LLFLHLGRDWQW-------NVNSKEEAYL--KILKMFEDRRT--APYSIHQIALTGASEG 108
L+ LGR W+W N +E L KI+K F D+ + +P SIHQ+ G + G
Sbjct: 65 LVCHFLGRSWRWLPEKPIQNAREFQEDCLHRKIIKWFGDKSSVNSPLSIHQMVSLGEALG 124
Query: 109 KAVGEWFGPNTVAQVLRKLAKYD-----DWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
K G+W+GP +VA L+ L ++ + +VA D+T+ + + +C A
Sbjct: 125 KKPGDWYGPASVAHCLKSLIASASKENYEFDHLEVYVAQDSTVYIQDIYSMCQLLHGA-- 182
Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
W+ L+L++P++LG + NP+Y P + +L+ +
Sbjct: 183 ---WKSLILLVPVKLGTEKFNPIY------------GPC--LTSLLTLDF---------- 215
Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
+G+IGG+P H+LYF+GY + +I LDPH Q + V+ + ++HC
Sbjct: 216 ---CIGIIGGRPKHSLYFVGYQDDKLIHLDPHYCQEMVDVWQPNFSLQ-----SFHCRSP 267
Query: 284 SRLHILHMDPSIAV 297
++ + MDPS +
Sbjct: 268 RKMPLAKMDPSCCI 281
>gi|30689628|ref|NP_850412.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|75160546|sp|Q8S929.1|ATG4A_ARATH RecName: Full=Cysteine protease ATG4a; AltName:
Full=Autophagy-related protein 4 homolog a;
Short=AtAPG4a; Short=Protein autophagy 4a
gi|19912143|dbj|BAB88383.1| autophagy 4a [Arabidopsis thaliana]
gi|110742303|dbj|BAE99076.1| hypothetical protein [Arabidopsis thaliana]
gi|330255286|gb|AEC10380.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 467
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 143/303 (47%), Gaps = 49/303 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L ++ D +S++ TYRKGF P D+ T+D WGCM+R QM+ AQALLF LGR W
Sbjct: 135 LAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWTK 194
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
E+ YL+ L+ F D + +SIH + + GAS G A G W GP + + LA
Sbjct: 195 KSELPEQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLACKK 254
Query: 129 -KYDDWSSIVFHVAL-------------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
K D + +A+ L + K C + S +W P++L++
Sbjct: 255 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPIILLV 312
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PL LG+ +NP YI + FTFPQS+G++GGK
Sbjct: 313 PLVLGLDSVNPRYIPSLVA---------------------------TFTFPQSVGILGGK 345
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
P + Y +G + +LDPH Q + V + D + S+YHC + + +DPS
Sbjct: 346 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVLRYVPLESLDPS 402
Query: 295 IAV 297
+A+
Sbjct: 403 LAL 405
>gi|168036750|ref|XP_001770869.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677928|gb|EDQ64393.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 346
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 107/312 (34%), Positives = 158/312 (50%), Gaps = 54/312 (17%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
+S E+ D +SR+W TYRKGF +G+S LT+D GWGCMLR GQ+++AQAL+ +LG
Sbjct: 28 VSAHAFEEFLNDFSSRIWITYRKGFESLGESKLTSDVGWGCMLRSGQILLAQALVCHYLG 87
Query: 67 RDWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
R W+ N + + YL+IL+ F D + +SIH + G G A G W GP + + L
Sbjct: 88 RTWRRNACQECLQEYLQILQSFGDSESCSFSIHNLLEAGRPFGLAAGSWLGPYALCRTLE 147
Query: 126 KLAKYDDWSSI-----------VFHVALDN--------TLVVNQVKKLCTTNKRASSNPQ 166
LAK D+ + V+ V+ + V LC+ K + +
Sbjct: 148 ALAKADEDQNAKKGGKRALPFAVYVVSGETEGDRGGAPVRCVEDAAVLCS--KWGEATEE 205
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W PLV+++PL LG+ +NP Y+ + R FT PQ
Sbjct: 206 WSPLVVLVPLVLGLDKLNPRYLPSL---------------------------RATFTLPQ 238
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDST-YHCPQASR 285
SLGV GGKP + + IG G+ ++LDPH NQ + V + + LD++ YHC R
Sbjct: 239 SLGVAGGKPGASTHLIGVQGDQAMYLDPHENQQVFAVTPENLE----LDTSFYHCSVVRR 294
Query: 286 LHILHMDPSIAV 297
L + +DPS+A+
Sbjct: 295 LPLDSIDPSLAI 306
>gi|348511374|ref|XP_003443219.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 459
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 152/328 (46%), Gaps = 82/328 (25%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
+ RR S LWFTYR+GF P+ S LTTD GWGC+LR QM++AQ LL + W W+
Sbjct: 96 QHFRRCFASLLWFTYRRGFRPLPGSSLTTDSGWGCVLRSSQMLLAQGLLLHLMSPGWTWS 155
Query: 73 VNSK---------------------------------------EEAYLKILKMFEDRRTA 93
N + E +IL+ F D TA
Sbjct: 156 GNQRVVKDDMDLIHSVNDGFSSSERESKRSRHLSWGSILDRPTEGTPRRILRWFADNPTA 215
Query: 94 PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL--AKYDDWSSIVFHVALDNTLVVNQV 151
P+ IH++ G S GK G+W+GP+ A +LRK A D ++V +VA D T+ + V
Sbjct: 216 PFGIHRLVELGKSSGKKAGDWYGPSIAAHILRKAVEASVVDLPNLVAYVAQDCTIYLQDV 275
Query: 152 KKLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILS 210
+KLC PQ W+ +++++P+RLG QD+NP YI +KK L
Sbjct: 276 RKLC-----ERPLPQHWKSVLILVPVRLGGQDLNPSYITSVKKLLMLEC----------- 319
Query: 211 STYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
+G+IGGKP H+L+F+G+ + +++LDPH Q D
Sbjct: 320 ----------------CIGIIGGKPKHSLFFVGFQDDHLLYLDPHYCQPT-------VDV 356
Query: 271 EKKLD-STYHCPQASRLHILHMDPSIAV 297
K ++HC ++ MDPS +
Sbjct: 357 TKNFPLESFHCKNPRKMPFSRMDPSCTI 384
>gi|195394658|ref|XP_002055959.1| GJ10670 [Drosophila virilis]
gi|194142668|gb|EDW59071.1| GJ10670 [Drosophila virilis]
Length = 672
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/323 (32%), Positives = 165/323 (51%), Gaps = 67/323 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM++AQ L+ LGR W
Sbjct: 264 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 323
Query: 70 QWNVNS------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S ++ + KI+K F D +++P+SIH + G GK G+W+GP +V+
Sbjct: 324 RYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGEQLGKKPGDWYGPASVS 383
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLC---------------TTNKRA 161
+L+ ++ D+ +I +VA D T+ + +++ C T+ K A
Sbjct: 384 YLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEEQCSIPEPAPKPHVPWQMTSKKPA 443
Query: 162 SSNPQ-------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
S P+ W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 444 SDAPKLDQPQQHWKSLIVLIPLRLGTDKLNPVYAH---------------CLKLLLSTEH 488
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 489 ------------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQETFS 531
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 532 MQSFHCKSPRKLKSSKMDPSCCI 554
>gi|42571227|ref|NP_973687.1| cysteine protease ATG4a [Arabidopsis thaliana]
gi|330255287|gb|AEC10381.1| cysteine protease ATG4a [Arabidopsis thaliana]
Length = 422
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 143/303 (47%), Gaps = 49/303 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L ++ D +S++ TYRKGF P D+ T+D WGCM+R QM+ AQALLF LGR W
Sbjct: 90 LAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWTK 149
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
E+ YL+ L+ F D + +SIH + + GAS G A G W GP + + LA
Sbjct: 150 KSELPEQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLACKK 209
Query: 129 -KYDDWSSIVFHVAL-------------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
K D + +A+ L + K C + S +W P++L++
Sbjct: 210 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPIILLV 267
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PL LG+ +NP YI + FTFPQS+G++GGK
Sbjct: 268 PLVLGLDSVNPRYIPSLVA---------------------------TFTFPQSVGILGGK 300
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
P + Y +G + +LDPH Q + V + D + S+YHC + + +DPS
Sbjct: 301 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVLRYVPLESLDPS 357
Query: 295 IAV 297
+A+
Sbjct: 358 LAL 360
>gi|357612380|gb|EHJ67950.1| autophagy related protein Atg4-like protein [Danaus plexippus]
Length = 354
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 151/303 (49%), Gaps = 52/303 (17%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E + D S++W TYR+ F + S TTD GWGCMLR GQM++AQAL+ LGR W
Sbjct: 7 EGIEGFKSDFISKIWMTYRREFPTMSGSSFTTDCGWGCMLRSGQMMLAQALVCHFLGRSW 66
Query: 70 QW------NVNSKEEAYLK--ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNT 119
+W N +E L I+K F D+ + +P SIHQ+ G + GK G+W+GP +
Sbjct: 67 RWSEKPIQNGREFQEDCLHRMIIKWFGDKSSVNSPLSIHQMVTLGEALGKKPGDWYGPAS 126
Query: 120 VAQVLRKL-----AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
VA L+ + + ++ + +VA D+T+ + V C N W+ L+L++
Sbjct: 127 VAHCLKSVMVEASKENYEFDKLEVYVAQDSTIYIQDVYTHCRL-----PNGCWKSLILLV 181
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
P++LG + +NP+Y + L +G+IGG+
Sbjct: 182 PVKLGTERLNPIYGPCLTSLLTLDFC---------------------------IGIIGGR 214
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
P H+LYF+GY + +I LDPH Q + V+ + T+HC ++ I MDPS
Sbjct: 215 PKHSLYFVGYQDDRLIHLDPHYCQEMVDVWQPNFSLQ-----TFHCRSPRKMPISKMDPS 269
Query: 295 IAV 297
+
Sbjct: 270 CCI 272
>gi|315047608|ref|XP_003173179.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
gi|311343565|gb|EFR02768.1| cysteine protease atg4 [Arthroderma gypseum CBS 118893]
Length = 471
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 154/317 (48%), Gaps = 68/317 (21%)
Query: 14 QIRRDITSRLWFTYRKGFVPI------GDS------------------GLTTDKGWGCML 49
Q D SRLW TYR F PI G S G T+D GWGCM+
Sbjct: 136 QFLDDFESRLWITYRSQFPPIPKMPKTGSSDSSMPLGVRLRSQLIDTQGFTSDTGWGCMI 195
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-G 108
R GQ ++A LLFL LGRDW+ +EE+ +++ +F D AP+SIH+ GA+ G
Sbjct: 196 RSGQALLANTLLFLRLGRDWRRGSKIQEES--ELVSLFADHPRAPFSIHRFVQHGATACG 253
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
K GEWFGP+ AQ ++ L K + + + +V D + + + + ++ S +
Sbjct: 254 KCPGEWFGPSAAAQCIQALVKSNPQAGLRVYVTNDGSDIYERQFREVACDESGS----IK 309
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
P ++++ +RLGI + P+Y +D +K L +PQS+
Sbjct: 310 PTLILLGVRLGIDRVTPIY---------------WDSLKAL------------LHYPQSV 342
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------STYHC 280
G+ GG+P+ + YFI G+ +LDPH Q C+ + + +E + STYH
Sbjct: 343 GIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLAPRSEPTEDEESHPYSPEELSTYHT 400
Query: 281 PQASRLHILHMDPSIAV 297
+ RLH+ MDPS+ +
Sbjct: 401 RRLRRLHVREMDPSMLI 417
>gi|195444549|ref|XP_002069918.1| GK11310 [Drosophila willistoni]
gi|194166003|gb|EDW80904.1| GK11310 [Drosophila willistoni]
Length = 676
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 103/322 (31%), Positives = 162/322 (50%), Gaps = 65/322 (20%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
+ +E RRD SRLW TYR+ F + S T+D GWGCMLR GQM++AQ L+ LGR
Sbjct: 269 EEGIEGFRRDFYSRLWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLIVHFLGRS 328
Query: 69 WQWNVNS------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
W+++ S ++ + KI+K F D +++P+SIH + G + GK G+W+GP +V
Sbjct: 329 WRYDAESQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVSLGTALGKKPGDWYGPASV 388
Query: 121 AQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTN--------------KRA 161
+ +L+ ++ D+ +I +VA D T+ + ++ C+ KR
Sbjct: 389 SYLLKHALEHATQENADFDNISVYVAKDCTIYIQDIEDQCSIPEPAPKQTHVPWQQMKRP 448
Query: 162 SSNP------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
S N W+ ++++IPLRLG +NP Y + +K+L ST N
Sbjct: 449 SLNEHQPDQQHWKSVIILIPLRLGTDKVNPAYAH---------------CLKLLLSTEN- 492
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 493 -----------CLGIIGGKPKHSLYFVGFQEDKLIHLDPHYCQEMVDV-----NQENFSM 536
Query: 276 STYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS +
Sbjct: 537 QSFHCKSPRKIKTSKMDPSCCI 558
>gi|90080692|dbj|BAE89827.1| unnamed protein product [Macaca fascicularis]
Length = 263
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 121/219 (55%), Gaps = 56/219 (25%)
Query: 104 GASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN---KR 160
G EGK++G+W+GPNTVAQVL+KLA +D WSS+ H+A+DNT+V+ ++++LC T+
Sbjct: 2 GVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAG 61
Query: 161 ASSNPQ---------------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
A++ P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 62 ATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHC----- 116
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
F PQSLGVIGGKPN A YF+GYVG ++I+LDPHT Q
Sbjct: 117 ----------------------FMMPQSLGVIGGKPNSAHYFVGYVGEELIYLDPHTTQP 154
Query: 260 IGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIAV 297
+ S D ++HC R+ I +DPSIAV
Sbjct: 155 AV----EPTGSCFIPDESFHCQHPPCRMSIAELDPSIAV 189
>gi|327306465|ref|XP_003237924.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
gi|326460922|gb|EGD86375.1| hypothetical protein TERG_02632 [Trichophyton rubrum CBS 118892]
Length = 454
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 152/317 (47%), Gaps = 68/317 (21%)
Query: 14 QIRRDITSRLWFTYRKGFVPI--------GDS----------------GLTTDKGWGCML 49
Q D S+LW TYR F PI GDS G T+D GWGCM+
Sbjct: 119 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSISLGVRLRSQLIDTQGFTSDTGWGCMI 178
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-G 108
R GQ ++A LLF+ LGRDW+ +EE+ +++ +F D AP+SIH+ GA+ G
Sbjct: 179 RSGQALLANTLLFIRLGRDWRRGSKLQEES--ELVSLFADHPRAPFSIHRFVHHGATACG 236
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
K GEWFGP+ +Q ++ L K + + ++ D + + + K ++ Q
Sbjct: 237 KCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDESGG----IQ 292
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
P ++++ +RLGI + PVY +D +K L FPQS+
Sbjct: 293 PTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRFPQSV 325
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------STYHC 280
G+ GG+P+ + YFI G+ +LDPH Q C+ + + + + STYH
Sbjct: 326 GIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLTPRAESTGDEESHPYSPEELSTYHT 383
Query: 281 PQASRLHILHMDPSIAV 297
+ RLHI MDPS+ +
Sbjct: 384 RRLRRLHIREMDPSMLI 400
>gi|195158262|ref|XP_002020011.1| GL13755 [Drosophila persimilis]
gi|194116780|gb|EDW38823.1| GL13755 [Drosophila persimilis]
Length = 678
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 167/323 (51%), Gaps = 67/323 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM++AQ L+ LGR W
Sbjct: 263 EGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 322
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ +S+ + + KI+K F D +++P+SIH + G + GK G+W+GP +V+
Sbjct: 323 RYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGKKPGDWYGPASVS 382
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 383 YLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKPHVPWQQAKRPQA 442
Query: 167 ------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 443 EAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH 487
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG+IGGKP H+LYF+G+ + +I LDPH Q + + + E
Sbjct: 488 ------------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDI-----NQEHFS 530
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC A +L + MDPS +
Sbjct: 531 LHSFHCKSARKLKVSKMDPSCCI 553
>gi|390177147|ref|XP_001357920.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
gi|388858923|gb|EAL27056.3| GA19429 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 167/323 (51%), Gaps = 67/323 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM++AQ L+ LGR W
Sbjct: 263 EGIEGFRRDFYSRVWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 322
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ +S+ + + KI+K F D +++P+SIH + G + GK G+W+GP +V+
Sbjct: 323 RYDSDSQLHSTYEDNMHKKIIKWFGDSSSKSSPFSIHALVRLGETLGKKPGDWYGPASVS 382
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 383 YLLKHALEHAAQENADFDNISVYVAKDCTIYMQDIEDQCSIPEPAPKPHVPWQQAKRPQA 442
Query: 167 ------------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 443 EAPPKQEPHQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH 487
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG+IGGKP H+LYF+G+ + +I LDPH Q + + + E
Sbjct: 488 ------------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDI-----NQEHFS 530
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC A +L + MDPS +
Sbjct: 531 LHSFHCKSARKLKVSKMDPSCCI 553
>gi|452977855|gb|EME77619.1| hypothetical protein MYCFIDRAFT_191078 [Pseudocercospora fijiensis
CIRAD86]
Length = 445
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 144/302 (47%), Gaps = 61/302 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
D SR+W TYR F PI S G T+D GWGCM+R GQ +
Sbjct: 114 DFESRVWITYRDAFPPIPKSSHPAAASKMSFTTKLRNFTNQAGFTSDTGWGCMIRSGQSL 173
Query: 56 IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
+A ++ LGRDW+ KE + IL +F D AP+SIH+ GA G GEW
Sbjct: 174 LANTIVVHRLGRDWR--KGQKEREHKDILSLFADTPDAPFSIHKFVEHGAQACGTYPGEW 231
Query: 115 FGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
FGPN A+ LR L KY V+ D+ + ++ L T + +N ++QP ++V
Sbjct: 232 FGPNATARCLRALTDKYHQAGLRVYARPNDSDVYID---ALTATATQKDANDEFQPTLIV 288
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI+ + P Y +K L PQS+G+ GG
Sbjct: 289 LGIRLGIEKVTPAYHAALKAALEL---------------------------PQSMGIAGG 321
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+G+ G++ +LDPHT + + Q S + +D T H + RL + MDP
Sbjct: 322 RPSSSHYFVGHQGDNFFYLDPHTTRPMLS----PQPSAEDVD-TCHTRRVRRLSLAEMDP 376
Query: 294 SI 295
S+
Sbjct: 377 SM 378
>gi|405953478|gb|EKC21133.1| Leucine-rich repeat-containing protein 6 [Crassostrea gigas]
Length = 1114
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 103/317 (32%), Positives = 166/317 (52%), Gaps = 64/317 (20%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ ++D +S LWFTYR+ F I + LT+D GWGCMLR GQM++A+AL +LG +
Sbjct: 258 NIEKFKQDFSSLLWFTYRQDFPAIPGTKLTSDCGWGCMLRSGQMMLAKALTLHYLGP--E 315
Query: 71 WNVNS----KEEAYLK-ILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
WNV S ++E Y K I++ F D +P+S+H++ G + GK GEWFGP +VA +
Sbjct: 316 WNVFSDQTREQETYRKQIIRWFGDYLCDESPFSMHRLVEVGKNLGKQPGEWFGPASVAHI 375
Query: 124 LRKLAKYDD-----WSSIVFHVALDNTLVVNQVKKLCTTNKRA----------------- 161
L++ S + +V+ D T+ + +LC T RA
Sbjct: 376 LKETMVKGQKTQTVLSDLCVYVSQDCTVYKQDIYELCCTRPRADTKFTNSTESEHESSQD 435
Query: 162 SSNPQWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+S+ W+ +V++IP+RLG + +NPVYI +K +LS
Sbjct: 436 ASSMDWKRAVVILIPVRLGGEQLNPVYIPCVKG--------------LLSQD-------- 473
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
+G+IGGKP H+LYF+G+ + +I+LDPH Q++ ++ + +YHC
Sbjct: 474 -----SCIGIIGGKPKHSLYFVGWQEDKLIYLDPHYCQDVVDTRERHFPIQ-----SYHC 523
Query: 281 PQASRLHILHMDPSIAV 297
++ I +DPS +
Sbjct: 524 MSPRKVSIDKIDPSCTI 540
>gi|297820846|ref|XP_002878306.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
gi|297324144|gb|EFH54565.1| autophagy 4b [Arabidopsis lyrata subsp. lyrata]
Length = 476
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 150/304 (49%), Gaps = 50/304 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
L R+D +S + TYR+GF PIGD+ T+D WGCMLR GQM+ AQALLF LGR W +
Sbjct: 137 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 196
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
+ E YL+IL++F D + +SIH + L G S G A G W GP V + LA+
Sbjct: 197 KDSEPPNEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARK 256
Query: 131 DDWSSIVFHVALDNT-----------------LVVNQVKKLCTTNKRASSNPQWQPLVLV 173
+ + V H + L + V K C + + + +W P++L+
Sbjct: 257 NKEETDVKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL--EFSEGDTEWPPILLL 314
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PL LG+ +NP YI + FTFPQSLG++GG
Sbjct: 315 VPLVLGLDKVNPRYIPSLIA---------------------------TFTFPQSLGILGG 347
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
KP + Y +G + +LDPH Q + V + QD + S+YHC + + +DP
Sbjct: 348 KPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVDT---SSYHCNTLRYVPLESLDP 404
Query: 294 SIAV 297
S+A+
Sbjct: 405 SLAL 408
>gi|145245643|ref|XP_001395089.1| cysteine protease atg4 [Aspergillus niger CBS 513.88]
gi|166990612|sp|A2QY50.1|ATG4_ASPNC RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|134079795|emb|CAK40930.1| unnamed protein product [Aspergillus niger]
Length = 404
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 153/305 (50%), Gaps = 59/305 (19%)
Query: 18 DITSRLWFTYRKGFVPI----GDS-------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F PI GD G T+D GWGCM+R GQ
Sbjct: 81 DFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQS 140
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A AL L LGRDW+ +EE+ ++L +F D TAP+S+H+ GA S GK GE
Sbjct: 141 LLANALSMLVLGRDWRRGARFEEES--QLLSLFADTPTAPFSVHRFVKHGAESCGKYPGE 198
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + L+ ++ +V+ D + V K + S +QP +++
Sbjct: 199 WFGPSATAKCIEALSSQCGNPTLKVYVSNDTSEVYQD--KFMDIARNTSG--AFQPTLIL 254
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI +I PVY +G+K FPQS+G+ GG
Sbjct: 255 LGTRLGIDNITPVYWDGLKAA---------------------------LQFPQSVGIAGG 287
Query: 234 KPNHALYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G G+ + +LDPH T + + E S++++D TYH + R+H+ MD
Sbjct: 288 RPSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVD-TYHTRRLRRIHVRDMD 346
Query: 293 PSIAV 297
PS+ +
Sbjct: 347 PSMLI 351
>gi|398389911|ref|XP_003848416.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
gi|339468291|gb|EGP83392.1| hypothetical protein MYCGRDRAFT_49421 [Zymoseptoria tritici IPO323]
Length = 440
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 155/335 (46%), Gaps = 68/335 (20%)
Query: 4 ANKLSHQDL---EQIRRDITSRLWFTYRKGFVPI----------------------GDSG 38
+ + +DL Q D SR+W TYR F PI
Sbjct: 97 SKSMEEEDLGWPSQFLDDFESRVWMTYRNNFPPIQKASDPAATSNMSFATKLRSLANQGN 156
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
T+D GWGCM+R GQ ++A ++ L LGRDW+ KE+ + +IL MF D AP+SIH
Sbjct: 157 FTSDTGWGCMIRSGQSLLANTVVMLRLGRDWR--RGQKEKQHHEILSMFADTPEAPFSIH 214
Query: 99 QIALTGASE-GKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCT 156
+ GAS G GEWFGP+ A+ +R L KY D V+ D+ + ++ L
Sbjct: 215 KFVEHGASACGTYPGEWFGPSATARCIRALTEKYHDVGLRVYARPNDSDVYID---TLTA 271
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
T + S++ + P ++V+ +RLGI+ + P Y +K L
Sbjct: 272 TTTQHSASETFSPTLIVLGVRLGIEKVTPAYHAALKSILEL------------------- 312
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
PQS+G+ GG+P+ + YF+G+ G+ +LDPHT + + +D E
Sbjct: 313 --------PQSVGIAGGRPSSSHYFVGHQGDHFFYLDPHTTRPMLTAQPTAEDVE----- 359
Query: 277 TYHCPQASRLHILHMDPSI----AVVSQRSYSDYK 307
+ H + RL I MDPS+ V + + D+K
Sbjct: 360 SCHTRRIRRLSIAEMDPSMLLGFLVRDKEDFEDWK 394
>gi|350631770|gb|EHA20141.1| hypothetical protein ASPNIDRAFT_178675 [Aspergillus niger ATCC
1015]
Length = 384
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 154/310 (49%), Gaps = 59/310 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI----GDS-------------------GLTTDKGWGCML 49
E D SR+W TYR F PI GD G T+D GWGCM+
Sbjct: 56 ESFLLDFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMI 115
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
R GQ ++A AL L LGRDW+ +EE+ ++L +F D TAP+S+H+ GA S G
Sbjct: 116 RSGQSLLANALSMLVLGRDWRRGARFEEES--QLLSLFADTPTAPFSVHRFVKHGAESCG 173
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
K GEWFGP+ A+ + L+ ++ +V+ D + V K + S +Q
Sbjct: 174 KYPGEWFGPSATAKCIEALSSQCGNPTLKVYVSNDTSEVYQD--KFMDIARNTSG--AFQ 229
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
P ++++ RLGI +I PVY +G+K FPQS+
Sbjct: 230 PTLILLGTRLGIDNITPVYWDGLKAA---------------------------LQFPQSV 262
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
G+ GG+P+ + YF+G G+ + +LDPH T + + E S++++D TYH + R+H
Sbjct: 263 GIAGGRPSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVD-TYHTRRLRRIH 321
Query: 288 ILHMDPSIAV 297
+ MDPS+ +
Sbjct: 322 VRDMDPSMLI 331
>gi|449303631|gb|EMC99638.1| hypothetical protein BAUCODRAFT_344306 [Baudoinia compniacensis
UAMH 10762]
Length = 446
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 146/302 (48%), Gaps = 61/302 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
D+ +R+W TYR F PI S G T+D GWGCM+R GQ +
Sbjct: 117 DMEARIWLTYRNNFPPIAKSSDPSAGSAMSFSTKLRNIGNSGGFTSDAGWGCMIRSGQTL 176
Query: 56 IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
+A +L L LGRDW+ KE+ Y ++ +F D AP+SIH+ GA GK GEW
Sbjct: 177 LANSLATLKLGRDWR--RGQKEDDYKHLISLFADTPEAPFSIHKFVEHGAQACGKHPGEW 234
Query: 115 FGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
FGP+ A+ ++ L KY D V+ D + V+ L T + +N ++QP ++V
Sbjct: 235 FGPSATARSVQALTEKYRDVGLRVYARPDDGDVYVD---SLFATAGQMDANDEFQPTLIV 291
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI I PVY +K + PQS+G+ GG
Sbjct: 292 LGIRLGIDRITPVYHAALKATLEM---------------------------PQSVGIAGG 324
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+G+ G++ +LDPHT + Q+ + ++ H + RL I MDP
Sbjct: 325 RPSSSHYFVGHQGDNFFYLDPHTTRQA-----IPQNPSAEDLASCHTRRLRRLKIAEMDP 379
Query: 294 SI 295
S+
Sbjct: 380 SM 381
>gi|15232213|ref|NP_191554.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|75182325|sp|Q9M1Y0.1|ATG4B_ARATH RecName: Full=Cysteine protease ATG4b; AltName:
Full=Autophagy-related protein 4 homolog b;
Short=AtAPG4b; Short=Protein autophagy 4b
gi|7019689|emb|CAB75814.1| putative protein [Arabidopsis thaliana]
gi|19912145|dbj|BAB88384.1| autophagy 4b [Arabidopsis thaliana]
gi|110742150|dbj|BAE99003.1| hypothetical protein [Arabidopsis thaliana]
gi|332646468|gb|AEE79989.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 477
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 151/304 (49%), Gaps = 50/304 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
L R+D +S + TYR+GF PIGD+ T+D WGCMLR GQM+ AQALLF LGR W +
Sbjct: 138 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 197
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
+ +E YL+IL++F D + +SIH + L G S G A G W GP V + LA+
Sbjct: 198 KDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARK 257
Query: 131 DDWS--------SIVFHVALDNT---------LVVNQVKKLCTTNKRASSNPQWQPLVLV 173
+ S+ H+ + L + V K C + + +W P++L+
Sbjct: 258 NKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL--EFSEGETEWPPILLL 315
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PL LG+ +NP YI + FTFPQSLG++GG
Sbjct: 316 VPLVLGLDRVNPRYIPSLIA---------------------------TFTFPQSLGILGG 348
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
KP + Y +G + +LDPH Q + V + QD + S+YHC + + +DP
Sbjct: 349 KPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVDT---SSYHCNTLRYVPLESLDP 405
Query: 294 SIAV 297
S+A+
Sbjct: 406 SLAL 409
>gi|308491308|ref|XP_003107845.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
gi|308249792|gb|EFO93744.1| CRE-ATG-4.2 protein [Caenorhabditis remanei]
Length = 518
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/320 (28%), Positives = 157/320 (49%), Gaps = 57/320 (17%)
Query: 2 RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
+ AN +S E D SRLW TYR F P+ ++ TTD GWGCM+R QM++AQA++
Sbjct: 158 KDANGVS-SGFEDFCSDYYSRLWITYRTDFAPLLNTDTTTDCGWGCMIRTTQMMVAQAIM 216
Query: 62 FLHLGRDWQW--------NVNSKEEAYLK-------ILKMFEDRRTAPYSIHQIALTGAS 106
GR+W++ +N +E + + ILK+FED+ ++P IH++ A
Sbjct: 217 LNRFGREWRFVRRKKSYVTINGEETDFDREKIKEWMILKLFEDKPSSPLGIHRMVEISAK 276
Query: 107 E--GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN 164
E KAVG W+ P+ +++K + + + D + ++ ++ + +
Sbjct: 277 EKGKKAVGSWYSPSEAVFIMKKAL-----TESISPLTGDTAMYLSIDGRVHIRDIEVETK 331
Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
+ L+LVI +RLG ++NP+Y+ + + F+
Sbjct: 332 NWMKTLILVIVVRLGAAELNPIYVPHLMRL---------------------------FSM 364
Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT-------NQNIGCVYDKEQDSEKKLDST 277
LGV GG+P+H+ +F+G+ G+ +I+LDPH + N + S+K + +
Sbjct: 365 ESCLGVTGGRPDHSCWFVGFYGDQIIYLDPHVAHEYIPIDMNFNVNMTDNKKSKKCPERS 424
Query: 278 YHCPQASRLHILHMDPSIAV 297
YHC S++H L MDPS A+
Sbjct: 425 YHCRLLSKMHFLDMDPSCAL 444
>gi|126723748|ref|NP_001075911.1| cysteine protease ATG4C [Bos taurus]
gi|126010621|gb|AAI33599.1| ATG4C protein [Bos taurus]
Length = 458
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIAYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + +++++P+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|296489147|tpg|DAA31260.1| TPA: APG4 autophagy 4 homolog C [Bos taurus]
Length = 458
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKMERYSDDREMQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + +++++P+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|425778592|gb|EKV16710.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum PHI26]
Length = 401
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 99/318 (31%), Positives = 152/318 (47%), Gaps = 62/318 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F PI + G T+D GWGCM+R GQ
Sbjct: 74 DFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQS 133
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A A L LGRDW+ KEE K++ MF D AP+SIH+ GA S GK GE
Sbjct: 134 LLANAFSVLLLGRDWR--RGEKEEEESKLISMFADHPEAPFSIHKFVNRGAESCGKYPGE 191
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + +V D + V + ++ QP +++
Sbjct: 192 WFGPSATAKCIQLLSTQSEAHRLRVYVTNDTSDVYEDKFAHVSHDRSGCI----QPTLIL 247
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI+++ P Y +G+ R T+PQS+G+ GG
Sbjct: 248 IGTRLGIENVTPAYWDGL---------------------------RAALTYPQSVGIAGG 280
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+G + FLDPHT + E ++++LDS Y+ + R+HI MDP
Sbjct: 281 RPSASHYFLGAQDCHLFFLDPHTTRPATPYRPDELYTQEELDS-YYTSRLRRIHIKDMDP 339
Query: 294 SI----AVVSQRSYSDYK 307
S+ + + ++D+K
Sbjct: 340 SMLIGFLIKDEEDWADWK 357
>gi|195328749|ref|XP_002031074.1| GM25780 [Drosophila sechellia]
gi|194120017|gb|EDW42060.1| GM25780 [Drosophila sechellia]
Length = 703
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 103/321 (32%), Positives = 163/321 (50%), Gaps = 65/321 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM+ AQ L+ LGR W
Sbjct: 296 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 355
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D +++P+SIH + G GK G+W+GP +V+
Sbjct: 356 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 415
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA--------------S 162
+L+ ++ D+ +I +VA D T+ + ++ C+ + A +
Sbjct: 416 YLLKHALEHASQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQKAKRPQA 475
Query: 163 SNPQ------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
NP+ W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 476 ENPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEHC- 519
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 520 -----------LGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 563
Query: 277 TYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 564 SFHCKSPRKLKASKMDPSCCI 584
>gi|154300262|ref|XP_001550547.1| hypothetical protein BC1G_11320 [Botryotinia fuckeliana B05.10]
gi|166990615|sp|A6SDQ3.1|ATG4_BOTFB RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|347841273|emb|CCD55845.1| similar to cysteine protease atg4 [Botryotinia fuckeliana]
Length = 439
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 143/305 (46%), Gaps = 57/305 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D +++W TYR F I S G T+D GWGCM+R GQ
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRSGQS 165
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ALL L +GR+W+ V+S EE KIL +F D APYSIH+ GAS GK GE
Sbjct: 166 LLANALLTLRMGREWRRGVSSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 223
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ S + ++ D + V K + K S+ + P +++
Sbjct: 224 WFGPSATARCIQALSNSQAKSELRVYITGDGSDVYED--KFMSIAKPNHSD--FTPTLIL 279
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLG+ I PVY +K Y PQS+G+ GG
Sbjct: 280 VGTRLGLDKITPVYWEALK---------------------------YSLQMPQSVGIAGG 312
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG +D +LDPH + D +D + + H + RLHI MDP
Sbjct: 313 RPSSSHYFIGVQESDFFYLDPHQTRPALPYKDNVEDYTTEDIDSCHTRRLRRLHIKEMDP 372
Query: 294 SIAVV 298
S+ +
Sbjct: 373 SMLIA 377
>gi|355669957|gb|AER94693.1| ATG4 autophagy related 4-like protein C [Mustela putorius furo]
Length = 396
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 13 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 72
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 73 WPDALNIENSDSESWTSNTVKKFTASFEASLSGEGELKTPTVSQKEAIRRYSDDHEMRNE 132
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 133 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 192
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + ++++IP+RLG + N Y++ +K
Sbjct: 193 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVK- 249
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 250 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 283
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 284 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 335
>gi|426215654|ref|XP_004002085.1| PREDICTED: cysteine protease ATG4C isoform 1 [Ovis aries]
gi|426215656|ref|XP_004002086.1| PREDICTED: cysteine protease ATG4C isoform 2 [Ovis aries]
Length = 458
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + +++++P+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|296804856|ref|XP_002843276.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
gi|238845878|gb|EEQ35540.1| cysteine protease atg4 [Arthroderma otae CBS 113480]
Length = 473
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 100/307 (32%), Positives = 147/307 (47%), Gaps = 57/307 (18%)
Query: 18 DITSRLWFTYRKGFVPI----GDS------------------GLTTDKGWGCMLRCGQMV 55
D SRLW TYR F PI G S G T+D GWGCM+R GQ +
Sbjct: 141 DFESRLWITYRSHFPPIPKTGGSSSSSMPLGVRLRSQLIDTQGFTSDTGWGCMIRSGQSL 200
Query: 56 IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
+A LLFL LGR W+ +EE+ ++L +F D AP+SIH+ GA+ GK GEW
Sbjct: 201 LANTLLFLRLGRGWRRGSQEQEES--ELLSLFADHPRAPFSIHRFVQHGATACGKCPGEW 258
Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVN-QVKKL-CTTNKRASSNPQWQPLVL 172
FGP AQ ++ LA + + ++ D + + Q +++ C + +P ++
Sbjct: 259 FGPAAAAQCIQALANGHPQAGLNVYITSDGSDIYERQFREIACRGLGEDGEDDSIKPTLI 318
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ +RLGI + PVY +K+ FPQS+G+ G
Sbjct: 319 LLGVRLGIDRVTPVYWESLKEV---------------------------IRFPQSVGIAG 351
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD--SEKKLDSTYHCPQASRLHILH 290
G+P+ + YFI G+ +LDPH + +D S +L STYH + RLHI
Sbjct: 352 GRPSSSHYFIATQGDTFFYLDPHQTRPSLPPRTAGEDVYSPGEL-STYHTRRLRRLHIRE 410
Query: 291 MDPSIAV 297
MDPS+ +
Sbjct: 411 MDPSMLI 417
>gi|453080987|gb|EMF09037.1| putative cysteine protease atg4 [Mycosphaerella populorum SO2202]
Length = 447
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 149/320 (46%), Gaps = 65/320 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
D SR+W TYR GF PI S G T+D GWGCM+R GQ +
Sbjct: 115 DFESRIWITYRDGFPPIAKSTDPAAGSKMSFTTKLRSLTNQQGFTSDTGWGCMIRSGQSL 174
Query: 56 IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEW 114
+A +L LGRDW+ K+E + IL +F D AP+SIH+ GA G GEW
Sbjct: 175 LANTILLHRLGRDWR--KGQKQEEHKNILSLFADTPEAPFSIHKFVEHGAQACGTYPGEW 232
Query: 115 FGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
FGPN A+ LR L KY V+ D+ + + L T + ++ ++QP ++V
Sbjct: 233 FGPNATARCLRALTDKYHGAGLRVYARPNDSDVYAD---ALIETATQKDADDKFQPTLIV 289
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI+ + Y +K L PQS+G+ GG
Sbjct: 290 LGIRLGIEKVTSAYHVALKAALEL---------------------------PQSVGIAGG 322
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+G+ G+ +LDPHT +++ +D E T H + +L + MDP
Sbjct: 323 RPSSSHYFLGHQGDSFFYLDPHTTRHMLSPQPSAEDIE-----TCHTRRIRKLPLSEMDP 377
Query: 294 SI----AVVSQRSYSDYKNV 309
S+ V SQ + +++
Sbjct: 378 SMLLGFLVRSQEEFEEWRKA 397
>gi|440902657|gb|ELR53425.1| Cysteine protease ATG4C [Bos grunniens mutus]
Length = 458
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTISLKEKIERYSDDREMQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IHHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLK 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + +++++P+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTNDKAVIILVPVRLGGERTNADYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|195570668|ref|XP_002103326.1| GD20357 [Drosophila simulans]
gi|194199253|gb|EDX12829.1| GD20357 [Drosophila simulans]
Length = 703
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 103/321 (32%), Positives = 162/321 (50%), Gaps = 65/321 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM+ AQ L+ LGR W
Sbjct: 296 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 355
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D +++P+SIH + G GK G+W+GP +V+
Sbjct: 356 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 415
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 416 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 475
Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 476 ETPKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEHC- 519
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 520 -----------LGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 563
Query: 277 TYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 564 SFHCKSPRKLKASKMDPSCCI 584
>gi|326478657|gb|EGE02667.1| cysteine protease atg4 [Trichophyton equinum CBS 127.97]
Length = 454
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 150/321 (46%), Gaps = 72/321 (22%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDS----------------------------GLTTDKGW 45
Q D S+LW TYR F PI + G T+D GW
Sbjct: 115 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 174
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
GCM+R GQ ++A LLFL LGRDW+ +EE+ +++ +F D AP+SIH+ GA
Sbjct: 175 GCMIRSGQALLANTLLFLRLGRDWRRGSKVQEES--ELVSLFADHPRAPFSIHRFVHHGA 232
Query: 106 SE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN 164
+ GK GEWFGP+ +Q ++ L K + + ++ D + + + K ++
Sbjct: 233 TACGKCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDESGGI- 291
Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
QP ++++ +RLGI + PVY +D +K L F
Sbjct: 292 ---QPTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRF 321
Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------S 276
PQS+G+ GG+P+ + YFI G+ +LDPH Q C+ + + + + S
Sbjct: 322 PQSVGIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLTPRAESTGDEESHPYSPEELS 379
Query: 277 TYHCPQASRLHILHMDPSIAV 297
TYH + RLHI MDPS+ +
Sbjct: 380 TYHTRRLRRLHIREMDPSMLI 400
>gi|344278625|ref|XP_003411094.1| PREDICTED: cysteine protease ATG4C [Loxodonta africana]
Length = 458
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 155/344 (45%), Gaps = 91/344 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGESELKTPSISLKKTIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIVSWFGDSPLALFGLHQLIEYGKKAGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K C + AS NP + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQCAS--MASDNPDNKAVIILVPVRLGGERTNVDYLEFVKG 312
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 313 --------------ILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
PH Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|121934653|sp|Q0U199.1|ATG4_PHANO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 467
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 128/265 (48%), Gaps = 53/265 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF PI S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCMIRSGQCIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW+W N ++ + +IL +F D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQILRLGRDWRWQENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
GP+ A+ ++ LA + + +V+ D V K ++ WQP ++++
Sbjct: 219 GPSAAARCIQDLANKHREAGLKVYVSGDGADVYEDKLKQVAVDEDG----LWQPTLILVG 274
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
RLGI I PVY +K PQS+G+ GG+P
Sbjct: 275 TRLGIDKITPVYWEALKAS---------------------------LQIPQSIGIAGGRP 307
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNI 260
+ + YF+G GN+ +LDPH+ + +
Sbjct: 308 SASHYFVGVQGNNFYYLDPHSTRPL 332
>gi|326470473|gb|EGD94482.1| hypothetical protein TESG_01998 [Trichophyton tonsurans CBS 112818]
Length = 469
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 150/321 (46%), Gaps = 72/321 (22%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDS----------------------------GLTTDKGW 45
Q D S+LW TYR F PI + G T+D GW
Sbjct: 130 QFLDDFESKLWITYRSQFPPIPKTTKAGSGDSSSSSSISLGVRLRSQLIDTQGFTSDTGW 189
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
GCM+R GQ ++A LLFL LGRDW+ +EE+ +++ +F D AP+SIH+ GA
Sbjct: 190 GCMIRSGQALLANTLLFLRLGRDWRRGSKVQEES--ELVSLFADHPRAPFSIHRFVHHGA 247
Query: 106 SE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSN 164
+ GK GEWFGP+ +Q ++ L K + + ++ D + + + K ++
Sbjct: 248 TACGKCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDESGGI- 306
Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
QP ++++ +RLGI + PVY +D +K L F
Sbjct: 307 ---QPTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRF 336
Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--------S 276
PQS+G+ GG+P+ + YFI G+ +LDPH Q C+ + + + + S
Sbjct: 337 PQSVGIAGGRPSSSHYFIATQGDSFFYLDPH--QTRPCLTPRAESTGDEESHPYSPEELS 394
Query: 277 TYHCPQASRLHILHMDPSIAV 297
TYH + RLHI MDPS+ +
Sbjct: 395 TYHTRRLRRLHIREMDPSMLI 415
>gi|194901010|ref|XP_001980048.1| GG20629 [Drosophila erecta]
gi|190651751|gb|EDV49006.1| GG20629 [Drosophila erecta]
Length = 708
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 163/321 (50%), Gaps = 65/321 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM++AQ L+ LGR W
Sbjct: 301 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 360
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D +++P+SIH + G GK G+W+GP +V+
Sbjct: 361 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 420
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 421 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 480
Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
W+ ++++IPLRLG +NPVY + +K+L ST +
Sbjct: 481 ETPKTEQQQHWKSVIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH-- 523
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 524 ----------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 568
Query: 277 TYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 569 SFHCKSPRKLKASKMDPSCCI 589
>gi|195501322|ref|XP_002097748.1| GE26385 [Drosophila yakuba]
gi|194183849|gb|EDW97460.1| GE26385 [Drosophila yakuba]
Length = 706
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 163/321 (50%), Gaps = 65/321 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM++AQ L+ LGR W
Sbjct: 299 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFLGRSW 358
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D +++P+SIH + G GK G+W+GP +V+
Sbjct: 359 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 418
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 419 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 478
Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
W+ ++++IPLRLG +NPVY + +K+L ST +
Sbjct: 479 ETPKTEQHQHWKSVIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH-- 521
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 522 ----------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 566
Query: 277 TYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 567 SFHCKSPRKLKASKMDPSCCI 587
>gi|302783857|ref|XP_002973701.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
gi|300158739|gb|EFJ25361.1| hypothetical protein SELMODRAFT_54035 [Selaginella moellendorffii]
Length = 358
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 138/288 (47%), Gaps = 43/288 (14%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D +SR+W TYR+GF IG+S T+D GWGCM+R GQM+ AQAL+ LGR W+
Sbjct: 72 DFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRLGRGWRRGEQPYA 131
Query: 78 EAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSS 135
YL+IL F D + P+SIH G+ G A G W GP + + LA+ D
Sbjct: 132 REYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCHAIEALARNDGRGR 191
Query: 136 ------IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
V+ V+ D L + P+++++PL LG+ INP Y+
Sbjct: 192 QGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-----PVLILVPLVLGLDKINPRYLP 246
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+ R F FPQS+G+ GGKP ++YF+G +
Sbjct: 247 SL---------------------------RATFAFPQSVGIAGGKPAASVYFVGVQDDQA 279
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPH Q + V + + + ++YHC ++ + +DPS+A+
Sbjct: 280 LYLDPHEVQKVVSVSGESLEFDS---ASYHCSVVRKMPLDAIDPSLAL 324
>gi|431905146|gb|ELK10197.1| Cysteine protease ATG4A [Pteropus alecto]
Length = 342
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/265 (35%), Positives = 135/265 (50%), Gaps = 69/265 (26%)
Query: 35 GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTA 93
G +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W ++ + Y +IL+ F DR+
Sbjct: 67 GGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEKQKEQPKEYQRILQCFLDRKDC 126
Query: 94 PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKK 153
YSIHQ+ + +L A +A +N
Sbjct: 127 CYSIHQM-----------------EKMCCILPLSAD----------IATENPSGSPNASN 159
Query: 154 LCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
+ ++ P W+PL+L++PLRLGI INPVY++ K
Sbjct: 160 --HSKGTSACCPAWKPLLLIVPLRLGINQINPVYVDAFK--------------------- 196
Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
SLG +GGKPN+A YFIG++G+++IFLDPHT Q D E++
Sbjct: 197 -------------SLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVD 240
Query: 274 LDSTYHCPQ-ASRLHILHMDPSIAV 297
D T+HC Q R++IL++DPS+A+
Sbjct: 241 -DQTFHCLQPPQRMNILNLDPSVAL 264
>gi|301764643|ref|XP_002917740.1| PREDICTED: cysteine protease ATG4C-like [Ailuropoda melanoleuca]
gi|281350282|gb|EFB25866.1| hypothetical protein PANDA_006093 [Ailuropoda melanoleuca]
Length = 458
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 160/357 (44%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPTVSQKETIRRYSDDHEMQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK ++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEETRHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + ++++IP+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|302787965|ref|XP_002975752.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
gi|300156753|gb|EFJ23381.1| hypothetical protein SELMODRAFT_54753 [Selaginella moellendorffii]
Length = 358
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 138/288 (47%), Gaps = 43/288 (14%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D +SR+W TYR+GF IG+S T+D GWGCM+R GQM+ AQAL+ LGR W+
Sbjct: 72 DFSSRIWITYRQGFEAIGESKFTSDVGWGCMIRSGQMLFAQALVCHRLGRGWRRGEQPYA 131
Query: 78 EAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSS 135
YL+IL F D + P+SIH G+ G A G W GP + + LA+ D
Sbjct: 132 REYLEILHSFVDSPSPACPFSIHNFIRAGSPYGLAAGSWLGPYALCHAIEALARNDGRGR 191
Query: 136 ------IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
V+ V+ D L + P+++++PL LG+ INP Y+
Sbjct: 192 EGEDHLAVYVVSGDAHGERGGAPVLYNVDVAGKC-----PVLILVPLVLGLDKINPRYLP 246
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+ R F FPQS+G+ GGKP ++YF+G +
Sbjct: 247 SL---------------------------RATFAFPQSVGIAGGKPAASVYFVGVQDDQA 279
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPH Q + V + + + ++YHC ++ + +DPS+A+
Sbjct: 280 LYLDPHEVQKVVSVSGESLEFDS---ASYHCSVVRKMLLDAIDPSLAL 324
>gi|73956170|ref|XP_852273.1| PREDICTED: cysteine protease ATG4C isoform 2 [Canis lupus
familiaris]
gi|73956176|ref|XP_865426.1| PREDICTED: cysteine protease ATG4C isoform 4 [Canis lupus
familiaris]
Length = 458
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 159/357 (44%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S TTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSAFTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W---------------------------------------NVNSKE-------------E 78
W V+ KE E
Sbjct: 135 WPDALNIENSDSDSWTSNTVKKFTASFEASLSGESELKTPTVSQKETIRRHSDDHEMRNE 194
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
Y KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + ++++IP+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|17862242|gb|AAL39598.1| LD17482p [Drosophila melanogaster]
Length = 653
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 162/321 (50%), Gaps = 65/321 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM+ AQ L+ LGR W
Sbjct: 246 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 305
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D +++P+SIH + G GK G+W+GP +V+
Sbjct: 306 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 365
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 366 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 425
Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 426 ETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEHC- 469
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG++GGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 470 -----------LGILGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 513
Query: 277 TYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 514 SFHCKSPRKLKASKMDPSCCI 534
>gi|24647125|ref|NP_650452.1| CG6194 [Drosophila melanogaster]
gi|23171357|gb|AAF55180.2| CG6194 [Drosophila melanogaster]
gi|261490735|gb|ACX83596.1| RE44406p [Drosophila melanogaster]
Length = 668
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 162/321 (50%), Gaps = 65/321 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM+ AQ L+ LGR W
Sbjct: 261 EGIEGFRRDFYSRIWMTYRREFPIMNGSNYTSDCGWGCMLRSGQMLFAQGLICHFLGRSW 320
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D +++P+SIH + G GK G+W+GP +V+
Sbjct: 321 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGEHLGKKPGDWYGPASVS 380
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 381 YLLKHALEHAAQENADFDNISVYVAKDCTIYLQDIEDQCSIPEPAPKPHVPWQQAKRPQA 440
Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
W+ L+++IPLRLG +NPVY + +K+L ST +
Sbjct: 441 ETTKTEQQQHWKSLIVLIPLRLGSDKLNPVYAH---------------CLKLLLSTEH-- 483
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG++GGKP H+LYF+G+ + +I LDPH Q + V + E
Sbjct: 484 ----------CLGILGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLH 528
Query: 277 TYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 529 SFHCKSPRKLKASKMDPSCCI 549
>gi|50344862|ref|NP_001002103.1| cysteine protease ATG4C [Danio rerio]
gi|47938047|gb|AAH71514.1| Autophagy-related 4C (yeast) [Danio rerio]
Length = 463
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 158/350 (45%), Gaps = 95/350 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ R+D TSR+W TYR+ F + S T+D GWGC LR GQM++AQALL LGRDW+
Sbjct: 74 NVDEFRKDFTSRVWLTYREEFPALPGSSFTSDCGWGCTLRAGQMILAQALLLHILGRDWK 133
Query: 71 WNV-------------------------------------------NSKEEA--YLK--- 82
W+ EEA YLK
Sbjct: 134 WSEALSLEPLDTETWTSSAARRLVATLEASIQGERAQASQPLCPVQGEAEEADSYLKETY 193
Query: 83 ---ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSSI 136
I+ F D +A I+++ G + GK G+W+GP VA +LRK A I
Sbjct: 194 HRTIVSWFGDGPSAQLGIYKLVELGMTSGKQAGDWYGPAVVAHILRKAVDEAVDAMLKGI 253
Query: 137 VFHVALDNTLVVNQVKKLCTTNKRASSNPQW---------QPLVLVIPLRLGIQDINPVY 187
+VA D T+ V +T + S+PQ + +V++IP+RLG + INP Y
Sbjct: 254 RVYVAQDCTVYSADVIDSHSTRTESHSDPQGLDSGASPDSRAVVILIPVRLGGEKINPEY 313
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
+N +K ILS Y +G+IGGKP A YF+G+ +
Sbjct: 314 LNFVK--------------SILSLEY-------------CIGIIGGKPKQAYYFVGFQDD 346
Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+I++DPH Q+ V S+ L S +HCP ++ MDPS +
Sbjct: 347 SLIYMDPHYCQSFVDV----STSDFPLQS-FHCPSPKKMSFSKMDPSCTI 391
>gi|17544636|ref|NP_502208.1| Protein ATG-4.2 [Caenorhabditis elegans]
gi|5824904|emb|CAB54515.1| Protein ATG-4.2 [Caenorhabditis elegans]
Length = 521
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/315 (29%), Positives = 156/315 (49%), Gaps = 58/315 (18%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
E D SRLW TYR F + D+ TTD GWGCM+R QM++AQA++ GRDW++
Sbjct: 168 FENFCSDYYSRLWITYRTDFPALLDTDTTTDCGWGCMIRTTQMMVAQAIMVNRFGRDWRF 227
Query: 72 N--------VNSKEEAYLK-------ILKMFEDRRTAPYSIHQ-IALTGASEG-KAVGEW 114
+ E+ + + ILK+FED+ TAP IH+ + + +G KAVG W
Sbjct: 228 TRRKRSHVAAHGDEDDFDREKIQEWMILKLFEDKPTAPLGIHKMVGIAAMGKGKKAVGSW 287
Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
+ P+ +++K A + S + + A ++++ ++ + + + L+LVI
Sbjct: 288 YSPSEAVFIMKK-ALTESSSPLTGNTA----MLLSIDGRVHIRDIEVETKNWMKKLILVI 342
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
+RLG ++NP+Y+ + + +A+ LG+ GG+
Sbjct: 343 VVRLGAAELNPIYVPHLMRLFAM---------------------------ESCLGITGGR 375
Query: 235 PNHALYFIGYVGNDVIFLDPHT---------NQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
P+H+ +F+GY G+ +I+LDPH N N V + ++K + +YHC S+
Sbjct: 376 PDHSSWFVGYYGDQIIYLDPHVAHEYIPIDINPNTNVVDSDSKKAKKCPEKSYHCRLLSK 435
Query: 286 LHILHMDPSIAVVSQ 300
+H MDPS A+ Q
Sbjct: 436 MHFFDMDPSCALCFQ 450
>gi|355755452|gb|EHH59199.1| Cysteine protease ATG4D, partial [Macaca fascicularis]
Length = 427
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 146/323 (45%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 61 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 120
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 121 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 180
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 181 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARL 240
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 241 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 278
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIG Q V + E
Sbjct: 279 ----RCELC----LGIMGGKPRHSLYFIGXXXXXXXXXXXXXCQPTVDVSQADFPLE--- 327
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 328 --SFHCTSPRKMAFAKMDPSCTV 348
>gi|452837994|gb|EME39935.1| hypothetical protein DOTSEDRAFT_47435 [Dothistroma septosporum
NZE10]
Length = 442
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 149/319 (46%), Gaps = 64/319 (20%)
Query: 4 ANKLSHQDL---EQIRRDITSRLWFTYRKGFVPIGDS----------------------G 38
+ + +DL + D+ S++W TYR F PI S G
Sbjct: 99 SKAMDEEDLGWPSEFLEDVESKIWLTYRNNFPPIPKSSEAAATSAMSFTTKLRNFANKDG 158
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
T+D GWGCM+R GQ ++A A+L LGRDW+ KE Y IL +F D +P SIH
Sbjct: 159 FTSDTGWGCMIRSGQSLLANAILIHRLGRDWR--RGDKEREYKDILSLFADTPESPLSIH 216
Query: 99 QIALTGASE-GKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCT 156
+ GA G GEWFGPN A+ +R L KY + V+ D+ + V+ L
Sbjct: 217 KFVEHGAQACGTYPGEWFGPNATARCIRALTEKYHEAGLQVYSRPNDSDVYVD---SLMQ 273
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
T + ++ ++QP ++V+ +RLGI+ + P Y +K L
Sbjct: 274 TAAQKDADDKFQPTLIVLGIRLGIEKVTPAYHAALKAALEL------------------- 314
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
QS+G+ GG+P+ + YFIG+ G++ +LDPHT + + +D +
Sbjct: 315 --------SQSVGIAGGRPSSSHYFIGHQGDNFFYLDPHTTRPMLSPQPLAEDI-----N 361
Query: 277 TYHCPQASRLHILHMDPSI 295
+ H + RL I MDPS+
Sbjct: 362 SCHTRRVRRLGIAEMDPSM 380
>gi|395840680|ref|XP_003793181.1| PREDICTED: cysteine protease ATG4C [Otolemur garnettii]
Length = 457
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/343 (30%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPGALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETMRKYSDYHETRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK AK+ D
Sbjct: 195 IYHRKIVSWFGDSPLAFFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEAKHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + R S + + +++++P+RLG + NP Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDTQSAS-RTSEGAEDKAVIILVPVRLGGERTNPDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQPFVDVSVKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|426230580|ref|XP_004009345.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Ovis
aries]
Length = 438
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 95/288 (32%), Positives = 145/288 (50%), Gaps = 36/288 (12%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGTLTSDCGWGCMLRSGQMMLAQGLLLHLLPRDWT 166
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAK 129
W+ + P G + GK G+W+GP+ VA +LRK +
Sbjct: 167 WSQGAGLGPAEPPGLGSPSPGPGPXXXXXXXSWGRAPGKKAGDWYGPSLVAHILRKAVES 226
Query: 130 YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYIN 189
+ + +V +V+ D T+ V +L R+ +W+ +V+++P+RLG + +NPVY+
Sbjct: 227 CSEVTRLVVYVSQDCTVYKADVARLVA---RSDPTAEWKSVVILVPVRLGGETLNPVYVP 283
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
+K+ R E LG++GG P H+LYFIGY + +
Sbjct: 284 CVKELL-----------------------RSELC----LGIMGGTPRHSLYFIGYQDDFL 316
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++LDPH Q D Q ++ L+S +HC ++ MDPS V
Sbjct: 317 LYLDPHYCQP---TVDVSQ-ADFPLES-FHCTSPRKMAFAKMDPSCTV 359
>gi|149709514|ref|XP_001500964.1| PREDICTED: cysteine protease ATG4C [Equus caballus]
Length = 458
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 161/357 (45%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D TSR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NVNSKE----------------EAYL----------------------------- 81
W N+ + + EA L
Sbjct: 135 WPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K C + AS + + +++++P+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCAS--MASDHADDKAVIILVPVRLGGERTNTDYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|325091702|gb|EGC45012.1| cysteine protease [Ajellomyces capsulatus H88]
Length = 508
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 148/308 (48%), Gaps = 58/308 (18%)
Query: 18 DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
D S++W TYR F +P + G TTD GWGCM+R GQ
Sbjct: 128 DFESKIWLTYRSNFPLIPKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 187
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGRDW+ KEE+ K+L +F D AP+SIH+ GAS GK GE
Sbjct: 188 LLANALAILSLGRDWRRGTKIKEES--KLLSLFADDPKAPFSIHRFVEHGASACGKYPGE 245
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKR-ASSNPQWQPLV 171
WFGP+ A+ ++ L+ + + + +V D + V ++ + + + A ++ P +
Sbjct: 246 WFGPSATARCIQALSSECEHAGLNVYVTSDGSDVYEDRFRAIASAGGTGAGTSTDVHPTL 305
Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
+++ +RLGI + PVY +K +PQS+G+
Sbjct: 306 ILLGIRLGIDRVTPVYWEALKAV---------------------------LKYPQSVGIA 338
Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHCPQASRLHIL 289
GG+P+ + YFIG G+ +LDPH + VY D + +TYH + RLHI
Sbjct: 339 GGRPSSSHYFIGAQGSHFFYLDPH-HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIK 397
Query: 290 HMDPSIAV 297
MDPS+ +
Sbjct: 398 DMDPSMLI 405
>gi|332232054|ref|XP_003265216.1| PREDICTED: cysteine protease ATG4C isoform 1 [Nomascus leucogenys]
gi|332232056|ref|XP_003265217.1| PREDICTED: cysteine protease ATG4C isoform 2 [Nomascus leucogenys]
Length = 458
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D AP+ +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPLAPFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|395530478|ref|XP_003767321.1| PREDICTED: cysteine protease ATG4C [Sarcophilus harrisii]
Length = 458
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 102/342 (29%), Positives = 153/342 (44%), Gaps = 89/342 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ RRD SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 77 NVEEFRRDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 136
Query: 71 W-------NVNSK--------------------------------------------EEA 79
W N +S+ E
Sbjct: 137 WPDALDVDNSDSESWTSHTVKKLTASLEASLTGERAAQDPSPIKEPPRRGSDDGGGEESC 196
Query: 80 YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSSI 136
+ KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D I
Sbjct: 197 HRKIVSWFADSPLACFGLHQLIEHGKKSGKTAGDWYGPAVVAHILRKAVEEARHPDLQGI 256
Query: 137 VFHVALDNTLV-VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCY 195
+VA D T+ + + K C++ N + + +++++P+RLG + N Y+ +K
Sbjct: 257 TIYVAQDCTVYKADVIDKQCSSMD--PENTEDKAVIILVPVRLGGERTNMDYLEFVK--- 311
Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
ILS Y +G+IGGKP + YF G+ + +I++DPH
Sbjct: 312 -----------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDPH 347
Query: 256 TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
Q+ V K+ E ++HCP ++ MDPS V
Sbjct: 348 YCQSFVDVSIKDFPLE-----SFHCPSPKKMSFRKMDPSCTV 384
>gi|296208133|ref|XP_002750954.1| PREDICTED: cysteine protease ATG4C [Callithrix jacchus]
Length = 458
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEIRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
K++ F D AP+ +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKVISWFGDSPLAPFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|168693565|ref|NP_001108301.1| uncharacterized protein LOC100137698 [Xenopus laevis]
gi|163915830|gb|AAI57741.1| LOC100137698 protein [Xenopus laevis]
Length = 468
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 156/330 (47%), Gaps = 78/330 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ ++D SR+W TYR+ F + + LTTD GWGCM+R GQM++AQ LL L R+W
Sbjct: 94 EIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLMHLLSREWT 153
Query: 71 W-------------------------------------------NVNSKEEAYLKILKMF 87
W + + I++ F
Sbjct: 154 WPEALYTHFVEMEPIRSSSPSRMPLSSLATSHSASDCWPHAHSSRAPHGNQVHRNIIRWF 213
Query: 88 EDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD-DWSSIVFHVALDNTL 146
D +AP+ +H++ G+ GK G+W+GP+ VA +++K + + + + +V+ D T+
Sbjct: 214 SDHPSAPFGLHRMVALGSIFGKKAGDWYGPSIVAHIIKKAIETSCEVAELSVYVSQDCTV 273
Query: 147 VVNQVKKLCTTN--KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
+++L + +S + +++++P RLG + NPVY + +K+ +
Sbjct: 274 YKADIEQLFAGDVPHAETSRDAGKAVIILVPARLGGETFNPVYKHCLKEFLRM------- 326
Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
P LG+IGGKP H+LYFIGY N +++LDPH +Q+ Y
Sbjct: 327 --------------------PSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYSQS----Y 362
Query: 265 DKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
++ L+S +HC ++ I MDPS
Sbjct: 363 IDTSRNDFPLES-FHCNTPRKISITRMDPS 391
>gi|392572178|gb|EIW65350.1| hypothetical protein TRAVEDRAFT_33890 [Trametes versicolor
FP-101664 SS1]
Length = 997
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 139/284 (48%), Gaps = 73/284 (25%)
Query: 18 DITSRLWFTYRKGFVPI---------------------------------GDSGLTTDKG 44
D TSR+W TYR F PI G+ G TTD G
Sbjct: 301 DFTSRIWLTYRSQFFPIRDTTLAALDAELMDNPTGVPSSPPTKKWNWPLGGEKGWTTDAG 360
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA----YLKILKMFEDRRTA--PYSIH 98
WGCMLR GQ ++A AL+ LHLGRDW+ + A Y++I+ F D + P+S+H
Sbjct: 361 WGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQIVTWFLDNPSPLCPFSVH 420
Query: 99 QIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-----KK 153
++AL G GK VG+WFGP+T A ++ L +++ A+D TL + V
Sbjct: 421 RMALVGKDLGKDVGQWFGPSTAAGAIKTLVHAFPEATLGVANAVDGTLYESDVYAASRSV 480
Query: 154 LCTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+ +T + + W + ++++I +RLGI+ +NP+Y N IK Y
Sbjct: 481 MYSTRRHGHARMDWGDRAVLVLIGIRLGIEGVNPLYYNTIKTLY---------------- 524
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
TFPQS+G+ GG+P+ + YF+G +++ +LDPH
Sbjct: 525 -----------TFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 557
>gi|410921904|ref|XP_003974423.1| PREDICTED: cysteine protease ATG4C-like [Takifugu rubripes]
Length = 468
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 164/373 (43%), Gaps = 106/373 (28%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E RRD SR+W TYR+ F P+ S LT+D GWGCMLR GQM++AQALL LGRDW
Sbjct: 67 NVEDFRRDFGSRIWLTYREEFPPLPGSSLTSDCGWGCMLRAGQMMLAQALLLHFLGRDWT 126
Query: 71 WN---------------------------------------------VNSKEEA------ 79
W+ S EEA
Sbjct: 127 WSGAMSLQPLDTETWTTSAAKRLVASLESSLQASPGPSDPVVSQRQVAGSGEEAGVHTDG 186
Query: 80 --YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
+ ++ F D +A + +H++ G + GK GEW+GP VA +L+K A+ +
Sbjct: 187 GFHRTLVSWFGDSPSAQFGLHRMVRLGLAMGKRAGEWYGPAVVAHILKKAVEEARDPCLA 246
Query: 135 SIVFHVALDNTL----VVNQ-------------VKKLCTTNKRASSNPQWQPLVLVIPLR 177
I +V+ D T+ V++ V ++ AS++P + +++++P+R
Sbjct: 247 GISSYVSQDCTVYSADVIDSHKASASAAAERPDVTSSSHNSQPASASPDSRAVIILVPVR 306
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG + NP Y N K LS Y +G+IGGKP
Sbjct: 307 LGGEKTNPDYFNLAKS--------------FLSLDY-------------CIGIIGGKPKQ 339
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
A YF+G+ + +I++DPH Q+ V S+ L S +HCP ++ MDPS
Sbjct: 340 ACYFVGFQDDSLIYMDPHYCQSFVDV----STSDFPLQS-FHCPSPKKMPFTKMDPSCTF 394
Query: 298 -VSQRSYSDYKNV 309
RS D++ +
Sbjct: 395 GFYSRSAQDFERI 407
>gi|194764839|ref|XP_001964535.1| GF23235 [Drosophila ananassae]
gi|190614807|gb|EDV30331.1| GF23235 [Drosophila ananassae]
Length = 668
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 100/321 (31%), Positives = 163/321 (50%), Gaps = 65/321 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+ +E RRD SR+W TYR+ F + S T+D GWGCMLR GQM++AQ L+ +GR W
Sbjct: 260 EGIEGFRRDFYSRIWMTYRREFPTMNGSNYTSDCGWGCMLRSGQMLLAQGLICHFMGRTW 319
Query: 70 QWNVNSK------EEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVGEWFGPNTVA 121
+++ S+ + + KI+K F D +++P+SIH + G + GK G+W+GP +V+
Sbjct: 320 RYDSESQLHSTYEDNMHKKIVKWFGDSSSKSSPFSIHALVRLGENLGKKPGDWYGPASVS 379
Query: 122 QVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRA---------SSNPQ- 166
+L+ ++ D+ +I +VA D T+ + ++ C+ + A + PQ
Sbjct: 380 YLLKHALEHAAQENADFDNISIYVAKDCTIYLQDIEDQCSVPEPAPKPNVPWQQAKRPQA 439
Query: 167 ----------WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
W+ L+++IPLRLG +N Y + +K+L ST +
Sbjct: 440 EVSKTEHQQHWKALIVLIPLRLGSDKLNLAYAH---------------CLKLLLSTEH-- 482
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS 276
LG+IGGKP H+LYF+G+ + +I LDPH Q + V + E +
Sbjct: 483 ----------CLGIIGGKPKHSLYFVGFQEDRLIHLDPHYCQEMVDV-----NQENFSLN 527
Query: 277 TYHCPQASRLHILHMDPSIAV 297
++HC +L MDPS +
Sbjct: 528 SFHCKSPRKLKSSKMDPSCCI 548
>gi|335774946|gb|AEH58408.1| cysteine protease ATG4C-like protein, partial [Equus caballus]
Length = 400
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 161/357 (45%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D TSR+W TYR+ F I S LTTD GWGC +R GQM++AQ L+ LGR W
Sbjct: 17 NVEEFRKDFTSRIWLTYREEFPQIEGSTLTTDCGWGCTVRTGQMLLAQGLILHFLGRAWT 76
Query: 71 W----NVNSKE----------------EAYL----------------------------- 81
W N+ + + EA L
Sbjct: 77 WPDALNIENSDFESWTSNTVKKFTASFEASLSEERELKTPTISLKETIGRYSDDHEMRNE 136
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 137 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 196
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K C + AS + + +++++P+RLG + N Y++ +K
Sbjct: 197 GITIYVAQDCTVYSSDVIDKQCAS--MASDHADDKAVIILVPVRLGGERTNTDYLDFVK- 253
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 254 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 287
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 288 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 339
>gi|403356037|gb|EJY77606.1| Cysteine protease family C54 putative [Oxytricha trifallax]
gi|403376523|gb|EJY88241.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 480
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 94/308 (30%), Positives = 157/308 (50%), Gaps = 50/308 (16%)
Query: 13 EQIRRDITSRLWFTYRKGF-VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD--- 68
+++ R S WFTYR +PIG S +D GWGCM+R GQM++ QA++ H+ D
Sbjct: 148 DKLTRAFKSVTWFTYRNELELPIGSSTYHSDAGWGCMVRTGQMLLFQAMM-RHVFEDNLK 206
Query: 69 --WQWNVNSKEEAYLKILKMFEDR---RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
+ + E YL +L++F+D + +PYSI IA G + G+W+GP ++ V
Sbjct: 207 YEYIEKITEYREEYLNLLRLFQDNGEGQFSPYSIQNIAFQGLKIDRKPGDWYGPQAISIV 266
Query: 124 LRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQD 182
L++L K Y +V L+ + +N +++ S Q + +VIPLRLG+
Sbjct: 267 LKRLTKIYKPVKQFTMYVCLEGNIYLNVIQE--------KSKDWTQSVFIVIPLRLGLNY 318
Query: 183 INPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI 242
I P Y++ +KK FTFPQ++G+ GG+ N ALYFI
Sbjct: 319 IEPEYLSSVKKV---------------------------FTFPQNVGIAGGRENSALYFI 351
Query: 243 GY--VGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-V 298
G N++I+LDPH +++ + + + +S++HC + ++ + M S+A+
Sbjct: 352 GISDSSNNLIYLDPHLVQKSVPTCNMQTNEQFYQYESSFHCTKFKKMPLNRMCTSVAIGF 411
Query: 299 SQRSYSDY 306
R Y+D+
Sbjct: 412 YIRDYNDF 419
>gi|348529755|ref|XP_003452378.1| PREDICTED: cysteine protease ATG4C-like [Oreochromis niloticus]
Length = 478
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 161/381 (42%), Gaps = 115/381 (30%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D++ RRD SR+W TYR+ F P+ S LT+D GWGCMLR GQM++AQ L+ LGRDW
Sbjct: 70 DVDAFRRDFASRVWLTYREEFSPLPGSTLTSDCGWGCMLRAGQMMLAQGLMLHFLGRDWT 129
Query: 71 WN------------------------------------------------VNSKEEAYLK 82
W+ + S EEA
Sbjct: 130 WSEALTLQPLDTETWTTTAAKRLVASLEASLQGVPGPSVRSSSPQAQALSLGSAEEADAH 189
Query: 83 ILKM--------FEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYD 131
+ +M F D + P +H++ G + GK G+W+GP VA +L+K A
Sbjct: 190 LKEMYHRTLVSWFGDSPSTPLGLHRLVRLGLTMGKQAGDWYGPAVVAHILKKAVEEAMDP 249
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKR----------------------ASSNPQWQP 169
+ I +V+ D T+ V C R AS+ P+ +
Sbjct: 250 GLACITAYVSQDCTVYSADVVD-CHRAPRAERTSDETPDAPTLPQNDQPAHASTLPESRA 308
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
+++++P+RLG + NP Y + K ILS Y +G
Sbjct: 309 VIILVPVRLGGEKTNPEYFDFAK--------------SILSLEY-------------CIG 341
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
+IGGKP A YF+G+ + +I++DPH Q+ V S+ L S YHCP ++
Sbjct: 342 IIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDV----STSDFPLQS-YHCPSPKKMPFS 396
Query: 290 HMDPSIAV-VSQRSYSDYKNV 309
MDPS V RS DY+ +
Sbjct: 397 KMDPSCTVGFYSRSVQDYERI 417
>gi|167526339|ref|XP_001747503.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773949|gb|EDQ87583.1| predicted protein [Monosiga brevicollis MX1]
Length = 355
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/248 (34%), Positives = 125/248 (50%), Gaps = 37/248 (14%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGL-TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
++IR ++ LWFTYR IGDS TD+GWGC LR GQM++ +AL H RD+
Sbjct: 45 DEIRSRASAFLWFTYRNSEYAIGDSPRHKTDRGWGCTLRVGQMIVGEALQRCHCPRDYDK 104
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
E A + ILK FEDR S+H +A+ GK G+W P VA VLR
Sbjct: 105 LSYPSEAARMSILKEFEDRPDRVLSVHAMAMQSKFVGKRAGQWHTPTDVAHVLRLAVNEQ 164
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
+ + HVA+D+ +V++ ++KL ++ +L +PLRLGI + I +
Sbjct: 165 EAMGLQVHVAMDSMVVLDDLRKLFRADR---------ATLLFVPLRLGIDIVQAEMIPAV 215
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K+ F P +LG++GG+P A YFIGY+ ++++
Sbjct: 216 KRF---------------------------FHSPSALGIMGGRPGAAHYFIGYMDHNLLL 248
Query: 252 LDPHTNQN 259
LDPHT Q+
Sbjct: 249 LDPHTTQD 256
>gi|225685095|gb|EEH23379.1| peptidase family C54 [Paracoccidioides brasiliensis Pb03]
Length = 508
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/305 (32%), Positives = 149/305 (48%), Gaps = 57/305 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR GF I S G TTD GWGCM+R GQ
Sbjct: 148 DFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 207
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGRDW+ + +E+ L L +F D AP+SIH+ GAS GK GE
Sbjct: 208 LLASALSILSLGRDWRRGTKTDQESNL--LSLFADDPKAPFSIHRFVEYGASACGKYPGE 265
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + +V D + V + T ++ P +++
Sbjct: 266 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYED--RFRTIASSGATEAGIHPTLIL 323
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI + PVY +K D++K +PQS+G+ GG
Sbjct: 324 LGIRLGIDRVTPVYWEALK-----------DVLK----------------YPQSVGIAGG 356
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHMD 292
+P+ + YFIG G+ +LDPH + + Q +E++L+S YH + RLHI MD
Sbjct: 357 RPSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNS-YHTRRLRRLHIKDMD 415
Query: 293 PSIAV 297
PS+ +
Sbjct: 416 PSMLI 420
>gi|226294409|gb|EEH49829.1| cysteine protease atg4 [Paracoccidioides brasiliensis Pb18]
Length = 513
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/304 (31%), Positives = 146/304 (48%), Gaps = 55/304 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR GF I S G TTD GWGCM+R GQ
Sbjct: 153 DFESKIWLTYRSGFPSIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 212
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGRDW+ + +E+ L L +F D AP+SIH+ GAS GK GE
Sbjct: 213 LLASALSILSLGRDWRRGTKTDQESNL--LSLFADDPKAPFSIHRFVEYGASACGKYPGE 270
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + +V D + V + T ++ P +++
Sbjct: 271 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYED--RFRTIASSGATEAGIHPTLIL 328
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI + PVY +K D++K +PQS+G+ GG
Sbjct: 329 LGIRLGIDRVTPVYWEALK-----------DVLK----------------YPQSVGIAGG 361
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG G+ +LDPH + + Q ++ ++YH + RLHI MDP
Sbjct: 362 RPSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDP 421
Query: 294 SIAV 297
S+ +
Sbjct: 422 SMLI 425
>gi|302674653|ref|XP_003027011.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
gi|300100696|gb|EFI92108.1| hypothetical protein SCHCODRAFT_70973 [Schizophyllum commune H4-8]
Length = 858
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/266 (33%), Positives = 131/266 (49%), Gaps = 69/266 (25%)
Query: 18 DITSRLWFTYRKGFVPI------------------------GDSGLTTDKGWGCMLRCGQ 53
D SRLW TYR GF PI G GLT+D GWGCMLR GQ
Sbjct: 160 DFASRLWLTYRSGFAPIRDMALEELEPVRGGALSTLTSALTGRRGLTSDAGWGCMLRTGQ 219
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAV 111
++A AL+ +GR Y+ ++ +F D + AP+S+H++AL G + GK V
Sbjct: 220 SLLANALVVAWMGR-------GALALYIHLISLFLDSPSPSAPFSVHRMALAGRALGKDV 272
Query: 112 GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW--QP 169
G+WFGP+T A ++ L + VA+ VV Q ++ +R +W QP
Sbjct: 273 GQWFGPSTAAGAIKALVNA--YPDAGLGVAIAEDGVVYQTQRRQKERER-----EWGDQP 325
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
+++++ +RLG+ +NP+Y + IK+ Y TFPQSLG
Sbjct: 326 VLVLLGIRLGLDGVNPIYYDTIKQLY---------------------------TFPQSLG 358
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPH 255
+ GG+P+ + YF+G D+ +LDPH
Sbjct: 359 IAGGRPSSSYYFVGAQAGDLFYLDPH 384
>gi|170032510|ref|XP_001844124.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872594|gb|EDS35977.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 628
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/340 (29%), Positives = 157/340 (46%), Gaps = 57/340 (16%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
+ + +E +RD SRLW TYRK F + DS T+D GWGCM+R GQM++AQ L+ LG
Sbjct: 186 VEDEGIEAFKRDFISRLWMTYRKEFQTMDDSNYTSDCGWGCMIRSGQMLLAQGLITHFLG 245
Query: 67 RDWQWNVNS------------KEEAYLKILKMFED--RRTAPYSIHQIALTGASEGKAVG 112
R W+W+ + ++ + KI++ F D RT+P+SIH + G GK G
Sbjct: 246 RGWRWDPSQEGLRLNFDSLQYEDGIHRKIIRWFGDTSSRTSPFSIHTLVALGKEAGKKPG 305
Query: 113 EWFGPNTVAQVLRKLAKY-----DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
+W+GP +VA +LR+ K D I +VA D + + + CT + S P W
Sbjct: 306 DWYGPGSVAHLLRQAVKLAAKEITDLDGINVYVAQDCAVYIQDILDECTVSTTPSVAP-W 364
Query: 168 Q------------PLVLVIPLRLGIQDINPVYINGIKKCYALP----------------- 198
Q P P R+G + + P
Sbjct: 365 QKKMSSAAACTDSPSQATTP-RVGATASCSSSSSPHATGFVAPSDTADESAPGSHWKSLI 423
Query: 199 -ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
+ P+ + L+ YN + + +G+IGG+P H+L+F+GY + +I LDPH
Sbjct: 424 LLVPLRLGTEKLNPIYN-DCLKAMLSLDNCIGIIGGRPKHSLFFVGYQEDKLIHLDPHYC 482
Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
Q++ V + E S++HC ++ + MDPS +
Sbjct: 483 QDMVDV-----NQENFPVSSFHCKSPRKMKLSKMDPSCCI 517
>gi|166990662|sp|A7F045.2|ATG4_SCLS1 RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 439
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 98/316 (31%), Positives = 147/316 (46%), Gaps = 58/316 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D +++W TYR F I S G T+D GWGCM+R GQ
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRSGQS 165
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ALL L +GR+W+ +S EE KIL +F D APYSIH+ GAS GK GE
Sbjct: 166 LLANALLTLRMGREWRRGSSSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 223
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L S + ++ D + V + K S+ ++ P +++
Sbjct: 224 WFGPSAAARCIQALTNSQVESELRVYITGDGSDVYEDT--FMSIAKPNST--KFTPTLIL 279
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLG+ I PVY +K +P QS+G+ GG
Sbjct: 280 VGTRLGLDKITPVYWEALKSSLQMP---------------------------QSVGIAGG 312
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG +D +LDPH + D +D + + H + RLHI MDP
Sbjct: 313 RPSSSHYFIGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDP 372
Query: 294 SIAVVSQ-RSYSDYKN 308
S+ + R +D+K+
Sbjct: 373 SMLIAFLIRDENDWKD 388
>gi|119195519|ref|XP_001248363.1| cysteine protease atg4 [Coccidioides immitis RS]
gi|303321428|ref|XP_003070708.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|121769827|sp|Q1E5M9.1|ATG4_COCIM RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|240110405|gb|EER28563.1| cysteine protease atg4, putative [Coccidioides posadasii C735 delta
SOWgp]
gi|320040173|gb|EFW22106.1| cysteine protease atg4 [Coccidioides posadasii str. Silveira]
gi|392862420|gb|EAS36938.2| cysteine protease atg4 [Coccidioides immitis RS]
Length = 432
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 143/305 (46%), Gaps = 62/305 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S+ WFTYR F I S G T D GWGCM+R GQ
Sbjct: 108 DFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRSGQS 167
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L+LGRDW+ KEE ++L +F D AP+SIH+ GAS GK GE
Sbjct: 168 LLANALSILNLGRDWRRGSKIKEEC--ELLSLFADNPQAPFSIHRFVDYGASACGKHPGE 225
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKRASSNPQWQPLVL 172
WFGP+ A+ + L+ + + +V D + V +Q +++ + +P ++
Sbjct: 226 WFGPSATARCIEALSNECKHTDLNVYVMSDGSDVHEDQFRQIAGPDG-------IRPTLI 278
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ +RLGI+ + PVY ++ +PQS+G+ G
Sbjct: 279 LLGVRLGIESVTPVYWEALRAI---------------------------IRYPQSVGIAG 311
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
G+P+ +LYFIG G +LDPH + S + LD TYH + RLHI MD
Sbjct: 312 GRPSSSLYFIGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLD-TYHTRRLRRLHIREMD 370
Query: 293 PSIAV 297
PS+ +
Sbjct: 371 PSMLI 375
>gi|417401291|gb|JAA47536.1| Putative cysteine protease required for autophagy [Desmodus
rotundus]
Length = 458
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 153/346 (44%), Gaps = 95/346 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGRYPDDREMQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPVALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRA---SSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
I +VA D T+ + V +RA S N + +++++P+RLG + N Y+ I
Sbjct: 255 GITIYVAQDCTVYSSDV----IDKQRASMTSDNTDGKAVIILVPVRLGGERTNTDYLEFI 310
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K ILS Y +G+IGGKP + YF G+ + +I+
Sbjct: 311 K--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIY 343
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+DPH Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 344 MDPHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|212545090|ref|XP_002152699.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
gi|210065668|gb|EEA19762.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
marneffei ATCC 18224]
Length = 489
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 148/310 (47%), Gaps = 69/310 (22%)
Query: 18 DITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCMLRCGQ 53
D S++W TYR F PI S G T+D GWGCM+R GQ
Sbjct: 156 DFESKIWMTYRSNFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIRSGQ 215
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
M++A AL LGRDW+ ++ EE K+L +F D AP+SIH+ GA GK G
Sbjct: 216 MLLANALAISRLGRDWRRVSHTTEEN--KLLSLFADDPAAPFSIHRFVRHGALYCGKHPG 273
Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
EWFGP+ A ++ L++ + + +V+ D+T V K N+ +P ++
Sbjct: 274 EWFGPSATATCIQALSEEYKVAGMNVYVSSDSTYVYEDKFKAVAYNQPG----HMRPTLI 329
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ RLGI I PVY G++ D++K+ PQSLG+ G
Sbjct: 330 LLGTRLGIDRITPVYRKGLE-----------DLLKL----------------PQSLGIAG 362
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQ-----NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
G+P+ + YFIG + +LDPH + + Y +EQ +DS H + R+H
Sbjct: 363 GRPSSSHYFIGVQNSFFFYLDPHHTRPALPHKVDSAYTQEQ-----VDSC-HTRRLRRIH 416
Query: 288 ILHMDPSIAV 297
I MDPS+ V
Sbjct: 417 IDDMDPSMLV 426
>gi|348586836|ref|XP_003479174.1| PREDICTED: cysteine protease ATG4C-like [Cavia porcellus]
Length = 435
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 156/343 (45%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ R+D SR+W TYR+ F PI S L+TD GWGC LR GQM++AQ L+ LGR W
Sbjct: 53 NVDEFRKDFISRIWLTYREEFPPIEGSALSTDCGWGCTLRTGQMLLAQGLVLHFLGRAWI 112
Query: 71 W-------NVNS---------------------------------------------KEE 78
W N++S ++E
Sbjct: 113 WPDALNIENLDSESWTSHTVKKFAASFEASLSGERQLGTPALSLKETMEKYPNPHEVRDE 172
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
Y KI+ F D +A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 173 VYHRKIISWFGDSPSALFGLHQLIECGRRSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 232
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + R + N + +++++P+RLG + N Y+ +K
Sbjct: 233 GITVYVAQDCTVYNSDVIDKQSAS-RPAGNADDKAVIILVPVRLGGERTNTDYLEFVK-- 289
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 290 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 324
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 325 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 362
>gi|327264155|ref|XP_003216881.1| PREDICTED: cysteine protease ATG4D-like [Anolis carolinensis]
Length = 585
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 155/346 (44%), Gaps = 90/346 (26%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D+E ++D SR+W TYR+ F + + TTD GWGCMLR GQM++AQ L+ LG+DW
Sbjct: 193 EDVEGFQKDFASRIWLTYRRDFQQLEGTMWTTDCGWGCMLRSGQMLLAQGLIVHFLGKDW 252
Query: 70 Q-------------------------------------------WNVNS----------- 75
W + +
Sbjct: 253 TWPDALHTPGLVEMEPMKATHLPYPSTSSSHQGPSIPTDRSRGPWELRAPRHTRSPDELE 312
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-YDDWS 134
KE + KI+ F DR A + IH++ G S GK G+W+GP+ A ++RK +
Sbjct: 313 KERYHRKIISWFADRPQAHFGIHRLVSLGHSSGKKAGDWYGPSVAAHIIRKAVDCCSEAG 372
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNK-RASSNPQ--WQPLVLVIPLRLGIQDINPVYINGI 191
++V +V+ D T+ V L ++ R + +P W+ +++++P+RLG + NP Y++ +
Sbjct: 373 NLVVYVSQDCTVYKGDVANLANKSEDRTAWDPGAVWKAVIILVPMRLGGEAFNPAYVDCV 432
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K+ L EF +G+IGGKP H+LYF+GY + +++
Sbjct: 433 KELLKL-----------------------EFC----IGIIGGKPRHSLYFVGYQDDALLY 465
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
LDPH Q + E ++HC + +DPS +
Sbjct: 466 LDPHYCQPF-----VDTTKENFPLESFHCNSPRKTAFTKVDPSCTI 506
>gi|291398772|ref|XP_002715996.1| PREDICTED: APG4 autophagy 4 homolog C [Oryctolagus cuniculus]
Length = 458
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTICLKETIGKCSEDHETENE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 ICHRKIISWFGDSPLAAFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITVYVAQDCTVYSSDVIDKQSAS-MTSDNTDDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
H Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|298231125|ref|NP_001177213.1| cysteine protease ATG4C [Sus scrofa]
gi|296874486|gb|ADH81748.1| autophagy related 4-like protein C [Sus scrofa]
Length = 458
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 158/357 (44%), Gaps = 92/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F + S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQLEGSALTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSNTAKKFTASFEASLSGERELKTPTISLKETIGRYSDDREKQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLTLFGLHQLIECGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K C + A N + +++++P+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCAS--MAPDNTDDKAVIILVPVRLGGERTNADYLDFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|242814606|ref|XP_002486401.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
gi|218714740|gb|EED14163.1| autophagy cysteine endopeptidase Atg4, putative [Talaromyces
stipitatus ATCC 10500]
Length = 454
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 100/306 (32%), Positives = 148/306 (48%), Gaps = 60/306 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCMLRCGQ 53
D R+W TYR GF PI S G T+D GWGCM+R GQ
Sbjct: 120 DFECRIWMTYRSGFPPIARSEDANAAQAMTLSVRLRSQLTEHHQGFTSDTGWGCMIRSGQ 179
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
++A AL LGRDW+ NS EE ++L +F D AP+SIH+ GA GK G
Sbjct: 180 SLLANALAISRLGRDWRRGSNSTEEN--RLLSLFADDPAAPFSIHKFVRHGALYCGKHPG 237
Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
EWFGP+ A ++ L+ + + +V+ DNT V K N+ + + +P ++
Sbjct: 238 EWFGPSATATCIQALSDEYKDAGMNVYVSSDNTYVYEDKFKAVAYNQ----SDRMRPTLI 293
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ RLGI I PVY G++ D++K+ PQ+LG+ G
Sbjct: 294 LLGTRLGIDRITPVYRKGLE-----------DLLKL----------------PQALGIAG 326
Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
G+P+ + YFIG + +LDP HT + +++++DS H + R+HI M
Sbjct: 327 GRPSASHYFIGVQNSFFFYLDPHHTRPALPYKTGDLAYTQEEIDSC-HTRRLRRIHIDDM 385
Query: 292 DPSIAV 297
DPS+ V
Sbjct: 386 DPSMLV 391
>gi|295657177|ref|XP_002789160.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284504|gb|EEH40070.1| autophagy-related protein 4 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 601
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 147/304 (48%), Gaps = 55/304 (18%)
Query: 18 DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
D S++W TYR GF +P + G TTD GWGCM+R GQ
Sbjct: 239 DFESKIWLTYRSGFPFIPKSSDPSAASAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 298
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGRDW+ + +E+ L L +F D AP+SIH+ GAS GK GE
Sbjct: 299 LLASALSILSLGRDWRRGTKTDQESNL--LSLFADDPKAPFSIHRFVEYGASACGKYPGE 356
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + +V D + V + T ++ P +++
Sbjct: 357 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYED--RFRTIASGGATEAGIHPTLIL 414
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI + PVY +K D++K +PQS+G+ GG
Sbjct: 415 LGIRLGIDRVTPVYWEALK-----------DVLK----------------YPQSVGIAGG 447
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG G+ +LDPH + + Q ++ ++YH + RLHI MDP
Sbjct: 448 RPSSSHYFIGAQGSYFFYLDPHHTRPALPYHAPGQVFTEEELNSYHTRRLRRLHIKDMDP 507
Query: 294 SIAV 297
S+ +
Sbjct: 508 SMLI 511
>gi|296415785|ref|XP_002837566.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633439|emb|CAZ81757.1| unnamed protein product [Tuber melanosporum]
Length = 409
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 93/303 (30%), Positives = 137/303 (45%), Gaps = 57/303 (18%)
Query: 18 DITSRLWFTYRKGFVPI---------------------GDSGLTTDKGWGCMLRCGQMVI 56
D S LW TYR F PI G T+D GWGCM+R GQ VI
Sbjct: 89 DFESTLWMTYRSDFKPIPRVADYNDKLTFLTSIRSHLDKAEGFTSDSGWGCMIRSGQAVI 148
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGR W+ + +EE ++L +F D AP+SIH+ G E GK GEWF
Sbjct: 149 ANALAHLRLGRGWRRGMKPEEEK--RLLALFADDPRAPFSIHKFVRHGEVECGKNPGEWF 206
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
GP+ A ++ L + + + + N L +K+ N ++P +++
Sbjct: 207 GPSAAAMCIQALTHAYEPAGLRVYQTNSNDLYEEDFRKVAVVNG------VFKPTLVLAG 260
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
+RLGI+ I +Y + C +P Q++G+ GG+P
Sbjct: 261 IRLGIERITNIYYEPLAACLRMP---------------------------QTVGIAGGRP 293
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+ + YFI G + +LDPHT + I + QD ++ T H + RLHI MDPS+
Sbjct: 294 SSSHYFIAVQGENFFYLDPHTCRPILPFKENPQDYTEEEVDTCHTRRIRRLHIREMDPSM 353
Query: 296 AVV 298
+
Sbjct: 354 LIA 356
>gi|47222154|emb|CAG11580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 440
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 162/375 (43%), Gaps = 108/375 (28%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E RRD SR+W TYR+ F P+ S LT+D GWGCMLR GQM++AQALL +GRDW
Sbjct: 74 NVEDFRRDFGSRIWLTYREEFPPLPGSTLTSDCGWGCMLRAGQMMLAQALLLHFMGRDWT 133
Query: 71 --------------WNVNSKE--------------------------------------- 77
W ++ +
Sbjct: 134 WSRTMSLQPLDTETWTTSAAKRLVASLESSLQGSPGPSDNRGPQNQAAGSAEEAGAHVEG 193
Query: 78 EAYLKIL-KMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY---DDW 133
EA+ + L F D +A + +H++ G GK GEW+GP VA +L+K +
Sbjct: 194 EAFHRTLVSWFGDSPSAQFGLHRMVHLGLEMGKQAGEWYGPAVVAHILKKAVEEALDPSL 253
Query: 134 SSIVFHVALDNTL------------------VVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
+ I +V+ D T+ + V L N+ AS+ P + +++++P
Sbjct: 254 AGITAYVSQDCTVYSADVIDGHKASTSASPESSDDVTLLSPNNQAASALPDSRAVIILVP 313
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
+RLG + NP Y N K ILS Y +G+IGGKP
Sbjct: 314 VRLGGEKTNPDYFNLAKS--------------ILSLDY-------------CIGIIGGKP 346
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
A YF+G+ + +I++DPH Q+ V S+ L S +HCP ++ MDPS
Sbjct: 347 KQACYFVGFQDDSLIYMDPHYCQSFVDV----STSDFPLQS-FHCPSPKKMPFTKMDPSC 401
Query: 296 AV-VSQRSYSDYKNV 309
+ RS D++ +
Sbjct: 402 TLGFYSRSAQDFEKI 416
>gi|30410844|ref|NP_116241.2| cysteine protease ATG4C [Homo sapiens]
gi|30410846|ref|NP_835739.1| cysteine protease ATG4C [Homo sapiens]
gi|114556947|ref|XP_001159883.1| PREDICTED: cysteine protease ATG4C isoform 4 [Pan troglodytes]
gi|114556951|ref|XP_001159976.1| PREDICTED: cysteine protease ATG4C isoform 6 [Pan troglodytes]
gi|61211867|sp|Q96DT6.1|ATG4C_HUMAN RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
gi|14625875|emb|CAC43939.1| putative autophagy-related cysteine endopeptidase [Homo sapiens]
gi|21542522|gb|AAH33024.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [Homo sapiens]
gi|27763973|emb|CAC85556.1| Apg4-C protein [Homo sapiens]
gi|119626984|gb|EAX06579.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|119626985|gb|EAX06580.1| ATG4 autophagy related 4 homolog C (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|123983334|gb|ABM83408.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|123998035|gb|ABM86619.1| ATG4 autophagy related 4 homolog C (S. cerevisiae) [synthetic
construct]
gi|410220598|gb|JAA07518.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410220600|gb|JAA07519.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410267918|gb|JAA21925.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291226|gb|JAA24213.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410291228|gb|JAA24214.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335203|gb|JAA36548.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
gi|410335205|gb|JAA36549.1| ATG4 autophagy related 4 homolog C [Pan troglodytes]
Length = 458
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
H Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|432855098|ref|XP_004068071.1| PREDICTED: cysteine protease ATG4C-like [Oryzias latipes]
Length = 482
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 165/384 (42%), Gaps = 113/384 (29%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
S ++E R+D TSR+W TYR+ F P+ S LTTD GWGC+LR GQM++AQAL+ LG
Sbjct: 70 FSMGNVEAFRKDFTSRVWLTYREEFPPLPGSTLTTDCGWGCLLRAGQMMLAQALVLHFLG 129
Query: 67 RDWQWN---------------------VNSKE---------------------------- 77
RDW W+ V S E
Sbjct: 130 RDWTWSEALTLQPLDTETWTASAAKRLVASLEASLQGSPKNSDRQHSEPQSSSQGSAEEA 189
Query: 78 EAYLK------ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---A 128
EA+LK I+ F D +A +H++ G + GK G W+GP VA +L+K A
Sbjct: 190 EAHLKEMYHRTIISWFGDTSSALLGLHRLVRLGLTMGKNAGNWYGPAVVAHILKKAVEEA 249
Query: 129 KYDDWSSIVFHVALDNTLVVNQVKKL--CTTNKRASSN--------------------PQ 166
+ I +V+ D T+ V + ++AS + P
Sbjct: 250 MDSGLAGITAYVSQDCTVYSADVADCHKPPSARQASVSPPIAGGGPSKEDQPGSASILPD 309
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
Q ++++IP+RLG + INP Y +K ILS Y
Sbjct: 310 SQAVIILIPVRLGGEKINPEYFEFVK--------------NILSVEY------------- 342
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
+G+IGGKP A YF+G+ + +I++DPH Q+ V + + + ++HCP ++
Sbjct: 343 CIGIIGGKPKQACYFVGFQDDSLIYMDPHYCQSFVDVSNGDFPLQ-----SFHCPSPKKI 397
Query: 287 HILHMDPSIAV-VSQRSYSDYKNV 309
MDPS + RS DY +
Sbjct: 398 PFTRMDPSCTIGFYSRSLQDYDRI 421
>gi|194374239|dbj|BAG57015.1| unnamed protein product [Homo sapiens]
Length = 259
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 92/252 (36%), Positives = 130/252 (51%), Gaps = 73/252 (28%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGAS 106
MLRCGQM++AQAL+ HLGRDW W ++ + Y +IL+ F DR+ YSIHQ+A G
Sbjct: 1 MLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVG 60
Query: 107 EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
EGK++GEWFGPNTVAQVL+KLA +D+W+S+ +V++DNT+V
Sbjct: 61 EGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVV------------------- 101
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
I+DI K C LP+S
Sbjct: 102 -------------IEDIK-------KMCRVLPLSA------------------------- 116
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SR 285
G +P +L G+++IFLDPHT Q D E++ D T+HC Q+ R
Sbjct: 117 --DTAGDRPPDSLT-ASNQGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQR 169
Query: 286 LHILHMDPSIAV 297
++IL++DPS+A+
Sbjct: 170 MNILNLDPSVAL 181
>gi|14042698|dbj|BAB55356.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
H Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|255945233|ref|XP_002563384.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
gi|166990617|sp|A7KAL5.1|ATG4_PENCW RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|129561973|gb|ABO31075.1| Atg4p [Penicillium chrysogenum]
gi|211588119|emb|CAP86190.1| Pc20g08610 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 401
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 144/304 (47%), Gaps = 58/304 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F PI + G T+D GWGCM+R GQ
Sbjct: 74 DFGSRIWITYRSNFTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQS 133
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A L LGRDW+ +EE+ K++ MF D AP+SIH+ GA S GK GE
Sbjct: 134 LLANTFSVLLLGRDWRRGEKVEEES--KLISMFADHPEAPFSIHRFVNRGAESCGKYPGE 191
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + ++ D + V ++ + QP +++
Sbjct: 192 WFGPSATAKCIQLLSTQSEVPQLRVYLTNDTSDVYEDKFAHVAHDESG----RIQPTLIL 247
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI ++ P Y +G+ R T+PQS+G+ GG
Sbjct: 248 IGTRLGIDNVTPAYWDGL---------------------------RAALTYPQSVGIAGG 280
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+G + FLDPHT + ++++LDS Y+ + R+HI MDP
Sbjct: 281 RPSASHYFVGAQDCHLFFLDPHTTRPATLYRPDGLYTQEELDS-YYTSRLRRIHIKDMDP 339
Query: 294 SIAV 297
S+ +
Sbjct: 340 SMLI 343
>gi|397475554|ref|XP_003809200.1| PREDICTED: cysteine protease ATG4C isoform 1 [Pan paniscus]
gi|397475556|ref|XP_003809201.1| PREDICTED: cysteine protease ATG4C isoform 2 [Pan paniscus]
Length = 458
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
H Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|442757637|gb|JAA70977.1| Putative cysteine protease required for autophagy [Ixodes ricinus]
Length = 458
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 157/358 (43%), Gaps = 94/358 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSTLTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 WNV-----NSKEEA---------------------------------------------- 79
W NS E+
Sbjct: 135 WPYALSIENSDSESRTSHTVKKFTASFEAPLSGARELKSPTVSLKETIGRYPDDHEMRNE 194
Query: 80 --YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
+ KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQV--KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
I +VA D T+ + V K+ + AS N + +++++P+RLG + N Y+ IK
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQRASM---ASDNTDDKAVIILVPVRLGGERTNTDYLEFIK 311
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
ILS Y +G+IGGKP + YF G+ + +I++
Sbjct: 312 --------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYM 344
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
DPH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 345 DPHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNIQDFKRA 397
>gi|403257906|ref|XP_003921531.1| PREDICTED: cysteine protease ATG4C [Saimiri boliviensis
boliviensis]
Length = 458
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 152/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 MYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|116283594|gb|AAH18678.1| ATG4C protein [Homo sapiens]
Length = 451
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 156/355 (43%), Gaps = 90/355 (25%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W W
Sbjct: 76 VEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTW 135
Query: 72 ----NV-NSKEEAYL--------------------------------------------- 81
N+ NS E++
Sbjct: 136 PDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEV 195
Query: 82 ---KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSS 135
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 196 YHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQG 255
Query: 136 IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCY 195
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 256 ITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGEERTNTDYLEFVK--- 311
Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
ILS Y +G+IGGKP + YF G+ + +I++DPH
Sbjct: 312 -----------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDPH 347
Query: 256 TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 348 YCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>gi|297664749|ref|XP_002810790.1| PREDICTED: cysteine protease ATG4C [Pongo abelii]
Length = 458
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 152/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|261195783|ref|XP_002624295.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
gi|239587428|gb|EEQ70071.1| cysteine protease atg4 [Ajellomyces dermatitidis SLH14081]
Length = 494
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 142/305 (46%), Gaps = 54/305 (17%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR F I S G TTD GWGCM+R GQ
Sbjct: 122 DFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMIRSGQS 181
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGR+W+ KEE+ L L +F D AP+SIH+ GAS GK GE
Sbjct: 182 LLANALAILFLGREWRRGTKVKEESNL--LSLFADDPRAPFSIHRFVEHGASACGKYPGE 239
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + +V D + V + + ++ +P +++
Sbjct: 240 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYEDRFRAIASGGGTGTSTDIRPTLIL 299
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI + PVY +K +PQ++G+ GG
Sbjct: 300 LGIRLGIDRVTPVYWEALKAV---------------------------LKYPQAVGIAGG 332
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YFIG G+ +LDP HT + +Q + +TYH + RLHI MD
Sbjct: 333 RPSSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELNTYHTRRLRRLHIKDMD 392
Query: 293 PSIAV 297
PS+ +
Sbjct: 393 PSMLI 397
>gi|410918329|ref|XP_003972638.1| PREDICTED: cysteine protease ATG4D-like [Takifugu rubripes]
Length = 499
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 159/369 (43%), Gaps = 99/369 (26%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S ++E+ R SR+W TYRK F + S TTD GWGCMLR GQM++AQ LL + R
Sbjct: 99 SEDEVERFRLAFVSRIWLTYRKEFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPR 158
Query: 68 DWQW----------------------------------NVNSKEEAYL----------KI 83
W W ++ E L K
Sbjct: 159 GWTWPDAQPLTDVDLEVFRPRSPARAGGVPIPSFASPRGPSTPERPLLSEQATKCSRKKR 218
Query: 84 LKMFEDRRTAP----------------YSIHQIALTGASEGKAVGEWFGPNTVAQVLRK- 126
L+ +DR+ P + IHQ+ G S GK G+W+GP VA +LRK
Sbjct: 219 LESVQDRQAEPTHQKLVFWFGDQPTAPFGIHQLVEIGKSAGKKAGDWYGPAIVAHILRKA 278
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLC-TTNKRASSNP----QWQPLVLVIPLRLGIQ 181
+A+ S+V +VA D T+ V LC T + S+P W+ +++++P+RLG +
Sbjct: 279 VARASAVHSLVVYVAQDCTVYKEDVMHLCDPTPSQTPSDPLSHQAWKSVIILVPVRLGGE 338
Query: 182 DINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYF 241
+NP YI +K L +G+IGGKP H+LYF
Sbjct: 339 CLNPSYIECVKNILKLDC---------------------------CIGIIGGKPKHSLYF 371
Query: 242 IGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQ 300
+G+ +++LDPH Q + V E ++HC ++ MDPS +
Sbjct: 372 VGFQDEQLLYLDPHYCQPVVDVSQVNSSLE-----SFHCNAPKKMPFNRMDPSCTIGFYA 426
Query: 301 RSYSDYKNV 309
+S D++++
Sbjct: 427 KSKKDFESL 435
>gi|239614382|gb|EEQ91369.1| cysteine protease atg4 [Ajellomyces dermatitidis ER-3]
gi|327351393|gb|EGE80250.1| cysteine protease atg4 [Ajellomyces dermatitidis ATCC 18188]
Length = 494
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 146/306 (47%), Gaps = 56/306 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR F I S G TTD GWGCM+R GQ
Sbjct: 122 DFESKIWLTYRSSFPLIPKSSDPNAASAMTLGVRLRSQLVDPQGFTTDTGWGCMIRSGQS 181
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGR+W+ KEE+ L L +F D AP+SIH+ GAS GK GE
Sbjct: 182 LLANALAILFLGREWRRGTKVKEESNL--LSLFADDPRAPFSIHRFVEHGASACGKYPGE 239
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + +V D + V + + ++ +P +++
Sbjct: 240 WFGPSATARCIQALSSECKHAGLNVYVTSDGSDVYEDRFRAIASGGGTGTSTDIRPTLIL 299
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ +RLGI + PVY +K +PQ++G+ GG
Sbjct: 300 LGIRLGIDRVTPVYWEALKAV---------------------------LKYPQAVGIAGG 332
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGC--VYDKEQDSEKKLDSTYHCPQASRLHILHM 291
+P+ + YFIG G+ +LDPH + V +Q ++++L+ TYH + RLHI M
Sbjct: 333 RPSSSHYFIGAQGSHFFYLDPHHTRPALPYHVPVDQQYTDEELN-TYHTRRLRRLHIKDM 391
Query: 292 DPSIAV 297
DPS+ +
Sbjct: 392 DPSMLI 397
>gi|121704590|ref|XP_001270558.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
gi|166990611|sp|A1CJ08.1|ATG4_ASPCL RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|119398704|gb|EAW09132.1| peptidase family C54 protein [Aspergillus clavatus NRRL 1]
Length = 400
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 151/328 (46%), Gaps = 61/328 (18%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKG 44
HQ E+ D+ SR+W TYR F PI G T+D G
Sbjct: 69 EHQWPEEFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTG 128
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R GQ ++A A+L L LGRDW+ + +EA ++L F D AP+SIH+ G
Sbjct: 129 WGCMIRSGQSLLANAMLILLLGRDWRRGTEAGKEA--QLLHQFADHPEAPFSIHRFVQHG 186
Query: 105 ASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
A K GEWFGP+ A+ ++ L S + ++ D+T + + K +
Sbjct: 187 AEFCNKYPGEWFGPSATARCIQALVAQQGSSELRVYIT-DDTADIYEDKFARIAQ---AE 242
Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
+ + P ++++ RLGI + P Y + +K+ LP
Sbjct: 243 HGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLP------------------------- 277
Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
QS+G+ GG+P+ + YFIG G + +LDPH + D + +TYH +
Sbjct: 278 --QSVGIAGGRPSASHYFIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRL 335
Query: 284 SRLHILHMDPSI----AVVSQRSYSDYK 307
R+HI MDPS+ + S+ ++D+K
Sbjct: 336 RRIHIKDMDPSMLIGFIIRSREDWTDWK 363
>gi|149507363|ref|XP_001514370.1| PREDICTED: cysteine protease ATG4C [Ornithorhynchus anatinus]
Length = 459
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/346 (29%), Positives = 153/346 (44%), Gaps = 95/346 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ R+D SR+W TYR+ F P+G SGLTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVDEFRKDFVSRIWLTYREEFPPMGASGLTTDCGWGCTLRTGQMLLAQGLVLHFLGRAWT 134
Query: 71 WNV-----NSKEEAYL-------------------------------------------- 81
W NS E++
Sbjct: 135 WPAALDMENSDSESWTSHTVKKLTASFEASWVGERDPRPPSASRNAPRGSGSVRDEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D + +HQ+ G GK G+W+GP VA +LRK ++ D
Sbjct: 195 GFHRKIISWFGDSPRTYFGLHQLTEYGKKSGKTAGDWYGPAVVAHILRKAVEEVRHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPLVLVIPLRLGIQDINPVYINGI 191
+ +VA D T+ + V T RAS++ + +++++P+RLG + N Y+ +
Sbjct: 255 GLTVYVAQDCTVYNSDV----TDKLRASTDSGKTDDKAVLILVPVRLGGERTNIDYLEFV 310
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K ILS Y +G+IGGKP + YF G+ + +I+
Sbjct: 311 K--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIY 343
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+DPH Q+ V K+ E ++HCP ++ MDPS V
Sbjct: 344 MDPHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFRKMDPSCTV 384
>gi|449551395|gb|EMD42359.1| ATG4-like protein [Ceriporiopsis subvermispora B]
Length = 988
Score = 149 bits (376), Expect = 1e-33, Method: Composition-based stats.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 70/281 (24%)
Query: 18 DITSRLWFTYRKGFVPI--------------------------------GDSGLTTDKGW 45
D TSR+W TYR F PI G+ G T+D GW
Sbjct: 308 DFTSRIWVTYRSQFQPIRDTTLSALELELGESTAVATSPQPKKWNWPLGGEKGWTSDAGW 367
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
GCMLR GQ ++A LL LHLGRDW+ + + + + A Y++IL F D + P+S+H+
Sbjct: 368 GCMLRTGQSLLANTLLHLHLGRDWRRPPYPICTADYATYVQILTWFFDNPSPLCPFSVHR 427
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN- 158
+AL G GK VG+WFGP+T A ++ L + + VA D+ + + V +N
Sbjct: 428 MALVGKELGKEVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVATDSVIYQSDVYTASRSNL 487
Query: 159 --KRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R + W + +++++ +RLG+ +NP+Y YD +K L
Sbjct: 488 GSPRRNGRSGWGDRAVLVLVGIRLGLDGVNPIY---------------YDTIKAL----- 527
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
+TFPQS+G+ GG+P+ + YF+G +++ +LDPH
Sbjct: 528 -------YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 561
>gi|402854773|ref|XP_003892029.1| PREDICTED: cysteine protease ATG4C isoform 1 [Papio anubis]
gi|402854775|ref|XP_003892030.1| PREDICTED: cysteine protease ATG4C isoform 2 [Papio anubis]
Length = 458
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 151/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTPKISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNCDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------SILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMAFRKMDPSCTI 384
>gi|345564445|gb|EGX47408.1| hypothetical protein AOL_s00083g501 [Arthrobotrys oligospora ATCC
24927]
Length = 444
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 138/298 (46%), Gaps = 55/298 (18%)
Query: 18 DITSRLWFTYRKGFVPI-------------------GDSGLTTDKGWGCMLRCGQMVIAQ 58
D ++ W TYR F PI G T+D GWGCM+R GQ V+A
Sbjct: 114 DFDAKFWMTYRSAFPPIPLSTTSRNMTLATRIRSLADQEGFTSDTGWGCMIRSGQCVLAN 173
Query: 59 ALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASEGKAVGEWFGP 117
A+ L LGRDW+ + +EE + IL +F D AP+S+H G AS G GEWFGP
Sbjct: 174 AISLLKLGRDWRRGKSPQEEQH--ILSLFADDPRAPFSLHNFVKYGEASCGVYPGEWFGP 231
Query: 118 NTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLR 177
+ A+ ++ LA D V+ + + +K+ ++ + P ++++ +R
Sbjct: 232 SATARCIQALAAQHDEGLQVYITGDGGDVYEDAFRKIAISDDGV-----FHPTLVLVGIR 286
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LGI+ + PVY +K + PQS+G+ GG+P+
Sbjct: 287 LGIERVTPVYWEALKSSLMM---------------------------PQSVGIAGGRPSA 319
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+ YFIG G + +LDPH + + Y K+ D + H + RLH+ MDPS+
Sbjct: 320 SHYFIGVQGQSLFYLDPHNTRPL-LPYRKDSDYTAEEIEFCHTRKLRRLHLREMDPSM 376
>gi|238506146|ref|XP_002384275.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
gi|220690389|gb|EED46739.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus flavus
NRRL3357]
Length = 439
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 59/310 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKGWGCML 49
E D S++W TYR F PI G T+D GWGCM+
Sbjct: 110 EAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWGCMI 169
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
R GQ ++A A+L L LGRDW+ ++EEA ++L +F D AP SIH+ GA S G
Sbjct: 170 RSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSIHRFVKYGAESCG 227
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
K GEWFGP+ A+ + L+ +I V + N + V + S + Q
Sbjct: 228 KHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSFLRVARSGSGSIQ 283
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
P ++++ RLGI ++ PVY +G+K L PQS+
Sbjct: 284 PTLILLGTRLGIDNVTPVYWDGLKAVLQL---------------------------PQSV 316
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
G+ GG+P+ + YFIG G +LDPHT + + D S+ ++ STYH + R+H
Sbjct: 317 GIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI-STYHTRRLRRIH 375
Query: 288 ILHMDPSIAV 297
I MDPS+ +
Sbjct: 376 IQDMDPSMLI 385
>gi|37748391|gb|AAH58981.1| Autophagy-related 4C (yeast) [Mus musculus]
Length = 458
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SRLW TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRLWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W-------NVNS---------------------------------------------KEE 78
W N +S + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNE 194
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
AY KI+ F D A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V T+ + + + + +++++P+RLG + N Y+ +K
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|327270876|ref|XP_003220214.1| PREDICTED: cysteine protease ATG4C-like [Anolis carolinensis]
Length = 459
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 154/345 (44%), Gaps = 91/345 (26%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGRDW
Sbjct: 74 RNVEEFRKDFISRIWLTYREEFPQIKGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDW 133
Query: 70 QW-----NVNSKEEAYL------------------------------------------- 81
W N N + E++
Sbjct: 134 TWPDALVNENPESESWTSHTVKKLTASFEASLIGEKEFKNQSIPPRQIRKRDWGKRESRD 193
Query: 82 -----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDW 133
KI+ F D A + +H++ G GK G+W+GP VA +LRK AK +
Sbjct: 194 EHYHRKIVSWFGDSPLANFGLHRLIEYGNKSGKMAGDWYGPAVVAHLLRKAVEEAKDPEL 253
Query: 134 SSIVFHVALDNTLVVNQVKKL-CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
I +VA D T+ + V ++ C+ + S P + ++++IP+RLG + N Y+ +K
Sbjct: 254 QGITVYVAQDCTVYKSDVVEMQCSL--KDSEKPGAKSVIILIPVRLGGERTNMEYLEFVK 311
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
ILS Y +G++GG+P + YF G+ + +I++
Sbjct: 312 --------------GILSLEY-------------CIGIVGGRPKQSYYFAGFQDDSLIYM 344
Query: 253 DPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
DPH Q+ V K E ++HCP ++ MDPS +
Sbjct: 345 DPHYCQSFVDVSIKNFPLE-----SFHCPSPKKMSFKKMDPSCTI 384
>gi|406862068|gb|EKD15120.1| putative cysteine protease atg4 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 441
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 96/307 (31%), Positives = 144/307 (46%), Gaps = 60/307 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR F I S G T+D GWGCM+R GQ
Sbjct: 106 DFESKIWLTYRSQFPAIPKSQDPKALSSMSLSVRLRSQLVDQAGFTSDTGWGCMIRSGQS 165
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL+ L +GRDW+ ++ +E I+ +F D TAPYSIH GA+ GK GE
Sbjct: 166 LLANALVMLRMGRDWRRGSSASQEER-SIISLFADTPTAPYSIHNFVEHGAAACGKHPGE 224
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVL 172
WFGP+ A+ ++ LA + +V D V + K+ + +A + P ++
Sbjct: 225 WFGPSATARCIQALANGHQSPELRVYVTGDGLEVYEDSFMKIAKPDGQA-----FIPTLI 279
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ RLG+ I PVY +K PQSLG+ G
Sbjct: 280 LVGTRLGLDKITPVYWEALKSS---------------------------LQIPQSLGIAG 312
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHM 291
G+P+ + YFIG G+ +LDPH + + D +D S++ +DS H + R+HI M
Sbjct: 313 GQPSSSHYFIGVQGHHFFYLDPHQTRPALPLPDNIEDYSQEDIDSC-HTRRLRRIHIKEM 371
Query: 292 DPSIAVV 298
DPS+ +
Sbjct: 372 DPSMLIA 378
>gi|383872484|ref|NP_001244816.1| cysteine protease ATG4C [Macaca mulatta]
gi|355745338|gb|EHH49963.1| hypothetical protein EGM_00712 [Macaca fascicularis]
gi|380788509|gb|AFE66130.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
gi|383413101|gb|AFH29764.1| cysteine protease ATG4C isoform 8 [Macaca mulatta]
Length = 458
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 151/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNCDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|147742949|sp|A2XHJ5.1|ATG4A_ORYSI RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|125544166|gb|EAY90305.1| hypothetical protein OsI_11880 [Oryza sativa Indica Group]
Length = 473
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 146/301 (48%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYRKGF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 130 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 189
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
Y+ IL MF D +SIH + G S G A G W GP + + + L + +
Sbjct: 190 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVRTNREHH 249
Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
++ ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 250 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 307
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ +NP YI +K+ FTFPQSLG++GGKP
Sbjct: 308 VLGLDKLNPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 340
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + V++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 341 TSTYVAGVQDDRVLYLDPHEVQ---LAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLA 397
Query: 297 V 297
+
Sbjct: 398 I 398
>gi|317151014|ref|XP_001824388.2| cysteine protease atg4 [Aspergillus oryzae RIB40]
Length = 402
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 59/310 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKGWGCML 49
E D S++W TYR F PI G T+D GWGCM+
Sbjct: 74 EAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWGCMI 133
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
R GQ ++A A+L L LGRDW+ ++EEA ++L +F D AP SIH+ GA S G
Sbjct: 134 RSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSIHRFVKYGAESCG 191
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
K GEWFGP+ A+ + L+ +I V + N + V + S + Q
Sbjct: 192 KHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSFLRVARSGSGSIQ 247
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
P ++++ RLGI ++ PVY +G+K L PQS+
Sbjct: 248 PTLILLGTRLGIDNVTPVYWDGLKAVLQL---------------------------PQSV 280
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
G+ GG+P+ + YFIG G +LDPHT + + D S+ ++ STYH + R+H
Sbjct: 281 GIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI-STYHTRRLRRIH 339
Query: 288 ILHMDPSIAV 297
I MDPS+ +
Sbjct: 340 IQDMDPSMLI 349
>gi|166990665|sp|Q2U5B0.2|ATG4_ASPOR RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
Length = 407
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 59/310 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKGWGCML 49
E D S++W TYR F PI G T+D GWGCM+
Sbjct: 79 EAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWGCMI 138
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
R GQ ++A A+L L LGRDW+ ++EEA ++L +F D AP SIH+ GA S G
Sbjct: 139 RSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSIHRFVKYGAESCG 196
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
K GEWFGP+ A+ + L+ +I V + N + V + S + Q
Sbjct: 197 KHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSFLRVARSGSGSIQ 252
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
P ++++ RLGI ++ PVY +G+K L PQS+
Sbjct: 253 PTLILLGTRLGIDNVTPVYWDGLKAVLQL---------------------------PQSV 285
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
G+ GG+P+ + YFIG G +LDPHT + + D S+ ++ STYH + R+H
Sbjct: 286 GIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI-STYHTRRLRRIH 344
Query: 288 ILHMDPSIAV 297
I MDPS+ +
Sbjct: 345 IQDMDPSMLI 354
>gi|354470829|ref|XP_003497647.1| PREDICTED: cysteine protease ATG4C [Cricetulus griseus]
Length = 458
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 152/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW- 69
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 70 -------------QWNVNSKE------EAYL----------------------------- 81
W N+ + EA L
Sbjct: 135 WPDALHIENSDSDSWTSNTVKKFTASFEASLSGERELRTTALSLKETIGKYSDDHAVQNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V TN S + + + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDK-QTNSSTSGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|222629790|gb|EEE61922.1| hypothetical protein OsJ_16662 [Oryza sativa Japonica Group]
Length = 892
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYR+GF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKP 193
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401
Query: 297 V 297
+
Sbjct: 402 I 402
>gi|393247625|gb|EJD55132.1| hypothetical protein AURDEDRAFT_78065 [Auricularia delicata
TFB-10046 SS5]
Length = 989
Score = 148 bits (374), Expect = 3e-33, Method: Composition-based stats.
Identities = 96/286 (33%), Positives = 138/286 (48%), Gaps = 75/286 (26%)
Query: 18 DITSRLWFTYRKGFVPIGDSGL-----------------------------TTDKGWGCM 48
D TSR+W TYR F PI D L T+D GWGCM
Sbjct: 317 DFTSRVWLTYRSQFSPIHDCPLSACKGKDLESLDANPPKRTFWPGSGEKTWTSDAGWGCM 376
Query: 49 LRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQIAL 102
LR GQ ++A L+ LHLGRDW+ N S E A Y+KIL F D + AP+S+H++A+
Sbjct: 377 LRTGQSLLANTLIHLHLGRDWRRPAINSASPEFATYVKILTWFFDAPSVHAPFSVHRMAM 436
Query: 103 TGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL-------C 155
+G GK VG+WFGP+T A +R L + + +A+D L +
Sbjct: 437 SGKDFGKDVGQWFGPSTAAGAIRTLVHDFPRAQLGVAIAVDGVLYETDIYSASHYPMSSA 496
Query: 156 TTNKRASS---NP-QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKIL 209
+RAS +P +W + +++++ RLG+ +NP+Y +K
Sbjct: 497 DGARRASGFKRHPGRWGNRAVLVLVATRLGLDGVNPIYYENLKTI--------------- 541
Query: 210 SSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
FTFPQSLG+ GG+P+ + YF+G GN + +LDPH
Sbjct: 542 ------------FTFPQSLGIAGGRPSSSYYFVGSQGNSLFYLDPH 575
>gi|359495820|ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera]
Length = 486
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/308 (32%), Positives = 145/308 (47%), Gaps = 50/308 (16%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S L + +D +SR+ TYRKGF IGDS LT+D WGCMLR QM++AQALL +GR
Sbjct: 129 SSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGR 188
Query: 68 DWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
W+ + ++ Y++IL F D + + +SIH I G + G A G W GP + +
Sbjct: 189 SWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWET 248
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP- 185
LA+ +KR ++ + Q L + I + G +D
Sbjct: 249 LAR----------------------------SKREETDLECQSLPMAIYIVSGDEDGERG 280
Query: 186 ----VYIN-GIKKCYALPISPVYDMVKILSSTYNMQ-----TPRY------EFTFPQSLG 229
VYI + C V D IL + PRY FTFPQSLG
Sbjct: 281 GAPVVYIEEASRHCLEFSKGQV-DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLG 339
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
++GGKP + Y +G +LDPH Q+ V D +++ + S+YHC + +
Sbjct: 340 ILGGKPGASTYIVGVQDEKAFYLDPHEAQS---VVDIRRENLEADTSSYHCNIIRHICLD 396
Query: 290 HMDPSIAV 297
+DPS+A+
Sbjct: 397 SIDPSLAI 404
>gi|448112117|ref|XP_004202013.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359465002|emb|CCE88707.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 155/342 (45%), Gaps = 90/342 (26%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI----------------------------------GDSG 38
E++ +DI SR+WFTYR GF PI +
Sbjct: 79 EEMDKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALDNIHGLFNNQN 138
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
TTD GWGCM+R QM++A A+ L LGR + + +S E+ + I+ MF D AP+S+H
Sbjct: 139 FTTDVGWGCMIRTSQMLLANAIQLLLLGRGFTY-ADSSEKKHSDIIDMFTDDPKAPFSLH 197
Query: 99 QIALTGASEGKAV--GEWFGPNTVAQVLRKLAK--YDDWSSIVFHVALDNTLVV--NQVK 152
+ V GEWFGPN + +++L K +D+ SS F V + + + +++
Sbjct: 198 NFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDESSSPRFRVIISESCDIYDDKIG 257
Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
KL N+ A +++++P+RLG+ ++P Y N +
Sbjct: 258 KLLQENEDAEG-----AILILLPVRLGLNKVSPYYHNSLSSL------------------ 294
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI--GCVYDKEQDS 270
F+ PQ +G+ GGKP+ + YF G ++++LDPH Q++ +YD
Sbjct: 295 ---------FSSPQLVGIAGGKPSSSYYFFGSHNGNLLYLDPHYPQSVKASSIYD----- 340
Query: 271 EKKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYKN 308
T+H L I MDPS I + S+ Y +K+
Sbjct: 341 ------TFHTHNVQSLKIEDMDPSMLIGILIKSKEDYESFKD 376
>gi|358390472|gb|EHK39877.1| hypothetical protein TRIATDRAFT_208244 [Trichoderma atroviride IMI
206040]
Length = 452
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 92/306 (30%), Positives = 145/306 (47%), Gaps = 60/306 (19%)
Query: 17 RDITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQ 53
D++S+ W TYR GF PI S G ++D GWGCM+R GQ
Sbjct: 122 EDMSSKFWMTYRSGFEPIPKSVDPKAASALSFSMRIKSTLSDSAGFSSDSGWGCMIRSGQ 181
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
++A + L LGRDW+ + + +EE +L + MF D APYSIH GA+ GK G
Sbjct: 182 SLLATTIGILRLGRDWRRDQSQEEERHL--ISMFADDPRAPYSIHNFVRHGATACGKYPG 239
Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
EWFGP+ AQ ++ L S ++ + + K+ ++ + + P ++
Sbjct: 240 EWFGPSATAQCIQALTSSSGLSLNIYSPNDGQDVYEDSFMKIAKSDGQT-----FNPTLI 294
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
+I RLGI I P+Y + + + PQS+G+ G
Sbjct: 295 LIRTRLGIDKITPIYWDALIAALHM---------------------------PQSVGIAG 327
Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
G+P + YF+G G+ + +LDP HT + I D + +E+ ++S H + R+HI M
Sbjct: 328 GRPASSHYFVGSQGSYLFYLDPHHTRKAIPYHDDVTKYTEEDIESC-HTSRLRRIHIKEM 386
Query: 292 DPSIAV 297
DPS+ +
Sbjct: 387 DPSMLI 392
>gi|355558068|gb|EHH14848.1| hypothetical protein EGK_00836 [Macaca mulatta]
Length = 458
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 152/347 (43%), Gaps = 97/347 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGERELKTSKISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALD----NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
I +VA D N V+++ T S N + +++++P+RLG + N Y+
Sbjct: 255 GITIYVAQDFSVYNCDVIDKQSASMT-----SDNADDKAVIILVPVRLGGERTNTDYLEF 309
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
+K ILS Y +G+IGGKP + YF G+ + +I
Sbjct: 310 VK--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLI 342
Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++DPH Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 343 YMDPHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|27763971|emb|CAC85555.1| Apg4-C protein [Mus musculus]
gi|148698944|gb|EDL30891.1| autophagy-related 4C (yeast), isoform CRA_a [Mus musculus]
Length = 458
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W-------NVNS---------------------------------------------KEE 78
W N +S + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNE 194
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
AY KI+ F D A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V T+ + + + + +++++P+RLG + N Y+ +K
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|225543220|ref|NP_778194.3| cysteine protease ATG4C [Mus musculus]
gi|225543224|ref|NP_001139439.1| cysteine protease ATG4C [Mus musculus]
gi|341940254|sp|Q811C2.2|ATG4C_MOUSE RecName: Full=Cysteine protease ATG4C; AltName: Full=AUT-like 3
cysteine endopeptidase; AltName: Full=Autophagin-3;
AltName: Full=Autophagy-related cysteine endopeptidase
3; AltName: Full=Autophagy-related protein 4 homolog C
Length = 458
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W-------NVNS---------------------------------------------KEE 78
W N +S + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNE 194
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
AY KI+ F D A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V T+ + + + + +++++P+RLG + N Y+ +K
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|148698945|gb|EDL30892.1| autophagy-related 4C (yeast), isoform CRA_b [Mus musculus]
Length = 466
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 83 NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 142
Query: 71 W-------NVNS---------------------------------------------KEE 78
W N +S + E
Sbjct: 143 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHTVRNE 202
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
AY KI+ F D A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 203 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 262
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V T+ + + + + +++++P+RLG + N Y+ +K
Sbjct: 263 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 319
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 320 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 354
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 355 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 392
>gi|224117658|ref|XP_002331599.1| predicted protein [Populus trichocarpa]
gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 145/298 (48%), Gaps = 50/298 (16%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK- 76
D +SR+ TYRKGF I DS LT+D WGCMLR QM++AQALLF LGR W+ ++
Sbjct: 145 DFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQALLFHRLGRSWRKPLDKPL 204
Query: 77 EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK------- 129
+ Y++IL +F D ++ +SIH + G + G A G W GP V L +
Sbjct: 205 DREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYAVCHSWESLVRSRREETN 264
Query: 130 --YDDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLG 179
Y S V+ V+ L + + + C+ + + W P++L++PL LG
Sbjct: 265 LEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQED--WTPILLLVPLVLG 322
Query: 180 IQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHAL 239
+ INP YI ++ FTFPQSLG++GGKP +
Sbjct: 323 LDKINPRYIPSLQAT---------------------------FTFPQSLGILGGKPGAST 355
Query: 240 YFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
Y +G + +LDPH Q V + +D + S+YHC + + +DPS+A+
Sbjct: 356 YIVGVQDENAFYLDPHEVQP---VVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAI 410
>gi|115461386|ref|NP_001054293.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|75143803|sp|Q7XPW8.1|ATG4B_ORYSJ RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B
gi|32488637|emb|CAE03430.1| OSJNBa0032F06.13 [Oryza sativa Japonica Group]
gi|82470053|gb|ABB77259.1| autophagy 4 [Oryza sativa Indica Group]
gi|113565864|dbj|BAF16207.1| Os04g0682000 [Oryza sativa Japonica Group]
gi|215697216|dbj|BAG91210.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 478
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYR+GF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKP 193
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401
Query: 297 V 297
+
Sbjct: 402 I 402
>gi|126305934|ref|XP_001364974.1| PREDICTED: cysteine protease ATG4C [Monodelphis domestica]
Length = 460
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 104/352 (29%), Positives = 151/352 (42%), Gaps = 103/352 (29%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
+ ++E+ RRD SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR
Sbjct: 75 YGNVEEFRRDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRA 134
Query: 69 WQW-----------------------------------------------NVNSKEE--- 78
W W S++E
Sbjct: 135 WTWPDALDIENSDSASWTSHTVKKLTASFEASLTGERTPKVPPSILKEPRRTGSEDEEGR 194
Query: 79 ---AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDD 132
+ KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 NELCHRKIISWFGDSPLACFGLHQLIEYGKKSGKTAGDWYGPAVVAHILRKAVEEARHPD 254
Query: 133 WSSIVFHVALDNTLVVNQV-------KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP 185
I +VA D T+ V L TT +A ++L++P+RLG + N
Sbjct: 255 LQGITIYVAQDCTVYKADVIDKQGISAGLETTEDKA--------IILLVPVRLGGERTNM 306
Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
Y++ +K ILS Y +G+IGGKP + YF G+
Sbjct: 307 DYLDFVK--------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQ 339
Query: 246 GNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ +I++DPH Q+ V K+ E ++HCP ++ MDPS V
Sbjct: 340 DDSLIYMDPHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFRKMDPSCTV 386
>gi|224994904|gb|ACN76571.1| cysteine proteinase [Triticum aestivum]
Length = 486
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 148/302 (49%), Gaps = 54/302 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV-NS 75
D +SR+W TYRKGF I DS LT+D WGCM+R QM++AQAL+F HLGR W+ N
Sbjct: 139 EDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNP 198
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y++IL +F D +SIH + G S G A G W GP + + + L +
Sbjct: 199 SNPEYIRILHLFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQP 258
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLC-TTNKRASSNPQWQPLVLVIP 175
+ + ++ V+ D + ++ +LC NK S+ W P++L++P
Sbjct: 259 EVINRNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQSA---WSPILLLVP 315
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
L LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 316 LVLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKP 348
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+ Y G + ++LDPH Q + D+ + S+YHC + + +DPS+
Sbjct: 349 GASTYIAGVQDDRALYLDPHEVQ---LAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSL 405
Query: 296 AV 297
A+
Sbjct: 406 AI 407
>gi|378731837|gb|EHY58296.1| autophagy-like protein 4 [Exophiala dermatitidis NIH/UT8656]
Length = 480
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 147/320 (45%), Gaps = 72/320 (22%)
Query: 18 DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
D SR+W TYR F PI S G T+D GWGCM+R GQ +
Sbjct: 117 DFESRIWMTYRSNFTPIPRSQEPSRASSMSFSVRLRNLTEREGFTSDTGWGCMIRSGQSL 176
Query: 56 IAQALLFLHLGRDWQWN----------------VNSKEEAYLKILKMFEDRRTAPYSIHQ 99
+A L+ LHLGRDW+ + ++K EA +IL +F D AP+SIH+
Sbjct: 177 LANTLMLLHLGRDWRRDHTHTPTTSDSKPSSSSSSTKREA--EILSLFADSPDAPFSIHR 234
Query: 100 IALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALD-NTLVVNQVKKLCTT 157
GAS GK G+WFGP+ A +R+L+ + + +V + L ++ + +
Sbjct: 235 FVQHGASACGKHPGQWFGPSATASCIRELSTECAAAGLRVYVTPSASELYEDRFRSIAAA 294
Query: 158 NKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQT 217
+ S+P +P +++ +RLG+ I PVY +K
Sbjct: 295 SP---SDPTIKPTLILFGIRLGLDRITPVYHEALKSS----------------------- 328
Query: 218 PRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDST 277
T+PQS+G+ GG+P+ + YF+G G+ +LDPH + + D ++ +T
Sbjct: 329 ----LTYPQSIGIAGGRPSSSHYFVGCQGDLFFYLDPHETRPALPHHASPADYSEEEIAT 384
Query: 278 YHCPQASRLHILHMDPSIAV 297
H + L I MDPS+ +
Sbjct: 385 CHTRRLRGLRINEMDPSMLI 404
>gi|358056752|dbj|GAA97415.1| hypothetical protein E5Q_04093 [Mixia osmundae IAM 14324]
Length = 1202
Score = 147 bits (371), Expect = 6e-33, Method: Composition-based stats.
Identities = 92/295 (31%), Positives = 138/295 (46%), Gaps = 84/295 (28%)
Query: 18 DITSRLWFTYRKGFVPI---------------------------GDSGLTTDKGWGCMLR 50
D TSR+ TYR GF PI + GL+TD GWGCMLR
Sbjct: 548 DFTSRIQLTYRAGFPPIPTTVSNGPATTAFNAVLSSLTGRSPLQANDGLSTDAGWGCMLR 607
Query: 51 CGQMVIAQALLFLHLGRDWQWNVNSKEEA---------------YLKILKMFEDRRT--A 93
GQ ++A AL F+HLGRDW+ +S +E+ Y ++L F D +
Sbjct: 608 TGQSLLANALAFVHLGRDWRRTCSSSDESPDIPEESRSLEHFETYARLLTWFLDDPSPLC 667
Query: 94 PYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
P+S+H+ A+ G + GK +GEWFGP+T A ++ LA +++ V++D T+ + V+
Sbjct: 668 PFSVHRFAVVGKEQGGKEIGEWFGPSTAAGAIKHLASNFAPANLGVAVSVDGTVYRSDVQ 727
Query: 153 KLC-------TTNKRASSNP----QWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
T R P WQ P++++I RLG+ +NP+Y IK
Sbjct: 728 AAANPPFSEPATAGRQDPAPSVRTSWQRPVLILINARLGLDKVNPLYYESIKAA------ 781
Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
+FPQS+G+ GG+P+ + YF+G N V ++DPH
Sbjct: 782 ---------------------LSFPQSVGISGGRPSSSYYFVGVQQNSVYYIDPH 815
>gi|218195841|gb|EEC78268.1| hypothetical protein OsI_17962 [Oryza sativa Indica Group]
Length = 912
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYR+GF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKP 193
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401
Query: 297 V 297
+
Sbjct: 402 I 402
>gi|147742963|sp|Q2XPP4.2|ATG4B_ORYSI RecName: Full=Cysteine protease ATG4B; AltName:
Full=Autophagy-related protein 4 homolog B;
Short=Protein autophagy 4; AltName: Full=OsAtg4
Length = 478
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYR+GF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKP 193
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401
Query: 297 V 297
+
Sbjct: 402 I 402
>gi|224092798|ref|XP_002309707.1| predicted protein [Populus trichocarpa]
gi|222852610|gb|EEE90157.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 146/304 (48%), Gaps = 50/304 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L RD +SR+ TYRKGF I DS LT+D WGCMLR QM++AQALLF LGR W+
Sbjct: 138 LAAFHRDFSSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRK 197
Query: 72 NVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
V+ + Y++IL +F D + +SIH + G + G A G W GP + + LA+
Sbjct: 198 PVDKPLDRDYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLAR- 256
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN----PV 186
+KR +N ++Q L + + + G +D PV
Sbjct: 257 ---------------------------SKREETNLEYQTLPMAVYVVSGCEDGERGGAPV 289
Query: 187 YI--NGIKKCYALPI-----SPVYDMVKILSSTYNMQTPRY------EFTFPQSLGVIGG 233
+ + C +P+ +V ++ + PRY FTFPQSLG++GG
Sbjct: 290 LSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKIN-PRYIPSLQATFTFPQSLGILGG 348
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
KP + Y +G + +LDPH Q V + +D + S+YHC + + +DP
Sbjct: 349 KPGASTYIVGVQDENAFYLDPHEVQP---VVNFSRDDVEANTSSYHCDVVRHIPLDLIDP 405
Query: 294 SIAV 297
S+A+
Sbjct: 406 SLAI 409
>gi|147862867|emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
Length = 489
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 148/311 (47%), Gaps = 53/311 (17%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S L + +D +SR+ TYRKGF IGDS LT+D WGCMLR QM++AQALL +GR
Sbjct: 129 SSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGR 188
Query: 68 DWQWNVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
W+ + ++ Y++IL F D + + +SIH I G + G A G W GP + +
Sbjct: 189 SWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWET 248
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP- 185
LA+ +KR ++ + Q L + I + G +D
Sbjct: 249 LAR----------------------------SKREETDLECQSLPMAIYIVSGDEDGERG 280
Query: 186 ----VYIN-GIKKCYALPISPVYDMVKILSSTYNMQ-----TPRY------EFTFPQSLG 229
VYI + C V D IL + PRY FTFPQSLG
Sbjct: 281 GAPVVYIEEASRHCLEFSKGQV-DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLG 339
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL-HI 288
++GGKP + Y +G +LDPH Q+ V D +++ + S+YHC +S + HI
Sbjct: 340 ILGGKPGASTYIVGVQDEKAFYLDPHEAQS---VVDIRRENLEADTSSYHCNCSSIIRHI 396
Query: 289 L--HMDPSIAV 297
+DPS+A+
Sbjct: 397 CLDSIDPSLAI 407
>gi|297828133|ref|XP_002881949.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
gi|297327788|gb|EFH58208.1| autophagy 4a [Arabidopsis lyrata subsp. lyrata]
Length = 467
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 94/303 (31%), Positives = 144/303 (47%), Gaps = 49/303 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L ++D +S++ TYR+GF P D+ T+D WGCM+R QM+ AQALLF LGR W
Sbjct: 135 LAAFQQDFSSKILMTYRRGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRSWTK 194
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
E+ YL+ L+ F D ++ +SIH + + G+S G A G W GP + + LA
Sbjct: 195 KSELPEQEYLETLEPFGDSESSAFSIHNLIIAGSSYGLAAGSWVGPYAICRAWESLACKK 254
Query: 129 -KYDDWSSIVFHVALD-------------NTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
K D + +A+ L + K C + S +W P++L++
Sbjct: 255 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPILLLV 312
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PL LG+ +NP YI + FTFPQS+G++GGK
Sbjct: 313 PLVLGLDSVNPRYIPSLIA---------------------------TFTFPQSVGILGGK 345
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
P + Y +G + +LDPH Q + V + D + S+YHC + + +DPS
Sbjct: 346 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVIRYVPLESLDPS 402
Query: 295 IAV 297
+A+
Sbjct: 403 LAL 405
>gi|340518098|gb|EGR48340.1| protease required for autophagy [Trichoderma reesei QM6a]
Length = 450
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 149/316 (47%), Gaps = 61/316 (19%)
Query: 17 RDITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQ 53
D+ ++ W TYR GF PI S G ++D GWGCM+R GQ
Sbjct: 117 EDMAAKFWMTYRSGFEPIPKSVDPRATSALSFSVRIKSTLTDPTGFSSDSGWGCMIRSGQ 176
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
++A + L LGRDW+ N +EE +++ MF D AP+SIH GA+ GK G
Sbjct: 177 SLLATTIATLQLGRDWRRGKNQQEER--RLISMFADDPRAPFSIHNFVRHGATACGKFPG 234
Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
EWFGP+ AQ ++ L D V+ + + K+ + + + P ++
Sbjct: 235 EWFGPSATAQCIQALTSSSDLDLHVYSPNDGQDVYEDSFMKVAKPDGQ-----DFHPTLI 289
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
+I RLGI I P+Y + L +T M PQS+G+ G
Sbjct: 290 LIRTRLGIDKITPIYW------------------EPLIATLQM---------PQSVGIAG 322
Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
G+P+ + YF+G G+ + +LDP HT + + D +++ +DS H + RLH+ M
Sbjct: 323 GRPSSSHYFVGSQGSYLFYLDPHHTRKALPYHEDVANYTDEDIDSC-HTSRLRRLHVKEM 381
Query: 292 DPSIAV-VSQRSYSDY 306
DPS+ + RS SD+
Sbjct: 382 DPSMLIGFLIRSESDW 397
>gi|449676306|ref|XP_002158689.2| PREDICTED: cysteine protease ATG4C-like [Hydra magnipapillata]
Length = 442
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 92/293 (31%), Positives = 142/293 (48%), Gaps = 47/293 (16%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW---QWNV 73
D +S ++ +YRK F + +S LT+D GWGCMLR GQM++A ALL L W +
Sbjct: 102 EDFSSLIYLSYRKHFSQLANSNLTSDSGWGCMLRTGQMLLANALLIHMLKEGWRISERKY 161
Query: 74 NSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLR---KLA 128
K Y IL+ F D + +P+S+H++ G+ K GEW+GP +VA L L
Sbjct: 162 TEKNYIYRMILRFFNDENSDNSPFSLHELVRIGS---KKPGEWYGPTSVAHTLSAAVNLT 218
Query: 129 KYDDWSSIVFHVALDNTL----VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
+ + +VA D T+ V++ K K+ W+ +++++P+RLG +N
Sbjct: 219 SHPVLDTFRVYVANDCTVYIKDVISTSTKCKNCTKKTCQEKFWRSMLILVPIRLGSDGLN 278
Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
P+YI +K AL T +G+IGG+P H+LYF+G+
Sbjct: 279 PIYIPCLK---AL------------------------LTLDYCVGIIGGRPKHSLYFVGF 311
Query: 245 VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
G +I LDPH Q + +E E ++ C ++ MDPS AV
Sbjct: 312 QGKKLINLDPHYLQEYVDMTTQEFPVE-----SFRCHYPKKMAFKKMDPSCAV 359
>gi|452004375|gb|EMD96831.1| hypothetical protein COCHEDRAFT_1123524 [Cochliobolus
heterostrophus C5]
Length = 471
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 129/262 (49%), Gaps = 55/262 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF+ I S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRIWMTYRSGFMAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQSIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW++ + + +IL +F D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQILRLGRDWRYQDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
GP+ A+ ++ LA + + +V+ D V +++K++ + + QWQP ++++
Sbjct: 219 GPSAAARCIQDLANKHREAGLRVYVSGDGADVYEDKLKEVAIDD-----DGQWQPTLILV 273
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
RLGI I PVY +K + QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306
Query: 235 PNHALYFIGYVGNDVIFLDPHT 256
P+ + YF+ GN+ +LDPH+
Sbjct: 307 PSASHYFVATQGNNFFYLDPHS 328
>gi|74147895|dbj|BAE22307.1| unnamed protein product [Mus musculus]
Length = 458
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W-------NVNS---------------------------------------------KEE 78
W N +S + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNE 194
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
AY KI+ F + A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 AYHRKIISWFGNSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V T+ + + + + +++++P+RLG + N Y+ +K
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|224994902|gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
Length = 484
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/301 (31%), Positives = 147/301 (48%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV-NS 75
D +SR+W TYRKGF I DS LT+D WGCM+R QM++AQAL+F HLGR W+ N
Sbjct: 137 EDFSSRVWITYRKGFDVISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPAQNP 196
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
+ + +IL +F D +SIH + G S G A G W GP + + + L +
Sbjct: 197 SDPEHTRILHLFGDSEVCAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQP 256
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + +++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 257 EVINRNESFPMVLYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKGQS--AWSPILLLVPL 314
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 315 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 347
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q + D+ + S+YHC + + +DPS+A
Sbjct: 348 ASTYIAGVQDDRALYLDPHEVQ---LAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLA 404
Query: 297 V 297
+
Sbjct: 405 I 405
>gi|342877133|gb|EGU78640.1| hypothetical protein FOXB_10826 [Fusarium oxysporum Fo5176]
Length = 449
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 94/318 (29%), Positives = 144/318 (45%), Gaps = 62/318 (19%)
Query: 18 DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR F PI GD S ++D GWGCM+R GQ
Sbjct: 117 DFESRIWMTYRSEFDPIPRSTNPQATSSLSLSMRLKSQLGDQSPFSSDSGWGCMIRSGQS 176
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A + + LGRDW+ + +EE +ILK F D APYSIH GAS GK GE
Sbjct: 177 LLANTIALVRLGRDWRQGQSLEEEC--RILKDFADDPRAPYSIHSFVRHGASACGKYPGE 234
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA + S V+ + + K+ A + P +++
Sbjct: 235 WFGPSATARCIQALANSHEPSIRVYSTGDGPDVYEDDFMKIANPTGEA-----FHPTLVL 289
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLG+ I PVY + +P QS+G+ GG
Sbjct: 290 VGTRLGLDKITPVYWEALIAALQMP---------------------------QSVGIAGG 322
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG G+ + +LDPH + ++ D + + H + R+H+ MDP
Sbjct: 323 RPSSSHYFIGSQGSFLFYLDPHHTRPALPYHENPMDYTSEEIESCHTARLRRIHVREMDP 382
Query: 294 SI----AVVSQRSYSDYK 307
S+ + S+ + D+K
Sbjct: 383 SMLIGFLIRSEEDWQDWK 400
>gi|166990618|sp|A7KAI3.1|ATG4_PICAN RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|129714817|gb|ABO31288.1| Atg4p [Ogataea angusta]
Length = 509
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 154/323 (47%), Gaps = 74/323 (22%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCML 49
+ RD+ SR+W TYR GF I + G TTD GWGCM+
Sbjct: 73 EFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSLIRGTVDLATVTKGFTTDAGWGCMI 132
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EG 108
R Q ++A +LL L LGR W+++ + + +I+ F D TAP+SIH GA+ G
Sbjct: 133 RTSQSLLANSLLQLRLGRGWRYDQTRECAKHAEIVSWFVDIPTAPFSIHNFVEQGANCAG 192
Query: 109 KAVGEWFGPNTVAQVLRKL--AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
K GEWFGP+ A+ ++ L A YD V+ A + +++ +L A +
Sbjct: 193 KKPGEWFGPSAAARSIQVLCEANYDKTGLKVYFTA-SGDIYEDELFEL------AQQGAE 245
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+P++++ +RLG++++NP+Y + +KK +PQ
Sbjct: 246 LRPVLILAGIRLGVKNVNPLYWDFLKKTLG---------------------------WPQ 278
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK------------EQDSEKKL 274
S+G+ GG+P+ + YF G+ G+ + +LDPH Q + + E +S L
Sbjct: 279 SVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHESPDPNHYVEVESGLDL 338
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
DS H + +LH+ MDPS+ V
Sbjct: 339 DSV-HTNKIRKLHLDQMDPSMLV 360
>gi|216963242|gb|ACJ73913.1| autophagy-related 4a variant 2 [Zea mays]
Length = 429
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 145/301 (48%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYRKGF I DS LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 150 EDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKP 209
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ +L +F D +SIH + G + G A G W GP + + + L +
Sbjct: 210 YNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQA 269
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+++ ++ V+ D + ++ +LC+ + S W P++L++PL
Sbjct: 270 DAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPST--WSPILLLVPL 327
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ F FPQSLG++GGKP
Sbjct: 328 VLGLDKINPRYIPLLKE---------------------------TFMFPQSLGILGGKPG 360
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 361 TSTYIAGVQDDRALYLDPHEVQ---MTVDIALDNLEADTSSYHCSVVRALALEQIDPSLA 417
Query: 297 V 297
+
Sbjct: 418 I 418
>gi|221137004|ref|NP_001137488.1| autophagy-related 4 [Zea mays]
gi|195620628|gb|ACG32144.1| cysteine protease ATG4B [Zea mays]
gi|216963236|gb|ACJ73912.1| autophagy-related 4 variant 1 [Zea mays]
gi|219886349|gb|ACL53549.1| unknown [Zea mays]
gi|414584729|tpg|DAA35300.1| TPA: autophagy 4a variant 2 isoform 1 [Zea mays]
gi|414584730|tpg|DAA35301.1| TPA: autophagy 4a variant 2 isoform 2 [Zea mays]
Length = 492
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 145/301 (48%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYRKGF I DS LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 150 EDFSSRIWITYRKGFDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPPEKP 209
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ +L +F D +SIH + G + G A G W GP + + + L +
Sbjct: 210 YNPDYIGVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIRTNREQA 269
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+++ ++ V+ D + ++ +LC+ + S W P++L++PL
Sbjct: 270 DAVDGKENFPMALYVVSGDEDGERGGAPVVCIDVAAQLCSDFNKGPST--WSPILLLVPL 327
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ F FPQSLG++GGKP
Sbjct: 328 VLGLDKINPRYIPLLKE---------------------------TFMFPQSLGILGGKPG 360
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 361 TSTYIAGVQDDRALYLDPHEVQ---MTVDIALDNLEADTSSYHCSVVRALALEQIDPSLA 417
Query: 297 V 297
+
Sbjct: 418 I 418
>gi|410967384|ref|XP_003990200.1| PREDICTED: cysteine protease ATG4C [Felis catus]
Length = 459
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 158/357 (44%), Gaps = 91/357 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSNTVKKFTASFEASLSGERELKTPAVSQKETIRRYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V K CT+ AS N + ++++IP+RLG + N Y++ +K
Sbjct: 255 GITIYVAQDCTVYSSDVIDKQCTS--MASDNTDDKAVIILIPVRLGGERTNTDYLDFVKG 312
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
I ++V +L + KP + YF G+ + +I++D
Sbjct: 313 -----ILRALNIVWVL---------------------LVAKPKQSYYFAGFQDDSLIYMD 346
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
PH Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 347 PHYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 398
>gi|255576671|ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis]
Length = 489
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 96/304 (31%), Positives = 146/304 (48%), Gaps = 50/304 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L + D +SR+ TYR+GF IGDS +D GWGCMLR QM++AQALLF LGR W
Sbjct: 134 LAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLVAQALLFHKLGRAWTK 193
Query: 72 NVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK- 129
++AY++IL +F D AP+SIH + G + A G W GP + + LA+
Sbjct: 194 PFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWVGPYAMCRSWESLARS 253
Query: 130 --------YDDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
Y V+ V+ D + + + C R ++ W P++L+
Sbjct: 254 KREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEFSRGQAD--WTPILLL 311
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PL LG+ +NP YI ++ FTF QSLG++GG
Sbjct: 312 VPLVLGLDKVNPRYIPSLQAT---------------------------FTFSQSLGIMGG 344
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
KP + Y +G ++ +LDPH Q+ V + +D + S+YH + + +DP
Sbjct: 345 KPGASTYIVGVQDDNAFYLDPHEVQS---VVNIGRDDIEADTSSYHSDIVRHIPLHSIDP 401
Query: 294 SIAV 297
S+A+
Sbjct: 402 SLAI 405
>gi|353227348|emb|CCA77858.1| hypothetical protein PIIN_00505 [Piriformospora indica DSM 11827]
Length = 1257
Score = 144 bits (364), Expect = 4e-32, Method: Composition-based stats.
Identities = 92/305 (30%), Positives = 142/305 (46%), Gaps = 94/305 (30%)
Query: 18 DITSRLWFTYRKGFVPI----------------------------------------GDS 37
D TSR+W TYR F PI G+
Sbjct: 320 DYTSRVWLTYRNTFPPIRDTALSCLEPVASRSTHNNSSSTDISQPLPSPSKPRWPWSGEK 379
Query: 38 GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE----AYLKILKMFEDRRT- 92
G T+D GWGCMLR GQ ++A AL+ LHL R W+ + Y++IL F D +
Sbjct: 380 GWTSDAGWGCMLRTGQSLLANALIHLHLSRSWRRPTHPSYSPDYVQYVRILTWFLDNPSP 439
Query: 93 -APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV 151
AP+ IH++AL G GK VG WFGP+T A +++L + + + +A+D+ + + V
Sbjct: 440 LAPFGIHRMALAGKELGKEVGSWFGPSTAAGAIKRLVGEFEDAGLEVALAVDSVVYQSDV 499
Query: 152 -----------------KKLCTTN--KRASSNPQW--QPLVLVIPLRLGIQDINPVYING 190
K + T+ K+ P+W +P+++++ +RLGI +NP+Y
Sbjct: 500 YAASAASRNQNGVEGDSKTVGTSKSRKKGQGPPKWGNRPVLILVGIRLGIDGVNPIY--- 556
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
Y+ VK L FTFPQ++G+ GG+P+ + YF+G G+ +
Sbjct: 557 ------------YESVKTL------------FTFPQTVGIAGGRPSSSYYFVGAQGDSLF 592
Query: 251 FLDPH 255
+LDPH
Sbjct: 593 YLDPH 597
>gi|147905876|ref|NP_001088249.1| cysteine protease ATG4C [Xenopus laevis]
gi|61211751|sp|Q5XH30.1|ATG4C_XENLA RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|54038152|gb|AAH84245.1| LOC495080 protein [Xenopus laevis]
Length = 450
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 149/340 (43%), Gaps = 93/340 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ R+D SR+W TYRK F I S TTD GWGC LR GQM++AQ LL LGRDW
Sbjct: 76 NVDEFRKDFISRIWLTYRKEFPQIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWT 135
Query: 71 WNV-------------------------------------------NSK-----EEAYLK 82
W NS+ E+ + K
Sbjct: 136 WTEALDIFCSESDFWTANTARKLDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRK 195
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---DWSSIVFH 139
I+ F D A + +HQ+ G + GK G+W+GP V+ +LRK + + I +
Sbjct: 196 IISWFADYPLAYFGLHQLVKLGKNSGKVAGDWYGPAVVSHLLRKAIEESSDPELQGITIY 255
Query: 140 VALDNTLVVNQVKKL-CTT-NKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYAL 197
VA D T+ V L C N++A +V+++P+RLG + N Y +K +L
Sbjct: 256 VAQDCTIYNADVYDLQCNKGNEKA--------VVILVPVRLGGERTNMEYFEYVKGILSL 307
Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
EF +G+IGGKP + YF+G+ + +I++DPH
Sbjct: 308 -----------------------EFC----IGIIGGKPKQSYYFVGFQDDSLIYMDPHYC 340
Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
Q+ V K E ++HCP ++ MDPS V
Sbjct: 341 QSFVDVSIKNFPLE-----SFHCPSPKKMSFKKMDPSCTV 375
>gi|451855330|gb|EMD68622.1| hypothetical protein COCSADRAFT_79257 [Cochliobolus sativus ND90Pr]
Length = 473
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 82/262 (31%), Positives = 128/262 (48%), Gaps = 55/262 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF I S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRIWMTYRSGFTAIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQSIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW++ + + +IL +F D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQILRLGRDWRYQDKPTAKEHCEILSLFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
GP+ A+ ++ LA + + +V+ D V +++K++ + + +WQP ++++
Sbjct: 219 GPSAAARCIQDLANKHREAGLRVYVSGDGADVYEDKLKEVAIDD-----DGEWQPTLILV 273
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
RLGI I PVY +K + QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306
Query: 235 PNHALYFIGYVGNDVIFLDPHT 256
P+ + YF+ GN+ +LDPH+
Sbjct: 307 PSASHYFVATQGNNFFYLDPHS 328
>gi|326925485|ref|XP_003208945.1| PREDICTED: cysteine protease ATG4C-like [Meleagris gallopavo]
Length = 458
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 152/343 (44%), Gaps = 90/343 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134
Query: 71 W-------NVNS--------------------------------------------KEEA 79
W N +S ++E
Sbjct: 135 WPDALDIENSDSESWTAHTVKKLTASLEASLTAEREPRILSNHQERIRRNCGDGEMRDEV 194
Query: 80 Y-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSS 135
Y KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A+ +
Sbjct: 195 YHRKIISWFGDSPLAAFGLHQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQG 254
Query: 136 IVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V + C+ ++ + +++++P+RLG + N Y+ +K
Sbjct: 255 VTVYVAQDCTVYSSDVIDRQCSFMDSGETDT--KAVIILVPVRLGGERTNMDYLEFVK-- 310
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 311 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 345
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E ++HCP ++ MDPS +
Sbjct: 346 HYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 383
>gi|301104974|ref|XP_002901571.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
gi|262100575|gb|EEY58627.1| cysteine protease family C54, putative [Phytophthora infestans
T30-4]
Length = 392
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 132/269 (49%), Gaps = 45/269 (16%)
Query: 5 NKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH 64
+ ++ D + +R S LWFTYR+ + + T+D GWGCMLR QM++ QAL
Sbjct: 31 DDVAAVDFDAYKRSFESILWFTYRRDYPAMTPYEHTSDAGWGCMLRSAQMLLGQALQRRL 90
Query: 65 LGRDW------QWNVNSK-EEAYLKILKMFEDRRTAP--YSIHQIALTGASEGKAVGEWF 115
LGRDW + ++++ E Y+++L+ F D YSIHQ+ G K GEW+
Sbjct: 91 LGRDWRLPALFETEIDARLPETYVQLLRWFADSPDVECRYSIHQMVKLGVQYDKLPGEWY 150
Query: 116 GPNTVAQVLRKLA---KYDDWSSIVFHVALDNTLVVNQVKKLCTTN-----KRASSNPQW 167
GP T AQVLR L + + + +V + + + V KLC + W
Sbjct: 151 GPTTAAQVLRDLVNLHRREFGGELSMYVPQEGVVYSDDVAKLCFFDPLLHPPTTEDKSDW 210
Query: 168 Q-PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
L+++IPLRLG+ +N Y+ I+K +A FPQ
Sbjct: 211 STALLILIPLRLGLDQVNERYVPAIQKSFA---------------------------FPQ 243
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
S+G+IGGK H++YF+G + + LDPH
Sbjct: 244 SVGIIGGKKGHSVYFVGTQQDQLHLLDPH 272
>gi|357166768|ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
Length = 493
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 94/301 (31%), Positives = 144/301 (47%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYRKGF I DS T+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 146 EDFSSRIWVTYRKGFDAISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPSQKP 205
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y++IL +F D +S+H + G S G A G W GP + + + L +
Sbjct: 206 CNPEYIRILHLFGDSEVCAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQP 265
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 266 EVSNGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCYDFNKDQST--WSPILLLVPL 323
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 324 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 356
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q + D++ S+YHC + + +DPS+A
Sbjct: 357 TSTYIAGIQDDRALYLDPHDVQMAVNIASDNLDADT---SSYHCSTVRDMALDLLDPSLA 413
Query: 297 V 297
+
Sbjct: 414 I 414
>gi|443893810|dbj|GAC71266.1| cysteine protease [Pseudozyma antarctica T-34]
Length = 1509
Score = 143 bits (361), Expect = 8e-32, Method: Composition-based stats.
Identities = 91/336 (27%), Positives = 151/336 (44%), Gaps = 104/336 (30%)
Query: 37 SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE------------------ 78
+GLTTD GWGCMLR GQ ++A AL+ +HLGR WQ K +
Sbjct: 774 AGLTTDSGWGCMLRTGQSLLANALINVHLGRSWQRETAPKSQIEFFEELANASLDASAEN 833
Query: 79 --------------AYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
Y++IL F D + P+ +H++A G GK VGEWFGP+T A
Sbjct: 834 QSLASWRERRARHATYIRILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 893
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKK-------------LCTTNKRASSNPQWQP 169
+++L + I +A D +++V+ + + N+RA + +P
Sbjct: 894 AIKQLVFDFPEAGIAVELAHDGVFYLDEVRAAASASTGKSRASGMLSGNRRAETAVWRRP 953
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
++++I +RLG++ +NP+Y +K F+FPQS+G
Sbjct: 954 VLILIGIRLGLETVNPIYYESVKAT---------------------------FSFPQSVG 986
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQ--------------------NIGCVY---DK 266
+ GG+P+ + YF+G+ GN + +LDPH + ++ Y D+
Sbjct: 987 IAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPTTFPAAAPSRFDVSHRYALEDR 1046
Query: 267 EQDSE-------KKLDSTYHCPQASRLHILHMDPSI 295
+ + E + ST+HC + R+ I +DPS+
Sbjct: 1047 DDEDEWWSHAYTEAQTSTFHCEKVRRMPIKSLDPSM 1082
>gi|449508713|ref|XP_002198788.2| PREDICTED: cysteine protease ATG4C [Taeniopygia guttata]
Length = 456
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 148/344 (43%), Gaps = 94/344 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134
Query: 71 W-------------------------------------------------NVNSKEEAY- 80
W + E Y
Sbjct: 135 WPEALDMESCDWESWTSSTVRKLTASLEASLTAERDPKVLARPPARRDWDGTEKRNEVYH 194
Query: 81 LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWSSIV 137
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A+ + +
Sbjct: 195 RKIISWFGDSPLAAFGLHQLIEYGKKSGKMAGDWYGPAVVAHILRKAVEEARDPELQGVT 254
Query: 138 FHVALDNTL----VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
+VA D T+ V+++ L + K + + ++++ P+RLG + N Y+ +K
Sbjct: 255 VYVAQDCTVYSSDVIDRQCSLVDSGKAGT-----KAVIILFPVRLGGERTNTDYLEFVK- 308
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 309 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 342
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
PH Q+ V K+ E ++HCP ++ MDPS +
Sbjct: 343 PHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 381
>gi|157818033|ref|NP_001101418.1| cysteine protease ATG4C [Rattus norvegicus]
gi|149044549|gb|EDL97808.1| similar to APG4 autophagy 4 homolog C [Rattus norvegicus]
Length = 458
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 152/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W---------------------------------------NVNSKEEA------------ 79
W V+ KE +
Sbjct: 135 WPDALHIESSDSDSWTSNTIHKFTASFEASLSGERELRTPAVSLKETSGKHPDDHAVQSE 194
Query: 80 --YLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
+ +I+ F D A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 IYHRQIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V T+ + + + + +++++P+RLG + N Y+ +K
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNIDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>gi|56118282|ref|NP_001007883.1| cysteine protease ATG4C [Xenopus (Silurana) tropicalis]
gi|61211764|sp|Q68EP9.1|ATG4C_XENTR RecName: Full=Cysteine protease ATG4C; AltName:
Full=Autophagy-related protein 4 homolog C
gi|51258902|gb|AAH80152.1| apg4c protein [Xenopus (Silurana) tropicalis]
gi|89269108|emb|CAJ81923.1| APG4 autophagy 4 homolog C (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 450
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 97/338 (28%), Positives = 147/338 (43%), Gaps = 89/338 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ R+D SR+W TYR+ F I S TTD GWGC LR GQM++AQ L+ LGRDW
Sbjct: 76 NVDEFRKDFISRIWLTYREEFPQIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWT 135
Query: 71 W---------------------------------------------NVNSK---EEAYLK 82
W N + K E+ + K
Sbjct: 136 WTEALDIFSSESEFWTANTARKLTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQK 195
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---DWSSIVFH 139
I+ F D A + +HQ+ G + GK G+W+GP V+ +LRK + + I +
Sbjct: 196 IISWFADYPLAYFGLHQLVKLGKNSGKVAGDWYGPAVVSHLLRKAIEESSDPELQGITIY 255
Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
VA D T+ V L NK + +V+++P+RLG + N Y +K +L
Sbjct: 256 VAQDCTIYSADVYDL-QCNKGTE-----KAVVILVPVRLGGERTNMEYFEFVKGILSL-- 307
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
EF +G+IGGKP + YF+G+ + +I++DPH Q+
Sbjct: 308 ---------------------EFC----IGIIGGKPKQSYYFVGFQDDSLIYMDPHYCQS 342
Query: 260 IGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
V K E ++HCP ++ MDPS +
Sbjct: 343 FVDVSVKNFPLE-----SFHCPSPKKMSFKKMDPSCTI 375
>gi|118094640|ref|XP_422520.2| PREDICTED: cysteine protease ATG4C [Gallus gallus]
Length = 459
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 152/344 (44%), Gaps = 91/344 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134
Query: 71 W-------NVNSKE-------------EAYL----------------------------- 81
W N +S+ EA L
Sbjct: 135 WPDALDIENSDSESWTAHTVKKLTASLEASLTAEREPKILSHHQERTLRRDCGDSEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A+ +
Sbjct: 195 VYHRKIISWFGDSPLAAFGLHQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
+ +VA D T+ + V + C+ ++ + +++++P+RLG + N Y+ +K
Sbjct: 255 GVTVYVAQDCTVYSSDVIDRQCSFMDSGETDT--KAVIILVPVRLGGERTNMDYLEFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
PH Q+ V K+ E ++HCP ++ MDPS +
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 384
>gi|320581937|gb|EFW96156.1| cysteine protease ATG4, putative [Ogataea parapolymorpha DL-1]
Length = 509
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/319 (30%), Positives = 153/319 (47%), Gaps = 74/319 (23%)
Query: 18 DITSRLWFTYRKGFVPI-----GDS-------------------GLTTDKGWGCMLRCGQ 53
D+ SR+W TYR GF I G S G TTD GWGCM+R Q
Sbjct: 77 DVHSRIWLTYRSGFPLIRRAEDGPSPLSFGSLIRGTVDLATVTKGFTTDAGWGCMIRTSQ 136
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EGKAVG 112
++A LL L LGR W+++ + + +I+ F D TAP+SIH GA+ GK G
Sbjct: 137 SLLANGLLQLRLGRGWRYDQTRECAKHAEIVSWFVDIPTAPFSIHNFVEQGANCAGKKPG 196
Query: 113 EWFGPNTVAQVLRKL--AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPL 170
EWFGP+ A+ ++ L A YD V+ A + +++ +L A + +P+
Sbjct: 197 EWFGPSAAARSIQVLCEANYDKIGLKVYFTA-SGDIYEDELFEL------AQEGAELRPV 249
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+++ +RLG++++NP+Y + +KK ++PQS+G+
Sbjct: 250 LILAGIRLGVKNVNPLYWDFLKKT---------------------------LSWPQSVGI 282
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK------------EQDSEKKLDSTY 278
GG+P+ + YF G+ G+ + +LDPH Q + + E +S LDS
Sbjct: 283 AGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHESPDPNHYVEVESGLDLDSV- 341
Query: 279 HCPQASRLHILHMDPSIAV 297
H + +LH+ MDPS+ V
Sbjct: 342 HTNKIRKLHLDQMDPSMLV 360
>gi|340383455|ref|XP_003390233.1| PREDICTED: cysteine protease ATG4D-like [Amphimedon queenslandica]
Length = 437
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 143/293 (48%), Gaps = 58/293 (19%)
Query: 24 WFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN----------- 72
W TYR GF PI S LTTD GWGCM+R GQM++A L LGRDW+ +
Sbjct: 102 WMTYRCGFSPILSSSLTTDCGWGCMVRSGQMLLATVLHLHFLGRDWRLSSSDVTGHKIHR 161
Query: 73 -VNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLR---- 125
V + + IL F D + P+SIH++ G G+WFGP+ V+ ++R
Sbjct: 162 QVKNWNNYVVLILSWFGDSESELCPFSIHRLMEAAYYHGNKPGDWFGPSQVSILIRDCVR 221
Query: 126 -KLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
L ++ + + +V+ D T+ + V+ + ++ Q L++++P+RLG + +N
Sbjct: 222 RALREHINLQKLNIYVSHDCTVYIKDVQDIFESDLD-------QSLLVLVPVRLGSESLN 274
Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
P+YI +K AL ++G+IGG+P H+++FIG+
Sbjct: 275 PIYIPCVKALLALD---------------------------HTVGIIGGRPKHSVFFIGF 307
Query: 245 VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
++I LDPH +Q + + D S+YHC ++ + MDPS +
Sbjct: 308 QDENLIHLDPHYSQTAVNMTRTDFDV-----SSYHCRSPKKIPVTKMDPSCTL 355
>gi|389750681|gb|EIM91754.1| hypothetical protein STEHIDRAFT_88418 [Stereum hirsutum FP-91666
SS1]
Length = 1286
Score = 143 bits (360), Expect = 1e-31, Method: Composition-based stats.
Identities = 88/288 (30%), Positives = 135/288 (46%), Gaps = 77/288 (26%)
Query: 18 DITSRLWFTYRKGFVPIGDS-------------------------------------GLT 40
D TSR+W TYR F PI DS G T
Sbjct: 342 DFTSRVWLTYRSHFQPIRDSTLTALESEQANMAHAGPVIMASSPPTKKWGWPGSGEKGWT 401
Query: 41 TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN----VNSKEEAYLKILKMFEDRRT--AP 94
+D GWGCMLR GQ ++A AL+ LHLGRDW+ + Y+++L F D T P
Sbjct: 402 SDAGWGCMLRTGQSLLANALVHLHLGRDWRRPPHPVYTADYATYVQMLTWFFDSPTPHCP 461
Query: 95 YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+S+H++AL G GK VG+WFGP+T A ++ L + + +A D+ + + V
Sbjct: 462 FSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSIASDSQIFQSDVFAA 521
Query: 155 C-------TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVK 207
++ K+ +S + ++++I +RLG+ +NP+Y IK Y
Sbjct: 522 SHPPMDSPSSKKKLASTWGGRAVLVLIGIRLGLDGVNPIYYETIKALY------------ 569
Query: 208 ILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
TFPQS+G+ GG+P+ + YF+G +++ +LDPH
Sbjct: 570 ---------------TFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 602
>gi|380485578|emb|CCF39271.1| cysteine protease atg4 [Colletotrichum higginsianum]
Length = 454
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 138/305 (45%), Gaps = 59/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S+ W TYR F I S G T+D GWGCM+R GQ
Sbjct: 121 DFESKFWMTYRSEFQAIAKSTDPRASSTLSFSMRIKSQLVDQNGFTSDSGWGCMIRSGQS 180
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A A+ ++LGRDW+ N ++E K+L F D APYSIHQ GA + GK GE
Sbjct: 181 LLANAMAAINLGRDWRRGQNPEDE--RKLLSWFADDPRAPYSIHQFVQHGAVACGKYPGE 238
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA + + + D V K K S ++ P +++
Sbjct: 239 WFGPSATARCIQALANAQEQQPLRVYSTGDGPDVYED--KFMEIAKPDGS--RFNPTLIL 294
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI I PVY + + PQS+G+ GG
Sbjct: 295 VGTRLGIDKITPVYWEALIAALQM---------------------------PQSVGIAGG 327
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P + YFIG G+ + +LDP HT + D SE +D T H + RLH+ +D
Sbjct: 328 RPASSHYFIGAQGSYLFYLDPHHTRPALPFHTDPSHYSEADVD-TVHTRRLRRLHVRELD 386
Query: 293 PSIAV 297
PS+ V
Sbjct: 387 PSMLV 391
>gi|75138024|sp|Q75KP8.1|ATG4A_ORYSJ RecName: Full=Cysteine protease ATG4A; AltName:
Full=Autophagy-related protein 4 homolog A
gi|40539014|gb|AAR87271.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708571|gb|ABF96366.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
gi|125586519|gb|EAZ27183.1| hypothetical protein OsJ_11120 [Oryza sativa Japonica Group]
gi|215769128|dbj|BAH01357.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 474
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 145/302 (48%), Gaps = 54/302 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYRKGF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 131 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 190
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 191 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHH 250
Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTT-NKRASSNPQWQPLVLVIP 175
++ ++ V+ D + ++ +LC NK S+ W P++L++P
Sbjct: 251 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKNQST---WSPILLLVP 307
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
L LG+ +NP YI +K+ TFPQSLG++GGKP
Sbjct: 308 LVLGLDKLNPRYIPLLKE---------------------------TLTFPQSLGILGGKP 340
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+
Sbjct: 341 GTSTYIAGVQDDRALYLDPHEVQ---LAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSL 397
Query: 296 AV 297
A+
Sbjct: 398 AI 399
>gi|40539015|gb|AAR87272.1| putative autophagy protein (with alternative splicing) [Oryza
sativa Japonica Group]
gi|108708572|gb|ABF96367.1| Peptidase family C54 containing protein, expressed [Oryza sativa
Japonica Group]
Length = 505
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 145/302 (48%), Gaps = 54/302 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYRKGF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 131 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 190
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 191 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHH 250
Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTT-NKRASSNPQWQPLVLVIP 175
++ ++ V+ D + ++ +LC NK S+ W P++L++P
Sbjct: 251 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKNQST---WSPILLLVP 307
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
L LG+ +NP YI +K+ TFPQSLG++GGKP
Sbjct: 308 LVLGLDKLNPRYIPLLKE---------------------------TLTFPQSLGILGGKP 340
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+
Sbjct: 341 GTSTYIAGVQDDRALYLDPHEVQ---LAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSL 397
Query: 296 AV 297
A+
Sbjct: 398 AI 399
>gi|358381369|gb|EHK19044.1| hypothetical protein TRIVIDRAFT_181799 [Trichoderma virens Gv29-8]
Length = 451
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 140/306 (45%), Gaps = 60/306 (19%)
Query: 17 RDITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQ 53
D+ ++ W TYR GF PI S G ++D GWGCM+R GQ
Sbjct: 119 EDMAAKFWMTYRSGFDPIAKSVDPRATSALSFAVRIKSTLSDPTGFSSDSGWGCMIRSGQ 178
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
++A + L LGRDW+ +EE +++ MF D APYSIH GA+ GK G
Sbjct: 179 SLLATTIGILQLGRDWRRGKCQQEER--QLISMFADDPRAPYSIHNFVRHGATACGKFPG 236
Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
EWFGP+ AQ ++ L V+ + + K+ + + + P ++
Sbjct: 237 EWFGPSATAQCIQALTSASGLPLKVYSPNDGQDVYEDSFMKIAKPDGQ-----DFHPTLI 291
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
+I RLGI I P+Y + + PQS+G+ G
Sbjct: 292 LIRTRLGIDKITPIYWEPLLAALQM---------------------------PQSVGIAG 324
Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
G+P+ + YF+G G+ + +LDP HT + I D + +E+ ++S H + RLH+ M
Sbjct: 325 GRPSSSHYFVGSQGSYLFYLDPHHTRKAIPYHADVTKYTEEDIESC-HTSRLRRLHLKEM 383
Query: 292 DPSIAV 297
DPS+ +
Sbjct: 384 DPSMLI 389
>gi|387015378|gb|AFJ49808.1| Cysteine protease ATG4C-like [Crotalus adamanteus]
Length = 457
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/344 (29%), Positives = 148/344 (43%), Gaps = 90/344 (26%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGRDW
Sbjct: 74 RNVEEFRKDFISRIWLTYREEFPQITGSVLTTDCGWGCTLRTGQMLLAQGLILHFLGRDW 133
Query: 70 ---------------------------------------------------QWNVNSKEE 78
Q S EE
Sbjct: 134 TWANAFVFENPESESWTSQTVKKLTASLETSLIGEREFRSQSTHPKSPIRNQETEESVEE 193
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
Y +I+ F D A + +H++ G GK G+W+GP VA +LRK A+ +
Sbjct: 194 QYHRRIISWFADSPFANFGLHRLIEYGKKSGKIAGDWYGPAVVAHLLRKAVEKARDPELQ 253
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
I +VA D T+ + V LC S + ++++IP+RLG + N Y +K
Sbjct: 254 GITIYVAQDCTVYKSDVIDALCPFTD--SEKTSVKSIIILIPVRLGGERTNMEYFEFVK- 310
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 311 -------------GILSLDY-------------CIGIIGGKPKQSYYFAGFQDDSLIYMD 344
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
PH Q+ V K+ E ++HCP ++ MDPS +
Sbjct: 345 PHYCQSFVDVSVKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 383
>gi|189194545|ref|XP_001933611.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979175|gb|EDU45801.1| peptidase family C54 protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 470
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 128/266 (48%), Gaps = 55/266 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF PI S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQCIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW++ + + ++ MF D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQILRLGRDWRYQEQPDAKEHCDVVAMFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
GP+ A+ ++ L + + + +V+ D V +++K++ + + +W P ++++
Sbjct: 219 GPSAAARCIQDLVHKNREAGLKVYVSGDGADVYEDKLKEIAVDD-----DGEWHPTLILV 273
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
RLGI I PVY +K + QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNI 260
P+ + YF+ N+ +LDPH+ + +
Sbjct: 307 PSASHYFVATQANNFFYLDPHSTRPL 332
>gi|449268268|gb|EMC79138.1| Cysteine protease ATG4C [Columba livia]
Length = 459
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 152/344 (44%), Gaps = 91/344 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIKGSALTTDCGWGCTLRTGQMLLAQGLMLHFLGRAWV 134
Query: 71 W----NVNSKE----------------EAYL----------------------------- 81
W +++S + EA L
Sbjct: 135 WPDALDIDSSDSESWTAHTVKKLTASFEASLTAEREPKILSNHHRGTLKRNWDESERRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D + +HQ+ G GK G+W+GP VA +LRK A+ +
Sbjct: 195 VYHRKIISWFGDSPLTAFGLHQLIEYGKKSGKIAGDWYGPAVVAHILRKAVEEARDPELQ 254
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
+ +VA D T+ + V + C+ ++ + +++++P+RLG + N Y+ +K
Sbjct: 255 GVTIYVAQDCTVYSSDVIDRQCSFMDSGEADT--KAVIILVPVRLGGERTNMDYLEFVK- 311
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
ILS Y +G+IGGKP + YF G+ + +I++D
Sbjct: 312 -------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMD 345
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
PH Q+ V K+ E ++HCP ++ MDPS +
Sbjct: 346 PHYCQSFVDVSIKDFPLE-----SFHCPSPKKMSFKKMDPSCTI 384
>gi|148226916|ref|NP_001087417.1| cysteine protease ATG4D [Xenopus laevis]
gi|61211765|sp|Q68FJ9.1|ATG4D_XENLA RecName: Full=Cysteine protease ATG4D; AltName: Full=Autophagin-4;
AltName: Full=Autophagy-related protein 4 homolog D
gi|51260960|gb|AAH79754.1| MGC84754 protein [Xenopus laevis]
Length = 469
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 155/334 (46%), Gaps = 78/334 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ ++D SR+W TYR+ F + + LTTD GWGCM+R GQM++AQ LL L R+W
Sbjct: 95 EIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSREWT 154
Query: 71 WN-------------------------------------------VNSKEEAYLKILKMF 87
W+ ++ + I++ F
Sbjct: 155 WSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMRWF 214
Query: 88 EDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTL 146
D +P+ +HQ+ G+ GK G+W+GP+ VA +++K + + + +V+ D T+
Sbjct: 215 SDHPGSPFGLHQLVTLGSIFGKKAGDWYGPSIVAHIIKKAIETSSEVPELSVYVSQDCTV 274
Query: 147 VVNQVKKLCTTN--KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
+++L + +S + +++++P+RLG + NPVY + +K+ +
Sbjct: 275 YKADIEQLFAGDVPHAETSRGAGKAVIILVPVRLGGETFNPVYKHCLKEFLRM------- 327
Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
P LG+IGGKP H+LYFIGY N +++LDPH Q Y
Sbjct: 328 --------------------PSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQP----Y 363
Query: 265 DKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
++ L+S +HC ++ I MDPS
Sbjct: 364 IDTSKNDFPLES-FHCNSPRKISITRMDPSCTFA 396
>gi|328351041|emb|CCA37441.1| autophagy-related protein 4 [Komagataella pastoris CBS 7435]
Length = 758
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 168/381 (44%), Gaps = 106/381 (27%)
Query: 1 MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIG---DS------------------GL 39
++ NK S + D+ S++W TYR GF PI DS G
Sbjct: 51 IKDGNKKSTTYSQSFIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGF 110
Query: 40 TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRTAPYSIH 98
T+D GWGCM+R Q ++A ALLFLHLGRDW + + +I+ F D P+SIH
Sbjct: 111 TSDAGWGCMIRTSQSLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIH 170
Query: 99 QIALTG-ASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCT 156
G K GEWFGP+ ++ ++ L K Y V+ + + +V++L
Sbjct: 171 NFVQQGIKCCDKKPGEWFGPSAASRAIKNLCKEYPPCGLRVYFSSDCGDVYDTEVRELAY 230
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
+ + P+++++ +RLG++ +NPVY + +++C +L
Sbjct: 231 GDSDT-----FTPILVLLGIRLGVEKVNPVYWDSLRECLSL------------------- 266
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------- 255
QS+G+ GG+P + YF G+ G+ + +LDPH
Sbjct: 267 --------KQSVGIAGGRPCSSHYFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTK 318
Query: 256 -TNQNIGCVY-----DKEQDS------EKKLDS-------------TYHCPQASRLHILH 290
T++N Y D ++ E KLD+ + H P+ ++LH+ H
Sbjct: 319 KTDENAAGQYPVSNTDSNNETNHDDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSH 378
Query: 291 MDPSI----AVVSQRSYSDYK 307
MDPS+ + S+ ++D+K
Sbjct: 379 MDPSMLIGFLITSEDDFNDWK 399
>gi|449442361|ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
Length = 483
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 146/300 (48%), Gaps = 50/300 (16%)
Query: 16 RRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNS 75
+D +SR+ TYRKGF I DS T+D WGCMLR QM++AQALLF LGR W+
Sbjct: 138 EQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPSQK 197
Query: 76 K-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWS 134
++ Y++IL +F D T+ +SIH + G + A G W GP + + L + +
Sbjct: 198 PLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRSWETLVRSKRET 257
Query: 135 SI---------VFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLR 177
I ++ V+ D L ++ + C + + W P++L++PL
Sbjct: 258 PILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHD--WSPILLLVPLV 315
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG++ INP YI + R FTFPQSLG++GGKP
Sbjct: 316 LGLEKINPRYIPSL---------------------------RTTFTFPQSLGILGGKPGA 348
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ Y +G + +LDPH Q V + ++D + S+YHC + + +DPS+A+
Sbjct: 349 STYIVGVQDENAFYLDPHEVQQ---VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAI 405
>gi|407917424|gb|EKG10733.1| Peptidase C54 [Macrophomina phaseolina MS6]
Length = 437
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 148/317 (46%), Gaps = 60/317 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D +R+W TYR F I S G ++D GWGCM+R GQ
Sbjct: 108 DFEARVWITYRSNFAAIPKSQDPNATTAMSFSVRFRNQISNQGGFSSDTGWGCMIRSGQS 167
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A AL L LGR W+ +S+ E +IL +F D AP+SIH+ GA + GK GE
Sbjct: 168 LLANALQVLRLGRAWRRGQDSQGE--RRILSLFADDPKAPFSIHRFVEHGAVACGKHPGE 225
Query: 114 WFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVL 172
WFGP+ A+ ++ L+ Y+D V+ + + + K+ +N + P ++
Sbjct: 226 WFGPSATARCIQALSNGYEDAGLRVYITGDGSDVYEDSFMKVAK-----DANNTFHPTLV 280
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ +RLGI + PVY +K L QS+G+ G
Sbjct: 281 LVGIRLGIDRVTPVYWEALKASLQLS---------------------------QSIGIAG 313
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
G+P+ + YF+G G+ +LDPHT + ++ D ++ + H + RLH+ MD
Sbjct: 314 GRPSASHYFVGTQGSYFFYLDPHTTRPFLPLHSDLSDYTQEDIDSCHTRRLRRLHVKEMD 373
Query: 293 PSIAVVSQ-RSYSDYKN 308
PS+ + R +D++N
Sbjct: 374 PSMLIAFLIRDETDWQN 390
>gi|83773128|dbj|BAE63255.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|325504923|dbj|BAJ83603.1| cysteine protease Atg4 [Aspergillus oryzae]
Length = 356
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 135/262 (51%), Gaps = 36/262 (13%)
Query: 38 GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSI 97
G T+D GWGCM+R GQ ++A A+L L LGRDW+ ++EEA ++L +F D AP SI
Sbjct: 76 GFTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSI 133
Query: 98 HQIALTGA-SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
H+ GA S GK GEWFGP+ A+ + L+ +I V + N + V +
Sbjct: 134 HRFVKYGAESCGKHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSF 189
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
S + QP ++++ RLGI ++ PVY +G+K L
Sbjct: 190 LRVARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQL------------------- 230
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLD 275
PQS+G+ GG+P+ + YFIG G +LDPHT + + D S+ ++
Sbjct: 231 --------PQSVGIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI- 281
Query: 276 STYHCPQASRLHILHMDPSIAV 297
STYH + R+HI MDPS+ +
Sbjct: 282 STYHTRRLRRIHIQDMDPSMLI 303
>gi|254567087|ref|XP_002490654.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
gi|238030450|emb|CAY68374.1| Conserved cysteine protease required for autophagy [Komagataella
pastoris GS115]
Length = 531
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 168/381 (44%), Gaps = 106/381 (27%)
Query: 1 MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIG---DS------------------GL 39
++ NK S + D+ S++W TYR GF PI DS G
Sbjct: 51 IKDGNKKSTTYSQSFIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGF 110
Query: 40 TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRTAPYSIH 98
T+D GWGCM+R Q ++A ALLFLHLGRDW + + +I+ F D P+SIH
Sbjct: 111 TSDAGWGCMIRTSQSLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIH 170
Query: 99 QIALTG-ASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCT 156
G K GEWFGP+ ++ ++ L K Y V+ + + +V++L
Sbjct: 171 NFVQQGIKCCDKKPGEWFGPSAASRAIKNLCKEYPPCGLRVYFSSDCGDVYDTEVRELAY 230
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
+ + P+++++ +RLG++ +NPVY + +++C +L
Sbjct: 231 GDSDT-----FTPILVLLGIRLGVEKVNPVYWDSLRECLSL------------------- 266
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------------- 255
QS+G+ GG+P + YF G+ G+ + +LDPH
Sbjct: 267 --------KQSVGIAGGRPCSSHYFYGFQGDYLFYLDPHLPQKALTFGSTEKPVHRLQTK 318
Query: 256 -TNQNIGCVY-----DKEQDS------EKKLDS-------------TYHCPQASRLHILH 290
T++N Y D ++ E KLD+ + H P+ ++LH+ H
Sbjct: 319 KTDENAAGQYPVSNTDSNNETNHDDCYESKLDNSKYVEILSCLDVKSVHTPKVTKLHLSH 378
Query: 291 MDPSI----AVVSQRSYSDYK 307
MDPS+ + S+ ++D+K
Sbjct: 379 MDPSMLIGFLITSEDDFNDWK 399
>gi|330935035|ref|XP_003304808.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
gi|311318464|gb|EFQ87127.1| hypothetical protein PTT_17484 [Pyrenophora teres f. teres 0-1]
Length = 470
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 127/266 (47%), Gaps = 55/266 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF PI S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRIWMTYRSGFTPIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQCIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW++ + + I+ MF D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQILRLGRDWRYQEQPDAKEHCDIVAMFADDPRAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLVI 174
GP+ A+ ++ L + + +V+ D V +++K++ + + +W P ++++
Sbjct: 219 GPSAAARCIQDLVHKNKEVGLKVYVSGDGADVYEDKLKEIAVDD-----DGEWHPTLILV 273
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
RLGI I PVY +K + QS+G+ GG+
Sbjct: 274 GTRLGIDKITPVYWEALKASLQMK---------------------------QSIGIAGGR 306
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNI 260
P+ + YF+ N+ +LDPH+ + +
Sbjct: 307 PSASHYFVATQANNFFYLDPHSTRPL 332
>gi|71022117|ref|XP_761289.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
gi|46097783|gb|EAK83016.1| hypothetical protein UM05142.1 [Ustilago maydis 521]
Length = 1541
Score = 141 bits (356), Expect = 3e-31, Method: Composition-based stats.
Identities = 92/332 (27%), Positives = 147/332 (44%), Gaps = 100/332 (30%)
Query: 37 SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW--------------------------- 69
+GLTTD GWGCMLR GQ ++A ALL +HLGR W
Sbjct: 818 AGLTTDSGWGCMLRTGQSLLANALLNVHLGRSWLREAPPMRQMDFLEQLASLSLDSSVEM 877
Query: 70 ----QW-NVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
+W ++ AY+KIL F D + P+ +H++A G GK VGEWFGP+T A
Sbjct: 878 QSLQEWREKRARHAAYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 937
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN--------KRASSNPQWQ-PLVLV 173
+++L + I +A D +++V+ ++ + W+ P+V++
Sbjct: 938 AIKQLVTEFPDAGIAVELAHDGVFYLDEVRLAAGARSALQSGKGRQGDAAVTWRRPVVIL 997
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I +RLG+ +NP+Y +K+ F+FP S+G+ GG
Sbjct: 998 IGIRLGLDSVNPIYYESVKET---------------------------FSFPHSVGIAGG 1030
Query: 234 KPNHALYFIGYVGNDVIFLDPH-TNQNIGCVY----------------------DKEQDS 270
+P+ + YF+G+ GN + +LDPH + Y DK+ +
Sbjct: 1031 RPSSSYYFMGHQGNSLFYLDPHNVRPAVALRYPPSTFPTAVPHQLDVAHRFALEDKDDEL 1090
Query: 271 E-------KKLDSTYHCPQASRLHILHMDPSI 295
E + ST+HC + R+ I +DPS+
Sbjct: 1091 EWWSHAYTEAQTSTFHCEKVRRMPIKSLDPSM 1122
>gi|356568569|ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 485
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 149/301 (49%), Gaps = 46/301 (15%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L +D +S++ TYRKGF IGD+ T+D WGCMLR QM++AQALLF LGR W+
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 72 NVNSK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA-K 129
++ ++ Y+ +L++F D + +SIH + G G AVG W GP + + LA K
Sbjct: 199 PIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARK 258
Query: 130 YDDWSS-----IVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+D ++ V+ D + + K C+ + +S W PL+L++PL
Sbjct: 259 KNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCS--EFSSGLAVWTPLLLLVPL 316
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ +NP YI +L ST F FPQSLG++GGKP
Sbjct: 317 VLGLDKVNPRYI------------------PLLRST---------FKFPQSLGIMGGKPG 349
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y IG +LDPH Q + + Q E S+YHC + + +DPS+A
Sbjct: 350 ASTYIIGVQNEKAFYLDPHDVQQVVNISGDTQ--EPTGTSSYHCNVMRHIPLDSIDPSLA 407
Query: 297 V 297
+
Sbjct: 408 I 408
>gi|356531828|ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
Length = 486
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 101/299 (33%), Positives = 149/299 (49%), Gaps = 42/299 (14%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L +D +S++ TYRKGF IGD+ T+D WGCMLR QM++AQALLF LGR W+
Sbjct: 139 LASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQMLVAQALLFHKLGRSWRK 198
Query: 72 NVNS-KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA-K 129
++ ++ Y+ +L++F D + +SIH + G G AVG W GP + + LA K
Sbjct: 199 PIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLARK 258
Query: 130 YDDWSSI-----VFHVALDNTLVVNQVKKLC--TTNKR----ASSNPQWQPLVLVIPLRL 178
+D + ++ V+ D +C +KR +S W PL+L++PL L
Sbjct: 259 KNDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCFEFSSGLAAWTPLLLLVPLVL 318
Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
G+ +NP YI +L ST F FPQSLG++GGKP +
Sbjct: 319 GLDKVNPRYI------------------PLLRST---------FKFPQSLGIMGGKPGAS 351
Query: 239 LYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
Y IG +LDPH Q + + Q E S+YHC + + +DPS+A+
Sbjct: 352 TYIIGAQNEKAFYLDPHDVQQVVNISGDTQ--EPTSTSSYHCNIMRHIPLDSIDPSLAI 408
>gi|440478911|gb|ELQ59709.1| cysteine protease atg4 [Magnaporthe oryzae P131]
Length = 572
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 140/311 (45%), Gaps = 66/311 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR GF PI S G TTD GWGCM+R GQ
Sbjct: 235 DFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 294
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A +LL LGR W+ EE K+L +F D APYSIH GA++ GK GE
Sbjct: 295 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 352
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + LA + S V+ + + ++ ++ + + P +++
Sbjct: 353 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKSDGKT-----FHPTLIL 407
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI IN VY + L PQS+G+ GG
Sbjct: 408 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 440
Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
+P+ + YF+G +D + +LDP HT + D + + +DS H + RL
Sbjct: 441 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 499
Query: 287 HILHMDPSIAV 297
HI MDPS+ +
Sbjct: 500 HIREMDPSMLI 510
>gi|444525500|gb|ELV14047.1| Cysteine protease ATG4D [Tupaia chinensis]
Length = 431
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 78/243 (32%), Positives = 120/243 (49%), Gaps = 42/243 (17%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 84 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 143
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 144 WAEGMGLGPPELSRSASPSRYHGPAHWRPPRWAQGTPELEQERRHRQIVSWFADHPRAPF 203
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 204 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVVRL 263
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K +P P D + L Y
Sbjct: 264 VA---RPDPAAEWKSVVILVPVRLGGETLNPVYVPCVK---LMPTPPTDDFLLYLDPHYC 317
Query: 215 MQT 217
T
Sbjct: 318 QPT 320
>gi|391868733|gb|EIT77943.1| cysteine protease required for autophagy - Apg4p/Aut2p [Aspergillus
oryzae 3.042]
Length = 357
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 135/262 (51%), Gaps = 36/262 (13%)
Query: 38 GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSI 97
G T+D GWGCM+R GQ ++A A+L L LGRDW+ ++EEA ++L +F D AP SI
Sbjct: 76 GFTSDTGWGCMIRSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSI 133
Query: 98 HQIALTGA-SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
H+ GA S GK GEWFGP+ A+ + L+ +I V + N + V +
Sbjct: 134 HRFVKYGAESCGKHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSF 189
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
S + QP ++++ RLGI ++ PVY +G+K L
Sbjct: 190 LRVARSGSGSIQPTLILLGTRLGIDNVTPVYWDGLKAVLQL------------------- 230
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLD 275
PQS+G+ GG+P+ + YFIG G +LDPHT + + D S+ ++
Sbjct: 231 --------PQSVGIAGGRPSASHYFIGTQGPYFFYLDPHTTRPAVPYSIDGRLLSKTEI- 281
Query: 276 STYHCPQASRLHILHMDPSIAV 297
STYH + R+HI MDPS+ +
Sbjct: 282 STYHTRRLRRIHIQDMDPSMLI 303
>gi|310801857|gb|EFQ36750.1| peptidase family C54 [Glomerella graminicola M1.001]
Length = 454
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 95/305 (31%), Positives = 139/305 (45%), Gaps = 59/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S+ W TYR F I S G ++D GWGCM+R GQ
Sbjct: 121 DFESKFWMTYRSEFPAIAKSTDPRASSALSFSMRIKSQLVDQNGFSSDSGWGCMIRSGQS 180
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A A+ ++LGRDW+ N +EE K+L +F D APYSIHQ GA + GK GE
Sbjct: 181 LLANAMAVINLGRDWRRGQNQEEER--KLLSLFADDPRAPYSIHQFVQHGAVACGKYPGE 238
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA + + D V K K S ++ P +++
Sbjct: 239 WFGPSATARCIQALANAQMHQPLRVYSTGDGPDVYED--KFMKIAKPDGS--RFHPTLIL 294
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI I PVY + + PQS+G+ GG
Sbjct: 295 VGTRLGIDKITPVYWEALIAALQM---------------------------PQSVGIAGG 327
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YFIG G+ + +LDP HT + + SE +D T H + RLH+ +D
Sbjct: 328 RPSSSHYFIGAQGSYLFYLDPHHTRPALPFHMNPSLYSEADVD-TVHTRRLRRLHVRELD 386
Query: 293 PSIAV 297
PS+ +
Sbjct: 387 PSMLI 391
>gi|389637385|ref|XP_003716330.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
gi|148887340|sp|Q523C3.2|ATG4_MAGO7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|351642149|gb|EHA50011.1| cysteine protease ATG4 [Magnaporthe oryzae 70-15]
Length = 491
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 139/311 (44%), Gaps = 66/311 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR GF PI S G TTD GWGCM+R GQ
Sbjct: 154 DFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 213
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A +LL LGR W+ EE K+L +F D APYSIH GA++ GK GE
Sbjct: 214 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 271
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + LA + S V+ + + ++ + + + P +++
Sbjct: 272 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKPDGKT-----FHPTLIL 326
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI IN VY + L PQS+G+ GG
Sbjct: 327 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 359
Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
+P+ + YF+G +D + +LDP HT + D + + +DS H + RL
Sbjct: 360 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 418
Query: 287 HILHMDPSIAV 297
HI MDPS+ +
Sbjct: 419 HIREMDPSMLI 429
>gi|3212867|gb|AAC23418.1| unknown protein [Arabidopsis thaliana]
Length = 451
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 135/303 (44%), Gaps = 65/303 (21%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L ++ D +S++ TYRKGF P D+ T+D WGCM+R QM+ AQ
Sbjct: 135 LAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQL------------ 182
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
E+ YL+ L+ F D + +SIH + + GAS G A G W GP + + LA
Sbjct: 183 ----PEQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLACKK 238
Query: 129 -KYDDWSSIVFHVAL-------------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
K D + +A+ L + K C + S +W P++L++
Sbjct: 239 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPIILLV 296
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PL LG+ +NP YI + FTFPQS+G++GGK
Sbjct: 297 PLVLGLDSVNPRYIPSLVA---------------------------TFTFPQSVGILGGK 329
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
P + Y +G + +LDPH Q + V + D + S+YHC + + +DPS
Sbjct: 330 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVLRYVPLESLDPS 386
Query: 295 IAV 297
+A+
Sbjct: 387 LAL 389
>gi|440467300|gb|ELQ36530.1| cysteine protease atg4 [Magnaporthe oryzae Y34]
Length = 572
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 139/311 (44%), Gaps = 66/311 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR GF PI S G TTD GWGCM+R GQ
Sbjct: 235 DFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 294
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A +LL LGR W+ EE K+L +F D APYSIH GA++ GK GE
Sbjct: 295 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 352
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + LA + S V+ + + ++ + + + P +++
Sbjct: 353 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKPDGKT-----FHPTLIL 407
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI IN VY + L PQS+G+ GG
Sbjct: 408 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 440
Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
+P+ + YF+G +D + +LDP HT + D + + +DS H + RL
Sbjct: 441 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 499
Query: 287 HILHMDPSIAV 297
HI MDPS+ +
Sbjct: 500 HIREMDPSMLI 510
>gi|402080175|gb|EJT75320.1| cysteine protease ATG4 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 468
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 141/310 (45%), Gaps = 64/310 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W +YR GF PI S TTD GWGCM+R GQ
Sbjct: 131 DFESRIWVSYRSGFPPIPRSTDPAATSRMSFAMRLKTMTDQQAAFTTDSGWGCMIRTGQS 190
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A LL LGR W+ S EE K+L +F D APYSIH+ GA++ GK GE
Sbjct: 191 LLANTLLSHRLGRGWRRGEKSDEE--RKLLSLFADDPRAPYSIHKFVEHGAAKCGKYPGE 248
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + LA ++ + V+ + + ++ + + + P +++
Sbjct: 249 WFGPSATARCIEALANTNEKTLRVYSTGDLPDVYEDSFMEVARPDGKT-----FHPTLIL 303
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI IN VY L++T M PQS+G+ GG
Sbjct: 304 VSTRLGIDKINQVYWES------------------LTATLQM---------PQSVGIAGG 336
Query: 234 KPNHALYFIGY------VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
+P+ + YF+G G+++ +LDPH + +D Q + H + RLH
Sbjct: 337 RPSSSHYFVGAQRSDEDQGSNLFYLDPHHTRPALPYFDDPQLYTPSDVDSCHTRRLRRLH 396
Query: 288 ILHMDPSIAV 297
I MDPS+ +
Sbjct: 397 IREMDPSMLI 406
>gi|388856806|emb|CCF49593.1| related to ATG4-essential for autophagy [Ustilago hordei]
Length = 1572
Score = 139 bits (351), Expect = 1e-30, Method: Composition-based stats.
Identities = 89/344 (25%), Positives = 148/344 (43%), Gaps = 112/344 (32%)
Query: 37 SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV----------------------- 73
+GLTTD GWGCMLR GQ ++A AL+ +HLGR WQ +
Sbjct: 828 AGLTTDSGWGCMLRTGQSLLANALINVHLGRSWQRDAPPLRQQQFLEELAGLSIADAAEK 887
Query: 74 ---------NSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
++ Y+KIL F D + P+ +H++A G GK VGEWFGP+T +
Sbjct: 888 ESLQEWRQKRARHATYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTASG 947
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKL--------------------CTTNKRAS 162
+++L + I +A D +++V+ + R
Sbjct: 948 AIKQLVSEFPQAGIAVELARDGVFYLDEVRAAASASASAASVQSGGKARSSGAASGSRKG 1007
Query: 163 SNPQW-QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
W +P++++I +RLG++ +NP+Y +K
Sbjct: 1008 EGLIWRRPVLILIGIRLGLESVNPIYYESVKA---------------------------T 1040
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH------------------TNQNIGCV 263
F+FP S+G+ GG+P+ + YF+G+ GN + +LDPH +++G
Sbjct: 1041 FSFPHSVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVPLRYPPSTFPDAVPRHLGIA 1100
Query: 264 Y-----DKEQDSE-------KKLDSTYHCPQASRLHILHMDPSI 295
+ DK+ + E + ST+HC + R+ I +DPS+
Sbjct: 1101 HRFVLEDKDDEDEWWSHAYSEVQTSTFHCEKVRRMPIKSLDPSM 1144
>gi|348666332|gb|EGZ06159.1| hypothetical protein PHYSODRAFT_532364 [Phytophthora sojae]
Length = 398
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/258 (32%), Positives = 124/258 (48%), Gaps = 45/258 (17%)
Query: 16 RRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNS 75
+R + LWFTYR+ F + T+D GWGCMLR QM++ QAL LGRDW+
Sbjct: 42 KRSFEAILWFTYRRDFPQMTPYDFTSDAGWGCMLRSAQMLLGQALQRRLLGRDWRLPALF 101
Query: 76 KEE-------AYLKILKMFEDRR--TAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
+ E Y+ +L+ F D YSIH + G K GEW+GP T AQVLR
Sbjct: 102 EAEIDARLPDKYVTLLRWFADSPDIECRYSIHHMVKLGMQYDKLPGEWYGPTTAAQVLRD 161
Query: 127 LA---KYDDWSSIVFHVALDNTLVVNQVKKLCTTN-----KRASSNPQWQ-PLVLVIPLR 177
L + + + +V + + + V +LC + A + W L+++IPLR
Sbjct: 162 LVNLHRREFGGELAMYVPQEGVVYTDDVTRLCFFDPLLHPPTAEDSSDWSTALLILIPLR 221
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +N Y+ ++K +A FPQS+G+IGGK H
Sbjct: 222 LGLDQVNERYVPALEKTFA---------------------------FPQSVGIIGGKKGH 254
Query: 238 ALYFIGYVGNDVIFLDPH 255
++YF+G + + LDPH
Sbjct: 255 SVYFVGTQQDQLHLLDPH 272
>gi|393219109|gb|EJD04597.1| hypothetical protein FOMMEDRAFT_133827 [Fomitiporia mediterranea
MF3/22]
Length = 1147
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 141/306 (46%), Gaps = 99/306 (32%)
Query: 18 DITSRLWFTYRKGFVPI---------------------------------GDSGLTTDKG 44
D +SR+W TYR + PI G+ G T+D G
Sbjct: 345 DFSSRVWLTYRSHYPPIRDQTLAQLEAEASGQIPLQPVSASPRKWHILGSGEKGWTSDSG 404
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQ------WNVNSKEEAYLKILKMFEDRRT--APYS 96
WGCMLR GQ ++A AL+ LHLGRDW+ + V+ Y+KIL F D P+S
Sbjct: 405 WGCMLRTGQSLLANALIHLHLGRDWRRPPQPVYTVDYA--TYVKILTWFFDSTDIHCPFS 462
Query: 97 IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
+H++AL G GK VG+WFGP+T A ++ + + + VA D VV + L
Sbjct: 463 VHRMALAGKDLGKDVGQWFGPSTAAGAIKTVVHAFAEAGLGVSVATDG--VVYETDVLAA 520
Query: 157 TN--------KRASSNPQ-----------------W--QPLVLVIPLRLGIQDINPVYIN 189
+N R +++ W +P+++++ +RLGI +NPVY
Sbjct: 521 SNAGPYMYRHSRMATSSPSTRRRRSAQQQQSMMSIWGQRPVLVLVGIRLGIDCVNPVY-- 578
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
YD VK L FTFPQS+G+ GG+P+ + YF+G +++
Sbjct: 579 -------------YDAVKAL------------FTFPQSVGIAGGRPSSSYYFVGVQTDNL 613
Query: 250 IFLDPH 255
+LDPH
Sbjct: 614 FYLDPH 619
>gi|346975631|gb|EGY19083.1| peptidase family C54 protein [Verticillium dahliae VdLs.17]
Length = 449
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 140/305 (45%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S+ W TYR F PI S G ++D GWGCM+R GQ
Sbjct: 117 DFESKFWMTYRSDFEPIAKSTDPRAASVLSLSMRIKSQFMDQAGYSSDSGWGCMIRSGQS 176
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A A+ L LGRDW+ V +++E ++L F D APYSIH+ GA + GK GE
Sbjct: 177 LLANAMAVLDLGRDWRRGVAAEKER--QLLSKFADDPKAPYSIHRFVQHGAVACGKYPGE 234
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L ++ V+ + ++ + S + P +++
Sbjct: 235 WFGPSATARCIQALVNANEPHLRVYSTGDGPDVYEDRFFDIAK-----PSGETFHPTLIL 289
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI I PVY + + + PQS+G+ GG
Sbjct: 290 VGTRLGIDKITPVYWDALIAALQM---------------------------PQSIGIAGG 322
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVY-DKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YFIG G+ + +LDPH + Y D ++ +DS H + RLH+ MD
Sbjct: 323 RPSSSHYFIGAQGSFLFYLDPHHTRTALPYYQDPTLYAQADVDSV-HTRRLRRLHVREMD 381
Query: 293 PSIAV 297
PS+ +
Sbjct: 382 PSMLI 386
>gi|392574855|gb|EIW67990.1| hypothetical protein TREMEDRAFT_63874 [Tremella mesenterica DSM
1558]
Length = 1159
Score = 139 bits (349), Expect = 2e-30, Method: Composition-based stats.
Identities = 88/255 (34%), Positives = 130/255 (50%), Gaps = 64/255 (25%)
Query: 36 DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE----------------A 79
+ GLTTD GWGCMLR GQ ++A AL+ LHLGRDW+ V S+ + +
Sbjct: 577 ERGLTTDAGWGCMLRTGQSLLANALIHLHLGRDWR--VPSQPQVPPTSAAHLAELEAYSS 634
Query: 80 YLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
Y++IL F D + P+S+H+IAL G GK VGEWFGP+T A L+ L S +
Sbjct: 635 YVRILSWFLDDPSPLCPFSVHRIALIGKELGKEVGEWFGPSTAAGALKTLVNSFPPSGMA 694
Query: 138 FHVALDNTLVVNQV---KKLCTTNKRASSNP------------QW--QPLVLVIPLRLGI 180
A+D+ + + V L +T S P W + ++++I +RLG+
Sbjct: 695 VATAVDSIVYKSDVYSASNLQSTGWSDESAPPRRQSSSSRSSTSWGNRAVLVLIGIRLGL 754
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
+NP+Y Y+ +K L FTFPQS+G+ GG+P+ + Y
Sbjct: 755 DGVNPLY---------------YESIKAL------------FTFPQSVGIAGGRPSSSYY 787
Query: 241 FIGYVGNDVIFLDPH 255
F+G N +++LDPH
Sbjct: 788 FVGTQANSLVYLDPH 802
>gi|357507987|ref|XP_003624282.1| Cysteine protease ATG4 [Medicago truncatula]
gi|147742964|sp|A2Q1V6.1|ATG4_MEDTR RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|124359485|gb|ABN05923.1| Peptidase C54 [Medicago truncatula]
gi|355499297|gb|AES80500.1| Cysteine protease ATG4 [Medicago truncatula]
Length = 487
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 139/300 (46%), Gaps = 48/300 (16%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
+D SR+ TYRKGF I DS T+D WGCMLR QM++AQALLF LGR W+ V+
Sbjct: 142 FEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRKTVD 201
Query: 75 SK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
++ Y+ IL++F D A +SIH + G G AVG W GP + + LA+
Sbjct: 202 KPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLAR---- 257
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN----PVYI- 188
N+R + Q L + I + G +D PV
Sbjct: 258 ------------------------NQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCI 293
Query: 189 -NGIKKCYALPISPV----YDMVKILSSTYNMQTPRY------EFTFPQSLGVIGGKPNH 237
+ K+C V ++ L + RY F FPQSLG++GGKP
Sbjct: 294 EDACKRCLEFSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGA 353
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ Y IG + +LDPH + V + D+++ S+YHC + + + +DPS+A+
Sbjct: 354 STYIIGVQNDKAFYLDPH---EVKPVVNITGDTQEPNTSSYHCNISRHMPLDSIDPSLAI 410
>gi|388514549|gb|AFK45336.1| unknown [Lotus japonicus]
Length = 489
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 139/300 (46%), Gaps = 48/300 (16%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
+D +S++ TYRKGF IGDS T+D WGCMLR QM++AQALLF LGR W+ +
Sbjct: 143 FEQDFSSKILLTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRMWRKTTD 202
Query: 75 SK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
++ YL IL+ F D + +SIH + G G AVG W GP + + LA+
Sbjct: 203 KPLDKEYLDILQHFGDSEASSFSIHNLLQAGKGYGLAVGSWVGPYAMCRSWEVLAR---- 258
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN----PVYI- 188
N+R +++ QPL + + + G +D PV
Sbjct: 259 ------------------------NQRETNDHGEQPLPMALYVVSGDEDGERGGAPVVCI 294
Query: 189 -NGIKKCY----ALPISPVYDMVKILSSTYNMQTPRY------EFTFPQSLGVIGGKPNH 237
+ ++C L ++ L + RY F FPQSLG++GGKP
Sbjct: 295 EDASRRCSEFSRGLAAWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGA 354
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ Y IG +LDPH Q + + QD S+YHC ++ + +DPS+A+
Sbjct: 355 STYIIGVQNEKAFYLDPHDVQPVVHINGDAQDPNT---SSYHCNIVRQMPLDSIDPSLAI 411
>gi|113931596|ref|NP_001039246.1| autophagy related 4D, cysteine peptidase [Xenopus (Silurana)
tropicalis]
gi|89273389|emb|CAJ82151.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
gi|114108226|gb|AAI22932.1| APG4 autophagy 4 homolog D (S. cerevisiae) [Xenopus (Silurana)
tropicalis]
Length = 470
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 153/333 (45%), Gaps = 77/333 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ ++D SR+W TYR+ F + + LTTD GWGCM+R GQM++AQ LL L R+W
Sbjct: 95 EIDRFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSREWT 154
Query: 71 WN------------------------------------------VNSKEEAYLKILKMFE 88
W+ E + I+ F
Sbjct: 155 WSEALYTHFVEMEPIRSSSPSSMPLSLATDHSGRHSQPQTHCSRAPYGGEVHQNIVSWFS 214
Query: 89 DRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLV 147
D +AP+ +H++ G+ GK G+W+GP+ VA +++K + + + +V+ D T+
Sbjct: 215 DHASAPFGLHRMVALGSIFGKRAGDWYGPSIVAHIIKKAIESSSEVPDLSVYVSQDCTVY 274
Query: 148 VNQVKKLCTTN--KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDM 205
+++L +S + +++++P RLG + NPVY + +K+ +
Sbjct: 275 KADIEQLFAGEVPHTDTSRGAGKAVIILVPARLGGETFNPVYKHCLKEFLRM-------- 326
Query: 206 VKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYD 265
P LG+IGGKP H+LYFIGY N +++LDPH Q D
Sbjct: 327 -------------------PSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQPY---ID 364
Query: 266 KEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
+D+ L+S +HC +L I MDPS
Sbjct: 365 TSRDN-FPLES-FHCNAPRKLSITRMDPSCTFA 395
>gi|145345840|ref|XP_001417407.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577634|gb|ABO95700.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 348
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 136/288 (47%), Gaps = 40/288 (13%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
RD SR W TYR+GF +G + TD GWGC LR QM++A AL GR W+ V +K
Sbjct: 27 RDWGSRCWMTYRRGFEALGRTKWRTDAGWGCTLRSAQMMVANALSIHTRGRHWRRQVKAK 86
Query: 77 E--EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL--AKYDD 132
E E+ +L MF D +AP+SIH + T + G G WF P+ + + L A D
Sbjct: 87 EDDESVDHVLSMFIDDASAPFSIHSVCETTTAWGAPPGRWFEPSVMCRAFSALIEANGDL 146
Query: 133 WSSIVFHV--ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI-QDINPVYIN 189
+ I HV + V + RA S + L+L +PL LG+ ++IN YI+
Sbjct: 147 RNQIAVHVVGGQNEDDSAGGVPTIDDGELRAKSADVGKALLLFVPLVLGVGRNINTRYIS 206
Query: 190 GIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDV 249
++ A F QS+GVIGG+PN +LY +G+ +
Sbjct: 207 QLRSIIA---------------------------FKQSIGVIGGRPNASLYLVGHSDDVF 239
Query: 250 IFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+LDPHT Q +E +Y+C ++ +DP++A+
Sbjct: 240 FYLDPHTVQPANSF------AEAVDFDSYYCSTPLQMRGELLDPTLAL 281
>gi|194696780|gb|ACF82474.1| unknown [Zea mays]
gi|413920008|gb|AFW59940.1| autophagy 4b variant 3 [Zea mays]
Length = 462
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 145/301 (48%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYRKGF I S LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 117 EDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKP 176
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
+ Y+++L +F D +SIH + G + G A G W GP + + + L +
Sbjct: 177 YDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQA 236
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+++ ++ V+ D ++ +LC+ + W P++L+IPL
Sbjct: 237 DAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCT--WSPILLLIPL 294
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ F FPQSLG++GGKP
Sbjct: 295 VLGLDKINPRYIPLLKE---------------------------TFKFPQSLGILGGKPG 327
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH ++ D D+ + S+YHC L + +DPS+A
Sbjct: 328 TSTYIAGVQEDRALYLDPH---DVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLA 384
Query: 297 V 297
+
Sbjct: 385 I 385
>gi|315259988|gb|ADT92194.1| autophagy-related 4b [Zea mays]
Length = 595
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 145/300 (48%), Gaps = 52/300 (17%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK- 76
D +SR+W TYRKGF I S LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKPY 207
Query: 77 EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY------ 130
+ Y+++L +F D +SIH + G + G A G W GP + + + L +
Sbjct: 208 DPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQAD 267
Query: 131 -----DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLR 177
+++ ++ V+ D ++ +LC+ + W P++L+IPL
Sbjct: 268 AVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCT--WSPILLLIPLV 325
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ INP YI +K+ F FPQSLG++GGKP
Sbjct: 326 LGLDKINPRYIPLLKE---------------------------TFKFPQSLGILGGKPGT 358
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ Y G + ++LDPH ++ D D+ + S+YHC L + +DPS+A+
Sbjct: 359 STYIAGVQEDRALYLDPH---DVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLAI 415
>gi|221137006|ref|NP_001137489.1| autophagy-related 4b [Zea mays]
gi|194701156|gb|ACF84662.1| unknown [Zea mays]
gi|195657359|gb|ACG48147.1| cysteine protease ATG4B [Zea mays]
gi|216963250|gb|ACJ73914.1| autophagy-related 4b variant 1 [Zea mays]
gi|413920007|gb|AFW59939.1| autophagy 4b variant 1Cysteine protease ATG4B [Zea mays]
Length = 492
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 145/301 (48%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYRKGF I S LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 147 EDFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSEKP 206
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
+ Y+++L +F D +SIH + G + G A G W GP + + + L +
Sbjct: 207 YDPDYIRVLHLFGDSEACAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWQTLIRTNREQA 266
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+++ ++ V+ D ++ +LC+ + W P++L+IPL
Sbjct: 267 DAVDGKENFPMALYVVSGDEDGERGGAPVFCIDVAAQLCSNFNKGQCT--WSPILLLIPL 324
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ F FPQSLG++GGKP
Sbjct: 325 VLGLDKINPRYIPLLKE---------------------------TFKFPQSLGILGGKPG 357
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH ++ D D+ + S+YHC L + +DPS+A
Sbjct: 358 TSTYIAGVQEDRALYLDPH---DVQMAVDIAPDNLEADTSSYHCSVVRDLALEQIDPSLA 414
Query: 297 V 297
+
Sbjct: 415 I 415
>gi|343428793|emb|CBQ72338.1| related to ATG4-essential for autophagy [Sporisorium reilianum SRZ2]
Length = 1505
Score = 137 bits (345), Expect = 7e-30, Method: Composition-based stats.
Identities = 89/339 (26%), Positives = 147/339 (43%), Gaps = 107/339 (31%)
Query: 37 SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW--------------------------- 69
+GLTTD GWGCMLR GQ ++A AL+ +HLGR W
Sbjct: 783 AGLTTDSGWGCMLRTGQSLLANALINVHLGRSWMREAPPARQLEFLQELANLSLDTSAEK 842
Query: 70 ----QW-NVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
+W ++ Y+KIL F D + P+ +H++A G GK VGEWFGP+T A
Sbjct: 843 QSLLEWRQKRARHSTYIKILSWFLDDPSPACPFGVHRMAREGKRLGKEVGEWFGPSTAAG 902
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC---------------TTNKRASSNPQW 167
+++L + + +A D +++V+ T ++ + W
Sbjct: 903 AIKQLVSEFPDAGLAVELAHDGVFYLDEVRAAAGASRQLGKGRASATGTNGRKGDTALTW 962
Query: 168 -QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+P++++I +RLG+ +NP+Y +K F+FP
Sbjct: 963 HKPVLILIGIRLGLDSVNPIYYESVKAT---------------------------FSFPH 995
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ--------------------NIGCVYD- 265
S+G+ GG+P+ + YF+G+ GN + +LDPH + +I +
Sbjct: 996 SVGIAGGRPSSSYYFMGHQGNSLFYLDPHNVRPAVALRFPPSTFPAAVPRQLDIAHRFAF 1055
Query: 266 KEQDSEKKL---------DSTYHCPQASRLHILHMDPSI 295
+E D E + ST+HC + R+ I +DPS+
Sbjct: 1056 EEHDDEDEWWSHAYTEAQTSTFHCDKVRRMPIKSLDPSM 1094
>gi|210063823|gb|ACJ06587.1| ATG4 protein [Magnaporthe oryzae]
Length = 491
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 96/311 (30%), Positives = 138/311 (44%), Gaps = 66/311 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR GF I S G TTD GWGCM+R GQ
Sbjct: 154 DFESRIWMTYRSGFESIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 213
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A +LL LGR W+ EE K+L +F D APYSIH GA++ GK GE
Sbjct: 214 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 271
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + LA + S V+ + + ++ + + + P +++
Sbjct: 272 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKPDGKT-----FHPTLIL 326
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI IN VY + L PQS+G+ GG
Sbjct: 327 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 359
Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
+P+ + YF+G +D + +LDP HT + D + + +DS H + RL
Sbjct: 360 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 418
Query: 287 HILHMDPSIAV 297
HI MDPS+ +
Sbjct: 419 HIREMDPSMLI 429
>gi|90399070|emb|CAJ86292.1| H0124B04.9 [Oryza sativa Indica Group]
Length = 1216
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 98/331 (29%), Positives = 144/331 (43%), Gaps = 79/331 (23%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYR+GF I DS T+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 405 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKP 464
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 465 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 524
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 525 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 582
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 583 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 615
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCV------------------------------YDK 266
+ Y G + ++LDPH Q V D
Sbjct: 616 TSTYIAGVQDDRALYLDPHEVQMSATVIIWLFLQYPFYAWNPFCYGSYSGVFSTSQAVDI 675
Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
D+ + S+YHC L + +DPS+A+
Sbjct: 676 AADNIEADTSSYHCSTVRDLALDLIDPSLAI 706
>gi|302684483|ref|XP_003031922.1| hypothetical protein SCHCODRAFT_109321 [Schizophyllum commune H4-8]
gi|300105615|gb|EFI97019.1| hypothetical protein SCHCODRAFT_109321, partial [Schizophyllum
commune H4-8]
Length = 602
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 66/281 (23%)
Query: 18 DITSRLWFTYRKGFV-------------------------------PIGDSGLTTDKGWG 46
D +R+W TYR GF P G G ++D GWG
Sbjct: 132 DFATRIWLTYRSGFELIRDRQLIDLPPPVASLDGHLQGEWATDEAEPPGAYGFSSDSGWG 191
Query: 47 CMLRCGQMVIAQALLFLHLGRDWQW--NVNSKEEA-YLKILKMFED--RRTAPYSIHQIA 101
CMLR GQ ++A ALL GRDW+ V + + + Y+ +L +F D TAP+SIH++A
Sbjct: 192 CMLRTGQSLLANALLTAWFGRDWRRISEVETHQHSLYVHLLSLFLDTPHPTAPFSIHRMA 251
Query: 102 LTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT---N 158
L G GK +G+WFGP+T A ++ L + I V +D L ++V + +
Sbjct: 252 LAGKQLGKDIGQWFGPSTAAGAIKNLVSAYPLAGIGVVVGMDGALSKSEVFTASHSEWSD 311
Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
+ A+ + +P+++++ LRLG+ +NP+Y +D +K L
Sbjct: 312 EEAALDWGDRPVLILLNLRLGLDRVNPIY---------------HDTIKAL--------- 347
Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
FTFPQS+G+ GG+P + +F+G G+D+I+LDPH +N
Sbjct: 348 ---FTFPQSVGIAGGRPCSSYHFVGAQGSDLIYLDPHHTRN 385
>gi|325184648|emb|CCA19140.1| cysteine protease family C54 putative [Albugo laibachii Nc14]
Length = 459
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 141/328 (42%), Gaps = 91/328 (27%)
Query: 21 SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE--- 77
S LW+TYR+ F + T+D GWGCMLR QM++++A LG W+ S++
Sbjct: 102 SILWYTYRRDFETMVPYDFTSDAGWGCMLRSAQMLLSEAFKRNMLGIKWKIPARSEDLEL 161
Query: 78 -EAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL----AKY 130
+ Y+K+LK F D YSIH I G K GEW+GP T AQ LR L A+
Sbjct: 162 PKVYVKLLKWFVDSFDTECKYSIHNITRIGMQYDKLPGEWYGPTTAAQALRDLVNLHAQE 221
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------------KRA 161
++V +V D + V +LC ++ R
Sbjct: 222 SPECNLVMYVPQDGVVYTKDVNELCISHLDQENTFVNVNEETQSDGTFPDPLLHPPTDRD 281
Query: 162 SSNPQWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+S WQ L+++IPLRLG+ INP Y+ I++
Sbjct: 282 NSEKMWQKSLLILIPLRLGLDSINPRYLPAIQRV-------------------------- 315
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
F FPQ++G+IGGK H++YF+G + + LDPH D D
Sbjct: 316 -FEFPQNVGIIGGKKGHSVYFVGTFDSKLQLLDPH-------------DIHPTADLNTAF 361
Query: 281 PQASRLHILH-----------MDPSIAV 297
P A+ L +H +DPS+A+
Sbjct: 362 PTATHLRTVHSRLPLEMSLGSIDPSLAL 389
>gi|189515077|ref|XP_001333093.2| PREDICTED: cysteine protease ATG4D-like [Danio rerio]
Length = 485
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 160/363 (44%), Gaps = 101/363 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+ S +W TYR+ F + S LTTD GWGCMLR GQM++AQ LL + DW+
Sbjct: 95 EVERFRQTFVSCVWLTYRREFPQLDGSSLTTDCGWGCMLRSGQMMLAQGLLLHLMPTDWR 154
Query: 71 W-NVNSKEEAYLKILK-------------------------------MFEDRRTAP---- 94
W + ++ + ++LK + E R AP
Sbjct: 155 WSDCHALTDVDFEVLKPRSPSRPAGMSMPSFSSSWSSSIPQINPSPGITEAHRRAPARCP 214
Query: 95 ------------------YSIHQIALTGASE----GKAVG----EWFGPNTVAQVLRK-L 127
+ H A G + GK G +W+GP+ VA +LRK +
Sbjct: 215 SASPDPQVDALHRKVVSCFGDHPSAPFGVHQLVELGKESGKRAGDWYGPSVVAHMLRKAV 274
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
A+ ++ + +VA D T+ V LC SS W+ +V+++P+RLG + +NP Y
Sbjct: 275 ARAAEFEDLAVYVAQDCTVYKEDVMSLCE-----SSGVGWKSVVILVPVRLGGESLNPSY 329
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
I +K L +G+IGGKP H+L+F+G+
Sbjct: 330 IECVKNILKLKC---------------------------CIGIIGGKPKHSLFFVGFQDE 362
Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDY 306
+++LDPH Q + V E ++HC +++ MDPS + + RS +D+
Sbjct: 363 QLLYLDPHYCQPVVDVTQANFSLE-----SFHCNSPRKMNFSRMDPSCTIGLYARSKTDF 417
Query: 307 KNV 309
+++
Sbjct: 418 ESL 420
>gi|384253649|gb|EIE27123.1| peptidase C54 [Coccomyxa subellipsoidea C-169]
Length = 362
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 146/311 (46%), Gaps = 36/311 (11%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q L I D+ SR+W TYR+GF PI SG+T+D GWGC LR GQM++AQAL++ +GR W
Sbjct: 15 QVLNAILSDLMSRIWMTYRRGFPPICGSGITSDVGWGCTLRSGQMLLAQALVYHLVGRQW 74
Query: 70 QWNVNSK-EEAYLKILKMFEDRRTA--PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
+ + + E ++L+ F D+ P+SIH + TG + G G+W GP+ + L
Sbjct: 75 RRKLEAAYPEEVAQVLQWFGDQACEQRPFSIHNMCTTGQTHGVKAGDWLGPSGLCHTLAD 134
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPV 186
+ + V LCT+ + G D +
Sbjct: 135 MVNKVQPGGLQCRVV---ATFGGGAPVLCTSRLATAFE--------------GGADRSGG 177
Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQ-TPRY------EFTFPQSLGVIGGKPNHAL 239
+ + P ++ L N + PRY T+PQS+G++GG+P+ +L
Sbjct: 178 EVGSSGSEESGPAGQGLLLLIPLMLGLNGKINPRYCAQLQQLLTWPQSVGIVGGRPSSSL 237
Query: 240 YFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-V 298
YFIG V++LDPH Q + SE TY C + + ++DPS+A+
Sbjct: 238 YFIGLQDQHVLYLDPHEVQEVA--------SEAADLDTYFCSSLRLMPLANIDPSLAIGF 289
Query: 299 SQRSYSDYKNV 309
S SD++++
Sbjct: 290 YCSSLSDFEDL 300
>gi|367032280|ref|XP_003665423.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
gi|347012694|gb|AEO60178.1| hypothetical protein MYCTH_2067869 [Myceliophthora thermophila ATCC
42464]
Length = 456
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 145/305 (47%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF +P +GD +G T+D GWGCM+R GQ
Sbjct: 125 DFESRIWMTYRTGFELIPRSTDPRATSSFSIAMRLKTTLGDQTGFTSDTGWGCMIRSGQS 184
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A ALL LGRDW+ + +A IL +F D APYS+H G + GK GE
Sbjct: 185 LLANALLISRLGRDWRRMTDP--DAERPILALFADDSRAPYSLHNFVKHGELACGKYPGE 242
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA + S V+ + V + + T + + P +++
Sbjct: 243 WFGPSATARCIQALANKHESSLRVYSTG--DLPDVYEDSFMATAKPDGET---FHPTLIL 297
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI IN VY V+ L ST M+ QS+G+ GG
Sbjct: 298 VCTRLGIDKINQVY------------------VEALISTLQME---------QSIGIAGG 330
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHMD 292
+P + YF+G G + +LDPH + + D + ++LDS H + RLH+ MD
Sbjct: 331 RPASSHYFVGVQGQWLFYLDPHHPRPKLPYRENPDDYTSEELDSC-HTRRLRRLHVEDMD 389
Query: 293 PSIAV 297
PS+ +
Sbjct: 390 PSMLI 394
>gi|260949671|ref|XP_002619132.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
gi|238846704|gb|EEQ36168.1| hypothetical protein CLUG_00291 [Clavispora lusitaniae ATCC 42720]
Length = 340
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/334 (31%), Positives = 154/334 (46%), Gaps = 85/334 (25%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPI-----GDSGL-------------------------TT 41
LE+I I SRLWFTYR GF PI G S L +T
Sbjct: 52 LEEIYPVINSRLWFTYRAGFEPIQKAEDGPSPLAFLKSMIFNVRPSMALGGLFDNQNYST 111
Query: 42 DKGWGCMLRCGQMVIAQALLFLHLGRDWQ--WNVNSKEEAYLKILKMFEDRRTAPYSIHQ 99
D GWGCM+R Q ++A AL L LGRD Q + S E KI+++F D T P+S+H
Sbjct: 112 DVGWGCMIRTSQSLLANALQMLILGRDHQSPQAIQSAPEKVEKIIQLFGDDYTCPFSLHN 171
Query: 100 -IALTGASEGKA-VGEWFGPNTVAQVLRKL-AKYD--DWSSIVFHVALDNTLVVNQVKKL 154
I + AS K GEWFGP+ + +++L AK++ + +I + L +++ +
Sbjct: 172 FIKVASASPLKVKPGEWFGPSAASLSIKRLCAKFESNEIPNINVSICESCNLYDEEIRGI 231
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
++ PL+++ PLRLGI IN +Y + + AL
Sbjct: 232 FEESES--------PLLILFPLRLGIDKINSIYYPSLLQLLALK---------------- 267
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
QS+G+ GGKP+ + YF G+ G+++++LDPH Q +
Sbjct: 268 -----------QSVGIAGGKPSSSYYFFGFQGSNLLYLDPHNLQ-----------AASSD 305
Query: 275 DSTYHCPQASRLHILHMDPSIAV--VSQRSYSDY 306
TYH + L I ++DP A V+Q +Y DY
Sbjct: 306 PGTYHTSKFQTLSISNLDPLNACWSVNQMTYDDY 339
>gi|67526025|ref|XP_661074.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
gi|40743824|gb|EAA63010.1| hypothetical protein AN3470.2 [Aspergillus nidulans FGSC A4]
Length = 379
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 144/304 (47%), Gaps = 58/304 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR F PI G T+D GWGCM+R GQ
Sbjct: 53 DFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRSGQS 112
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ++ L LGRDW+ +EE K+L +F D AP+SIH GA GK GE
Sbjct: 113 LLANSMAILLLGRDWRRGERLEEEG--KLLSLFADSPHAPFSIHSFVKHGADFCGKHPGE 170
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP A+ ++ LA D S++ ++A DN+ V+Q K + + + +P +++
Sbjct: 171 WFGPTATARCIQGLAARYDQSNLQVYIADDNS-DVHQDKFMSVSRDEKGTV---RPTLIL 226
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ LRLGI I VY NG+K L PQS+G+ GG
Sbjct: 227 LGLRLGIDRITAVYWNGLKAVLQL---------------------------PQSVGIAGG 259
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+ G+ +LDPH N Y + + +TYH + RL+I MDP
Sbjct: 260 RPSASHYFVAVQGSHFFYLDPH-NTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDP 318
Query: 294 SIAV 297
S+ +
Sbjct: 319 SMLI 322
>gi|440638438|gb|ELR08357.1| hypothetical protein GMDG_03152 [Geomyces destructans 20631-21]
Length = 448
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 144/305 (47%), Gaps = 59/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S+L F+YR GF I S G ++D GWGCM+R GQ
Sbjct: 111 DFESKLRFSYRTGFPVIPRSEDPKASSTMSFSVRLRSQLSDQGGFSSDTGWGCMIRSGQS 170
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A +++ L L R W+ V +E +I+ +F D APYSIH+ GA GK G+
Sbjct: 171 LLANSMVILRLSRGWRRGVGRDKE--REIVSLFADDPRAPYSIHKFVEHGAEACGKYPGQ 228
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ +++LAK + + + ++ D + V + K N ++P +++
Sbjct: 229 WFGPSATARCIQELAKRHESADVRVYITGDGSDVYKD--GFMSVAKPDGVN--FKPTLIL 284
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI + PVY +K + PQS+G+ GG
Sbjct: 285 VGTRLGIDKVTPVYWEALKASLQM---------------------------PQSVGIAGG 317
Query: 234 KPNHALYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G G+ +LDPH T I D ++ + ++DS H + RL I MD
Sbjct: 318 RPSSSHYFVGVQGSHFFYLDPHQTMAAIPFHTDVDEYTPAEIDSC-HTRRLRRLDIKEMD 376
Query: 293 PSIAV 297
PS+ +
Sbjct: 377 PSMLI 381
>gi|448114689|ref|XP_004202639.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
gi|359383507|emb|CCE79423.1| Piso0_001485 [Millerozyma farinosa CBS 7064]
Length = 480
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 151/342 (44%), Gaps = 90/342 (26%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI----------------------------------GDSG 38
E++ +DI SR+WFTYR GF PI +
Sbjct: 79 EEMEKDIYSRIWFTYRTGFEPIPKDEDGPQPLSFVHSMIFNKNPIPSALDNIHGLFNNQN 138
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
TTD GWGCM+R QM++A A L LGRD+ + V+ E+ + I+ MF D P+S+H
Sbjct: 139 FTTDVGWGCMIRTSQMLLANAFQLLLLGRDFAY-VDGSEKKHSDIIDMFTDEPKTPFSLH 197
Query: 99 QIALTGASEGKAV--GEWFGPNTVAQVLRKLAK--YDDWSSIVFHVALDNTLVV--NQVK 152
+ V GEWFGPN + +++L K +D S F V + + + +++
Sbjct: 198 NFIKAASDSPLKVKPGEWFGPNAASISIKRLCKSQFDGSVSPSFRVIISESCDIYDDKIG 257
Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
KL + + +++++P+RLG+ ++P Y +D + L
Sbjct: 258 KLLQEIENSE-----DAILILLPVRLGLNKVSPYY---------------HDSLSSL--- 294
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI--GCVYDKEQDS 270
F Q +G+ GGKP+ + YF G +++LDPH Q++ +YD
Sbjct: 295 ---------FCSSQLVGIAGGKPSSSYYFFGSHNGHLLYLDPHYPQSMKASSIYD----- 340
Query: 271 EKKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYKN 308
T+H + L I MDPS I + S+ Y +K+
Sbjct: 341 ------TFHTNKVQSLKIEDMDPSMLIGILIKSKEDYESFKD 376
>gi|432845798|ref|XP_004065858.1| PREDICTED: cysteine protease ATG4D-like [Oryzias latipes]
Length = 497
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 167/370 (45%), Gaps = 93/370 (25%)
Query: 1 MRHANKLSHQD-LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQA 59
+ HA L+ +D +E+ R D SR+W TYR+ F + S LTTD GWGCMLR GQM++AQ
Sbjct: 95 LGHAYLLNSEDEVERFRLDFVSRIWLTYRREFPQLEGSTLTTDCGWGCMLRSGQMLLAQG 154
Query: 60 LLFLHLGRDWQW-NVNSKEEAYLKILK--------------------------------- 85
LL + DW W + + + +I +
Sbjct: 155 LLLHLMPPDWTWPDAHQLTDVDFEIFRPRSPVRAAGVPIPSFGAPRASTTPEKSCSSSQK 214
Query: 86 ----MFEDRRTAPYSIHQIALTG----------------ASEGKAVGEWFGPNTVAQVLR 125
DR+ P + L G GK G+W+GP+ VA +LR
Sbjct: 215 KKTESSRDRQAEPTHQKLVTLFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAHILR 274
Query: 126 K-LAKYDDWSSIVFHVALDNTLVVNQVKKLC--TTNKRAS--SNPQWQPLVLVIPLRLGI 180
K +AK S+ +VA D T+ V +LC + ++R + S+ W+ +++++P+RLG
Sbjct: 275 KAVAKTSVGQSLAVYVAQDCTVYKEDVLQLCDPSLSQRVADPSSQAWKSVIILVPVRLGG 334
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
+ +NP YI +K +L +G+IGGKP H+LY
Sbjct: 335 EALNPSYIECVKNILSLDC---------------------------CIGIIGGKPKHSLY 367
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VS 299
FIG+ +++LDPH Q V D Q + L+S +HC ++ MDPS +
Sbjct: 368 FIGFQDEQLLYLDPHYCQP---VVDFTQ-ANFSLES-FHCSSPKKMPFSRMDPSCTIGFY 422
Query: 300 QRSYSDYKNV 309
R+ D++++
Sbjct: 423 ARTKEDFESM 432
>gi|357528776|sp|Q5B7L0.2|ATG4_EMENI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|259485537|tpe|CBF82642.1| TPA: Cysteine protease atg4 (EC 3.4.22.-)(Autophagy-related protein
4) [Source:UniProtKB/Swiss-Prot;Acc:Q5B7L0] [Aspergillus
nidulans FGSC A4]
Length = 402
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 144/304 (47%), Gaps = 58/304 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR F PI G T+D GWGCM+R GQ
Sbjct: 76 DFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRSGQS 135
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ++ L LGRDW+ +EE K+L +F D AP+SIH GA GK GE
Sbjct: 136 LLANSMAILLLGRDWRRGERLEEEG--KLLSLFADSPHAPFSIHSFVKHGADFCGKHPGE 193
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP A+ ++ LA D S++ ++A DN+ V+Q K + + + +P +++
Sbjct: 194 WFGPTATARCIQGLAARYDQSNLQVYIADDNS-DVHQDKFMSVSRDEKGTV---RPTLIL 249
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ LRLGI I VY NG+K L PQS+G+ GG
Sbjct: 250 LGLRLGIDRITAVYWNGLKAVLQL---------------------------PQSVGIAGG 282
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+ G+ +LDPH N Y + + +TYH + RL+I MDP
Sbjct: 283 RPSASHYFVAVQGSHFFYLDPH-NTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDP 341
Query: 294 SIAV 297
S+ +
Sbjct: 342 SMLI 345
>gi|405119256|gb|AFR94029.1| peptidase family C54 protein [Cryptococcus neoformans var. grubii
H99]
Length = 1185
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 125/267 (46%), Gaps = 85/267 (31%)
Query: 38 GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW----------QWNVNSKEEA------YL 81
GLT+D GWGCMLR GQ ++ AL+ +HLGRDW + N + A Y
Sbjct: 559 GLTSDAGWGCMLRTGQSLLVNALIHVHLGRDWRVPSTPASFSEATTNQETAALKDYAKYA 618
Query: 82 KILKMFEDRRTA--PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK---------- 129
++L F D + P+S+H++AL G GK VGEWFGP+T A L+ LA
Sbjct: 619 QMLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANSFAPCGVAVA 678
Query: 130 -------------------YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW--Q 168
DDW+SI + N KK + +A +W +
Sbjct: 679 TATDSIIYKSDVYTASNLPSDDWNSI--------SPTFNSSKKKRGGDNKAKEG-KWGKR 729
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
+++++ +RLG+ +NP+Y YD +K L FTFPQS+
Sbjct: 730 AVLILVGIRLGLDGVNPIY---------------YDSIKAL------------FTFPQSV 762
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPH 255
G+ GG+P+ + YFIG N + +LDPH
Sbjct: 763 GIAGGRPSSSYYFIGSQANHLFYLDPH 789
>gi|189091768|ref|XP_001929717.1| hypothetical protein [Podospora anserina S mat+]
gi|188219237|emb|CAP49217.1| unnamed protein product [Podospora anserina S mat+]
Length = 508
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/308 (30%), Positives = 144/308 (46%), Gaps = 66/308 (21%)
Query: 18 DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF +P GD +G ++D GWGCM+R GQ
Sbjct: 176 DFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRSGQS 235
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L GR W+ N E +I+ +F D APYSI GA+ GK GE
Sbjct: 236 LLANAMLISRAGRAWRRTTNPDIE--REIVCLFADDPRAPYSIQNFVNHGAAACGKYPGE 293
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPL 170
WFGP+ A+ ++ LAK D S V+ + + ++ N +++NP + P
Sbjct: 294 WFGPSATARCIQALAKKHDSSLRVY--------LTRDLPEVYEDNFMSTANPDGNHFHPT 345
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
++++ RLGI INP+Y + L PQ++G+
Sbjct: 346 LILVSTRLGIDKINPIYHEALISTLQL---------------------------PQAIGI 378
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHIL 289
GG+P+ + YFIG G + +LDPH + + D + ++LDS H + LH+
Sbjct: 379 AGGRPSSSHYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSC-HTRRLRHLHVE 437
Query: 290 HMDPSIAV 297
MDPS+ +
Sbjct: 438 DMDPSMLI 445
>gi|355703136|gb|EHH29627.1| Cysteine protease ATG4D [Macaca mulatta]
Length = 511
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 141/327 (43%), Gaps = 85/327 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 151 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 210
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 211 WAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 270
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
+H++ G S GK G+W+GP+ VA +LRK + T +V V + C
Sbjct: 271 GLHRLVELGQSSGKKAGDWYGPSLVAHILRK----------AVESCSEVTRLVVYVSQDC 320
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
T + +S + D P S ++ +L
Sbjct: 321 TAAEASSP----------------VSDT--------------PASGPLHLLPLLLGVLFQ 350
Query: 216 QTPRYEFTFPQ-----SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
Q R+ F LG++GGKP H+LYFIGY + +++LDPH Q D Q +
Sbjct: 351 QRCRWLFVCELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-A 406
Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
+ L+S +HC ++ MDPS V
Sbjct: 407 DFPLES-FHCTSPRKMAFAKMDPSCTV 432
>gi|409077121|gb|EKM77488.1| hypothetical protein AGABI1DRAFT_108018 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1355
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/297 (30%), Positives = 136/297 (45%), Gaps = 86/297 (28%)
Query: 18 DITSRLWFTYRKGFV-PIGDSGLT------------------------------------ 40
D SR+W TYR F PI DS LT
Sbjct: 337 DFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGGEKS 396
Query: 41 --TDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT-- 92
+D GWGCMLR GQ ++A AL+ +HLGRDW+ + V + + A Y++IL F D +
Sbjct: 397 WSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTPSPD 456
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
AP+S+H++AL G G VG+WFGP+ A +++L S + VA D L V
Sbjct: 457 APFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRLVNEFPRSGVGVSVAKDGVLSQTDVF 516
Query: 153 KLCTTNKRASSNP------------QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
+ ++ W +P+++++ LRLGI +NP+Y
Sbjct: 517 LASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVNPIY----------- 565
Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
Y+ +K L FT PQS+G+ GG+P + YF+G +++ +LDPH
Sbjct: 566 ----YETIKTL------------FTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPH 606
>gi|426191859|gb|EKV41798.1| hypothetical protein AGABI2DRAFT_123279 [Agaricus bisporus var.
bisporus H97]
Length = 1261
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/297 (30%), Positives = 136/297 (45%), Gaps = 86/297 (28%)
Query: 18 DITSRLWFTYRKGFV-PIGDSGLT------------------------------------ 40
D SR+W TYR F PI DS LT
Sbjct: 250 DFVSRIWLTYRSHFSQPIKDSTLTGLCASQPPSAVNDAASTTTTSGSPSKSRWHWGGEKS 309
Query: 41 --TDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT-- 92
+D GWGCMLR GQ ++A AL+ +HLGRDW+ + V + + A Y++IL F D +
Sbjct: 310 WSSDTGWGCMLRTGQSLLANALIHVHLGRDWRKPPYPVYTSDYATYVQILTWFFDTPSPD 369
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
AP+S+H++AL G G VG+WFGP+ A +++L S + VA D L V
Sbjct: 370 APFSVHRMALAGKEFGTDVGQWFGPSVAAGAVKRLVNEFPRSGVGVSVAKDGVLSQTDVF 429
Query: 153 KLCTTNKRASSNP------------QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
+ ++ W +P+++++ LRLGI +NP+Y
Sbjct: 430 LASHADSSTTTRTHSKSTSSTSQALHWGDRPVLILVGLRLGIDGVNPIY----------- 478
Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
Y+ +K L FT PQS+G+ GG+P + YF+G +++ +LDPH
Sbjct: 479 ----YETIKTL------------FTLPQSVGIAGGRPGSSYYFVGSQADNLFYLDPH 519
>gi|403413274|emb|CCL99974.1| predicted protein [Fibroporia radiculosa]
Length = 994
Score = 134 bits (338), Expect = 4e-29, Method: Composition-based stats.
Identities = 92/283 (32%), Positives = 142/283 (50%), Gaps = 72/283 (25%)
Query: 18 DITSRLWFTYRKGFVPI--------------------------------GDSGLTTDKGW 45
D TSR+W TYR F PI G+ G T+D GW
Sbjct: 312 DFTSRIWLTYRSQFEPIRDTSLSALNYDMDERAAPTSSPQPKRWNWGLGGEKGWTSDSGW 371
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
GCMLR GQ ++A ALL LHLGRDW+ + + + + A Y++I+ F D + P+S+H+
Sbjct: 372 GCMLRTGQSLLANALLHLHLGRDWRRPPYPIYTADFATYVQIISWFLDDPSPLCPFSVHR 431
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT-- 157
+AL G GK VG+WFGP+T A ++ L + + VA+D + + V + +
Sbjct: 432 MALVGKELGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVAVDGVIYQSDVYAVSRSTM 491
Query: 158 ---NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
+ R P W + ++++I +RLGI +NP+Y YD++K L
Sbjct: 492 GLGSPRKHGRPSWGDRAVLVLIGIRLGIDGVNPIY---------------YDLIKAL--- 533
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
+T PQ+LG+ GG+P+ + YF+G N++ +LDPH
Sbjct: 534 ---------YTLPQTLGIAGGRPSSSYYFVGSQANNLFYLDPH 567
>gi|396482697|ref|XP_003841525.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
gi|312218100|emb|CBX98046.1| similar to autophagy-related protein 4 [Leptosphaeria maculans JN3]
Length = 462
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 122/263 (46%), Gaps = 53/263 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF I S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRIWMTYRSGFPVIQKSQDPKATSAMSFRVRMQNLASPGFTSDTGFGCMIRSGQCIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW++ + + + IL +F D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQTLRLGRDWRYQDDPTAQEHCNILSLFADDPQAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
GP+ A+ ++ L + + +V+ D V K + +W P ++++
Sbjct: 219 GPSAAARCIQDLVHKYKEAGLRVYVSGDGADVYEDKLKQVAVEEDG----EWIPTLILVG 274
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
RLGI I PVY +K ++ M+ QS+G+ GG+P
Sbjct: 275 TRLGIDKITPVYWEALK------------------ASLQMK---------QSMGIAGGRP 307
Query: 236 NHALYFIGYVGNDVIFLDPHTNQ 258
+ + YF+ N +LDPH+ +
Sbjct: 308 SASHYFVATQANHFFYLDPHSTR 330
>gi|395323681|gb|EJF56143.1| hypothetical protein DICSQDRAFT_113447 [Dichomitus squalens
LYAD-421 SS1]
Length = 999
Score = 133 bits (335), Expect = 9e-29, Method: Composition-based stats.
Identities = 92/284 (32%), Positives = 139/284 (48%), Gaps = 73/284 (25%)
Query: 18 DITSRLWFTYRKGFVPI---------------------------------GDSGLTTDKG 44
D TSR+W TYR F PI G+ G T+D G
Sbjct: 306 DFTSRIWLTYRSQFFPIRDTTLAALEQEVHDSPTGLPSSPPSKRWNWPIGGEKGWTSDAG 365
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA----YLKILKMFEDRRT--APYSIH 98
WGCMLR GQ ++A ALL LHLGRDW+ + A Y++I+ F D + P+S+H
Sbjct: 366 WGCMLRTGQSLLANALLHLHLGRDWRRPPHPVYTADYAMYVQIVTWFLDTPSPLCPFSVH 425
Query: 99 QIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-----KK 153
++AL G GK VG+WFGP+T A ++ L + + VA D+TL + V
Sbjct: 426 RMALVGKDLGKEVGQWFGPSTAAGAIKTLVHSFPDAGLGVAVASDSTLYESDVYAASRSS 485
Query: 154 LCTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+ +T + +W + ++++I +RLGI+ +NP+Y N IK Y
Sbjct: 486 VYSTRRHGHPRMEWGDRAVLILIGIRLGIEGVNPLYYNTIKTLY---------------- 529
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
TFPQ++G+ GG+P+ + YF+G +++ +LDPH
Sbjct: 530 -----------TFPQTVGIAGGRPSSSYYFVGSQADNLFYLDPH 562
>gi|367047453|ref|XP_003654106.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
gi|347001369|gb|AEO67770.1| hypothetical protein THITE_2116815 [Thielavia terrestris NRRL 8126]
Length = 454
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 93/307 (30%), Positives = 141/307 (45%), Gaps = 64/307 (20%)
Query: 18 DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF +P GD +G ++D GWGCM+R GQ
Sbjct: 121 DFESRIWMTYRTGFELIPRSTDPRANSALSFAMRLKTSFGDQTGFSSDTGWGCMIRSGQS 180
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL LGRDW+ + +A +IL +F D APYS+H GA+ GK GE
Sbjct: 181 LLANALQISRLGRDWRRATDP--DAEREILSLFADDPRAPYSLHNFVKHGAAACGKYPGE 238
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPL 170
WFGP+ A+ + LA + S V+ + + + A +NP + P
Sbjct: 239 WFGPSATARCIEALANQHESSLRVYS--------TGDLPDVYEDSFMAVANPDGEHFHPT 290
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
++++ RLGI IN VY + L ST M+ QS+G+
Sbjct: 291 LILVCTRLGIDKINQVY------------------EEALISTLQME---------QSIGI 323
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILH 290
GG+P+ + YF+G G + +LDPH + + +D + + H + LH+
Sbjct: 324 AGGRPSSSHYFVGVQGQWLFYLDPHHPRPALPYREAPEDYTSEELGSCHTRRLRHLHVED 383
Query: 291 MDPSIAV 297
MDPS+ +
Sbjct: 384 MDPSMLI 390
>gi|294654609|ref|XP_456671.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
gi|218511938|sp|Q6BYP8.2|ATG4_DEBHA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|199429011|emb|CAG84627.2| DEHA2A07832p [Debaryomyces hansenii CBS767]
Length = 492
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 93/341 (27%), Positives = 147/341 (43%), Gaps = 91/341 (26%)
Query: 15 IRRDITSRLWFTYRKGFVPIG----------------------------------DSGLT 40
I +DI S++W TYR GF PI + T
Sbjct: 85 IEQDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNFHGLLDNDNFT 144
Query: 41 TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
TD GWGCM+R Q ++A L LGR + + + + + +I+ MF D AP+S+H
Sbjct: 145 TDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRD-RSPRHDEIIDMFMDEPRAPFSLHNF 203
Query: 101 ALTGASEGKAV--GEWFGPNTVAQVLRKLA----KYDDWSSIVFHVALDNTLVVNQVKKL 154
+ V G+WFGPN + +++L + + + ++ + L + + ++
Sbjct: 204 IKVASESPLKVKPGQWFGPNAASLSIKRLCDNVYESNGTGRVKVVISESSNLYDDIITQM 263
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
TT NP +++++P+RLGI +NP+Y + + AL
Sbjct: 264 FTT-----LNPVPDAILVLLPVRLGIDKVNPLYHASVLELLALR---------------- 302
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ---NIGCVYDKEQDSE 271
QS+G+ GGKP+ + YF GY GND+++LDPH Q N VYD
Sbjct: 303 -----------QSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRNKTSVYD------ 345
Query: 272 KKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYKN 308
TYH +L + MDPS I + Y D+K+
Sbjct: 346 -----TYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKS 381
>gi|119623099|gb|EAX02694.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_e
[Homo sapiens]
Length = 231
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/176 (41%), Positives = 100/176 (56%), Gaps = 30/176 (17%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W ++
Sbjct: 48 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ 107
Query: 78 -EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
+ Y +IL+ F DR+ YSIHQ+ ++ R L D
Sbjct: 108 PKEYQRILQCFLDRKDCCYSIHQM--------------------EKMCRVLPLSAD---T 144
Query: 137 VFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
D+ NQ K T+ S+ W+PL+L++PLRLGI INPVY++ K
Sbjct: 145 AGDRPPDSLTASNQSK---GTSAYCSA---WKPLLLIVPLRLGINQINPVYVDAFK 194
>gi|392586633|gb|EIW75969.1| hypothetical protein CONPUDRAFT_111807 [Coniophora puteana
RWD-64-598 SS2]
Length = 1038
Score = 132 bits (333), Expect = 1e-28, Method: Composition-based stats.
Identities = 95/281 (33%), Positives = 141/281 (50%), Gaps = 70/281 (24%)
Query: 18 DITSRLWFTYRKGFVPIGDS--------------------------------GLTTDKGW 45
D TSR+W TYR F PI DS G TTD GW
Sbjct: 291 DFTSRIWLTYRNQFAPIRDSTLSTLESDQTREPCTEMSSPSPKSRRWFGGEKGWTTDTGW 350
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDR--RTAPYSIHQ 99
GCMLR GQ ++A ALL LHLGRDW+ + + +++ A Y++I+ F D AP+S+H+
Sbjct: 351 GCMLRTGQTLLANALLHLHLGRDWRRPPYPLYTEDYATYVQIITWFLDSPLPQAPFSVHR 410
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
+AL G GK VG+WFGP+T A +++L + + + VA D L V +
Sbjct: 411 MALAGKDLGKDVGQWFGPSTAAGAIKRLVQAFPDAGLGVAVASDGALYQTDVYSASYVDV 470
Query: 160 RASSNP---QW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ N +W + ++++ +RLGI +NP+Y YD +K L
Sbjct: 471 GSPRNVRKLRWGGRAVLVLFGIRLGINGVNPIY---------------YDTIKGL----- 510
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
F PQS+G+ GG+P+ + YF+G G+++I+LDPH
Sbjct: 511 -------FEIPQSVGIAGGRPSSSYYFMGVQGDNLIYLDPH 544
>gi|342321655|gb|EGU13587.1| Cysteine protease ATG4 [Rhodotorula glutinis ATCC 204091]
Length = 1119
Score = 132 bits (332), Expect = 2e-28, Method: Composition-based stats.
Identities = 73/236 (30%), Positives = 120/236 (50%), Gaps = 47/236 (19%)
Query: 35 GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN------------SKEEAYLK 82
+ GL++D GWGCMLR GQ ++A AL+ +HLGRDW+ + Y +
Sbjct: 705 AEGGLSSDTGWGCMLRTGQSLLANALVKVHLGRDWRRPLPLGDFITSSTSPVPSAATYAR 764
Query: 83 ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV 140
IL +F D + +P+S+H+ A G GK +GEWFGP+T A ++ L + + +
Sbjct: 765 ILSLFLDDPSPISPFSVHRFAQQGKVLGKEIGEWFGPSTAAGAIKTLVNAYEPAGLKVVS 824
Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQ-PLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
+D T+ ++V T + +W+ P++++I +RLGI +NP+Y IK + L
Sbjct: 825 CVDGTVYESEVVAASTKD-----GEKWKTPVLVLINVRLGIDGVNPIYYEAIKGIFRL-- 877
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
PQS+G+ GG+P+ + YF+G N + ++DPH
Sbjct: 878 -------------------------PQSVGIAGGRPSSSYYFVGAQANSLFYIDPH 908
>gi|340931831|gb|EGS19364.1| cysteine protease-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 494
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/308 (30%), Positives = 138/308 (44%), Gaps = 65/308 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCMLRCGQ 53
D SR+W TYR GF I S G ++D GWGCM+R GQ
Sbjct: 158 DFESRIWMTYRTGFELIPRSTDPRATSALSFSMRLKTSFGADQAGFSSDTGWGCMIRSGQ 217
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVG 112
++A ALL LGR+W+ N K E +IL +F D APYS+H GA GK G
Sbjct: 218 SLLANALLISRLGREWRRGQNPKAE--REILSLFADDPRAPYSLHNFVKHGAEACGKFPG 275
Query: 113 EWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ---P 169
EWFGP+ A+ ++ LA H + + + + A +NP Q P
Sbjct: 276 EWFGPSATARCIQALANK--------HESELRVYSTGDLPDVYEDSFMAIANPDGQHFHP 327
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
++++ RLGI IN VY + L ST M+ QS+G
Sbjct: 328 TLVLVCTRLGIDKINKVY------------------EQALISTLQME---------QSIG 360
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
+ GG+P+ + YFIG + +LDPH + + + +D ++ + H + LH+
Sbjct: 361 IAGGRPSQSHYFIGVQDQWLFYLDPHYPRPMLPYRENPEDYTQEEVDSCHTRRLRHLHVE 420
Query: 290 HMDPSIAV 297
+DPS+ +
Sbjct: 421 DLDPSMLI 428
>gi|170109871|ref|XP_001886142.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
gi|164639072|gb|EDR03346.1| hypothetical protein LACBIDRAFT_307494 [Laccaria bicolor S238N-H82]
Length = 1039
Score = 132 bits (332), Expect = 2e-28, Method: Composition-based stats.
Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 72/286 (25%)
Query: 18 DITSRLWFTYRKGF-VPIGDSGL-------------------------------TTDKGW 45
D TSR+W TYR F PI D+ L ++D GW
Sbjct: 339 DFTSRIWLTYRSHFPTPIKDTRLADLCGDAAPEIANSPTTVKTRPWNWGGEKTWSSDTGW 398
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
GCMLR GQ ++A AL+ +HLGRDW+ + V + + A Y++I+ F D AP+S+H+
Sbjct: 399 GCMLRTGQSLLANALVHMHLGRDWRRPPYPVQTADYATYVQIVTWFLDTPAPEAPFSVHR 458
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-----KKL 154
+AL G G VG+WFGP+ A ++ L S + VA D TL + V ++
Sbjct: 459 MALAGKEFGTDVGQWFGPSVAAGAIKTLVNSFPESGLGVSVATDGTLFQSDVFAVSHGEM 518
Query: 155 CTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
+ + R W +P++L++ +RLGI+ +NP+Y Y+ +K+L
Sbjct: 519 SSRSPRRIKTTTWGHRPVLLLLGIRLGIEGVNPIY---------------YETIKLL--- 560
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
+TFPQS+G+ GG+P+ + YF+G +++ +LDPH +
Sbjct: 561 ---------YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPHNTR 597
>gi|358369016|dbj|GAA85631.1| autophagy cysteine endopeptidase Atg4 [Aspergillus kawachii IFO
4308]
Length = 378
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/306 (31%), Positives = 150/306 (49%), Gaps = 62/306 (20%)
Query: 18 DITSRLWFTYRKGFVPI----GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
D SR+W TYR F PI GD DK L ++A AL L LGRDW+
Sbjct: 82 DFESRIWMTYRSNFPPIPRVEGD-----DKSASMTLGS---LLANALSTLVLGRDWRRGA 133
Query: 74 NSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGEWFGPNTVAQVLRKLAKYDD 132
+EE+ ++L +F D TAP+S+H+ GA S GK GEWFGP+ A+ + L+
Sbjct: 134 RFEEES--QLLSLFADTPTAPFSVHRFVKHGAESCGKFPGEWFGPSATAKCIEALSSQCG 191
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
++ +V+ D + +V + N +S+ +QP ++++ RLGI I PVY +G+K
Sbjct: 192 SPTLKVYVSNDTS----EVYQDRFMNVARNSSGVFQPTLILLGTRLGIDHITPVYWDGLK 247
Query: 193 KCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFL 252
LP QS+G+ GG+P+ + YF+G G+ + +L
Sbjct: 248 ATLQLP---------------------------QSVGIAGGRPSASHYFVGAQGSHLFYL 280
Query: 253 DPH------TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI----AVVSQRS 302
DPH ++ G +Y KE+ +D TYH + R+H+ MDPS+ + Q
Sbjct: 281 DPHYTRPALPDRQGGELYSKEE-----VD-TYHTRRLRRIHVRDMDPSMLIGFLIRDQED 334
Query: 303 YSDYKN 308
+ D+ N
Sbjct: 335 WDDWLN 340
>gi|37360148|dbj|BAC98052.1| mKIAA0943 protein [Mus musculus]
gi|148707989|gb|EDL39936.1| autophagy-related 4B (yeast), isoform CRA_d [Mus musculus]
Length = 266
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/204 (38%), Positives = 104/204 (50%), Gaps = 56/204 (27%)
Query: 119 TVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-------------------- 158
+V RKLA +D WSS+ H+A+DNT+V+ ++++LC N
Sbjct: 20 SVLAFCRKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGF 79
Query: 159 ---KRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
++ P W+PLVL+IPLRLG+ DIN Y+ +K C
Sbjct: 80 PAGAEVTNRPSAWRPLVLLIPLRLGLTDINEAYVETLKHC-------------------- 119
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
F PQSLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS
Sbjct: 120 -------FMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIP 168
Query: 275 DSTYHCPQ-ASRLHILHMDPSIAV 297
D ++HC SR+ I +DPSIAV
Sbjct: 169 DESFHCQHPPSRMGIGELDPSIAV 192
>gi|307190831|gb|EFN74681.1| Cysteine protease ATG4B [Camponotus floridanus]
Length = 115
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 59/93 (63%), Positives = 71/93 (76%)
Query: 67 RDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
+DWQW +K YLKIL FED+R A +SIHQIAL GASEGK VG+WFGPNT+AQVL+K
Sbjct: 15 KDWQWMPETKNSTYLKILSRFEDKRAAAFSIHQIALMGASEGKEVGQWFGPNTIAQVLKK 74
Query: 127 LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
L YD+WSSI HVALDNTL++N + K +K
Sbjct: 75 LVVYDEWSSITIHVALDNTLIINDICKYAVISK 107
>gi|74665877|sp|Q4U3V5.1|ATG4_CRYPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|66576169|gb|AAY51673.1| putative cysteine protease Atg4 [Cryphonectria parasitica]
Length = 459
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/312 (29%), Positives = 143/312 (45%), Gaps = 67/312 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
D SR+W TYR F PI S G ++D GWGCM+R GQ +
Sbjct: 127 DFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLADQGGFSSDTGWGCMIRSGQSL 186
Query: 56 IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGEW 114
+A L+ LGRDW+ +++E +IL F D APYS+H GA + GK GEW
Sbjct: 187 LANTLVICQLGRDWRRGKAARQER--EILARFADDPRAPYSLHNFVRHGAVACGKFPGEW 244
Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
FGP+ A+ ++ LA ++ S V+ + + + + + P ++++
Sbjct: 245 FGPSATARCIQALANSNESSLRVYSTGDLPDVYEDSFMAVAKPDGET-----FHPTLILV 299
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
RLGI IN VY + L++T M PQS+G+ GG+
Sbjct: 300 GTRLGIDKINQVYW------------------EALTATLQM---------PQSVGIAGGR 332
Query: 235 PNHALYFIGY--------VGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
P+ + YFIG G+ + +LDPH T + D +Q + ++ T H + R
Sbjct: 333 PSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN-TCHTRRLRR 391
Query: 286 LHILHMDPSIAV 297
LH+ MDPS+ +
Sbjct: 392 LHVRDMDPSMLI 403
>gi|164660504|ref|XP_001731375.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
gi|159105275|gb|EDP44161.1| hypothetical protein MGL_1558 [Malassezia globosa CBS 7966]
Length = 651
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 131/283 (46%), Gaps = 56/283 (19%)
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRT--APY 95
T+D GWGCMLR Q ++A AL+ +HLGR W+ K Y +IL F D + P+
Sbjct: 313 FTSDVGWGCMLRSVQSMLANALIRVHLGRHWRRRAKQKTHPQYARILSWFMDDPSLECPF 372
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
SIH++ G G G+WFGP+T A L KL + D + V D L QV
Sbjct: 373 SIHRLVDEGQRLGVQAGDWFGPSTAAFALCKLIQAYDACGLGVVVTNDGMLYKEQVVAAS 432
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
R S+P +P+++++ RLG+ + P Y +K+
Sbjct: 433 FAPGR--SDPWTRPVLILLVQRLGLDQVPPHYRPALKQ---------------------- 468
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH--------------TNQNIG 261
FT PQS+GV+GG+P +LYF+G ++ LDPH T ++G
Sbjct: 469 -----SFTMPQSVGVVGGRPRSSLYFVGVQREHLLCLDPHHVRPCVPFRSPPRMTRASVG 523
Query: 262 CVYD---------KEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
D +E + ++LDS +H P S L I MDPS+
Sbjct: 524 ASTDLASTVSPWFEEAYTAEELDS-FHTPHTSLLPISQMDPSM 565
>gi|409050837|gb|EKM60313.1| hypothetical protein PHACADRAFT_179659 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1009
Score = 131 bits (329), Expect = 4e-28, Method: Composition-based stats.
Identities = 85/297 (28%), Positives = 137/297 (46%), Gaps = 86/297 (28%)
Query: 18 DITSRLWFTYRKGFVPI-------------------------------GDSGLTTDKGWG 46
D TSR+W TYR F+PI GD ++D GWG
Sbjct: 311 DFTSRIWLTYRSQFLPIRDMSLEELNAAPESAALSTGSQAKKWSWSLSGDKCWSSDAGWG 370
Query: 47 CMLRCGQMVIAQALLFLHLGRDWQWNVN----SKEEAYLKILKMFEDRRT--APYSIHQI 100
CMLR GQ ++A AL+ +HLGRDW+ + S Y++I+ F D + P+S+H++
Sbjct: 371 CMLRTGQSLLANALIHVHLGRDWRKPPHPVPTSDYATYIQIITWFFDDPSLLCPFSVHRM 430
Query: 101 ALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW----------------SSIVFHVALDN 144
AL G G VG+WFGP+T A ++ ++ + + + +VA D
Sbjct: 431 ALVGKQLGVKVGQWFGPSTAAGAIKYVSAHSSMVPNQPARRTLVHAFPEAGLGIYVAADG 490
Query: 145 TLVVNQ----VKKLCTTNKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
+ + + R + W +P++++I RLGI +NP+Y
Sbjct: 491 GTIYDSEVFAASHSGIGSPRRHTRRVWGDRPVLILIGHRLGIDGVNPIY----------- 539
Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
YD +K L +T+PQS+G+ GG+P+ + YF+G +++ +LDPH
Sbjct: 540 ----YDTLKTL------------YTWPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 580
>gi|336381646|gb|EGO22797.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.9]
Length = 992
Score = 130 bits (328), Expect = 5e-28, Method: Composition-based stats.
Identities = 92/283 (32%), Positives = 141/283 (49%), Gaps = 72/283 (25%)
Query: 18 DITSRLWFTYRKGFVPIGDS----------------------------------GLTTDK 43
D TSR+W TYR F PI DS G T+D
Sbjct: 301 DFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPVGGEKGWTSDA 360
Query: 44 GWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSI 97
GWGCMLR GQ ++A ALL LHLGRDW+ + V++ + A Y++I+ F D + +P+S+
Sbjct: 361 GWGCMLRTGQSLLANALLHLHLGRDWRRPPYPVHTTDYATYVQIITWFFDTPSPQSPFSV 420
Query: 98 HQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT 157
H++AL G GK VG+WFGP+T A ++ L + + VA D + + V
Sbjct: 421 HRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVASDGVIFQSDVYAASNA 480
Query: 158 ---NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
+ R + W + ++++I +RLG+ +NP+Y YD +K L
Sbjct: 481 YIGSPRRHAKVSWGGRAVIVLIGIRLGLDGVNPIY---------------YDTIKAL--- 522
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
+TFPQS+G+ GG+P+ + YF+G +++ +LDPH
Sbjct: 523 ---------YTFPQSVGIAGGRPSSSYYFMGSQADNLFYLDPH 556
>gi|85067704|ref|XP_959438.1| hypothetical protein NCU02433 [Neurospora crassa OR74A]
gi|62899773|sp|Q7S3X7.1|ATG4_NEUCR RecName: Full=Probable cysteine protease atg-4; AltName:
Full=Autophagy-related protein 4
gi|28920860|gb|EAA30202.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 506
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 137/305 (44%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F I S G ++D GWGCM+R GQ
Sbjct: 174 DFESRIWMTYRTDFALIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 233
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L LGR+W+ + E I+ +F D APYS+H GA+ GK GE
Sbjct: 234 LLANAILIARLGREWRRGTDLDAEK--DIIALFADDPRAPYSLHNFVKYGATACGKYPGE 291
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA V+ + + + + R +QP +++
Sbjct: 292 WFGPSATARCIQALADEKQSGLRVYSTGDLPDVYEDSFMAVANPDGRG-----FQPTLIL 346
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI IN VY + L ST + PQS+G+ GG
Sbjct: 347 VCTRLGIDKINQVY------------------EEALISTLQL---------PQSIGIAGG 379
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G G + +LDP H + D + ++LD T H + +LHI MD
Sbjct: 380 RPSSSHYFVGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELD-TCHTRRLRQLHIGDMD 438
Query: 293 PSIAV 297
PS+ +
Sbjct: 439 PSMLI 443
>gi|336467357|gb|EGO55521.1| hypothetical protein NEUTE1DRAFT_85886 [Neurospora tetrasperma FGSC
2508]
gi|350288001|gb|EGZ69237.1| hypothetical protein NEUTE2DRAFT_94213 [Neurospora tetrasperma FGSC
2509]
Length = 506
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 137/305 (44%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F I S G ++D GWGCM+R GQ
Sbjct: 174 DFESRIWMTYRTDFAFIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 233
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L LGR+W+ + E I+ +F D APYS+H GA+ GK GE
Sbjct: 234 LLANAILIARLGREWRRGTDLDAEK--DIIALFADDPRAPYSLHNFVKYGATACGKYPGE 291
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA V+ + + + + R +QP +++
Sbjct: 292 WFGPSATARCIQALADEKQSGLRVYSTGDLPDVYEDSFMAVANPDGRG-----FQPTLIL 346
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI IN VY + L ST + PQS+G+ GG
Sbjct: 347 VCTRLGIDKINQVY------------------EEALISTLQL---------PQSIGIAGG 379
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G G + +LDP H + D + ++LD T H + +LHI MD
Sbjct: 380 RPSSSHYFVGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELD-TCHTRRLRQLHIGDMD 438
Query: 293 PSIAV 297
PS+ +
Sbjct: 439 PSMLI 443
>gi|58260832|ref|XP_567826.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134117209|ref|XP_772831.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817600|sp|P0CQ11.1|ATG4_CRYNB RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|338817601|sp|P0CQ10.1|ATG4_CRYNJ RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|50255449|gb|EAL18184.1| hypothetical protein CNBK2020 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229907|gb|AAW46309.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1193
Score = 130 bits (326), Expect = 1e-27, Method: Composition-based stats.
Identities = 86/269 (31%), Positives = 126/269 (46%), Gaps = 85/269 (31%)
Query: 36 DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---------WNVNSKEEAYLK---- 82
+ GLT+D GWGCMLR GQ ++ AL+ +HLGRDW+ ++E A LK
Sbjct: 559 ERGLTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAK 618
Query: 83 ---ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-------- 129
+L F D + P+S+H++AL G GK VGEWFGP+T A L+ LA
Sbjct: 619 YAQMLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANSFAPCGVA 678
Query: 130 ---------------------YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW- 167
DDW+SI + N KK + A +W
Sbjct: 679 VATATDSIIYKSDVYTASNLPSDDWNSI--------SPTFNSSKKKRRGDNEAKEE-KWG 729
Query: 168 -QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+ +++++ +RLG+ +NP+Y YD +K L FTFPQ
Sbjct: 730 KRAVLILVGVRLGLDGVNPIY---------------YDSIKAL------------FTFPQ 762
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
S+G+ GG+P+ + YF+G N + +LDPH
Sbjct: 763 SVGIAGGRPSSSYYFVGSQANHLFYLDPH 791
>gi|320169048|gb|EFW45947.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 918
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 147/328 (44%), Gaps = 82/328 (25%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN----- 72
D + + F+YRK F I S TTD GWGC LR QM++A+AL+ GR W+
Sbjct: 301 DFQTLVCFSYRKDFERIPGSKHTTDCGWGCTLRSAQMLVAEALVLQIFGRRWRIEDRSCP 360
Query: 73 ---VNSKEEAYLKILKMFED--RRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRK 126
+SKE+ I+++F+D R +P+SIH I G K G+WFGP +V +V
Sbjct: 361 APLSSSKEDQLRLIIRLFQDQLRLDSPFSIHNIVQHGCQLFDKRAGDWFGPASVVRVFAD 420
Query: 127 L---AKYDDWSSIVFHVALDNTLVVNQVKKLC---------------TTNKRASSNPQWQ 168
L A S + A+D+ + + V +LC +T++ S++
Sbjct: 421 LINQAYAMHQSPFRAYQAIDHIIYRDLVAELCSGPDAVRDLEFSTPTSTSESVSTDETVT 480
Query: 169 PLVLV-----------------IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
P +PLRLG+ +IN +YI +K
Sbjct: 481 PSASTSQSPPVLPPPFIPLLILMPLRLGLNEINRMYIPCLKAL----------------- 523
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC--VYDKEQD 269
Q +G+IGG+P H+LYF+GY ++VIF DPH GC D +Q
Sbjct: 524 ----------LMCAQCVGIIGGRPRHSLYFVGYQEDNVIFADPH-----GCKRFVDMQQT 568
Query: 270 SEKKLDSTYHCPQASRLHILHMDPSIAV 297
S T+H +++ HMDPS+A+
Sbjct: 569 SFPT--ETFHSAVPNKIPFTHMDPSMAI 594
>gi|268536436|ref|XP_002633353.1| Hypothetical protein CBG06097 [Caenorhabditis briggsae]
Length = 411
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 141/286 (49%), Gaps = 64/286 (22%)
Query: 40 TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW--------NVNSKEEAY-------LKIL 84
TTD GWGCM+R QM++AQA++ GR+W++ VN +E + IL
Sbjct: 88 TTDCGWGCMIRTTQMMVAQAIMINRFGRNWRFVRRKKSHVTVNGEETEFDTEKMKEWMIL 147
Query: 85 KMFEDRRTAPYSIHQIALTGASE--GKAVGEWFGPNTVAQVLRKLAKYDDWSSI----VF 138
K+FED+ +AP IH++ A E +AVG W+ P+ +++K A + S + V
Sbjct: 148 KLFEDKPSAPLGIHKMIEIAAREKGKRAVGCWYSPSEAVFIMKK-AITESASPLTGDTVM 206
Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
++++D + ++ L K + L+LVI +RLG ++N +Y+ + +
Sbjct: 207 YLSIDGRV---HIRDLEVETKHWTKT-----LMLVIVVRLGAAELNRIYVPHLMRL---- 254
Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
F+ LG+ GG+P+H+ +F+GY G+ VI+LDPH
Sbjct: 255 -----------------------FSMDSCLGITGGRPDHSCWFVGYYGDQVIYLDPHVAH 291
Query: 259 N---IGCVYDKEQDSEKK----LDSTYHCPQASRLHILHMDPSIAV 297
I ++ Q+ KK + +YHC S++H L MDPS A+
Sbjct: 292 EYIPIDMDFNTSQEDPKKPKKCPERSYHCRLLSKMHFLDMDPSCAL 337
>gi|169622773|ref|XP_001804795.1| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
gi|160704853|gb|EAT78153.2| hypothetical protein SNOG_14613 [Phaeosphaeria nodorum SN15]
Length = 357
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/197 (36%), Positives = 101/197 (51%), Gaps = 26/197 (13%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF PI S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCMIRSGQCIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW+W N ++ + +IL +F D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQILRLGRDWRWQENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
GP+ A+ ++ LA + + +V+ D V K ++ WQP ++++
Sbjct: 219 GPSAAARCIQDLANKHREAGLKVYVSGDGADVYEDKLKQVAVDEDG----LWQPTLILVG 274
Query: 176 LRLGIQDINPVYINGIK 192
RLGI I PVY +K
Sbjct: 275 TRLGIDKITPVYWEALK 291
>gi|166990663|sp|Q2HH40.2|ATG4_CHAGB RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
Length = 448
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 142/305 (46%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF PI GD +G ++D GWGCM+R GQ
Sbjct: 116 DFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRSGQS 175
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A ALL LGRDW+ + E I+ +F D APYS+ GA + GK GE
Sbjct: 176 LLANALLISQLGRDWRRTTDPGAER--NIVALFADDARAPYSLQNFVKHGAIACGKHPGE 233
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA + S ++ + V + L T + + P +++
Sbjct: 234 WFGPSATARCIQALADQHESSLRIYSTG--DLPDVYEDSFLATARPDGET---FHPTLIL 288
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI INPVY + L ST M+ QS+G+ GG
Sbjct: 289 VCTRLGIDKINPVY------------------EEALISTLQME---------QSIGIAGG 321
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G + +LDPH + + + + ++LDS H + LH+ MD
Sbjct: 322 RPSSSHYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSC-HTRRLRYLHVEDMD 380
Query: 293 PSIAV 297
PS+ +
Sbjct: 381 PSMLI 385
>gi|321263995|ref|XP_003196715.1| hypothetical protein CGB_K2500C [Cryptococcus gattii WM276]
gi|317463192|gb|ADV24928.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1188
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 122/261 (46%), Gaps = 69/261 (26%)
Query: 36 DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW---------NVNSKEEAYLK---- 82
+ GLT+D GWGCMLR GQ ++ AL+ +HLGRDW+ S+E A LK
Sbjct: 557 ERGLTSDAGWGCMLRTGQSLLINALIHVHLGRDWRLPSTPATFSEATTSQEIAALKDYAK 616
Query: 83 ---ILKMFEDRRTA--PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
++ F D + P+S+H++AL G GK VGEWFGP+T A L+ LA I
Sbjct: 617 YAQMVSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGTLKTLANSFAPCGIA 676
Query: 138 FHVALDNTL---------------------VVNQVKKLCTTNKRASSNPQW--QPLVLVI 174
A D+ + N +K N A +W + +++++
Sbjct: 677 VATATDSIIYRSDVYAASNLPSDDWNRISPTFNPSRKKKRHNAEAKEG-KWGERAVLILV 735
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
+RLG+ +NP+Y YD +K L FTFPQ+ G GG+
Sbjct: 736 GIRLGLDGVNPIY---------------YDSIKAL------------FTFPQAGGSAGGR 768
Query: 235 PNHALYFIGYVGNDVIFLDPH 255
P+ + YF+G N + +LDPH
Sbjct: 769 PSSSYYFVGSQANHLFYLDPH 789
>gi|403296347|ref|XP_003939073.1| PREDICTED: cysteine protease ATG4D [Saimiri boliviensis
boliviensis]
Length = 463
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 135/322 (41%), Gaps = 103/322 (31%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 131 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 190
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + +E + +I+ F D AP+
Sbjct: 191 WAEGTGLGPPELSGPASPSRYHGPARWMPPCWAQGAPELEQERRHRQIVSWFADHPQAPF 250
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
+H++ G S GK G+W+GP+ VA + L K + SS V T +V V + C
Sbjct: 251 GLHRLVELGQSSGKKAGDWYGPSLVAHI---LRKAVESSSEV-------TRLVVYVSQDC 300
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
T + + P Q L+
Sbjct: 301 T--GKGTCTPSLQELL-------------------------------------------- 314
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
R E LG++GGKP H+LYFIGY + +++LDPH Q D Q + L+
Sbjct: 315 ---RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-ANFPLE 363
Query: 276 STYHCPQASRLHILHMDPSIAV 297
S +HC ++ MDPS V
Sbjct: 364 S-FHCTSPRKMAFAKMDPSCTV 384
>gi|426329870|ref|XP_004025954.1| PREDICTED: cysteine protease ATG4C [Gorilla gorilla gorilla]
Length = 491
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 150/371 (40%), Gaps = 112/371 (30%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNYDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI---- 250
ILS Y +G+IGGKP + YF G+ N+V
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQENEVQRSSM 346
Query: 251 -FLDPHTNQNIGCVYDKEQDSEKKLDS-----------------------TYHCPQASRL 286
L +++N + E+ + S T+HCP ++
Sbjct: 347 NSLKQKSSKNNLKLEGSEKRPQMGFGSEDEFKNILLDHVQAFGPPSYPRLTFHCPSPKKM 406
Query: 287 HILHMDPSIAV 297
MDPS +
Sbjct: 407 SFRKMDPSCTI 417
>gi|62899783|sp|Q86ZL5.1|ATG4_PODAS RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|27802993|emb|CAD60696.1| unnamed protein product [Podospora anserina]
Length = 500
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 138/308 (44%), Gaps = 74/308 (24%)
Query: 18 DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF +P GD +G ++D GWGCM+R GQ
Sbjct: 176 DFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRSGQS 235
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L GR W+ N E +I+ +F D APYSI GA+ GK GE
Sbjct: 236 LLANAMLISRAGRAWRRTTNPDIE--REIVCLFADDPRAPYSIQNFVNHGAAACGKYPGE 293
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPL 170
WFGP+ A+ + L Y + + ++ N +++NP + P
Sbjct: 294 WFGPSATARCIHSLRVY----------------LTRDLPEVYEDNFMSTANPDGNHFHPT 337
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
++++ RLGI INP+Y + L PQ++G+
Sbjct: 338 LILVSTRLGIDKINPIYHEALISTLQL---------------------------PQAIGI 370
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHIL 289
GG+P+ + YFIG G + +LDPH + + D + ++LDS H + LH+
Sbjct: 371 AGGRPSSSHYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSC-HTRRLRHLHVE 429
Query: 290 HMDPSIAV 297
MDPS+ +
Sbjct: 430 DMDPSMLI 437
>gi|67967551|dbj|BAE00258.1| unnamed protein product [Macaca fascicularis]
Length = 330
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 130/288 (45%), Gaps = 71/288 (24%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNS-------------------------------- 75
MLR GQM++AQ LL L RDW W +
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 76 ---KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYD 131
+E + +I+ F D AP+ +H++ G S GK G+W+GP+ VA +LRK +
Sbjct: 61 ELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCS 120
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
+ + +V +V+ D T+ V +L R +W+ +V+++P+RLG + +NPVY+ +
Sbjct: 121 EVTRLVVYVSQDCTVYKADVARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCV 177
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K+ R E LG++GGKP H+LYFIGY + +++
Sbjct: 178 KELL-----------------------RCELC----LGIMGGKPRHSLYFIGYQDDFLLY 210
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVVS 299
LDPH Q V + E ++HC ++ MDPS V S
Sbjct: 211 LDPHYCQPTVDVSQADFPLE-----SFHCTSPRKMAFAKMDPSCTVGS 253
>gi|328868883|gb|EGG17261.1| autophagy protein 4 [Dictyostelium fasciculatum]
Length = 616
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 151/370 (40%), Gaps = 120/370 (32%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD-- 68
++E+ D S LWF+YRK F I ++ +TTD GWGCMLR GQM++A+ALL +
Sbjct: 193 EVERFLEDFKSILWFSYRKDFPSIENTSITTDIGWGCMLRTGQMILARALLKHFYNNENI 252
Query: 69 ---WQWNVNSKEEAYLKILKMFED--RRTAPYSIHQIA-----LTGASEGK--------- 109
+ NSK Y KI+ F D + YSIHQI +T + K
Sbjct: 253 PYGEKIKTNSK---YKKIMSWFCDYPSKENFYSIHQIVHKNKIITKYNNSKLKDFDIDSD 309
Query: 110 ------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK----------- 152
V EWF P +A VL+ L K SSI +V D + ++V
Sbjct: 310 DQDDWNNVDEWFAPTKIAVVLKLLVKSHHSSSIAMYVPSDGVVYKDRVAKICTIRDDQSA 369
Query: 153 --------------KLCTTNKRASSN--------------------------------PQ 166
KL +T +S N
Sbjct: 370 PARVPLSLSLPAGIKLFSTTSPSSPNLFVPSQSTGNSMEDQSFLVGEEEDNTDNNSNQSN 429
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+ L++++P++LG+ +N +Y +GIK +P
Sbjct: 430 WKSLIILVPVKLGLDKLNEIYFSGIKAMLQMP---------------------------S 462
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
S+G+IGGKP + YF+G+ +I+LDPH V+D + ++YH ++
Sbjct: 463 SIGLIGGKPKQSFYFVGFQDEHIIYLDPH------FVHDTIHPFDSNFLNSYHDCIPQKM 516
Query: 287 HILHMDPSIA 296
H +DPS+A
Sbjct: 517 HFSQIDPSMA 526
>gi|380092671|emb|CCC09424.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 515
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 137/305 (44%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F I S G ++D GWGCM+R GQ
Sbjct: 183 DFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 242
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L LGR+W+ + E I+ +F D AP+S+H GA+ GK GE
Sbjct: 243 LLANAILVARLGREWRRETDLDAEK--DIIALFADDPRAPFSLHNFVKYGATACGKYPGE 300
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP A+ ++ L + V+ + + + + R +QP +++
Sbjct: 301 WFGPLATARCIQALTDEKESGLRVYSTGDLPDVYEDSFMAVANPDGRG-----FQPTLIL 355
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI IN VY + L ST + PQS+G+ GG
Sbjct: 356 VCTRLGIDKINQVY------------------EEALISTLQL---------PQSIGIAGG 388
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YFIG G + +LDP H + D + + ++LD T H + +LHI MD
Sbjct: 389 RPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELD-TCHTRRLRQLHIDDMD 447
Query: 293 PSIAV 297
PS+ +
Sbjct: 448 PSMLI 452
>gi|299738612|ref|XP_001834660.2| cysteine protease [Coprinopsis cinerea okayama7#130]
gi|298403389|gb|EAU87108.2| cysteine protease [Coprinopsis cinerea okayama7#130]
Length = 1034
Score = 127 bits (319), Expect = 6e-27, Method: Composition-based stats.
Identities = 92/282 (32%), Positives = 138/282 (48%), Gaps = 71/282 (25%)
Query: 18 DITSRLWFTYRKGF-VPIGDSGL-------------------------------TTDKGW 45
D TSR+W TYR F PI D L ++D GW
Sbjct: 305 DFTSRIWLTYRSHFPQPIKDGRLADLCGGPQPEPVASPVTKKSPWHWVGGEKSWSSDSGW 364
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APYSIHQ 99
GCMLR GQ ++A AL+ +HLGRDW+ + V + + A Y+ IL F D AP+S+H+
Sbjct: 365 GCMLRTGQSLLANALIHVHLGRDWRKPPYPVMTADYATYVHILTWFLDTPAPEAPFSVHR 424
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV------KK 153
+AL G G VG+WFGP+ A ++ L + I VA+D L V
Sbjct: 425 MALAGKELGTDVGQWFGPSVAAGAIKALVNSFPEAGIGVAVAVDGVLYQTDVHAASHGDH 484
Query: 154 LCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
T +R + +P++L++ +RLGI+ +NP+Y YD +K+L
Sbjct: 485 FGRTPRRHKRSWGDRPVLLLLGIRLGIEGVNPIY---------------YDTIKML---- 525
Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
+TFPQS+G+ GG+P+ + YF+G +++ +LDPH
Sbjct: 526 --------YTFPQSVGIAGGRPSSSYYFVGSQADNLFYLDPH 559
>gi|45185039|ref|NP_982756.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|62899767|sp|Q75E61.1|ATG4_ASHGO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|44980675|gb|AAS50580.1| ABL191Wp [Ashbya gossypii ATCC 10895]
gi|374105958|gb|AEY94868.1| FABL191Wp [Ashbya gossypii FDAG1]
Length = 521
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 154/317 (48%), Gaps = 70/317 (22%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDK 43
E+ D+ +RL FTYR FVPI G S ++ TD
Sbjct: 114 EEFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDI 173
Query: 44 GWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALT 103
GWGCM+R GQ ++A AL LGRD++ + N+ E L+I+K FED P+S+H+
Sbjct: 174 GWGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHELRIIKWFEDDPKYPFSLHKFVQE 233
Query: 104 GAS-EGKAVGEWFGPNTVAQVLRKL-AKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKR 160
G S GK GEWFGP+ ++ ++ L AK+ ++ D+ V +++V+ L +
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPACGIAHCVISTDSGDVYMDEVEPLFRADPS 293
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
A+ ++L++ +RLG+ +N VY I+ ILSS +
Sbjct: 294 AA-------VLLLLCVRLGVDVVNEVYWEHIR--------------HILSSEH------- 325
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
S+G+ GG+P+ +LYF GY + +LDPH Q Y ++ D L + H
Sbjct: 326 ------SVGIAGGRPSSSLYFFGYQDEHLFYLDPHKPQLNLASYQQDLD----LFRSVHT 375
Query: 281 PQASRLHILHMDPSIAV 297
+ +++H+ +DPS+ +
Sbjct: 376 QRFNKVHMSDIDPSMLI 392
>gi|16551551|dbj|BAB71121.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 127/286 (44%), Gaps = 71/286 (24%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVN--------------------------------- 74
MLR GQM++AQ LL L RDW W
Sbjct: 1 MLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAP 60
Query: 75 --SKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYD 131
+E + +I+ F D AP+ +H++ G S GK G+W+GP+ VA +LRK +
Sbjct: 61 ELERERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCS 120
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
D + +V +V+ D T+ V +L R +W+ +V+++P+RLG + +NPVY+ +
Sbjct: 121 DVTRLVVYVSQDCTVYKADVARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCV 177
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K+ R E LG++GGKP H+LYFIGY + +++
Sbjct: 178 KELL-----------------------RCELC----LGIMGGKPRHSLYFIGYQDDFLLY 210
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
LDPH Q V + E ++HC ++ DPS V
Sbjct: 211 LDPHYCQPTVDVSQADFPLE-----SFHCTSPRKMAFAKTDPSCTV 251
>gi|402219068|gb|EJT99143.1| hypothetical protein DACRYDRAFT_70366 [Dacryopinax sp. DJM-731 SS1]
Length = 1093
Score = 125 bits (315), Expect = 2e-26, Method: Composition-based stats.
Identities = 92/321 (28%), Positives = 147/321 (45%), Gaps = 78/321 (24%)
Query: 21 SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL--------------- 65
+R W + VP G LT+D GWGCMLR GQM++A +L+ LH+
Sbjct: 422 NRRWLAW----VP-GRGDLTSDAGWGCMLRTGQMLLANSLVALHVPPLPPNPVYINNFPA 476
Query: 66 -GRDWQWNVNSKEEAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
+ EAY+KIL F D + P+S+H++AL GA G+ VG+WFGP+ A
Sbjct: 477 PSLPPSETDRQRFEAYVKILVWFLDDPSIWCPFSVHRLALAGADMGREVGQWFGPSIAAG 536
Query: 123 VLRKLAKYDDWSSIVFHVALDNTL-------------VVNQVKKLCTTNKRASS------ 163
++KL + V D + + + L T R +
Sbjct: 537 SIKKLVSAFPACGLGVVVPPDQIIHETAVFTASHTPTLPSSASSLSNTRDREARERANRM 596
Query: 164 NPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
+W + ++++I LRLGI+ + P+Y YD VK L
Sbjct: 597 KEEWGDRAVLILIGLRLGIEGVTPIY---------------YDSVKAL------------ 629
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ---NIGCVYDKEQDSEKKLD--- 275
FTFPQ++G+ GG+P+ + YF+G G+ + +LDPH+ + + D D+ +
Sbjct: 630 FTFPQTVGIAGGRPSSSYYFVGTQGDHLFYLDPHSTRPAVPLRVPTDGPYDATGQFTLSE 689
Query: 276 -STYHCPQASRLHILHMDPSI 295
T+H + ++HI +DPS+
Sbjct: 690 MKTFHSDKVRKMHISGLDPSM 710
>gi|308802424|ref|XP_003078525.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
gi|116056978|emb|CAL51405.1| APG4C_XENLA Cysteine protease APG4C (ISS) [Ostreococcus tauri]
Length = 424
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/336 (26%), Positives = 144/336 (42%), Gaps = 81/336 (24%)
Query: 4 ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL 63
+N++ ++ E RD SR W TYR+GF +G + TD GWGC LR QM++A AL
Sbjct: 58 SNEVGRREWE---RDWRSRCWMTYRRGFEALGRTKWCTDAGWGCTLRSAQMMLANALSIH 114
Query: 64 HLGRDWQWNVN--------------------------------------SKEEAYLKILK 85
GR W+ V + +A IL+
Sbjct: 115 SRGRHWRREVQLVAVHENETADDGSKSPAVSFLSGVVNKLKIPQSERTRAGSDAQEDILR 174
Query: 86 MFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKYDDWSSIVFHVALDN 144
+F D AP+SIH++ G G WF P+ + + L A++D S + HV
Sbjct: 175 LFADEVGAPFSIHRVCEKTTEWGAPPGRWFEPSVMCRAFEALVAEHDLGSELTVHVVSGR 234
Query: 145 TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI-QDINPVYINGIKKCYALPISPVY 203
V + RA S + L+L +P+ LG+ + IN Y++ ++ A
Sbjct: 235 EGEDGGVPTVDEAEVRAKSADVGKALLLFVPVVLGVGRTINARYLSQLRSMMA------- 287
Query: 204 DMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCV 263
F QS+G++GG+PN +LY +G+ + +LDPHT Q +
Sbjct: 288 --------------------FKQSVGIVGGRPNSSLYLVGHSDDVFFYLDPHTVQVASSM 327
Query: 264 YDKEQDSEKKLDSTYHCPQASRLHIL--HMDPSIAV 297
+ +S Y+CP + LH+ +DP++A+
Sbjct: 328 VTMDFES-------YYCP--TPLHVCGGDLDPTLAL 354
>gi|323303340|gb|EGA57136.1| Atg4p [Saccharomyces cerevisiae FostersB]
Length = 494
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD + V+ B + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD---CIVSVSSGB-IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|322707969|gb|EFY99546.1| ATG4 protein [Metarhizium anisopliae ARSEF 23]
Length = 430
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 133/302 (44%), Gaps = 60/302 (19%)
Query: 18 DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
D SR W TYR F +P + SG T+D GWGCM+R GQ
Sbjct: 126 DFASRFWMTYRSNFEIIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMIRSGQS 185
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+ L LGRDW+ + E ++L +F D APYSIH G K GE
Sbjct: 186 LLANAMAVLDLGRDWRRGMLPDRER--RLLALFADDPRAPYSIHNFVRHGEKYCSKYPGE 243
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L ++ + + K+ + + P +++
Sbjct: 244 WFGPSATARCIQDLVNSRKQELRIYSTGDGPDIYEDNFMKIAKPDGEV-----FHPTLVL 298
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI I PVY + L ++ M QS+G+ GG
Sbjct: 299 VGTRLGIDKITPVYW------------------EALIASVQMS---------QSVGIAGG 331
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G G+ + +LDP HT + + D + + +DS H + R+H+ MD
Sbjct: 332 RPSSSHYFVGSQGHFLFYLDPHHTRKALPYYSDVARYTIDDMDSC-HTSRLRRIHVREMD 390
Query: 293 PS 294
P+
Sbjct: 391 PN 392
>gi|348520913|ref|XP_003447971.1| PREDICTED: cysteine protease ATG4D-like [Oreochromis niloticus]
Length = 500
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 146/357 (40%), Gaps = 101/357 (28%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
S ++E+ R SR+W TYR+ F + S TTD GWGCMLR GQM++AQ LL + R
Sbjct: 100 SEDEVERFRLAFVSRIWLTYRREFPQLEGSTWTTDCGWGCMLRSGQMLLAQGLLVHLMPR 159
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGE-------------- 113
DW W ++ ++F R A I G+ G + E
Sbjct: 160 DWVW--PESQQLTDVDFEVFRPRSPARAGGVPIPSFGSPRGSSTPEKSLPSSQAPRCSQK 217
Query: 114 --------------------WFG----------------------------PNTVAQVLR 125
WFG P+ VA +LR
Sbjct: 218 KRVHESTKDRQEHIHSRLVTWFGDQPPAPFGVHQLVDIGKGSGKKAGDWYGPSVVAHILR 277
Query: 126 K-LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN-KRASSNPQ---WQPLVLVIPLRLGI 180
K + K +++ +VA D T+ V +LC + + SS+P W+ +++++P+RLG
Sbjct: 278 KAVDKTSVVTNLAVYVAQDCTVYKEDVVRLCDRSLNQTSSDPSSQDWKSVIILVPVRLGG 337
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
+ +NP YI+ +K L +G+IGGKP H+LY
Sbjct: 338 EALNPSYIDCVKNFLKLDC---------------------------CIGIIGGKPKHSLY 370
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
FIG+ +++LDPH Q + V E ++HC ++ MDPS +
Sbjct: 371 FIGFQDEQLLYLDPHYCQPVVDVSQINFSLE-----SFHCSSPKKMPFNRMDPSCTI 422
>gi|37362688|ref|NP_014176.2| Atg4p [Saccharomyces cerevisiae S288c]
gi|61252248|sp|P53867.2|ATG4_YEAST RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|166990654|sp|A6ZRL7.1|ATG4_YEAS7 RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|1173491|gb|AAA86498.1| ORF494 [Saccharomyces cerevisiae]
gi|151944321|gb|EDN62599.1| cysteine protease [Saccharomyces cerevisiae YJM789]
gi|190409197|gb|EDV12462.1| anchor protein [Saccharomyces cerevisiae RM11-1a]
gi|285814439|tpg|DAA10333.1| TPA: Atg4p [Saccharomyces cerevisiae S288c]
gi|323352870|gb|EGA85172.1| Atg4p [Saccharomyces cerevisiae VL3]
gi|392297128|gb|EIW08229.1| Atg4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|1183991|emb|CAA93375.1| N1274 [Saccharomyces cerevisiae]
gi|1302243|emb|CAA96126.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 506
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 101 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 160
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 161 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 220
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 221 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 276
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 277 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 306
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 307 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 353
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 354 KFGKLQLSEMDPSMLI 369
>gi|256272398|gb|EEU07381.1| Atg4p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|323335883|gb|EGA77161.1| Atg4p [Saccharomyces cerevisiae Vin13]
Length = 494
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|323346814|gb|EGA81093.1| Atg4p [Saccharomyces cerevisiae Lalvin QA23]
Length = 494
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|259149141|emb|CAY82383.1| Atg4p [Saccharomyces cerevisiae EC1118]
Length = 506
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 101 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 160
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 161 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 220
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 221 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 276
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 277 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 306
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 307 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 353
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 354 KFGKLQLSEMDPSMLI 369
>gi|323307493|gb|EGA60764.1| Atg4p [Saccharomyces cerevisiae FostersO]
Length = 494
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|349580723|dbj|GAA25882.1| K7_Atg4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|365763488|gb|EHN05016.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 494
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>gi|207341865|gb|EDZ69806.1| YNL223Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 371
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 136/317 (42%), Gaps = 83/317 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 101 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 160
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 161 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 220
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 221 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 276
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 277 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 306
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 307 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 353
Query: 282 QASRLHILHMDPSIAVV 298
+ +L + MDP ++V
Sbjct: 354 KFGKLQLSEMDPRCSLV 370
>gi|302498547|ref|XP_003011271.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
gi|291174820|gb|EFE30631.1| autophagy cysteine endopeptidase Atg4, putative [Arthroderma
benhamiae CBS 112371]
Length = 437
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 131/310 (42%), Gaps = 97/310 (31%)
Query: 14 QIRRDITSRLWFTYRKGFVPI--------GDS-----------------GLTTDKGWGCM 48
Q D S+LW TYR F PI GDS G T+D GWGCM
Sbjct: 145 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSISLGVRLRSQLIDTQGFTSDTGWGCM 204
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE- 107
+R GQ ++A LLFL LGRDW+ +EE+ L + +F D AP+SIH+ GA+
Sbjct: 205 IRSGQALLANTLLFLRLGRDWRRGSKVQEESEL--VSLFADHPRAPFSIHRFVHHGATAC 262
Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
GK GEWFGP+ +Q ++ L K + + ++ D + + + K ++ S
Sbjct: 263 GKCPGEWFGPSAASQCIQALVKSNPQVGLRVYITSDGSDIYEKQFKEVACDE---SGGGI 319
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
QP ++++ +RLGI + PVY +D +K L FPQS
Sbjct: 320 QPTLILLGVRLGIDRVTPVY---------------WDSLKAL------------LRFPQS 352
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
+G+ G P STYH + RLH
Sbjct: 353 VGIAG--PEEL-------------------------------------STYHTRRLRRLH 373
Query: 288 ILHMDPSIAV 297
+ MDPS+ +
Sbjct: 374 VREMDPSMLI 383
>gi|406698456|gb|EKD01693.1| hypothetical protein A1Q2_04064 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1295
Score = 123 bits (309), Expect = 1e-25, Method: Composition-based stats.
Identities = 87/279 (31%), Positives = 122/279 (43%), Gaps = 69/279 (24%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
LSH R W G+V G+ GLT+D GWGCMLR GQ ++A AL+ LHLG
Sbjct: 512 LSHSQTMMPSRQSGGGAW-----GWVKGGERGLTSDAGWGCMLRTGQSMLANALIHLHLG 566
Query: 67 RDWQWNVNSKE---------------EAYLKILKMFEDRRT--APYSIHQIALTGASEGK 109
R W+ Y+++L F D + P+S+H+ AL G GK
Sbjct: 567 RGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFALIGKELGK 626
Query: 110 AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV---VNQVKKLCT--TNKRASSN 164
VGEWFGP+T A L+ LA + A D ++ V Q L T T S
Sbjct: 627 EVGEWFGPSTAAGALKTLANSFPPCGLSVVSAADGSVFRSEVYQASNLPTDWTTGAKPSR 686
Query: 165 PQ------W--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
P W + +++VIP RLG+ +NP+Y + IK
Sbjct: 687 PNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------------------------ 722
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
S+G+ GG+P+ + YF+ N + +LDPH
Sbjct: 723 ----------SVGIAGGRPSSSYYFVASQANSLFYLDPH 751
>gi|401886473|gb|EJT50506.1| hypothetical protein A1Q1_00204 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1295
Score = 123 bits (308), Expect = 1e-25, Method: Composition-based stats.
Identities = 87/279 (31%), Positives = 122/279 (43%), Gaps = 69/279 (24%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
LSH R W G+V G+ GLT+D GWGCMLR GQ ++A AL+ LHLG
Sbjct: 512 LSHSQTMMPSRQSGGGAW-----GWVKGGERGLTSDAGWGCMLRTGQSMLANALIHLHLG 566
Query: 67 RDWQWNVNSKE---------------EAYLKILKMFEDRRT--APYSIHQIALTGASEGK 109
R W+ Y+++L F D + P+S+H+ AL G GK
Sbjct: 567 RGWRVPTQKPSVHPRTPLELAELEAYSTYVRVLSWFMDDPSPLCPFSVHRFALIGKELGK 626
Query: 110 AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV---VNQVKKLCT--TNKRASSN 164
VGEWFGP+T A L+ LA + A D ++ V Q L T T S
Sbjct: 627 EVGEWFGPSTAAGALKTLANSFPPCGLSVVSAADGSVFRSEVYQASNLPTDWTTGAKPSR 686
Query: 165 PQ------W--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
P W + +++VIP RLG+ +NP+Y + IK
Sbjct: 687 PNSYHRMSWGGKAVLIVIPTRLGLDGVNPMYYDDIK------------------------ 722
Query: 217 TPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
S+G+ GG+P+ + YF+ N + +LDPH
Sbjct: 723 ----------SVGIAGGRPSSSYYFVASQANSLFYLDPH 751
>gi|385305819|gb|EIF49766.1| cysteine protease atg4 [Dekkera bruxellensis AWRI1499]
Length = 476
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 155/330 (46%), Gaps = 67/330 (20%)
Query: 18 DITSRLWFTYRKGFV-----PIGDS-------------------GLTTDKGWGCMLRCGQ 53
D+ +RLWFTYR GF P G S G TTD GWGCM+R Q
Sbjct: 96 DVATRLWFTYRSGFPVIKRDPDGPSPLSLGSLFRGTLDVKNASIGFTTDSGWGCMIRTSQ 155
Query: 54 MVIAQALLFLHLGRDWQW----NVNS------KEEAYLKILKMFEDRRTAPYSIHQIALT 103
++A ALL LH+GR W++ N N K E +I+ F D AP+SI QI
Sbjct: 156 SLLANALLNLHVGRKWRYIPAENPNGETEYAKKYEKQWQIITWFADFPWAPFSIQQIVRY 215
Query: 104 GASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRAS 162
G+ K GEWFGP+ ++ + L K + + + + + + L + +
Sbjct: 216 GSEHCNKKPGEWFGPSAASRSIVYLCKQSYKACKLNTYLTEGNGDIYEDELLXVSCPEGT 275
Query: 163 SNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEF 222
N ++P +++ +RLG+ +NPVY +KK ++
Sbjct: 276 EN-GFRPTLILSGVRLGVXXVNPVYWAFLKKLLSIH------------------------ 310
Query: 223 TFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHC 280
QS+G+ GG+P+ + YF GY G+++ ++DPHT Q + D D++ + + ++ H
Sbjct: 311 ---QSVGIAGGRPSSSHYFFGYQGDNLFYMDPHTPQT-ALLADHVDDADYRXEYVASVHT 366
Query: 281 PQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
+ +L + MDPS+ + + S DYK +
Sbjct: 367 KRIRKLGLCEMDPSMLIGLLVTSLEDYKEL 396
>gi|354544955|emb|CCE41680.1| hypothetical protein CPAR2_802300 [Candida parapsilosis]
Length = 423
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/336 (27%), Positives = 146/336 (43%), Gaps = 94/336 (27%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSG---------------------------------LT 40
+ + I S LW +YR GF PI S T
Sbjct: 83 EAKEYIQSLLWLSYRCGFTPIPKSADGPQPVSFLPSVLFSKSTLTNMSNLRGLFDNDNFT 142
Query: 41 TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
+D GWGCM+R Q ++A ALL L + E A L ILK+F+D T+P+S+H
Sbjct: 143 SDAGWGCMIRTSQNLLAIALLKL--------SEEHNESAQLDILKLFQDDPTSPFSLHNF 194
Query: 101 ALTGASEGKAV--GEWFGPNTVAQVLRKLA----KYDDWSSIVF-HVALDNTLVVNQVKK 153
+S V G+WFGPN + ++KL K + I + +++ + L ++++
Sbjct: 195 IRVASSSPLLVKPGQWFGPNAASLSIKKLTIEAKKLETPGEIPYVYISENADLFDDEIED 254
Query: 154 LCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
L N + +PL+L+ P+RLGI +N Y I + +LP
Sbjct: 255 LF--------NEEQKPLLLLFPVRLGIDQVNKYYYKSILQLLSLPY-------------- 292
Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG-NDVIFLDPHTNQNIGCVYDKEQDSEK 272
S+G+ GGKP+ + YFIGY N +++ DPH Q + +
Sbjct: 293 -------------SVGIAGGKPSSSFYFIGYENENHLLYFDPHLPQVVEAPINI------ 333
Query: 273 KLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYK 307
+TYH ++L I +DPS+ + V +S +YK
Sbjct: 334 ---TTYHTANYNKLDIEMVDPSMMIGVLLKSMDEYK 366
>gi|50543736|ref|XP_500034.1| YALI0A13277p [Yarrowia lipolytica]
gi|62899740|sp|Q6CH28.1|ATG4_YARLI RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49645899|emb|CAG83963.1| YALI0A13277p [Yarrowia lipolytica CLIB122]
Length = 545
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/337 (26%), Positives = 139/337 (41%), Gaps = 101/337 (29%)
Query: 18 DITSRLWFTYRKGF--VPIGDS---------------------GLTTDKGWGCMLRCGQM 54
D+ SR+W +YR GF +P D G T+D GWGCM+R Q
Sbjct: 68 DVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRGYTSDVGWGCMIRTSQS 127
Query: 55 VIAQALLFLHLGRDWQWN------------------------VNSKEEAYLK-------- 82
++A ALLF HLGR W+WN N ++E +
Sbjct: 128 LLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSEETAVSEE 187
Query: 83 -ILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV 140
I+ F D +P+SIH+ G G+WFGP+ + L S + +
Sbjct: 188 TIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYALCNEFPDSGLKVYY 247
Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
+ V + + L T PL+++ LRLGI ++NP+Y + +++ +L
Sbjct: 248 NGNGGGDVYEDELLETGF----------PLLVLCGLRLGIDNVNPIYWDSLRQMLSL--- 294
Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
PQS+G+ GG+P + YF G+ G + +LDPH +
Sbjct: 295 ------------------------PQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPA 330
Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
DK+ +++H + +LH+ MDPS+ V
Sbjct: 331 VKTTDKDT-------TSFHSSRIWKLHLKEMDPSMLV 360
>gi|330840249|ref|XP_003292131.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
gi|325077656|gb|EGC31355.1| hypothetical protein DICPUDRAFT_99239 [Dictyostelium purpureum]
Length = 603
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 12 LEQIRRDITSR-LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
+++ D T+R LWFTYR+GF I ++ D GWGCMLR GQM+++ LL LG DW+
Sbjct: 136 IKEFLEDFTTRVLWFTYRQGFPFIDNTQYDNDCGWGCMLRSGQMLLSNLLLHHALGDDWK 195
Query: 71 WNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-A 128
+ NS + Y I+ MF D+ +AP+SIH IAL G + GK +GEWF P+ ++Q ++ L +
Sbjct: 196 KSSNSTHPDVYNNIISMFLDKPSAPFSIHNIALEGQTLGKNIGEWFAPSIISQAIKSLVS 255
Query: 129 KYDDWSSIVFHVALDNTLVVNQV 151
K + +I ++ D +L ++Q+
Sbjct: 256 KNYEKCNISVFISEDGSLYIDQL 278
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 27/91 (29%)
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
PL+++IP+RLG+ +N +Y + + F FPQ+L
Sbjct: 371 PLLILIPMRLGLDGLNSIYYQSLLEI---------------------------FKFPQNL 403
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
GV+GGKP +LYFI +++ +LDPHT QN
Sbjct: 404 GVVGGKPRASLYFIAVQDDNLFYLDPHTVQN 434
>gi|440790872|gb|ELR12135.1| autophagy protein 4, putative [Acanthamoeba castellanii str. Neff]
Length = 510
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 88/156 (56%), Gaps = 4/156 (2%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D SR+W TYR F IG++ L TD GWGCMLR GQM++AQAL+ +LGRDW+
Sbjct: 118 DFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAEENM 177
Query: 78 EAYLKILKMFEDRRT--APYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWS 134
Y ++L+ F D + +PYSIH IA G + K +G+WF P T+++ LR L +
Sbjct: 178 MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTEHSPN 237
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPL 170
+ +V D + +V +LC + A Q PL
Sbjct: 238 GLKMYVPKDGIIYRKEVYQLCAV-QPADGPAQHSPL 272
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 72/145 (49%), Gaps = 37/145 (25%)
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W P+++++P+RLGIQ +NP+YI +K F+FPQ
Sbjct: 342 WHPVIILVPVRLGIQCLNPIYIPTLKAF---------------------------FSFPQ 374
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS-TYHCPQASR 285
LGVIGGKP+ + YF+GY N V+++DPH Q + D ++S PQA
Sbjct: 375 CLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQP---TVKMDDDPLFPIESYRMEIPQA-- 429
Query: 286 LHILHMDPSIAV----VSQRSYSDY 306
+ +DPS+A+ SQ + D+
Sbjct: 430 MSFDDIDPSLALGFLCSSQAEFDDF 454
>gi|291059129|gb|ADD71908.1| autophagy protein 4 [Acanthamoeba castellanii]
Length = 373
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 83/141 (58%), Gaps = 3/141 (2%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D SR+W TYR F IG++ L TD GWGCMLR GQM++AQAL+ +LGRDW+
Sbjct: 150 DFRSRMWLTYRSNFPAIGETNLVTDMGWGCMLRTGQMLLAQALITHYLGRDWRIQAEENM 209
Query: 78 EAYLKILKMFEDRRT--APYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWS 134
Y ++L+ F D + +PYSIH IA G + K +G+WF P T+++ LR L +
Sbjct: 210 MTYRELLRWFADEPSSRSPYSIHAIARIGLRKFNKQIGDWFEPTTISEALRLLVTEHSPN 269
Query: 135 SIVFHVALDNTLVVNQVKKLC 155
+ +V D + +V +LC
Sbjct: 270 GLKMYVPKDGIIYRKEVYQLC 290
>gi|167393590|ref|XP_001740639.1| cysteine protease atg4 [Entamoeba dispar SAW760]
gi|165895180|gb|EDR22930.1| cysteine protease atg4, putative [Entamoeba dispar SAW760]
Length = 332
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 143/294 (48%), Gaps = 49/294 (16%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
+D+++ R +W TYRK I + TTD GWGCM+R QMV+AQ L + LG +W
Sbjct: 29 KDIDEFARHT---IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGNNW 83
Query: 70 QWN---VNSKEEAY--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVL 124
++ +N++ + I+ +F D + +SIH++ ++ G G+W+GP+ + +
Sbjct: 84 KYENNCMNTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIA 143
Query: 125 RKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
+ +VA ++V ++++L + + P ++ +PLRLG
Sbjct: 144 AEHINEMRVFRTRGYVAKLGSIVGPKIEEL------SKDEVGFNPCIIFVPLRLG----- 192
Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
P SP + +L + +++ PQ +G+IGGKP +A YF +
Sbjct: 193 -------------PESPENEFRPLLKTIFDI---------PQCMGMIGGKPGYAHYFHTF 230
Query: 245 VGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
G ++ FLDPHT QN D + D + +Y C ++ +DPSI++V
Sbjct: 231 DGTNLYFLDPHTTQN---AIDMKGDWSYQ---SYFCKDNKSMNYSKIDPSISLV 278
>gi|388581514|gb|EIM21822.1| hypothetical protein WALSEDRAFT_68740 [Wallemia sebi CBS 633.66]
Length = 603
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 133/278 (47%), Gaps = 68/278 (24%)
Query: 18 DITSRLWFTYRKGFVPIG-------DSGLT--------------------TDKGWGCMLR 50
D SR+W TYR F I D GL TD+GWGCMLR
Sbjct: 68 DFQSRIWCTYRSNFCQISLNDPMMDDLGLAKMQTLSSKPSHWLLRERTFNTDQGWGCMLR 127
Query: 51 CGQMVIAQALLFLHLGRDWQWNV------NSKEEAY---LKILKMFEDRRT--APYSIHQ 99
Q ++A L + LGR W+ N +K + Y +K+L +F D + +P+S+H+
Sbjct: 128 TSQSLLANTLQIMLLGRQWRRNPFVDLTDYAKRKEYVNLIKLLNLFMDNPSTLSPFSVHR 187
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNK 159
+A+ G S GK VGEWFGP+T A ++ L ++ VA D+ + + V + +
Sbjct: 188 MAVVGKSLGKEVGEWFGPSTAALAIKHLVNNQTDINLSVSVASDSVIYKSDVYQ-ASGGT 246
Query: 160 RASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQT 217
+++ +W +P+++++ +RLG+ I+P Y Y+ +K MQ+
Sbjct: 247 STTADSEWGNKPVLILVGVRLGLDGIHPRY---------------YETLKAF---LRMQS 288
Query: 218 PRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
+G+ GG+P+ + YF GY + + ++DPH
Sbjct: 289 ---------CVGIAGGRPSSSYYFFGYQSDSLFYVDPH 317
>gi|366995231|ref|XP_003677379.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
gi|342303248|emb|CCC71026.1| hypothetical protein NCAS_0G01390 [Naumovozyma castellii CBS 4309]
Length = 495
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 133/274 (48%), Gaps = 67/274 (24%)
Query: 17 RDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGC 47
+D+ +RL FTYR F PI G S L TD GWGC
Sbjct: 77 KDVVTRLHFTYRTRFKPIMKSPEGPSPLNFSLVIRENPIDVIENAITNPDCFNTDIGWGC 136
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS- 106
M+R GQ ++ L + LGRD++++ +K+ + +I++ F D P+S+HQ G
Sbjct: 137 MIRTGQSLLGNTLQIVRLGRDFRYDPENKDISENRIIEWFIDAPEKPFSLHQFITEGMEL 196
Query: 107 EGKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDN-TLVVNQVKKLCTTNKRASSN 164
GK GEWFGP A+ ++ L K+ D V++ + + ++VK++ NK+
Sbjct: 197 SGKNPGEWFGPAATARSIQSLIRKFPDCGIAECLVSVSSGDIYSDEVKQVFADNKKN--- 253
Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
L++++ ++LG+ +N Y + I+ ILSS Y
Sbjct: 254 -----LLILLGVKLGLNAVNECYWDSIR--------------HILSSKY----------- 283
Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
S+G+ GG+P+ +LYF GY G+++++ DPH+ Q
Sbjct: 284 --SVGISGGRPSSSLYFFGYEGDELLYFDPHSPQ 315
>gi|183230788|ref|XP_001913481.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169802747|gb|EDS89733.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449704540|gb|EMD44766.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 330
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 135/281 (48%), Gaps = 46/281 (16%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN---VNSKEEA 79
+W TYRK I + TTD GWGCM+R QM +AQ L + LG +W++ +N++
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96
Query: 80 Y--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
+ I+ +F D + +SIH++ ++ G G+W+GP+ + + +
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRTR 156
Query: 138 FHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYAL 197
+VA +++ +++++L + P ++ +PLRLG
Sbjct: 157 GYVAKLGSIIGSKIEELI------KDGGGFNPCIIFVPLRLG------------------ 192
Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
P SP + +L + +++ PQ +G+IGGKP +A YF + G ++ FLDPHT
Sbjct: 193 PESPENEFKPLLKTIFDI---------PQCMGMIGGKPGYAHYFHTFDGTNLYFLDPHTT 243
Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
QN D + D + +Y C + MDPSI++V
Sbjct: 244 QN---AIDMKGDWSYQ---SYFCKDNKSMLYSKMDPSISLV 278
>gi|145553267|ref|XP_001462308.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430147|emb|CAK94935.1| unnamed protein product [Paramecium tetraurelia]
Length = 389
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 141/313 (45%), Gaps = 59/313 (18%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH----- 64
Q +E+++ +WF+YR + + S LT+D GWGCMLR GQM + Q + + +
Sbjct: 52 QKMEKLKSLFEGTIWFSYRSKILQLQYSTLTSDTGWGCMLRVGQMAMCQQIKYFYNLSSS 111
Query: 65 --LGRDWQWNVNSKEEAYLKILKMFEDRRT----APYSIHQIALTGASE-GKAVGEWFGP 117
L Q ++ EE K + + +T +P+SI +I + E K+ GEW+ P
Sbjct: 112 QELTELIQQFADNDEEELSKFMDRNDGDQTIQYKSPFSIQKIVVQTKLELQKSPGEWYKP 171
Query: 118 NTVAQVLRKLAKYDDWS-SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQP------- 169
N + VL+ L +Y + ++ H+ +N +++ V L + + +W
Sbjct: 172 NDILFVLKYLFRYSKYQKNLRMHINHENAFILSDVISLMFN--KNGGDEEWLKEQIEKGQ 229
Query: 170 -----LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
+ + I R+G+ N Y+ K+L+ T+
Sbjct: 230 NDEFGVSIFILTRIGLDTCNQEYL------------------KVLNDI---------MTY 262
Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS 284
PQ G++GG PN ALY +G VGN I+LDPH QN + E D S+Y C
Sbjct: 263 PQFQGILGGFPNKALYILGRVGNYYIYLDPHYVQNAQNYQEMENDR-----SSYTCQSIQ 317
Query: 285 RLHILHMDPSIAV 297
+ +DPS+A+
Sbjct: 318 LIDSNQLDPSMAI 330
>gi|444726263|gb|ELW66801.1| Cysteine protease ATG4C [Tupaia chinensis]
Length = 378
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 76/238 (31%), Positives = 109/238 (45%), Gaps = 57/238 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ RRD SR+W TYR+ F PI S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 47 NVEEFRRDFISRIWLTYREEFPPIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 106
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 107 WPDALNIENSDSESWTSHTVKKFTASVEASLSGERELKTPTISLKETIEKYSDDHEIRNE 166
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 167 IYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 226
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
I +VA D T+ + V T A N + +++++P+RLG + N Y+ +K
Sbjct: 227 GITIYVAQDCTVYSSDVIDKQRTAMTA-DNADDKAVIILVPVRLGGERTNTDYLEFVK 283
>gi|302657364|ref|XP_003020406.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
gi|291184236|gb|EFE39788.1| autophagy cysteine endopeptidase Atg4, putative [Trichophyton
verrucosum HKI 0517]
Length = 398
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 104/214 (48%), Gaps = 32/214 (14%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDS--------------------------GLTTDKGWGC 47
Q D S+LW TYR F PI + G T+D GWGC
Sbjct: 185 QFLDDFESKLWITYRSQFPPIPKTPKTGSGDSSSSISLGVRLRSQLIDTQGFTSDTGWGC 244
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE 107
M+R GQ ++A LLFL LGRDW+ +EE+ +++ +F D AP+SIH+ GA+
Sbjct: 245 MIRSGQALLANTLLFLRLGRDWRRGSKVQEES--ELVSLFADHPRAPFSIHRFVHHGATA 302
Query: 108 -GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
GK GEWFGP+ +Q ++ L K + + + D + + + K ++
Sbjct: 303 CGKCPGEWFGPSAASQCIQALVKSNPQVGLRVCITSDGSDIYEKQFKEVACDESGGG--- 359
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
QP ++++ +RLGI + PVY + +K P S
Sbjct: 360 IQPTLILLGVRLGIDRVTPVYWDSLKALLRFPQS 393
>gi|407043540|gb|EKE42005.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 330
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 135/281 (48%), Gaps = 46/281 (16%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN---VNSKEEA 79
+W TYRK I + TTD GWGCM+R QM +AQ L + LG +W++ +N++
Sbjct: 39 IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMALAQTFLSIVLGNNWKYEDNCINTERNI 96
Query: 80 Y--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIV 137
+ I+ +F D + +SIH++ ++ G G+W+GP+ + + +
Sbjct: 97 FHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASDIAAEHINEMRVFRTR 156
Query: 138 FHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYAL 197
+VA +++ +++++L + P ++ +PLRLG
Sbjct: 157 GYVAKLGSIIGSKIEELI------KDGGGFNPCIIFVPLRLG------------------ 192
Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
P SP + +L + +++ PQ +G+IGGKP +A YF + G ++ FLDPHT
Sbjct: 193 PESPENEFRPLLKTIFDI---------PQCMGMIGGKPGYAHYFHTFDGINLYFLDPHTT 243
Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
QN D + D + +Y C + MDPSI++V
Sbjct: 244 QN---AIDMKGDWSYQ---SYFCKDNKSMLYSKMDPSISLV 278
>gi|79597805|ref|NP_850722.3| cysteine protease ATG4b [Arabidopsis thaliana]
gi|332646467|gb|AEE79988.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 360
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 74/119 (62%), Gaps = 1/119 (0%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
L R+D +S + TYR+GF PIGD+ T+D WGCMLR GQM+ AQALLF LGR W +
Sbjct: 138 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 197
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+ +E YL+IL++F D + +SIH + L G S G A G W GP V + LA+
Sbjct: 198 KDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLAR 256
>gi|365758760|gb|EHN00587.1| Atg4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 485
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 138/312 (44%), Gaps = 75/312 (24%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 80 DVRSRVNFTYRTRFVPIARAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIGWGCM 139
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ + + +I+ F D AP+S+H TG
Sbjct: 140 IRTGQSLLGNALQILHLGRDFRVDEDDDFRRESRIVNWFNDTPEAPFSLHNFVSTGTELS 199
Query: 108 GKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDN-TLVVNQVKKLCTTNKRASSNP 165
K GEWFGP A+ ++ L + + V++ + + N+V+++ N +S
Sbjct: 200 DKRPGEWFGPAATARSIQYLIYGFPECGINACIVSVSSGDIYENEVEEVFVDNPNSS--- 256
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
++ ++ ++LGI +N Y I IL+S +
Sbjct: 257 ----ILFLLGVKLGINAVNESYRESI--------------CGILNSAW------------ 286
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
S+G+ GG+P+ +LYF GY GN+ + DPH Q E ++ H + R
Sbjct: 287 -SVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVNSCHTSKFGR 336
Query: 286 LHILHMDPSIAV 297
L + MDPS+ +
Sbjct: 337 LQLSEMDPSMLI 348
>gi|390344344|ref|XP_786847.3| PREDICTED: uncharacterized protein LOC581768 [Strongylocentrotus
purpuratus]
Length = 1018
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 111/203 (54%), Gaps = 24/203 (11%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ- 70
+E ++D +SRLW TYR+ F + S T+D GWGCMLR GQM++A +L+ LGR+W
Sbjct: 376 IEMFKQDFSSRLWMTYRREFPTLAGSNFTSDCGWGCMLRSGQMMLAHSLILHFLGREWNI 435
Query: 71 WNVNSKE--EAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
+ ++E + + +I++ F D+ +P+S+H++ G + GK VG+W+GP++VA +L++
Sbjct: 436 YKPQTQEMLQFHRQIVRWFGDQPLDMSPFSVHRLVGIGQNNGKKVGDWYGPSSVAHILKE 495
Query: 127 LAKYDD-----WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQ 181
+ +VA D T+ V LC R+ S + QP+ IP +
Sbjct: 496 AMDSAHELNPLLGEVCIYVAQDCTVYKQDVIDLC----RSKSKKRLQPVYRDIP---SSE 548
Query: 182 DINPVYINGIKKCYALPI-SPVY 203
D +PV KC PI P Y
Sbjct: 549 DNSPV------KCTTNPIKGPAY 565
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 62/131 (47%), Gaps = 32/131 (24%)
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W +V++IP+RLG ++NPVYI I+ FT
Sbjct: 836 WCAVVIMIPVRLGGDEVNPVYIRPIQSL---------------------------FTLES 868
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
LG+IGGKP H+L+F+G+ +I LDPH Q + V K +D ++HC ++
Sbjct: 869 CLGIIGGKPKHSLFFVGFQEEKLIHLDPHYCQQV--VDMKTRDFPLW---SFHCMSPRKM 923
Query: 287 HILHMDPSIAV 297
I MDPS +
Sbjct: 924 SISKMDPSCTI 934
>gi|401624007|gb|EJS42084.1| atg4p [Saccharomyces arboricola H-6]
Length = 494
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/312 (28%), Positives = 133/312 (42%), Gaps = 75/312 (24%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR F+PI G S L+ TD GWGCM
Sbjct: 89 DVRSRVNFTYRTRFIPIPRAPDGPSPLSLNVLVRTNPINTIENYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ + + KI+ F D AP+SIH TG
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVDNEKSLKRESKIVTWFNDTPEAPFSIHNFVSTGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV--ALDNTLVVNQVKKLCTTNKRASSNP 165
K GEWFGP A+ ++ L I V + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGITDCVVSVSSGDIYQNEVEKIYVENPDSI--- 265
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
++ ++ ++LGI +N Y I IL+S
Sbjct: 266 ----ILFLLGVKLGINAVNESYRESI--------------CGILNSA------------- 294
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
+S+G+ GG+P+ +LYF GY GN ++ DPH Q E+ + H + +
Sbjct: 295 RSVGIAGGRPSSSLYFFGYQGNQFLYFDPHIPQPA---------VEESFVESCHTSKFGK 345
Query: 286 LHILHMDPSIAV 297
L + MDPS+ +
Sbjct: 346 LQLSEMDPSMLI 357
>gi|186511209|ref|NP_001118859.1| cysteine protease ATG4b [Arabidopsis thaliana]
gi|62318602|dbj|BAD95023.1| hypothetical protein [Arabidopsis thaliana]
gi|332646469|gb|AEE79990.1| cysteine protease ATG4b [Arabidopsis thaliana]
Length = 267
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 74/119 (62%), Gaps = 1/119 (0%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
L R+D +S + TYR+GF PIGD+ T+D WGCMLR GQM+ AQALLF LGR W +
Sbjct: 52 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 111
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+ +E YL+IL++F D + +SIH + L G S G A G W GP V + LA+
Sbjct: 112 KDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLAR 170
>gi|390594065|gb|EIN03481.1| hypothetical protein PUNSTDRAFT_56214 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1093
Score = 116 bits (290), Expect = 2e-23, Method: Composition-based stats.
Identities = 88/287 (30%), Positives = 134/287 (46%), Gaps = 76/287 (26%)
Query: 18 DITSRLWFTYRKGFVPI---------------------------------------GDSG 38
D TSR+W TYR F PI G+ G
Sbjct: 372 DFTSRVWVTYRSHFQPIRDTTLSALESDFGEQAQSANTSGNSVVSGSPSSGRRWWGGEKG 431
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA----YLKILKMFEDRRT-- 92
T+D GWGCMLR GQ ++A ALL LHLGRDW+ + A Y+++L F D +
Sbjct: 432 WTSDAGWGCMLRTGQSLLANALLHLHLGRDWRRPSYPQPTAAYASYVQLLTWFFDSPSPL 491
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
P+S+H++AL G GK VG+WFGP+T A ++ L + VA+D + V
Sbjct: 492 CPFSVHRMALAGKELGKDVGQWFGPSTAAGAIKTLVHAFPGGGLGVAVAVDGVVYETDVF 551
Query: 153 KLCTT--NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKI 208
+ ++R W + ++++I +RLG+ +NP+Y + IK+ Y
Sbjct: 552 SASHSPDSRRHHRTSTWGDRGVLILIGIRLGLDGVNPIYYDTIKELY------------- 598
Query: 209 LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
T+PQS+G+ GG+P+ + YF+G + + +LDPH
Sbjct: 599 --------------TWPQSVGIAGGRPSSSYYFVGSQADSLFYLDPH 631
>gi|66810578|ref|XP_638996.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
gi|60467622|gb|EAL65643.1| hypothetical protein DDB_G0283753 [Dictyostelium discoideum AX4]
Length = 551
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 88/142 (61%), Gaps = 5/142 (3%)
Query: 12 LEQIRRDITSR-LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
+++ D T+R LWFTYR+GF I D+ D GWGCMLR GQM+++ LL LG +W+
Sbjct: 140 IKEFLNDFTTRVLWFTYRQGFPCIDDTMYDNDCGWGCMLRSGQMLLSNVLLHNILGDEWK 199
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
S + I+ MF D+ +AP+SIH IA+ G + GK +GEWF P+ ++Q ++ L
Sbjct: 200 ---RSSSATHPDIISMFLDKPSAPFSIHNIAMEGQNLGKNIGEWFAPSIISQTIKILVSR 256
Query: 131 D-DWSSIVFHVALDNTLVVNQV 151
+ D +I ++ D +L ++Q+
Sbjct: 257 NYDQCNISVFISEDGSLYIDQL 278
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 76/152 (50%), Gaps = 31/152 (20%)
Query: 147 VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMV 206
+ ++ K + N ++ W+PL+++IP+RLG+ +N +Y + + +
Sbjct: 363 IDDESKDEISENNNKDNDETWEPLLILIPMRLGLDGLNSIYHSSLLEI------------ 410
Query: 207 KILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK 266
F FPQ+LGV+GGKP +LYFI +++ +LDPHT QN + +
Sbjct: 411 ---------------FKFPQNLGVVGGKPRASLYFIAAQDDNLFYLDPHTVQN----HIE 451
Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
++ K +T+ C R H+ +DPS+ V
Sbjct: 452 VENGSKFPLNTFFCSTTKRTHVSEVDPSLVVA 483
>gi|68485712|ref|XP_713234.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|71152285|sp|Q59UG3.1|ATG4_CANAL RecName: Full=Cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|46434715|gb|EAK94117.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/336 (27%), Positives = 144/336 (42%), Gaps = 102/336 (30%)
Query: 19 ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
I S+LW +YR GF PI S T+D G
Sbjct: 83 IESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTSDAG 142
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R Q ++A LL L+ K E +I+K+F+D ++P+SIH
Sbjct: 143 WGCMIRTSQNLLANTLLKLY----------PKNEP--EIVKLFQDDTSSPFSIHNFIRVA 190
Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLA-------KYDDWSSIVFHVALDNTLVVNQVKKLC 155
+ V GEWFGPN + +++LA + D ++ ++ L ++++ +
Sbjct: 191 SLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVFISENSDLFDDEIRDVF 250
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
K AS ++++ P+RLGI +N Y N I +L+S Y
Sbjct: 251 AKEKNAS-------VLILFPIRLGIDKVNSYYYNSI--------------FHLLASKY-- 287
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
S G+ GGKP+ + YF+GY D+I+ DPH Q + ++ +D
Sbjct: 288 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV--------ETPINMD 328
Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
S YH +RL+I +DPS I V + Y D+K
Sbjct: 329 S-YHTTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363
>gi|441628985|ref|XP_004093160.1| PREDICTED: LOW QUALITY PROTEIN: cysteine protease ATG4D [Nomascus
leucogenys]
Length = 441
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 132/290 (45%), Gaps = 45/290 (15%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGW--GCMLRCGQMVIAQALLFLHLGRDW---QWN 72
D SRLW TYR + + D W G L ++ + + H W +W
Sbjct: 108 DFVSRLWLTYRXXXHCLTMCSIPPDWTWAEGTGLGPPELSGSASPSRYHGPARWMPPRWA 167
Query: 73 VNS----KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-L 127
+ +E + +I+ F D AP+ +H++ G S GK G+W+GP+ VA +LRK +
Sbjct: 168 QGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAV 227
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
+ + +V +V+ ++ V +L R +W+ +V+++P+RLG + +NPVY
Sbjct: 228 ESCSEVTRLVVYVSQTCSMYKADVARLVA---RPDPTAEWKSVVILVPVRLGGETLNPVY 284
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
+ +K+ + LG++GGKP H+LYFIGY +
Sbjct: 285 VPCVKELLRCQL---------------------------CLGIMGGKPRHSLYFIGYQDD 317
Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+++LDPH Q V + E ++HC ++ MDPS V
Sbjct: 318 FLLYLDPHYCQPTVDVSQADFPLE-----SFHCTSPRKMAFAKMDPSCTV 362
>gi|365988214|ref|XP_003670938.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
gi|343769709|emb|CCD25695.1| hypothetical protein NDAI_0F03770 [Naumovozyma dairenensis CBS 421]
Length = 427
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 135/321 (42%), Gaps = 83/321 (25%)
Query: 18 DITSRLWFTYRKGFVPI-----------------------------GDSGLTTDKGWGCM 48
DI SRL FTYR F PI TD GWGCM
Sbjct: 3 DIKSRLNFTYRTRFKPIQRMSDGPSPFHFSFILRENPINTLENVISNPDCFFTDIGWGCM 62
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSK---EEAYLKILKMFEDRRTAPYSIHQIALTGA 105
+R GQ ++ AL +LGRDW+++ N+ E +I F D P+S+H+ G
Sbjct: 63 IRTGQSLLGNALQLRNLGRDWRFDDNTDLKMTEKSNEIASWFMDTPEKPFSLHRFISKGM 122
Query: 106 S-EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN------ 158
GK GEWFGP A+ ++ L +D L+ + T
Sbjct: 123 QLSGKKPGEWFGPAATARSIQSLVHE------FPECGIDKCLISVSSGDIYKTEVEDVFN 176
Query: 159 ----KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
A + + + +++++ ++LGI+ IN Y + I+ +ILSS Y
Sbjct: 177 EGHTGEARNGQKDKTILILLGVKLGIETINRCYWDSIR--------------RILSSEY- 221
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
S+G+ GG+P+ +LYF GY G+++++ DPH+ Q YDK
Sbjct: 222 ------------SIGIAGGRPSSSLYFFGYQGDELLYFDPHSPQP---SYDKND----LF 262
Query: 275 DSTYHCPQASRLHILHMDPSI 295
T H +L + MDPS+
Sbjct: 263 YETCHTTNFGKLSLADMDPSM 283
>gi|254584596|ref|XP_002497866.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
gi|238940759|emb|CAR28933.1| ZYRO0F15334p [Zygosaccharomyces rouxii]
Length = 489
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 89/312 (28%), Positives = 140/312 (44%), Gaps = 71/312 (22%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SRL FTYR F+PI G S L+ TD GWGCM
Sbjct: 81 DVRSRLHFTYRTRFMPIPAVPGGPSPLSFHFLIRENPINAIENAINNPACFNTDVGWGCM 140
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LGR ++ K E + I+ F D AP+SIH G
Sbjct: 141 IRTGQSLLGNALQIARLGRGYRIGSELKPEE-ISIIDWFVDIPDAPFSIHNFVSKGMELS 199
Query: 108 GKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQ-VKKLCTTNKRASSNP 165
K GEWFGP ++ ++ L + + +++ + V + V K+ +K +
Sbjct: 200 SKRPGEWFGPAATSRSIQSLIRGFKQCGIDDCQISVSSGDVYEEDVMKVFNESKDSR--- 256
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
++L++ ++LGI +N Y N IK+ +L S +
Sbjct: 257 ----ILLLLGVKLGINAVNEFYWNDIKR--------------LLGSKF------------ 286
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
S+G+ GG+P+ +LYFIGY GN++++LDPHT Q + E+ + H +
Sbjct: 287 -SVGIAGGRPSSSLYFIGYQGNELLYLDPHTAQ----PFLSPSHQERSFYDSCHSSNYGK 341
Query: 286 LHILHMDPSIAV 297
L I +DPS+ +
Sbjct: 342 LAIQDLDPSMLI 353
>gi|443917360|gb|ELU38094.1| peptidase family c54 domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 808
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 132/293 (45%), Gaps = 94/293 (32%)
Query: 18 DITSRLWFTYRKGFVPIGDSGL------------------------------------TT 41
D TS +W TYR + PI D+ L T+
Sbjct: 145 DFTSLIWLTYRSHYTPIRDTSLESLAPLGPCDMEMAPAHLVPASPRRWNWPGSADKSWTS 204
Query: 42 DKGWGCMLRCGQMVIAQALLFLHLGRDWQ---WNVNSKEEA-YLKILKMFEDRRT--APY 95
D GWGCMLR GQ ++A AL+ LHLGR+W+ + + ++E A Y+KIL F D + AP+
Sbjct: 205 DAGWGCMLRTGQSLLANALIHLHLGRNWRRPHYPMFAEEHAVYVKILTWFFDTPSPLAPF 264
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV---- 151
+H++AL G + GK VG WFGP+T A ++ LA + +A+D T+ + V
Sbjct: 265 GVHRMALAGKALGKDVGTWFGPSTAAGSIKTLAHAFPECQLSVSLAVDGTVFASDVYAAS 324
Query: 152 -KKLCTT----NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
+ TT S +W + +++++ +RLG+ ++NP+Y YD
Sbjct: 325 HMGMVTTSGRSISSRRSASKWGGRAVLILVNIRLGLDNVNPIY---------------YD 369
Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH--ALYFIGYVGNDVIFLDPH 255
+K+ G+P + YF+G + + +LDPH
Sbjct: 370 ALKV------------------------GRPRQGSSYYFVGSQADSLFYLDPH 398
>gi|241958330|ref|XP_002421884.1| cysteine protease, putative [Candida dubliniensis CD36]
gi|223645229|emb|CAX39828.1| cysteine protease, putative [Candida dubliniensis CD36]
Length = 443
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 140/336 (41%), Gaps = 103/336 (30%)
Query: 19 ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
I S+LW +YR GF PI S T+D G
Sbjct: 83 IESKLWLSYRCGFEPIPKSIDGPQPIHFFPSIIFNRTTIYSNFANLKSLFDKENFTSDAG 142
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R Q ++A LL L+ + + I+K+F+D +P+SIH
Sbjct: 143 WGCMIRTSQNLLANTLLKLYPKNEQE------------IVKLFQDDTKSPFSIHNFIRVA 190
Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSI-------VFHVALDNTLVVNQVKKLC 155
+S V GEWFGPN + +++L I VF ++ ++ L ++++ +
Sbjct: 191 SSSPLHVKPGEWFGPNAASLSIKRLTNELQDQEINGINPPRVF-ISENSDLFDDEIRDVF 249
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
K S ++++ P+RLGI +N Y N I +LSS Y
Sbjct: 250 AKEKSNS-------VIILFPIRLGIDKVNSYYYNSI--------------FHLLSSKY-- 286
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
S G+ GGKP+ + YF+GY D+I+ DPH Q + ++ +
Sbjct: 287 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQIVETPFNMD-------- 327
Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
+YH + L+I +DPS I V + Y D+K
Sbjct: 328 -SYHSTNYNTLNISLLDPSMMIGILVTNIDEYIDFK 362
>gi|156839152|ref|XP_001643270.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
gi|166990653|sp|A7TQN1.1|ATG4_VANPO RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|156113873|gb|EDO15412.1| hypothetical protein Kpol_1015p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 411
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 134/311 (43%), Gaps = 73/311 (23%)
Query: 18 DITSRLWFTYRKGFVPIG--DSG---------------------------LTTDKGWGCM 48
D+ SR+ FTYR F+PI D G TD GWGCM
Sbjct: 77 DVISRIHFTYRTKFIPIARSDDGPSPLRINFLIGDNPFNAIENAIYNPNCFNTDIGWGCM 136
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++A A+ LGR+++ N E KI+ F D P+S+H G
Sbjct: 137 IRTGQSLLANAIQIAILGREFRVNDGDVNEQERKIISWFMDTPDEPFSLHNFVKKGCELS 196
Query: 108 GKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
K GEWFGP ++ ++ L + + D V++ + + NKR S+
Sbjct: 197 SKKPGEWFGPAATSRSIQSLVEQFPDCGIDRCIVSVSSADIFKDEINDIFKNKRYSN--- 253
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
++L++ ++LG+ +N Y+ I+ KIL S Y
Sbjct: 254 ---ILLLMGVKLGVDKVNEYYLKDIR--------------KILESRY------------- 283
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
S+G+ GG+P+ +LYF GY + +++ DPH Q + + L T H ++
Sbjct: 284 SVGISGGRPSSSLYFFGYQDDTLLYFDPHKPQ---------PSTIESLLETCHTDNFDKI 334
Query: 287 HILHMDPSIAV 297
+I MDPS+ +
Sbjct: 335 NISDMDPSMLI 345
>gi|238879782|gb|EEQ43420.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 446
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 143/336 (42%), Gaps = 102/336 (30%)
Query: 19 ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
I S+LW +YR GF PI S T+D G
Sbjct: 83 IESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTSDAG 142
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R Q ++A LL L+ K E +I+K+F+D ++P+SIH
Sbjct: 143 WGCMIRTSQNLLANTLLKLY----------PKNEP--EIVKLFQDGTSSPFSIHNFIRVA 190
Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLA-------KYDDWSSIVFHVALDNTLVVNQVKKLC 155
+ V GEWFGPN + +++L + D ++ ++ L ++++ +
Sbjct: 191 SLSPLHVKSGEWFGPNAASLSIKRLTSELLQDQEIDGIKIPRVFISENSDLFDDEIRDVF 250
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
K AS ++++ P+RLGI +N Y N I +L+S Y
Sbjct: 251 AKEKNAS-------VLILFPIRLGIDKVNSYYYNSI--------------FHLLASKY-- 287
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
S G+ GGKP+ + YF+GY D+I+ DPH Q + ++ +D
Sbjct: 288 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV--------ETPINMD 328
Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
S YH +RL+I +DPS I V + Y D+K
Sbjct: 329 S-YHTTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363
>gi|68485607|ref|XP_713286.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
gi|46434768|gb|EAK94169.1| potential autophagy related protease and anchor protein Atg4
[Candida albicans SC5314]
Length = 446
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 143/336 (42%), Gaps = 102/336 (30%)
Query: 19 ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
I S+LW +YR GF PI S T+D G
Sbjct: 83 IESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTSDAG 142
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R Q ++A LL L+ K E +I+K+F+D ++P+SIH
Sbjct: 143 WGCMIRTSQNLLANTLLKLY----------PKNEP--EIVKLFQDGTSSPFSIHNFIRVA 190
Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLA-------KYDDWSSIVFHVALDNTLVVNQVKKLC 155
+ V GEWFGPN + +++L + D ++ ++ L ++++ +
Sbjct: 191 SLSPLHVKPGEWFGPNAASLSIKRLTNELLQDQELDGIRIPRVFISENSDLFDDEIRDVF 250
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
K AS ++++ P+RLGI +N Y N I +L+S Y
Sbjct: 251 AKEKSAS-------VLILFPIRLGIDKVNSYYYNSI--------------FHLLASKY-- 287
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
S G+ GGKP+ + YF+GY D+I+ DPH Q + ++ +D
Sbjct: 288 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV--------ETPINMD 328
Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
S YH +RL+I +DPS I V + Y D+K
Sbjct: 329 S-YHTTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363
>gi|50307871|ref|XP_453929.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62899744|sp|Q6CQ60.1|ATG4_KLULA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49643063|emb|CAH01025.1| KLLA0D19536p [Kluyveromyces lactis]
Length = 450
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 142/310 (45%), Gaps = 71/310 (22%)
Query: 18 DITSRLWFTYRKGFVPIG-----------------------DSGLT------TDKGWGCM 48
D+ SR++FTYR F PI ++ LT +D GWGCM
Sbjct: 64 DVHSRVFFTYRTQFTPIRRNENGPSPINFTLFFRDNPINTLENALTDPDSFYSDIGWGCM 123
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ-IALTGASE 107
+R GQ ++A A+ + L R+++ N + ++ L +++ F+D P S+H +
Sbjct: 124 IRTGQALLANAIQRVKLAREFRINASRIDDNELNLIRWFQDDVKYPLSLHNFVKAEEKIS 183
Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV--NQVKKLCTTNKRASSNP 165
G G+WFGP+ A+ ++ L + I + + + ++V ++ ++ A+
Sbjct: 184 GMKPGQWFGPSATARSIKTLIEGFPLCGIKNCIISTQSADIYEDEVTRIFHKDRDAN--- 240
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
L+L+ +RLG+ IN +Y D+ KILSS P
Sbjct: 241 ----LLLLFAVRLGVDKINSLYWK--------------DIFKILSS-------------P 269
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
S+G+ GGKP+ +LYF GY ++ +LDPH Q + D + + + H + ++
Sbjct: 270 YSVGIAGGKPSSSLYFFGYQNENLFYLDPHNTQQSSLMMD-----DLEFYRSCHGHKFNK 324
Query: 286 LHILHMDPSI 295
LHI DPS+
Sbjct: 325 LHISETDPSM 334
>gi|119623097|gb|EAX02692.1| ATG4 autophagy related 4 homolog A (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 172
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 48/84 (57%), Positives = 64/84 (76%), Gaps = 1/84 (1%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W ++
Sbjct: 52 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ 111
Query: 78 -EAYLKILKMFEDRRTAPYSIHQI 100
+ Y +IL+ F DR+ YSIHQ+
Sbjct: 112 PKEYQRILQCFLDRKDCCYSIHQM 135
>gi|326430141|gb|EGD75711.1| pyruvate water dikinase [Salpingoeca sp. ATCC 50818]
Length = 1055
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 121/266 (45%), Gaps = 59/266 (22%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPI-GDSGLTTDKGWGCMLRCGQMVIAQALLF------ 62
Q+ + ++ I +W TYRKG+ PI GD+ LT+D GWGC R GQM++AQAL+
Sbjct: 590 QESDDLKAHIRRLVWLTYRKGYDPIHGDAQLTSDTGWGCTYRSGQMLLAQALMSNAEPSA 649
Query: 63 ----LHLGRDWQWNVNSKEEAYLKILKMFEDRR--TAPYSIHQIALTGASEGKAVGEWFG 116
L R W EE +L MF+D A +SI +A T K G+W
Sbjct: 650 RMQRLEGVRPSTWQ---HEETKRAVLSMFQDSHDPAAFFSIQHMAETSFVVRKKPGQWLS 706
Query: 117 PNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
P+ VA ++R+L + + V V + +R + W P +L+IPL
Sbjct: 707 PSEVALIIRRLNPP------------ETGMRVRIVNDTLLSTRRILAGEPWMPTLLMIPL 754
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
R G+ + P + P + F +P +G IGGKP
Sbjct: 755 RAGLDTLQPESV------------PAFVAF---------------FDWPWCVGAIGGKPG 787
Query: 237 HALYFIGYVGND---VIFLDPHTNQN 259
A Y++G + +D V++LDPHT ++
Sbjct: 788 SAYYYVG-IDHDRRRVLYLDPHTTRS 812
>gi|148707987|gb|EDL39934.1| autophagy-related 4B (yeast), isoform CRA_c [Mus musculus]
Length = 128
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 49/90 (54%), Positives = 65/90 (72%), Gaps = 1/90 (1%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 37 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 96
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIA 101
++ ++Y +L F DR+ + YSIHQI
Sbjct: 97 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIG 126
>gi|432110194|gb|ELK33968.1| Cysteine protease ATG4A, partial [Myotis davidii]
Length = 256
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/85 (56%), Positives = 64/85 (75%), Gaps = 1/85 (1%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W ++
Sbjct: 46 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEKQKEQ 105
Query: 78 -EAYLKILKMFEDRRTAPYSIHQIA 101
+ Y +IL+ F DR+ YSIHQ+
Sbjct: 106 PKEYQRILQCFLDRKDCCYSIHQMG 130
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 5/61 (8%)
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIA 296
Y I +G+++IFLDPHT Q D E+D D T+HC Q+ R++IL++DPS+A
Sbjct: 122 CCYSIHQMGDELIFLDPHTTQTF---VDTEEDGTVD-DQTFHCLQSPQRMNILNLDPSVA 177
Query: 297 V 297
+
Sbjct: 178 L 178
>gi|148693225|gb|EDL25172.1| autophagy-related 4D (yeast), isoform CRA_a [Mus musculus]
Length = 296
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 112/217 (51%), Gaps = 36/217 (16%)
Query: 82 KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHV 140
+I+ F D AP+ +H++ G S GK G+W+GP+ VA +LRK + + S +V +V
Sbjct: 36 RIVSWFADHPRAPFGLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYV 95
Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
+ D T+ V +L + + +W+ +V+++P+RLG + +NPVY+ +K
Sbjct: 96 SQDCTVYKADVARLLSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK-------- 144
Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
++L S LG++GGKP H+LYFIGY + +++LDPH Q
Sbjct: 145 ------ELLRSEL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP- 184
Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
D Q S L+S +HC ++ MDPS V
Sbjct: 185 --TVDVSQPS-FPLES-FHCTSPRKMAFAKMDPSCTV 217
>gi|339252578|ref|XP_003371512.1| cysteine protease ATG4B [Trichinella spiralis]
gi|316968242|gb|EFV52545.1| cysteine protease ATG4B [Trichinella spiralis]
Length = 414
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 145/346 (41%), Gaps = 92/346 (26%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L D +R+WFTYR GF I + D GWGC +R GQM++A+ +L +LGRDW
Sbjct: 78 LSDFLEDFRTRIWFTYRHGFPCIPGTKFDNDCGWGCTIRSGQMLLAETMLRHYLGRDWLL 137
Query: 72 NVNS--KEEAYL--KILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLR- 125
+ ++EA + K++ +F D T+P+S+H + G GK G W+GP +V Q+L+
Sbjct: 138 GQSGLPEDEALMHRKVIGLFCDNLTSPFSLHNLVQVGQQLFGKQAGSWYGPVSVLQILQV 197
Query: 126 ---KLAKYDDWSSIVFHVALDNTLVVNQVKKLCT------------TNKRASSNPQ---- 166
+ + HV D L+++ V++L N A P+
Sbjct: 198 AMNNAIERGLVEGLAVHVIGDGELIIDDVERLGCGLTLAPVPRRGPENDLADRQPKSSSY 257
Query: 167 ----------------------------------WQPLVLV-IPLRLGIQDINPVYINGI 191
W VLV +PLRLG++ N +Y + +
Sbjct: 258 LDLRRLTSVSNGDLLPSHDGESIGSTEFVDETRSWSRGVLVLLPLRLGVEKFNQLYSDHL 317
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K ++LS+ + +GVIGG+ + YF G+ + +I
Sbjct: 318 K--------------RVLSTKF-------------CVGVIGGRHHKCYYFCGWHTDYLIR 350
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
LDPH +Q D Q ++HC + I +DP ++
Sbjct: 351 LDPHYSQP---AVDATQPGVSL--HSFHCKYPKKTLIADIDPWCSI 391
>gi|256071261|ref|XP_002571959.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229490|emb|CCD75661.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 376
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 145/284 (51%), Gaps = 37/284 (13%)
Query: 31 FVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY-LKILKMFED 89
++P+ S T+D GWGCM RCGQM++AQAL+ LGR+W+ N ++ + L+I+K F D
Sbjct: 60 YIPL--SVQTSDCGWGCMFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFND 117
Query: 90 RRT--APYSIHQIALTGASEGKAVGEWFGPNTV-AQVLRKLAKYDDWSS----IVFHVAL 142
+ +P S+H+ L S+ K GEW GP+++ + +LR +AK S + ++A
Sbjct: 118 SWSPFSPLSLHR--LVQMSDRKP-GEWCGPSSICSAILRVMAKGSSLDSRLSQVQVYLAR 174
Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY---INGIKKCYALPI 199
D + ++ L + ++ Q+QP ++ D +Y + ++
Sbjct: 175 DRVIYREEIIDLA---RGLHTSYQYQP-------KIYFTDHTALYRSQSDQTNDSHSFKP 224
Query: 200 SPVYDMVKILSSTYNMQTPRY------EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
+ + ++ ++ N PRY F+ P +G+IGG+ H+ Y++G N +I+LD
Sbjct: 225 TAILLLIPLMFGKGNRINPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLD 284
Query: 254 PHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
PH Q + +S K ++HCP + +++PS AV
Sbjct: 285 PHFTQPT-----QNLNSPKFSVDSWHCPIPKTMSAANLNPSCAV 323
>gi|363754893|ref|XP_003647662.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891299|gb|AET40845.1| hypothetical protein Ecym_6474 [Eremothecium cymbalariae
DBVPG#7215]
Length = 469
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 93/316 (29%), Positives = 151/316 (47%), Gaps = 72/316 (22%)
Query: 14 QIRRDITSRLWFTYRKGFVPI-----GDSGL------------------------TTDKG 44
+ +D+ SRL FTYR F PI G S + TD G
Sbjct: 62 EFLKDVNSRLHFTYRTRFAPIPRHIDGPSPMRISILLRDNPLNVIENVLNNLDCFQTDIG 121
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE--EAYLKILKMFEDRRTAPYSIHQIAL 102
WGCM+R GQ ++A AL +LGRD++ + + + E +KI++ FED P+S+H+
Sbjct: 122 WGCMIRTGQSLLANALQLANLGRDFRISGSDSDINEVEMKIIRWFEDNPKHPFSLHKFVQ 181
Query: 103 TGAS-EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNT--LVVNQVKKLCTTNK 159
G GK GEWFGP+ +++ +R L S I + ++ + ++++ L N
Sbjct: 182 EGYKLSGKKPGEWFGPSAISRSIRSLVMKFPGSGIDHCIISTDSADVYLDEIDPLFRANP 241
Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
+A+ +L++ +RLG+ N Y + IK ILSS+
Sbjct: 242 KANV-------LLLLGVRLGVDFTNEYYWDDIKN--------------ILSSS------- 273
Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
QS+G+ GG+P+ +LYF GY G+ + +LDPH Q +Y E D E+ + H
Sbjct: 274 ------QSVGISGGRPSSSLYFFGYQGDYLFYLDPHKVQLNLALY--ESDEERF--HSVH 323
Query: 280 CPQASRLHILHMDPSI 295
+++H+ +DPS+
Sbjct: 324 PQTFNKIHLSAIDPSM 339
>gi|156042330|ref|XP_001587722.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980]
gi|154695349|gb|EDN95087.1| hypothetical protein SS1G_10962 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 414
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 129/316 (40%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D +++W TYR F I S G T+D GWGC
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCS------ 159
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
+S EE KIL +F D APYSIH+ GAS GK GE
Sbjct: 160 -------------------SSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 198
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L S + ++ D + V + K S+ ++ P +++
Sbjct: 199 WFGPSAAARCIQALTNSQVESELRVYITGDGSDVYEDT--FMSIAKPNST--KFTPTLIL 254
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLG+ I PVY +K +P QS+G+ GG
Sbjct: 255 VGTRLGLDKITPVYWEALKSSLQMP---------------------------QSVGIAGG 287
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG +D +LDPH + D +D + + H + RLHI MDP
Sbjct: 288 RPSSSHYFIGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDP 347
Query: 294 SIAVVSQ-RSYSDYKN 308
S+ + R +D+K+
Sbjct: 348 SMLIAFLIRDENDWKD 363
>gi|431896953|gb|ELK06217.1| Cysteine protease ATG4C [Pteropus alecto]
Length = 378
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 73/239 (30%), Positives = 109/239 (45%), Gaps = 59/239 (24%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 47 NVEEFRKDFISRIWLTYREEFPSIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 106
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 107 WPDALNIDNSDSESWTSHTVKKFTASFEASLSGERELKTPTISLKETIGRYSDDHEMQNE 166
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ +
Sbjct: 167 IYHRKIISWFGDSPLALFGLHQLIKYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPELQ 226
Query: 135 SIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIK 192
I +VA D T+ + V K C + A + +++++P+RLG + N Y+ +K
Sbjct: 227 GITIYVAQDCTVYSSDVIDKQCAS--MAPDITDDKAVIILVPVRLGGERTNIDYLEFVK 283
>gi|407408842|gb|EKF32115.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 357
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/239 (31%), Positives = 108/239 (45%), Gaps = 40/239 (16%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
L+F+YR VP+ + G TTD WGCM+R GQM++A A + G +E +
Sbjct: 74 LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGNERLQELRAR 132
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
+F D +AP+ IH I G G GEWFGP +A+ L L A Y V
Sbjct: 133 TQTLFCDVPSAPFGIHAITSEGTKHGVKCGEWFGPTPIAKTLNALMASYLAAGGEGPVVL 192
Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
+ + + QVK+L Q +VL+IP+ LGI+ I+ Y +K+C +
Sbjct: 193 AFPERQIFLEQVKELLR---------QSMHVVLLIPVMLGIRVISEKYSQLMKRCLEM-- 241
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
S+G++GGK AL+ G+ +DV FLDPH Q
Sbjct: 242 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHRVQ 275
>gi|149020505|gb|EDL78310.1| rCG31864, isoform CRA_c [Rattus norvegicus]
Length = 337
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 112/217 (51%), Gaps = 36/217 (16%)
Query: 82 KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHV 140
+I+ F D AP+ +H++ G S GK G+W+GP+ VA +LRK + + + +V +V
Sbjct: 77 RIVSWFADHPQAPFGLHRLVELGQSSGKKAGDWYGPSVVAHILRKAVESCSEVTRLVVYV 136
Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
+ D T+ V +L + + +W+ +V+++P+RLG + +NPVY+ +K
Sbjct: 137 SQDCTVYKADVARLVSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK-------- 185
Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
++L S LG++GGKP H+LYFIGY + +++LDPH Q
Sbjct: 186 ------ELLRSEL-------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP- 225
Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
D Q + L+S +HC ++ MDPS V
Sbjct: 226 --TVDVNQ-ANFPLES-FHCTSPRKMAFAKMDPSCTV 258
>gi|145549650|ref|XP_001460504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428334|emb|CAK93107.1| unnamed protein product [Paramecium tetraurelia]
Length = 402
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/327 (25%), Positives = 141/327 (43%), Gaps = 65/327 (19%)
Query: 1 MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQAL 60
+R+ + + Q +E+++R +S +WF+YRK S LT+D GWGCM+R QM +AQ +
Sbjct: 52 VRNPSFILKQRIEKLKRICSSIIWFSYRKKIPQFQISSLTSDTGWGCMIRVAQMALAQVI 111
Query: 61 LFLH----------LGRDWQWNVNSKEEAYLKILKMFEDRRT----APYSIHQIALTGAS 106
H L R + ++ ++ + +K + + AP+SI +I
Sbjct: 112 RHYHSFTQPEQLIVLIRHF---LDDDDDELINFIKQDQKNQVQYYHAPFSIQKIVYHAKV 168
Query: 107 E-GKAVGEWFGPNTVAQVLRKLAKYDDWS-----SIVFHVALDNTLVVNQV--------- 151
E K G+W+ PN + + L L KY +S I + A + Q+
Sbjct: 169 EFKKEPGDWYKPNEILETLNYLFKYSQYSLNMQIYINYQCAFILQDAIKQMFNYDKGNQE 228
Query: 152 -KKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILS 210
K C N + + + + +P R+G+Q +N Y+ +++ IL
Sbjct: 229 WLKECIKNNNQFISQHDKGIAIFLPARIGLQRVNQDYL---------------EVLNIL- 272
Query: 211 STYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
T P G+IGG N A Y +G + + +I+LDPH QN D
Sbjct: 273 -----------MTLPYFQGIIGGVTNRAFYIVGRIQDYLIYLDPHFVQNAQNFEDL---- 317
Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
K ++Y C +H +DPSI V
Sbjct: 318 -SKTQASYTCQNIQLIHNKSIDPSIVV 343
>gi|145510316|ref|XP_001441091.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408330|emb|CAK73694.1| unnamed protein product [Paramecium tetraurelia]
Length = 392
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 142/324 (43%), Gaps = 75/324 (23%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
Q +E++++ + +WF+YRK S LT+D GWGCM+R QM +AQ + +
Sbjct: 51 EQRIEKVKQTCSKIIWFSYRKNIPKFQVSSLTSDTGWGCMIRVAQMALAQIIRY------ 104
Query: 69 WQWNVNSKEEAYLKILKMFEDRRT-------------------APYSIHQIALTGASE-G 108
+N K E + +++ F D AP+SI +I E
Sbjct: 105 --YNYFKKPEQLIVLIRHFIDDDDNELTDFIQQFHKNQNQYYHAPFSIQKIVHYAKVELK 162
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWS-SIVFHVALDNTLVV-NQVKKLCTTNK------- 159
K G+W+ + + Q L L KY +S ++ ++ D ++ + ++++ +
Sbjct: 163 KEPGDWYKSDEILQTLDYLFKYSQYSLNMEIYINYDCAFILQDAIQQMFNQQEGNEIWLK 222
Query: 160 -RASSNPQW-----QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
RA +N Q+ + + + +P R+G+Q+IN Y+ + + ALP
Sbjct: 223 ERAKNNNQFDLQDHKGICIFLPTRIGLQNINKDYLEVLNQIIALPYFQ------------ 270
Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
G+IGG ALYF+G + + +I+LDPH QN + D K
Sbjct: 271 ---------------GMIGGVSKRALYFVGRIQDYLIYLDPHFVQNA-----QNFDDLSK 310
Query: 274 LDSTYHCPQASRLHILHMDPSIAV 297
++Y C +H +DPSI V
Sbjct: 311 NQASYTCQNIQLIHNSLIDPSIVV 334
>gi|219129924|ref|XP_002185127.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403306|gb|EEC43259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 557
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/327 (29%), Positives = 141/327 (43%), Gaps = 54/327 (16%)
Query: 15 IRRDITSRL-WFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW---Q 70
IRRD L WFTYR F I +T+D GWGCMLR QM++ QAL RDW Q
Sbjct: 167 IRRDDERSLFWFTYRCDFPEIAPYNITSDAGWGCMLRSAQMMLGQALRLHFKSRDWRPPQ 226
Query: 71 WNVNSKEEAYLK-ILKMFEDRRTAP---YSIHQIALTGASE-GKAVGEWFGPNTVAQVLR 125
+++++++ +L F D ++ YS+H + G S+ K GEW+GP T V+R
Sbjct: 227 LLARRRQDSFIRSVLTWFADYPSSSESVYSLHNMVAAGLSKYDKLPGEWYGPGTACYVMR 286
Query: 126 KLAKYDDWSSIVFHVALD-----------NTLVVNQVKKLCTTNKRA----------SSN 164
L + + LD T+ + + TT R +
Sbjct: 287 DLVHIHEKQQALGKTRLDRRIFRVYVAPQGTVYRDTIHAFMTTEARVRIEEKKKVKEQTQ 346
Query: 165 PQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTF 224
PQ PL L L ++ N V + + + L+ Y +Q+ + F+
Sbjct: 347 PQAHPLDLEWEEEL-MESANTVEWDTALLLLVP----LRLGLTSLNEEY-VQSLAHTFSL 400
Query: 225 PQSLGVIGGKPNHALYFIGYV--GNDVIFLDPHT------------NQNIGCVYDKEQDS 270
PQS+GV+GG+P A +F G G+ + LDPHT N V + D
Sbjct: 401 PQSVGVLGGRPRGARWFYGAQKDGSKIFGLDPHTVQTAPGRQTARVNGQASSVVELSDDY 460
Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
+ +T CP+ MDPSIA+
Sbjct: 461 LRSCHTT--CPEM--FPFCKMDPSIAL 483
>gi|71415152|ref|XP_809652.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70874068|gb|EAN87801.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 357
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 111/239 (46%), Gaps = 40/239 (16%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
L+F+YR VP+ + G TTD WGCM+R GQM++A A + G +E +
Sbjct: 74 LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 132
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
+F D +AP+ IH + G G GEWFGP +A+ L L A Y V
Sbjct: 133 TQTLFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSALMASYLAAGGEGPVVL 192
Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
+ + + +VK+L R S++ +VL+IP+ LGI+ I+ Y +K+C +
Sbjct: 193 AFPERQIFLEEVKELL----RQSTH-----VVLLIPVMLGIRVISEKYSQLMKRCLEM-- 241
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
S+G++GGK AL+ G+ +DV FLDPH Q
Sbjct: 242 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQ 275
>gi|111154179|gb|ABH07411.1| autophagin-2 [Trypanosoma cruzi]
Length = 351
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 111/239 (46%), Gaps = 40/239 (16%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
L+F+YR VP+ + G TTD WGCM+R GQM++A A + G +E +
Sbjct: 68 LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPHIGSERLQELRAR 126
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
+F D +AP+ IH + G G GEWFGP +A+ L L A Y V
Sbjct: 127 TQTLFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSALMASYLAAGGEGPVVL 186
Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
+ + + +VK+L R S++ +VL+IP+ LGI+ I+ Y +K+C +
Sbjct: 187 AFPERQIFLEEVKELL----RQSTH-----VVLLIPVMLGIRVISEKYSQLMKRCLEM-- 235
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
S+G++GGK AL+ G+ +DV FLDPH Q
Sbjct: 236 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQ 269
>gi|407848120|gb|EKG03593.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 357
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 111/239 (46%), Gaps = 40/239 (16%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
L+F+YR VP+ + G TTD WGCM+R GQM++A A + G + +E +
Sbjct: 74 LYFSYRNRIVPLMN-GATTDLFWGCMIRTGQMMLAHAFMRYFNGGGPRIGSERLQELRAR 132
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKY--DDWSSIVFH 139
+F D +AP+ IH + G G GEWFGP +A+ L L A Y V
Sbjct: 133 TQTLFCDVPSAPFGIHAVTSEGTKHGVNCGEWFGPTPIAKTLSALMASYLATGGEGPVIL 192
Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
+ + + +VK+L R S++ +VL+IP+ LGI I+ Y +K+C +
Sbjct: 193 AFPERQIFLEEVKELL----RQSTH-----VVLLIPVMLGICVISEKYSQLMKRCLEM-- 241
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
S+G++GGK AL+ G+ +DV FLDPH Q
Sbjct: 242 -------------------------ESSIGILGGKSRSALFLFGHQDDDVFFLDPHCVQ 275
>gi|440297742|gb|ELP90383.1| cysteine protease atg4, putative [Entamoeba invadens IP1]
Length = 330
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 142/303 (46%), Gaps = 59/303 (19%)
Query: 10 QDLEQIRRDIT--SR--LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
Q ++R DI SR +W TYRK + G T+D GWGCM+R QM +AQ+ + L +
Sbjct: 21 QHPRELREDINLYSRHTIWVTYRKNMKEL-PGGRTSDSGWGCMIRSMQMALAQSFVSLVM 79
Query: 66 GRDWQWNVNS----KEEAYLK-ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTV 120
G W++ + + +L+ I+ +F D + +SIH + + G G+W+GP+
Sbjct: 80 GNSWKFTKTGFQVERNKFHLRCIINLFGDGPGSLFSIHNLISRSTTRGVGDGKWWGPSFA 139
Query: 121 AQVLRKLAKYDDWSSI-VF----HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
+++ D ++I VF +VA +V + + + N P ++ +P
Sbjct: 140 SEI-----AADHLNTIHVFRTRGYVARLGRIVKPDILDI------SEDNGNILPTIIFVP 188
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
LRLG P++ D IL +++ PQ +G++GGKP
Sbjct: 189 LRLG------------------PVNAEEDFRPILKKVFDI---------PQCVGMVGGKP 221
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
N A +F + GN + +LDPHT QN D +E +Y C + ++DPS+
Sbjct: 222 NLAFFFHTFDGNLLYYLDPHTTQN-AVSMDGGWSAE-----SYFCNDVKSMKYKNLDPSV 275
Query: 296 AVV 298
+++
Sbjct: 276 SLL 278
>gi|406606786|emb|CCH41822.1| putative cysteine protease atg4 [Wickerhamomyces ciferrii]
Length = 592
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 151/311 (48%), Gaps = 67/311 (21%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGL------------------TTDKGWGCMLRCGQM 54
D+ +R+W TYR F PI G S L TTD GWGCM+R Q
Sbjct: 107 DVYTRIWLTYRTKFSPIDRDPEGPSPLSLNFFLRGQNYDLDNEHFTTDCGWGCMIRTSQS 166
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A ALL LH+GRDW++ E + +I+ F D + P+SIH+I G K GE
Sbjct: 167 LLANALLNLHIGRDWRY-TGELNEMHNEIVSWFIDCPSHPFSIHKIVDKGKLLSNKKPGE 225
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L D V+ + + N V K+ N ++P++++
Sbjct: 226 WFGPSAAARSIQSLCNEFDSGVKVYIGSDSGDIYENDVFKVA-----KDENGVFKPILIL 280
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ LRLGI +INPVY + +K IL+S +S+G+ GG
Sbjct: 281 LGLRLGIDNINPVYWDSLK--------------AILNSK-------------ESIGIAGG 313
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE--------KKLD-STYHCPQAS 284
+P+ + YF G+ G+ + +LDPH Q ++D + D+ LD ++ H +
Sbjct: 314 RPSTSHYFFGFQGDHLFYLDPHLPQ-PALLHDDQLDTSVSESTEIVSSLDVNSVHTKKLR 372
Query: 285 RLHILHMDPSI 295
++H+ +DPS+
Sbjct: 373 KIHLSEVDPSM 383
>gi|167521501|ref|XP_001745089.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776703|gb|EDQ90322.1| predicted protein [Monosiga brevicollis MX1]
Length = 392
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 131/287 (45%), Gaps = 56/287 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
+Q+ D+ +R+WFTYRK F P+ S TTD GWGCMLRCGQM++A L+ + R
Sbjct: 118 QQLEDDVATRIWFTYRKDFPPLPSSRRTTDVGWGCMLRCGQMILATTLMAVLQPRVHHLL 177
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY-D 131
+ E +LK + +HQ+ P+ +AQ L ++ D
Sbjct: 178 KYTMENHHLKAGRFQGPSSVGSALLHQV----------------PSALAQ----LNQFRD 217
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
+ + + A D ++++Q++ +++P++LV+PLRLGI+ I P Y
Sbjct: 218 EEVKLRTYFASDTLVILDQLRP-------EEGQAEFEPIMLVLPLRLGIEKIGPQY---- 266
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
+ +++L P +G IGG A+Y GY G+
Sbjct: 267 -----------HARLQLL------------LRQPWCMGFIGGHDKRAMYIFGYQGHQYFG 303
Query: 252 LDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
LDPH + + + +D ++ ++H + S + +DPS+AV
Sbjct: 304 LDPHRCSAAVAQSTAELRDRWVEVRDSFHTSKLSGIERDDLDPSLAV 350
>gi|258566559|ref|XP_002584024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907725|gb|EEP82126.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 377
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 67/208 (32%), Positives = 102/208 (49%), Gaps = 38/208 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F I S G TTD GWGCM+R GQ
Sbjct: 95 DFESRIWITYRSNFPAIPKSKDPNAQQALTFSVRLRSQLLDTRGFTTDTGWGCMIRSGQS 154
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ALL LGRDW+ + +E + +L +F DR AP+SIH+ GA+ GK GE
Sbjct: 155 LLANALLIQKLGRDWRRGSETGKE--IALLSLFADRPQAPFSIHRFVEHGAAACGKHPGE 212
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKRASSNPQWQPLVL 172
WFGP+ A+ + + + + +V D + V ++ +++ + +P ++
Sbjct: 213 WFGPSATARCIDECEH----AGLNVYVTSDGSDVHEDKFRQIAGLD-------DIKPTLI 261
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPIS 200
++ +RLGI I PVY + +K P S
Sbjct: 262 LLGVRLGIDSITPVYWDALKAIIQYPQS 289
>gi|384493397|gb|EIE83888.1| hypothetical protein RO3G_08593 [Rhizopus delemar RA 99-880]
Length = 194
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 72/112 (64%), Gaps = 4/112 (3%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D TSRLW TYR + PI S TD GWGC LR GQ ++A L+ LGRDW+ ++
Sbjct: 83 DFTSRLWMTYRHNYPPIRPSSHKTDIGWGCTLRSGQSLLANTLIIHFLGRDWRRQTQNQA 142
Query: 78 --EAYLKILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
+ Y +I+ F D + AP+SIH+IAL G GK +GEWFGP+T++QV++
Sbjct: 143 AWKQYSRIVHWFLDELSPRAPFSIHRIALLGKQLGKNIGEWFGPSTISQVIQ 194
>gi|344229797|gb|EGV61682.1| hypothetical protein CANTEDRAFT_115142 [Candida tenuis ATCC 10573]
Length = 408
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 131/312 (41%), Gaps = 82/312 (26%)
Query: 19 ITSRLWFTYRKGFVPI-------------------------------GDSGLTTDKGWGC 47
+ + +W TYR GF PI + TTD GWGC
Sbjct: 77 VEALVWLTYRTGFEPIPKNPNGPHPLAFVQSMVFNKNPLSTNVHSFIDNENFTTDVGWGC 136
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE 107
M+R Q ++A ++ ++ + +++L F+D AP+S+H
Sbjct: 137 MIRTSQSLLANT---------YKRMISEDAQQEIQLLDQFKDSEAAPFSLHNFIRVANES 187
Query: 108 GKAV--GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP 165
V G+WFGPN + +++L + L ++++++ L + +
Sbjct: 188 PLQVKPGQWFGPNAASLSIQRLCNLVNSKENFGLPGL--SVLISENSDLYDDKVQEFLDK 245
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
+ Q L++++P+RLGI N Y + I +++L+
Sbjct: 246 KKQSLLILLPIRLGIDKTNEFYYSSI--------------LQLLNCK------------- 278
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
QS+G+ GGKP+ + YF GY +++++LDPH Q Y+ +YH P+ R
Sbjct: 279 QSVGIAGGKPSSSFYFFGYDNDELLYLDPHYPQGTNAGYN-----------SYHTPRYQR 327
Query: 286 LHILHMDPSIAV 297
L I +DPS+ +
Sbjct: 328 LTISQLDPSMMI 339
>gi|150864470|ref|XP_001383296.2| hypothetical protein PICST_30446 [Scheffersomyces stipitis CBS
6054]
gi|166990661|sp|A3LQU0.2|ATG4_PICST RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|149385726|gb|ABN65267.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 514
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 143/345 (41%), Gaps = 99/345 (28%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGF--VP-------------------------------I 34
S+Q E+ DI +L TYR GF +P I
Sbjct: 94 SYQTTEEAHEDIIKKLCLTYRYGFERIPRAVNGPSPLSFMQSVIFSKSLLYNLQNFNNFI 153
Query: 35 GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP 94
TTD GWGCM+R Q ++A + L D Q + I+ +F D AP
Sbjct: 154 EKENFTTDVGWGCMIRTSQSLLANTFVRL---LDKQSD----------IIALFNDTYLAP 200
Query: 95 YSIHQIALTGASEGKAV--GEWFGPNTVAQVLRKLAK--YDDWSS------IVFHVALDN 144
+S+H +S V GEWFGPN + +++L YD+ +S I ++
Sbjct: 201 FSLHNFIRVASSSPLKVKPGEWFGPNAASLSIKRLCDGYYDNSTSETILPRINVLISEST 260
Query: 145 TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
L +Q+ +L + + + L++++P+RLGI IN Y + + +L
Sbjct: 261 DLYDSQIAQLLEPST------ETKGLLVLLPVRLGIDSINSYYFSSLLHLLSLE------ 308
Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
QS+G+ GGKP+ + YF GY N +I++DPH+ Q
Sbjct: 309 ---------------------QSVGIAGGKPSSSFYFFGYQDNSLIYMDPHSAQIFSSDI 347
Query: 265 DKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKN 308
D STY+ + R+ I +DPS+ + V R + Y+N
Sbjct: 348 DM---------STYYATRYQRVDIGKLDPSMLIGVFIRDLTSYEN 383
>gi|148693227|gb|EDL25174.1| autophagy-related 4D (yeast), isoform CRA_c [Mus musculus]
Length = 257
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 76/150 (50%), Gaps = 35/150 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + ++ + +I+ F D AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
+H++ G S GK G+W+GP+ VA +LR
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILR 257
>gi|62899792|sp|Q8NJJ3.1|ATG4_PICPA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4; AltName:
Full=Pexophagy zeocin-resistant mutant protein 8
gi|21585563|gb|AAL25849.1| Paz8 [Komagataella pastoris]
Length = 533
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 127/279 (45%), Gaps = 57/279 (20%)
Query: 1 MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIG---DS------------------GL 39
++ NK S + D+ S++W TYR GF PI DS G
Sbjct: 51 IKDGNKKSTTYSQSFIDDVYSKIWLTYRAGFPPIARDKDSPTFTLGALLRGQFDFNEIGF 110
Query: 40 TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA-YLKILKMFEDRRTAPYSIH 98
T+D GWGCM+R Q ++A ALLFLHLGRDW + + +I+ F D P+SIH
Sbjct: 111 TSDAGWGCMIRTSQSLLANALLFLHLGRDWVFKAKDPANVEHDRIISWFVDIPDEPFSIH 170
Query: 99 QIALTG-ASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQVKKLCT 156
G K GEWFGP+ ++ ++ L K Y V+ + + +V++L
Sbjct: 171 NFVQQGIKCCDKKPGEWFGPSAASRAIKNLCKEYPPCGLRVYFSSDCGDVYDTEVRELAY 230
Query: 157 TNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ 216
+ + + P+++++ +RLG++ +N + +++C +L
Sbjct: 231 GD-----SDTFTPILVLLGIRLGVEKVNLYIGDLLRECLSL------------------- 266
Query: 217 TPRYEFTFPQSLGVIGGKPNH-ALYFIGYVGNDVIFLDP 254
QS+G+ G K + AL IG+ G+ + +L P
Sbjct: 267 --------KQSVGISGRKTSFLALLSIGFQGDYLFYLIP 297
>gi|221046296|dbj|BAH14825.1| unnamed protein product [Homo sapiens]
Length = 280
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 74/150 (49%), Gaps = 35/150 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 131 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 190
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 191 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 250
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
+H++ G S GK G+W+GP+ VA +LR
Sbjct: 251 GLHRLVELGQSSGKKAGDWYGPSLVAHILR 280
>gi|353441084|gb|AEQ94126.1| putative cysteine protease [Elaeis guineensis]
Length = 169
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 68/114 (59%), Gaps = 1/114 (0%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYRKGF IGDS T+D WGCM+R QM++AQALLF HLGR W+ +
Sbjct: 48 EDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKP 107
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK 129
+ Y++IL +F D +SIH + G + G A EW GP + + + +
Sbjct: 108 HDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITR 161
>gi|344304092|gb|EGW34341.1| hypothetical protein SPAPADRAFT_59751, partial [Spathaspora
passalidarum NRRL Y-27907]
Length = 363
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 143/339 (42%), Gaps = 99/339 (29%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDS------------------------------G 38
+ LE I+++LW +YR GF PI +
Sbjct: 61 YTSLEDAEHSISNKLWLSYRCGFDPITKAPDGPTPISFFPSLVFNKRLFTTVRSLFDSEN 120
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
+D GWGCM+R Q ++A AL+ L + A +++ +F+D + +S+H
Sbjct: 121 FNSDVGWGCMIRTSQSLLANALMKL------------QPSAEHEVINLFQDNIASAFSLH 168
Query: 99 QIALTGASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSI-------VFHVALDNTLVVN 149
+ V G+WFGPN + +KL +I VF ++ ++ L
Sbjct: 169 NFIRVASESPLEVKPGQWFGPNAASLSTKKLLDGMKGKTIQGVKYPHVF-ISENSDLYDE 227
Query: 150 QVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKIL 209
++++L + ++++ P+RLGI ++N Y + I + A P +
Sbjct: 228 EIEELLVESS----------VLILFPVRLGIDNVNSYYYDSIFQLLACPFT--------- 268
Query: 210 SSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD 269
+G+ GGKP+ + YF+GY D+++ DPH+ Q +Y+ +
Sbjct: 269 ------------------VGISGGKPSSSFYFLGYQDQDLLYFDPHSPQ----LYENPIN 306
Query: 270 SEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYK 307
+TYH RLHI +DPS+ V + + S+YK
Sbjct: 307 Y-----TTYHTNNYQRLHIHMLDPSMMVGILVKDKSEYK 340
>gi|71043632|ref|NP_001020882.1| cysteine protease ATG4B [Rattus norvegicus]
gi|68533688|gb|AAH98833.1| ATG4 autophagy related 4 homolog B (S. cerevisiae) [Rattus
norvegicus]
Length = 224
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 88/181 (48%), Gaps = 56/181 (30%)
Query: 142 LDNTLVVNQVKKLCTTN-----------------------KRASSNP-QWQPLVLVIPLR 177
+DNT+V+ ++++LC + ++ P W+PLVL+IPLR
Sbjct: 1 MDNTVVMEEIRRLCRASLPCAGAAALSMESERHCNGLPAGAEVTNRPLAWRPLVLLIPLR 60
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ DIN Y+ +K C F PQSLGVIGGKPN
Sbjct: 61 LGLTDINEAYVETLKHC---------------------------FMMPQSLGVIGGKPNS 93
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSIA 296
A YFIGYVG ++I+LDPHT Q + DS D ++HC R+ I +DPSIA
Sbjct: 94 AHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPCRMGIGELDPSIA 149
Query: 297 V 297
V
Sbjct: 150 V 150
>gi|149020503|gb|EDL78308.1| rCG31864, isoform CRA_a [Rattus norvegicus]
Length = 256
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/150 (36%), Positives = 77/150 (51%), Gaps = 36/150 (24%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ S LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGS-LTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 166
Query: 71 W----NVNSKE-------------------------------EAYLKILKMFEDRRTAPY 95
W + S E + +I+ F D AP+
Sbjct: 167 WVEGTGLASSEMPGPASPSRYRGPGRRGPLRCAQGALEMEPDRWHRRIVSWFADHPQAPF 226
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLR 125
+H++ G S GK G+W+GP+ VA +LR
Sbjct: 227 GLHRLVELGQSSGKKAGDWYGPSVVAHILR 256
>gi|367014015|ref|XP_003681507.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
gi|359749168|emb|CCE92296.1| hypothetical protein TDEL_0E00530 [Torulaspora delbrueckii]
Length = 460
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 133/317 (41%), Gaps = 76/317 (23%)
Query: 14 QIRRDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKG 44
Q D+ SRL FTYR FVPI G S L+ TD G
Sbjct: 60 QFLSDVHSRLHFTYRTKFVPIPRVSDGPSPLSFHFLIRENPLTTIENAIYNPDCFNTDIG 119
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R GQ ++ AL +LGRD++ N +E Y KI+ F D A +SIH G
Sbjct: 120 WGCMIRTGQSLLGNALQIANLGRDFRVNQGKDQEEY-KIIDWFADTPQAHFSIHNFVSQG 178
Query: 105 AS-EGKAVGEWFGPNTVAQVLRKLA-KYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA- 161
K GEWFGP ++ ++ L ++ D +D L+ + R
Sbjct: 179 LKLSNKKPGEWFGPAATSRSIQCLVEQFPD-------CGIDKCLISVSSGDVFEDEVREI 231
Query: 162 -SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+ PQ R+ + + +N + + Y D+ K L S +
Sbjct: 232 FAQKPQ---------SRILLLLGVKLGVNAVNEYYW------DDVKKTLGSKF------- 269
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
S+G+ GG+P+ +LYF+G+ GN++I+ DPHT Q + T H
Sbjct: 270 ------SVGIAGGRPSSSLYFMGFQGNELIYFDPHTPQ-------PSLQTSANFYDTCHA 316
Query: 281 PQASRLHILHMDPSIAV 297
+L + +DPS+ +
Sbjct: 317 LNFGKLLLSDLDPSMLI 333
>gi|400593108|gb|EJP61110.1| peptidase family C54 [Beauveria bassiana ARSEF 2860]
Length = 378
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/222 (31%), Positives = 101/222 (45%), Gaps = 51/222 (22%)
Query: 5 NKLSHQD--LEQIRRDITSRLWFTYRKGFVPI----------------------GD-SGL 39
+++ H++ +Q D SR W TYR F PI GD G
Sbjct: 103 DEMDHENGWPQQFITDFDSRFWMTYRNDFKPIPRSKDPKAASSMSFPMRIKYQLGDQGGF 162
Query: 40 TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ 99
++D GWGCM+R GQ ++A A + LGRDW+ EE +KI++MF D APYSIH
Sbjct: 163 SSDSGWGCMIRSGQSLLANATGIVRLGRDWRRGQQKAEE--IKIMRMFADDPAAPYSIHN 220
Query: 100 IALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN 158
G+S+ GK GEWFGP+ +Q + Y+D S + D+
Sbjct: 221 FVDYGSSKCGKYPGEWFGPSATSQCINPDV-YED--SFMATAKSDHGF------------ 265
Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
++P +++I RLGI I VY + +P S
Sbjct: 266 --------FKPTLILISTRLGIDKITQVYWEALISALQMPQS 299
>gi|342186623|emb|CCC96110.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 388
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 111/247 (44%), Gaps = 40/247 (16%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
+R L+F+YR+ F P+ +G T+D GWGC +R QM++A A + G + N
Sbjct: 86 VRAAAQKLLYFSYRRQFEPL-RNGATSDVGWGCTIRACQMMLAWAFMRYRNGGSVTMDDN 144
Query: 75 SKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA---KYD 131
+ ++F D TAP+ IH + G G G WFGP +A+V+ L +
Sbjct: 145 VVDSLKEFTQRLFYDVPTAPFGIHAMTNEGVRHGVTCGMWFGPTPMAKVIGALNEAYRSS 204
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
VA D + V+ + +R+ Q +VL+IP++LG Q ++ Y N +
Sbjct: 205 GGEGPEVLVASDRQI---GVQDVVVRLQRS------QHVVLLIPVKLGPQTVSVTYANAL 255
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
K+ F S+G +GG+ N A +F GY G+ +I
Sbjct: 256 KR---------------------------FFEMGSSIGAVGGEKNSAYFFFGYQGDKIIH 288
Query: 252 LDPHTNQ 258
LDPH Q
Sbjct: 289 LDPHYVQ 295
>gi|358336800|dbj|GAA27956.2| autophagy-related protein 4 [Clonorchis sinensis]
Length = 507
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 129/297 (43%), Gaps = 56/297 (18%)
Query: 40 TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNS------KEEAYLKILKMFED--RR 91
T+D GWGCM+R GQM++AQ L+ LGRDW+ + ++ + ++++ F D +
Sbjct: 183 TSDSGWGCMIRSGQMLLAQTLMIHLLGRDWRAFRGTSPIKTPEDHLHRQLIRWFHDCWSQ 242
Query: 92 TAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTL 146
+P+S+H++ + G+ G WFGP T+ L K+ ++++ + + + D +
Sbjct: 243 ESPFSLHRLV---QASGQLPGSWFGPATLCSALVKVMSDASRRFEELARVHIYWVRDRVI 299
Query: 147 VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMV 206
++ L + QP V P RL D + + + + + PI P Y
Sbjct: 300 YREEIMNLA----------RGQP-VRRKPGRLNFTDFSENFQHCCSQECSPPIPPTYLQD 348
Query: 207 KILSSTYNMQTPRYEFTF--------------------------PQSLGVIGGKPNHALY 240
I SS P + P +G+IGG+P H++Y
Sbjct: 349 GIQSSPSTTLFPSHAVILLLPIRLGLDKRIDARYVPMVCRLVRDPCFVGIIGGRPRHSIY 408
Query: 241 FIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+G +I LDPH Q V DSE+ T+HC + +DPS AV
Sbjct: 409 ILGCQNTQLIHLDPHFTQP---VVRNVVDSEEFNVKTWHCLVPRVIEAAKLDPSCAV 462
>gi|340508502|gb|EGR34192.1| hypothetical protein IMG5_021070 [Ichthyophthirius multifiliis]
Length = 285
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/285 (25%), Positives = 126/285 (44%), Gaps = 62/285 (21%)
Query: 19 ITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
S +W TYR+ F P+ +D GWGCM+R GQM +A+ L + D
Sbjct: 2 FESIIWITYRRKFPPLKAPQYEYISDTGWGCMIRVGQMALAEGLKRFQIKED-------- 53
Query: 77 EEAYLKILKMFEDRRTAPYSIHQIALTGASEGK-AVGEWFGPNTVAQVLRKLAKYDDWSS 135
+I+ +F+D++ + +SI I G E K G+WF P + +L+ L + +
Sbjct: 54 -----EIIDLFQDKKDSLFSIQNICEAGKEEFKLEAGDWFNPIRICYILQILNEKKGFKD 108
Query: 136 I-VFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ + ++ D L+ ++ ++ K L+L + +LG++ Y+
Sbjct: 109 LKIRTISSDRILIFEDLEMEFSSEKNG--------LILFLVCKLGLEKTEENYLK----- 155
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
AL I F + S+G+IGGKP AL+F+G + + +I+LDP
Sbjct: 156 IALKI----------------------FDYKNSIGMIGGKPKKALFFVGRIEDQLIYLDP 193
Query: 255 HTNQNIGCVYDKEQDSEKKLD-STYHCPQASRLHILHMDPSIAVV 298
H Q+ ++ +D ++Y C + L +D SI V
Sbjct: 194 HYVQDF---------NQNNVDQNSYFCKNYAVLDQKKIDSSIGNV 229
>gi|448509127|ref|XP_003866066.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
gi|380350404|emb|CCG20626.1| hypothetical protein CORT_0A02350 [Candida orthopsilosis Co 90-125]
Length = 419
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 95/348 (27%), Positives = 154/348 (44%), Gaps = 96/348 (27%)
Query: 4 ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDS-------------------------- 37
N +QD + R I S LW +YR GF PI S
Sbjct: 72 GNHFINQD--EARDHIYSLLWLSYRCGFSPIPKSIDGPQPVTFFPSLLFSKSTLTNVGNL 129
Query: 38 -------GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDR 90
T+D GWGCM+R Q ++A A L + N N + L+ILK+F+D
Sbjct: 130 RSLFDNENFTSDAGWGCMIRTSQNLLANA----LLKLAGEANGNVQ----LEILKLFQDD 181
Query: 91 RTAPYSIHQIALTGASEGKAV--GEWFGPNTVAQVLRKLA--KYDDWSSIV---FHVALD 143
A +SIH ++ +V G+WFGPN + +R+L D S V +++ +
Sbjct: 182 PNAAFSIHNFIRVASASPLSVKPGQWFGPNAASISIRQLTIEMTDQESPTVVPFVYISEN 241
Query: 144 NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVY 203
L +++++ KR PL+L+ P+RLGI +N Y I
Sbjct: 242 ADLYDDEIEETFLKEKR--------PLLLLFPVRLGIDHVNKYYYKSI------------ 281
Query: 204 DMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHTNQNIGC 262
+++L+S + S+G+ GGKP+ + YFIGY ++ +I+ DPH Q
Sbjct: 282 --LQLLASRF-------------SVGIAGGKPSSSFYFIGYENDENLIYFDPHLPQ---- 322
Query: 263 VYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
V++ + ++YH ++L I +DPS+ + V S S+Y+ +
Sbjct: 323 VFESPINL-----ASYHTLNYNKLSIEMLDPSMMIGVLLGSMSEYREL 365
>gi|345311182|ref|XP_001519565.2| PREDICTED: cysteine protease ATG4D-like, partial [Ornithorhynchus
anatinus]
Length = 147
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 72/147 (48%), Gaps = 33/147 (22%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+E +RD SRLW TYR+ F P+ S T+D GWGCMLR GQM++AQ L+ L RDW
Sbjct: 1 DVESFQRDFVSRLWLTYRRDFPPLEGSAWTSDCGWGCMLRSGQMLLAQGLVVHLLSRDWI 60
Query: 71 WN---------------------------------VNSKEEAYLKILKMFEDRRTAPYSI 97
W + +E + +I+ F D AP+S+
Sbjct: 61 WAEAGPAPKPGEHRLLKSDPGGPSRSPAPPPPAGVLQEQERQHRRIVSWFADHPQAPFSL 120
Query: 98 HQIALTGASEGKAVGEWFGPNTVAQVL 124
H++ G GK G+W+GP+ A +L
Sbjct: 121 HRLVRLGQGSGKRAGDWYGPSLTAHLL 147
>gi|358339268|dbj|GAA47364.1| autophagy-related protein 4 [Clonorchis sinensis]
Length = 700
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 71/98 (72%)
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGE 113
M++A+A+ +HLG+DW+W ++EAY ++ +MF+D +++ YSI I + G + K +G
Sbjct: 1 MMLAEAITRIHLGKDWRWTPGCQDEAYCRLRRMFQDHKSSLYSIQNITMLGMALDKPIGS 60
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQV 151
WFGPNTVAQV++KL YD ++ H+++++ ++V+++
Sbjct: 61 WFGPNTVAQVIKKLCAYDPCTNWYVHISVEDGVIVDEI 98
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 34/147 (23%)
Query: 153 KLCTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+L + A+ +P W+PL+L IPLRLG+ NP Y N IK +
Sbjct: 246 RLQASEIEATPSPATWRPLLLFIPLRLGLHQPNPCYFNAIKAILQI-------------- 291
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN-DVIFLDPHTNQNIGCVYDKEQDS 270
P S+G++GG+P+HA++ +G G+ D++ LDPHT Q + D
Sbjct: 292 -------------PHSIGIMGGRPSHAVWIVGTAGDEDLLCLDPHTTQPAS-----QDDL 333
Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
+ D T+HC RL + +DPS+ +
Sbjct: 334 TAEDDVTHHCDCPVRLPLERLDPSMVI 360
>gi|323331874|gb|EGA73286.1| Atg4p [Saccharomyces cerevisiae AWRI796]
Length = 347
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 113/257 (43%), Gaps = 54/257 (21%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS- 106
M+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 1 MIRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTEL 60
Query: 107 EGKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKR 160
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N
Sbjct: 61 SDKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPN 116
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+ ++ ++ ++LGI +N Y I ILSST
Sbjct: 117 SR-------ILFLLGVKLGINAVNESYRESI--------------CGILSST-------- 147
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 148 -----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHT 193
Query: 281 PQASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 194 SKFGKLQLSEMDPSMLI 210
>gi|429850312|gb|ELA25600.1| cysteine protease atg4 [Colletotrichum gloeosporioides Nara gc5]
Length = 411
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 122/305 (40%), Gaps = 95/305 (31%)
Query: 18 DITSRLWFTYRKGFVPIG-----------------------DSGLTTDKGWGCMLRCGQM 54
D S+ W TYR F I SG ++D GWGCM+R GQM
Sbjct: 114 DFESKFWMTYRSEFELIAKSTDPRASSALSLSMRIKSQLVDQSGFSSDSGWGCMIRSGQM 173
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEW 114
++A A+ +LGR + GK GEW
Sbjct: 174 LLANAMAITNLGR-------------------------------------VACGKYPGEW 196
Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKRASSNPQWQPLVLV 173
FGP+ A+ ++ L + S+ + D V ++ K+ + ++ P +++
Sbjct: 197 FGPSATARCIQSLTNAQEQPSLRVYSTGDGPDVYEDKFMKIAKPD-----GTRFHPTLIL 251
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI I PVY + + +P QS+G+ GG
Sbjct: 252 VGTRLGIDKITPVYWDALIAALQMP---------------------------QSVGIAGG 284
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YFIG G+ + +LDP HT + D + ++ +D T H + RLH+ MD
Sbjct: 285 RPSASHYFIGAQGSFLFYLDPHHTRPALPYHSDPSRYTDADID-TAHTRRLRRLHVREMD 343
Query: 293 PSIAV 297
PS+ +
Sbjct: 344 PSMLI 348
>gi|261335715|emb|CBH18709.1| peptidase, putative [Trypanosoma brucei gambiense DAL972]
Length = 348
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 115/257 (44%), Gaps = 54/257 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E ++ L+F+YR F P+ +G TTD GWGC +R GQM++A AL+ R
Sbjct: 40 EMVKLAACKLLYFSYRCQFEPL-RNGSTTDIGWGCTIRAGQMMLAHALM-----RYKNGG 93
Query: 73 VNSKEEAYLKILK-----MFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
S E++ + LK +F D +AP+ IH I G G G WFGP VA V+ L
Sbjct: 94 GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVVMGAL 153
Query: 128 AKYDDWSSIVFH-----VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQD 182
+D+ S V D ++ ++V+K+ +K IP+ LG
Sbjct: 154 --MEDYLSSGGQGPDVLVLRDRQVMEDEVRKILLLSKHVLLL---------IPVMLGPHH 202
Query: 183 INPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI 242
I+ Y +K+C R E T +G +GGK A +F+
Sbjct: 203 ISEGYAKLLKRCL-----------------------RMEST----VGAVGGKEGSAFFFM 235
Query: 243 GYVGNDVIFLDPHTNQN 259
GY G ++I LDPH Q+
Sbjct: 236 GYQGGNLIVLDPHYAQS 252
>gi|340059839|emb|CCC54236.1| putative peptidase [Trypanosoma vivax Y486]
Length = 354
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 117/248 (47%), Gaps = 42/248 (16%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQA-LLFLHLGRDWQWNV 73
+ R L+F+YR GF P+ + G TTD WGC++R QM++AQA + F + G + +
Sbjct: 61 VTRATQKLLYFSYRCGFTPLSN-GSTTDVAWGCVVRAAQMLLAQAHMRFFNSGHAFV-DG 118
Query: 74 NSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
++ + K+ +F D +AP+ IH + G A G+WFG A+ + L +
Sbjct: 119 SALQILREKVQPLFLDDPSAPFGIHAMTSEAEKYGVACGQWFGMTPAAKTIASLCQQHSL 178
Query: 134 ---SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
+ V +D + +V+ L + +++ +VL+IP LG+ I+ Y
Sbjct: 179 RGGNGPAVLVFVDREVSALKVRDLLSHSRQ---------VVLLIPAVLGLDRISVKYSKM 229
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
+ +C + +GVIGG+ + ALYF+G+ N++I
Sbjct: 230 LIRCLEM---------------------------ESCIGVIGGRKSSALYFVGHQSNNII 262
Query: 251 FLDPHTNQ 258
+LDPH Q
Sbjct: 263 YLDPHRAQ 270
>gi|444321667|ref|XP_004181489.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
gi|387514534|emb|CCH61970.1| hypothetical protein TBLA_0G00200 [Tetrapisispora blattae CBS 6284]
Length = 577
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 83/326 (25%), Positives = 130/326 (39%), Gaps = 78/326 (23%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPI-----------------------------GDSG 38
+ D + D SRL FTYR F PI +
Sbjct: 122 ADDDSVEFLEDCKSRLIFTYRTNFSPIERAPDGPSPINVSVLFRDTLFNTVNHVLNNPNS 181
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-----NVNSKEEAYLKILKMFEDRRTA 93
TTD GWGCM+R GQ ++ AL ++LGR+++ N N+K I++ F D
Sbjct: 182 FTTDIGWGCMIRTGQSLLGNALQIINLGRNFRINNQSNNPNTKNIKEEDIIEWFYDNPNK 241
Query: 94 PYSIHQIALTGAS-EGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
P+SIH+ G K GEWFGP+T ++ L Y+ +D ++
Sbjct: 242 PFSIHKFVDKGMRISDKKPGEWFGPSTTCTAIQSLI-YE-----FPECGIDECILSVSSG 295
Query: 153 KLCTTNKRASSNPQWQPLVLV-IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSS 211
+ ++L+ + ++LGI IN Y N IK IL+S
Sbjct: 296 DIYEDEINEHFQKNENTIILILLGVKLGIDKINQCYFNDIK--------------DILNS 341
Query: 212 TYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
Y S G+ GG+P+ +LYF G++ + + DPH Q
Sbjct: 342 RY-------------SCGISGGRPSSSLYFFGHMNEYLYYFDPHKPQ---------LQLN 379
Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
+ ++ H S++ I +DPS+ +
Sbjct: 380 EDFKNSCHSTDYSKILISEIDPSMLI 405
>gi|74026240|ref|XP_829686.1| peptidase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70835072|gb|EAN80574.1| peptidase, putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 348
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 50/255 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E ++ L+F+YR F P+ +G TTD GWGC +R GQM++A AL+ R
Sbjct: 40 EMVKLAACKLLYFSYRCQFEPL-RNGSTTDIGWGCTIRAGQMMLAHALM-----RYKNGG 93
Query: 73 VNSKEEAYLKILK-----MFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
S E++ + LK +F D +AP+ IH I G G G WFGP VA V+ L
Sbjct: 94 GASFEDSIVPSLKQATQHLFHDDPSAPFGIHAITNKGVQHGAPCGSWFGPTHVAVVMGAL 153
Query: 128 AK---YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
+ + V D ++ ++V+K+ +K IP+ LG I+
Sbjct: 154 MEDYLRNGGQGPDVLVLRDRQVMEDEVRKILLLSKHVLLL---------IPVMLGPHHIS 204
Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
Y +K+C R E T +G +GGK A +F+GY
Sbjct: 205 EGYAKLLKRCL-----------------------RMEST----VGAVGGKEGSAFFFMGY 237
Query: 245 VGNDVIFLDPHTNQN 259
G ++I LDPH Q+
Sbjct: 238 QGGNLIVLDPHYAQS 252
>gi|425784144|gb|EKV21938.1| Autophagy cysteine endopeptidase Atg4, putative [Penicillium
digitatum Pd1]
Length = 208
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 69/134 (51%), Gaps = 26/134 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F PI + G T+D GWGCM+R GQ
Sbjct: 71 DFESRIWITYRSNFTPIPRTKSPEAISSLTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQS 130
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A A L LGRDW+ KEE K++ MF D AP+SIH+ GA S GK GE
Sbjct: 131 LLANAFSVLLLGRDWR--RGEKEEEESKLISMFADHPEAPFSIHKFVNRGAESCGKYPGE 188
Query: 114 WFGPNTVAQVLRKL 127
WFGP+ A+ ++ +
Sbjct: 189 WFGPSATAKCIQSV 202
>gi|307190834|gb|EFN74684.1| Cysteine protease ATG4B [Camponotus floridanus]
Length = 93
Score = 99.0 bits (245), Expect = 2e-18, Method: Composition-based stats.
Identities = 43/59 (72%), Positives = 51/59 (86%), Gaps = 2/59 (3%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
++L+ IRRDI S LWFTYRKGF+PIG +S T+DKGWGCMLRCGQMV+AQAL+ LHLG
Sbjct: 34 KELDMIRRDIRSMLWFTYRKGFIPIGGCNSTFTSDKGWGCMLRCGQMVLAQALITLHLG 92
>gi|336368847|gb|EGN97189.1| cysteine protease required for autophagy [Serpula lacrymans var.
lacrymans S7.3]
Length = 873
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 112/221 (50%), Gaps = 46/221 (20%)
Query: 18 DITSRLWFTYRKGFVPI----------------------------------GDSGLTTDK 43
D TSR+W TYR F PI G+ G T+D
Sbjct: 301 DFTSRIWLTYRSQFYPIRDSTLSALESEMAVASQGPLPSSPQPKRWNWPVGGEKGWTSDA 360
Query: 44 GWGCMLRCGQMVIAQALLFLHLGR-DWQ---WNVNSKEEA-YLKILKMFEDRRT--APYS 96
GWGCMLR GQ ++A ALL LHLGR DW+ + V++ + A Y++I+ F D + +P+S
Sbjct: 361 GWGCMLRTGQSLLANALLHLHLGRADWRRPPYPVHTTDYATYVQIITWFFDTPSPQSPFS 420
Query: 97 IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
+H++AL G GK VG+WFGP+T A ++ L + + VA D + + V
Sbjct: 421 VHRMALAGKDLGKDVGQWFGPSTAAGAIKTLVHAFPEAGLGVSVASDGVIFQSDVYAASN 480
Query: 157 T---NKRASSNPQW--QPLVLVIPLRLGIQDINPVYINGIK 192
+ R + W + ++++I +RLG+ +NP+Y + IK
Sbjct: 481 AYIGSPRRHAKVSWGGRAVIVLIGIRLGLDGVNPIYYDTIK 521
>gi|225554849|gb|EEH03143.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 425
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 53/131 (40%), Positives = 69/131 (52%), Gaps = 26/131 (19%)
Query: 18 DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
D S++W TYR F +P + G TTD GWGCM+R GQ
Sbjct: 127 DFESKIWLTYRSNFPLIPKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSGQS 186
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGRDW+ KEE+ K+L +F D AP+SIH+ GAS GK GE
Sbjct: 187 LLANALAILSLGRDWRRGTKIKEES--KLLSLFADDPKAPFSIHRFVEHGASACGKYPGE 244
Query: 114 WFGPNTVAQVL 124
WFGP+ A+ +
Sbjct: 245 WFGPSATARCI 255
Score = 45.4 bits (106), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 3/69 (4%)
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHCPQASRLHI 288
I G+P+ + YFIG G+ +LDPH + VY D + +TYH + RLHI
Sbjct: 255 IHGRPSSSHYFIGAQGSHFFYLDPH-HTRPALVYRDAGDRPYTTEELNTYHTRRLRRLHI 313
Query: 289 LHMDPSIAV 297
MDPS+ +
Sbjct: 314 KDMDPSMLI 322
>gi|223590151|sp|A5DEF7.2|ATG4_PICGU RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|190345638|gb|EDK37561.2| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 131/333 (39%), Gaps = 103/333 (30%)
Query: 14 QIRRDITSRLWFTYRKGFVPI---------------------------------GDSGLT 40
++++ + R W +YR GF PI + T
Sbjct: 78 EVQKQVKKRYWMSYRSGFEPIKKHEDGPSPLSFVQSMIFNKNVGNTFANIHSLVDNDNFT 137
Query: 41 TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
TD GWGCM+R Q V+A A+ Y +++F D +A +S+H
Sbjct: 138 TDVGWGCMIRTSQSVLANAI---------------DRAGYEVDVELFADTSSAAFSLHNF 182
Query: 101 ALTGASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN 158
+ V G+WFGP+ + +++L + + S+ V L +C +
Sbjct: 183 VKVASDSPLRVRPGQWFGPSAASLSIKRLCEARNSSTNVPLSVL-----------VCESG 231
Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
Q P++L++PLRLGI +N VY + + + +P
Sbjct: 232 DIYDDQIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEVP-------------------- 271
Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
QS G+ GGKP+ +LYF GY G +++LDPH QN+ +Y
Sbjct: 272 -------QSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNVSAGV-----------GSY 313
Query: 279 HCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
H +L I MDPS I + + Y+D K
Sbjct: 314 HSSSYQKLDISDMDPSMMAGIVLKNNEDYTDLK 346
>gi|145481079|ref|XP_001426562.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393637|emb|CAK59164.1| unnamed protein product [Paramecium tetraurelia]
Length = 391
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 94/332 (28%), Positives = 146/332 (43%), Gaps = 91/332 (27%)
Query: 13 EQIRRDITSR-LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
++I + I SR +WFTYRK F I +S T+D GWGCMLR GQM+ AQ +L +H+ + Q
Sbjct: 46 KEIIQQIYSRTIWFTYRKNFPQILNSQQTSDAGWGCMLRSGQMIWAQ-ILRVHIRQKKQ- 103
Query: 72 NVNSKEEAYLKIL------------KMFEDRRT---APYSIHQI-ALTGASEGKAVGEWF 115
+SK+ Y K+L KMF D +PYSI +I A++ +W+
Sbjct: 104 --HSKDYQY-KLLCAFSDDDDDEHKKMFTDNFKLCLSPYSIQKIEAISQIKFSMKPCQWY 160
Query: 116 GPNTVAQVLRKLAK-------------------YDDWSSIVFHVALDNTLVVNQVK---- 152
P+ + L L + YD S ++ + +D +VN++K
Sbjct: 161 RPDQILNALSLLHQQKQLEGSEDLEITISDSLLYDRLYSEMYGLKMDCEHIVNEIKQDKN 220
Query: 153 ----KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKI 208
K+C ++ + L + R+G+ +IN Y LP + D++ +
Sbjct: 221 KEISKICNICQKKDP----KALAIFFITRIGLDEINKEY---------LPF--LNDLIDL 265
Query: 209 LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ---NIGCVYD 265
PQ G+IGG+ + A Y +G V +I+LDPH Q N G V
Sbjct: 266 ----------------PQFQGIIGGRDDKAYYILGRVNKRLIYLDPHYIQEHINRGNVV- 308
Query: 266 KEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
L T+ C ++ M PSIA+
Sbjct: 309 -------MLKDTFFCKDVKYINEEQMSPSIAL 333
>gi|118390095|ref|XP_001028038.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89309808|gb|EAS07796.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 1216
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 131/301 (43%), Gaps = 80/301 (26%)
Query: 23 LWFTYRKGFVP-----IGD--SGLTTDKGWGCMLRCGQMVIAQA---------------L 60
+ FTYRK F P I D T+D GWGCM+R GQM+ AQ L
Sbjct: 263 ILFTYRKNFYPLLKDKINDPQKNQTSDAGWGCMIRAGQMIFAQTIKRHLKKTDYIEQHQL 322
Query: 61 LFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNT 119
+ + +G + V + Y+ + + R PYSIHQI + K G+W+ PN
Sbjct: 323 INIIIGFLEEEEVQEGGKGYIFNQQSYIQDRIRPYSIHQITNRAFCKYKIQPGQWYTPNQ 382
Query: 120 VAQVLRKL-----------AKYDDWSS---IVFHVALDNTLVVNQ--VKKLCTTNKRASS 163
+A +L++L K D SS I+F L TL+ Q + C + S
Sbjct: 383 IAIILKELHKKNKIKGTENLKIDVHSSDKPIIFEKIL-QTLLGRQGKINLNCNHENQQSR 441
Query: 164 NPQWQPL-----VLVIPLRLGIQDINPVYING---------IKKCYA--------LP--- 198
N Q ++ P + I++ + Y K C+ LP
Sbjct: 442 NSINQDQDDSFEKIMPPNQQEIEEFSSQYEESKEDQTDNLCCKDCFKTDNKLFLLLPCRL 501
Query: 199 ----ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ISP++ ++IL + QS+G+IGGKPN A YF+G+VG+D+++LDP
Sbjct: 502 GLDEISPIH--IEILKKL---------LSLKQSVGMIGGKPNKAHYFLGFVGDDLLYLDP 550
Query: 255 H 255
H
Sbjct: 551 H 551
>gi|255722127|ref|XP_002545998.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240136487|gb|EER36040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 444
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 146/332 (43%), Gaps = 100/332 (30%)
Query: 19 ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
I SRLW +YR GF PI + T+D G
Sbjct: 82 IESRLWLSYRCGFDPIPKAEDGPQPIQFFPSIIFNKTTIYSNFANLKSLFDKENFTSDAG 141
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ-IALT 103
WGCM+R Q ++A LL L +SK++ ++ +F+D +++P+SIH I +
Sbjct: 142 WGCMIRTSQNLLANTLLQLLPP-------DSKQD----VIGLFQDNQSSPFSIHNFIKVA 190
Query: 104 GASEGKA-VGEWFGPNTVAQVLRKLAKYDDWSSI-------VFHVALDNTLVVNQVKKLC 155
G S + G+WFGPN + +++L I VF ++ ++ L ++ ++
Sbjct: 191 GESPLQVKPGQWFGPNAASLSIKRLTDTLQDKEIKGVKYPKVF-ISENSDLYDGEINEIL 249
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
+ R+ ++++ P+RLGI +N Y + I ++L S +
Sbjct: 250 SEEGRS--------VLVLFPIRLGIDKVNSYYYDSI--------------FQVLKSKF-- 285
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
S G+ GGKP+ + YF+GY +D+I+ DPH Q + + E
Sbjct: 286 -----------SCGISGGKPSSSFYFLGYDNSDLIYFDPHLPQLVENPINIE-------- 326
Query: 276 STYHCPQASRLHILHMDPSIAV-VSQRSYSDY 306
+YH +RL+I +DPS+ + + RS DY
Sbjct: 327 -SYHTRNYNRLNISLLDPSMMIGILLRSMDDY 357
>gi|410075557|ref|XP_003955361.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
gi|372461943|emb|CCF56226.1| hypothetical protein KAFR_0A07920 [Kazachstania africana CBS 2517]
Length = 463
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 126/314 (40%), Gaps = 79/314 (25%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SRL FTYR F PI G S L TD GWGCM
Sbjct: 64 DVNSRLAFTYRTKFQPILRSSEGPSPLNFRMIFRDNPINTLENVINNPDCFNTDIGWGCM 123
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LGR ++ + N +I+ F D P+SIH+ G
Sbjct: 124 IRTGQSLLGNALQLAKLGRHFRLD-NKMGIKDDEIISWFRDTTQEPFSIHKFVEKGNKLA 182
Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
K GEWFGP + ++ L + +D LV + + R
Sbjct: 183 NKKPGEWFGPAATSISIQSLIEE------FPECGIDKCLVSVSSGDIFEDDVREIFEENM 236
Query: 168 QPLVL-VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+L ++ ++LG+ +N Y D++ IL S +
Sbjct: 237 DSKILFLMGVKLGLDAVNSFYWE--------------DILNILDSKF------------- 269
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI---GCVYDKEQDSEKKLDSTYHCPQA 283
S+G+ GG+P+ +LYF G+ GN++++ DPH Q VY+ T H
Sbjct: 270 SVGIAGGRPSSSLYFFGHQGNELLYFDPHRPQPSLVDPSVYE-----------TCHTTNF 318
Query: 284 SRLHILHMDPSIAV 297
+L I MDPS+ +
Sbjct: 319 GKLDIKDMDPSMLI 332
>gi|146420060|ref|XP_001485988.1| hypothetical protein PGUG_01658 [Meyerozyma guilliermondii ATCC
6260]
Length = 402
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 136/345 (39%), Gaps = 104/345 (30%)
Query: 2 RHANKLSHQDLEQIRRDITSRLWFTYRKGFVPI--------------------------- 34
R ++ DLE +++ + R W +YR GF PI
Sbjct: 67 REGDRDREGDLE-VQKQVKKRYWMSYRLGFEPIKKHEDGPLPLSFVQSMIFNKNVGNTFA 125
Query: 35 ------GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFE 88
+ TTD GWGCM+R Q V+A A+ Y +++F
Sbjct: 126 NIHSLVDNDNFTTDVGWGCMIRTSQSVLANAI---------------DRAGYEVDVELFA 170
Query: 89 DRRTAPYSIHQIALTGASEGKAV--GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
D +A +S+H + V G+WFGP+ + +++L + + S+ V L
Sbjct: 171 DTSSAAFSLHNFVKVASDLPLRVRPGQWFGPSAASLSIKRLCEARNSSTNVPLSVL---- 226
Query: 147 VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMV 206
+C + Q P++L++PLRLGI +N VY + + + +P
Sbjct: 227 -------VCESGDIYDDQIQTFPVLLLLPLRLGIDHVNNVYHSSLLQLLEVP-------- 271
Query: 207 KILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK 266
QS G+ GGKP+ +LYF GY G +++LDPH QN+
Sbjct: 272 -------------------QSAGIAGGKPSSSLYFFGYQGTSLLYLDPHYPQNVSAGV-- 310
Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
+YH +L I MDPS I + + Y+D K
Sbjct: 311 ---------GSYHSSLYQKLDISDMDPSMMAGIVLKNNEDYTDLK 346
>gi|241729578|ref|XP_002404604.1| cysteine protease, putative [Ixodes scapularis]
gi|215505492|gb|EEC14986.1| cysteine protease, putative [Ixodes scapularis]
Length = 433
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 86/182 (47%), Gaps = 41/182 (22%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D +SRLWFTYR+ F I + + TD GWGCMLR QM++AQA + LGR W+W E
Sbjct: 69 DFSSRLWFTYRREFPAIPGTDIRTDCGWGCMLRSSQMILAQAFVMHLLGRQWRWQQVHTE 128
Query: 78 EAYLKI----------------------------------LKMFEDRRTA--PYSIHQIA 101
+++ + F D+ A P+S+H +
Sbjct: 129 AGEVRLPRHALWPLREGFRCTGGDGTAVLVRCSPKPVNDPPRWFGDKADASTPFSLHNLV 188
Query: 102 LTGASEGKAVGEWFGPNTVAQVLRKL---AKYDD--WSSIVFHVALDNTLVVNQVKKLCT 156
G GK G+W+GP++VA +L+ A + D + + +VA D T+ ++ V LC+
Sbjct: 189 QRGRESGKKAGDWYGPSSVAYILKDALEDAAHRDQRLAQLCIYVAQDCTIYMDDVTALCS 248
Query: 157 TN 158
Sbjct: 249 AG 250
>gi|19115683|ref|NP_594771.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe
972h-]
gi|62899818|sp|Q9P373.1|ATG4_SCHPO RecName: Full=Probable cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|9588465|emb|CAC00556.1| Atg8 deconjugator Atg4 (predicted) [Schizosaccharomyces pombe]
Length = 320
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/285 (25%), Positives = 117/285 (41%), Gaps = 60/285 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+ D S + TYR G G +T+D GWGCM+R Q ++A L
Sbjct: 42 EKFLYDSFSLITITYRSGIE--GLENMTSDTGWGCMIRSTQTLLANCL-----------R 88
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKA-VGEWFGPNTVAQVLRKLAKYD 131
+ E+ +IL +F D +AP+SIHQ G + G+WFGP T + +L+ +
Sbjct: 89 ICYPEKQLKEILALFADEPSAPFSIHQFVTMGKTLCDINPGQWFGPTTSCSCVARLSDQN 148
Query: 132 DWSSIVFHVALD-NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
+ +VA + N + +Q+ K+ P++L+IP RLGI IN Y +
Sbjct: 149 PDVPLHVYVARNGNAIYRDQLSKVSF------------PVLLLIPTRLGIDSINESYYDQ 196
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
+ + F +G+ GG+P A YF
Sbjct: 197 LLQV---------------------------FEIRSFVGITGGRPRSAHYFYARQNQYFF 229
Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+LDPH C + ++ + T+H R+ I +DP +
Sbjct: 230 YLDPH------CTHFAHTTTQPASEETFHSATLRRVAIQDLDPCM 268
>gi|116179672|ref|XP_001219685.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
gi|88184761|gb|EAQ92229.1| hypothetical protein CHGG_00464 [Chaetomium globosum CBS 148.51]
Length = 425
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 98/211 (46%), Gaps = 34/211 (16%)
Query: 18 DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF PI GD +G ++D GWGCM+R GQ
Sbjct: 116 DFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRSGQS 175
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A ALL LGRDW+ + E I+ +F D APYS+ GA + GK GE
Sbjct: 176 LLANALLISQLGRDWRRTTDPGAER--NIVALFADDARAPYSLQNFVKHGAIACGKHPGE 233
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA + S ++ + V + L T + + P +++
Sbjct: 234 WFGPSATARCIQALADQHESSLRIYSTG--DLPDVYEDSFLATARPDGET---FHPTLIL 288
Query: 174 IPLRLGIQDINPV---YINGIKKCYALPISP 201
+ +GI P Y G+++ + + P
Sbjct: 289 MEQSIGIAGGRPSSSHYFVGVQRQWLFYLDP 319
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQAS 284
QS+G+ GG+P+ + YF+G + +LDPH + + + + ++LDS H +
Sbjct: 291 QSIGIAGGRPSSSHYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSC-HTRRLR 349
Query: 285 RLHILHMDPSIAV 297
LH+ MDPS+ +
Sbjct: 350 YLHVEDMDPSMLI 362
>gi|413917967|gb|AFW57899.1| hypothetical protein ZEAMMB73_419246 [Zea mays]
Length = 290
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 113/251 (45%), Gaps = 49/251 (19%)
Query: 31 FVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK-EEAYLKILKMFED 89
F I G + G GCM+R QM++AQAL+F HLGR W+ Y+ +L++F D
Sbjct: 4 FWKISLPGYGSLTGKGCMVRSSQMLVAQALIFHHLGRSWRKPPEKPYNPDYIGVLRLFGD 63
Query: 90 RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY-----------DDWSSIVF 138
+SIH + + G A G W GP + + + L + +++ ++
Sbjct: 64 SEACAFSIHNLLQARRNYGLAAGSWLGPYAMCRAWQTLIRTNREQADAVDGKENFPMALY 123
Query: 139 HVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
V+ D + ++ +LC+ + S W P++L++PL LG+ INP YI
Sbjct: 124 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPST--WSPILLLVPLVLGLDKINPRYIPL 181
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
+K+ F FPQSLG++GGKP + Y G + +
Sbjct: 182 LKE---------------------------TFMFPQSLGILGGKPGTSTYIAGVQDDRAL 214
Query: 251 FLDPHTNQNIG 261
+LDPH Q G
Sbjct: 215 YLDPHEVQMFG 225
>gi|413941968|gb|AFW74617.1| hypothetical protein ZEAMMB73_836919 [Zea mays]
Length = 416
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 115/253 (45%), Gaps = 68/253 (26%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L+ D +SR+W TYRKGF I D LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 29 LQVFLEDFSSRIWITYRKGFDAISDFKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR- 87
Query: 72 NVNSKEEAYLKILKMFED----RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
E+ ++ + D + P ++ ++G +G+ G
Sbjct: 88 --KPPEKTLIRTNREQADAVDGKENFPMELY--VVSGDEDGERGG--------------- 128
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
+ +V+ ++ +LC+ + S W P++L++PL LG+ INP Y
Sbjct: 129 ------APVVY---------IDVAAQLCSDFNKGPST--WSPILLLVPLVLGLDKINPRY 171
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
I +K+ F FPQSLG++G KP + Y G +
Sbjct: 172 IPLLKE---------------------------TFMFPQSLGILGVKPGTSTYIAGVQDD 204
Query: 248 DVIFLDPHTNQNI 260
++LDPH Q +
Sbjct: 205 RALYLDPHEVQMV 217
>gi|403216261|emb|CCK70758.1| hypothetical protein KNAG_0F00890 [Kazachstania naganishii CBS
8797]
Length = 448
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 133/316 (42%), Gaps = 77/316 (24%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSG-----------------------------LTTDKG 44
Q RD+ +RL FTYR FVPI S TD G
Sbjct: 42 QFYRDVCTRLNFTYRTKFVPISRSPDGPSPISFQLMIRDGPLSVIENALLHPDCFNTDIG 101
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R GQ ++ AL L GR+++ ++ ++ I++ F+D AP+S+H G
Sbjct: 102 WGCMIRTGQSLLGNALQRLRHGREFRVTESTHDD---DIIQWFKDTPDAPFSLHNFVKKG 158
Query: 105 ASEGK-AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA-- 161
G+WFGP ++ ++ L +D+ +V + +
Sbjct: 159 VELADMKPGQWFGPAATSRSIQSLI------CNFPQCGIDHCIVSVSSADIYKQDVEDMF 212
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++P L+++ ++LG+ +N Y I+ ++L+S +
Sbjct: 213 DADPDSN-LLILFGVKLGVSAVNASYWEDIR--------------RLLNSKF-------- 249
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
S+G+ GG+P+ +LYF GY ++++ DPHT Q + +T H
Sbjct: 250 -----SVGIAGGRPSSSLYFFGYQNQELLYFDPHTPQ--------PSLIDDAAFNTCHSI 296
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 297 EFGKLELRDMDPSMLI 312
>gi|256071263|ref|XP_002571960.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
gi|353229491|emb|CCD75662.1| family C54 unassigned peptidase (C54 family) [Schistosoma mansoni]
Length = 302
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 73/267 (27%), Positives = 133/267 (49%), Gaps = 35/267 (13%)
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY-LKILKMFEDRRT--APYSIHQIALTG 104
M RCGQM++AQAL+ LGR+W+ N ++ + L+I+K F D + +P S+H+ L
Sbjct: 1 MFRCGQMLLAQALVVHFLGRNWRLTKNQRDSDFSLQIIKWFNDSWSPFSPLSLHR--LVQ 58
Query: 105 ASEGKAVGEWFGPNTV-AQVLRKLAKYDDWSS----IVFHVALDNTLVVNQVKKLCTTNK 159
S+ K GEW GP+++ + +LR +AK S + ++A D + ++ L +
Sbjct: 59 MSDRKP-GEWCGPSSICSAILRVMAKGSSLDSRLSQVQVYLARDRVIYREEIIDLA---R 114
Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVY---INGIKKCYALPISPVYDMVKILSSTYNMQ 216
++ Q+QP ++ D +Y + ++ + + ++ ++ N
Sbjct: 115 GLHTSYQYQP-------KIYFTDHTALYRSQSDQTNDSHSFKPTAILLLIPLMFGKGNRI 167
Query: 217 TPRY------EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDS 270
PRY F+ P +G+IGG+ H+ Y++G N +I+LDPH Q + +S
Sbjct: 168 NPRYIQVVLRLFSDPAFVGLIGGRRKHSSYYVGCQNNSLIYLDPHFTQPT-----QNLNS 222
Query: 271 EKKLDSTYHCPQASRLHILHMDPSIAV 297
K ++HCP + +++PS AV
Sbjct: 223 PKFSVDSWHCPIPKTMSAANLNPSCAV 249
>gi|414869447|tpg|DAA48004.1| TPA: hypothetical protein ZEAMMB73_510335 [Zea mays]
gi|414869466|tpg|DAA48023.1| TPA: hypothetical protein ZEAMMB73_786179 [Zea mays]
Length = 472
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 107/237 (45%), Gaps = 49/237 (20%)
Query: 31 FVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK-EEAYLKILKMFED 89
F I DS LT+D WGCM+R QM++AQAL+F HLGR + Y+ +L +F D
Sbjct: 34 FDAISDSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSCRKPPEKPYNPDYIGVLHLFGD 93
Query: 90 RRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY-----------DDWSSIVF 138
+SIH + G + G A G W GP + + + L +++ ++
Sbjct: 94 SEACAFSIHNLLQAGRNYGLAAGSWLGPYAMCRAWQTLIHTNREQADAVDGKENFPMALY 153
Query: 139 HVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
V+ D + ++ +LC+ + S W P++L++PL LG+ INP YI
Sbjct: 154 VVSGDEDGERGGAPVVYIDVAAQLCSDFNKGPST--WSPILLLVPLVLGLDKINPRYIPL 211
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
+K+ F FPQSL ++GGKP + Y G + N
Sbjct: 212 LKE---------------------------TFMFPQSLCILGGKPGTSTYIAGVLAN 241
>gi|159465677|ref|XP_001691049.1| autophagy protein [Chlamydomonas reinhardtii]
gi|158279735|gb|EDP05495.1| autophagy protein [Chlamydomonas reinhardtii]
Length = 484
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 72/135 (53%), Gaps = 10/135 (7%)
Query: 1 MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQ-- 58
+R ++L H LE + D SR+W TYRK F +G S LT+D GWGC LR GQM++A+
Sbjct: 33 LRKLSELMHA-LEAMLGDFRSRMWCTYRKDFPALGPSLLTSDVGWGCTLRSGQMLLAEVR 91
Query: 59 ------ALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVG 112
A++ + LGRDWQ + EA ++ D AP SIH+I G G G
Sbjct: 92 HGWRAGAMMRVALGRDWQ-RCSDNLEAVRPVVAALLDCAEAPLSIHRICDAGGPAGIVPG 150
Query: 113 EWFGPNTVAQVLRKL 127
W GP + + L L
Sbjct: 151 RWLGPWMLCKGLEAL 165
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 49/120 (40%), Gaps = 43/120 (35%)
Query: 179 GIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
G+ INPVYI +++ ++PQS+G++GG+P+ +
Sbjct: 339 GMDKINPVYIPQLQQV---------------------------LSWPQSVGIVGGRPSAS 371
Query: 239 LYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
LY G I+LDPH Q +G TY C L +DPS+A+
Sbjct: 372 LYVCGVQDASFIYLDPHEAQLALG---------------TYFCDVVRVLPSAQLDPSLAI 416
>gi|307201261|gb|EFN81130.1| Cysteine protease ATG4B [Harpegnathos saltator]
Length = 98
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/59 (72%), Positives = 51/59 (86%), Gaps = 2/59 (3%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
++L+ IRRDI S LWFTYRKGFVPIG +S T+DKGWGCMLRCGQMV+A+AL+ LHLG
Sbjct: 34 KELDIIRRDIRSILWFTYRKGFVPIGGCNSTFTSDKGWGCMLRCGQMVLARALITLHLG 92
>gi|298712912|emb|CBJ33424.1| Autophagy-related protein 4 [Ectocarpus siliculosus]
Length = 546
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 66/123 (53%), Gaps = 6/123 (4%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+ R D+ S +W TYR GF + G T D GWGCMLR QM++ QAL LGR W+
Sbjct: 50 EERRLDVFSTMWITYRSGFPKMEPYGYTDDSGWGCMLRSAQMLMTQALQRHTLGRSWRVP 109
Query: 73 VNSKEE----AYLKILKMFEDRRTAP--YSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
+E Y ++++F D +SIH + G K GEW+GP T A VLR
Sbjct: 110 RTLEERLRVPEYRTLVRLFADHPGEANLFSIHNMCQVGIRYDKLPGEWYGPTTAACVLRD 169
Query: 127 LAK 129
+++
Sbjct: 170 ISE 172
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 65/129 (50%), Gaps = 29/129 (22%)
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
+VL++PLRLG+ +++ YI + ++T R PQSLG
Sbjct: 380 VVLLVPLRLGLDELSTGYIPSL-----------------------LETLR----VPQSLG 412
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
+GG+PNHA++FIG GN + LDPHT Q + E ++ + HC A + +
Sbjct: 413 FLGGRPNHAIFFIGAQGNTLTGLDPHTTQPAADM--GEGFPSERYVHSLHCQSAVSMDVH 470
Query: 290 HMDPSIAVV 298
+DPS+A+
Sbjct: 471 RIDPSLALA 479
>gi|145526665|ref|XP_001449138.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416715|emb|CAK81741.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 135/327 (41%), Gaps = 78/327 (23%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+ +I++ + +W TYR+ F P+ S +D GWGCMLR GQM +AQ +L HL
Sbjct: 56 INKIKQLVQDTIWITYRRNFPPLYQSNYISDTGWGCMLRVGQMAMAQ-MLKKHLKN---- 110
Query: 72 NVNSKEEAYLKILKMFEDRRT----------------------APYSIHQIALTGASE-G 108
+ + ++E Y IL F D + P+SI +IA E
Sbjct: 111 HGDKRDEDYDNILLAFADNDSQECKEFIEFQNKKEKQKVHNFICPFSIQKIAYLAKKEFN 170
Query: 109 KAVGEWFGPNTVAQVLR--------------KLAKYDDWSSIVFHVALDNTL--VVNQVK 152
GEW+ PN + +L KL+ ++D S +F L N + + +
Sbjct: 171 LDPGEWYKPNYILFLLEELHNTIPIRASENLKLSVFND--SCLFLDQLMNRMFDIKFETD 228
Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
K + L + + R+G+ + N Y+ K+L
Sbjct: 229 KDLEEQLEKTQLKSKNSLAIFVLTRIGLDEPNQKYL------------------KVLDEL 270
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEK 272
M+ P ++ G++GG P A Y +G + + I+LDPH Q +K Q E
Sbjct: 271 --MELPYFQ-------GIVGGTPKRAFYILGRINDHYIYLDPHYVQE---AENKGQIIEN 318
Query: 273 KL--DSTYHCPQASRLHILHMDPSIAV 297
K+ ++Y C L+ H+D S+ +
Sbjct: 319 KMFNRTSYSCKYIHLLNQKHVDTSMGL 345
>gi|146161894|ref|XP_001008187.2| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|146146576|gb|EAR87942.2| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 516
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 151/355 (42%), Gaps = 66/355 (18%)
Query: 1 MRHANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGD------------SGLTTDKGWGCM 48
M + ++ +++ + + +W TYRK F + + S +D GWGCM
Sbjct: 55 MNENKETYEKNYKEVLENFYNIIWITYRKNFPALLNMIDKANLKNQKMSEYISDTGWGCM 114
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRT----APYSIHQIALTG 104
+R GQM A+ L HL + + V KE+ + I +D + APYSI +I+
Sbjct: 115 VRVGQMAFAEGLR-RHLVENKKLVVKKKEDLRVIIEGFLDDDQKCIDFAPYSIQKISKIA 173
Query: 105 ASEGKAV-GEWFGPNTVAQVL------RKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTT 157
S+ + GEW+ P + +L RK K + + + + + ++++C
Sbjct: 174 LSDFNLLPGEWYTPIRICYILGLLHNERKAIKGTEDLKVAVFSSSRPIVFQDFLERMCKV 233
Query: 158 NKRASSNPQWQPLVLVI---------------PLRLGIQDINPVYINGIKKCYALP-ISP 201
+ + + Q P I ++L Q+ N + ++ L + P
Sbjct: 234 DPQRGKHAQICPNQCRIIKQDQKSKVDHDHHKDIKLEKQNSNSEILVVSEETPKLRLVCP 293
Query: 202 V-----YDMVKILSSTYNMQTPRYEF--------TFPQSLGVIGGKPNHALYFIGYVGND 248
+ Y M+ + + TP+ E+ F SLG+IGGKP ALYF+G + ++
Sbjct: 294 IHHELQYSMIVYIVCLIGLDTPQPEYLELAKKMMDFKYSLGLIGGKPKKALYFVGRIEDE 353
Query: 249 VIFLDPHTNQNIGCVYDKEQDSEKKLDS-----TYHCPQASRLHILHMDPSIAVV 298
I+LDPH Y +E +EK S TY C + ++D S +++
Sbjct: 354 FIYLDPH--------YVQEFSNEKNFQSSSQLETYFCKKFQTYPSKNIDSSFSLM 400
>gi|145500634|ref|XP_001436300.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403439|emb|CAK68903.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 140/321 (43%), Gaps = 66/321 (20%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL-----G 66
+ +I++ + +W TYR+ + P+ S +D GWGCMLR GQM +AQ +L HL
Sbjct: 56 INKIKQLVQETIWITYRRNYPPLYQSNYISDTGWGCMLRVGQMAMAQ-MLKKHLKNHGDK 114
Query: 67 RDWQW--------NVNSKEEAYLKILKMFEDRRTA-----PYSIHQIALTGASE-GKAVG 112
RD + + +S+E + +D++ A P+SI +IA E G
Sbjct: 115 RDEDYDNIILAFADNDSQENKEFIEFQNSKDKQKAHNFICPFSIQKIAYLAKKEFNLDPG 174
Query: 113 EWFGPNTV---AQVLRKLAKYDDWSSIVFHVALDNTLVVNQV-KKLCTTNKRASSNPQWQ 168
EW+ PN + ++L ++ V D+ L ++Q+ ++ + + Q
Sbjct: 175 EWYRPNYILFLLELLHNTIPIRASENLKLSVFNDSCLFLDQLMNRMFEAKFETDKDLEEQ 234
Query: 169 ----------PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
L + + R+G+ + N Y+ KIL M+ P
Sbjct: 235 LEKTQLIGKNSLAIFVLTRIGLDEPNQKYL------------------KILDEI--MELP 274
Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL--DS 276
++ G++GG P A Y +G + + ++LDPH Q +K+Q +E K+ +
Sbjct: 275 YFQ-------GIVGGTPKRAFYILGKINDHYLYLDPHYVQE---AENKDQINENKMFNRT 324
Query: 277 TYHCPQASRLHILHMDPSIAV 297
+Y C L+ H+D S+ +
Sbjct: 325 SYSCKNIHLLNQKHVDTSMGL 345
>gi|367008068|ref|XP_003688763.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
gi|357527073|emb|CCE66329.1| hypothetical protein TPHA_0P01710 [Tetrapisispora phaffii CBS 4417]
Length = 356
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 119/265 (44%), Gaps = 54/265 (20%)
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIH 98
T+D GWGCM+R GQ ++A AL + G +I+++F D P+SIH
Sbjct: 84 FTSDIGWGCMIRTGQTLLANALQRTNKGTPCS-----------EIIELFVDETKNPFSIH 132
Query: 99 QIALTGASEGKA-VGEWFGPNTVAQVLRKLAKYDDWSSI---VFHVALDNTLVVNQVKKL 154
G VGEWF P+ Q++ KL + ++ I + ++ + + + +L
Sbjct: 133 NFITVGKDLNLVKVGEWFSPSITIQIIEKLIENNNDHGIKKCIVSISSGDIYEQDVLDEL 192
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ +N + Q ++L+ ++LGI IN I+K Y D+ I ++ Y
Sbjct: 193 --DDSEPPANTKQQHILLLFGIKLGINTIN------IEK-YG------QDIKDITNNKY- 236
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGY--VGNDVIFLDPHTNQNIGCVYDKEQDSEK 272
+ G+ GG+P +L+F GY + +++ DPH N D
Sbjct: 237 ------------TCGISGGQPKSSLFFFGYNNTHDRILYFDPHKPNNFTTDNDY------ 278
Query: 273 KLDSTYHCPQASRLHILHMDPSIAV 297
STYH + + L + ++DPS+ +
Sbjct: 279 ---STYHSTEFNELEMFNLDPSMII 300
>gi|402593880|gb|EJW87807.1| hypothetical protein WUBG_01286, partial [Wuchereria bancrofti]
Length = 216
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 72/143 (50%), Gaps = 38/143 (26%)
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
+W+PL+++IPLRLG+ IN Y I+ + LP
Sbjct: 28 EWRPLLIIIPLRLGLNTINRCYFPAIQAFFELP--------------------------- 60
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI------GCVYDKEQD-----SEKKL 274
Q +G+IGG+PNHALYF G V N++++LDPH QN D+ D +++
Sbjct: 61 QCVGIIGGRPNHALYFCGIVDNNLLYLDPHFCQNFVDLDETTTTRDERDDYVEIKNDEFK 120
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
DSTYHCP I +DPS+A+
Sbjct: 121 DSTYHCPFILSTKIDKVDPSLAL 143
>gi|281210274|gb|EFA84441.1| autophagy protein 4 [Polysphondylium pallidum PN500]
Length = 734
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 92/183 (50%), Gaps = 25/183 (13%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++Q++++ D + LWF+YRK F PI ++ +TTD GWGCM+R GQM++A+ALL
Sbjct: 269 ANQEIDRFIADFKNILWFSYRKDFAPIENTNITTDIGWGCMVRTGQMLLARALLRHLYQN 328
Query: 68 DWQWNVNSKEEA--YLKILKMFED--RRTAPYSIHQIALTGASEGK-------------- 109
+ V+ + Y K++ F D R YSIHQI K
Sbjct: 329 ENIPEVDRTRPSSKYRKVMNWFCDLPTREHYYSIHQIVHKNKIIAKYHNSKLKDFDIETD 388
Query: 110 ------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
V EWF P ++ VL+ L K S I +V D + + V+KLC T++R S
Sbjct: 389 ENIDLLNVDEWFAPTKISVVLKHLLKSHGLSDITMYVPSDGVVYKDYVRKLC-TDERLSF 447
Query: 164 NPQ 166
+P+
Sbjct: 448 DPE 450
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 68/144 (47%), Gaps = 34/144 (23%)
Query: 155 CTTNKRASSNPQ-WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTY 213
C+ +S PQ W+ +++++P++LG+ +N VY IK LP
Sbjct: 527 CSDFFSSSCIPQRWKSIIILVPIKLGLDKLNEVYFREIKSMLELP--------------- 571
Query: 214 NMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKK 273
QS+G+IGGKP + YF+GY +I+LDPH V+D ++
Sbjct: 572 ------------QSIGLIGGKPKQSFYFVGYQDEHIIYLDPHF------VHDTVSPNDIN 613
Query: 274 LDSTYHCPQASRLHILHMDPSIAV 297
+YH ++ I +DPS+A+
Sbjct: 614 FSDSYHHCVPQKMLISQLDPSMAI 637
>gi|322701885|gb|EFY93633.1| cysteine protease atg4 [Metarhizium acridum CQMa 102]
Length = 255
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 64/132 (48%), Gaps = 26/132 (19%)
Query: 18 DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
D SR W TYR F +P + SG T+D GWGCM+R GQ
Sbjct: 126 DFASRFWMTYRSNFELIPKSTDPKAASALSLSMRIRSQLVDQSGFTSDSGWGCMIRSGQS 185
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L LGRDW+ + E ++L +F D APYS+H G K GE
Sbjct: 186 LLANALAVLDLGRDWRRGMLPDRER--RLLALFADDPRAPYSVHNFVRHGEKYCSKYPGE 243
Query: 114 WFGPNTVAQVLR 125
WFGP+ A+ ++
Sbjct: 244 WFGPSATARCIQ 255
>gi|118349810|ref|XP_001008186.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89289953|gb|EAR87941.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 343
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 122/261 (46%), Gaps = 35/261 (13%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD--- 68
L QI+ + ++F+YR GF + + +D GWGCMLR GQM+ A LL HL +
Sbjct: 16 LSQIKEAQHNLIYFSYRSGFSHQFQNHIFSDSGWGCMLRSGQMIFANGLL-RHLKENPQI 74
Query: 69 -WQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGK-AVGEWFGPNTVAQVLRK 126
Q + + + L I+K F + + P+SI QIA E K +G W+ PN +A L+K
Sbjct: 75 QNQLKIQNINDILLFIIKFFIENKDQPFSIQQIAAVALEEFKLEMGFWYSPNRIAYSLKK 134
Query: 127 LA-KYDDWS--SIVFHVAL-DNTLVVNQVKKLCTTNKRASSNPQ--WQPLVLVIPLRLGI 180
L + +S +IV V D L +Q T K S+ P+ Q L+ I ++ I
Sbjct: 135 LLNNFQTFSEMNIVSEVMYSDRPLYFSQCVTAMTGQKIDSTLPKQLLQILINNIEKQIKI 194
Query: 181 --QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHA 238
Q+ N IN K+ Y + I Y K L + FT S+G+IG
Sbjct: 195 MKQNSNKYQIN--KQNYKILIGLDYPEEKYLDILIKL------FTHRLSIGMIG------ 240
Query: 239 LYFIGYVGND-VIFLDPHTNQ 258
+ ND + +LDPH Q
Sbjct: 241 ------LNNDKLTYLDPHIVQ 255
>gi|238594668|ref|XP_002393548.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
gi|215461192|gb|EEB94478.1| hypothetical protein MPER_06700 [Moniliophthora perniciosa FA553]
Length = 142
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 68/146 (46%), Gaps = 51/146 (34%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDS--------------------------GLTTDKGWGC 47
+ D TS++W TYR F PI D+ G T+D GWGC
Sbjct: 16 EFTADFTSKVWLTYRSHFTPIRDTNLADLPLPSIFWKKWGWGLPGLGGERGWTSDSGWGC 75
Query: 48 MLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPY------SIHQIA 101
MLR GQ ++A AL+F+ LGR+W+ R AP S+H++A
Sbjct: 76 MLRTGQSLLANALVFMWLGREWR-------------------RPPAPMPTESYASVHRMA 116
Query: 102 LTGASEGKAVGEWFGPNTVAQVLRKL 127
L G GK VG+WFGP+T A ++ L
Sbjct: 117 LAGKELGKDVGQWFGPSTAAGAIKTL 142
>gi|403354729|gb|EJY76927.1| hypothetical protein OXYTRI_01553 [Oxytricha trifallax]
Length = 564
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 133/330 (40%), Gaps = 109/330 (33%)
Query: 38 GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDR----RTA 93
LTTD WGC +R QM+IA AL + + VNS ILK+F+D +
Sbjct: 213 NLTTDCNWGCTIRSAQMMIANAL----QQSTFMYPVNS-------ILKLFDDNIRECTES 261
Query: 94 PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKYDDWSS----------IVFHVAL 142
+SI IA+ G G+ G+W+G +++ +L+ L Y +S IVF +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGFIVFETIM 321
Query: 143 ----------------DNTLVVNQVKKL---------------------CTT-------- 157
+++V+NQ + C
Sbjct: 322 KKGCQLVNEKQDQQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKLP 381
Query: 158 NKRASSNP----QWQPLVLVI-PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
N NP +W+ VLVI +RLG+Q I+P+Y I K
Sbjct: 382 NMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKY------------------ 423
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND------VIFLDPHTNQNIGCVYDK 266
MQ P++ +G++GGKPN A YF G++ + ++FLDPH Q+ +
Sbjct: 424 --MQMPQF-------VGLVGGKPNKAFYFFGHIIDQDTNKVKLMFLDPHKVQDYTYNVET 474
Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
D + K + +H +A L I +D +
Sbjct: 475 SYDLDVKEQAKFHTTEARLLKIKELDTCLG 504
>gi|213403524|ref|XP_002172534.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
gi|212000581|gb|EEB06241.1| peptidase family C54 [Schizosaccharomyces japonicus yFS275]
Length = 314
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 118/300 (39%), Gaps = 62/300 (20%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E +D L TYRK G ++D GWGCM+R Q ++A L R Q +
Sbjct: 42 EAFVQDTYDLLSLTYRKCIA--GMECFSSDAGWGCMIRSMQTMLANCL------RRVQPS 93
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNTVAQVLRKLAKYD 131
+ KIL F D A S+HQ G + G WFGP TV+ L
Sbjct: 94 LPVH-----KILHYFADEANAYLSLHQFVDAGHTLCNITPGNWFGPATVSHCAAHLCSTH 148
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
+ V+ D ++ + + P P +L+ LRLGI I+ Y +
Sbjct: 149 PQVGLNVCVSHDGAIMYR---------DQLRNTPY--PRLLLFTLRLGIDTIHTSYYEQL 197
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
C+ L T PQ++G++GG+P A YF +
Sbjct: 198 --CHVL-------------------------TIPQAIGIVGGRPRAAHYFYACQSQWFFY 230
Query: 252 LDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI----AVVSQRSYSDYK 307
LDPHT Q +D +S++H RL I +DP + A+ S+ +D++
Sbjct: 231 LDPHTTQTAH-TFDNPAP-----NSSFHVTTLRRLRINELDPCMVLGFAITSEECQTDFE 284
>gi|403370248|gb|EJY84987.1| hypothetical protein OXYTRI_17161 [Oxytricha trifallax]
Length = 564
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 133/330 (40%), Gaps = 109/330 (33%)
Query: 38 GLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDR----RTA 93
LTTD WGC +R QM+IA AL + + VNS ILK+F+D +
Sbjct: 213 NLTTDCNWGCTIRSAQMMIANAL----QQSTFMYPVNS-------ILKLFDDNIRECTES 261
Query: 94 PYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL-AKYDDWSS----------IVFHVAL 142
+SI IA+ G G+ G+W+G +++ +L+ L Y +S IVF +
Sbjct: 262 AFSIQNIAIQGLQIGRFPGDWYGVSSITTILQSLNDNYKPFSQFEICTFQDGYIVFETIM 321
Query: 143 ----------------DNTLVVNQVKKL---------------------CTT-------- 157
+++V+NQ + C
Sbjct: 322 KKGCQLVNEKQDQQLQKDSIVLNQKDQSEYDPQNRENYDDLTFSQMGLGCDRRINYDKLP 381
Query: 158 NKRASSNP----QWQPLVLVI-PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSST 212
N NP +W+ VLVI +RLG+Q I+P+Y I K
Sbjct: 382 NMDQDQNPFNNQEWKNEVLVIVNVRLGLQKIDPIYHQIIVKY------------------ 423
Query: 213 YNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN------DVIFLDPHTNQNIGCVYDK 266
MQ P++ +G++GGKPN A YF G++ + ++FLDPH Q+ +
Sbjct: 424 --MQMPQF-------VGLVGGKPNKAFYFFGHIIDLDTNKVKLMFLDPHKVQDYTYDVET 474
Query: 267 EQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
D + K + +H +A L I +D +
Sbjct: 475 SYDLDVKEQAKFHTTEARLLKIKELDTCLG 504
>gi|123397031|ref|XP_001301012.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121882136|gb|EAX88082.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 297
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 111/249 (44%), Gaps = 46/249 (18%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH-LGR 67
D E++++ + + FTY KGF P+ G TTDK WGC +R GQ ++ Q + L+ L
Sbjct: 11 QSDTEKLKKVVDTIPRFTYHKGFSPLA-GGYTTDKNWGCCIRSGQGLLMQFVSKLYQLYG 69
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
D N+ + ++F D AP+ IH I + G GEW P+ +A V + L
Sbjct: 70 DKIKNIFPNGSKF----ELFFDHPQAPFGIHCICRELETFGVKAGEWVKPSMLAPVFKDL 125
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW-QPLVLVIPLRLGIQDINPV 186
+ HV + + C + + + P++L+ L LG +D +
Sbjct: 126 LSF-----FGIHVVI--------AENGCLSRESLREALSYGHPVLLLFTLMLGYKDFDLK 172
Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
Y+ ++ L +S +Y QS+GV+GG+ A Y +G+
Sbjct: 173 YLPFLR----LTLSLIY----------------------QSVGVVGGQQGKAYYLVGHQK 206
Query: 247 NDVIFLDPH 255
++++ DPH
Sbjct: 207 ENLLYFDPH 215
>gi|302915349|ref|XP_003051485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732424|gb|EEU45772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 355
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 87/193 (45%), Gaps = 13/193 (6%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSG---LTTDKGWGCMLRC--GQMVIAQALLFLHLGRD 68
Q D SR+W TYR F I S T+ L+ G + + LGRD
Sbjct: 113 QFMADFESRIWMTYRSEFEAIPRSTNPQATSSLSLSMRLKSQLGDQSPFSSDSMIRLGRD 172
Query: 69 WQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKL 127
W+ + EE +I+K+F D APYS+H GAS GK GEWFGP+ A+ ++ L
Sbjct: 173 WRRGQSPHEE--REIIKLFADHPNAPYSLHSFVRHGASACGKYPGEWFGPSATARCIQAL 230
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
A + S V+ + ++ K+ A + P ++++ RLGI I PVY
Sbjct: 231 ANSHESSLRVYSTGDGPDVYEDEFMKIAKPEGEA-----FHPTLILVGTRLGIDKITPVY 285
Query: 188 INGIKKCYALPIS 200
+ +P S
Sbjct: 286 WEALIASLQMPQS 298
>gi|213514936|ref|NP_001135074.1| Cysteine protease ATG4A [Salmo salar]
gi|209738482|gb|ACI70110.1| Cysteine protease ATG4A [Salmo salar]
Length = 102
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/57 (61%), Positives = 45/57 (78%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++ D+ SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ LGR W+
Sbjct: 44 ELLSDVHSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMILAQALVCSQLGRAWR 100
>gi|255711728|ref|XP_002552147.1| KLTH0B08272p [Lachancea thermotolerans]
gi|238933525|emb|CAR21709.1| KLTH0B08272p [Lachancea thermotolerans CBS 6340]
Length = 483
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 132/313 (42%), Gaps = 83/313 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
++ S L FTYR F PI G S + +D GWGCM
Sbjct: 94 EVHSLLHFTYRTKFEPIPKDPNGPSPMNFGTLFRDNPLNSFESAINHPDCFCSDIGWGCM 153
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG-ASE 107
+R GQ ++ AL L + EE +++ FEDR +AP+S+H G A
Sbjct: 154 IRTGQALLGNALARLR---------SPPEEK--QLIGWFEDRSSAPFSLHNFVREGNALS 202
Query: 108 GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW 167
K GEWFGP+ ++ ++ L L++ ++ +T+
Sbjct: 203 RKPPGEWFGPSATSRSIQSLVH------AFPQCGLNHCII--------STDSGDVYEEDV 248
Query: 168 QPLVLVIP-LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
P++ P + + + +N + Y D+ IL S++
Sbjct: 249 GPILEREPQATILLLLGVKLGLNNVNSRYWP------DVKHILGSSF------------- 289
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ--NIGCVYDKEQDSEKKLDSTYHCPQAS 284
S+G+ GG+P+ +LYF GY G+ + +LDPHT+Q C D E K +S H + +
Sbjct: 290 SVGIAGGRPSSSLYFFGYQGDYLFYLDPHTSQLDLASCATDNE-----KYESV-HSARFN 343
Query: 285 RLHILHMDPSIAV 297
++H +DPS+ +
Sbjct: 344 KVHFSELDPSMLI 356
>gi|281208441|gb|EFA82617.1| hypothetical protein PPL_04309 [Polysphondylium pallidum PN500]
Length = 646
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 73/142 (51%), Gaps = 22/142 (15%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+ + D ++++W +YR+GF IGD+ D GWG + GQ
Sbjct: 409 INEFLEDFSNKIWMSYRQGFPYIGDTMFENDCGWGYWKKSGQ------------------ 450
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLA-K 129
N E I++MF D+ TAP+SIH IAL G + GK VGEWF P+ + ++ L K
Sbjct: 451 --NEYPELLYNIVRMFLDKPTAPFSIHNIALHGQNHLGKNVGEWFAPSNITHAIKSLVNK 508
Query: 130 YDDWSSIVFHVALDNTLVVNQV 151
++ +I ++ D +L V+Q+
Sbjct: 509 FNLQCNISVVISEDGSLYVDQM 530
>gi|123407417|ref|XP_001303004.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121884346|gb|EAX90074.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 298
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 117/256 (45%), Gaps = 53/256 (20%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQAL--LFLHLGRD 68
D E+ R+ + + FTY K F P+ G TTDK WGC +R Q +I Q + L+ HLG D
Sbjct: 13 DTEKQRKLLETIPRFTYHKNFAPL-QGGFTTDKNWGCCIRSAQGLIMQFITKLYKHLGDD 71
Query: 69 WQ--WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK 126
+ + NSK E +F D +P+ + I S G GEW P+ +A V+++
Sbjct: 72 IRNIFPTNSKYE-------LFYDLPHSPFGLPHICAELQSYGVMPGEWVKPSLLAPVIKE 124
Query: 127 LAKYDDWSSIVF-HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINP 185
+ + ++ H L ++ N+ S N P++L+ L LG ++
Sbjct: 125 IMNFFRIPVVIAEHGCLSREVL----------NEALSHN---IPVLLLFTLMLGYENFEL 171
Query: 186 VYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYV 245
Y+ +K L +S +Y QS+GV+GG+ A + +G+
Sbjct: 172 KYLPFLK----LTLSLIY----------------------QSVGVVGGQQGKAYFIVGHQ 205
Query: 246 GNDVIFLDPH-TNQNI 260
+++ DPH N++I
Sbjct: 206 KEKLLYFDPHDVNESI 221
>gi|28395487|gb|AAO39081.1| autophagy protein 4 [Dictyostelium discoideum]
Length = 745
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 78/182 (42%), Gaps = 42/182 (23%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL-FLHLGRDWQWNVNSK 76
D+ S +WF+YRK F PI ++ +TTD GWGCMLR GQM++A+AL+ L+ D + K
Sbjct: 233 DVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEIERK 292
Query: 77 E--EAYLKILKMFED--RRTAPYSIHQIALTGASEGK----------------------- 109
+ Y ++L F D + Y IHQI + K
Sbjct: 293 KPHSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQAMEKNNRKQQILREQVISLNRGGGGSS 352
Query: 110 --------------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
V EW P ++ +LR+L K+ + +V D + + + LC
Sbjct: 353 KGKKKKEKEEEINDNVEEWLAPTRISNILRQLIKFQHLEDLEMYVPTDGVIYKDYINNLC 412
Query: 156 TT 157
Sbjct: 413 NN 414
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 63/139 (45%), Gaps = 37/139 (26%)
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+S P+W+ L+++IPL+LG +N YI + +
Sbjct: 497 SSIPPKWKSLIIMIPLKLGADKLNSTYI---------------------------EKLKL 529
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STY 278
PQSLG IGGKP + YFIG+ + VI+LDPH + +E + D +TY
Sbjct: 530 LLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLDPH--------FVQESVNPNSFDYSNTY 581
Query: 279 HCPQASRLHILHMDPSIAV 297
++ +DPS+++
Sbjct: 582 SGCIPQKMPFTQLDPSLSI 600
>gi|66822477|ref|XP_644593.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|66822607|ref|XP_644658.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|74857708|sp|Q557H7.1|ATG4_DICDI RecName: Full=Cysteine protease atg4; AltName:
Full=Autophagy-related protein 4
gi|60472726|gb|EAL70676.1| autophagy protein 4 [Dictyostelium discoideum AX4]
gi|60472781|gb|EAL70731.1| autophagy protein 4 [Dictyostelium discoideum AX4]
Length = 745
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 79/183 (43%), Gaps = 42/183 (22%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL-FLHLGRDWQWNVNSK 76
D+ S +WF+YRK F PI ++ +TTD GWGCMLR GQM++A+AL+ L+ D + K
Sbjct: 233 DVASMIWFSYRKDFPPIENTNITTDIGWGCMLRTGQMILARALIKHLYKENDMVPEIERK 292
Query: 77 E--EAYLKILKMFED--RRTAPYSIHQIALTGASEGK----------------------- 109
+ Y ++L F D + Y IHQI + K
Sbjct: 293 KPHSNYSQVLAWFSDYPSKEHVYGIHQIVNKKQAMEKNNRKQQILREQVISLNRGGGGSS 352
Query: 110 --------------AVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC 155
V EW P ++ +LR+L K+ + +V D + + + LC
Sbjct: 353 KGKKKKEKEEEINDNVEEWLAPTRISNILRQLIKFQHLEDLEMYVPTDGVIYKDYINNLC 412
Query: 156 TTN 158
+
Sbjct: 413 NNS 415
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 63/139 (45%), Gaps = 37/139 (26%)
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+S P+W+ L+++IPL+LG +N YI + +
Sbjct: 497 SSIPPKWKSLIIMIPLKLGADKLNSTYI---------------------------EKLKL 529
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STY 278
PQSLG IGGKP + YFIG+ + VI+LDPH + +E + D +TY
Sbjct: 530 LLKLPQSLGFIGGKPKQSFYFIGFQDDQVIYLDPH--------FVQESVNPNSFDYSNTY 581
Query: 279 HCPQASRLHILHMDPSIAV 297
++ +DPS+++
Sbjct: 582 SGCIPQKMPFTQLDPSLSI 600
>gi|395750455|ref|XP_002828707.2| PREDICTED: cysteine protease ATG4D [Pongo abelii]
Length = 296
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 93/222 (41%), Gaps = 54/222 (24%)
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSS 135
+E + +I+ F D AP+ +H++ G S GK G+W+GP+ VA +LRK
Sbjct: 50 QERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRK--------- 100
Query: 136 IVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCY 195
+ T +V V + CT V+ +R
Sbjct: 101 -AVESCSEVTRLVVYVSQDCT----------------VLHMR------------------ 125
Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
+L I P D L S+ + R E LG++GGKP H+LYFIGY + +++LDPH
Sbjct: 126 SLAIDPSKDRSTCLPSSLQ-ELLRCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPH 180
Query: 256 TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
Q V E ++HC ++ MDPS V
Sbjct: 181 YCQPTVDVSQANFPLE-----SFHCTSPRKMAFAKMDPSCTV 217
>gi|37991904|gb|AAR06350.1| putative autophagy, 3'-partial [Oryza sativa Japonica Group]
Length = 207
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 48/74 (64%), Gaps = 1/74 (1%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYRKGF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 131 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 190
Query: 76 KEEAYLKILKMFED 89
Y+ IL MF D
Sbjct: 191 YSPEYIGILHMFGD 204
>gi|149246610|ref|XP_001527730.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|166990616|sp|A5DSB4.1|ATG4_LODEL RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|146447684|gb|EDK42072.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 523
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 132/332 (39%), Gaps = 76/332 (22%)
Query: 23 LWFTYRKGFVPI--GDSG-------------------------------LTTDKGWGCML 49
LW +YR GF PI D G T+D GWGCM+
Sbjct: 130 LWLSYRCGFEPIPKSDDGPQPITFFPSIVFNRLTLVNLSNLRSLLDKDHFTSDAGWGCMI 189
Query: 50 RCGQ--MVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI--ALTGA 105
R Q + A LF G Q +K EA ++++F+D +AP+S+H A
Sbjct: 190 RTSQNLLANALLRLFHTTGGQPQNFAVTKTEA--DVIELFQDTLSAPFSLHNFIKAANSL 247
Query: 106 SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP 165
S G+WFGP+ + ++KL +D++ I + + K+ T N + S
Sbjct: 248 SLNIKPGQWFGPSAASLSIKKLV--NDYNLIQQERRSERDSGRDSGHKVPTPNLKLHSKS 305
Query: 166 QWQPLVLVIPLRLGIQDINPVYI--------NGIKKCYALPISPVYDMVKI--------- 208
I VY+ + I + L P+ + I
Sbjct: 306 ADSDSDSDSDAISKRNSIPYVYVSENCDLYDDEINAIFELEQRPILFLFPIRLGIEQVNK 365
Query: 209 --LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG-NDVIFLDPHTNQNIGCVYD 265
SS + ++ S+G+ GGKP+ + YFIGY G +D+I+ DPH Q + +
Sbjct: 366 YYYSSILQILASKF------SVGIAGGKPSSSFYFIGYEGEDDLIYFDPHLPQIVQTPVN 419
Query: 266 KEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
E +YH + S+L I +DPS+ +
Sbjct: 420 LE---------SYHTSEYSKLKIDQLDPSMMI 442
>gi|145521674|ref|XP_001446691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414171|emb|CAK79294.1| unnamed protein product [Paramecium tetraurelia]
Length = 473
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 129/303 (42%), Gaps = 60/303 (19%)
Query: 13 EQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
E I + + FTYR+GF DS LTTD GWGC++R GQM++A+ LL HL ++
Sbjct: 42 EDILDVVVHTIRFTYRQGFQAYQCQDSALTTDSGWGCVIRVGQMMMAE-LLKRHLKCFYK 100
Query: 71 WNVNSKEEAYLKILKMFEDRRTAP------------YSIHQIALTGASE-GKAVGEWFGP 117
++ S +L+MF+D +SI +I E GK GEW+ P
Sbjct: 101 VDLFSFPPLLQDVLQMFKDDDDMESQKGFSKPSKYGFSIQKIMRVAYKEWGKKPGEWYSP 160
Query: 118 NTVAQVLRKLAK-------YDDWSSIVFHVALDNTLVVNQVKKL--CTTNKRASSNPQW- 167
N + Q + K+ + Y + +D + ++ + C K+ S Q+
Sbjct: 161 NQIVQAIYKILQEINIPYCYGLGFVPFYESQIDLRAIFQEMCMMEDCVCQKKVFSIEQFL 220
Query: 168 ----------QPLVLVIPLRLGIQDI--------NPVYINGI------KKCYALPISPVY 203
+ +V V+ I D+ N I + +KC+ +P+ V
Sbjct: 221 KSLEKLEIGKEEMVQVMHGNDSISDVCCEDQSEQNKKEIGNLLKKYICQKCF-VPVRAV- 278
Query: 204 DMVKILSST-YNMQTPRYEFTFPQSL------GVIGGKPNHALYFIGYVGNDVIFLDPHT 256
V +LS + P Y Q + G++GG+P A + +G+V N + LDPH
Sbjct: 279 -AVCLLSRIGCDEPNPDYLQAIRQFMKKKYFAGMLGGRPKEANFIVGFVDNKFVVLDPHL 337
Query: 257 NQN 259
Q
Sbjct: 338 VQE 340
>gi|154419947|ref|XP_001582989.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121917228|gb|EAY22003.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 107/262 (40%), Gaps = 46/262 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
+I+ I F YR + +S LTTDKGWGC R Q ++ Q +L LH R ++
Sbjct: 12 EIKDVIADIPRFCYRYNLSDLANSLLTTDKGWGCCFRSTQGLLCQYILKLH--RKFRSLY 69
Query: 74 NSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
+ + L +F D +AP+ I + + G VGEW P+ +A ++ + +
Sbjct: 70 DQVFGQNVNPLDLFLDIPSAPFGIQNLTKNAFAIGLPVGEWAKPSIMAATIKLIFDTLNL 129
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
S I ++ D TL N +K P +++IP G+ ++ Y++ +
Sbjct: 130 SCI---ISQDLTLDSNDIKH------------TKYPALILIPSLFGLSKMDDSYLSFLLL 174
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
C + SLG + G+ A YF+G+ D + D
Sbjct: 175 CLCI---------------------------ESSLGFVSGQNASAYYFVGFDLEDFYYFD 207
Query: 254 PHTNQN--IGCVYDKEQDSEKK 273
PH + + YD D E K
Sbjct: 208 PHVTKEAVVSPPYDSFFDLELK 229
>gi|330840629|ref|XP_003292315.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
gi|325077457|gb|EGC31168.1| hypothetical protein DICPUDRAFT_99299 [Dictyostelium purpureum]
Length = 465
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/350 (23%), Positives = 127/350 (36%), Gaps = 136/350 (38%)
Query: 48 MLRCGQMVIAQALL------FLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP--YSIHQ 99
MLR GQM++A+ALL + + + +NSK Y ++LK F D ++ Y IHQ
Sbjct: 1 MLRTGQMILARALLKHVYPDNVIINHQERIRINSK---YNQVLKWFSDYQSKEHLYGIHQ 57
Query: 100 IA-LTGASEGKA------------------------------------------------ 110
I + A E K
Sbjct: 58 IVHMKKAMEKKIRQKALENYARRKQQLQQQQQQRYGKNSVRVRIDNYSDSSSDSEDEWDN 117
Query: 111 VGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLC--------------- 155
V EW P ++ VLR+L K + + +V D + + +LC
Sbjct: 118 VEEWLAPTKISNVLRQLVKNQNLDDLEMYVPNDGVIYREYINQLCNPYYFNNYKNNDQNN 177
Query: 156 ----TTNKRASS-----------------------NP-QWQPLVLVIPLRLGIQDINPVY 187
+ N+ S NP +W+ L+++IPL+LG+ IN Y
Sbjct: 178 QNNLSMNQSPPSRVPSEVFNHPLSVNDDDQDYYHFNPNKWKSLIIMIPLKLGVDRINTSY 237
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
I +K ++ PQSLG IGGKP + YFIG+ +
Sbjct: 238 IRKLKSILSI---------------------------PQSLGFIGGKPKQSFYFIGFQDD 270
Query: 248 DVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
VI+LDPH V D S T+ ++ ++DPS++V
Sbjct: 271 QVIYLDPH------FVQDTVDPSSNNYSETFCGCIPQKMSFSNIDPSLSV 314
>gi|167394648|ref|XP_001741038.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165894548|gb|EDR22516.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 200
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 91/177 (51%), Gaps = 16/177 (9%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
+ +D+++ R +W TYRK I + TTD GWGCM+R QMV+AQ L + LG
Sbjct: 27 NKKDIDEFARHT---IWITYRKNMPLIKEK--TTDSGWGCMIRSLQMVLAQTFLSIVLGN 81
Query: 68 DWQWN---VNSKEEAY--LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQ 122
+W++ +N++ + I+ +F D + +SIH++ ++ G G+W+GP+ +
Sbjct: 82 NWKYENNCMNTERNIFHIKSIINLFGDSTGSLFSIHRLVARASTRGVTEGQWWGPSFASD 141
Query: 123 VLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLG 179
+ + +VA ++V ++++L + + P ++ +PLRLG
Sbjct: 142 IAAEHINEMRVFRTRGYVAKLGSIVGPKIEEL------SKDEVGFNPCIIFVPLRLG 192
>gi|123479730|ref|XP_001323022.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121905878|gb|EAY10799.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 284
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 105/251 (41%), Gaps = 47/251 (18%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGR 67
++ D QI +I F YR F I +S L+ D GWGC R Q ++ Q +L LH +
Sbjct: 9 TNVDANQILAEIPR---FCYRNNFQAIENSTLSCDSGWGCCFRSSQGLVCQYILRLH--K 63
Query: 68 DWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
++ NS L +F D AP+ I I S G +G W P+ +A + +
Sbjct: 64 NFPDLYNSTFGIDKNPLDLFLDIPEAPFGIQNIVTHANSLGLPIGNWAKPSIIASAYKSI 123
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVY 187
+ + I V D+T + +++ +TN P++++IP G++ I Y
Sbjct: 124 FQSLHLNCI---VPQDSTFIYEELE---STN---------YPVLILIPGLFGLEKIEKPY 168
Query: 188 INGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN 247
I+ I LS N SLG + G + A YFIG+ +
Sbjct: 169 ISFI----------------FLSLCMN-----------SSLGFVSGHNDSAFYFIGFDSD 201
Query: 248 DVIFLDPHTNQ 258
+ DPH +
Sbjct: 202 YFYYFDPHVTK 212
>gi|149422017|ref|XP_001518728.1| PREDICTED: cysteine protease ATG4D-like [Ornithorhynchus anatinus]
Length = 286
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 39/186 (20%)
Query: 112 GEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLV 171
GEW P A + + S +V +VA D T+ + +L + +W+ ++
Sbjct: 64 GEWTRPPGKA-----VEGSSEVSGMVVYVAQDCTVYKADMARL--AGQPGDPEAEWKSII 116
Query: 172 LVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVI 231
+++P+RLG + +NP Y+ IK+ L + P LG+I
Sbjct: 117 ILVPVRLGGETLNPAYMPCIKE--LLRMEPC-------------------------LGII 149
Query: 232 GGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
GGKP H+LYFIGY + +++LDPH Q CV D +DS L+S +HC +L M
Sbjct: 150 GGKPKHSLYFIGYQDDFLLYLDPHYCQP--CV-DTMKDS-FPLES-FHCTAPRKLPFAKM 204
Query: 292 DPSIAV 297
DPS V
Sbjct: 205 DPSCTV 210
>gi|291238482|ref|XP_002739158.1| PREDICTED: Autophagy-specific gene 4-like [Saccoglossus
kowalevskii]
Length = 338
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 71/137 (51%), Gaps = 32/137 (23%)
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
+ S+ W ++++IP+RLG +++NPVYI+ IK
Sbjct: 156 SRSSQLWCSVIILIPVRLGGEELNPVYISCIKSL-------------------------- 189
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
FT +G+IGGKP H+LYFIG+ + +I LDPH Q++ V + +D + ++HC
Sbjct: 190 -FTLKHCIGIIGGKPKHSLYFIGFQEDKLIHLDPHLCQDV--VDMRSRDFPLQ---SFHC 243
Query: 281 PQASRLHILHMDPSIAV 297
++ ++ MDPS +
Sbjct: 244 MSPRKMSLMKMDPSCTI 260
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 45/62 (72%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+E+ +RD TSRLW TYR+ F + + LTTD GWGCMLR GQM++AQ+ L LGR ++
Sbjct: 81 MERFKRDFTSRLWLTYRREFQQLAGTSLTTDCGWGCMLRSGQMMLAQSFLTHFLGRVYKQ 140
Query: 72 NV 73
+V
Sbjct: 141 DV 142
>gi|324519641|gb|ADY47439.1| Cysteine protease ATG4C, partial [Ascaris suum]
Length = 282
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 62/116 (53%), Gaps = 8/116 (6%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
D SR+W TYR P+ S TTD GWGC LR QM++AQAL+ LHLGR+W++ + +
Sbjct: 142 DYYSRIWLTYRTELSPLPGSSKTTDCGWGCTLRTCQMMLAQALVVLHLGREWRFWGDEEA 201
Query: 78 EAY------LKILKMFEDRRTAPYSIHQIALTGA--SEGKAVGEWFGPNTVAQVLR 125
Y I+ +F D A ++++ +E AVG W+ T ++R
Sbjct: 202 NRYRCGFGHYDIVSLFGDHLDADLGLYRLMKIAKERNEHDAVGNWYSACTAFGLIR 257
>gi|260823874|ref|XP_002606893.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
gi|229292238|gb|EEN62903.1| hypothetical protein BRAFLDRAFT_126356 [Branchiostoma floridae]
Length = 384
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 71/155 (45%), Gaps = 40/155 (25%)
Query: 151 VKKLCTTNKRASSNP--------QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
V LCT +R SSN W ++++IP+RLG + +NP+Y IK
Sbjct: 180 VVSLCTKRRRLSSNAADRDGSTENWCSVIILIPVRLGGESLNPIYEPCIKGL-------- 231
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
FT LGVIGG+P H+LYF+G+ + +I LDPH Q +
Sbjct: 232 -------------------FTMDHCLGVIGGRPKHSLYFVGFQEDKLIHLDPHFCQEVVD 272
Query: 263 VYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ ++ E ++HC ++ I MDPS +
Sbjct: 273 MTPRDFPLE-----SFHCMNPRKMSIARMDPSCTI 302
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
+E +RD S++W TYR+ F + S TTD GWGCMLR GQM++A L+ LGR ++
Sbjct: 119 MELFKRDFASKVWLTYRREFPQLAGSMFTTDCGWGCMLRSGQMLLAGGLVMHFLGRVYKQ 178
Query: 72 NVNS 75
+V S
Sbjct: 179 DVVS 182
>gi|216963257|gb|ACJ73915.1| autophagy-related 4b variant 3 [Zea mays]
Length = 178
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/53 (60%), Positives = 40/53 (75%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D +SR+W TYRKGF I S LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 118 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 170
>gi|440291586|gb|ELP84849.1| hypothetical protein EIN_284050 [Entamoeba invadens IP1]
Length = 352
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 112/258 (43%), Gaps = 60/258 (23%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE 77
+I + +F YR F P+ ++ LT+D GWGC +R QM++A A +G+ + + ++ E
Sbjct: 83 EIINLFYFVYRNNFQPLPNTTLTSDSGWGCTIRSTQMLVANA-----IGKLFTNDFDTGE 137
Query: 78 EAYLKILKMFEDRRTA--PYSIHQIALTGAS-EGKAVGEWF-GPNTVAQVLRKLAKYDDW 133
++K F D + P+SIH + LT A +G G F P+ VA ++ K
Sbjct: 138 VTDKMVIKFFLDFFSVECPFSIHNLFLTKAILQGNINGNSFLPPSAVAAAFVEINK--KL 195
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKK 193
++ F + + T +V QP +++IP+ + P N
Sbjct: 196 ANPKFGMEILTTTFTFRVYT--------------QPTIVLIPISI------PDSFN---- 231
Query: 194 CYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLD 253
D + ++ S Y G++GG A YF G + ++FLD
Sbjct: 232 ----------DKIAVIFSFYLFS------------GMVGGSGRKAFYFFGIHHDQLLFLD 269
Query: 254 PHTNQNI---GCVYDKEQ 268
PHT +N C +D ++
Sbjct: 270 PHTVRNTVINSCSFDPQE 287
>gi|151556001|gb|AAI49850.1| ATG4D protein [Bos taurus]
Length = 359
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/63 (50%), Positives = 44/63 (69%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L R ++
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRVYK 166
Query: 71 WNV 73
+V
Sbjct: 167 ADV 169
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 32/138 (23%)
Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
R +W+ +V+++P+RLG + +NPVY+ +K+ R
Sbjct: 175 RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL-----------------------R 211
Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
E LG++GGKP H+LYFIGY + +++LDPH Q V + E ++H
Sbjct: 212 SELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE-----SFH 262
Query: 280 CPQASRLHILHMDPSIAV 297
C ++ MDPS V
Sbjct: 263 CTSPRKMAFAKMDPSCTV 280
>gi|46136685|ref|XP_390034.1| hypothetical protein FG09858.1 [Gibberella zeae PH-1]
Length = 360
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 92/206 (44%), Gaps = 14/206 (6%)
Query: 18 DITSRLWFTYRKGFVPIGDSG---LTTDKGWGCMLRC--GQMVIAQALLFLHLGR-DWQW 71
D S++W TYR F PI S T+ L+ G + + LGR DW+
Sbjct: 120 DFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQLGDQSPFSSDTMVRLGRGDWRR 179
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKY 130
+ +EE ++LK F D APYSIH GAS GK GEWFGP+ A+ ++ L
Sbjct: 180 GESVEEEC--RLLKDFADDPRAPYSIHSFVRHGASACGKYPGEWFGPSATARCIQALTNS 237
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
+ S V+ + ++ ++ + P ++++ RLGI I PVY
Sbjct: 238 HESSIRVYSTGDGPDVYEDEFMQIAKPPGE-----DFHPTLVLVGTRLGIDKITPVYWEA 292
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQ 216
+ +P S D ++ + ++Q
Sbjct: 293 LIAALQMPQSNEVDWQELKRNVKHVQ 318
>gi|145500036|ref|XP_001436002.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403139|emb|CAK68605.1| unnamed protein product [Paramecium tetraurelia]
Length = 469
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 132/302 (43%), Gaps = 60/302 (19%)
Query: 13 EQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
E I + + FTYR+GF +S LTTD GWGC++R GQM++A+ LL HL +
Sbjct: 42 EDILDVVIHTIRFTYREGFQAYQCQNSTLTTDSGWGCVIRVGQMMMAE-LLKRHLKCFYN 100
Query: 71 WNVNSKEEAYLKILKMFEDRRTAP------------YSIHQIALTGASE-GKAVGEWFGP 117
N+ ++L++F+D +SI +I E GK GEW+ P
Sbjct: 101 VNLFQFPPLMQEVLQLFKDDDEMESLKVQGKPSKYGFSIQKIMRIAYEEWGKKPGEWYSP 160
Query: 118 NTVAQVLRKLAKYDD------WSSIVFHVALDNTLVVNQ---VKKLCTTNKRASSNPQW- 167
N + Q + K+ ++ S + F+ + + V+ Q V + C +R ++
Sbjct: 161 NQIVQAIYKILSDNNIIYSCGLSLLPFYESQIDLKVILQEMCVMENCICEQRVFFIEKFL 220
Query: 168 QPLVL-------VIPLRLGIQDINPVYINGI-----------------KKCYALPISPVY 203
Q LV VI + G I+ VY + +KC+ +PI V
Sbjct: 221 QDLVRLEINKEEVIQVIHGNDSISDVYYEDLSQQNKQEIGMLLKKYVCQKCF-VPIRAV- 278
Query: 204 DMVKILSST-YNMQTPRYEFTFPQSL------GVIGGKPNHALYFIGYVGNDVIFLDPHT 256
+ +LS + P Y Q + G++GG+P A + +G+V + + LDPH
Sbjct: 279 -AICLLSRIGCDEPNPDYIQAIRQFMKKKYFAGLLGGRPREANFIVGFVDDKFVVLDPHL 337
Query: 257 NQ 258
Q
Sbjct: 338 VQ 339
>gi|216963264|gb|ACJ73916.1| autophagy-related 4b variant 4 [Zea mays]
Length = 208
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 32/53 (60%), Positives = 40/53 (75%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D +SR+W TYRKGF I S LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 200
>gi|440789707|gb|ELR11008.1| cysteine protease atg4a, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 180
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 71/145 (48%), Gaps = 37/145 (25%)
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W P+++++P+RLGIQ +NP+YI +K F+FPQ
Sbjct: 11 WHPVIILVPVRLGIQCLNPIYIPTLKAF---------------------------FSFPQ 43
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDS-TYHCPQASR 285
LGVIGGKP+ + YF+GY N V+++DPH Q + D ++S PQA
Sbjct: 44 CLGVIGGKPHSSFYFVGYQDNKVLYMDPHFVQP---TVKMDDDPLFPIESYRMEIPQAMS 100
Query: 286 LHILHMDPSIAV----VSQRSYSDY 306
+DPS+A+ SQ + D+
Sbjct: 101 FD--DIDPSLALGFLCSSQAEFDDF 123
>gi|216963276|gb|ACJ73918.1| autophagy-related 4b variant 6 [Zea mays]
Length = 271
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 32/53 (60%), Positives = 40/53 (75%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D +SR+W TYRKGF I S LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 200
>gi|403345460|gb|EJY72096.1| Cysteine protease family C54 putative [Oxytricha trifallax]
Length = 823
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 72/139 (51%), Gaps = 30/139 (21%)
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
++++IP RLG+ +N Y + IK Y F ++G
Sbjct: 611 ILVIIPTRLGLNKVNKEYYSSIK---------------------------YVFQCRLNVG 643
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHIL 289
++GG+PN ALYF+G D+I LDPH Q+ V ++E+ S +L+ TYHC QA +L +
Sbjct: 644 IMGGRPNQALYFVGTQKTDLICLDPHLVQD--TVLNQEELSNVELNQTYHCDQAKKLSMT 701
Query: 290 HMDPSIAV-VSQRSYSDYK 307
+D S+A + Y+D++
Sbjct: 702 KLDTSLAFGFYLKDYNDFE 720
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 69/130 (53%), Gaps = 8/130 (6%)
Query: 40 TTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ-WNVNSKEEA---YLKILKMFEDR---RT 92
TTD GWGC +R GQM+I QAL+ +G D N++S E+ Y KI+++ D +T
Sbjct: 394 TTDVGWGCTIRVGQMMICQALMRHLIGLDHSVKNLSSTEQKRLNYAKIIQLIHDNDCSQT 453
Query: 93 APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-YDDWSSIVFHVALDNTLVVNQV 151
+SI IA G K GEW+GP+ + +LR L + Y + + D + +++
Sbjct: 454 GAFSIQNIAKMGFCHDKLPGEWYGPHALTIMLRDLNRIYQPVENFQVCMFRDGNVYYDKI 513
Query: 152 KKLCTTNKRA 161
K T+ +A
Sbjct: 514 MKTAITDGKA 523
>gi|216963270|gb|ACJ73917.1| autophagy-related 4b variant 5 [Zea mays]
Length = 292
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 32/53 (60%), Positives = 40/53 (75%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D +SR+W TYRKGF I S LT+D WGCM+R QM++AQAL+F HLGR W+
Sbjct: 148 DFSSRIWITYRKGFDAIPGSKLTSDVNWGCMVRSSQMLVAQALIFHHLGRSWR 200
>gi|119604523|gb|EAW84117.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_b
[Homo sapiens]
Length = 228
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 44/62 (70%), Gaps = 2/62 (3%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL--FLHLGRD 68
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL FL G+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRGKP 167
Query: 69 WQ 70
W+
Sbjct: 168 WR 169
>gi|119604525|gb|EAW84119.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_d
[Homo sapiens]
Length = 360
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 32/63 (50%), Positives = 44/63 (69%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L R ++
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRVYK 167
Query: 71 WNV 73
+V
Sbjct: 168 ADV 170
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 32/138 (23%)
Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
R +W+ +V+++P+RLG + +NPVY+ +K+ R
Sbjct: 176 RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL-----------------------R 212
Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
E LG++GGKP H+LYFIGY + +++LDPH Q V + E ++H
Sbjct: 213 CELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE-----SFH 263
Query: 280 CPQASRLHILHMDPSIAV 297
C ++ MDPS V
Sbjct: 264 CTSPRKMAFAKMDPSCTV 281
>gi|320588376|gb|EFX00845.1| cysteine protease atg4 [Grosmannia clavigera kw1407]
Length = 348
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 53/110 (48%), Gaps = 24/110 (21%)
Query: 18 DITSRLWFTYRKGFVPI---------------------GD-SGLTTDKGWGCMLRCGQMV 55
D SR W TYR GF PI GD S ++D GWGCM+R GQ +
Sbjct: 123 DFESRFWMTYRSGFEPIARSVDPKAPATLSFTMKLKALGDQSDFSSDSGWGCMIRSGQSL 182
Query: 56 IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
+A A+ LGR W+ + E +I+ +F D APYSIH+ GA
Sbjct: 183 LANAMAMYELGRGWRLSDGGIAEK--EIISLFADDPRAPYSIHRFVGHGA 230
>gi|426387285|ref|XP_004060104.1| PREDICTED: cysteine protease ATG4D [Gorilla gorilla gorilla]
Length = 362
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/63 (50%), Positives = 44/63 (69%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L R ++
Sbjct: 110 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRVYK 169
Query: 71 WNV 73
+V
Sbjct: 170 ADV 172
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 68/138 (49%), Gaps = 32/138 (23%)
Query: 160 RASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPR 219
R +W+ +V+++P+RLG + +NPVY+ +K+ R
Sbjct: 178 RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL-----------------------R 214
Query: 220 YEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYH 279
E LG++GGKP H+LYFIGY + +++LDPH Q D Q ++ L+S +H
Sbjct: 215 CELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-ADFPLES-FH 265
Query: 280 CPQASRLHILHMDPSIAV 297
C ++ MDPS V
Sbjct: 266 CTSPRKMAFAKMDPSCTV 283
>gi|240274226|gb|EER37743.1| cysteine protease atg4 [Ajellomyces capsulatus H143]
Length = 454
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/307 (23%), Positives = 115/307 (37%), Gaps = 110/307 (35%)
Query: 18 DITSRLWFTYRKGF--VP---------------------IGDSGLTTDKGWGCMLRCGQM 54
D S++W TYR F +P + G TTD GWGCM+R G
Sbjct: 128 DFESKIWLTYRSNFPLIPKSNDPNALSAMTLGVRLRSQLVDSQGFTTDTGWGCMIRSG-- 185
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEW 114
Q+LL N+ L IL
Sbjct: 186 ---QSLL-----------ANA-----LAIL------------------------------ 196
Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVV-NQVKKLCTTNKR-ASSNPQWQPLVL 172
++ + R L+ + + + +V D + V ++ + + + A ++ P ++
Sbjct: 197 ----SLGRACRALSSECEHAGLNVYVTSDGSDVYEDRFRAIASAGGTGAGTSTDVHPTLI 252
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ +RLGI + PVY +K +PQS+G+ G
Sbjct: 253 LLGIRLGIDRVTPVYWEALKAV---------------------------LKYPQSVGIAG 285
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD--STYHCPQASRLHILH 290
G+P+ + YFIG G+ +LDPH + VY D + +TYH + RLHI
Sbjct: 286 GRPSSSHYFIGAQGSHFFYLDPH-HTRPALVYHDAGDRPYTTEELNTYHTRRLRRLHIKD 344
Query: 291 MDPSIAV 297
MDPS+ +
Sbjct: 345 MDPSMLI 351
>gi|323450755|gb|EGB06635.1| hypothetical protein AURANDRAFT_65498 [Aureococcus anophagefferens]
Length = 426
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 50/113 (44%), Gaps = 17/113 (15%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
LWFTYR GF + G T D GWGCMLR QM++ AL + L
Sbjct: 28 LWFTYRCGFEELAPYGFTDDAGWGCMLRSAQMLLGNAL------------TRNGAAPRLA 75
Query: 83 ILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
+F D +AP+ +H A G GEW+GP VLR L DW
Sbjct: 76 TAALFADAPGDSAPFGLHNFAKCGLRYDVLPGEWYGPGVACHVLRDLV---DW 125
>gi|428184439|gb|EKX53294.1| hypothetical protein GUITHDRAFT_133035 [Guillardia theta CCMP2712]
Length = 567
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 96/216 (44%), Gaps = 58/216 (26%)
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD--W----SSIVFHVALDNTLVVNQVKK 153
+ L GA WFGP+T+ +VLR + ++ W + ++F D+ + + +
Sbjct: 337 LMLHGAISQPLCCRWFGPDTICRVLRHIWNMNEGVWPCHTAGMLF--VEDHCIYRDLAES 394
Query: 154 L-----------CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
+ C+ +A W+PL++V+P+RLG + + +++ I K
Sbjct: 395 VACSRQAYSGTNCSRMAQAREPCSWRPLIVVVPVRLGARSEDQ-HLSRIDKHL------- 446
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGC 262
QSLG IGG+P H+ YF+G G + +LDPH Q
Sbjct: 447 -----------------------QSLGFIGGRPRHSYYFVGVRGYNAYYLDPHITQPY-- 481
Query: 263 VYDKEQDSEKKLD-STYHCPQASRLHILHMDPSIAV 297
Q K ++ +++HC ++ + H+DPS+A+
Sbjct: 482 -----QSIRKNINVASFHCAHPGKMSLAHIDPSLAL 512
>gi|154343631|ref|XP_001567761.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065093|emb|CAM43207.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 66/267 (24%), Positives = 106/267 (39%), Gaps = 54/267 (20%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
L+F+YR F P+ + G TTD WGC+LR QM+I LL H + E
Sbjct: 74 LYFSYRSCFPPLPN-GSTTDTRWGCLLRTTQMLIGTCLLRYHCKGAYVLPEADNAELKAN 132
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKYDDWSSIVFHV 140
I ++F D +AP IH+ P + +A + + + F
Sbjct: 133 ISRLFMDVPSAPLGIHRAEDEAHKNCVKYASMLSPTEAGMAMAAALIACHAEGGDVPFTF 192
Query: 141 ALDNTLVVNQ--VKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
+ +N + V KL + Q ++L+IP+ LG+ P
Sbjct: 193 SCENRNIDEPAVVAKLL----------EGQHVILIIPVVLGLA----------------P 226
Query: 199 ISPVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH- 255
+S Y+ M+KIL +M+ G+ GG + Y G+ G V F+DPH
Sbjct: 227 LSDKYESMMLKIL----DMKA---------CCGIAGGFKQASFYMFGHQGRKVFFMDPHY 273
Query: 256 ------TNQNIGCVYDKEQD-SEKKLD 275
+++ G +Y D + +K D
Sbjct: 274 IQKAYTSDKTAGTLYGARGDLTARKFD 300
>gi|340054025|emb|CCC48319.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma vivax Y486]
Length = 326
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 96/252 (38%), Gaps = 47/252 (18%)
Query: 20 TSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEA 79
TS TYR F P+ S LT+D+GWGC+ R QM++A L ++ E
Sbjct: 41 TSFYLLTYRMNFEPLPCSTLTSDRGWGCLARASQMLLAHVL-----------RRHAASEC 89
Query: 80 YLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNTVAQVLRKLAKYDDWSSIVF 138
+LK D AP+S+H + G +++ P+ + +R + S V
Sbjct: 90 HLKFFCDMNDEHLAPFSLHCMTRAVIKHGTEFRADYWAPSQGCEAIRSCVE-----SAVR 144
Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV-IPLRLGIQDINPVYINGIKKCYAL 197
L L V + + + VLV +P+R G
Sbjct: 145 QGLLTQKLSVVVSSSGTIPEREIHEHLRGDGSVLVLVPVRCGTS---------------- 188
Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT- 256
+ ++ T + P +GV+GG PN Y +G G+ +++LDPH
Sbjct: 189 ---------RRMTQTMFFAL-EHLLHIPSCMGVVGGVPNRGYYIVGTSGHRLLYLDPHCM 238
Query: 257 --NQNIGCVYDK 266
N + C K
Sbjct: 239 TQNAMVSCELGK 250
>gi|67482849|ref|XP_656724.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56473943|gb|EAL51338.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449705841|gb|EMD45804.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 348
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 102/241 (42%), Gaps = 61/241 (25%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
+TS ++F YR F + ++ LT+D GWGC +R QM++A A++ L G D N+N K
Sbjct: 84 LTSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANAIIKL-FGSD---NINRK-- 137
Query: 79 AYLKILKMFED--RRTAPYSIHQIALTG-ASEGKAVGEWFGP-NTVAQVLRKLAKYDDWS 134
++ F D PYSIH + T G G F P ++V L +L D
Sbjct: 138 ---TVIHWFLDFYNSECPYSIHSLFTTQIIVSGNPNGSSFLPFSSVIYALTELVNKD--- 191
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
F+ A + ++ N+ L + NK P ++ IP
Sbjct: 192 ---FNRAFECHVITNKF-LLKSINK---------PTIVFIP------------------- 219
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+ +P ++ I F+F G++GG A YF G N ++FLDP
Sbjct: 220 FTIPDKFDQRLITI-------------FSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDP 266
Query: 255 H 255
H
Sbjct: 267 H 267
>gi|47213810|emb|CAF92583.1| unnamed protein product [Tetraodon nigroviridis]
Length = 265
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 33/67 (49%), Positives = 45/67 (67%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLG 66
L+ ++E+ R SR+W TYRK F P+ S LTTD GWGCMLR GQM++AQ LL +
Sbjct: 69 LNLDEVERFRLAFVSRIWLTYRKDFPPLEGSTLTTDCGWGCMLRSGQMLLAQGLLVHLMH 128
Query: 67 RDWQWNV 73
R ++ +V
Sbjct: 129 RVYKEDV 135
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 59/131 (45%), Gaps = 32/131 (24%)
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
WQ +++++P+RLG + +NP YI +K L
Sbjct: 155 WQSVIILVPVRLGGESLNPSYIECVKNILKLDCC-------------------------- 188
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
+G+IGGKP H+LYFIG+ +++LDPH Q + V E ++HC ++
Sbjct: 189 -IGIIGGKPKHSLYFIGFQDEQLLYLDPHYCQPVVDVSQVNFSLE-----SFHCNSPKKM 242
Query: 287 HILHMDPSIAV 297
MDPS +
Sbjct: 243 PFSRMDPSCTI 253
>gi|336259147|ref|XP_003344378.1| hypothetical protein SMAC_08321 [Sordaria macrospora k-hell]
Length = 429
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 59/126 (46%), Gaps = 26/126 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F I S G ++D GWGCM+R GQ
Sbjct: 183 DFESRIWMTYRTDFALIPRSCDPQASYALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 242
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L LGR+W+ + E I+ +F D AP+S+H GA+ GK GE
Sbjct: 243 LLANAILVARLGREWRRETDLDAEK--DIIALFADDPRAPFSLHNFVKYGATACGKYPGE 300
Query: 114 WFGPNT 119
P++
Sbjct: 301 CGRPSS 306
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 233 GKPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHM 291
G+P+ + YFIG G + +LDP H + D + + ++LD T H + +LHI M
Sbjct: 302 GRPSSSHYFIGVQGQRLFYLDPHHPRPALPYREDPKGYTAEELD-TCHTRRLRQLHIDDM 360
Query: 292 DPSIAV 297
DPS+ +
Sbjct: 361 DPSMLI 366
>gi|407037690|gb|EKE38747.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 348
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 99/241 (41%), Gaps = 61/241 (25%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
+TS ++F YR F + ++ LT+D GWGC +R QM++A +++ L G D N+N K
Sbjct: 84 LTSLIYFVYRSNFSALPNTSLTSDGGWGCTIRACQMLLANSIIKL-FGSD---NINRK-- 137
Query: 79 AYLKILKMFED--RRTAPYSIHQIALTGASEGK-AVGEWFGP-NTVAQVLRKLAKYDDWS 134
++ F D PYSIH + T K G F P + V L +L D
Sbjct: 138 ---TVIHWFLDFYNSECPYSIHSLFTTQIIVSKNPNGSSFLPFSVVIYALTELVNKDFNR 194
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ H+ + N ++N + K P ++ IP
Sbjct: 195 AFECHI-ITNKFLLNSINK---------------PTIVFIP------------------- 219
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+ +P ++ I F+F G++GG A YF G N ++FLDP
Sbjct: 220 FTIPDEFEQRLITI-------------FSFNLFAGMVGGSKQKAFYFFGIHHNQLLFLDP 266
Query: 255 H 255
H
Sbjct: 267 H 267
>gi|255082892|ref|XP_002504432.1| predicted protein [Micromonas sp. RCC299]
gi|226519700|gb|ACO65690.1| predicted protein [Micromonas sp. RCC299]
Length = 196
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 35/139 (25%)
Query: 167 WQPLVLVIPLRLGI-QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W PLV+++PL LG+ + +NP Y+ GI + LP
Sbjct: 75 WAPLVILVPLVLGLDRCVNPRYVPGIVRMLGLP--------------------------- 107
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHT-------NQNIGCVYDKEQDSEKKLDSTY 278
QS+G++GGKP +LYF+G ++ +LDPHT Q GC +S TY
Sbjct: 108 QSVGILGGKPCASLYFVGAQDEELFYLDPHTVQLAVPLEQIWGCAQTGSPESGPFPTETY 167
Query: 279 HCPQASRLHILHMDPSIAV 297
HC ++ +DPS+ +
Sbjct: 168 HCRSVLHMNARELDPSMVL 186
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 29/40 (72%)
Query: 21 SRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQAL 60
SR+W TYR+GF IG TTD GWGC LR GQM++A AL
Sbjct: 3 SRVWITYRRGFPQIGGGTYTTDAGWGCTLRSGQMLLANAL 42
>gi|146097214|ref|XP_001468076.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134072442|emb|CAM71152.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 388
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 100/251 (39%), Gaps = 42/251 (16%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E ++ L+F+YR F P+ +G TTD WGC++R QM++ LL H +
Sbjct: 58 EFVKAATKKLLYFSYRNCFPPL-PNGSTTDTRWGCLVRTTQMLVGTCLLRYHCQGTYVLP 116
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKY 130
E +I ++F D +AP IH+ P + +A +
Sbjct: 117 EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSPTEAGMAIAAALIAFH 176
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
+ F ++ + + K + + Q ++L+IP+ LGI
Sbjct: 177 AQGGDVPFTFCCES----RNIDEPAVMAKLS----EGQHVILIIPVVLGIA--------- 219
Query: 191 IKKCYALPISPVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
P+S Y+ M+KIL +M+ G+ GG +LY G+ G
Sbjct: 220 -------PMSDQYERMMLKIL----DMKA---------CCGIAGGLKRASLYMFGHQGRS 259
Query: 249 VIFLDPHTNQN 259
V F+DPH QN
Sbjct: 260 VFFMDPHYIQN 270
>gi|384496645|gb|EIE87136.1| hypothetical protein RO3G_11847 [Rhizopus delemar RA 99-880]
Length = 224
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/53 (56%), Positives = 35/53 (66%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D TSRLW TYR + PI S TD GWGCMLR GQ ++A L+ LGRDW+
Sbjct: 142 DFTSRLWMTYRHNYPPIRPSNHKTDIGWGCMLRSGQSLLANTLIIHFLGRDWR 194
>gi|71407017|ref|XP_806004.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70869620|gb|EAN84153.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
gi|111154177|gb|ABH07410.1| autophagin-1 [Trypanosoma cruzi]
Length = 328
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 100/237 (42%), Gaps = 52/237 (21%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
TYR F P+ S +T+DKGWGC++R QM++A AL W+++ N + L
Sbjct: 46 LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+ + + P+S+H++ + E++ P+ + +R + A+D
Sbjct: 95 RDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144
Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
L+ V + C + SN ++ ++++ P+R G +
Sbjct: 145 RKLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS-------------RRMTQ 191
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
+ + +L S+ +GV+GG P + Y +G G +++LDPH
Sbjct: 192 MMFFSLEHLLHSS-------------ACIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235
>gi|398021304|ref|XP_003863815.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322502048|emb|CBZ37132.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 388
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 100/251 (39%), Gaps = 42/251 (16%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E ++ L+F+YR F P+ +G TTD WGC++R QM++ LL H +
Sbjct: 58 EFVKAATKKLLYFSYRNCFPPL-PNGSTTDTRWGCLVRTTQMLVGTCLLRYHCQGAYVLP 116
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKY 130
E +I ++F D +AP IH+ P + +A +
Sbjct: 117 EADNAELKERISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSPTEAGMAIAAALIAFH 176
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
+ F ++ + + K + + Q ++L+IP+ LGI
Sbjct: 177 AQGGDVPFTFCCES----RNIDEPAVMAKLS----EGQHVILIIPVVLGIA--------- 219
Query: 191 IKKCYALPISPVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND 248
P+S Y+ M+KIL +M+ G+ GG +LY G+ G
Sbjct: 220 -------PMSDQYERMMLKIL----DMKA---------CCGIAGGLKRASLYMFGHQGRS 259
Query: 249 VIFLDPHTNQN 259
V F+DPH QN
Sbjct: 260 VFFMDPHYIQN 270
>gi|401427503|ref|XP_003878235.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322494482|emb|CBZ29784.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 388
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 96/252 (38%), Gaps = 46/252 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E ++ L+F+YR F P+ + TTD WGC++R QM++ LL H +
Sbjct: 58 EFVKAAAKKLLYFSYRNCFPPLPNRS-TTDTRWGCLVRTTQMLVGSCLLRYHCKGAYVLP 116
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
E +I ++F D +AP IH++ P +
Sbjct: 117 ERDNAELKERISRLFMDVPSAPLGIHKVEDEAHKNSVKYASMLSPTEAGMAIAA------ 170
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP------QWQPLVLVIPLRLGIQDINPV 186
+ I FH + C N+ + + Q ++L+IP+ LGI
Sbjct: 171 -ALIAFHAQGGDAPFT-----FCCENRNIDESAVMAKLSEGQHVILIIPVVLGIA----- 219
Query: 187 YINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVG 246
P+S Y+ ++L +M+ G+ GG +LY G+ G
Sbjct: 220 -----------PMSGQYE--RMLLKILDMKA---------CCGIAGGFKQASLYMFGHQG 257
Query: 247 NDVIFLDPHTNQ 258
+V F+DPH Q
Sbjct: 258 RNVFFMDPHYVQ 269
>gi|407417199|gb|EKF38000.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi marinkellei]
Length = 328
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 100/237 (42%), Gaps = 52/237 (21%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
TYR F P+ S +T+DKGWGC++R QM++A AL W+++ N + L
Sbjct: 46 LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+ + + P+S+H++ + E++ P+ + +R + A+D
Sbjct: 95 RDIDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144
Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
L+ V + C + SN ++ ++++ P+R G +
Sbjct: 145 RRLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS-------------RRMTQ 191
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
+ + +L S+ +GV+GG P + Y +G G +++LDPH
Sbjct: 192 MMFFSLEHLLHSS-------------ACIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235
>gi|119493442|ref|XP_001263911.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
gi|119412071|gb|EAW22014.1| peptidase family C54 protein [Neosartorya fischeri NRRL 181]
Length = 179
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 73/161 (45%), Gaps = 42/161 (26%)
Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
D+T V++ K L N S ++P +++I RLGI I PVY + +K LP
Sbjct: 3 DDTGDVHEDKFLDAANDERGS---FRPTLILIGTRLGIDRITPVYWDAVKTTLQLP---- 55
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN----- 257
QS+G+ GG+P+ + YF+G G+ + +LDPH
Sbjct: 56 -----------------------QSVGIAGGRPSASHYFVGVQGSHLFYLDPHQTRPALP 92
Query: 258 -QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+NI Y E+ TYH + R+HI MDPS+ +
Sbjct: 93 QRNIDERYTDEE------IETYHTRRLRRIHIRDMDPSMLI 127
>gi|407852207|gb|EKG05835.1| AUT2/APG4/ATG4 cysteine peptidase, putative,cysteine peptidase,
Clan CA, family C54, putative [Trypanosoma cruzi]
Length = 328
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/237 (22%), Positives = 99/237 (41%), Gaps = 52/237 (21%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
TYR F P+ S +T+DKGWGC++R QM++A AL W+++ N + L
Sbjct: 46 LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+ + P+S+H++ + E++ P+ + +R + A+D
Sbjct: 95 CDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144
Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
L+ V + C + SN ++ ++++ P+R G
Sbjct: 145 RKLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCG-------------------A 185
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
S +K S + + + +GV+GG P + Y +G G +++LDPH
Sbjct: 186 SRRMTQMKFFSLEHLLHS-------STCIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235
>gi|157874465|ref|XP_001685715.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68128787|emb|CAJ08920.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 388
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 92/241 (38%), Gaps = 42/241 (17%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK 82
L+F+YR F P+ SG TTD WGC++R QM++ LL H + E +
Sbjct: 68 LYFSYRNCFPPL-PSGSTTDTHWGCLVRTTQMLVGTCLLRYHCKGAYVLPEADNAELKER 126
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK--LAKYDDWSSIVFHV 140
I ++F D +AP IH+ P + +A + F
Sbjct: 127 ISRLFMDVPSAPLGIHKAEDEAHKNSVKYASMLSPTEAGMAIAAALIAFRAQGGDVPFTF 186
Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
++ + + K + Q +VL+IP+ LGI P+S
Sbjct: 187 CCES----RHIDEPAVMAKLL----EGQHVVLIIPVVLGIA----------------PMS 222
Query: 201 PVYD--MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ 258
Y+ M+KIL G+ GG +LY G+ G V F+DPH Q
Sbjct: 223 DQYELVMLKILD-------------VKACCGIAGGFKQASLYMFGHQGRSVFFMDPHYVQ 269
Query: 259 N 259
N
Sbjct: 270 N 270
>gi|145507452|ref|XP_001439681.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406876|emb|CAK72284.1| unnamed protein product [Paramecium tetraurelia]
Length = 312
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 114/251 (45%), Gaps = 51/251 (20%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW 69
Q+ E + + +WF YR G + +D+GWGC++R GQM++A AL+ R+
Sbjct: 44 QNAEAFNQKKDTLIWFCYRANIQFEGKA--ISDQGWGCLVRVGQMMLANALM-----REC 96
Query: 70 QWNVNSKEEAYLKILKMFEDRR----TAPYSIHQIALTGA-SEGKAVGEWFGPNTVAQVL 124
+ +K +A I+ +F+D + AP+SI QI + + +G+W+ + V+
Sbjct: 97 KILAINKTKAM--IIHLFDDNQEYSTIAPFSIQQIIKRASINLNMKIGDWYTGPKIMSVI 154
Query: 125 RKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN 184
L K + + + +VN +++ C + + + +P +L+I +G + +
Sbjct: 155 EDLNKNN--------MNIKQINLVNFLEQ-CVLESQIDLSFK-KPHLLIIHAIIGDKSLG 204
Query: 185 PVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGY 244
+ I ++ +MQ ++ G I GK N A + IG+
Sbjct: 205 QLEIQNLQS--------------------HMQISQFA-------GAIIGKNNKAFFLIGF 237
Query: 245 VGNDVIFLDPH 255
N+ IF+DPH
Sbjct: 238 QKNNAIFMDPH 248
>gi|124088531|ref|XP_001347134.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia strain d4-2]
gi|145474259|ref|XP_001423152.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057523|emb|CAH03507.1| Cysteine protease required for autophagy-like [Paramecium
tetraurelia]
gi|124390212|emb|CAK55754.1| unnamed protein product [Paramecium tetraurelia]
Length = 277
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 108/242 (44%), Gaps = 53/242 (21%)
Query: 23 LWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK-EEAYL 81
+WF+YR G + +D+GWGC++R GQM++A +L+ + + NSK +
Sbjct: 22 IWFSYRANIQYEGRA--ISDQGWGCLIRVGQMIVANSLI--------RESTNSKPNDLKT 71
Query: 82 KILKMFEDRRT----APYSIHQ-IALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSI 136
KI+ +F+D + AP+SI Q I +G+W+ + +L L +
Sbjct: 72 KIICLFDDNQCFSTLAPFSIQQIIKRADLVYNIKIGDWYTGPKIMCLLEDLLQSAK---- 127
Query: 137 VFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYA 196
+ ++N +++ C K+ + QP +L+I +G ++++ ++ ++K
Sbjct: 128 ----TIKQLKIINFLEQ-CVIEKQIDLQFK-QPQLLIIHAIIGNKELDQYFVAELQK--- 178
Query: 197 LPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
+MQ P++ G I GK A + IGY N I +DPH
Sbjct: 179 -----------------HMQIPQFA-------GAIVGKSKKAYFLIGYQNNQGIVMDPHY 214
Query: 257 NQ 258
Q
Sbjct: 215 VQ 216
>gi|71425372|ref|XP_813094.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma cruzi strain CL
Brener]
gi|70877946|gb|EAN91243.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma cruzi]
Length = 328
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 99/237 (41%), Gaps = 52/237 (21%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
TYR F P+ S +T+DKGWGC++R QM++A AL W+++ N + L
Sbjct: 46 LTYRMNFSPLPHSSVTSDKGWGCLVRSSQMLLAHAL--------WRYSAN---DCRLDHF 94
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVG-EWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+ + P+S+H++ + E++ P+ + +R + A+D
Sbjct: 95 CDMDTEDSTPFSLHKMVRAVMKKADVFRPEYWTPSQGCEAIR----------CCVNNAVD 144
Query: 144 NTLV----VNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
L+ V + C + SN ++ ++++ P+R G +
Sbjct: 145 RKLIPPIRVVVCSQGCLLAREICSNLEFGTVLILAPMRCGAS-------------RRMTQ 191
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHT 256
+ + +L S+ +GV+GG P + Y +G G +++LDPH
Sbjct: 192 MMFFSLEHLLHSS-------------ACIGVVGGVPQRSYYILGTSGQRLLYLDPHC 235
>gi|330846267|ref|XP_003294964.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
gi|325074459|gb|EGC28510.1| hypothetical protein DICPUDRAFT_85404 [Dictyostelium purpureum]
Length = 266
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 36/44 (81%)
Query: 18 DITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL 61
D+ S +WF+YRK F PI ++ +TTD GWGCMLR GQM++A+ALL
Sbjct: 216 DVRSLIWFSYRKDFPPIENTTITTDIGWGCMLRTGQMILARALL 259
>gi|403364614|gb|EJY82073.1| hypothetical protein OXYTRI_20407 [Oxytricha trifallax]
Length = 806
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 65/129 (50%), Gaps = 30/129 (23%)
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLG 229
L++++ +RLG+++I Y +K C++L Q +G
Sbjct: 641 LMIIMTIRLGLENIEQDYHKALKACFSLR---------------------------QCVG 673
Query: 230 VIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHI 288
++GGKPN ALYF+GY + +IFLDPH Q + EQ +++L TY + A ++ +
Sbjct: 674 ILGGKPNFALYFVGYQQDHMIFLDPHYVQQ--ALTSDEQLKDQELKDTYQSQRSAKKIKM 731
Query: 289 LHMDPSIAV 297
+DP I V
Sbjct: 732 ESLDPCIGV 740
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 6/67 (8%)
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKE-EAYLKILKMFEDRRTAPYSI 97
+ +D GWGCM+RC QM++A + L L Q N N + + IL M D+ AP+ I
Sbjct: 393 INSDCGWGCMIRCQQMMLANSFLKL-----LQQNHNFHDILTHDSILSMILDQLDAPFGI 447
Query: 98 HQIALTG 104
HQI G
Sbjct: 448 HQITEEG 454
>gi|401425377|ref|XP_003877173.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322493418|emb|CBZ28705.1| cysteine peptidase, Clan CA, family C54,putative [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 394
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
S ++LE+ D + L FTYR GF +P + TD+GWGC+LR QM++A L++H
Sbjct: 33 SREELEKALAD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88
Query: 66 GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
GR + L +F D TAP+SIH + + + E++ P+ +
Sbjct: 89 GRPAD-----------RKLSLFFDHSAETAPFSIHNMIRSVWNRRAFKAEYWSPSQGCEA 137
Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
+++ T + A Q Q V+V+ G
Sbjct: 138 IKR------------------------------TMQGAVKTEQLQTRVMVVTSTNGCIYA 167
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
++ + G L V ++ +Y +Q + PQ LGV+GG P + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225
Query: 241 FIGYVGNDVIFLDPH 255
F + + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240
>gi|159128081|gb|EDP53196.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus A1163]
Length = 226
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 72/161 (44%), Gaps = 42/161 (26%)
Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
D+T V + K L N S ++P +++I RLGI I PVY + +K LP
Sbjct: 3 DDTADVYEDKFLDAANDGRGS---FRPTLILIGTRLGIDRITPVYWDAVKTTLQLP---- 55
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN----- 257
QS+G+ GG+P+ + YF+G G+ + +LDPH
Sbjct: 56 -----------------------QSVGIAGGRPSASHYFVGVQGSHLFYLDPHQTRPALP 92
Query: 258 -QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+NI Y E+ TYH + R+HI MDPS+ +
Sbjct: 93 QRNIDDPYTDEE------IETYHTRRLRRIHIRDMDPSMLI 127
>gi|167385012|ref|XP_001737178.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900129|gb|EDR26546.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 348
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 97/241 (40%), Gaps = 61/241 (25%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEE 78
+TS ++F YR F + ++ L +D GWGC +R QM++A A++ L G D N+N K
Sbjct: 84 LTSLIYFVYRSNFSALPNTSLKSDGGWGCTIRACQMLLANAIIKL-FGSD---NINRK-- 137
Query: 79 AYLKILKMFED--RRTAPYSIHQIALTG-ASEGKAVGEWFGP-NTVAQVLRKLAKYDDWS 134
++ F D PYSIH + T G G F P + V L +L D
Sbjct: 138 ---TVIHWFLDFYNVECPYSIHSLFTTQIIVSGNPNGSSFLPLSVVTYALTELVNKDLNR 194
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
HV + N ++N + K P ++ IP
Sbjct: 195 IFECHV-ITNKFLLNSINK---------------PTIIFIP------------------- 219
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+ +P ++ I F+F G++GG A YF G + ++FLDP
Sbjct: 220 FTIPDEFNQRLISI-------------FSFNLFAGMVGGCKQKAFYFFGIHHDQLLFLDP 266
Query: 255 H 255
H
Sbjct: 267 H 267
>gi|146093458|ref|XP_001466840.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
gi|134071204|emb|CAM69889.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania infantum
JPCM5]
Length = 394
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
S ++LE+ D + L FTYR GF +P + TD+GWGC+LR QM++A L++H
Sbjct: 33 SREELEKALTD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88
Query: 66 GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
GR + L +F D TAP+SIH + + + E++ P+ +
Sbjct: 89 GRPAD-----------RRLSLFFDHSAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGCEA 137
Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
+++ T + A Q Q V+V+ G
Sbjct: 138 IKR------------------------------TVQGAVKTEQLQTRVMVVTSTNGCIYA 167
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
++ + G L V ++ +Y +Q + PQ LGV+GG P + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225
Query: 241 FIGYVGNDVIFLDPH 255
F + + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240
>gi|71000771|ref|XP_755067.1| autophagy cysteine endopeptidase Atg4 [Aspergillus fumigatus Af293]
gi|66852704|gb|EAL93029.1| autophagy cysteine endopeptidase Atg4, putative [Aspergillus
fumigatus Af293]
Length = 226
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 72/161 (44%), Gaps = 42/161 (26%)
Query: 143 DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
D+T V + K L N S ++P +++I RLGI I PVY + +K LP
Sbjct: 3 DDTADVYEDKFLDAANDGRGS---FRPTLILIGTRLGIDRITPVYWDAVKTTLQLP---- 55
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN----- 257
QS+G+ GG+P+ + YF+G G+ + +LDPH
Sbjct: 56 -----------------------QSVGIAGGRPSASHYFVGVQGSHLFYLDPHQTRPALP 92
Query: 258 -QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+NI Y E+ TYH + R+HI MDPS+ +
Sbjct: 93 QRNIDDPYTDEE------IETYHTRRLRRIHIRDMDPSMLI 127
>gi|72389991|ref|XP_845290.1| AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359288|gb|AAX79730.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei]
gi|70801825|gb|AAZ11731.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 327
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 102/236 (43%), Gaps = 45/236 (19%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
FTYR+ F P+ S LT+DKGWGC+ R QM++A +L +S ++ L+
Sbjct: 46 FTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL-----------RRHSAQDCKLQYF 94
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGE-WFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+D + AP+S+H + +G+++ ++ P+ + + K I L
Sbjct: 95 ADLDDEQVAPFSLHCMVRHILKQGESLRPVYWAPSQGCEAISGCVKRATERGI-----LS 149
Query: 144 NTL-VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
+ L VV V + + + + ++++ PLR G
Sbjct: 150 SPLSVVITVAGAVPAEEVSCHLKESRNVLILAPLRCGASR-------------------- 189
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHTN 257
Y K+ S ++ P+S+G++GG PN Y IG + +++LDPH
Sbjct: 190 YMSQKMFLSLEHL------LLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCK 239
>gi|398019156|ref|XP_003862742.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
gi|322500973|emb|CBZ36050.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Leishmania donovani]
Length = 394
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
S ++LE+ D + L FTYR GF +P + TD+GWGC+LR QM++A L++H
Sbjct: 33 SREELEKALTD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88
Query: 66 GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
GR + L +F D TAP+SIH + + + E++ P+ +
Sbjct: 89 GRPAD-----------RRLSLFFDHSAETAPFSIHNMIRSLWNRRAFKAEYWSPSQGCEA 137
Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
+++ T + A Q Q V+V+ G
Sbjct: 138 IKR------------------------------TVQGAVKTEQLQTRVMVVTSANGCIYA 167
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
++ + G L V ++ +Y +Q + PQ LGV+GG P + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225
Query: 241 FIGYVGNDVIFLDPH 255
F + + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240
>gi|326665689|ref|XP_002661113.2| PREDICTED: cysteine protease ATG4D-like, partial [Danio rerio]
Length = 149
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/53 (54%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
E RR +S LW +YR+GF P+ S L++D GWGCMLR QM++AQ LL LH+
Sbjct: 95 ESFRRVFSSLLWMSYRRGFRPLDGSTLSSDAGWGCMLRSAQMLLAQGLL-LHI 146
>gi|261328682|emb|CBH11660.1| AUT2/APG4/ATG4 cysteine peptidase, putative [Trypanosoma brucei
gambiense DAL972]
Length = 327
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 104/236 (44%), Gaps = 45/236 (19%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
FTYR+ F P+ S LT+DKGWGC+ R QM++A +L +S ++ L+
Sbjct: 46 FTYRRYFDPLPYSTLTSDKGWGCLARATQMLLACSL-----------RRHSAQDCKLQYF 94
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGE-WFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+D + AP+S+H + +G+++ ++ P+ + + K I L
Sbjct: 95 ADLDDEQVAPFSLHCMVRHILKQGESLRPVYWAPSQGCEAISGCVKRATERGI-----LS 149
Query: 144 NTL-VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPV 202
+ L VV V + + + + ++++ PLR G +C +
Sbjct: 150 SPLSVVITVAGAVPAEEVSCHLKESRNVLILAPLRC-----------GASRCMS------ 192
Query: 203 YDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHTN 257
K+ S ++ P+S+G++GG PN Y IG + +++LDPH
Sbjct: 193 ---QKMFLSLEHL------LLAPESVGMVGGVPNRGYYIIGTGAQELLLYLDPHCK 239
>gi|157872135|ref|XP_001684616.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
gi|68127686|emb|CAJ05824.1| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania major strain
Friedlin]
Length = 394
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 54/255 (21%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHL 65
S ++LE+ D + L FTYR GF +P + TD+GWGC+LR QM++A L++H
Sbjct: 33 SREELEKALTD--TFLIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAH-FLWVH- 88
Query: 66 GRDWQWNVNSKEEAYLKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQV 123
GR + L +F D TAP+SIH + + + E++ P+ +
Sbjct: 89 GRPAD-----------RKLSLFFDHSAETAPFSIHNMIRSVWNRRVFKAEYWSPSQGCEA 137
Query: 124 LRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI--- 180
+++ T + A Q Q V+V+ G
Sbjct: 138 IKR------------------------------TVQGAVKTEQLQTRVMVVTSTNGCIYA 167
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
++ + G L V ++ +Y +Q + PQ LGV+GG P + Y
Sbjct: 168 DEVQHTFKQGADVVLVLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGVVGGVPGRSYY 225
Query: 241 FIGYVGNDVIFLDPH 255
F + + +LDPH
Sbjct: 226 FFAHNQTQLFYLDPH 240
>gi|123497568|ref|XP_001327207.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
gi|121910133|gb|EAY14984.1| Clan CA, family C54, ATG4-like cysteine peptidase [Trichomonas
vaginalis G3]
Length = 296
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 101/247 (40%), Gaps = 49/247 (19%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALL-FLHLGRDWQWNVNSKEEAYLKI 83
FTYR F I +T+D GWGC R Q +IA L + + ++ + V ++ + +
Sbjct: 30 FTYRCNFQAIQPGNITSDSGWGCCYRSAQGLIASYFLNYAPVDAEYFFTVFNE----IPM 85
Query: 84 LKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+FEDR P+SI + G G W P+ +A + + K S + ++ D
Sbjct: 86 FSLFEDRVEMPFSIQNLVYRSELFGVKPGTWAKPSQLAATIESIFKDLKLSVL---ISKD 142
Query: 144 NTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVY 203
+ ++ VK + RA + LG++D+ +I IK
Sbjct: 143 SNIIPEDVKTM-----RAPFLLLIP-------ILLGMKDVEQKFIPFIK----------- 179
Query: 204 DMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGN-DVIFLDPH-TNQNIG 261
Y F P+ LG + G + + + +G + +V++ DPH T Q +
Sbjct: 180 ----------------YTFQRPEFLGAVSGSSDFSYFLVGLSEDQNVVYFDPHVTKQAVA 223
Query: 262 CVYDKEQ 268
+D +
Sbjct: 224 SSFDHSE 230
>gi|389602150|ref|XP_001566661.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505338|emb|CAM40177.2| putative AUT2/APG4/ATG4 cysteine peptidase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 398
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 99/250 (39%), Gaps = 52/250 (20%)
Query: 23 LWFTYRKGF--VPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAY 80
L FTYR GF +P + TD+GWGC+LR QM++A L W + +
Sbjct: 46 LIFTYRDGFEAIPAVTRLIETDQGWGCLLRTSQMLLAHFL--------WAYGRPAD---- 93
Query: 81 LKILKMFEDR--RTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVF 138
+ L +F D TAP+SIH + + ++ E++ P+ + +++
Sbjct: 94 -RRLALFFDHSAETAPFSIHNLIRSVWNQRAFKAEYWSPSQGCEAIKR------------ 140
Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLG---IQDINPVYINGIKKCY 195
T + A Q Q V V+ G +++ + G +
Sbjct: 141 ------------------TMQDAIKTEQLQTRVTVVTSTNGCVYADEVHHTFKQGAEVVL 182
Query: 196 ALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
L V ++ +Y +Q + PQ LG++GG P + YF + + +LDPH
Sbjct: 183 VLASVRVSAAAQLTQESY-LQIEKL-MEQPQCLGIVGGVPGRSYYFFAHNQTQLFYLDPH 240
Query: 256 TNQNIGCVYD 265
+ D
Sbjct: 241 QRTTAALLSD 250
>gi|440301471|gb|ELP93857.1| hypothetical protein EIN_176840 [Entamoeba invadens IP1]
Length = 362
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 102/247 (41%), Gaps = 50/247 (20%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN---S 75
+++ LW TYR G+ + +S L TD GWGC +R QM+I+ A+ L D +
Sbjct: 75 MSNLLWMTYRSGYEKLPNSSLNTDVGWGCTIRAVQMMISNAMQTLVYKHDLTSSTTPYIP 134
Query: 76 KEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQVLRKLAKYD 131
K+ L ++ F D +T P SIH + + + K+ + P VA+ L +
Sbjct: 135 KQNEILNVVIPFVDFFEQTTPLSIHHVYESRFVVEQNKSGVNYLAPTIVAKAYSDLV--N 192
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGI 191
W AL + N LC K ++P ++ +P+ + + + + +
Sbjct: 193 SWK----MCALRCVMASNTSIPLCDIKKEP-----FKPTLVFLPIIM-----DQLVKSRL 238
Query: 192 KKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIF 251
++ Y +NM G++ G + A+Y G+ +F
Sbjct: 239 QQIYK----------------FNMFA-----------GIVSGIGDRAVYIFGFHVMRCLF 271
Query: 252 LDPHTNQ 258
LDPHT Q
Sbjct: 272 LDPHTVQ 278
>gi|118378680|ref|XP_001022514.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304281|gb|EAS02269.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 371
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 56/259 (21%), Positives = 110/259 (42%), Gaps = 47/259 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIG--DSGLTTDKGWGCMLRCGQMVIAQAL---LFLHLGR 67
E ++ +++S ++ +Y+K + +TTD GWGC LR QM++AQ L L+ +
Sbjct: 48 ELLQEELSSLVFLSYKKNMKEFQYLSTTITTDNGWGCSLRTSQMMLAQGLKRHLYEKRVQ 107
Query: 68 DWQWNVNSKEEAYLKILKMF------EDRRTAPYSIHQIALTGASEGKA-VGEWFGPNTV 120
+ +N +K + + ++ MF E+ +P+ H + + + + + + P
Sbjct: 108 SFIYNDKTKLD-FQHLIMMFAESNSLENMDQSPFGFHSLLTQAINLFQVPLKQQYTPVQG 166
Query: 121 AQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGI 180
+ L++ K + + +T V+ Q + R + L+L++ +LG
Sbjct: 167 IKALKQQFKQQKLVKSL-KIVTSSTGVIFQ------EDIRQKMKNWEKSLLLILHFKLGT 219
Query: 181 QDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALY 240
+N +Y+ IK L +G IGG N +L+
Sbjct: 220 GKLNQIYVEQIKSLMDLEY---------------------------FVGAIGGIKNKSLF 252
Query: 241 FIGYVGNDVIFLDPHTNQN 259
+GY+ + + LDPH QN
Sbjct: 253 MVGYMNDQFLSLDPHVQQN 271
>gi|340508254|gb|EGR34000.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 209
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 11/126 (8%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDS-------GLTTDKGWGCMLRCGQMVIAQALLF 62
++ +++ + + +W TYR+ F P+ + +D GWGCM+R GQM +A+ L
Sbjct: 17 KNCKKLIENFKNIIWMTYRRNFFPLLHNTKDHKIQNYISDTGWGCMVRVGQMALAEGLRH 76
Query: 63 LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAV-GEWFGPNTVA 121
HL + ++ +A+L F D APYSI +I E + V G+W+ P +
Sbjct: 77 -HLQQKGIYDNKRIIQAFLD--NDFGDDNIAPYSIQKICKIAYKEFQLVPGQWYSPVRIC 133
Query: 122 QVLRKL 127
VL L
Sbjct: 134 HVLSLL 139
>gi|294953189|ref|XP_002787639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239902663|gb|EER19435.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 341
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 56/109 (51%), Gaps = 15/109 (13%)
Query: 25 FTYRKGFVPI-GDSGLTT--DKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYL 81
FTYR F PI G G T+ DKGWGC +R QM++AQA+ G+D +V
Sbjct: 69 FTYRCAFEPIEGCVGPTSVSDKGWGCAIRATQMLLAQAVKM--AGKDADDSV-------- 118
Query: 82 KILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAK 129
+L +F D AP S+H++ G K G WFGP + V +L K
Sbjct: 119 -VLSLFLDSPQAPLSLHRMVKMGQEVLAKRPGTWFGPTSGGMVASRLVK 166
>gi|154281231|ref|XP_001541428.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150411607|gb|EDN06995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 463
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 63/138 (45%), Gaps = 28/138 (20%)
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
A ++ P ++++ +RLGI + PVY +K
Sbjct: 249 AGTSTDVHPTLILLGIRLGIDRVTPVYWEALKAV-------------------------- 282
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK-EQDSEKKLDSTYH 279
+PQS+G+ GG+P+ + YFIG + +LDPH + +D ++ + +TYH
Sbjct: 283 -LKYPQSVGIAGGRPSSSHYFIGAQASHFFYLDPHHTRPALAYHDAGDRPYTTEELNTYH 341
Query: 280 CPQASRLHILHMDPSIAV 297
+ RLHI MDPS+ +
Sbjct: 342 TRRLRRLHIKDMDPSMLI 359
>gi|342181415|emb|CCC90894.1| putative AUT2/APG4/ATG4 cysteine peptidase [Trypanosoma congolense
IL3000]
Length = 327
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 101/239 (42%), Gaps = 53/239 (22%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
FTYRK F P+ S +TTDKGWGC+ R QM++A AL H+ D+ +
Sbjct: 46 FTYRKDFEPLPRSVITTDKGWGCLARASQMLLACALR-RHMALDFSFQYFCD-------- 96
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGEWF-----GPNTVAQVLRK-LAKYDDWSSIVF 138
+D R AP+S+H + + G+ + + G ++ +R+ + + S +
Sbjct: 97 --IDDERIAPFSLHCMVRSVLRPGEDLRPVYWTPSQGCEAISGCVRRAIHRGALHSQLRV 154
Query: 139 HVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP 198
V + ++V + + A ++++P+R G +K +
Sbjct: 155 VVGAAGAIPKHEVNRHLEDSGNA---------LILVPVRCGTTR------RMTQKMF--- 196
Query: 199 ISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPHT 256
LS + + T P +G++GG P Y IG G + +++LDPH
Sbjct: 197 ----------LSLEHLLLT-------PMCVGMVGGVPGRCYYIIGTGGQELLLYLDPHC 238
>gi|195350255|ref|XP_002041656.1| GM16787 [Drosophila sechellia]
gi|194123429|gb|EDW45472.1| GM16787 [Drosophila sechellia]
Length = 135
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/37 (72%), Positives = 30/37 (81%)
Query: 10 QDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWG 46
Q+LE IRRDI SRLW TYR GF P+G+ LTTDKGWG
Sbjct: 61 QELELIRRDIQSRLWCTYRHGFSPLGEVQLTTDKGWG 97
>gi|167381603|ref|XP_001735783.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165902089|gb|EDR28003.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 359
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 110/285 (38%), Gaps = 60/285 (21%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL----HLGRDWQWNVN 74
I++ W TYR G+ + +S LTTD GWGC +R QM+IA A+ + L +
Sbjct: 75 ISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIP 134
Query: 75 SKEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQVLRKLAKY 130
+KEE + +L F D T P SIH + + + K+ + P+ VA+ L
Sbjct: 135 TKEEI-MNVLVPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLV-- 191
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
+ W KLC SN + IP
Sbjct: 192 NSW-------------------KLCPIRCVMCSN-------VSIPTH------------- 212
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
+ LP P + I+ + + + G++GG + A++ G+ +
Sbjct: 213 --ELSKLPFKPTLVFLPIVLNHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFL 270
Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP-QASRLHILHMDPS 294
+LDPH Q S ++D+ + P +R + +DP+
Sbjct: 271 YLDPHIVQ-------PSFKSFTEIDTKSYSPISTNRFSVHTIDPT 308
>gi|118378678|ref|XP_001022513.1| Peptidase family C54 containing protein [Tetrahymena thermophila]
gi|89304280|gb|EAS02268.1| Peptidase family C54 containing protein [Tetrahymena thermophila
SB210]
Length = 649
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 123/321 (38%), Gaps = 83/321 (25%)
Query: 23 LWFTYRKGFVPIGD-----SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD----WQWNV 73
+WF+YR F I D ++ D GWGCM+RC QM++A+AL +L Q +
Sbjct: 145 IWFSYRNNFPLIRDVADDNQSVSNDYGWGCMIRCSQMLLAEALKRHYLNDQNIQIEQLSQ 204
Query: 74 NSKEEAYLKILKMF------EDRRTAPYSIHQIA---LTGASEGKAVGEWFGPNTVAQ-- 122
+ ++ Y I+K+F D P S I L + FG + Q
Sbjct: 205 DDEKHFYSNIIKLFLDCTSESDVLNQPGSYQDIQSKMLLNEQNLNNIYSLFGIQNICQSA 264
Query: 123 VLRKLAK-YDDW-SSIVFHVALDNTLVVNQ--------------------VKKLCTTNKR 160
+LR+ + +W +SI V L L +Q +K+L +++
Sbjct: 265 ILRQYQQNVKNWYTSIQVSVILQEILEESQSKLNSKLGFHILNFTDQIIFLKELEEASRK 324
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
N + +++++ L+ GI + K Y + + + + V
Sbjct: 325 Q--NDRLNNILVMVHLKFGINKFEMQH-----KDYFIELLKIKNFV-------------- 363
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY-- 278
G + G +Y IG+ + +I LDPH Q K + E+ LD Y
Sbjct: 364 --------GALSGTETKGMYIIGFQEDRLIVLDPHFIQ-------KSTEGEQGLDKDYCT 408
Query: 279 ---HCPQASRLHILHMDPSIA 296
P++ L L D S+
Sbjct: 409 YFNKTPRSISLECLSSDISLG 429
>gi|209880175|ref|XP_002141527.1| peptidase family C54 [Cryptosporidium muris RN66]
gi|209557133|gb|EEA07178.1| peptidase family C54, putative [Cryptosporidium muris RN66]
Length = 353
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 60/136 (44%), Gaps = 17/136 (12%)
Query: 4 ANKLSHQDLEQIRRDITSRLWFTYRKGFVPIGD---------SGLTTDKGWGCMLRCGQM 54
NK + + + F+YR F I S +TTD GWGCMLR QM
Sbjct: 36 GNKYDADKYDSFLKSFHEIILFSYRYNFPTIRSEWDFSIETGSSVTTDLGWGCMLRVIQM 95
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EGKAVGE 113
+A LL + + ++++ IL+ F+D + +SIHQ G S K +
Sbjct: 96 SLALGLLRYCKMKKYTYSLD-------YILQNFQDLEESLFSIHQFVKVGCSIFNKKPKD 148
Query: 114 WFGPNTVAQVLRKLAK 129
WFGP + + + L K
Sbjct: 149 WFGPTSASTIADYLVK 164
>gi|343472883|emb|CCD15086.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 327
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 98/235 (41%), Gaps = 47/235 (20%)
Query: 25 FTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKIL 84
FTYRK F P+ S +TTDKGWGC+ R QM++A AL H+ D+ +
Sbjct: 46 FTYRKDFEPLPRSVITTDKGWGCLARASQMLLACALR-RHMTLDFSFQYFCD-------- 96
Query: 85 KMFEDRRTAPYSIHQIALTGASEGKAVGE-WFGPNTVAQVLRKLAKYDDWSSIVFHVALD 143
+D R AP+S+H + + G+ + ++ P+ + + + S + AL
Sbjct: 97 --IDDERIAPFSLHCMVRSVLRPGEDLRPVYWTPSQGCEAISGCVR-----SAIHRGALH 149
Query: 144 NTL--VVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
+ L VV + L+LV P+R G +K +
Sbjct: 150 SQLRVVVGAAGAIPKHEVNRHLEDSGNALILV-PVRCGTTR------RMTQKMF------ 196
Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGND-VIFLDPH 255
LS + + T P +G++GG P Y +G G + +++LDPH
Sbjct: 197 -------LSLEHLLLT-------PMCVGMVGGVPGRCYYIVGTGGQELLLYLDPH 237
>gi|66359342|ref|XP_626849.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
gi|46228139|gb|EAK89038.1| possible peptidase family C54 [Cryptosporidium parvum Iowa II]
Length = 348
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 68/151 (45%), Gaps = 22/151 (14%)
Query: 5 NKLSHQDLEQIRRDITSRLWFTYRKGFVPI------------GDSGLTTDKGWGCMLRCG 52
NK + ++ + ++ + FTYR F I + +D GWGCM R
Sbjct: 34 NKFAPEEKKYFLKEFHDIILFTYRNEFKNIIITRNTVQLTKNYSKNINSDVGWGCMYRVT 93
Query: 53 QMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAV 111
QM IA + + + N+N + KIL F+D +A +SIH + G SE G
Sbjct: 94 QMSIAHGI--CQFMKRFLGNLNIE-----KILNNFQDNESAKFSIHNMVNIGLSEFGIDP 146
Query: 112 GEWFGPNTVAQVLRKLAKYDDWSSIVFHVAL 142
W GP T + + KL +D SI+ ++ +
Sbjct: 147 TSWIGPTTSSMIANKLI--NDNRSIISNIQI 175
>gi|457866467|dbj|BAM93578.1| autophagy related protein 4 [Vigna unguiculata]
Length = 219
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 44/86 (51%), Gaps = 8/86 (9%)
Query: 218 PRY------EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSE 271
PRY F FPQSLG++GGKP + Y IG +LDPH Q + + Q E
Sbjct: 57 PRYIPLLHSTFKFPQSLGIMGGKPGASTYIIGVQSEKAFYLDPHDVQTVVNISGDTQ--E 114
Query: 272 KKLDSTYHCPQASRLHILHMDPSIAV 297
S+YHC + + +DPS+A+
Sbjct: 115 PNSTSSYHCNVMRHIPLDSIDPSLAI 140
>gi|183230042|ref|XP_653798.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169803042|gb|EAL48412.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449708555|gb|EMD47997.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 359
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/285 (22%), Positives = 110/285 (38%), Gaps = 60/285 (21%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL----HLGRDWQWNVN 74
I++ W TYR G+ + +S LTTD GWGC +R QM+IA A+ + L +
Sbjct: 75 ISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIP 134
Query: 75 SKEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQVLRKLAKY 130
+K+E + +L F D T P SIH + + + K+ + P+ VA+ L
Sbjct: 135 TKQEV-MNVLIPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAKAYSGLV-- 191
Query: 131 DDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYING 190
+ W KLC SN + IP
Sbjct: 192 NSW-------------------KLCPIRCVMCSN-------VSIPTH------------- 212
Query: 191 IKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVI 250
+ LP P + I+ + + + G++GG + A++ G+ +
Sbjct: 213 --ELSKLPFKPTLVFLPIVLNHLIHSKLQQIYKSKLFAGIVGGMGDRAIFVFGFHALQFL 270
Query: 251 FLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS-RLHILHMDPS 294
+LDPH Q S ++D+ + P + R + +DP+
Sbjct: 271 YLDPHIVQ-------PSFKSFTEIDTKSYSPIGTNRFSVHTIDPT 308
>gi|50291183|ref|XP_448024.1| hypothetical protein [Candida glabrata CBS 138]
gi|62899752|sp|Q6FP20.1|ATG4_CANGA RecName: Full=Probable cysteine protease ATG4; AltName:
Full=Autophagy-related protein 4
gi|49527335|emb|CAG60975.1| unnamed protein product [Candida glabrata]
Length = 483
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 62/147 (42%), Gaps = 31/147 (21%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDK 43
E+I I S+L FTYR F PI G S + TD
Sbjct: 78 EEILNAIRSKLNFTYRTNFEPIERAPDGPSPINPLIMLRINPIDAIENVFNNRECFFTDV 137
Query: 44 GWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALT 103
GWGCM+R GQ ++ AL + Q + ++ +I +F+D + +S+
Sbjct: 138 GWGCMIRTGQSLLGNALQRVKSTVKDQPYIYEMDDTK-EITDLFKDNTKSAFSLQNFVKC 196
Query: 104 GASEGK-AVGEWFGPNTVAQVLRKLAK 129
G K A GEWFGP T A +R L +
Sbjct: 197 GRIYNKIAPGEWFGPATTATCIRYLIQ 223
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 9/71 (12%)
Query: 225 PQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQAS 284
P S+G+ GG+P+ +LYF GY + ++F DPH +Q + D D + H
Sbjct: 287 PFSVGIAGGRPSSSLYFFGYFDDTLLFFDPHNSQ-TALIDD--------FDESCHTENFG 337
Query: 285 RLHILHMDPSI 295
+L+ +DPS+
Sbjct: 338 KLNFSDLDPSM 348
>gi|312381461|gb|EFR27207.1| hypothetical protein AND_06241 [Anopheles darlingi]
Length = 307
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCG 52
+ +D+E RRD SR+W TYR+ F + DS T+D GWGCM+ G
Sbjct: 190 VEEEDIEAFRRDFVSRIWMTYRREFQTMDDSNYTSDCGWGCMIPAG 235
>gi|440300801|gb|ELP93248.1| hypothetical protein EIN_056230 [Entamoeba invadens IP1]
Length = 321
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/238 (23%), Positives = 94/238 (39%), Gaps = 55/238 (23%)
Query: 26 TYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLK--I 83
TYR+ + +G + LT+D GWGC +R QM++ +++ ++L + + S + +K
Sbjct: 67 TYRQKYATLGHTYLTSDAGWGCAIRSVQMLLVNSIV-VYLDKSFHPEYTSHDHIAIKNNA 125
Query: 84 LKMFEDRRTAPYSIHQIALTGA--SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVA 141
++ D+ ++ SIH I + A + P+T A + L Y+ W F V
Sbjct: 126 KQLVFDKESSVLSIHNIYIQDAIIKHNPTGTNFLPPSTCATAVADL--YNFWEKRTFDVL 183
Query: 142 LDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISP 201
+CT + QP +L IP + + N +
Sbjct: 184 ------------MCTEYIPEVT----QPTLLFIPRIVTKSERNFI--------------- 212
Query: 202 VYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
QT + PQS G + G + A+Y G V FLDPH Q+
Sbjct: 213 --------------QTTSF---LPQSRGFVAGIGDAAIYCFGVQEKRVFFLDPHFVQD 253
>gi|407037201|gb|EKE38550.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 193
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 9/112 (8%)
Query: 19 ITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL----HLGRDWQWNVN 74
I++ W TYR G+ + +S LTTD GWGC +R QM+IA A+ + L +
Sbjct: 75 ISNLFWMTYRSGYEKLPNSSLTTDVGWGCTIRAMQMMIANAMETIVYSGALNNTQTPYIP 134
Query: 75 SKEEAYLKILKMFED--RRTAPYSIHQIALTG--ASEGKAVGEWFGPNTVAQ 122
+K+E + +L F D T P SIH + + + K+ + P+ VA+
Sbjct: 135 TKQEV-MNVLIPFIDSPNSTTPLSIHHVYESRFVVEKNKSGVNYLAPSVVAK 185
>gi|403222100|dbj|BAM40232.1| autophagy-related peptidase [Theileria orientalis strain Shintoku]
Length = 351
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 98/232 (42%), Gaps = 40/232 (17%)
Query: 41 TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN---VNSKEEAYLKILKMFEDRRTAPYSI 97
TDKGWGC++R QM +AQAL+ L +G ++ VN+K RT
Sbjct: 65 TDKGWGCVVRSTQMALAQALINLIIGPEFSMEDLLVNNKSP------------RTGHLDA 112
Query: 98 HQIALTGASEGKAVGEWFGPNTVAQVLRKLA-----KYDDWSSIVFHVALDNTLVVNQVK 152
++L G + + + A L KL+ YDD SS+ +L N +V + V
Sbjct: 113 KLLSLDG------LQQLLTEESHADELTKLSIILSQFYDDKSSL---FSLYNFIVADLVL 163
Query: 153 KLCTTNKRASSNPQWQPLVL---VIPLRLGIQDIN----PVYINGIKKCYAL-PISPVYD 204
K CT K S P + + + + I I+ YIN +K + V+
Sbjct: 164 KTCT--KFLSFGPTSTAVCISKVINDANIAISSISFPDGVFYINKVKDLFEKNKYLLVWV 221
Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI-GYVGNDVIFLDPH 255
+K Y +T R F Q G++GG H Y+I G + + DPH
Sbjct: 222 SMKKKLDKYEKETVRSLFKLKQFNGIVGGNLLHRSYYIFGTSSKRLYYNDPH 273
>gi|238595999|ref|XP_002393933.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
gi|215462138|gb|EEB94863.1| hypothetical protein MPER_06258 [Moniliophthora perniciosa FA553]
Length = 158
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 41/78 (52%), Gaps = 27/78 (34%)
Query: 178 LGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNH 237
LG+ +NP+Y YD +KIL +TFPQS+G+ GG+P+
Sbjct: 1 LGLDGVNPIY---------------YDTIKIL------------YTFPQSVGIAGGRPSS 33
Query: 238 ALYFIGYVGNDVIFLDPH 255
+ YF+G +++ +LDPH
Sbjct: 34 SYYFVGSQADNLFYLDPH 51
>gi|156085180|ref|XP_001610073.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797325|gb|EDO06505.1| hypothetical protein BBOV_II005540 [Babesia bovis]
Length = 206
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 6/78 (7%)
Query: 37 SGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYS 96
+ + TD+GWGC LR QM +A+AL RD +++ +E +IL++F D AP+S
Sbjct: 69 TDIKTDRGWGCALRATQMALAEAL------RDVLSPLDNVQEQRSRILQLFYDTTEAPFS 122
Query: 97 IHQIALTGASEGKAVGEW 114
+ + + G V W
Sbjct: 123 LENLVMADVEHGANVVAW 140
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 207 KILSSTYNMQTPRYEFTFPQSLGVIGGKPN--HALYFIGYVGNDVIFLDPH 255
K LS + N + +Y FT P G++G K + A YF+G+ GN ++LDPH
Sbjct: 145 KELSESQN-ECLKYLFTLPWFKGMVGAKKDKQRAYYFVGHHGNQALYLDPH 194
>gi|426336111|ref|XP_004029547.1| PREDICTED: uncharacterized protein LOC101129491 [Gorilla gorilla
gorilla]
Length = 351
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 42/71 (59%), Gaps = 1/71 (1%)
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWS 134
+E + +I+ F D AP+ +H++ G S GK G+W+GP+ VA +LRK + + +
Sbjct: 50 QERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVT 109
Query: 135 SIVFHVALDNT 145
+V +V+ D T
Sbjct: 110 RLVVYVSQDCT 120
>gi|84994978|ref|XP_952211.1| autophagy-related peptidase [Theileria annulata strain Ankara]
gi|65302372|emb|CAI74479.1| autophagy-related peptidase, putative [Theileria annulata]
Length = 350
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 98/241 (40%), Gaps = 46/241 (19%)
Query: 35 GDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAP 94
G + +DKGWGC+LR QM I+QALL L LG ++ ++ E R P
Sbjct: 58 GIVTIDSDKGWGCVLRSTQMAISQALLNLVLGPEFS-------------VEQLEIRNRTP 104
Query: 95 YS--IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVK 152
+ I Q L + K + + V+ V LA++ D + VF + N ++ + V
Sbjct: 105 RNRKIDQSLLNIDTFEKLLNGLLDLDGVSAVSVILAQFYDDLNAVFSIY--NFVIADYVL 162
Query: 153 KLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALP--ISPVYDMVKILS 210
K CT + P + I D+N + IN I A P + + D+ +IL
Sbjct: 163 KTCT------KFLHFGPTSAALCASKIINDLN-LPINSI----AFPDGVFHISDVREILE 211
Query: 211 STYNM---------------QTPRYEFTFPQSLGVIGGKP-NHALYFIGYVGNDVIFLDP 254
N+ + R F Q G+IGG N + Y G + + DP
Sbjct: 212 EKRNLLVWVSNKKKLDRIERECVRSMFRLSQFNGIIGGNLFNKSYYIFGTTNKRLYYNDP 271
Query: 255 H 255
H
Sbjct: 272 H 272
>gi|224010768|ref|XP_002294341.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969836|gb|EED88175.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 658
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 82/215 (38%), Gaps = 74/215 (34%)
Query: 95 YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY------------------------ 130
YS+HQ+ G G GEW+GP T VLR+L +
Sbjct: 243 YSLHQMVAAGLGLGVLPGEWYGPTTACHVLRELNEIHCGCRERVAEVLKRRRKGGDKGDI 302
Query: 131 -------DD--WSSIVF--HVALDNTLVVNQVKKLCTTNKRA----------SSNPQWQP 169
DD ++ VF H+A + + ++ + KL T++ ++ N
Sbjct: 303 DEHNHVGDDSQYTCDVFRVHIATEGCIYLDAISKLMTSSNQSLQTESNDAPIQHNTDSAA 362
Query: 170 LVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQ------------- 216
V+ PL L + +P+ A + D +IL+ ++
Sbjct: 363 NVIDHPLSLPEEVFDPLR--------AQVTTQSSDKEQILNQQWDTSLLLLLPLRLGIQS 414
Query: 217 --TPRYEFT------FPQSLGVIGGKPNHALYFIG 243
TP Y T FPQS+G++GG P HAL+F G
Sbjct: 415 IPTPTYGSTLAKLLSFPQSVGMLGGTPRHALWFYG 449
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 5/42 (11%)
Query: 24 WFTYRKGF-VPI----GDSGLTTDKGWGCMLRCGQMVIAQAL 60
W TYR VP+ G GL +D GWGCMLR QM++AQ +
Sbjct: 113 WLTYRSDLTVPLRPYNGGVGLKSDAGWGCMLRSAQMMMAQTV 154
>gi|14043289|gb|AAH07639.1| ATG4D protein [Homo sapiens]
gi|16877152|gb|AAH16845.1| ATG4D protein [Homo sapiens]
gi|119604522|gb|EAW84116.1| ATG4 autophagy related 4 homolog D (S. cerevisiae), isoform CRA_a
[Homo sapiens]
gi|325464017|gb|ADZ15779.1| ATG4 autophagy related 4 homolog D (S. cerevisiae) [synthetic
construct]
Length = 141
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 5/67 (7%)
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILH 290
+GGKP H+LYFIGY + +++LDPH Q D Q ++ L+S +HC ++
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQ-ADFPLES-FHCTSPRKMAFAK 55
Query: 291 MDPSIAV 297
MDPS V
Sbjct: 56 MDPSCTV 62
>gi|408392897|gb|EKJ72185.1| hypothetical protein FPSE_07642 [Fusarium pseudograminearum CS3096]
Length = 389
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 23/73 (31%)
Query: 18 DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
D S++W TYR F PI GD S ++D GWGCM+R GQ
Sbjct: 120 DFESKIWMTYRSEFEPIPRSTNPQATSALSLSMRLKSQLGDQSPFSSDSGWGCMIRSGQS 179
Query: 55 VIAQALLFLHLGR 67
++A + + LGR
Sbjct: 180 MLANTIAMVRLGR 192
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 6/68 (8%)
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK---EQDSEKKLDSTYHCPQASRLHIL 289
G+P+ + YFIG G+ + +LDPH + + Y + E SE+ ++ H P+ R+H+
Sbjct: 262 GRPSSSHYFIGAQGSFLFYLDPH-HTRVALPYREDPIEYTSEEI--ASCHTPRLRRIHVR 318
Query: 290 HMDPSIAV 297
MDPS+ +
Sbjct: 319 EMDPSMLI 326
>gi|395756856|ref|XP_002834509.2| PREDICTED: cysteine protease ATG4D-like [Pongo abelii]
Length = 141
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 5/67 (7%)
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILH 290
+GGKP H+LYFIGY + +++LDPH Q V E ++HC ++
Sbjct: 1 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQANFPLE-----SFHCTSPRKMAFAK 55
Query: 291 MDPSIAV 297
MDPS V
Sbjct: 56 MDPSCTV 62
>gi|302833489|ref|XP_002948308.1| autophagy protein [Volvox carteri f. nagariensis]
gi|300266528|gb|EFJ50715.1| autophagy protein [Volvox carteri f. nagariensis]
Length = 391
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI---GCVYDKEQDSEKKLD--- 275
T+PQS+G++GG+P+ +LY G + +FLDPH Q G D E
Sbjct: 232 LTWPQSVGIVGGRPSASLYLCGVQDSSFLFLDPHEAQPTVRWGIAGDAGHTKEAGNGGSA 291
Query: 276 --------STYHCPQASRLHILHMDPSIAV 297
+TY C + +DPS+A+
Sbjct: 292 VVLPASSLATYFCDTVRLMPATALDPSMAI 321
>gi|407043625|gb|EKE42056.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 183
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 48/96 (50%), Gaps = 2/96 (2%)
Query: 7 LSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL--H 64
L+H I + TYR+ + +G++ L++D GWGC +R QM++ AL+
Sbjct: 51 LNHLTFNDANLKIHDLIVATYRQKYSYLGNTYLSSDAGWGCAIRATQMMVVNALVIFKDQ 110
Query: 65 LGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
+ + +N ++ + ++ DR ++ SIH I
Sbjct: 111 MQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHNI 146
>gi|340500608|gb|EGR27474.1| peptidase family c54 protein, putative [Ichthyophthirius
multifiliis]
Length = 384
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 86/214 (40%), Gaps = 43/214 (20%)
Query: 114 WFGPNTVAQVLRKL-------------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKR 160
W+ PN + +L KL KY ++F L +V +Q KLC N
Sbjct: 86 WYDPNRICFILEKLYNFSSIKGTENLKFKYFSNHKLIFFEDLIKLMVDSQA-KLCNQNIH 144
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYING----------IKKCYALPISPVYDMVKILS 210
N Q Q L L I+D V KKC+ S + + L+
Sbjct: 145 ---NEQQQNLDLNNNSSQLIEDSFEVITKSSKQNTLDNLICKKCHQSDKSLLI-FISCLT 200
Query: 211 STYNMQTPRYEFTFPQ------SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
+T + + + S+G+IGG P A YF+G + ND I+LDPH Q
Sbjct: 201 NTNKISNKKQQEVVISLLKNQFSIGMIGGVPGKAYYFLGIIDNDFIYLDPHYIQ------ 254
Query: 265 DKEQDSEKKLDS--TYHCPQASRLHILHMDPSIA 296
+ +EK + + TY C +R+ ++ S+A
Sbjct: 255 -EAHQNEKTVQNIDTYFCKFINRVSQKKLESSLA 287
>gi|167391747|ref|XP_001739914.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165896205|gb|EDR23684.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 325
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 49/99 (49%), Gaps = 2/99 (2%)
Query: 6 KLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL-- 63
L+H I + TYR+ + +G++ L++D GWGC +R QM+I L+
Sbjct: 50 NLNHLTFNDANIKIHDLIVATYRQKYSCLGNTYLSSDAGWGCAIRATQMMIVNTLVIFKD 109
Query: 64 HLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIAL 102
+ + +N ++ L+ ++ D+ ++ SIH I +
Sbjct: 110 QMQQIIDYNSFEHQQNKLQAKELIYDKISSLLSIHNIYI 148
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 22/41 (53%)
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIG 261
T QS G +GG A++ GY G + FLDPH QN G
Sbjct: 219 SLTLSQSRGFVGGIGESAIFVFGYQGTTLFFLDPHYVQNAG 259
>gi|71030858|ref|XP_765071.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352027|gb|EAN32788.1| hypothetical protein TP02_0505 [Theileria parva]
Length = 215
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 58/120 (48%), Gaps = 17/120 (14%)
Query: 39 LTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYS-- 96
+ TDKGWGC+LR QM I+QAL+ L LG ++ ++ E R +P +
Sbjct: 62 IDTDKGWGCVLRSTQMAISQALMNLVLGPEFS-------------VEQLEIRNRSPRNKK 108
Query: 97 IHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCT 156
I + L + K + + ++ V LA++ D + VF + N ++ + V K CT
Sbjct: 109 IDESILNLDTFEKLINGVVDLDEISAVSVILAQFYDDLNAVFSIY--NFIIADYVLKTCT 166
>gi|67470848|ref|XP_651386.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|56468115|gb|EAL46000.1| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
Length = 325
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 2/97 (2%)
Query: 6 KLSHQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFL-- 63
L+H I + TYR+ + +G++ L++D GWGC +R QM+I AL+
Sbjct: 50 NLNHLTFNDANIKIHDLIVATYRQKYSYLGNTYLSSDAGWGCAIRATQMMIVNALVIFKD 109
Query: 64 HLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
+ + +N ++ + ++ DR ++ SIH I
Sbjct: 110 QMQQIVDYNSFEHQQNKSQAKELIYDRISSLLSIHNI 146
>gi|297601024|ref|NP_001050279.2| Os03g0391000 [Oryza sativa Japonica Group]
gi|255674556|dbj|BAF12193.2| Os03g0391000, partial [Oryza sativa Japonica Group]
Length = 81
Score = 45.8 bits (107), Expect = 0.026, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
TFPQSLG++GGKP + Y G + ++LDPH Q + Y
Sbjct: 19 LTFPQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQLVKLYY 61
>gi|167386236|ref|XP_001737678.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899448|gb|EDR26037.1| hypothetical protein EDI_014170 [Entamoeba dispar SAW760]
Length = 346
Score = 44.7 bits (104), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 62/153 (40%), Gaps = 29/153 (18%)
Query: 15 IRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
I + +++ TYR GF + LTTD GWGC LR QM+ +L+ L + N
Sbjct: 62 IAKHLSTMFRVTYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLIRLQ-----EPNP 116
Query: 74 NSKEEAYLKILKMF-----EDRRT---------------APYSIHQIALTGASEGKAVGE 113
E+A K+ K F E+RR + Y + + + + K
Sbjct: 117 GFGEDAAEKVQKNFIIHSMEERREYVQLIEDTPKQEAVLSLYKMFNLKIVRQNNQKGTN- 175
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
+ P+T A L +L + W HV NT
Sbjct: 176 YLSPSTCAIALSQLVEI--WDQRPCHVIYSNTF 206
>gi|429327650|gb|AFZ79410.1| hypothetical protein BEWA_022580 [Babesia equi]
Length = 385
Score = 44.7 bits (104), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 109/290 (37%), Gaps = 62/290 (21%)
Query: 11 DLEQIRRDITSRLWFTYR-----KGFVPIG--------------------DSGLTTDKGW 45
+ + +R IT L FTYR K PIG + TDKGW
Sbjct: 46 EYRKFKRKITGILLFTYRSDLNYKVAKPIGLIKREHVIGIFKPFNVCLPSIQTIDTDKGW 105
Query: 46 GCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA 105
GC++R QM +AQ L+ L LG ++ KE D S + +G
Sbjct: 106 GCVIRATQMALAQTLISLILGDNFDIYSILKENT-------LPDSSGGAPSHRRSDRSGK 158
Query: 106 SEGKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFH--------VALDNTLVVNQVKKLCTT 157
SE N + + KYD + SI+ +L ++ + V K CT
Sbjct: 159 SEN-------FDNIITDGYQN--KYDAFCSILSQFYDSRESKFSLYKFIIADSVLKTCT- 208
Query: 158 NKRASSNPQWQPLVL-------VIPLRLGIQDINPVYINGIKKCYALPISPV--YDMVKI 208
K S P + + IPL+ YIN + K + + + + K
Sbjct: 209 -KFLSFGPTSSAICVNKMINDANIPLKSIAFPDGVFYINEVYKGFNKNRNVIVWLSLNKK 267
Query: 209 LSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFI-GYVGNDVIFLDPHTN 257
L + R F Q G++GG N+ Y+I G + + ++DPH +
Sbjct: 268 LDKNEKVAV-RSLFLLKQFNGIVGGNMNNRAYYICGCSSSRLYYVDPHVS 316
>gi|407038566|gb|EKE39191.1| peptidase, C54 family protein [Entamoeba nuttalli P19]
Length = 346
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 62/153 (40%), Gaps = 29/153 (18%)
Query: 15 IRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
+ + +++ TYR GF + LTTD GWGC LR QM+ +L+ L + N
Sbjct: 62 VAKHLSTMFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLIRLQ-----EPNP 116
Query: 74 NSKEEAYLKILKMF-----EDRRT---------------APYSIHQIALTGASEGKAVGE 113
E+A K+ + F E+RR + Y + + + + K
Sbjct: 117 GFGEDAAEKVQRNFIIHSMEERREYVQLIEDTPKQEAVLSLYKMFNLKIVRQNNQKGTN- 175
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
+ P+T A L +L + W HV NT
Sbjct: 176 YLSPSTCAIALSQLVEM--WDQRPCHVIYSNTF 206
>gi|294877403|ref|XP_002767983.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
gi|239870083|gb|EER00701.1| hypothetical protein Pmar_PMAR002136 [Perkinsus marinus ATCC 50983]
Length = 133
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 28/41 (68%), Gaps = 3/41 (7%)
Query: 23 LWFTYRKGFVPI-GDSGLTT--DKGWGCMLRCGQMVIAQAL 60
+ FTYR F PI G G T+ DKGWGC +R QM++AQA+
Sbjct: 67 ILFTYRCAFEPIEGCVGPTSVSDKGWGCAIRATQMLLAQAV 107
>gi|183234005|ref|XP_652043.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|169801304|gb|EAL46674.2| peptidase, C54 family [Entamoeba histolytica HM-1:IMSS]
gi|449707706|gb|EMD47317.1| peptidase C54 family protein [Entamoeba histolytica KU27]
Length = 346
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 67/148 (45%), Gaps = 19/148 (12%)
Query: 15 IRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH-----LGRD 68
I + +++ TYR GF + LTTD GWGC LR QM+ +L+ L G D
Sbjct: 62 IAKHLSTLFRITYRNGFTYHLPHCSLTTDAGWGCTLRSIQMLFLNSLIRLQEPNPGFGDD 121
Query: 69 ------WQWNVNSKEEAYLKILKMFED--RRTAPYSIHQI-ALTGASEGKAVG-EWFGPN 118
+ ++S EE + +++ ED ++ A S++++ L + G + P+
Sbjct: 122 AAEKVQQNFIIHSMEERR-EYVQLIEDTPKQEAVLSLYKMFNLKIVRQNNQKGTNYLSPS 180
Query: 119 TVAQVLRKLAKYDDWSSIVFHVALDNTL 146
T A L +L + W HV NT
Sbjct: 181 TCAIALSQLVEM--WDQRPCHVIYSNTF 206
>gi|440292697|gb|ELP85881.1| hypothetical protein EIN_133850 [Entamoeba invadens IP1]
Length = 348
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 64/151 (42%), Gaps = 17/151 (11%)
Query: 11 DLEQIRRDITSRLWFTYRKGFV-PIGDSGLTTDKGWGCMLRCGQMVIAQALLFLH----- 64
D QI + +++ TYR GF + LTTD GWGC +R QM+ +L+ +
Sbjct: 60 DGSQIAKHLSTLFKVTYRNGFTYHLPHCSLTTDAGWGCTIRSVQMLFLNSLIRIQEPDPG 119
Query: 65 LGRDWQWNVNS-----KEEAYLKILKMFED--RRTAPYSIHQI-ALTGASEGKAVG-EWF 115
+D Q + + + +++ ED R+ A SIH++ L + G +
Sbjct: 120 FDKDSQTKMKKGFLVHPMDVRREYVQLIEDTPRKEAVLSIHKMFDLEVVRKNNQKGTNYL 179
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTL 146
P+T A + L + W HV T
Sbjct: 180 SPSTCATAISVLM--EQWDERPCHVMFVQTF 208
>gi|422293936|gb|EKU21236.1| cysteine protease family, partial [Nannochloropsis gaditana
CCMP526]
Length = 91
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 8/54 (14%)
Query: 80 YLKILKMFEDRRTAP-----YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
Y ++L F D AP +S+H + G S K GEW+GP TVA +LR LA
Sbjct: 23 YCQLLDSFVD---APGPNHVFSVHNMVQIGMSYDKLPGEWYGPTTVAYILRDLA 73
>gi|422295376|gb|EKU22675.1| cysteine protease family, partial [Nannochloropsis gaditana
CCMP526]
Length = 96
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 8/54 (14%)
Query: 80 YLKILKMFEDRRTAP-----YSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA 128
Y ++L F D AP +S+H + G S K GEW+GP TVA +LR LA
Sbjct: 23 YCQLLDSFVD---APGPNHVFSVHNMVQIGMSYDKLPGEWYGPTTVAYILRDLA 73
>gi|401403014|ref|XP_003881388.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115800|emb|CBZ51355.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 3465
Score = 40.4 bits (93), Expect = 1.00, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 34/71 (47%), Gaps = 17/71 (23%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI----GDS-------------GLTTDKGWGCMLRCGQMV 55
+Q+ + + S FTYR GF P+ G+ + +D GWGC +R QM+
Sbjct: 941 QQLSQTVGSIARFTYRSGFSPMYKCCGEKKRRAGGGFEREWIAINSDVGWGCTVRAAQML 1000
Query: 56 IAQALLFLHLG 66
+ QAL LG
Sbjct: 1001 LMQALRRHFLG 1011
>gi|148682816|gb|EDL14763.1| mCG116861, isoform CRA_a [Mus musculus]
Length = 127
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 5/52 (9%)
Query: 247 NDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
+++IFLDPHT Q + ++S D T+HC Q+ R+ IL++DPS+A+
Sbjct: 1 DELIFLDPHTTQTFVDI----EESGLVDDQTFHCLQSPQRMSILNLDPSVAL 48
>gi|392343434|ref|XP_003754884.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
gi|392355909|ref|XP_003752169.1| PREDICTED: cysteine protease ATG4A-like, partial [Rattus
norvegicus]
Length = 126
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 5/52 (9%)
Query: 247 NDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMDPSIAV 297
+++IFLDPHT Q + ++S D T+HC Q+ R+ IL++DPS+A+
Sbjct: 1 DELIFLDPHTTQ----TFVDTEESGLVDDHTFHCLQSPQRMSILNLDPSVAL 48
>gi|171318466|ref|ZP_02907620.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
ambifaria MEX-5]
gi|171096332|gb|EDT41235.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
ambifaria MEX-5]
Length = 189
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 41/96 (42%), Gaps = 23/96 (23%)
Query: 46 GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
G +CG V+ LLF LHLG +W +V KE E RT PY + +
Sbjct: 71 GLQAQCGLAVLVAGLLFSVWARLHLGTNWSVSVTLKEN--------HELVRTGPYGLVRH 122
Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
IAL GA+ GEW G VA V LA
Sbjct: 123 PIYTGCLIALAGAA--LIGGEWRGALGVALVFASLA 156
>gi|115359254|ref|YP_776392.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
ambifaria AMMD]
gi|115284542|gb|ABI90058.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
ambifaria AMMD]
Length = 189
Score = 38.1 bits (87), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 41/96 (42%), Gaps = 23/96 (23%)
Query: 46 GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
G +CG V+ LLF LHLG +W +V KE E RT PY + +
Sbjct: 71 GLQAQCGLAVLIAGLLFSVWARLHLGTNWSVSVTLKEN--------HELVRTGPYGLVRH 122
Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
IAL GA+ GEW G VA V LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGAFGVALVFASLA 156
>gi|170700470|ref|ZP_02891476.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
ambifaria IOP40-10]
gi|170134635|gb|EDT02957.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
ambifaria IOP40-10]
Length = 189
Score = 37.7 bits (86), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 41/96 (42%), Gaps = 23/96 (23%)
Query: 46 GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
G +CG V+ LLF LHLG +W +V KE E RT PY + +
Sbjct: 71 GLQAQCGLAVLIAGLLFSVWARLHLGTNWSVSVTLKEN--------HELVRTGPYGLVRH 122
Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
IAL GA+ GEW G VA V LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGALGVALVFASLA 156
>gi|307108757|gb|EFN56996.1| hypothetical protein CHLNCDRAFT_143632 [Chlorella variabilis]
Length = 538
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 19/90 (21%)
Query: 54 MVIAQALLFLHLGRDWQWNVNSKEEAYL-----------------KILKMFEDRRTA--P 94
M++AQ L+ LGR+W+W ++++ ++L++F D P
Sbjct: 1 MILAQGLVRHVLGREWRWPEAARQQQAAAAPALAAAPAEAPPRLARLLELFWDTPAERNP 60
Query: 95 YSIHQIALTGASEGKAVGEWFGPNTVAQVL 124
+S+H + G + G G W GP + + L
Sbjct: 61 FSLHSLCRAGQACGVVAGRWLGPWVMCKTL 90
Score = 37.0 bits (84), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 11/23 (47%), Positives = 19/23 (82%)
Query: 222 FTFPQSLGVIGGKPNHALYFIGY 244
PQS+G++GG+P+ +LYF+G+
Sbjct: 228 LAMPQSIGIVGGRPSSSLYFVGF 250
>gi|107025629|ref|YP_623140.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
cenocepacia AU 1054]
gi|116693189|ref|YP_838722.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
cenocepacia HI2424]
gi|105895003|gb|ABF78167.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
cenocepacia AU 1054]
gi|116651189|gb|ABK11829.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
cenocepacia HI2424]
Length = 189
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 42/96 (43%), Gaps = 23/96 (23%)
Query: 46 GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
G +CG V+ LLF LHLG +W +V KE+ E RT PY++ +
Sbjct: 71 GLQAQCGLAVLVAGLLFSVWARLHLGTNWSVSVTLKED--------HELVRTGPYALVRH 122
Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
IAL GA+ GEW G V V LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGAIGVLLVFASLA 156
>gi|170737545|ref|YP_001778805.1| isoprenylcysteine carboxyl methyltransferase [Burkholderia
cenocepacia MC0-3]
gi|169819733|gb|ACA94315.1| Isoprenylcysteine carboxyl methyltransferase [Burkholderia
cenocepacia MC0-3]
Length = 189
Score = 37.4 bits (85), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 42/96 (43%), Gaps = 23/96 (23%)
Query: 46 GCMLRCGQMVIAQALLF-----LHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQ- 99
G +CG V+ LLF LHLG +W +V KE+ E RT PY++ +
Sbjct: 71 GLQAQCGLAVLVAGLLFSVWARLHLGTNWSVSVTLKED--------HELVRTGPYALVRH 122
Query: 100 -------IALTGASEGKAVGEWFGPNTVAQVLRKLA 128
IAL GA+ GEW G V V LA
Sbjct: 123 PIYTGCLIALVGAA--LIGGEWRGAIGVLLVFASLA 156
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.137 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,909,479,630
Number of Sequences: 23463169
Number of extensions: 201377412
Number of successful extensions: 395909
Number of sequences better than 100.0: 779
Number of HSP's better than 100.0 without gapping: 735
Number of HSP's successfully gapped in prelim test: 44
Number of HSP's that attempted gapping in prelim test: 392486
Number of HSP's gapped (non-prelim): 1251
length of query: 309
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 167
effective length of database: 9,027,425,369
effective search space: 1507580036623
effective search space used: 1507580036623
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)