BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10465
(309 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q640G7|ATG4B_XENLA Cysteine protease ATG4B OS=Xenopus laevis GN=atg4b PE=2 SV=1
Length = 384
Score = 291 bits (744), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 193/315 (61%), Gaps = 52/315 (16%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
EQ+ DITSRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ H+GRDW+W+
Sbjct: 40 EQLLNDITSRLWFTYRRNFQAIGGTGPTSDTGWGCMLRCGQMIFAQALICRHVGRDWRWD 99
Query: 73 VNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
+ YL IL F D++ + YSIHQIA G EGK +G+W+GPNTVAQVLRKLA +D
Sbjct: 100 KQKPKGEYLNILTAFLDKKDSYYSIHQIAQMGVGEGKYIGQWYGPNTVAQVLRKLAVFDQ 159
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSN----------------PQWQPLVLVIPL 176
WSSI H+A+DNT+VV+++++LC SS+ QW+PLVL+IPL
Sbjct: 160 WSSIAVHIAMDNTVVVDEIRRLCRAGSGESSDAGALSNGYTGDSDPSCAQWKPLVLLIPL 219
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
RLG+ +IN YI +K C F PQSLGVIGG+PN
Sbjct: 220 RLGLSEINEAYIETLKHC---------------------------FMVPQSLGVIGGRPN 252
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHILHMDPSI 295
A YFIGYVG+++I+LDPHT Q + + D D ++HC R+H+ +DPSI
Sbjct: 253 SAHYFIGYVGDELIYLDPHTTQ----LSVEPSDCSFIEDESFHCQHPPCRMHVSEIDPSI 308
Query: 296 AV----VSQRSYSDY 306
AV SQ + D+
Sbjct: 309 AVGFFCSSQEDFEDW 323
>sp|Q6PZ05|ATG4A_BOVIN Cysteine protease ATG4A OS=Bos taurus GN=ATG4A PE=2 SV=1
Length = 398
Score = 285 bits (730), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 195/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP----------------------QWQPL 170
W+S+ +V++DNT+V+ +KK+C T ++ P W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRTLSLSADTPAERPLESLTASNQSKGPSACCTAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTAD-DQTFHCLQPPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>sp|Q8WYN0|ATG4A_HUMAN Cysteine protease ATG4A OS=Homo sapiens GN=ATG4A PE=1 SV=1
Length = 398
Score = 281 bits (720), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 197/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D E++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTEENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>sp|Q8C9S8|ATG4A_MOUSE Cysteine protease ATG4A OS=Mus musculus GN=Atg4a PE=2 SV=2
Length = 396
Score = 281 bits (719), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 135/305 (44%), Positives = 194/305 (63%), Gaps = 52/305 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWNWER 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQV++KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVIKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP-------------------QWQPLVLV 173
W+S+ +V++DNT+V+ +KK+C +++P W+PL+L+
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCCVLPVGAADPAGDFLTASNQSRDTSVPCSAWKPLLLI 223
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY+ K+C F PQSLG +GG
Sbjct: 224 VPLRLGINQINPVYVEAFKEC---------------------------FKMPQSLGALGG 256
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPN+A YFIG++G+++IFLDPHT Q + ++S D T+HC Q+ R+ IL++D
Sbjct: 257 KPNNAYYFIGFLGDELIFLDPHTTQT----FVDIEESGLVDDQTFHCLQSPQRMSILNLD 312
Query: 293 PSIAV 297
PS+A+
Sbjct: 313 PSVAL 317
>sp|Q5R699|ATG4A_PONAB Cysteine protease ATG4A OS=Pongo abelii GN=ATG4A PE=2 SV=1
Length = 398
Score = 278 bits (712), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 196/308 (63%), Gaps = 55/308 (17%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNV 73
++ DI++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW W
Sbjct: 44 KLLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK 103
Query: 74 NSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDD 132
++ + Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+
Sbjct: 104 QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDE 163
Query: 133 WSSIVFHVALDNTLVVNQVKKLC--------TTNKR-----ASSN---------PQWQPL 170
W+S+ +V++DNT+V+ +KK+C T R +SN W+PL
Sbjct: 164 WNSLAVYVSMDNTVVIEDIKKMCRVLPLGADTAGDRPPDSLTASNLSKGTSAYCSAWKPL 223
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
+L++PLRLGI INPVY++ K+C F PQSLG
Sbjct: 224 LLIVPLRLGINQINPVYVDAFKEC---------------------------FKMPQSLGA 256
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHIL 289
+GGKPN+A YFIG++G+++IFLDPHT Q D ++ D T+HC Q+ R++IL
Sbjct: 257 LGGKPNNAYYFIGFLGDELIFLDPHTTQTF---VDTGENGTVN-DQTFHCLQSPQRMNIL 312
Query: 290 HMDPSIAV 297
++DPS+A+
Sbjct: 313 NLDPSVAL 320
>sp|Q6DG88|ATG4B_DANRE Cysteine protease ATG4B OS=Danio rerio GN=atg4b PE=2 SV=2
Length = 394
Score = 276 bits (707), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 190/313 (60%), Gaps = 59/313 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
+ I D+TSRLWFTYRK F PIG +G T+D GWGCMLRCGQM++ +AL+ HLGRDW+W+
Sbjct: 40 DDILADVTSRLWFTYRKNFQPIGGTGPTSDTGWGCMLRCGQMILGEALICRHLGRDWKWS 99
Query: 73 VNSKEE-AYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ Y+ IL F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 PGQRQRPEYVSILNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTT----NKRA---SSNPQ------------------ 166
WS + HVA+DNT+V+ ++K+LC ++ A S P+
Sbjct: 160 SWSRLAVHVAMDNTVVIEEIKRLCMPWLDFDRGACAVSEEPREMNGDLEGACALAEEETA 219
Query: 167 -WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W+PLVL+IPLRLG+ DIN YI +K+C F P
Sbjct: 220 LWKPLVLLIPLRLGLSDINEAYIEPLKQC---------------------------FMMP 252
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-AS 284
QSLGVIGGKPN A YFIG+VG+++I+LDPHT Q D +D D +YHC
Sbjct: 253 QSLGVIGGKPNSAHYFIGFVGDELIYLDPHTTQP---AVDPSEDGHFP-DDSYHCQHPPC 308
Query: 285 RLHILHMDPSIAV 297
R+HI +DPSIA
Sbjct: 309 RMHICELDPSIAA 321
>sp|Q6PZ02|ATG4B_CHICK Cysteine protease ATG4B OS=Gallus gallus GN=ATG4B PE=2 SV=1
Length = 393
Score = 276 bits (705), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 188/312 (60%), Gaps = 58/312 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
E+I D+TSRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 39 EEILLDVTSRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWI 98
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ + Y +L F D++ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 99 KGKRQTDNYFSVLNAFIDKKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLATFD 158
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------- 166
WSS+ H+A+DNT+V+ ++++LC +N A++ P
Sbjct: 159 TWSSLAVHIAMDNTVVMEEIRRLCQSNFSCAGAAACPAVEADVLYNGYPEEAGVRDKLSL 218
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
W+PLVL+IPLRLG+ +IN YI +K C F PQ
Sbjct: 219 WKPLVLLIPLRLGLTEINEAYIETLKHC---------------------------FMMPQ 251
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASR 285
SLGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC R
Sbjct: 252 SLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPSDSGCLPDESFHCQHPPCR 307
Query: 286 LHILHMDPSIAV 297
+ I +DPSIAV
Sbjct: 308 MSIAELDPSIAV 319
>sp|Q8BGE6|ATG4B_MOUSE Cysteine protease ATG4B OS=Mus musculus GN=Atg4b PE=1 SV=2
Length = 393
Score = 274 bits (701), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYR+ F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRRNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFNVLNAFLDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN-----------------------KRASSNPQ-W 167
WSS+ H+A+DNT+V+ ++++LC N ++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRANLPCAGAAALPTDSERHCNGFPAGAEVTNRPSAW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + DS D ++HC SR+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----ELTDSCFIPDESFHCQHPPSRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 GIGELDPSIAV 319
>sp|Q9Y4P1|ATG4B_HUMAN Cysteine protease ATG4B OS=Homo sapiens GN=ATG4B PE=1 SV=2
Length = 393
Score = 273 bits (699), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 187/311 (60%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WSS+ H+A+DNT+V+ ++++LC T+ A++ P W
Sbjct: 160 TWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ DIN Y+ +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLTDINEAYVETLKHC---------------------------FMMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q + D D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAV----EPTDGCFIPDESFHCQHPPCRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>sp|Q6PZ03|ATG4B_BOVIN Cysteine protease ATG4B OS=Bos taurus GN=ATG4B PE=2 SV=1
Length = 393
Score = 270 bits (690), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 185/311 (59%), Gaps = 57/311 (18%)
Query: 13 EQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWN 72
++I D+ SRLWFTYRK F IG +G T+D GWGCMLRCGQM+ AQAL+ HLGRDW+W
Sbjct: 40 DEILADVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWT 99
Query: 73 VNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD 131
++ ++Y +L+ F DR+ + YSIHQIA G EGK++G+W+GPNTVAQVL+KLA +D
Sbjct: 100 QRKRQPDSYCSVLQAFLDRKDSCYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFD 159
Query: 132 DWSSIVFHVALDNTLVVNQVKKLCTTN---KRASSNPQ---------------------W 167
WS++ HVA+DNT+V+ +++LC ++ A + P W
Sbjct: 160 TWSALAVHVAMDNTVVMADIRRLCRSSLPCAGAEAFPADSERHCNGFPAGAEGGGRAAPW 219
Query: 168 QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQS 227
+PLVL+IPLRLG+ D+N Y +K C F PQS
Sbjct: 220 RPLVLLIPLRLGLADVNAAYAGTLKHC---------------------------FRMPQS 252
Query: 228 LGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQ-ASRL 286
LGVIGGKPN A YFIGYVG ++I+LDPHT Q D+ D ++HC R+
Sbjct: 253 LGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVAAADR----CPVPDESFHCQHPPGRM 308
Query: 287 HILHMDPSIAV 297
I +DPSIAV
Sbjct: 309 SIAELDPSIAV 319
>sp|Q5ZIW7|ATG4A_CHICK Cysteine protease ATG4A OS=Gallus gallus GN=ATG4A PE=2 SV=1
Length = 380
Score = 265 bits (676), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 201/313 (64%), Gaps = 55/313 (17%)
Query: 9 HQDLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRD 68
++D ++ D+++RLWFTYR+ F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRD
Sbjct: 22 NEDKSKLLLDVSARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRD 81
Query: 69 WQWNVNSKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL 127
WQW + K+ E Y +IL F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KL
Sbjct: 82 WQWEKHKKQPEEYHRILHCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKL 141
Query: 128 AKYDDWSSIVFHVALDNTLVVNQVKKLCTT---------------------NKRASS-NP 165
A +D+W+S+ +V++DNT+V+ +KK+C + N+ A+
Sbjct: 142 ALFDEWNSLAVYVSMDNTVVIEDIKKMCRSPPQSSSTAHSSAHLHRSALGRNRNAAGLCT 201
Query: 166 QWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFP 225
W+PL+L+IPLRLGI INPVYI+ K+C F P
Sbjct: 202 GWKPLLLIIPLRLGINHINPVYIDAFKEC---------------------------FKMP 234
Query: 226 QSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-S 284
QSLG +GGKPN+A YFIG++GN++I+LDPHT Q+ D E++ D ++HC QA
Sbjct: 235 QSLGALGGKPNNAYYFIGFLGNELIYLDPHTTQSF---VDSEENGTVD-DQSFHCQQAPH 290
Query: 285 RLHILHMDPSIAV 297
R+ I+++DPS+A+
Sbjct: 291 RMKIMNLDPSVAL 303
>sp|Q6GPU1|ATG4A_XENLA Cysteine protease ATG4A OS=Xenopus laevis GN=atg4a PE=2 SV=1
Length = 397
Score = 261 bits (666), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 190/305 (62%), Gaps = 53/305 (17%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
++ DI SRLWFTYRK F PIG +G ++D GWGCMLRCGQM++AQAL+ HLGRDW+W +
Sbjct: 47 LQSDIVSRLWFTYRKKFSPIGGTGPSSDTGWGCMLRCGQMMLAQALVCQHLGRDWRWEKH 106
Query: 75 SKE-EAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
E Y +IL+ F DR+ YSIHQ+A G EGK++GEWFGPNTVAQVL+KLA +D+W
Sbjct: 107 KNHPEEYQQILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW 166
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ--------------------WQPLVLV 173
+S+ +V++DNT+VV +K +C ++ S Q W+PL+LV
Sbjct: 167 NSLAVYVSMDNTVVVEDIKTMCKYQPQSCSMAQAASHQSTWSRCRDTSGHCSGWRPLLLV 226
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PLRLGI INPVY++ K C F PQSLG +GG
Sbjct: 227 VPLRLGINHINPVYVDAFKAC---------------------------FKMPQSLGALGG 259
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA-SRLHILHMD 292
KPNHA YFIG+ G+++I+LDPHT Q D E+ + D TYHC + + + +L++D
Sbjct: 260 KPNHAYYFIGFSGDEIIYLDPHTTQTF---VDTEEAGTVQ-DQTYHCQKGPNSMKVLNLD 315
Query: 293 PSIAV 297
PS+A+
Sbjct: 316 PSVAL 320
>sp|Q684M2|ATG4D_PIG Cysteine protease ATG4D OS=Sus scrofa GN=ATG4D PE=3 SV=1
Length = 469
Score = 175 bits (444), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 156/319 (48%), Gaps = 67/319 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 107 DIQRFQRDFVSRLWLTYRRDFPPLAGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 166
Query: 71 WNVN-------------------------------SKEEAYLKILKMFEDRRTAPYSIHQ 99
W+ +E + +I+ F D AP+ +H+
Sbjct: 167 WSQGVGLGPPESSPNRYRGPAHWMPPHWVQAAPELEQERRHRQIVSWFADHPRAPFGLHR 226
Query: 100 IALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKLCTTN 158
+ G S GK G+W+GP+ VA +LRK + + + +V +V+ D T+ V +L
Sbjct: 227 LVELGQSSGKKAGDWYGPSLVAHILRKAVESCSEVTRLVVYVSQDCTVYKADVARLVA-- 284
Query: 159 KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTP 218
R +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 285 -RPDPTAEWKAVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL----- 324
Query: 219 RYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTY 278
LG++GGKP H+LYFIGY + +++LDPH Q V + E ++
Sbjct: 325 --------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE-----SF 371
Query: 279 HCPQASRLHILHMDPSIAV 297
HC ++ MDPS V
Sbjct: 372 HCTSPRKMAFTKMDPSCTV 390
>sp|Q86TL0|ATG4D_HUMAN Cysteine protease ATG4D OS=Homo sapiens GN=ATG4D PE=2 SV=1
Length = 474
Score = 174 bits (440), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 154/323 (47%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWT 167
Query: 71 WNVN-----------------------------------SKEEAYLKILKMFEDRRTAPY 95
W +E + +I+ F D AP+
Sbjct: 168 WAEGMGLGPPELSGSASPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + D + +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGQSSGKKAGDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
R +W+ +V+++P+RLG + +NPVY+ +K+
Sbjct: 288 VA---RPDPTAEWKSVVILVPVRLGGETLNPVYVPCVKELL------------------- 325
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
R E LG++GGKP H+LYFIGY + +++LDPH Q V + E
Sbjct: 326 ----RCELC----LGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPTVDVSQADFPLE--- 374
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
++HC ++ MDPS V
Sbjct: 375 --SFHCTSPRKMAFAKMDPSCTV 395
>sp|Q8BGV9|ATG4D_MOUSE Cysteine protease ATG4D OS=Mus musculus GN=Atg4d PE=1 SV=1
Length = 474
Score = 171 bits (433), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 160/323 (49%), Gaps = 71/323 (21%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
D+++ +RD SRLW TYR+ F P+ LT+D GWGCMLR GQM++AQ LL L RDW+
Sbjct: 108 DIQRFQRDFVSRLWLTYRRDFPPLAGGSLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWR 167
Query: 71 WNVNS-----------------------------------KEEAYLKILKMFEDRRTAPY 95
W + ++ + +I+ F D AP+
Sbjct: 168 WVEGTGLASSEMPGPASPSRCRGPGRRGPPRWTQGALEMEQDRWHRRIVSWFADHPRAPF 227
Query: 96 SIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTLVVNQVKKL 154
+H++ G S GK G+W+GP+ VA +LRK + + S +V +V+ D T+ V +L
Sbjct: 228 GLHRLVELGRSSGKKAGDWYGPSVVAHILRKAVESCSEVSRLVVYVSQDCTVYKADVARL 287
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
+ + +W+ +V+++P+RLG + +NPVY+ +K ++L S
Sbjct: 288 LSWPDPTA---EWKSVVILVPVRLGGETLNPVYVPCVK--------------ELLRSEL- 329
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKL 274
LG++GGKP H+LYFIGY + +++LDPH Q D Q S L
Sbjct: 330 ------------CLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQP---TVDVSQPS-FPL 373
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
+S +HC ++ MDPS V
Sbjct: 374 ES-FHCTSPRKMAFAKMDPSCTV 395
>sp|Q8S929|ATG4A_ARATH Cysteine protease ATG4a OS=Arabidopsis thaliana GN=ATG4A PE=2 SV=1
Length = 467
Score = 164 bits (414), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 143/303 (47%), Gaps = 49/303 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW 71
L ++ D +S++ TYRKGF P D+ T+D WGCM+R QM+ AQALLF LGR W
Sbjct: 135 LAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLGRAWTK 194
Query: 72 NVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLA--- 128
E+ YL+ L+ F D + +SIH + + GAS G A G W GP + + LA
Sbjct: 195 KSELPEQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWESLACKK 254
Query: 129 -KYDDWSSIVFHVAL-------------DNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
K D + +A+ L + K C + S +W P++L++
Sbjct: 255 RKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQS--EWTPIILLV 312
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
PL LG+ +NP YI + FTFPQS+G++GGK
Sbjct: 313 PLVLGLDSVNPRYIPSLVA---------------------------TFTFPQSVGILGGK 345
Query: 235 PNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPS 294
P + Y +G + +LDPH Q + V + D + S+YHC + + +DPS
Sbjct: 346 PGASTYIVGVQEDKGFYLDPHEVQQVVTVNKETPDVDT---SSYHCNVLRYVPLESLDPS 402
Query: 295 IAV 297
+A+
Sbjct: 403 LAL 405
>sp|A2QY50|ATG4_ASPNC Probable cysteine protease atg4 OS=Aspergillus niger (strain CBS
513.88 / FGSC A1513) GN=atg4 PE=3 SV=1
Length = 404
Score = 159 bits (403), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 153/305 (50%), Gaps = 59/305 (19%)
Query: 18 DITSRLWFTYRKGFVPI----GDS-------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F PI GD G T+D GWGCM+R GQ
Sbjct: 81 DFESRIWMTYRSNFPPIPRVEGDDKSASMTLGVRLRSQLVDTQGFTSDTGWGCMIRSGQS 140
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A AL L LGRDW+ +EE+ ++L +F D TAP+S+H+ GA S GK GE
Sbjct: 141 LLANALSMLVLGRDWRRGARFEEES--QLLSLFADTPTAPFSVHRFVKHGAESCGKYPGE 198
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + L+ ++ +V+ D + V K + S +QP +++
Sbjct: 199 WFGPSATAKCIEALSSQCGNPTLKVYVSNDTSEVYQD--KFMDIARNTSG--AFQPTLIL 254
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI +I PVY +G+K FPQS+G+ GG
Sbjct: 255 LGTRLGIDNITPVYWDGLKAA---------------------------LQFPQSVGIAGG 287
Query: 234 KPNHALYFIGYVGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G G+ + +LDPH T + + E S++++D TYH + R+H+ MD
Sbjct: 288 RPSASHYFVGAQGSHLFYLDPHYTRPALPDRQEGELYSKEEVD-TYHTRRLRRIHVRDMD 346
Query: 293 PSIAV 297
PS+ +
Sbjct: 347 PSMLI 351
>sp|Q9M1Y0|ATG4B_ARATH Cysteine protease ATG4b OS=Arabidopsis thaliana GN=ATG4B PE=1 SV=1
Length = 477
Score = 158 bits (400), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 151/304 (49%), Gaps = 50/304 (16%)
Query: 12 LEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDW-Q 70
L R+D +S + TYR+GF PIGD+ T+D WGCMLR GQM+ AQALLF LGR W +
Sbjct: 138 LAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRK 197
Query: 71 WNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY 130
+ +E YL+IL++F D + +SIH + L G S G A G W GP V + LA+
Sbjct: 198 KDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARK 257
Query: 131 DDWS--------SIVFHVALDNT---------LVVNQVKKLCTTNKRASSNPQWQPLVLV 173
+ S+ H+ + L + V K C + + +W P++L+
Sbjct: 258 NKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCL--EFSEGETEWPPILLL 315
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+PL LG+ +NP YI + FTFPQSLG++GG
Sbjct: 316 VPLVLGLDRVNPRYIPSLIA---------------------------TFTFPQSLGILGG 348
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
KP + Y +G + +LDPH Q + V + QD + S+YHC + + +DP
Sbjct: 349 KPGASTYIVGVQEDKGFYLDPHDVQQVVTVKKENQDVDT---SSYHCNTLRYVPLESLDP 405
Query: 294 SIAV 297
S+A+
Sbjct: 406 SLAL 409
>sp|A6SDQ3|ATG4_BOTFB Probable cysteine protease atg4 OS=Botryotinia fuckeliana (strain
B05.10) GN=atg4 PE=3 SV=1
Length = 439
Score = 157 bits (398), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 143/305 (46%), Gaps = 57/305 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D +++W TYR F I S G T+D GWGCM+R GQ
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRSGQS 165
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ALL L +GR+W+ V+S EE KIL +F D APYSIH+ GAS GK GE
Sbjct: 166 LLANALLTLRMGREWRRGVSSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 223
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ S + ++ D + V K + K S+ + P +++
Sbjct: 224 WFGPSATARCIQALSNSQAKSELRVYITGDGSDVYED--KFMSIAKPNHSD--FTPTLIL 279
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLG+ I PVY +K Y PQS+G+ GG
Sbjct: 280 VGTRLGLDKITPVYWEALK---------------------------YSLQMPQSVGIAGG 312
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG +D +LDPH + D +D + + H + RLHI MDP
Sbjct: 313 RPSSSHYFIGVQESDFFYLDPHQTRPALPYKDNVEDYTTEDIDSCHTRRLRRLHIKEMDP 372
Query: 294 SIAVV 298
S+ +
Sbjct: 373 SMLIA 377
>sp|Q0U199|ATG4_PHANO Probable cysteine protease ATG4 OS=Phaeosphaeria nodorum (strain
SN15 / ATCC MYA-4574 / FGSC 10173) GN=ATG4 PE=3 SV=1
Length = 467
Score = 156 bits (395), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 128/265 (48%), Gaps = 53/265 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS---------------------GLTTDKGWGCMLRCGQMVI 56
D SR+W TYR GF PI S G T+D G+GCM+R GQ ++
Sbjct: 99 DFESRVWMTYRSGFSPIQKSQDPKATSAMSFRVRMQNLASPGFTSDAGFGCMIRSGQCIL 158
Query: 57 AQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWF 115
A AL L LGRDW+W N ++ + +IL +F D AP+SIH+ GA+ GK GEWF
Sbjct: 159 ANALQILRLGRDWRWQENHADKDHAEILSLFADDPQAPFSIHRFVEHGAAVCGKYPGEWF 218
Query: 116 GPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIP 175
GP+ A+ ++ LA + + +V+ D V K ++ WQP ++++
Sbjct: 219 GPSAAARCIQDLANKHREAGLKVYVSGDGADVYEDKLKQVAVDEDG----LWQPTLILVG 274
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
RLGI I PVY +K PQS+G+ GG+P
Sbjct: 275 TRLGIDKITPVYWEALKAS---------------------------LQIPQSIGIAGGRP 307
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNI 260
+ + YF+G GN+ +LDPH+ + +
Sbjct: 308 SASHYFVGVQGNNFYYLDPHSTRPL 332
>sp|A7F045|ATG4_SCLS1 Probable cysteine protease atg4 OS=Sclerotinia sclerotiorum (strain
ATCC 18683 / 1980 / Ss-1) GN=atg4 PE=3 SV=2
Length = 439
Score = 152 bits (383), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/316 (31%), Positives = 147/316 (46%), Gaps = 58/316 (18%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D +++W TYR F I S G T+D GWGCM+R GQ
Sbjct: 106 DFEAKIWLTYRSNFPAIAKSQDPKALSAMSLSVRLRSQLVDQGGFTSDTGWGCMIRSGQS 165
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ALL L +GR+W+ +S EE KIL +F D APYSIH+ GAS GK GE
Sbjct: 166 LLANALLTLRMGREWRRGSSSNEER--KILSLFADDPRAPYSIHKFVEHGASACGKHPGE 223
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L S + ++ D + V + K S+ ++ P +++
Sbjct: 224 WFGPSAAARCIQALTNSQVESELRVYITGDGSDVYEDT--FMSIAKPNST--KFTPTLIL 279
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLG+ I PVY +K +P QS+G+ GG
Sbjct: 280 VGTRLGLDKITPVYWEALKSSLQMP---------------------------QSVGIAGG 312
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YFIG +D +LDPH + D +D + + H + RLHI MDP
Sbjct: 313 RPSSSHYFIGVQESDFFYLDPHQTRPALPFNDNVEDYTPEDIDSCHTRRLRRLHIKEMDP 372
Query: 294 SIAVVSQ-RSYSDYKN 308
S+ + R +D+K+
Sbjct: 373 SMLIAFLIRDENDWKD 388
>sp|Q1E5M9|ATG4_COCIM Probable cysteine protease ATG4 OS=Coccidioides immitis (strain RS)
GN=ATG4 PE=3 SV=1
Length = 432
Score = 151 bits (382), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 143/305 (46%), Gaps = 62/305 (20%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S+ WFTYR F I S G T D GWGCM+R GQ
Sbjct: 108 DFESKFWFTYRSNFPAIPKSRDPDTPLALTLSVRLRSQFLDTHGFTADTGWGCMIRSGQS 167
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A AL L+LGRDW+ KEE ++L +F D AP+SIH+ GAS GK GE
Sbjct: 168 LLANALSILNLGRDWRRGSKIKEEC--ELLSLFADNPQAPFSIHRFVDYGASACGKHPGE 225
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKRASSNPQWQPLVL 172
WFGP+ A+ + L+ + + +V D + V +Q +++ + +P ++
Sbjct: 226 WFGPSATARCIEALSNECKHTDLNVYVMSDGSDVHEDQFRQIAGPDG-------IRPTLI 278
Query: 173 VIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIG 232
++ +RLGI+ + PVY ++ +PQS+G+ G
Sbjct: 279 LLGVRLGIESVTPVYWEALRAI---------------------------IRYPQSVGIAG 311
Query: 233 GKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
G+P+ +LYFIG G +LDPH + S + LD TYH + RLHI MD
Sbjct: 312 GRPSSSLYFIGVQGPYFFYLDPHHTRPAVSWNPDSTLSPENLD-TYHTRRLRRLHIREMD 370
Query: 293 PSIAV 297
PS+ +
Sbjct: 371 PSMLI 375
>sp|Q96DT6|ATG4C_HUMAN Cysteine protease ATG4C OS=Homo sapiens GN=ATG4C PE=2 SV=1
Length = 458
Score = 150 bits (379), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 90/356 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W----NV-NSKEEAYL-------------------------------------------- 81
W N+ NS E++
Sbjct: 135 WPDALNIENSDSESWTSHTVKKFTASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNE 194
Query: 82 ----KILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
KI+ F D A + +HQ+ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 VYHRKIISWFGDSPLALFGLHQLIEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
I +VA D T+ + V + + S N + +++++P+RLG + N Y+ +K
Sbjct: 255 GITIYVAQDCTVYNSDVIDKQSAS-MTSDNADDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
ILS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GILSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV-VSQRSYSDYKNV 309
H Q+ V K+ E T+HCP ++ MDPS + R+ D+K
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRA 397
>sp|A7KAL5|ATG4_PENCW Probable cysteine protease atg4 OS=Penicillium chrysogenum (strain
ATCC 28089 / DSM 1075 / Wisconsin 54-1255) GN=atg4 PE=3
SV=1
Length = 401
Score = 150 bits (379), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 144/304 (47%), Gaps = 58/304 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F PI + G T+D GWGCM+R GQ
Sbjct: 74 DFGSRIWITYRSNFTPIPRTKTPEATSSMTLGVRLRSQLMDPQGFTSDTGWGCMIRSGQS 133
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A L LGRDW+ +EE+ K++ MF D AP+SIH+ GA S GK GE
Sbjct: 134 LLANTFSVLLLGRDWRRGEKVEEES--KLISMFADHPEAPFSIHRFVNRGAESCGKYPGE 191
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ L+ + + ++ D + V ++ + QP +++
Sbjct: 192 WFGPSATAKCIQLLSTQSEVPQLRVYLTNDTSDVYEDKFAHVAHDESG----RIQPTLIL 247
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI ++ P Y +G+ R T+PQS+G+ GG
Sbjct: 248 IGTRLGIDNVTPAYWDGL---------------------------RAALTYPQSVGIAGG 280
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+G + FLDPHT + ++++LDS Y+ + R+HI MDP
Sbjct: 281 RPSASHYFVGAQDCHLFFLDPHTTRPATLYRPDGLYTQEELDS-YYTSRLRRIHIKDMDP 339
Query: 294 SIAV 297
S+ +
Sbjct: 340 SMLI 343
>sp|A1CJ08|ATG4_ASPCL Probable cysteine protease atg4 OS=Aspergillus clavatus (strain
ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1)
GN=atg4 PE=3 SV=1
Length = 400
Score = 149 bits (377), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 151/328 (46%), Gaps = 61/328 (18%)
Query: 8 SHQDLEQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKG 44
HQ E+ D+ SR+W TYR F PI G T+D G
Sbjct: 69 EHQWPEEFLDDVESRIWITYRSNFTPIPKPPNQEANPAMTLTVHLRSQLMDSQGFTSDTG 128
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R GQ ++A A+L L LGRDW+ + +EA ++L F D AP+SIH+ G
Sbjct: 129 WGCMIRSGQSLLANAMLILLLGRDWRRGTEAGKEA--QLLHQFADHPEAPFSIHRFVQHG 186
Query: 105 ASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASS 163
A K GEWFGP+ A+ ++ L S + ++ D+T + + K +
Sbjct: 187 AEFCNKYPGEWFGPSATARCIQALVAQQGSSELRVYIT-DDTADIYEDKFARIAQ---AE 242
Query: 164 NPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFT 223
+ + P ++++ RLGI + P Y + +K+ LP
Sbjct: 243 HGDFIPTLILVGTRLGIDHVTPAYWDALKEALQLP------------------------- 277
Query: 224 FPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQA 283
QS+G+ GG+P+ + YFIG G + +LDPH + D + +TYH +
Sbjct: 278 --QSVGIAGGRPSASHYFIGVHGQYLFYLDPHHTRPASLHQDVNDTLTHEEVNTYHTRRL 335
Query: 284 SRLHILHMDPSI----AVVSQRSYSDYK 307
R+HI MDPS+ + S+ ++D+K
Sbjct: 336 RRIHIKDMDPSMLIGFIIRSREDWTDWK 363
>sp|A2XHJ5|ATG4A_ORYSI Cysteine protease ATG4A OS=Oryza sativa subsp. indica GN=ATG4A PE=3
SV=1
Length = 473
Score = 149 bits (375), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 146/301 (48%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYRKGF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 130 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 189
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
Y+ IL MF D +SIH + G S G A G W GP + + + L + +
Sbjct: 190 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVRTNREHH 249
Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
++ ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 250 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 307
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ +NP YI +K+ FTFPQSLG++GGKP
Sbjct: 308 VLGLDKLNPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 340
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + V++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 341 TSTYVAGVQDDRVLYLDPHEVQ---LAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLA 397
Query: 297 V 297
+
Sbjct: 398 I 398
>sp|Q2U5B0|ATG4_ASPOR Probable cysteine protease atg4 OS=Aspergillus oryzae (strain ATCC
42149 / RIB 40) GN=atg4 PE=3 SV=2
Length = 407
Score = 148 bits (374), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 147/310 (47%), Gaps = 59/310 (19%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI-----------------------GDSGLTTDKGWGCML 49
E D S++W TYR F PI G T+D GWGCM+
Sbjct: 79 EAFVSDFESKIWMTYRSDFPPIPRLDNDEANHPMTLTVRIRTQLMDPQGFTSDTGWGCMI 138
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEG 108
R GQ ++A A+L L LGRDW+ ++EEA ++L +F D AP SIH+ GA S G
Sbjct: 139 RSGQSLLANAMLTLCLGRDWRRGDKAEEEA--RLLSLFADHPDAPLSIHRFVKYGAESCG 196
Query: 109 KAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQ 168
K GEWFGP+ A+ + L+ +I V + N + V + S + Q
Sbjct: 197 KHPGEWFGPSATARCIEALSA--QCGNIAPRVYVTND--TSDVYEDSFLRVARSGSGSIQ 252
Query: 169 PLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSL 228
P ++++ RLGI ++ PVY +G+K L PQS+
Sbjct: 253 PTLILLGTRLGIDNVTPVYWDGLKAVLQL---------------------------PQSV 285
Query: 229 GVIGGKPNHALYFIGYVGNDVIFLDPHTNQ-NIGCVYDKEQDSEKKLDSTYHCPQASRLH 287
G+ GG+P+ + YFIG G +LDPHT + + D S+ ++ STYH + R+H
Sbjct: 286 GIAGGRPSASHYFIGTQGPHFFYLDPHTTRPAVPYSIDGRLLSKTEI-STYHTRRLRRIH 344
Query: 288 ILHMDPSIAV 297
I MDPS+ +
Sbjct: 345 IQDMDPSMLI 354
>sp|Q811C2|ATG4C_MOUSE Cysteine protease ATG4C OS=Mus musculus GN=Atg4c PE=2 SV=2
Length = 458
Score = 148 bits (373), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 153/343 (44%), Gaps = 89/343 (25%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ R+D SR+W TYR+ F I S LTTD GWGC LR GQM++AQ L+ LGR W
Sbjct: 75 NVEEFRKDFISRIWLTYREEFPQIEASALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWT 134
Query: 71 W-------NVNS---------------------------------------------KEE 78
W N +S + E
Sbjct: 135 WPDALHIENADSDSWTSNTVKKFTASFEASLSGDRELRTPAVSLKETSGKCPDDHAVRNE 194
Query: 79 AY-LKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKL---AKYDDWS 134
AY KI+ F D A + +H++ G GK G+W+GP VA +LRK A++ D
Sbjct: 195 AYHRKIISWFGDSPVAVFGLHRLIEFGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQ 254
Query: 135 SIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKC 194
+ +VA D T+ + V T+ + + + + +++++P+RLG + N Y+ +K
Sbjct: 255 GLTIYVAQDCTVYNSDVIDK-QTDSVTAGDARDKAVIILVPVRLGGERTNTDYLEFVK-- 311
Query: 195 YALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDP 254
+LS Y +G+IGGKP + YF G+ + +I++DP
Sbjct: 312 ------------GVLSLEY-------------CVGIIGGKPKQSYYFAGFQDDSLIYMDP 346
Query: 255 HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
H Q+ V K+ E T+HCP ++ MDPS +
Sbjct: 347 HYCQSFVDVSIKDFPLE-----TFHCPSPKKMSFRKMDPSCTI 384
>sp|Q7XPW8|ATG4B_ORYSJ Cysteine protease ATG4B OS=Oryza sativa subsp. japonica GN=ATG4B
PE=2 SV=1
Length = 478
Score = 147 bits (372), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSK 76
D +SR+W TYR+GF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPLEKP 193
Query: 77 -EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401
Query: 297 V 297
+
Sbjct: 402 I 402
>sp|Q2XPP4|ATG4B_ORYSI Cysteine protease ATG4B OS=Oryza sativa subsp. indica GN=ATG4B PE=1
SV=2
Length = 478
Score = 147 bits (370), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 144/301 (47%), Gaps = 52/301 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYR+GF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 134 EDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKP 193
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKY----- 130
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 194 YNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQH 253
Query: 131 ------DDWSSIVFHVALDN--------TLVVNQVKKLCTTNKRASSNPQWQPLVLVIPL 176
+ + ++ V+ D + ++ +LC + S W P++L++PL
Sbjct: 254 EVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQST--WSPILLLVPL 311
Query: 177 RLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPN 236
LG+ INP YI +K+ FTFPQSLG++GGKP
Sbjct: 312 VLGLDKINPRYIPLLKE---------------------------TFTFPQSLGILGGKPG 344
Query: 237 HALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIA 296
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+A
Sbjct: 345 TSTYIAGVQDDRALYLDPHEVQ---MAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLA 401
Query: 297 V 297
+
Sbjct: 402 I 402
>sp|A7KAI3|ATG4_PICAN Probable cysteine protease ATG4 OS=Pichia angusta GN=ATG4 PE=3 SV=1
Length = 509
Score = 145 bits (366), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 154/323 (47%), Gaps = 74/323 (22%)
Query: 14 QIRRDITSRLWFTYRKGFVPIGDS------------------------GLTTDKGWGCML 49
+ RD+ SR+W TYR GF I + G TTD GWGCM+
Sbjct: 73 EFLRDVHSRIWLTYRSGFPLIKRAEDGPSPLSFGSLIRGTVDLATVTKGFTTDAGWGCMI 132
Query: 50 RCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-EG 108
R Q ++A +LL L LGR W+++ + + +I+ F D TAP+SIH GA+ G
Sbjct: 133 RTSQSLLANSLLQLRLGRGWRYDQTRECAKHAEIVSWFVDIPTAPFSIHNFVEQGANCAG 192
Query: 109 KAVGEWFGPNTVAQVLRKL--AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQ 166
K GEWFGP+ A+ ++ L A YD V+ A + +++ +L A +
Sbjct: 193 KKPGEWFGPSAAARSIQVLCEANYDKTGLKVYFTA-SGDIYEDELFEL------AQQGAE 245
Query: 167 WQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+P++++ +RLG++++NP+Y + +KK +PQ
Sbjct: 246 LRPVLILAGIRLGVKNVNPLYWDFLKKTLG---------------------------WPQ 278
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDK------------EQDSEKKL 274
S+G+ GG+P+ + YF G+ G+ + +LDPH Q + + E +S L
Sbjct: 279 SVGIAGGRPSSSHYFFGFQGDYLFYLDPHVPQKALLIASEAPHESPDPNHYVEVESGLDL 338
Query: 275 DSTYHCPQASRLHILHMDPSIAV 297
DS H + +LH+ MDPS+ V
Sbjct: 339 DSV-HTNKIRKLHLDQMDPSMLV 360
>sp|Q5XH30|ATG4C_XENLA Cysteine protease ATG4C OS=Xenopus laevis GN=atg4c PE=2 SV=1
Length = 450
Score = 144 bits (364), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 149/340 (43%), Gaps = 93/340 (27%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ R+D SR+W TYRK F I S TTD GWGC LR GQM++AQ LL LGRDW
Sbjct: 76 NVDEFRKDFISRIWLTYRKEFPQIESSSWTTDCGWGCTLRTGQMLLAQGLLVHFLGRDWT 135
Query: 71 WNV-------------------------------------------NSK-----EEAYLK 82
W NS+ E+ + K
Sbjct: 136 WTEALDIFCSESDFWTANTARKLDPSLEKSSPENEEYVSLGKQPLQNSEKKRYSEDLHRK 195
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---DWSSIVFH 139
I+ F D A + +HQ+ G + GK G+W+GP V+ +LRK + + I +
Sbjct: 196 IISWFADYPLAYFGLHQLVKLGKNSGKVAGDWYGPAVVSHLLRKAIEESSDPELQGITIY 255
Query: 140 VALDNTLVVNQVKKL-CTT-NKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYAL 197
VA D T+ V L C N++A +V+++P+RLG + N Y +K +L
Sbjct: 256 VAQDCTIYNADVYDLQCNKGNEKA--------VVILVPVRLGGERTNMEYFEYVKGILSL 307
Query: 198 PISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTN 257
EF +G+IGGKP + YF+G+ + +I++DPH
Sbjct: 308 -----------------------EFC----IGIIGGKPKQSYYFVGFQDDSLIYMDPHYC 340
Query: 258 QNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
Q+ V K E ++HCP ++ MDPS V
Sbjct: 341 QSFVDVSIKNFPLE-----SFHCPSPKKMSFKKMDPSCTV 375
>sp|Q68EP9|ATG4C_XENTR Cysteine protease ATG4C OS=Xenopus tropicalis GN=atg4c PE=2 SV=1
Length = 450
Score = 143 bits (361), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/338 (28%), Positives = 147/338 (43%), Gaps = 89/338 (26%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++++ R+D SR+W TYR+ F I S TTD GWGC LR GQM++AQ L+ LGRDW
Sbjct: 76 NVDEFRKDFISRIWLTYREEFPQIETSSWTTDCGWGCTLRTGQMLLAQGLIVHFLGRDWT 135
Query: 71 W---------------------------------------------NVNSK---EEAYLK 82
W N + K E+ + K
Sbjct: 136 WTEALDIFSSESEFWTANTARKLTPSLETSFSENNECVSSNKQPLHNCDKKSNSEDFHQK 195
Query: 83 ILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---DWSSIVFH 139
I+ F D A + +HQ+ G + GK G+W+GP V+ +LRK + + I +
Sbjct: 196 IISWFADYPLAYFGLHQLVKLGKNSGKVAGDWYGPAVVSHLLRKAIEESSDPELQGITIY 255
Query: 140 VALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPI 199
VA D T+ V L NK + +V+++P+RLG + N Y +K +L
Sbjct: 256 VAQDCTIYSADVYDL-QCNKGTE-----KAVVILVPVRLGGERTNMEYFEFVKGILSL-- 307
Query: 200 SPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQN 259
EF +G+IGGKP + YF+G+ + +I++DPH Q+
Sbjct: 308 ---------------------EFC----IGIIGGKPKQSYYFVGFQDDSLIYMDPHYCQS 342
Query: 260 IGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
V K E ++HCP ++ MDPS +
Sbjct: 343 FVDVSVKNFPLE-----SFHCPSPKKMSFKKMDPSCTI 375
>sp|Q75KP8|ATG4A_ORYSJ Cysteine protease ATG4A OS=Oryza sativa subsp. japonica GN=ATG4A
PE=3 SV=1
Length = 474
Score = 142 bits (359), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 145/302 (48%), Gaps = 54/302 (17%)
Query: 17 RDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQW-NVNS 75
D +SR+W TYRKGF I DS T+D WGCM+R QM++AQAL+F HLGR W+ +
Sbjct: 131 EDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKP 190
Query: 76 KEEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYD---- 131
Y+ IL MF D +SIH + G S G A G W GP + + + L +
Sbjct: 191 YSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVCTNREHH 250
Query: 132 -------DWSSIVFHVALDN--------TLVVNQVKKLCTT-NKRASSNPQWQPLVLVIP 175
++ ++ V+ D + ++ +LC NK S+ W P++L++P
Sbjct: 251 EAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKNQST---WSPILLLVP 307
Query: 176 LRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKP 235
L LG+ +NP YI +K+ TFPQSLG++GGKP
Sbjct: 308 LVLGLDKLNPRYIPLLKE---------------------------TLTFPQSLGILGGKP 340
Query: 236 NHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSI 295
+ Y G + ++LDPH Q D D+ + S+YHC L + +DPS+
Sbjct: 341 GTSTYIAGVQDDRALYLDPHEVQ---LAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSL 397
Query: 296 AV 297
A+
Sbjct: 398 AI 399
>sp|Q68FJ9|ATG4D_XENLA Cysteine protease ATG4D OS=Xenopus laevis GN=atg4d PE=2 SV=1
Length = 469
Score = 142 bits (357), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 155/334 (46%), Gaps = 78/334 (23%)
Query: 11 DLEQIRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ 70
++E+ ++D SR+W TYR+ F + + LTTD GWGCM+R GQM++AQ LL L R+W
Sbjct: 95 EIERFQKDFVSRVWLTYRRDFPALEGTALTTDCGWGCMIRSGQMLLAQGLLLHLLSREWT 154
Query: 71 WN-------------------------------------------VNSKEEAYLKILKMF 87
W+ ++ + I++ F
Sbjct: 155 WSEALYRHFVEMEPIRSSSPPSMPLSSLATGHSAGDYQPHTQCSGAPHGDQVHRNIMRWF 214
Query: 88 EDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRK-LAKYDDWSSIVFHVALDNTL 146
D +P+ +HQ+ G+ GK G+W+GP+ VA +++K + + + +V+ D T+
Sbjct: 215 SDHPGSPFGLHQLVTLGSIFGKKAGDWYGPSIVAHIIKKAIETSSEVPELSVYVSQDCTV 274
Query: 147 VVNQVKKLCTTN--KRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYD 204
+++L + +S + +++++P+RLG + NPVY + +K+ +
Sbjct: 275 YKADIEQLFAGDVPHAETSRGAGKAVIILVPVRLGGETFNPVYKHCLKEFLRM------- 327
Query: 205 MVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVY 264
P LG+IGGKP H+LYFIGY N +++LDPH Q Y
Sbjct: 328 --------------------PSCLGIIGGKPKHSLYFIGYQDNYLLYLDPHYCQP----Y 363
Query: 265 DKEQDSEKKLDSTYHCPQASRLHILHMDPSIAVV 298
++ L+S +HC ++ I MDPS
Sbjct: 364 IDTSKNDFPLES-FHCNSPRKISITRMDPSCTFA 396
>sp|Q523C3|ATG4_MAGO7 Cysteine protease ATG4 OS=Magnaporthe oryzae (strain 70-15 / ATCC
MYA-4617 / FGSC 8958) GN=ATG4 PE=3 SV=2
Length = 491
Score = 140 bits (353), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 139/311 (44%), Gaps = 66/311 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR GF PI S G TTD GWGCM+R GQ
Sbjct: 154 DFESRIWMTYRSGFEPIPRSTDPTASSRMSFAMRLKTMADQQAGFTTDSGWGCMIRTGQS 213
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A +LL LGR W+ EE K+L +F D APYSIH GA++ GK GE
Sbjct: 214 LLANSLLTCRLGRSWRRGQAPDEE--RKLLSLFADDPRAPYSIHNFVAHGAAKCGKYPGE 271
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ + LA + S V+ + + ++ + + + P +++
Sbjct: 272 WFGPSATARCIHALANATENSFRVYSTGDLPDVYEDSFMEVAKPDGKT-----FHPTLIL 326
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
I RLGI IN VY + L PQS+G+ GG
Sbjct: 327 ISTRLGIDKINQVYWESLTATLQL---------------------------PQSVGIAGG 359
Query: 234 KPNHALYFIGYVGND------VIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRL 286
+P+ + YF+G +D + +LDP HT + D + + +DS H + RL
Sbjct: 360 RPSSSHYFVGAQRSDEDQGSYLFYLDPHHTRPALPFHEDPQLYTPSDVDSC-HTRRLRRL 418
Query: 287 HILHMDPSIAV 297
HI MDPS+ +
Sbjct: 419 HIREMDPSMLI 429
>sp|A2Q1V6|ATG4_MEDTR Cysteine protease ATG4 OS=Medicago truncatula GN=ATG4 PE=3 SV=1
Length = 487
Score = 138 bits (348), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 139/300 (46%), Gaps = 48/300 (16%)
Query: 15 IRRDITSRLWFTYRKGFVPIGDSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVN 74
+D SR+ TYRKGF I DS T+D WGCMLR QM++AQALLF LGR W+ V+
Sbjct: 142 FEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGRSWRKTVD 201
Query: 75 SK-EEAYLKILKMFEDRRTAPYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAKYDDW 133
++ Y+ IL++F D A +SIH + G G AVG W GP + + LA+
Sbjct: 202 KPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEVLAR---- 257
Query: 134 SSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDIN----PVYI- 188
N+R + Q L + I + G +D PV
Sbjct: 258 ------------------------NQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCI 293
Query: 189 -NGIKKCYALPISPV----YDMVKILSSTYNMQTPRY------EFTFPQSLGVIGGKPNH 237
+ K+C V ++ L + RY F FPQSLG++GGKP
Sbjct: 294 EDACKRCLEFSRGLVPWTPLLLLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGA 353
Query: 238 ALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
+ Y IG + +LDPH + V + D+++ S+YHC + + + +DPS+A+
Sbjct: 354 STYIIGVQNDKAFYLDPH---EVKPVVNITGDTQEPNTSSYHCNISRHMPLDSIDPSLAI 410
>sp|Q5B7L0|ATG4_EMENI Cysteine protease atg4 OS=Emericella nidulans (strain FGSC A4 /
ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=atg4 PE=3
SV=2
Length = 402
Score = 135 bits (341), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 144/304 (47%), Gaps = 58/304 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D S++W TYR F PI G T+D GWGCM+R GQ
Sbjct: 76 DFESKIWMTYRSNFPPIPKDAGQEGSLSLTLGVRLRSQLIDAQGFTSDTGWGCMIRSGQS 135
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A ++ L LGRDW+ +EE K+L +F D AP+SIH GA GK GE
Sbjct: 136 LLANSMAILLLGRDWRRGERLEEEG--KLLSLFADSPHAPFSIHSFVKHGADFCGKHPGE 193
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP A+ ++ LA D S++ ++A DN+ V+Q K + + + +P +++
Sbjct: 194 WFGPTATARCIQGLAARYDQSNLQVYIADDNS-DVHQDKFMSVSRDEKGTV---RPTLIL 249
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ LRLGI I VY NG+K L PQS+G+ GG
Sbjct: 250 LGLRLGIDRITAVYWNGLKAVLQL---------------------------PQSVGIAGG 282
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMDP 293
+P+ + YF+ G+ +LDPH N Y + + +TYH + RL+I MDP
Sbjct: 283 RPSASHYFVAVQGSHFFYLDPH-NTRPALRYSESGTYTEDEVNTYHTRRLRRLNIQDMDP 341
Query: 294 SIAV 297
S+ +
Sbjct: 342 SMLI 345
>sp|Q6BYP8|ATG4_DEBHA Probable cysteine protease ATG4 OS=Debaryomyces hansenii (strain
ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=ATG4 PE=3 SV=2
Length = 492
Score = 133 bits (335), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 93/341 (27%), Positives = 147/341 (43%), Gaps = 91/341 (26%)
Query: 15 IRRDITSRLWFTYRKGFVPIG----------------------------------DSGLT 40
I +DI S++W TYR GF PI + T
Sbjct: 85 IEQDIYSKIWLTYRTGFEPIAKCLDGPQPLSFVQSMVFNRNPISSTFNNFHGLLDNDNFT 144
Query: 41 TDKGWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQI 100
TD GWGCM+R Q ++A L LGR + + + + + +I+ MF D AP+S+H
Sbjct: 145 TDVGWGCMIRTSQALLANTYQLLFLGRGFSYGRD-RSPRHDEIIDMFMDEPRAPFSLHNF 203
Query: 101 ALTGASEGKAV--GEWFGPNTVAQVLRKLA----KYDDWSSIVFHVALDNTLVVNQVKKL 154
+ V G+WFGPN + +++L + + + ++ + L + + ++
Sbjct: 204 IKVASESPLKVKPGQWFGPNAASLSIKRLCDNVYESNGTGRVKVVISESSNLYDDIITQM 263
Query: 155 CTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYN 214
TT NP +++++P+RLGI +NP+Y + + AL
Sbjct: 264 FTT-----LNPVPDAILVLLPVRLGIDKVNPLYHASVLELLALR---------------- 302
Query: 215 MQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQ---NIGCVYDKEQDSE 271
QS+G+ GGKP+ + YF GY GND+++LDPH Q N VYD
Sbjct: 303 -----------QSVGIAGGKPSSSFYFFGYKGNDLLYLDPHYPQFVRNKTSVYD------ 345
Query: 272 KKLDSTYHCPQASRLHILHMDPS----IAVVSQRSYSDYKN 308
TYH +L + MDPS I + Y D+K+
Sbjct: 346 -----TYHTNSYQKLSVDDMDPSMMIGILIKDINDYEDFKS 381
>sp|Q4U3V5|ATG4_CRYPA Probable cysteine protease ATG4 OS=Cryphonectria parasitica GN=ATG4
PE=2 SV=1
Length = 459
Score = 131 bits (330), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 93/312 (29%), Positives = 143/312 (45%), Gaps = 67/312 (21%)
Query: 18 DITSRLWFTYRKGFVPIGDS----------------------GLTTDKGWGCMLRCGQMV 55
D SR+W TYR F PI S G ++D GWGCM+R GQ +
Sbjct: 127 DFESRVWMTYRSEFEPISKSNDPRASAALSFAMRLRTLADQGGFSSDTGWGCMIRSGQSL 186
Query: 56 IAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGEW 114
+A L+ LGRDW+ +++E +IL F D APYS+H GA + GK GEW
Sbjct: 187 LANTLVICQLGRDWRRGKAARQER--EILARFADDPRAPYSLHNFVRHGAVACGKFPGEW 244
Query: 115 FGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVI 174
FGP+ A+ ++ LA ++ S V+ + + + + + P ++++
Sbjct: 245 FGPSATARCIQALANSNESSLRVYSTGDLPDVYEDSFMAVAKPDGET-----FHPTLILV 299
Query: 175 PLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGK 234
RLGI IN VY + L++T M PQS+G+ GG+
Sbjct: 300 GTRLGIDKINQVYW------------------EALTATLQM---------PQSVGIAGGR 332
Query: 235 PNHALYFIGY--------VGNDVIFLDPH-TNQNIGCVYDKEQDSEKKLDSTYHCPQASR 285
P+ + YFIG G+ + +LDPH T + D +Q + ++ T H + R
Sbjct: 333 PSASHYFIGAQRSGDAYEPGSYLFYLDPHCTRPALPFHEDVDQYTSDDIN-TCHTRRLRR 391
Query: 286 LHILHMDPSIAV 297
LH+ MDPS+ +
Sbjct: 392 LHVRDMDPSMLI 403
>sp|Q7S3X7|ATG4_NEUCR Probable cysteine protease atg-4 OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=atg-4 PE=3 SV=1
Length = 506
Score = 130 bits (327), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 137/305 (44%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGFVPIGDS-----------------------GLTTDKGWGCMLRCGQM 54
D SR+W TYR F I S G ++D GWGCM+R GQ
Sbjct: 174 DFESRIWMTYRTDFALIPRSSDPQASSALSFAMRIKTTFSDLTGFSSDTGWGCMIRSGQS 233
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L LGR+W+ + E I+ +F D APYS+H GA+ GK GE
Sbjct: 234 LLANAILIARLGREWRRGTDLDAEK--DIIALFADDPRAPYSLHNFVKYGATACGKYPGE 291
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA V+ + + + + R +QP +++
Sbjct: 292 WFGPSATARCIQALADEKQSGLRVYSTGDLPDVYEDSFMAVANPDGRG-----FQPTLIL 346
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI IN VY + L ST + PQS+G+ GG
Sbjct: 347 VCTRLGIDKINQVY------------------EEALISTLQL---------PQSIGIAGG 379
Query: 234 KPNHALYFIGYVGNDVIFLDP-HTNQNIGCVYDKEQDSEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G G + +LDP H + D + ++LD T H + +LHI MD
Sbjct: 380 RPSSSHYFVGVQGQRLFYLDPHHPRPALPYREDPRGYTAEELD-TCHTRRLRQLHIGDMD 438
Query: 293 PSIAV 297
PS+ +
Sbjct: 439 PSMLI 443
>sp|P0CQ10|ATG4_CRYNJ Cysteine protease ATG4 OS=Cryptococcus neoformans var. neoformans
serotype D (strain JEC21 / ATCC MYA-565) GN=ATG4 PE=3
SV=1
Length = 1193
Score = 130 bits (326), Expect = 2e-29, Method: Composition-based stats.
Identities = 86/269 (31%), Positives = 126/269 (46%), Gaps = 85/269 (31%)
Query: 36 DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---------WNVNSKEEAYLK---- 82
+ GLT+D GWGCMLR GQ ++ AL+ +HLGRDW+ ++E A LK
Sbjct: 559 ERGLTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAK 618
Query: 83 ---ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-------- 129
+L F D + P+S+H++AL G GK VGEWFGP+T A L+ LA
Sbjct: 619 YAQMLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANSFAPCGVA 678
Query: 130 ---------------------YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW- 167
DDW+SI + N KK + A +W
Sbjct: 679 VATATDSIIYKSDVYTASNLPSDDWNSI--------SPTFNSSKKKRRGDNEAKEE-KWG 729
Query: 168 -QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+ +++++ +RLG+ +NP+Y YD +K L FTFPQ
Sbjct: 730 KRAVLILVGVRLGLDGVNPIY---------------YDSIKAL------------FTFPQ 762
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
S+G+ GG+P+ + YF+G N + +LDPH
Sbjct: 763 SVGIAGGRPSSSYYFVGSQANHLFYLDPH 791
>sp|P0CQ11|ATG4_CRYNB Cysteine protease ATG4 OS=Cryptococcus neoformans var. neoformans
serotype D (strain B-3501A) GN=ATG4 PE=3 SV=1
Length = 1193
Score = 130 bits (326), Expect = 2e-29, Method: Composition-based stats.
Identities = 86/269 (31%), Positives = 126/269 (46%), Gaps = 85/269 (31%)
Query: 36 DSGLTTDKGWGCMLRCGQMVIAQALLFLHLGRDWQ---------WNVNSKEEAYLK---- 82
+ GLT+D GWGCMLR GQ ++ AL+ +HLGRDW+ ++E A LK
Sbjct: 559 ERGLTSDAGWGCMLRTGQSLLVNALIHIHLGRDWRVPSTPASFSEATTTQEIAALKDYAK 618
Query: 83 ---ILKMFEDRRT--APYSIHQIALTGASEGKAVGEWFGPNTVAQVLRKLAK-------- 129
+L F D + P+S+H++AL G GK VGEWFGP+T A L+ LA
Sbjct: 619 YAQMLSWFLDDPSPLCPFSVHRMALIGKELGKEVGEWFGPSTAAGALKTLANSFAPCGVA 678
Query: 130 ---------------------YDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQW- 167
DDW+SI + N KK + A +W
Sbjct: 679 VATATDSIIYKSDVYTASNLPSDDWNSI--------SPTFNSSKKKRRGDNEAKEE-KWG 729
Query: 168 -QPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQ 226
+ +++++ +RLG+ +NP+Y YD +K L FTFPQ
Sbjct: 730 KRAVLILVGVRLGLDGVNPIY---------------YDSIKAL------------FTFPQ 762
Query: 227 SLGVIGGKPNHALYFIGYVGNDVIFLDPH 255
S+G+ GG+P+ + YF+G N + +LDPH
Sbjct: 763 SVGIAGGRPSSSYYFVGSQANHLFYLDPH 791
>sp|Q2HH40|ATG4_CHAGB Probable cysteine protease ATG4 OS=Chaetomium globosum (strain ATCC
6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970)
GN=ATG4 PE=3 SV=2
Length = 448
Score = 129 bits (324), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 142/305 (46%), Gaps = 60/305 (19%)
Query: 18 DITSRLWFTYRKGFVPI----------------------GD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF PI GD +G ++D GWGCM+R GQ
Sbjct: 116 DFGSRIWMTYRTGFEPIPRSTDPKAASALSFTMRLKTSFGDQTGFSSDTGWGCMIRSGQS 175
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGA-SEGKAVGE 113
++A ALL LGRDW+ + E I+ +F D APYS+ GA + GK GE
Sbjct: 176 LLANALLISQLGRDWRRTTDPGAER--NIVALFADDARAPYSLQNFVKHGAIACGKHPGE 233
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLV 173
WFGP+ A+ ++ LA + S ++ + V + L T + + P +++
Sbjct: 234 WFGPSATARCIQALADQHESSLRIYSTG--DLPDVYEDSFLATARPDGET---FHPTLIL 288
Query: 174 IPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGG 233
+ RLGI INPVY + L ST M+ QS+G+ GG
Sbjct: 289 VCTRLGIDKINPVY------------------EEALISTLQME---------QSIGIAGG 321
Query: 234 KPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHILHMD 292
+P+ + YF+G + +LDPH + + + + ++LDS H + LH+ MD
Sbjct: 322 RPSSSHYFVGVQRQWLFYLDPHHPRPALQYRENPLNYTLEELDSC-HTRRLRYLHVEDMD 380
Query: 293 PSIAV 297
PS+ +
Sbjct: 381 PSMLI 385
>sp|Q86ZL5|ATG4_PODAS Probable cysteine protease ATG4 OS=Podospora anserina GN=ATG4 PE=3
SV=1
Length = 500
Score = 127 bits (320), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 138/308 (44%), Gaps = 74/308 (24%)
Query: 18 DITSRLWFTYRKGF--VP--------------------IGD-SGLTTDKGWGCMLRCGQM 54
D SR+W TYR GF +P GD +G ++D GWGCM+R GQ
Sbjct: 176 DFESRIWMTYRTGFEVIPRSTDPKAAAALSFTMRFKTSFGDQTGFSSDTGWGCMIRSGQS 235
Query: 55 VIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGASE-GKAVGE 113
++A A+L GR W+ N E +I+ +F D APYSI GA+ GK GE
Sbjct: 236 LLANAMLISRAGRAWRRTTNPDIE--REIVCLFADDPRAPYSIQNFVNHGAAACGKYPGE 293
Query: 114 WFGPNTVAQVLRKLAKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRASSNP---QWQPL 170
WFGP+ A+ + L Y + + ++ N +++NP + P
Sbjct: 294 WFGPSATARCIHSLRVY----------------LTRDLPEVYEDNFMSTANPDGNHFHPT 337
Query: 171 VLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYEFTFPQSLGV 230
++++ RLGI INP+Y + L PQ++G+
Sbjct: 338 LILVSTRLGIDKINPIYHEALISTLQL---------------------------PQAIGI 370
Query: 231 IGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQD-SEKKLDSTYHCPQASRLHIL 289
GG+P+ + YFIG G + +LDPH + + D + ++LDS H + LH+
Sbjct: 371 AGGRPSSSHYFIGAQGQWLFYLDPHHPRPALPYRENPNDYTIEELDSC-HTRRLRHLHVE 429
Query: 290 HMDPSIAV 297
MDPS+ +
Sbjct: 430 DMDPSMLI 437
>sp|Q75E61|ATG4_ASHGO Probable cysteine protease ATG4 OS=Ashbya gossypii (strain ATCC
10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=ATG4
PE=3 SV=1
Length = 521
Score = 127 bits (319), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 154/317 (48%), Gaps = 70/317 (22%)
Query: 13 EQIRRDITSRLWFTYRKGFVPI-----GDSGLT------------------------TDK 43
E+ D+ +RL FTYR FVPI G S ++ TD
Sbjct: 114 EEFLADVHTRLHFTYRTRFVPIPRHPNGPSPMSISVMLRDNPLNVIENVLNNPDCFQTDI 173
Query: 44 GWGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALT 103
GWGCM+R GQ ++A AL LGRD++ + N+ E L+I+K FED P+S+H+
Sbjct: 174 GWGCMIRTGQSLLANALQRACLGRDFRIDDNAANEHELRIIKWFEDDPKYPFSLHKFVQE 233
Query: 104 GAS-EGKAVGEWFGPNTVAQVLRKL-AKYDDWSSIVFHVALDNTLV-VNQVKKLCTTNKR 160
G S GK GEWFGP+ ++ ++ L AK+ ++ D+ V +++V+ L +
Sbjct: 234 GFSLSGKKPGEWFGPSATSRSIQALVAKFPACGIAHCVISTDSGDVYMDEVEPLFRADPS 293
Query: 161 ASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRY 220
A+ ++L++ +RLG+ +N VY I+ ILSS +
Sbjct: 294 AA-------VLLLLCVRLGVDVVNEVYWEHIR--------------HILSSEH------- 325
Query: 221 EFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHC 280
S+G+ GG+P+ +LYF GY + +LDPH Q Y ++ D L + H
Sbjct: 326 ------SVGIAGGRPSSSLYFFGYQDEHLFYLDPHKPQLNLASYQQDLD----LFRSVHT 375
Query: 281 PQASRLHILHMDPSIAV 297
+ +++H+ +DPS+ +
Sbjct: 376 QRFNKVHMSDIDPSMLI 392
>sp|P53867|ATG4_YEAST Cysteine protease ATG4 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=ATG4 PE=1 SV=2
Length = 494
Score = 124 bits (310), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>sp|A6ZRL7|ATG4_YEAS7 Cysteine protease ATG4 OS=Saccharomyces cerevisiae (strain YJM789)
GN=ATG4 PE=3 SV=1
Length = 494
Score = 124 bits (310), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 136/316 (43%), Gaps = 83/316 (26%)
Query: 18 DITSRLWFTYRKGFVPI-----GDSGLT------------------------TDKGWGCM 48
D+ SR+ FTYR FVPI G S L+ TD GWGCM
Sbjct: 89 DVQSRVNFTYRTRFVPIARAPDGPSPLSLNLLVRTNPISTIEDYIANPDCFNTDIGWGCM 148
Query: 49 LRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTGAS-E 107
+R GQ ++ AL LHLGRD++ N N E K + F D AP+S+H G
Sbjct: 149 IRTGQSLLGNALQILHLGRDFRVNGNESLERESKFVNWFNDTPEAPFSLHNFVSAGTELS 208
Query: 108 GKAVGEWFGPNTVAQVLRKL------AKYDDWSSIVFHVALDNTLVVNQVKKLCTTNKRA 161
K GEWFGP A+ ++ L DD IV + D + N+V+K+ N +
Sbjct: 209 DKRPGEWFGPAATARSIQSLIYGFPECGIDD--CIVSVSSGD--IYENEVEKVFAENPNS 264
Query: 162 SSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNMQTPRYE 221
++ ++ ++LGI +N Y I ILSST
Sbjct: 265 R-------ILFLLGVKLGINAVNESYRESI--------------CGILSST--------- 294
Query: 222 FTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLDSTYHCP 281
QS+G+ GG+P+ +LYF GY GN+ + DPH Q E + H
Sbjct: 295 ----QSVGIAGGRPSSSLYFFGYQGNEFLHFDPHIPQPA---------VEDSFVESCHTS 341
Query: 282 QASRLHILHMDPSIAV 297
+ +L + MDPS+ +
Sbjct: 342 KFGKLQLSEMDPSMLI 357
>sp|Q6CH28|ATG4_YARLI Probable cysteine protease ATG4 OS=Yarrowia lipolytica (strain CLIB
122 / E 150) GN=ATG4 PE=3 SV=1
Length = 545
Score = 122 bits (305), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 88/337 (26%), Positives = 139/337 (41%), Gaps = 101/337 (29%)
Query: 18 DITSRLWFTYRKGF--VPIGDS---------------------GLTTDKGWGCMLRCGQM 54
D+ SR+W +YR GF +P D G T+D GWGCM+R Q
Sbjct: 68 DVQSRIWLSYRTGFPLIPKSDGSGTIHLGKLKNMIRGGGFDPRGYTSDVGWGCMIRTSQS 127
Query: 55 VIAQALLFLHLGRDWQWN------------------------VNSKEEAYLK-------- 82
++A ALLF HLGR W+WN N ++E +
Sbjct: 128 LLANALLFRHLGRGWRWNKGDDFVYLSEGNTESRGGESRNGGANKEQETAVSEETAVSEE 187
Query: 83 -ILKMFEDRRTAPYSIHQIALTGASE-GKAVGEWFGPNTVAQVLRKLAKYDDWSSIVFHV 140
I+ F D +P+SIH+ G G+WFGP+ + L S + +
Sbjct: 188 TIISWFLDSPDSPFSIHKFVRHGEKACSTPAGDWFGPSAAGSSIYALCNEFPDSGLKVYY 247
Query: 141 ALDNTLVVNQVKKLCTTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPIS 200
+ V + + L T PL+++ LRLGI ++NP+Y + +++ +L
Sbjct: 248 NGNGGGDVYEDELLETGF----------PLLVLCGLRLGIDNVNPIYWDSLRQMLSL--- 294
Query: 201 PVYDMVKILSSTYNMQTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNI 260
PQS+G+ GG+P + YF G+ G + +LDPH +
Sbjct: 295 ------------------------PQSVGIAGGRPFTSHYFFGFQGEQLFYLDPHQPKPA 330
Query: 261 GCVYDKEQDSEKKLDSTYHCPQASRLHILHMDPSIAV 297
DK+ +++H + +LH+ MDPS+ V
Sbjct: 331 VKTTDKDT-------TSFHSSRIWKLHLKEMDPSMLV 360
>sp|Q59UG3|ATG4_CANAL Cysteine protease ATG4 OS=Candida albicans (strain SC5314 / ATCC
MYA-2876) GN=ATG4 PE=3 SV=1
Length = 446
Score = 115 bits (289), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/336 (27%), Positives = 144/336 (42%), Gaps = 102/336 (30%)
Query: 19 ITSRLWFTYRKGFVPIGDS----------------------------------GLTTDKG 44
I S+LW +YR GF PI S T+D G
Sbjct: 83 IESKLWLSYRCGFEPIPKSIDGPQPIQFFPSIIFNRSTIYSNFANLKSLFDKENFTSDAG 142
Query: 45 WGCMLRCGQMVIAQALLFLHLGRDWQWNVNSKEEAYLKILKMFEDRRTAPYSIHQIALTG 104
WGCM+R Q ++A LL L+ K E +I+K+F+D ++P+SIH
Sbjct: 143 WGCMIRTSQNLLANTLLKLY----------PKNEP--EIVKLFQDDTSSPFSIHNFIRVA 190
Query: 105 ASEGKAV--GEWFGPNTVAQVLRKLA-------KYDDWSSIVFHVALDNTLVVNQVKKLC 155
+ V GEWFGPN + +++LA + D ++ ++ L ++++ +
Sbjct: 191 SLSPLHVKPGEWFGPNAASLSIKRLASELLQDQEIDGIKIPRVFISENSDLFDDEIRDVF 250
Query: 156 TTNKRASSNPQWQPLVLVIPLRLGIQDINPVYINGIKKCYALPISPVYDMVKILSSTYNM 215
K AS ++++ P+RLGI +N Y N I +L+S Y
Sbjct: 251 AKEKNAS-------VLILFPIRLGIDKVNSYYYNSI--------------FHLLASKY-- 287
Query: 216 QTPRYEFTFPQSLGVIGGKPNHALYFIGYVGNDVIFLDPHTNQNIGCVYDKEQDSEKKLD 275
S G+ GGKP+ + YF+GY D+I+ DPH Q + ++ +D
Sbjct: 288 -----------SCGIAGGKPSSSFYFLGYEDTDLIYFDPHLPQVV--------ETPINMD 328
Query: 276 STYHCPQASRLHILHMDPS----IAVVSQRSYSDYK 307
S YH +RL+I +DPS I V + Y D+K
Sbjct: 329 S-YHTTNYNRLNISLLDPSMMIGILVTNIDEYIDFK 363
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.137 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 115,635,439
Number of Sequences: 539616
Number of extensions: 4756899
Number of successful extensions: 10295
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 60
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 9981
Number of HSP's gapped (non-prelim): 91
length of query: 309
length of database: 191,569,459
effective HSP length: 117
effective length of query: 192
effective length of database: 128,434,387
effective search space: 24659402304
effective search space used: 24659402304
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)